WorldWideScience

Sample records for conserved dna motifs

  1. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Science.gov (United States)

    Fauteux, François; Strömvik, Martina V

    2009-01-01

    Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs

  2. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Directory of Open Access Journals (Sweden)

    Fauteux François

    2009-10-01

    Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

  3. Conserved XPB Core Structure and Motifs for DNA Unwinding:Implications for Pathway Selection of Transcription or ExcisionRepair

    Energy Technology Data Exchange (ETDEWEB)

    Fan, Li; Arval, Andrew S.; Cooper, Priscilla K.; Iwai, Shigenori; Hanaoka, Fumio; Tainer, John A.

    2005-04-01

    The human xeroderma pigmentosum group B (XPB) helicase is essential for transcription, nucleotide excision repair, and TFIIH functional assembly. Here, we determined crystal structures of an Archaeoglobus fulgidus XPB homolog (AfXPB) that characterize two RecA-like XPB helicase domains and discover a DNA damage recognition domain (DRD), a unique RED motif, a flexible thumb motif (ThM), and implied conformational changes within a conserved functional core. RED motif mutations dramatically reduce helicase activity, and the DRD and ThM, which flank the RED motif, appear structurally as well as functionally analogous to the MutS mismatch recognition and DNA polymerase thumb domains. Substrate specificity is altered by DNA damage, such that AfXPB unwinds dsDNA with 3' extensions, but not blunt-ended dsDNA, unless it contains a lesion, as shown for CPD or (6-4) photoproducts. Together, these results provide an unexpected mechanism of DNA unwinding with Implications for XPB damage verification in nucleotide excision repair.

  4. The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element.

    Science.gov (United States)

    Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko

    2013-07-01

    AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5'-NNCCAC-3' and 5'-GCGMGN'N'-3' (M:A or C; N and N' form Watson-Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences.

  5. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    OpenAIRE

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with...

  6. Conserved amino acid motifs from the novel Piv/MooV family of transposases and site-specific recombinases are required for catalysis of DNA inversion by Piv.

    Science.gov (United States)

    Tobiason, D M; Buchner, J M; Thiel, W H; Gernert, K M; Karls, A C

    2001-02-01

    Piv, a site-specific invertase from Moraxella lacunata, exhibits amino acid homology with the transposases of the IS110/IS492 family of insertion elements. The functions of conserved amino acid motifs that define this novel family of both transposases and site-specific recombinases (Piv/MooV family) were examined by mutagenesis of fully conserved amino acids within each motif in Piv. All Piv mutants altered in conserved residues were defective for in vivo inversion of the M. lacunata invertible DNA segment, but competent for in vivo binding to Piv DNA recognition sequences. Although the primary amino acid sequences of the Piv/MooV recombinases do not contain a conserved DDE motif, which defines the retroviral integrase/transposase (IN/Tnps) family, the predicted secondary structural elements of Piv align well with those of the IN/Tnps for which crystal structures have been determined. Molecular modelling of Piv based on these alignments predicts that E59, conserved as either E or D in the Piv/MooV family, forms a catalytic pocket with the conserved D9 and D101 residues. Analysis of Piv E59G confirms a role for E59 in catalysis of inversion. These results suggest that Piv and the related IS110/IS492 transposases mediate DNA recombination by a common mechanism involving a catalytic DED or DDD motif.

  7. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    Science.gov (United States)

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with tandem CYR motifs, has endo- and exonuclease activities against abasic site and other types of base damage. PALF accumulates rapidly at single-strand breaks in a poly(ADP-ribose) polymerase 1 (PARP1)-dependent manner in human cells. Indeed, PALF interacts directly with PARP1 and is required for its activation and for cellular resistance to methyl-methane sulfonate. PALF also interacts directly with KU86, LIGASEIV and phosphorylated XRCC4 proteins and possesses endo/exonuclease activity at protruding DNA ends. Various treatments that produce double-strand breaks induce formation of PALF foci, which fully coincide with γH2AX foci. Thus, PALF and the CYR motif may play important roles in DNA repair of higher eukaryotes. PMID:17396150

  8. MotifMark: Finding regulatory motifs in DNA sequences.

    Science.gov (United States)

    Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D

    2017-07-01

    The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.

  9. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

    Directory of Open Access Journals (Sweden)

    Down Thomas A

    2010-09-01

    Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.

  10. A conserved motif in the linker domain of STAT1 transcription factor is required for both recognition and release from high-affinity DNA-binding sites.

    Science.gov (United States)

    Hüntelmann, Bettina; Staab, Julia; Herrmann-Lingen, Christoph; Meyer, Thomas

    2014-01-01

    Binding to specific palindromic sequences termed gamma-activated sites (GAS) is a hallmark of gene activation by members of the STAT (signal transducer and activator of transcription) family of cytokine-inducible transcription factors. However, the precise molecular mechanisms involved in the signal-dependent finding of target genes by STAT dimers have not yet been very well studied. In this study, we have characterized a sequence motif in the STAT1 linker domain which is highly conserved among the seven human STAT proteins and includes surface-exposed residues in close proximity to the bound DNA. Using site-directed mutagenesis, we have demonstrated that a lysine residue in position 567 of the full-length molecule is required for GAS recognition. The substitution of alanine for this residue completely abolished both binding to high-affinity GAS elements and transcriptional activation of endogenous target genes in cells stimulated with interferon-γ (IFNγ), while the time course of transient nuclear accumulation and tyrosine phosphorylation were virtually unchanged. In contrast, two glutamic acid residues (E559 and E563) on each monomer are important for the dissociation of dimeric STAT1 from DNA and, when mutated to alanine, result in elevated levels of tyrosine-phosphorylated STAT1 as well as prolonged IFNγ-stimulated nuclear accumulation. In conclusion, our data indicate that the kinetics of signal-dependent GAS binding is determined by an array of glutamic acid residues located at the interior surface of the STAT1 dimer. These negatively charged residues appear to align the long axis of the STAT1 dimer in a position perpendicular to the DNA, thereby facilitating the interaction between lysine 567 and the phosphodiester backbone of a bound GAS element, which is a prerequisite for transient gene induction.

  11. 14-3-3 checkpoint regulatory proteins interact specifically with DNA repair protein human exonuclease 1 (hEXO1) via a semi-conserved motif

    DEFF Research Database (Denmark)

    Andersen, Sofie Dabros; Keijzers, Guido; Rampakakis, Emmanouil

    2012-01-01

    Human exonuclease 1 (hEXO1) acts directly in diverse DNA processing events, including replication, mismatch repair (MMR), and double strand break repair (DSBR), and it was also recently described to function as damage sensor and apoptosis inducer following DNA damage. In contrast, 14-3-3 proteins...... are specifically induced by replication inhibition leading to protein ubiquitination and degradation. We demonstrate direct and robust interaction between hEXO1 and six of the seven 14-3-3 isoforms in vitro, suggestive of a novel protein interaction network between DNA repair and cell cycle control. Binding...... and most likely a second unidentified binding motif. 14-3-3 associations do not appear to directly influence hEXO1 in vitro nuclease activity or in vitro DNA replication initiation. Moreover, specific phosphorylation variants, including hEXO1 S746A, are efficiently imported to the nucleus; to associate...

  12. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-01-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  13. DNA motif elucidation using belief propagation.

    Science.gov (United States)

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-09-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k=8∼10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/∼wkc/kmerHMM.

  14. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun

    2013-06-29

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors\\' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  15. Presence of a consensus DNA motif at nearby DNA sequence of the mutation susceptible CG nucleotides.

    Science.gov (United States)

    Chowdhury, Kaushik; Kumar, Suresh; Sharma, Tanu; Sharma, Ankit; Bhagat, Meenakshi; Kamai, Asangla; Ford, Bridget M; Asthana, Shailendra; Mandal, Chandi C

    2018-01-10

    Complexity in tissues affected by cancer arises from somatic mutations and epigenetic modifications in the genome. The mutation susceptible hotspots present within the genome indicate a non-random nature and/or a position specific selection of mutation. An association exists between the occurrence of mutations and epigenetic DNA methylation. This study is primarily aimed at determining mutation status, and identifying a signature for predicting mutation prone zones of tumor suppressor (TS) genes. Nearby sequences from the top five positions having a higher mutation frequency in each gene of 42 TS genes were selected from a cosmic database and were considered as mutation prone zones. The conserved motifs present in the mutation prone DNA fragments were identified. Molecular docking studies were done to determine putative interactions between the identified conserved motifs and enzyme methyltransferase DNMT1. Collective analysis of 42 TS genes found GC as the most commonly replaced and AT as the most commonly formed residues after mutation. Analysis of the top 5 mutated positions of each gene (210 DNA segments for 42 TS genes) identified that CG nucleotides of the amino acid codons (e.g., Arginine) are most susceptible to mutation, and found a consensus DNA "T/AGC/GAGGA/TG" sequence present in these mutation prone DNA segments. Similar to TS genes, analysis of 54 oncogenes not only found CG nucleotides of the amino acid Arg as the most susceptible to mutation, but also identified the presence of similar consensus DNA motifs in the mutation prone DNA fragments (270 DNA segments for 54 oncogenes) of oncogenes. Docking studies depicted that, upon binding of DNMT1 methylates to this consensus DNA motif (C residues of CpG islands), mutation was likely to occur. Thus, this study proposes that DNMT1 mediated methylation in chromosomal DNA may decrease if a foreign DNA segment containing this consensus sequence along with CG nucleotides is exogenously introduced to dividing

  16. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium

    Directory of Open Access Journals (Sweden)

    Lynch Michael

    2010-05-01

    Full Text Available Abstract Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1 shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2 are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3 reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  17. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

    Science.gov (United States)

    Catania, Francesco; Lynch, Michael

    2010-05-04

    In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  18. MotifMark: Finding Regulatory Motifs in DNA Sequences

    OpenAIRE

    Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L.; Wang, May D.

    2017-01-01

    The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity be...

  19. A Conserved Metal Binding Motif in the Bacillus subtilis Competence Protein ComFA Enhances Transformation.

    Science.gov (United States)

    Chilton, Scott S; Falbel, Tanya G; Hromada, Susan; Burton, Briana M

    2017-08-01

    Genetic competence is a process in which cells are able to take up DNA from their environment, resulting in horizontal gene transfer, a major mechanism for generating diversity in bacteria. Many bacteria carry homologs of the central DNA uptake machinery that has been well characterized in Bacillus subtilis It has been postulated that the B. subtilis competence helicase ComFA belongs to the DEAD box family of helicases/translocases. Here, we made a series of mutants to analyze conserved amino acid motifs in several regions of B. subtilis ComFA. First, we confirmed that ComFA activity requires amino acid residues conserved among the DEAD box helicases, and second, we show that a zinc finger-like motif consisting of four cysteines is required for efficient transformation. Each cysteine in the motif is important, and mutation of at least two of the cysteines dramatically reduces transformation efficiency. Further, combining multiple cysteine mutations with the helicase mutations shows an additive phenotype. Our results suggest that the helicase and metal binding functions are two distinct activities important for ComFA function during transformation. IMPORTANCE ComFA is a highly conserved protein that has a role in DNA uptake during natural competence, a mechanism for horizontal gene transfer observed in many bacteria. Investigation of the details of the DNA uptake mechanism is important for understanding the ways in which bacteria gain new traits from their environment, such as drug resistance. To dissect the role of ComFA in the DNA uptake machinery, we introduced point mutations into several motifs in the protein sequence. We demonstrate that several amino acid motifs conserved among ComFA proteins are important for efficient transformation. This report is the first to demonstrate the functional requirement of an amino-terminal cysteine motif in ComFA. Copyright © 2017 American Society for Microbiology.

  20. Discovery of candidate KEN-box motifs using cell cycle keyword enrichment combined with native disorder prediction and motif conservation.

    Science.gov (United States)

    Michael, Sushama; Travé, Gilles; Ramu, Chenna; Chica, Claudia; Gibson, Toby J

    2008-02-15

    KEN-box-mediated target selection is one of the mechanisms used in the proteasomal destruction of mitotic cell cycle proteins via the APC/C complex. While annotating the Eukaryotic Linear Motif resource (ELM, http://elm.eu.org/), we found that KEN motifs were significantly enriched in human protein entries with cell cycle keywords in the UniProt/Swiss-Prot database-implying that KEN-boxes might be more common than reported. Matches to short linear motifs in protein database searches are not, per se, significant. KEN-box enrichment with cell cycle Gene Ontology terms suggests that collectively these motifs are functional but does not prove that any given instance is so. Candidates were surveyed for native disorder prediction using GlobPlot and IUPred and for motif conservation in homologues. Among >25 strong new candidates, the most notable are human HIPK2, CHFR, CDC27, Dab2, Upf2, kinesin Eg5, DNA Topoisomerase 1 and yeast Cdc5 and Swi5. A similar number of weaker candidates were present. These proteins have yet to be tested for APC/C targeted destruction, providing potential new avenues of research.

  1. Genome Analysis of Conserved Dehydrin Motifs in Vascular Plants

    Directory of Open Access Journals (Sweden)

    Ahmad A. Malik

    2017-05-01

    Full Text Available Dehydrins, a large family of abiotic stress proteins, are defined by the presence of a mostly conserved motif known as the K-segment, and may also contain two other conserved motifs known as the Y-segment and S-segment. Using the dehydrin literature, we developed a sequence motif definition of the K-segment, which we used to create a large dataset of dehydrin sequences by searching the Pfam00257 dehydrin dataset and the Phytozome 10 sequences of vascular plants. A comprehensive analysis of these sequences reveals that lysine residues are highly conserved in the K-segment, while the amino acid type is often conserved at other positions. Despite the Y-segment name, the central tyrosine is somewhat conserved, but can be substituted with two other small aromatic amino acids (phenylalanine or histidine. The S-segment contains a series of serine residues, but in some proteins is also preceded by a conserved LHR sequence. In many dehydrins containing all three of these motifs the S-segment is linked to the K-segment by a GXGGRRKK motif (where X can be any amino acid, suggesting a functional linkage between these two motifs. An analysis of the sequences shows that the dehydrin architecture and several biochemical properties (isoelectric point, molecular mass, and hydrophobicity score are dependent on each other, and that some dehydrin architectures are overexpressed during certain abiotic stress, suggesting that they may be optimized for a specific abiotic stress while others are involved in all forms of dehydration stress (drought, cold, and salinity.

  2. The limits of de novo DNA motif discovery.

    Directory of Open Access Journals (Sweden)

    David Simcha

    Full Text Available A major challenge in molecular biology is reverse-engineering the cis-regulatory logic that plays a major role in the control of gene expression. This program includes searching through DNA sequences to identify "motifs" that serve as the binding sites for transcription factors or, more generally, are predictive of gene expression across cellular conditions. Several approaches have been proposed for de novo motif discovery-searching sequences without prior knowledge of binding sites or nucleotide patterns. However, unbiased validation is not straightforward. We consider two approaches to unbiased validation of discovered motifs: testing the statistical significance of a motif using a DNA "background" sequence model to represent the null hypothesis and measuring performance in predicting membership in gene clusters. We demonstrate that the background models typically used are "too null," resulting in overly optimistic assessments of significance, and argue that performance in predicting TF binding or expression patterns from DNA motifs should be assessed by held-out data, as in predictive learning. Applying this criterion to common motif discovery methods resulted in universally poor performance, although there is a marked improvement when motifs are statistically significant against real background sequences. Moreover, on synthetic data where "ground truth" is known, discriminative performance of all algorithms is far below the theoretical upper bound, with pronounced "over-fitting" in training. A key conclusion from this work is that the failure of de novo discovery approaches to accurately identify motifs is basically due to statistical intractability resulting from the fixed size of co-regulated gene clusters, and thus such failures do not necessarily provide evidence that unfound motifs are not active biologically. Consequently, the use of prior knowledge to enhance motif discovery is not just advantageous but necessary. An implementation of

  3. The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

    Directory of Open Access Journals (Sweden)

    Roberts Richard J

    2008-05-01

    Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.

  4. PISMA: A Visual Representation of Motif Distribution in DNA Sequences

    Directory of Open Access Journals (Sweden)

    Rogelio Alcántara-Silva

    2017-03-01

    Full Text Available Background: Because the graphical presentation and analysis of motif distribution can provide insights for experimental hypothesis, PISMA aims at identifying motifs on DNA sequences, counting and showing them graphically. The motif length ranges from 2 to 10 bases, and the DNA sequences range up to 10 kb. The motif distribution is shown as a bar-code–like, as a gene-map–like, and as a transcript scheme. Results: We obtained graphical schemes of the CpG site distribution from 91 human papillomavirus genomes. Also, we present 2 analyses: one of DNA motifs associated with either methylation-resistant or methylation-sensitive CpG islands and another analysis of motifs associated with exosome RNA secretion. Availability and Implementation: PISMA is developed in Java; it is executable in any type of hardware and in diverse operating systems. PISMA is freely available to noncommercial users. The English version and the User Manual are provided in Supplementary Files 1 and 2, and a Spanish version is available at www.biomedicas.unam.mx/wp-content/software/pisma.zip and www.biomedicas.unam.mx/wp-content/pdf/manual/pisma.pdf .

  5. Motif finding in DNA sequences based on skipping nonconserved positions in background Markov chains.

    Science.gov (United States)

    Zhao, Xiaoyan; Sze, Sing-Hoi

    2011-05-01

    One strategy to identify transcription factor binding sites is through motif finding in upstream DNA sequences of potentially co-regulated genes. Despite extensive efforts, none of the existing algorithms perform very well. We consider a string representation that allows arbitrary ignored positions within the nonconserved portion of single motifs, and use O(2(l)) Markov chains to model the background distributions of motifs of length l while skipping these positions within each Markov chain. By focusing initially on positions that have fixed nucleotides to define core occurrences, we develop an algorithm to identify motifs of moderate lengths. We compare the performance of our algorithm to other motif finding algorithms on a few benchmark data sets, and show that significant improvement in accuracy can be obtained when the sites are sufficiently conserved within a given sample, while comparable performance is obtained when the site conservation rate is low. A software program (PosMotif ) and detailed results are available online at http://faculty.cse.tamu.edu/shsze/posmotif.

  6. G-quadruplex DNA sequences are evolutionarily conserved and associated with distinct genomic features in Saccharomyces cerevisiae.

    Directory of Open Access Journals (Sweden)

    John A Capra

    2010-07-01

    Full Text Available G-quadruplex DNA is a four-stranded DNA structure formed by non-Watson-Crick base pairing between stacked sets of four guanines. Many possible functions have been proposed for this structure, but its in vivo role in the cell is still largely unresolved. We carried out a genome-wide survey of the evolutionary conservation of regions with the potential to form G-quadruplex DNA structures (G4 DNA motifs across seven yeast species. We found that G4 DNA motifs were significantly more conserved than expected by chance, and the nucleotide-level conservation patterns suggested that the motif conservation was the result of the formation of G4 DNA structures. We characterized the association of conserved and non-conserved G4 DNA motifs in Saccharomyces cerevisiae with more than 40 known genome features and gene classes. Our comprehensive, integrated evolutionary and functional analysis confirmed the previously observed associations of G4 DNA motifs with promoter regions and the rDNA, and it identified several previously unrecognized associations of G4 DNA motifs with genomic features, such as mitotic and meiotic double-strand break sites (DSBs. Conserved G4 DNA motifs maintained strong associations with promoters and the rDNA, but not with DSBs. We also performed the first analysis of G4 DNA motifs in the mitochondria, and surprisingly found a tenfold higher concentration of the motifs in the AT-rich yeast mitochondrial DNA than in nuclear DNA. The evolutionary conservation of the G4 DNA motif and its association with specific genome features supports the hypothesis that G4 DNA has in vivo functions that are under evolutionary constraint.

  7. BlockLogo: Visualization of peptide and sequence motif conservation

    DEFF Research Database (Denmark)

    Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian

    2013-01-01

    BlockLogo is a web-server application for the visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, se...

  8. Hybrid DNA i-motif: Aminoethylprolyl-PNA (pC5) enhance the stability of DNA (dC5) i-motif structure.

    Science.gov (United States)

    Gade, Chandrasekhar Reddy; Sharma, Nagendra K

    2017-12-15

    This report describes the synthesis of C-rich sequence, cytosine pentamer, of aep-PNA and its biophysical studies for the formation of hybrid DNA:aep-PNAi-motif structure with DNA cytosine pentamer (dC 5 ) under acidic pH conditions. Herein, the CD/UV/NMR/ESI-Mass studies strongly support the formation of stable hybrid DNA i-motif structure with aep-PNA even near acidic conditions. Hence aep-PNA C-rich sequence cytosine could be considered as potential DNA i-motif stabilizing agents in vivo conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. Comparative analysis of evolutionarily conserved motifs of epidermal growth factor receptor 2 (HER2) predicts novel potential therapeutic epitopes

    DEFF Research Database (Denmark)

    Deng, Xiaohong; Zheng, Xuxu; Yang, Huanming

    2014-01-01

    druggable epitopes/targets. We employed the PROSITE Scan to detect structurally conserved motifs and PRINTS to search for linearly conserved motifs of ECD HER2. We found that the epitopes recognized by trastuzumab and pertuzumab are located in the predicted conserved motifs of ECD HER2, supporting our...

  10. DNA mutation motifs in the genes associated with inherited diseases.

    Directory of Open Access Journals (Sweden)

    Michal Růžička

    Full Text Available Mutations in human genes can be responsible for inherited genetic disorders and cancer. Mutations can arise due to environmental factors or spontaneously. It has been shown that certain DNA sequences are more prone to mutate. These sites are termed hotspots and exhibit a higher mutation frequency than expected by chance. In contrast, DNA sequences with lower mutation frequencies than expected by chance are termed coldspots. Mutation hotspots are usually derived from a mutation spectrum, which reflects particular population where an effect of a common ancestor plays a role. To detect coldspots/hotspots unaffected by population bias, we analysed the presence of germline mutations obtained from HGMD database in the 5-nucleotide segments repeatedly occurring in genes associated with common inherited disorders, in particular, the PAH, LDLR, CFTR, F8, and F9 genes. Statistically significant sequences (mutational motifs rarely associated with mutations (coldspots and frequently associated with mutations (hotspots exhibited characteristic sequence patterns, e.g. coldspots contained purine tract while hotspots showed alternating purine-pyrimidine bases, often with the presence of CpG dinucleotide. Using molecular dynamics simulations and free energy calculations, we analysed the global bending properties of two selected coldspots and two hotspots with a G/T mismatch. We observed that the coldspots were inherently more flexible than the hotspots. We assume that this property might be critical for effective mismatch repair as DNA with a mutation recognized by MutSα protein is noticeably bent.

  11. DMINDA: an integrated web server for DNA motif identification and analyses.

    Science.gov (United States)

    Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying

    2014-07-01

    DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. The Q Motif Is Involved in DNA Binding but Not ATP Binding in ChlR1 Helicase.

    Directory of Open Access Journals (Sweden)

    Hao Ding

    Full Text Available Helicases are molecular motors that couple the energy of ATP hydrolysis to the unwinding of structured DNA or RNA and chromatin remodeling. The conversion of energy derived from ATP hydrolysis into unwinding and remodeling is coordinated by seven sequence motifs (I, Ia, II, III, IV, V, and VI. The Q motif, consisting of nine amino acids (GFXXPXPIQ with an invariant glutamine (Q residue, has been identified in some, but not all helicases. Compared to the seven well-recognized conserved helicase motifs, the role of the Q motif is less acknowledged. Mutations in the human ChlR1 (DDX11 gene are associated with a unique genetic disorder known as Warsaw Breakage Syndrome, which is characterized by cellular defects in genome maintenance. To examine the roles of the Q motif in ChlR1 helicase, we performed site directed mutagenesis of glutamine to alanine at residue 23 in the Q motif of ChlR1. ChlR1 recombinant protein was overexpressed and purified from HEK293T cells. ChlR1-Q23A mutant abolished the helicase activity of ChlR1 and displayed reduced DNA binding ability. The mutant showed impaired ATPase activity but normal ATP binding. A thermal shift assay revealed that ChlR1-Q23A has a melting point value similar to ChlR1-WT. Partial proteolysis mapping demonstrated that ChlR1-WT and Q23A have a similar globular structure, although some subtle conformational differences in these two proteins are evident. Finally, we found ChlR1 exists and functions as a monomer in solution, which is different from FANCJ, in which the Q motif is involved in protein dimerization. Taken together, our results suggest that the Q motif is involved in DNA binding but not ATP binding in ChlR1 helicase.

  13. TOPDOM: database of conservatively located domains and motifs in proteins.

    Science.gov (United States)

    Varga, Julia; Dobson, László; Tusnády, Gábor E

    2016-09-01

    The TOPDOM database-originally created as a collection of domains and motifs located consistently on the same side of the membranes in α-helical transmembrane proteins-has been updated and extended by taking into consideration consistently localized domains and motifs in globular proteins, too. By taking advantage of the recently developed CCTOP algorithm to determine the type of a protein and predict topology in case of transmembrane proteins, and by applying a thorough search for domains and motifs as well as utilizing the most up-to-date version of all source databases, we managed to reach a 6-fold increase in the size of the whole database and a 2-fold increase in the number of transmembrane proteins. TOPDOM database is available at http://topdom.enzim.hu The webpage utilizes the common Apache, PHP5 and MySQL software to provide the user interface for accessing and searching the database. The database itself is generated on a high performance computer. tusnady.gabor@ttk.mta.hu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  14. FTZ-Factor1 and Fushi tarazu interact via conserved nuclear receptor and coactivator motifs

    Science.gov (United States)

    Schwartz, Carol J.E.; Sampson, Heidi M.; Hlousek, Daniela; Percival-Smith, Anthony; Copeland, John W.R.; Simmonds, Andrew J.; Krause, Henry M.

    2001-01-01

    To activate transcription, most nuclear receptor proteins require coactivators that bind to their ligand-binding domains (LBDs). The Drosophila FTZ-Factor1 (FTZ-F1) protein is a conserved member of the nuclear receptor superfamily, but was previously thought to lack an AF2 motif, a motif that is required for ligand and coactivator binding. Here we show that FTZ-F1 does have an AF2 motif and that it is required to bind a coactivator, the homeodomain-containing protein Fushi tarazu (FTZ). We also show that FTZ contains an AF2-interacting nuclear receptor box, the first to be found in a homeodomain protein. Both interaction motifs are shown to be necessary for physical interactions in vitro and for functional interactions in developing embryos. These unexpected findings have important implications for the conserved homologs of the two proteins. PMID:11157757

  15. MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

    Science.gov (United States)

    Ozaki, Haruka; Iwasaki, Wataru

    2016-08-01

    As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. DNA motif alignment by evolving a population of Markov chains.

    Science.gov (United States)

    Bi, Chengpeng

    2009-01-30

    Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences.

  17. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin

    2015-01-01

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  18. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun

    2015-09-27

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  19. Probing structural changes of self assembled i-motif DNA

    KAUST Repository

    Lee, Iljoon; Patil, Sachin; Fhayli, Karim; Alsaiari, Shahad K.; Khashab, Niveen M.

    2015-01-01

    We report an i-motif structural probing system based on Thioflavin T (ThT) as a fluorescent sensor. This probe can discriminate the structural changes of RET and Rb i-motif sequences according to pH change. This journal is

  20. Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family

    Science.gov (United States)

    Soufari, Heddy

    2017-01-01

    Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515

  1. The RXL motif of the African cassava mosaic virus Rep protein is necessary for rereplication of yeast DNA and viral infection in plants

    Energy Technology Data Exchange (ETDEWEB)

    Hipp, Katharina; Rau, Peter; Schäfer, Benjamin [Institut für Biomaterialien und biomolekulare Systeme, Abteilung für Molekularbiologie und Virologie der Pflanzen, Universität Stuttgart, Pfaffenwaldring 57, D-70550 Stuttgart (Germany); Gronenborn, Bruno [Institut des Sciences du Végétal, CNRS, 91198 Gif-sur-Yvette (France); Jeske, Holger, E-mail: holger.jeske@bio.uni-stuttgart.de [Institut für Biomaterialien und biomolekulare Systeme, Abteilung für Molekularbiologie und Virologie der Pflanzen, Universität Stuttgart, Pfaffenwaldring 57, D-70550 Stuttgart (Germany)

    2014-08-15

    Geminiviruses, single-stranded DNA plant viruses, encode a replication-initiator protein (Rep) that is indispensable for virus replication. A potential cyclin interaction motif (RXL) in the sequence of African cassava mosaic virus Rep may be an alternative link to cell cycle controls to the known interaction with plant homologs of retinoblastoma protein (pRBR). Mutation of this motif abrogated rereplication in fission yeast induced by expression of wildtype Rep suggesting that Rep interacts via its RXL motif with one or several yeast proteins. The RXL motif is essential for viral infection of Nicotiana benthamiana plants, since mutation of this motif in infectious clones prevented any symptomatic infection. The cell-cycle link (Clink) protein of a nanovirus (faba bean necrotic yellows virus) was investigated that activates the cell cycle by binding via its LXCXE motif to pRBR. Expression of wildtype Clink and a Clink mutant deficient in pRBR-binding did not trigger rereplication in fission yeast. - Highlights: • A potential cyclin interaction motif is conserved in geminivirus Rep proteins. • In ACMV Rep, this motif (RXL) is essential for rereplication of fission yeast DNA. • Mutating RXL abrogated viral infection completely in Nicotiana benthamiana. • Expression of a nanovirus Clink protein in yeast did not induce rereplication. • Plant viruses may have evolved multiple routes to exploit host DNA synthesis.

  2. The RXL motif of the African cassava mosaic virus Rep protein is necessary for rereplication of yeast DNA and viral infection in plants

    International Nuclear Information System (INIS)

    Hipp, Katharina; Rau, Peter; Schäfer, Benjamin; Gronenborn, Bruno; Jeske, Holger

    2014-01-01

    Geminiviruses, single-stranded DNA plant viruses, encode a replication-initiator protein (Rep) that is indispensable for virus replication. A potential cyclin interaction motif (RXL) in the sequence of African cassava mosaic virus Rep may be an alternative link to cell cycle controls to the known interaction with plant homologs of retinoblastoma protein (pRBR). Mutation of this motif abrogated rereplication in fission yeast induced by expression of wildtype Rep suggesting that Rep interacts via its RXL motif with one or several yeast proteins. The RXL motif is essential for viral infection of Nicotiana benthamiana plants, since mutation of this motif in infectious clones prevented any symptomatic infection. The cell-cycle link (Clink) protein of a nanovirus (faba bean necrotic yellows virus) was investigated that activates the cell cycle by binding via its LXCXE motif to pRBR. Expression of wildtype Clink and a Clink mutant deficient in pRBR-binding did not trigger rereplication in fission yeast. - Highlights: • A potential cyclin interaction motif is conserved in geminivirus Rep proteins. • In ACMV Rep, this motif (RXL) is essential for rereplication of fission yeast DNA. • Mutating RXL abrogated viral infection completely in Nicotiana benthamiana. • Expression of a nanovirus Clink protein in yeast did not induce rereplication. • Plant viruses may have evolved multiple routes to exploit host DNA synthesis

  3. DNA regulatory motif selection based on support vector machine ...

    African Journals Online (AJOL)

    ... machine (SVM) and its application in microarray experiment of Kashin-Beck disease. ... speed and amount of the corresponding mRNA in gene replication process. ... and revealed that some motifs may be related to the immune reactions.

  4. A single thiazole orange molecule forms an exciplex in a DNA i-motif.

    Science.gov (United States)

    Xu, Baochang; Wu, Xiangyang; Yeow, Edwin K L; Shao, Fangwei

    2014-06-18

    A fluorescent exciplex of thiazole orange (TO) is formed in a single-dye conjugated DNA i-motif. The exciplex fluorescence exhibits a large Stokes shift, high quantum yield, robust response to pH oscillation and little structural disturbance to the DNA quadruplex, which can be used to monitor the folding of high-order DNA structures.

  5. Identification of novel conserved functional motifs across most Influenza A viral strains

    Directory of Open Access Journals (Sweden)

    El-Azab Iman

    2011-01-01

    Full Text Available Abstract Background Influenza A virus poses a continuous threat to global public health. Design of novel universal drugs and vaccine requires a careful analysis of different strains of Influenza A viral genome from diverse hosts and subtypes. We performed a systematic in silico analysis of Influenza A viral segments of all available Influenza A viral strains and subtypes and grouped them based on host, subtype, and years isolated, and through multiple sequence alignments we extrapolated conserved regions, motifs, and accessible regions for functional mapping and annotation. Results Across all species and strains 87 highly conserved regions (conservation percentage > = 90% and 19 functional motifs (conservation percentage = 100% were found in PB2, PB1, PA, NP, M, and NS segments. The conservation percentage of these segments ranged between 94 - 98% in human strains (the most conserved, 85 - 93% in swine strains (the most variable, and 91 - 94% in avian strains. The most conserved segment was different in each host (PB1 for human strains, NS for avian strains, and M for swine strains. Target accessibility prediction yielded 324 accessible regions, with a single stranded probability > 0.5, of which 78 coincided with conserved regions. Some of the interesting annotations in these regions included sites for protein-protein interactions, the RNA binding groove, and the proton ion channel. Conclusions The influenza virus has evolved to adapt to its host through variations in the GC content and conservation percentage of the conserved regions. Nineteen universal conserved functional motifs were discovered, of which some were accessible regions with interesting biological functions. These regions will serve as a foundation for universal drug targets as well as universal vaccine design.

  6. I-motif DNA structures are formed in the nuclei of human cells

    Science.gov (United States)

    Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel

    2018-06-01

    Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.

  7. Novel and deviant Walker A ATP-binding motifs in bacteriophage large terminase-DNA packaging proteins

    International Nuclear Information System (INIS)

    Mitchell, Michael S.; Rao, Venigalla B.

    2004-01-01

    Bacteriophage terminases constitute a very interesting class of viral-coded multifunctional ATPase 'motors' that apparently drive directional translocation of DNA into an empty viral capsid. A common Walker A motif and other conserved signatures of a critical ATPase catalytic center are identified in the N-terminal half of numerous large terminase proteins. However, several terminases, including the well-characterized λ and SPP1 terminases, seem to lack the classic Walker A in the N-terminus. Using sequence alignment approaches, we discovered the presence of deviant Walker A motifs in these and many other phage terminases. One deviation, the presence of a lysine at the beginning of P-loop, may represent a 3D equivalent of the universally conserved lysine in the Walker A GKT/S signature. This and other novel putative Walker A motifs that first came to light through this study help define the ATPase centers of phage and viral terminases as well as elicit important insights into the molecular functioning of this fundamental motif in biological systems

  8. Solution structure of a DNA mimicking motif of an RNA aptamer against transcription factor AML1 Runt domain.

    Science.gov (United States)

    Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi

    2013-12-01

    AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.

  9. Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets.

    Science.gov (United States)

    Vishnevsky, Oleg V; Bocharnikov, Andrey V; Kolchanov, Nikolay A

    2018-02-01

    The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.

  10. PDL1 Signals through Conserved Sequence Motifs to Overcome Interferon-Mediated Cytotoxicity

    Directory of Open Access Journals (Sweden)

    Maria Gato-Cañas

    2017-08-01

    Full Text Available PDL1 blockade produces remarkable clinical responses, thought to occur by T cell reactivation through prevention of PDL1-PD1 T cell inhibitory interactions. Here, we find that PDL1 cell-intrinsic signaling protects cancer cells from interferon (IFN cytotoxicity and accelerates tumor progression. PDL1 inhibited IFN signal transduction through a conserved class of sequence motifs that mediate crosstalk with IFN signaling. Abrogation of PDL1 expression or antibody-mediated PDL1 blockade strongly sensitized cancer cells to IFN cytotoxicity through a STAT3/caspase-7-dependent pathway. Moreover, somatic mutations found in human carcinomas within these PDL1 sequence motifs disrupted motif regulation, resulting in PDL1 molecules with enhanced protective activities from type I and type II IFN cytotoxicity. Overall, our results reveal a mode of action of PDL1 in cancer cells as a first line of defense against IFN cytotoxicity.

  11. Interaction of MYC with host cell factor-1 is mediated by the evolutionarily conserved Myc box IV motif.

    Science.gov (United States)

    Thomas, L R; Foshage, A M; Weissmiller, A M; Popay, T M; Grieb, B C; Qualls, S J; Ng, V; Carboneau, B; Lorey, S; Eischen, C M; Tansey, W P

    2016-07-07

    The MYC family of oncogenes encodes a set of three related transcription factors that are overexpressed in many human tumors and contribute to the cancer-related deaths of more than 70,000 Americans every year. MYC proteins drive tumorigenesis by interacting with co-factors that enable them to regulate the expression of thousands of genes linked to cell growth, proliferation, metabolism and genome stability. One effective way to identify critical co-factors required for MYC function has been to focus on sequence motifs within MYC that are conserved throughout evolution, on the assumption that their conservation is driven by protein-protein interactions that are vital for MYC activity. In addition to their DNA-binding domains, MYC proteins carry five regions of high sequence conservation known as Myc boxes (Mb). To date, four of the Mb motifs (MbI, MbII, MbIIIa and MbIIIb) have had a molecular function assigned to them, but the precise role of the remaining Mb, MbIV, and the reason for its preservation in vertebrate Myc proteins, is unknown. Here, we show that MbIV is required for the association of MYC with the abundant transcriptional coregulator host cell factor-1 (HCF-1). We show that the invariant core of MbIV resembles the tetrapeptide HCF-binding motif (HBM) found in many HCF-interaction partners, and demonstrate that MYC interacts with HCF-1 in a manner indistinguishable from the prototypical HBM-containing protein VP16. Finally, we show that rationalized point mutations in MYC that disrupt interaction with HCF-1 attenuate the ability of MYC to drive tumorigenesis in mice. Together, these data expose a molecular function for MbIV and indicate that HCF-1 is an important co-factor for MYC.

  12. Discovery of a Regulatory Motif for Human Satellite DNA Transcription in Response to BATF2 Overexpression.

    Science.gov (United States)

    Bai, Xuejia; Huang, Wenqiu; Zhang, Chenguang; Niu, Jing; Ding, Wei

    2016-03-01

    One of the basic leucine zipper transcription factors, BATF2, has been found to suppress cancer growth and migration. However, little is known about the genes downstream of BATF2. HeLa cells were stably transfected with BATF2, then chromatin immunoprecipitation-sequencing was employed to identify the DNA motifs responsive to BATF2. Comprehensive bioinformatics analyses indicated that the most significant motif discovered as TTCCATT[CT]GATTCCATTC[AG]AT was primarily distributed among the chromosome centromere regions and mostly within human type II satellite DNA. Such motifs were able to prime the transcription of type II satellite DNA in a directional and asymmetrical manner. Consistently, satellite II transcription was up-regulated in BATF2-overexpressing cells. The present study provides insight into understanding the role of BATF2 in tumours and the importance of satellite DNA in the maintenance of genomic stability. Copyright© 2016 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  13. Peptomics, identification of novel cationic Arabidopsis peptides with conserved sequence motifs

    DEFF Research Database (Denmark)

    Olsen, Addie Nina; Mundy, John; Skriver, Karen

    2002-01-01

    Arabidopsis family of 34 genes. The predicted peptides are characterized by a conserved C-terminal sequence motif and additional primary structure conservation in a core region. The majority of these genes had not previously been annotated. A subset of the predicted peptides show high overall sequence...... similarity to Rapid Alkalinization Factor (RALF), a peptide isolated from tobacco. We therefore refer to this peptide family as RALFL for RALF-Like. RT-PCR analysis confirmed that several of the Arabidopsis genes are expressed and that their expression patterns vary. The identification of a large gene family...

  14. Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

    Directory of Open Access Journals (Sweden)

    Jie Zhu

    Full Text Available DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.

  15. Accurate Quantification of microRNA via Single Strand Displacement Reaction on DNA Origami Motif

    Science.gov (United States)

    Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

    2013-01-01

    DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs. PMID:23990889

  16. Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

    Science.gov (United States)

    Zhu, Jie; Feng, Xiaolu; Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

    2013-01-01

    DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.

  17. Dipeptide frequency/bias analysis identifies conserved sites of nonrandomness shared by cysteine-rich motifs.

    Science.gov (United States)

    Campion, S R; Ameen, A S; Lai, L; King, J M; Munzenmaier, T N

    2001-08-15

    This report describes the application of a simple computational tool, AAPAIR.TAB, for the systematic analysis of the cysteine-rich EGF, Sushi, and Laminin motif/sequence families at the two-amino acid level. Automated dipeptide frequency/bias analysis detects preferences in the distribution of amino acids in established protein families, by determining which "ordered dipeptides" occur most frequently in comprehensive motif-specific sequence data sets. Graphic display of the dipeptide frequency/bias data revealed family-specific preferences for certain dipeptides, but more importantly detected a shared preference for employment of the ordered dipeptides Gly-Tyr (GY) and Gly-Phe (GF) in all three protein families. The dipeptide Asn-Gly (NG) also exhibited high-frequency and bias in the EGF and Sushi motif families, whereas Asn-Thr (NT) was distinguished in the Laminin family. Evaluation of the distribution of dipeptides identified by frequency/bias analysis subsequently revealed the highly restricted localization of the G(F/Y) and N(G/T) sequence elements at two separate sites of extreme conservation in the consensus sequence of all three sequence families. The similar employment of the high-frequency/bias dipeptides in three distinct protein sequence families was further correlated with the concurrence of these shared molecular determinants at similar positions within the distinctive scaffolds of three structurally divergent, but similarly employed, motif modules.

  18. Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

    KAUST Repository

    Kalkatawi, Manal M.

    2011-11-15

    Motivation: Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants. The Author(s) 2011. Published by Oxford University Press. All rights reserved.

  19. Molecular dynamics simulations of electrostatics and hydration distributions around RNA and DNA motifs

    Science.gov (United States)

    Marlowe, Ashley E.; Singh, Abhishek; Semichaevsky, Andrey V.; Yingling, Yaroslava G.

    2009-03-01

    Nucleic acid nanoparticles can self-assembly through the formation of complementary loop-loop interactions or stem-stem interactions. Presence and concentration of ions can significantly affect the self-assembly process and the stability of the nanostructure. In this presentation we use explicit molecular dynamics simulations to examine the variations in cationic distributions and hydration environment around DNA and RNA helices and loop-loop interactions. Our simulations show that the potassium and sodium ionic distributions are different around RNA and DNA motifs which could be indicative of ion mediated relative stability of loop-loop complexes. Moreover in RNA loop-loop motifs ions are consistently present and exchanged through a distinct electronegative channel. We will also show how we used the specific RNA loop-loop motif to design a RNA hexagonal nanoparticle.

  20. Rtt107/Esc4 binds silent chromatin and DNA repair proteins using different BRCT motifs

    Directory of Open Access Journals (Sweden)

    Jockusch Rebecca A

    2006-11-01

    Full Text Available Abstract Background By screening a plasmid library for proteins that could cause silencing when targeted to the HMR locus in Saccharomyces cerevisiae, we previously reported the identification of Rtt107/Esc4 based on its ability to establish silent chromatin. In this study we aimed to determine the mechanism of Rtt107/Esc4 targeted silencing and also learn more about its biological functions. Results Targeted silencing by Rtt107/Esc4 was dependent on the SIR genes, which encode obligatory structural and enzymatic components of yeast silent chromatin. Based on its sequence, Rtt107/Esc4 was predicted to contain six BRCT motifs. This motif, originally identified in the human breast tumor suppressor gene BRCA1, is a protein interaction domain. The targeted silencing activity of Rtt107/Esc4 resided within the C-terminal two BRCT motifs, and this region of the protein bound to Sir3 in two-hybrid tests. Deletion of RTT107/ESC4 caused sensitivity to the DNA damaging agent MMS as well as to hydroxyurea. A two-hybrid screen showed that the N-terminal BRCT motifs of Rtt107/Esc4 bound to Slx4, a protein previously shown to be involved in DNA repair and required for viability in a strain lacking the DNA helicase Sgs1. Like SLX genes, RTT107ESC4 interacted genetically with SGS1; esc4Δ sgs1Δ mutants were viable, but exhibited a slow-growth phenotype and also a synergistic DNA repair defect. Conclusion Rtt107/Esc4 binds to the silencing protein Sir3 and the DNA repair protein Slx4 via different BRCT motifs, thus providing a bridge linking silent chromatin to DNA repair enzymes.

  1. STUDYING THE INFLUENCE OF THE PYRENE INTERCALATOR TINA ON THE STABILITY OF DNA i-MOTIFS

    DEFF Research Database (Denmark)

    El-Sayed, Ahmed A.; Pedersen, Erik Bjerregaard; Khaireldin, Nahid A.

    2012-01-01

    Certain cytosine-rich (C-rich) DNA sequences can fold into secondary structures as four-stranded i-motifs with hemiprotonated base pairs. Here we synthesized C-rich TINA-intercalating oligonucleotides by inserting a nonnucleotide pyrene moiety between two C-rich regions. The stability of their i-...

  2. Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

    KAUST Repository

    Kalkatawi, Manal M.; Rangkuti, Farania; Schramm, Michael C.; Jankovic, Boris R.; Kamau, Allan; Chowdhary, Rajesh; Archer, John A.C.; Bajic, Vladimir B.

    2011-01-01

    . These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity

  3. A conserved cysteine motif is critical for rice ceramide kinase activity and function.

    Directory of Open Access Journals (Sweden)

    Fang-Cheng Bi

    Full Text Available Ceramide kinase (CERK is a key regulator of cell survival in dicotyledonous plants and animals. Much less is known about the roles of CERK and ceramides in mediating cellular processes in monocot plants. Here, we report the characterization of a ceramide kinase, OsCERK, from rice (Oryza sativa spp. Japonica cv. Nipponbare and investigate the effects of ceramides on rice cell viability.OsCERK can complement the Arabidopsis CERK mutant acd5. Recombinant OsCERK has ceramide kinase activity with Michaelis-Menten kinetics and optimal activity at 7.0 pH and 40°C. Mg2+ activates OsCERK in a concentration-dependent manner. Importantly, a CXXXCXXC motif, conserved in all ceramide kinases and important for the activity of the human enzyme, is critical for OsCERK enzyme activity and in planta function. In a rice protoplast system, inhibition of CERK leads to cell death and the ratio of added ceramide and ceramide-1-phosphate, CERK's substrate and product, respectively, influences cell survival. Ceramide-induced rice cell death has apoptotic features and is an active process that requires both de novo protein synthesis and phosphorylation, respectively. Finally, mitochondria membrane potential loss previously associated with ceramide-induced cell death in Arabidopsis was also found in rice, but it occurred with different timing.OsCERK is a bona fide ceramide kinase with a functionally and evolutionarily conserved Cys-rich motif that plays an important role in modulating cell fate in plants. The vital function of the conserved motif in both human and rice CERKs suggests that the biochemical mechanism of CERKs is similar in animals and plants. Furthermore, ceramides induce cell death with similar features in monocot and dicot plants.

  4. Insights into the molecular evolution of the PDZ/LIM family and identification of a novel conserved protein motif.

    Directory of Open Access Journals (Sweden)

    Aartjan J W Te Velthuis

    Full Text Available The PDZ and LIM domain-containing protein family is encoded by a diverse group of genes whose phylogeny has currently not been analyzed. In mammals, ten genes are found that encode both a PDZ- and one or several LIM-domains. These genes are: ALP, RIL, Elfin (CLP36, Mystique, Enigma (LMP-1, Enigma homologue (ENH, ZASP (Cypher, Oracle, LMO7 and the two LIM domain kinases (LIMK1 and LIMK2. As conventional alignment and phylogenetic procedures of full-length sequences fell short of elucidating the evolutionary history of these genes, we started to analyze the PDZ and LIM domain sequences themselves. Using information from most sequenced eukaryotic lineages, our phylogenetic analysis is based on full-length cDNA-, EST-derived- and genomic- PDZ and LIM domain sequences of over 25 species, ranging from yeast to humans. Plant and protozoan homologs were not found. Our phylogenetic analysis identifies a number of domain duplication and rearrangement events, and shows a single convergent event during evolution of the PDZ/LIM family. Further, we describe the separation of the ALP and Enigma subfamilies in lower vertebrates and identify a novel consensus motif, which we call 'ALP-like motif' (AM. This motif is highly-conserved between ALP subfamily proteins of diverse organisms. We used here a combinatorial approach to define the relation of the PDZ and LIM domain encoding genes and to reconstruct their phylogeny. This analysis allowed us to classify the PDZ/LIM family and to suggest a meaningful model for the molecular evolution of the diverse gene architectures found in this multi-domain family.

  5. New scoring schema for finding motifs in DNA Sequences

    Directory of Open Access Journals (Sweden)

    Nowzari-Dalini Abbas

    2009-03-01

    Full Text Available Abstract Background Pattern discovery in DNA sequences is one of the most fundamental problems in molecular biology with important applications in finding regulatory signals and transcription factor binding sites. An important task in this problem is to search (or predict known binding sites in a new DNA sequence. For this reason, all subsequences of the given DNA sequence are scored based on an scoring function and the prediction is done by selecting the best score. By assuming no dependency between binding site base positions, most of the available tools for known binding site prediction are designed. Recently Tomovic and Oakeley investigated the statistical basis for either a claim of dependence or independence, to determine whether such a claim is generally true, and they presented a scoring function for binding site prediction based on the dependency between binding site base positions. Our primary objective is to investigate the scoring functions which can be used in known binding site prediction based on the assumption of dependency or independency in binding site base positions. Results We propose a new scoring function based on the dependency between all positions in biding site base positions. This scoring function uses joint information content and mutual information as a measure of dependency between positions in transcription factor binding site. Our method for modeling dependencies is simply an extension of position independency methods. We evaluate our new scoring function on the real data sets extracted from JASPAR and TRANSFAC data bases, and compare the obtained results with two other well known scoring functions. Conclusion The results demonstrate that the new approach improves known binding site discovery and show that the joint information content and mutual information provide a better and more general criterion to investigate the relationships between positions in the TFBS. Our scoring function is formulated by simple

  6. Analysis of a conserved RGE/RGD motif in HCV E2 in mediating entry

    Directory of Open Access Journals (Sweden)

    Rong Lijun

    2009-01-01

    Full Text Available Abstract Background Hepatitis C virus (HCV encodes two transmembrane glycoproteins E1 and E2 which form a heterodimer. E1 is believed to mediate fusion while E2 has been shown to bind cellular receptors. It is clear that HCV uses a multi-receptor complex to gain entry into susceptible cells, however key elements of this complex remain elusive. In this study, the role of a highly conserved RGE/RGD motif of HCV E2 glycoprotein in viral entry was examined. The effect of each substitution mutation in this motif was tested by challenging susceptible cell lines with mutant HCV E1E2 pseudotyped viruses generated using a lentiviral system (HCVpp. In addition to assaying infectivity, producer cell expression and HCVpp incorporation of HCV E2 proteins, CD81 binding profiles, and conformation of mutants were examined. Results Based on these characteristics, mutants either displayed wt characteristics (high infectivity [≥ 90% of wt HCVpp], CD81 binding, E1E2 expression, and incorporation into viral particles and proper conformation or very low infectivity (≤ 20% of wt HCVpp. Only amino acid substitutions of the 3rd position (D or E resulted in wt characteristics as long as the negative charge was maintained or a neutral alanine was introduced. A change in charge to a positive lysine, disrupted HCVpp infectivity at this position. Conclusion Although most amino acid substitutions within this conserved motif displayed greatly reduced HCVpp infectivity, they retained soluble CD81 binding, proper E2 conformation, and incorporation into HCVpp. Our results suggest that although RGE/D is a well-defined integrin binding motif, in this case the role of these three hyperconserved amino acids does not appear to be integrin binding. As the extent of conservation of this region extends well beyond these three amino acids, we speculate that this region may play an important role in the structure of HCV E2 or in mediating the interaction with other factor(s during

  7. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San

    2015-01-01

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  8. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun

    2015-06-11

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  9. A Conserved GPG-Motif in the HIV-1 Nef Core Is Required for Principal Nef-Activities.

    Directory of Open Access Journals (Sweden)

    Marta Martínez-Bonet

    Full Text Available To find out new determinants required for Nef activity we performed a functional alanine scanning analysis along a discrete but highly conserved region at the core of HIV-1 Nef. We identified the GPG-motif, located at the 121-137 region of HIV-1 NL4.3 Nef, as a novel protein signature strictly required for the p56Lck dependent Nef-induced CD4-downregulation in T-cells. Since the Nef-GPG motif was dispensable for CD4-downregulation in HeLa-CD4 cells, Nef/AP-1 interaction and Nef-dependent effects on Tf-R trafficking, the observed effects on CD4 downregulation cannot be attributed to structure constraints or to alterations on general protein trafficking. Besides, we found that the GPG-motif was also required for Nef-dependent inhibition of ring actin re-organization upon TCR triggering and MHCI downregulation, suggesting that the GPG-motif could actively cooperate with the Nef PxxP motif for these HIV-1 Nef-related effects. Finally, we observed that the Nef-GPG motif was required for optimal infectivity of those viruses produced in T-cells. According to these findings, we propose the conserved GPG-motif in HIV-1 Nef as functional region required for HIV-1 infectivity and therefore with a potential interest for the interference of Nef activity during HIV-1 infection.

  10. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo; Jankovic, Boris R.; Bajic, Vladimir B.; Song, Le; Gao, Xin

    2013-01-01

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  11. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo

    2013-06-21

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  12. A Conserved EAR Motif Is Required for Avirulence and Stability of the Ralstonia solanacearum Effector PopP2 In Planta

    Directory of Open Access Journals (Sweden)

    Cécile Segonzac

    2017-08-01

    Full Text Available Ralstonia solanacearum is the causal agent of the devastating bacterial wilt disease in many high value Solanaceae crops. R. solanacearum secretes around 70 effectors into host cells in order to promote infection. Plants have, however, evolved specialized immune receptors that recognize corresponding effectors and confer qualitative disease resistance. In the model species Arabidopsis thaliana, the paired immune receptors RRS1 (resistance to Ralstonia solanacearum 1 and RPS4 (resistance to Pseudomonas syringae 4 cooperatively recognize the R. solanacearum effector PopP2 in the nuclei of infected cells. PopP2 is an acetyltransferase that binds to and acetylates the RRS1 WRKY DNA-binding domain resulting in reduced RRS1-DNA association thereby activating plant immunity. Here, we surveyed the naturally occurring variation in PopP2 sequence among the R. solanacearum strains isolated from diseased tomato and pepper fields across the Republic of Korea. Our analysis revealed high conservation of popP2 sequence with only three polymorphic alleles present amongst 17 strains. Only one variation (a premature stop codon caused the loss of RPS4/RRS1-dependent recognition in Arabidopsis. We also found that PopP2 harbors a putative eukaryotic transcriptional repressor motif (ethylene-responsive element binding factor-associated amphiphilic repression or EAR, which is known to be involved in the recruitment of transcriptional co-repressors. Remarkably, mutation of the EAR motif disabled PopP2 avirulence function as measured by the development of hypersensitive response, electrolyte leakage, defense marker gene expression and bacterial growth in Arabidopsis. This lack of recognition was partially but significantly reverted by the C-terminal addition of a synthetic EAR motif. We show that the EAR motif-dependent gain of avirulence correlated with the stability of the PopP2 protein. Furthermore, we demonstrated the requirement of the PopP2 EAR motif for PTI

  13. DistAMo: A web-based tool to characterize DNA-motif distribution on bacterial chromosomes

    Directory of Open Access Journals (Sweden)

    Patrick eSobetzko

    2016-03-01

    Full Text Available Short DNA motifs are involved in a multitude of functions such as for example chromosome segregation, DNA replication or mismatch repair. Distribution of such motifs is often not random and the specific chromosomal pattern relates to the respective motif function. Computational approaches which quantitatively assess such chromosomal motif patterns are necessary. Here we present a new computer tool DistAMo (Distribution Analysis of DNA Motifs. The algorithm uses codon redundancy to calculate the relative abundance of short DNA motifs from single genes to entire chromosomes. Comparative genomics analyses of the GATC-motif distribution in γ-proteobacterial genomes using DistAMo revealed that (i genes beside the replication origin are enriched in GATCs, (ii genome-wide GATC distribution follows a distinct pattern and (iii genes involved in DNA replication and repair are enriched in GATCs. These features are specific for bacterial chromosomes encoding a Dam methyltransferase. The new software is available as a stand-alone or as an easy-to-use web-based server version at http://www.computational.bio.uni-giessen.de/distamo.

  14. DNA barcodes for ecology, evolution, and conservation.

    Science.gov (United States)

    Kress, W John; García-Robledo, Carlos; Uriarte, Maria; Erickson, David L

    2015-01-01

    The use of DNA barcodes, which are short gene sequences taken from a standardized portion of the genome and used to identify species, is entering a new phase of application as more and more investigations employ these genetic markers to address questions relating to the ecology and evolution of natural systems. The suite of DNA barcode markers now applied to specific taxonomic groups of organisms are proving invaluable for understanding species boundaries, community ecology, functional trait evolution, trophic interactions, and the conservation of biodiversity. The application of next-generation sequencing (NGS) technology will greatly expand the versatility of DNA barcodes across the Tree of Life, habitats, and geographies as new methodologies are explored and developed. Published by Elsevier Ltd.

  15. Human telomeric DNA: G-quadruplex, i-motif and Watson–Crick double helix

    Science.gov (United States)

    Phan, Anh Tuân; Mergny, Jean-Louis

    2002-01-01

    Human telomeric DNA composed of (TTAGGG/CCCTAA)n repeats may form a classical Watson–Crick double helix. Each individual strand is also prone to quadruplex formation: the G-rich strand may adopt a G-quadruplex conformation involving G-quartets whereas the C-rich strand may fold into an i-motif based on intercalated C·C+ base pairs. Using an equimolar mixture of the telomeric oligonucleotides d[AGGG(TTAGGG)3] and d[(CCCTAA)3CCCT], we defined which structures existed and which would be the predominant species under a variety of experimental conditions. Under near-physiological conditions of pH, temperature and salt concentration, telomeric DNA was predominantly in a double-helix form. However, at lower pH values or higher temperatures, the G-quadruplex and/or the i-motif efficiently competed with the duplex. We also present kinetic and thermodynamic data for duplex association and for G-quadruplex/i-motif unfolding. PMID:12409451

  16. Identification of multiple distinct Snf2 subfamilies with conserved structural motifs.

    Science.gov (United States)

    Flaus, Andrew; Martin, David M A; Barton, Geoffrey J; Owen-Hughes, Tom

    2006-01-01

    The Snf2 family of helicase-related proteins includes the catalytic subunits of ATP-dependent chromatin remodelling complexes found in all eukaryotes. These act to regulate the structure and dynamic properties of chromatin and so influence a broad range of nuclear processes. We have exploited progress in genome sequencing to assemble a comprehensive catalogue of over 1300 Snf2 family members. Multiple sequence alignment of the helicase-related regions enables 24 distinct subfamilies to be identified, a considerable expansion over earlier surveys. Where information is known, there is a good correlation between biological or biochemical function and these assignments, suggesting Snf2 family motor domains are tuned for specific tasks. Scanning of complete genomes reveals all eukaryotes contain members of multiple subfamilies, whereas they are less common and not ubiquitous in eubacteria or archaea. The large sample of Snf2 proteins enables additional distinguishing conserved sequence blocks within the helicase-like motor to be identified. The establishment of a phylogeny for Snf2 proteins provides an opportunity to make informed assignments of function, and the identification of conserved motifs provides a framework for understanding the mechanisms by which these proteins function.

  17. An evolutionarily conserved glycine-tyrosine motif forms a folding core in outer membrane proteins.

    Directory of Open Access Journals (Sweden)

    Marcin Michalik

    Full Text Available An intimate interaction between a pair of amino acids, a tyrosine and glycine on neighboring β-strands, has been previously reported to be important for the structural stability of autotransporters. Here, we show that the conservation of this interacting pair extends to nearly all major families of outer membrane β-barrel proteins, which are thought to have originated through duplication events involving an ancestral ββ hairpin. We analyzed the function of this motif using the prototypical outer membrane protein OmpX. Stopped-flow fluorescence shows that two folding processes occur in the millisecond time regime, the rates of which are reduced in the tyrosine mutant. Folding assays further demonstrate a reduction in the yield of folded protein for the mutant compared to the wild-type, as well as a reduction in thermal stability. Taken together, our data support the idea of an evolutionarily conserved 'folding core' that affects the folding, membrane insertion, and thermal stability of outer membrane protein β-barrels.

  18. LDsplit: screening for cis-regulatory motifs stimulating meiotic recombination hotspots by analysis of DNA sequence polymorphisms.

    Science.gov (United States)

    Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie

    2014-02-17

    As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of

  19. DNA methylation requires a DNMT1 ubiquitin interacting motif (UIM) and histone ubiquitination.

    Science.gov (United States)

    Qin, Weihua; Wolf, Patricia; Liu, Nan; Link, Stephanie; Smets, Martha; La Mastra, Federica; Forné, Ignasi; Pichler, Garwin; Hörl, David; Fellinger, Karin; Spada, Fabio; Bonapace, Ian Marc; Imhof, Axel; Harz, Hartmann; Leonhardt, Heinrich

    2015-08-01

    DNMT1 is recruited by PCNA and UHRF1 to maintain DNA methylation after replication. UHRF1 recognizes hemimethylated DNA substrates via the SRA domain, but also repressive H3K9me3 histone marks with its TTD. With systematic mutagenesis and functional assays, we could show that chromatin binding further involved UHRF1 PHD binding to unmodified H3R2. These complementation assays clearly demonstrated that the ubiquitin ligase activity of the UHRF1 RING domain is required for maintenance DNA methylation. Mass spectrometry of UHRF1-deficient cells revealed H3K18 as a novel ubiquitination target of UHRF1 in mammalian cells. With bioinformatics and mutational analyses, we identified a ubiquitin interacting motif (UIM) in the N-terminal regulatory domain of DNMT1 that binds to ubiquitinated H3 tails and is essential for DNA methylation in vivo. H3 ubiquitination and subsequent DNA methylation required UHRF1 PHD binding to H3R2. These results show the manifold regulatory mechanisms controlling DNMT1 activity that require the reading and writing of epigenetic marks by UHRF1 and illustrate the multifaceted interplay between DNA and histone modifications. The identification and functional characterization of the DNMT1 UIM suggests a novel regulatory principle and we speculate that histone H2AK119 ubiquitination might also lead to UIM-dependent recruitment of DNMT1 and DNA methylation beyond classic maintenance.

  20. Evolutionarily conserved bias of amino-acid usage refines the definition of PDZ-binding motif

    Directory of Open Access Journals (Sweden)

    Launey Thomas

    2011-06-01

    Full Text Available Abstract Background The interactions between PDZ (PSD-95, Dlg, ZO-1 domains and PDZ-binding motifs play central roles in signal transductions within cells. Proteins with PDZ domains bind to PDZ-binding motifs almost exclusively when the motifs are located at the carboxyl (C- terminal ends of their binding partners. However, it remains little explored whether PDZ-binding motifs show any preferential location at the C-terminal ends of proteins, at genome-level. Results Here, we examined the distribution of the type-I (x-x-S/T-x-I/L/V or type-II (x-x-V-x-I/V PDZ-binding motifs in proteins encoded in the genomes of five different species (human, mouse, zebrafish, fruit fly and nematode. We first established that these PDZ-binding motifs are indeed preferentially present at their C-terminal ends. Moreover, we found specific amino acid (AA bias for the 'x' positions in the motifs at the C-terminal ends. In general, hydrophilic AAs were favored. Our genomics-based findings confirm and largely extend the results of previous interaction-based studies, allowing us to propose refined consensus sequences for all of the examined PDZ-binding motifs. An ontological analysis revealed that the refined motifs are functionally relevant since a large fraction of the proteins bearing the motif appear to be involved in signal transduction. Furthermore, co-precipitation experiments confirmed two new protein interactions predicted by our genomics-based approach. Finally, we show that influenza virus pathogenicity can be correlated with PDZ-binding motif, with high-virulence viral proteins bearing a refined PDZ-binding motif. Conclusions Our refined definition of PDZ-binding motifs should provide important clues for identifying functional PDZ-binding motifs and proteins involved in signal transduction.

  1. The conservation pattern of short linear motifs is highly correlated with the function of interacting protein domains

    Directory of Open Access Journals (Sweden)

    Wang Yiguo

    2008-10-01

    Full Text Available Abstract Background Many well-represented domains recognize primary sequences usually less than 10 amino acids in length, called Short Linear Motifs (SLiMs. Accurate prediction of SLiMs has been difficult because they are short (often Results Our combined approach revealed that SLiMs are highly conserved in proteins from functional classes that are known to interact with a specific domain, but that they are not conserved in most other protein groups. We found that SLiMs recognized by SH2 domains were highly conserved in receptor kinases/phosphatases, adaptor molecules, and tyrosine kinases/phosphatases, that SLiMs recognized by SH3 domains were highly conserved in cytoskeletal and cytoskeletal-associated proteins, that SLiMs recognized by PDZ domains were highly conserved in membrane proteins such as channels and receptors, and that SLiMs recognized by S/T kinase domains were highly conserved in adaptor molecules, S/T kinases/phosphatases, and proteins involved in transcription or cell cycle control. We studied Tyr-SLiMs recognized by SH2 domains in more detail, and found that SH2-recognized Tyr-SLiMs on the cytoplasmic side of membrane proteins are more highly conserved than those on the extra-cellular side. Also, we found that SH2-recognized Tyr-SLiMs that are associated with SH3 motifs and a tyrosine kinase phosphorylation motif are more highly conserved. Conclusion The interactome of protein domains is reflected by the evolutionary conservation of SLiMs recognized by these domains. Combining scoring matrixes derived from peptide libraries and conservation analysis, we would be able to find those protein groups that are more likely to interact with specific domains.

  2. Detecting remote sequence homology in disordered proteins: discovery of conserved motifs in the N-termini of Mononegavirales phosphoproteins.

    Directory of Open Access Journals (Sweden)

    David Karlin

    Full Text Available Paramyxovirinae are a large group of viruses that includes measles virus and parainfluenza viruses. The viral Phosphoprotein (P plays a central role in viral replication. It is composed of a highly variable, disordered N-terminus and a conserved C-terminus. A second viral protein alternatively expressed, the V protein, also contains the N-terminus of P, fused to a zinc finger. We suspected that, despite their high variability, the N-termini of P/V might all be homologous; however, using standard approaches, we could previously identify sequence conservation only in some Paramyxovirinae. We now compared the N-termini using sensitive sequence similarity search programs, able to detect residual similarities unnoticeable by conventional approaches. We discovered that all Paramyxovirinae share a short sequence motif in their first 40 amino acids, which we called soyuz1. Despite its short length (11-16aa, several arguments allow us to conclude that soyuz1 probably evolved by homologous descent, unlike linear motifs. Conservation across such evolutionary distances suggests that soyuz1 plays a crucial role and experimental data suggest that it binds the viral nucleoprotein to prevent its illegitimate self-assembly. In some Paramyxovirinae, the N-terminus of P/V contains a second motif, soyuz2, which might play a role in blocking interferon signaling. Finally, we discovered that the P of related Mononegavirales contain similarly overlooked motifs in their N-termini, and that their C-termini share a previously unnoticed structural similarity suggesting a common origin. Our results suggest several testable hypotheses regarding the replication of Mononegavirales and suggest that disordered regions with little overall sequence similarity, common in viral and eukaryotic proteins, might contain currently overlooked motifs (intermediate in length between linear motifs and disordered domains that could be detected simply by comparing orthologous proteins.

  3. Conserved Functional Motifs and Homology Modeling to Predict Hidden Moonlighting Functional Sites

    KAUST Repository

    Wong, Aloysius Tze; Gehring, Christoph A; Irving, Helen R.

    2015-01-01

    Moonlighting functional centers within proteins can provide them with hitherto unrecognized functions. Here, we review how hidden moonlighting functional centers, which we define as binding sites that have catalytic activity or regulate protein function in a novel manner, can be identified using targeted bioinformatic searches. Functional motifs used in such searches include amino acid residues that are conserved across species and many of which have been assigned functional roles based on experimental evidence. Molecules that were identified in this manner seeking cyclic mononucleotide cyclases in plants are used as examples. The strength of this computational approach is enhanced when good homology models can be developed to test the functionality of the predicted centers in silico, which, in turn, increases confidence in the ability of the identified candidates to perform the predicted functions. Computational characterization of moonlighting functional centers is not diagnostic for catalysis but serves as a rapid screening method, and highlights testable targets from a potentially large pool of candidates for subsequent in vitro and in vivo experiments required to confirm the functionality of the predicted moonlighting centers.

  4. Conserved Functional Motifs and Homology Modeling to Predict Hidden Moonlighting Functional Sites

    KAUST Repository

    Wong, Aloysius Tze

    2015-06-09

    Moonlighting functional centers within proteins can provide them with hitherto unrecognized functions. Here, we review how hidden moonlighting functional centers, which we define as binding sites that have catalytic activity or regulate protein function in a novel manner, can be identified using targeted bioinformatic searches. Functional motifs used in such searches include amino acid residues that are conserved across species and many of which have been assigned functional roles based on experimental evidence. Molecules that were identified in this manner seeking cyclic mononucleotide cyclases in plants are used as examples. The strength of this computational approach is enhanced when good homology models can be developed to test the functionality of the predicted centers in silico, which, in turn, increases confidence in the ability of the identified candidates to perform the predicted functions. Computational characterization of moonlighting functional centers is not diagnostic for catalysis but serves as a rapid screening method, and highlights testable targets from a potentially large pool of candidates for subsequent in vitro and in vivo experiments required to confirm the functionality of the predicted moonlighting centers.

  5. qPMS7: a fast algorithm for finding (ℓ, d-motifs in DNA and protein sequences.

    Directory of Open Access Journals (Sweden)

    Hieu Dinh

    Full Text Available Detection of rare events happening in a set of DNA/protein sequences could lead to new biological discoveries. One kind of such rare events is the presence of patterns called motifs in DNA/protein sequences. Finding motifs is a challenging problem since the general version of motif search has been proven to be intractable. Motifs discovery is an important problem in biology. For example, it is useful in the detection of transcription factor binding sites and transcriptional regulatory elements that are very crucial in understanding gene function, human disease, drug design, etc. Many versions of the motif search problem have been proposed in the literature. One such is the (ℓ, d-motif search (or Planted Motif Search (PMS. A generalized version of the PMS problem, namely, Quorum Planted Motif Search (qPMS, is shown to accurately model motifs in real data. However, solving the qPMS problem is an extremely difficult task because a special case of it, the PMS Problem, is already NP-hard, which means that any algorithm solving it can be expected to take exponential time in the worse case scenario. In this paper, we propose a novel algorithm named qPMS7 that tackles the qPMS problem on real data as well as challenging instances. Experimental results show that our Algorithm qPMS7 is on an average 5 times faster than the state-of-art algorithm. The executable program of Algorithm qPMS7 is freely available on the web at http://pms.engr.uconn.edu/downloads/qPMS7.zip. Our online motif discovery tools that use Algorithm qPMS7 are freely available at http://pms.engr.uconn.edu or http://motifsearch.com.

  6. Novel essential residues of Hda for interaction with DnaA in the regulatory inactivation of DnaA: unique roles for Hda AAA Box VI and VII motifs.

    Science.gov (United States)

    Nakamura, Kenta; Katayama, Tsutomu

    2010-04-01

    Escherichia coli ATP-DnaA initiates chromosomal replication. For preventing extra-initiations, a complex of ADP-Hda and the DNA-loaded replicase clamp promotes DnaA-ATP hydrolysis, yielding inactive ADP-DnaA. However, the Hda-DnaA interaction mode remains unclear except that the Hda Box VII Arg finger (Arg-153) and DnaA sensor II Arg-334 within each AAA(+) domain are crucial for the DnaA-ATP hydrolysis. Here, we demonstrate that direct and functional interaction of ADP-Hda with DnaA requires the Hda residues Ser-152, Phe-118 and Asn-122 as well as Hda Arg-153 and DnaA Arg-334. Structural analyses suggest intermolecular interactions between Hda Ser-152 and DnaA Arg-334 and between Hda Phe-118 and the DnaA Walker B motif region, in addition to an intramolecular interaction between Hda Asn-122 and Arg-153. These interactions likely sustain a specific association of ADP-Hda and DnaA, promoting DnaA-ATP hydrolysis. Consistently, ATP-DnaA and ADP-DnaA interact with the ADP-Hda-DNA-clamp complex with similar affinities. Hda Phe-118 and Asn-122 are contained in the Box VI region, and their hydrophobic and electrostatic features are basically conserved in the corresponding residues of other AAA(+) proteins, suggesting a conserved role for Box VI. These findings indicate novel interaction mechanisms for Hda-DnaA as well as a potentially fundamental mechanism in AAA(+) protein interactions.

  7. A conserved MCM single-stranded DNA binding element is essential for replication initiation.

    Science.gov (United States)

    Froelich, Clifford A; Kang, Sukhyun; Epling, Leslie B; Bell, Stephen P; Enemark, Eric J

    2014-04-01

    The ring-shaped MCM helicase is essential to all phases of DNA replication. The complex loads at replication origins as an inactive double-hexamer encircling duplex DNA. Helicase activation converts this species to two active single hexamers that encircle single-stranded DNA (ssDNA). The molecular details of MCM DNA interactions during these events are unknown. We determined the crystal structure of the Pyrococcus furiosus MCM N-terminal domain hexamer bound to ssDNA and define a conserved MCM-ssDNA binding motif (MSSB). Intriguingly, ssDNA binds the MCM ring interior perpendicular to the central channel with defined polarity. In eukaryotes, the MSSB is conserved in several Mcm2-7 subunits, and MSSB mutant combinations in S. cerevisiae Mcm2-7 are not viable. Mutant Mcm2-7 complexes assemble and are recruited to replication origins, but are defective in helicase loading and activation. Our findings identify an important MCM-ssDNA interaction and suggest it functions during helicase activation to select the strand for translocation. DOI: http://dx.doi.org/10.7554/eLife.01993.001.

  8. Interleukin-11 binds specific EF-hand proteins via their conserved structural motifs.

    Science.gov (United States)

    Kazakov, Alexei S; Sokolov, Andrei S; Vologzhannikova, Alisa A; Permyakova, Maria E; Khorn, Polina A; Ismailov, Ramis G; Denessiouk, Konstantin A; Denesyuk, Alexander I; Rastrygina, Victoria A; Baksheeva, Viktoriia E; Zernii, Evgeni Yu; Zinchenko, Dmitry V; Glazatov, Vladimir V; Uversky, Vladimir N; Mirzabekov, Tajib A; Permyakov, Eugene A; Permyakov, Sergei E

    2017-01-01

    Interleukin-11 (IL-11) is a hematopoietic cytokine engaged in numerous biological processes and validated as a target for treatment of various cancers. IL-11 contains intrinsically disordered regions that might recognize multiple targets. Recently we found that aside from IL-11RA and gp130 receptors, IL-11 interacts with calcium sensor protein S100P. Strict calcium dependence of this interaction suggests a possibility of IL-11 interaction with other calcium sensor proteins. Here we probed specificity of IL-11 to calcium-binding proteins of various types: calcium sensors of the EF-hand family (calmodulin, S100B and neuronal calcium sensors: recoverin, NCS-1, GCAP-1, GCAP-2), calcium buffers of the EF-hand family (S100G, oncomodulin), and a non-EF-hand calcium buffer (α-lactalbumin). A specific subset of the calcium sensor proteins (calmodulin, S100B, NCS-1, GCAP-1/2) exhibits metal-dependent binding of IL-11 with dissociation constants of 1-19 μM. These proteins share several amino acid residues belonging to conservative structural motifs of the EF-hand proteins, 'black' and 'gray' clusters. Replacements of the respective S100P residues by alanine drastically decrease its affinity to IL-11, suggesting their involvement into the association process. Secondary structure and accessibility of the hinge region of the EF-hand proteins studied are predicted to control specificity and selectivity of their binding to IL-11. The IL-11 interaction with the EF-hand proteins is expected to occur under numerous pathological conditions, accompanied by disintegration of plasma membrane and efflux of cellular components into the extracellular milieu.

  9. Extensive Mutagenesis of the Conserved Box E Motif in Duck Hepatitis B Virus P Protein Reveals Multiple Functions in Replication and a Common Structure with the Primer Grip in HIV-1 Reverse Transcriptase

    OpenAIRE

    Wang, Yong-Xiang; Luo, Cheng; Zhao, Dan; Beck, Jürgen; Nassal, Michael

    2012-01-01

    Hepadnaviruses, including the pathogenic hepatitis B virus (HBV), replicate their small DNA genomes through protein-primed reverse transcription, mediated by the terminal protein (TP) domain in their P proteins and an RNA stem-loop, ϵ, on the pregenomic RNA (pgRNA). No direct structural data are available for P proteins, but their reverse transcriptase (RT) domains contain motifs that are conserved in all RTs (box A to box G), implying a similar architecture; however, experimental support for...

  10. Discovery of cell-type specific DNA motif grammar in cis-regulatory elements using random Forest.

    Science.gov (United States)

    Wang, Xin; Lin, Peijie; Ho, Joshua W K

    2018-01-19

    It has been observed that many transcription factors (TFs) can bind to different genomic loci depending on the cell type in which a TF is expressed in, even though the individual TF usually binds to the same core motif in different cell types. How a TF can bind to the genome in such a highly cell-type specific manner, is a critical research question. One hypothesis is that a TF requires co-binding of different TFs in different cell types. If this is the case, it may be possible to observe different combinations of TF motifs - a motif grammar - located at the TF binding sites in different cell types. In this study, we develop a bioinformatics method to systematically identify DNA motifs in TF binding sites across multiple cell types based on published ChIP-seq data, and address two questions: (1) can we build a machine learning classifier to predict cell-type specificity based on motif combinations alone, and (2) can we extract meaningful cell-type specific motif grammars from this classifier model. We present a Random Forest (RF) based approach to build a multi-class classifier to predict the cell-type specificity of a TF binding site given its motif content. We applied this RF classifier to two published ChIP-seq datasets of TF (TCF7L2 and MAX) across multiple cell types. Using cross-validation, we show that motif combinations alone are indeed predictive of cell types. Furthermore, we present a rule mining approach to extract the most discriminatory rules in the RF classifier, thus allowing us to discover the underlying cell-type specific motif grammar. Our bioinformatics analysis supports the hypothesis that combinatorial TF motif patterns are cell-type specific.

  11. Fragile DNA Motifs Trigger Mutagenesis at Distant Chromosomal Loci in Saccharomyces cerevisiae

    Science.gov (United States)

    Saini, Natalie; Zhang, Yu; Nishida, Yuri; Sheng, Ziwei; Choudhury, Shilpa; Mieczkowski, Piotr; Lobachev, Kirill S.

    2013-01-01

    DNA sequences capable of adopting non-canonical secondary structures have been associated with gross-chromosomal rearrangements in humans and model organisms. Previously, we have shown that long inverted repeats that form hairpin and cruciform structures and triplex-forming GAA/TTC repeats induce the formation of double-strand breaks which trigger genome instability in yeast. In this study, we demonstrate that breakage at both inverted repeats and GAA/TTC repeats is augmented by defects in DNA replication. Increased fragility is associated with increased mutation levels in the reporter genes located as far as 8 kb from both sides of the repeats. The increase in mutations was dependent on the presence of inverted or GAA/TTC repeats and activity of the translesion polymerase Polζ. Mutagenesis induced by inverted repeats also required Sae2 which opens hairpin-capped breaks and initiates end resection. The amount of breakage at the repeats is an important determinant of mutations as a perfect palindromic sequence with inherently increased fragility was also found to elevate mutation rates even in replication-proficient strains. We hypothesize that the underlying mechanism for mutagenesis induced by fragile motifs involves the formation of long single-stranded regions in the broken chromosome, invasion of the undamaged sister chromatid for repair, and faulty DNA synthesis employing Polζ. These data demonstrate that repeat-mediated breaks pose a dual threat to eukaryotic genome integrity by inducing chromosomal aberrations as well as mutations in flanking genes. PMID:23785298

  12. A CACGTG motif of the Antirrhinum majus chalcone synthase promoter is recognized by an evolutionarily conserved nuclear protein

    International Nuclear Information System (INIS)

    Staiger, D.; Kaulen, H.; Schell, J.

    1989-01-01

    In the chalcone synthase gene of Antirrhinum majus (snapdragon), 150 base pairs of the 5' flanking region contain cis-acting signals for UV light-induced expression. A nuclear factor, designated CG-1, specifically recognizes a hexameric motif with internal dyad symmetry, CACGTG, located within this light-responsive sequence. Binding of CG-1 is influenced by C-methylation of the CpG dinucleotide in the recognition sequence. CG-1 is a factor found in a variety of dicotyledonous plant species including Nicotiana tabacum, A. majus, Petunia hybrida, Arabidopsis thaliana, and Glycine max. CACGTG motifs contained within trans-acting factor recognition sites in various other plant promoters can interact with CG-1. In addition, the binding site of the human adenovirus major late transcription factor USF can compete for CG-1 binding to the chalcone synthase promoter. This suggests an evolutionary conservation of trans-acting factor recognition sites involved in divergent mechanisms of gene control. (author)

  13. Role of specific cations and water entropy on the stability of branched DNA motif structures.

    Science.gov (United States)

    Pascal, Tod A; Goddard, William A; Maiti, Prabal K; Vaidehi, Nagarajan

    2012-10-11

    DNA three-way junctions (TWJs) are important intermediates in various cellular processes and are the simplest of a family of branched nucleic acids being considered as scaffolds for biomolecular nanotechnology. Branched nucleic acids are stabilized by divalent cations such as Mg(2+), presumably due to condensation and neutralization of the negatively charged DNA backbone. However, electrostatic screening effects point to more complex solvation dynamics and a large role of interfacial waters in thermodynamic stability. Here, we report extensive computer simulations in explicit water and salt on a model TWJ and use free energy calculations to quantify the role of ionic character and strength on stability. We find that enthalpic stabilization of the first and second hydration shells by Mg(2+) accounts for 1/3 and all of the free energy gain in 50% and pure MgCl(2) solutions, respectively. The more distorted DNA molecule is actually destabilized in pure MgCl(2) compared to pure NaCl. Notably, the first shell, interfacial waters have very low translational and rotational entropy (i.e., mobility) compared to the bulk, an entropic loss that is overcompensated by increased enthalpy from additional electrostatic interactions with Mg(2+). In contrast, the second hydration shell has anomalously high entropy as it is trapped between an immobile and bulklike layer. The nonmonotonic entropic signature and long-range perturbations of the hydration shells to Mg(2+) may have implications in the molecular recognition of these motifs. For example, we find that low salt stabilizes the parallel configuration of the three-way junction, whereas at normal salt we find antiparallel configurations deduced from the NMR. We use the 2PT analysis to follow the thermodynamics of this transition and find that the free energy barrier is dominated by entropic effects that result from the decreased surface area of the antiparallel form which has a smaller number of low entropy waters in the first

  14. Stanniocalcin 1 binds hemin through a partially conserved heme regulatory motif

    International Nuclear Information System (INIS)

    Westberg, Johan A.; Jiang, Ji; Andersson, Leif C.

    2011-01-01

    Highlights: → Stanniocalcin 1 (STC1) binds heme through novel heme binding motif. → Central iron atom of heme and cysteine-114 of STC1 are essential for binding. → STC1 binds Fe 2+ and Fe 3+ heme. → STC1 peptide prevents oxidative decay of heme. -- Abstract: Hemin (iron protoporphyrin IX) is a necessary component of many proteins, functioning either as a cofactor or an intracellular messenger. Hemoproteins have diverse functions, such as transportation of gases, gas detection, chemical catalysis and electron transfer. Stanniocalcin 1 (STC1) is a protein involved in respiratory responses of the cell but whose mechanism of action is still undetermined. We examined the ability of STC1 to bind hemin in both its reduced and oxidized states and located Cys 114 as the axial ligand of the central iron atom of hemin. The amino acid sequence differs from the established (Cys-Pro) heme regulatory motif (HRM) and therefore presents a novel heme binding motif (Cys-Ser). A STC1 peptide containing the heme binding sequence was able to inhibit both spontaneous and H 2 O 2 induced decay of hemin. Binding of hemin does not affect the mitochondrial localization of STC1.

  15. Stanniocalcin 1 binds hemin through a partially conserved heme regulatory motif

    Energy Technology Data Exchange (ETDEWEB)

    Westberg, Johan A., E-mail: johan.westberg@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland); Jiang, Ji, E-mail: ji.jiang@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland); Andersson, Leif C., E-mail: leif.andersson@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland)

    2011-06-03

    Highlights: {yields} Stanniocalcin 1 (STC1) binds heme through novel heme binding motif. {yields} Central iron atom of heme and cysteine-114 of STC1 are essential for binding. {yields} STC1 binds Fe{sup 2+} and Fe{sup 3+} heme. {yields} STC1 peptide prevents oxidative decay of heme. -- Abstract: Hemin (iron protoporphyrin IX) is a necessary component of many proteins, functioning either as a cofactor or an intracellular messenger. Hemoproteins have diverse functions, such as transportation of gases, gas detection, chemical catalysis and electron transfer. Stanniocalcin 1 (STC1) is a protein involved in respiratory responses of the cell but whose mechanism of action is still undetermined. We examined the ability of STC1 to bind hemin in both its reduced and oxidized states and located Cys{sup 114} as the axial ligand of the central iron atom of hemin. The amino acid sequence differs from the established (Cys-Pro) heme regulatory motif (HRM) and therefore presents a novel heme binding motif (Cys-Ser). A STC1 peptide containing the heme binding sequence was able to inhibit both spontaneous and H{sub 2}O{sub 2} induced decay of hemin. Binding of hemin does not affect the mitochondrial localization of STC1.

  16. MicroRNA genes preferentially expressed in dendritic cells contain sites for conserved transcription factor binding motifs in their promoters

    Directory of Open Access Journals (Sweden)

    Huynen Martijn A

    2011-06-01

    Full Text Available Abstract Background MicroRNAs (miRNAs play a fundamental role in the regulation of gene expression by translational repression or target mRNA degradation. Regulatory elements in miRNA promoters are less well studied, but may reveal a link between their expression and a specific cell type. Results To explore this link in myeloid cells, miRNA expression profiles were generated from monocytes and dendritic cells (DCs. Differences in miRNA expression among monocytes, DCs and their stimulated progeny were observed. Furthermore, putative promoter regions of miRNAs that are significantly up-regulated in DCs were screened for Transcription Factor Binding Sites (TFBSs based on TFBS motif matching score, the degree to which those TFBSs are over-represented in the promoters of the up-regulated miRNAs, and the extent of conservation of the TFBSs in mammals. Conclusions Analysis of evolutionarily conserved TFBSs in DC promoters revealed preferential clustering of sites within 500 bp upstream of the precursor miRNAs and that many mRNAs of cognate TFs of the conserved TFBSs were indeed expressed in the DCs. Taken together, our data provide evidence that selected miRNAs expressed in DCs have evolutionarily conserved TFBSs relevant to DC biology in their promoters.

  17. Plant DNA banks for genetic resources conservation (review

    Directory of Open Access Journals (Sweden)

    Н. Е. Волкова

    2016-12-01

    Full Text Available Purpose. Literature review of DNA banks creation as the current strategy of plant genetic resources conservation. Results. The current state of plant genetic resources conservation was analyzed in the context of the threat of gene­tic erosion. The importance of DNA banks was shown which function is to store DNA samples and associated products and disseminate them for research purposes. The main DNA banks in the world were described, including the Republican DNA Bank of Human, Animals, Plants and Microorganisms at the Institute of Genetics and Cytology of the National Academy of Sciences of Belarus. Stages of DNA banking were considered: tissue sampling (usually from leaves, cell destruction, DNA extraction, DNA storage. Different methods of tissue sampling, extraction and DNA storage were compared. The need for Plant DNA Bank creation in Ukraine was highlighted. Conclusions. DNA collections is an important resource in the global effort to overcome the crisis in biodiversity, for managing world genetic resources and maximi­zing their potential.

  18. Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences

    Science.gov (United States)

    König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.

    2013-01-01

    G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141

  19. Reversible Redox Activity by Ion-pH Dually Modulated Duplex Formation of i-Motif DNA with Complementary G-DNA

    Directory of Open Access Journals (Sweden)

    Soyoung Chang

    2018-04-01

    Full Text Available The unique biological features of supramolecular DNA have led to an increasing interest in biomedical applications such as biosensors. We have developed an i-motif and G-rich DNA conjugated single-walled carbon nanotube hybrid materials, which shows reversible conformational switching upon external stimuli such as pH (5 and 8 and presence of ions (Li+ and K+. We observed reversible electrochemical redox activity upon external stimuli in a quick and robust manner. Given the ease and the robustness of this method, we believe that pH- and ion-driven reversible DNA structure transformations will be utilized for future applications for developing novel biosensors.

  20. Design of character-based DNA barcode motif for species identification: A computational approach and its validation in fishes.

    Science.gov (United States)

    Chakraborty, Mohua; Dhar, Bishal; Ghosh, Sankar Kumar

    2017-11-01

    The DNA barcodes are generally interpreted using distance-based and character-based methods. The former uses clustering of comparable groups, based on the relative genetic distance, while the latter is based on the presence or absence of discrete nucleotide substitutions. The distance-based approach has a limitation in defining a universal species boundary across the taxa as the rate of mtDNA evolution is not constant throughout the taxa. However, character-based approach more accurately defines this using a unique set of nucleotide characters. The character-based analysis of full-length barcode has some inherent limitations, like sequencing of the full-length barcode, use of a sparse-data matrix and lack of a uniform diagnostic position for each group. A short continuous stretch of a fragment can be used to resolve the limitations. Here, we observe that a 154-bp fragment, from the transversion-rich domain of 1367 COI barcode sequences can successfully delimit species in the three most diverse orders of freshwater fishes. This fragment is used to design species-specific barcode motifs for 109 species by the character-based method, which successfully identifies the correct species using a pattern-matching program. The motifs also correctly identify geographically isolated population of the Cypriniformes species. Further, this region is validated as a species-specific mini-barcode for freshwater fishes by successful PCR amplification and sequencing of the motif (154 bp) using the designed primers. We anticipate that use of such motifs will enhance the diagnostic power of DNA barcode, and the mini-barcode approach will greatly benefit the field-based system of rapid species identification. © 2017 John Wiley & Sons Ltd.

  1. Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

    Energy Technology Data Exchange (ETDEWEB)

    Chojnowski, Grzegorz, E-mail: gchojnowski@genesilico.pl [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Waleń, Tomasz [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); University of Warsaw, Banacha 2, 02-097 Warsaw (Poland); Piątkowski, Paweł; Potrzebowski, Wojciech [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Bujnicki, Janusz M. [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Adam Mickiewicz University, Umultowska 89, 61-614 Poznan (Poland)

    2015-03-01

    A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure is RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx.

  2. Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

    International Nuclear Information System (INIS)

    Chojnowski, Grzegorz; Waleń, Tomasz; Piątkowski, Paweł; Potrzebowski, Wojciech; Bujnicki, Janusz M.

    2015-01-01

    A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure is RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx

  3. Sequence and structural analysis of the chitinase insertion domain reveals two conserved motifs involved in chitin-binding.

    Directory of Open Access Journals (Sweden)

    Hai Li

    2010-01-01

    Full Text Available Chitinases are prevalent in life and are found in species including archaea, bacteria, fungi, plants, and animals. They break down chitin, which is the second most abundant carbohydrate in nature after cellulose. Hence, they are important for maintaining a balance between carbon and nitrogen trapped as insoluble chitin in biomass. Chitinases are classified into two families, 18 and 19 glycoside hydrolases. In addition to a catalytic domain, which is a triosephosphate isomerase barrel, many family 18 chitinases contain another module, i.e., chitinase insertion domain. While numerous studies focus on the biological role of the catalytic domain in chitinase activity, the function of the chitinase insertion domain is not completely understood. Bioinformatics offers an important avenue in which to facilitate understanding the role of residues within the chitinase insertion domain in chitinase function.Twenty-seven chitinase insertion domain sequences, which include four experimentally determined structures and span five kingdoms, were aligned and analyzed using a modified sequence entropy parameter. Thirty-two positions with conserved residues were identified. The role of these conserved residues was explored by conducting a structural analysis of a number of holo-enzymes. Hydrogen bonding and van der Waals calculations revealed a distinct subset of four conserved residues constituting two sequence motifs that interact with oligosaccharides. The other conserved residues may be key to the structure, folding, and stability of this domain.Sequence and structural studies of the chitinase insertion domains conducted within the framework of evolution identified four conserved residues which clearly interact with the substrates. Furthermore, evolutionary studies propose a link between the appearance of the chitinase insertion domain and the function of family 18 chitinases in the subfamily A.

  4. WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences

    Directory of Open Access Journals (Sweden)

    Pesole Graziano

    2007-02-01

    Full Text Available Abstract Background This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely used given the ever increasing amount of genomic data available. Results We present an algorithm that identifies conserved transcription factor binding sites in a given sequence by comparing it to one or more homologs, adapting a framework we previously introduced for the discovery of sites in sequences from co-regulated genes. Differently from the most commonly used methods, the approach we present does not need or compute an alignment of the sequences investigated, nor resorts to descriptors of the binding specificity of known transcription factors. The main novel idea we introduce is a relative measure of conservation, assuming that true functional elements should present a higher level of conservation with respect to the rest of the sequence surrounding them. We present tests where we applied the algorithm to the identification of conserved annotated sites in homologous promoters, as well as in distal regions like enhancers. Conclusion Results of the tests show how the algorithm can provide fast and reliable predictions of conserved transcription factor binding sites regulating the transcription of a gene, with better performances than other available methods for the same task. We also show examples on how the algorithm can be successfully employed when promoter annotations of the genes investigated are missing, or when regulatory sites and regions are located far away from the genes.

  5. DNA in the conservation and management of African antelope

    DEFF Research Database (Denmark)

    Lorenzen, Eline

    2016-01-01

    tool in informed species conservation and sustainable wildlife management. The movement of antelope through translocations, reintroductions, and population augmentations is common practice in wildlife management. DNA-led species identification using genetic barcoding is an effective use of genetic data...... within forensics. DNA barcoding is a taxonomic method that uses a short genetic marker in an organism's DNA to identify it as belonging to a particular species....... databases, and represents a valuable reference database of antelope DNA diversity. For the evolution of antelope, sub-Saharan Africa is a region of particular intrigue. The geographic regions of sub-Saharan Africa represent unique evolutionary scenarios. Molecular data have become an increasingly important...

  6. Glycine in the conserved motif III modulates the thermostability and oxidative stress resistance of peptide deformylase in Mycobacterium tuberculosis.

    Science.gov (United States)

    Narayanan, Sai Shyam; Sokkar, Pandian; Ramachandran, Murugesan; Nampoothiri, Kesavan Madhavan

    2011-07-01

    Peptide deformylase (PDF) catalyses the removal of the N-formyl group from the nascent polypeptide during protein maturation. The PDF of Mycobacterium tuberculosis H37Rv (MtbPDF), overexpressed and purified from Escherichia coli, was characterized as an iron-containing enzyme with stability towards H(2) O(2) and moderate thermostability. Substitution of two conserved residues (G49 and L107) from MtbPDF with the corresponding residues found in human PDF affected its deformylase activity. Among characterized PDFs, glycine (G151) in motif III instead of conserved aspartate is characteristic of M. tuberculosis. Although the G151D mutation in MtbPDF increased its deformylase activity and thermostability, it also affected enzyme stability towards H(2) O(2) . Molecular dynamics and docking results confirmed improved substrate binding and catalysis for the G151D mutant and the study provides another possible molecular basis for the stability of MtbPDF against oxidizing agents. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  7. An effective approach for annotation of protein families with low sequence similarity and conserved motifs: identifying GDSL hydrolases across the plant kingdom.

    Science.gov (United States)

    Vujaklija, Ivan; Bielen, Ana; Paradžik, Tina; Biđin, Siniša; Goldstein, Pavle; Vujaklija, Dušica

    2016-02-18

    The massive accumulation of protein sequences arising from the rapid development of high-throughput sequencing, coupled with automatic annotation, results in high levels of incorrect annotations. In this study, we describe an approach to decrease annotation errors of protein families characterized by low overall sequence similarity. The GDSL lipolytic family comprises proteins with multifunctional properties and high potential for pharmaceutical and industrial applications. The number of proteins assigned to this family has increased rapidly over the last few years. In particular, the natural abundance of GDSL enzymes reported recently in plants indicates that they could be a good source of novel GDSL enzymes. We noticed that a significant proportion of annotated sequences lack specific GDSL motif(s) or catalytic residue(s). Here, we applied motif-based sequence analyses to identify enzymes possessing conserved GDSL motifs in selected proteomes across the plant kingdom. Motif-based HMM scanning (Viterbi decoding-VD and posterior decoding-PD) and the here described PD/VD protocol were successfully applied on 12 selected plant proteomes to identify sequences with GDSL motifs. A significant number of identified GDSL sequences were novel. Moreover, our scanning approach successfully detected protein sequences lacking at least one of the essential motifs (171/820) annotated by Pfam profile search (PfamA) as GDSL. Based on these analyses we provide a curated list of GDSL enzymes from the selected plants. CLANS clustering and phylogenetic analysis helped us to gain a better insight into the evolutionary relationship of all identified GDSL sequences. Three novel GDSL subfamilies as well as unreported variations in GDSL motifs were discovered in this study. In addition, analyses of selected proteomes showed a remarkable expansion of GDSL enzymes in the lycophyte, Selaginella moellendorffii. Finally, we provide a general motif-HMM scanner which is easily accessible through

  8. Sequence-specific DNA binding by MYC/MAX to low-affinity non-E-box motifs.

    Directory of Open Access Journals (Sweden)

    Michael Allevato

    Full Text Available The MYC oncoprotein regulates transcription of a large fraction of the genome as an obligatory heterodimer with the transcription factor MAX. The MYC:MAX heterodimer and MAX:MAX homodimer (hereafter MYC/MAX bind Enhancer box (E-box DNA elements (CANNTG and have the greatest affinity for the canonical MYC E-box (CME CACGTG. However, MYC:MAX also recognizes E-box variants and was reported to bind DNA in a "non-specific" fashion in vitro and in vivo. Here, in order to identify potential additional non-canonical binding sites for MYC/MAX, we employed high throughput in vitro protein-binding microarrays, along with electrophoretic mobility-shift assays and bioinformatic analyses of MYC-bound genomic loci in vivo. We identified all hexameric motifs preferentially bound by MYC/MAX in vitro, which include the low-affinity non-E-box sequence AACGTT, and found that the vast majority (87% of MYC-bound genomic sites in a human B cell line contain at least one of the top 21 motifs bound by MYC:MAX in vitro. We further show that high MYC/MAX concentrations are needed for specific binding to the low-affinity sequence AACGTT in vitro and that elevated MYC levels in vivo more markedly increase the occupancy of AACGTT sites relative to CME sites, especially at distal intergenic and intragenic loci. Hence, MYC binds diverse DNA motifs with a broad range of affinities in a sequence-specific and dose-dependent manner, suggesting that MYC overexpression has more selective effects on the tumor transcriptome than previously thought.

  9. A Nuclear Ribosomal DNA Phylogeny of Acer Inferred with Maximum Likelihood, Splits Graphs, and Motif Analysis of 606 Sequences

    Science.gov (United States)

    Grimm, Guido W.; Renner, Susanne S.; Stamatakis, Alexandros; Hemleben, Vera

    2007-01-01

    The multi-copy internal transcribed spacer (ITS) region of nuclear ribosomal DNA is widely used to infer phylogenetic relationships among closely related taxa. Here we use maximum likelihood (ML) and splits graph analyses to extract phylogenetic information from ~ 600 mostly cloned ITS sequences, representing 81 species and subspecies of Acer, and both species of its sister Dipteronia. Additional analyses compared sequence motifs in Acer and several hundred Anacardiaceae, Burseraceae, Meliaceae, Rutaceae, and Sapindaceae ITS sequences in GenBank. We also assessed the effects of using smaller data sets of consensus sequences with ambiguity coding (accounting for within-species variation) instead of the full (partly redundant) original sequences. Neighbor-nets and bipartition networks were used to visualize conflict among character state patterns. Species clusters observed in the trees and networks largely agree with morphology-based classifications; of de Jong’s (1994) 16 sections, nine are supported in neighbor-net and bipartition networks, and ten by sequence motifs and the ML tree; of his 19 series, 14 are supported in networks, motifs, and the ML tree. Most nodes had higher bootstrap support with matrices of 105 or 40 consensus sequences than with the original matrix. Within-taxon ITS divergence did not differ between diploid and polyploid Acer, and there was little evidence of differentiated parental ITS haplotypes, suggesting that concerted evolution in Acer acts rapidly. PMID:19455198

  10. A Nuclear Ribosomal DNA Phylogeny of Acer Inferred with Maximum Likelihood, Splits Graphs, and Motif Analysis of 606 Sequences

    Directory of Open Access Journals (Sweden)

    Guido W. Grimm

    2006-01-01

    Full Text Available The multi-copy internal transcribed spacer (ITS region of nuclear ribosomal DNA is widely used to infer phylogenetic relationships among closely related taxa. Here we use maximum likelihood (ML and splits graph analyses to extract phylogenetic information from ~ 600 mostly cloned ITS sequences, representing 81 species and subspecies of Acer, and both species of its sister Dipteronia. Additional analyses compared sequence motifs in Acer and several hundred Anacardiaceae, Burseraceae, Meliaceae, Rutaceae, and Sapindaceae ITS sequences in GenBank. We also assessed the effects of using smaller data sets of consensus sequences with ambiguity coding (accounting for within-species variation instead of the full (partly redundant original sequences. Neighbor-nets and bipartition networks were used to visualize conflict among character state patterns. Species clusters observed in the trees and networks largely agree with morphology-based classifications; of de Jong’s (1994 16 sections, nine are supported in neighbor-net and bipartition networks, and ten by sequence motifs and the ML tree; of his 19 series, 14 are supported in networks, motifs, and the ML tree. Most nodes had higher bootstrap support with matrices of 105 or 40 consensus sequences than with the original matrix. Within-taxon ITS divergence did not differ between diploid and polyploid Acer, and there was little evidence of differentiated parental ITS haplotypes, suggesting that concerted evolution in Acer acts rapidly.

  11. Mouse transgenesis identifies conserved functional enhancers and cis-regulatory motif in the vertebrate LIM homeobox gene Lhx2 locus.

    Directory of Open Access Journals (Sweden)

    Alison P Lee

    Full Text Available The vertebrate Lhx2 is a member of the LIM homeobox family of transcription factors. It is essential for the normal development of the forebrain, eye, olfactory system and liver as well for the differentiation of lymphoid cells. However, despite the highly restricted spatio-temporal expression pattern of Lhx2, nothing is known about its transcriptional regulation. In mammals and chicken, Crb2, Dennd1a and Lhx2 constitute a conserved linkage block, while the intervening Dennd1a is lost in the fugu Lhx2 locus. To identify functional enhancers of Lhx2, we predicted conserved noncoding elements (CNEs in the human, mouse and fugu Crb2-Lhx2 loci and assayed their function in transgenic mouse at E11.5. Four of the eight CNE constructs tested functioned as tissue-specific enhancers in specific regions of the central nervous system and the dorsal root ganglia (DRG, recapitulating partial and overlapping expression patterns of Lhx2 and Crb2 genes. There was considerable overlap in the expression domains of the CNEs, which suggests that the CNEs are either redundant enhancers or regulating different genes in the locus. Using a large set of CNEs (810 CNEs associated with transcription factor-encoding genes that express predominantly in the central nervous system, we predicted four over-represented 8-mer motifs that are likely to be associated with expression in the central nervous system. Mutation of one of them in a CNE that drove reporter expression in the neural tube and DRG abolished expression in both domains indicating that this motif is essential for expression in these domains. The failure of the four functional enhancers to recapitulate the complete expression pattern of Lhx2 at E11.5 indicates that there must be other Lhx2 enhancers that are either located outside the region investigated or divergent in mammals and fishes. Other approaches such as sequence comparison between multiple mammals are required to identify and characterize such enhancers.

  12. DndEi Exhibits Helicase Activity Essential for DNA Phosphorothioate Modification and ATPase Activity Strongly Stimulated by DNA Substrate with a GAAC/GTTC Motif.

    Science.gov (United States)

    Zheng, Tao; Jiang, Pan; Cao, Bo; Cheng, Qiuxiang; Kong, Lingxin; Zheng, Xiaoqing; Hu, Qinghai; You, Delin

    2016-01-15

    Phosphorothioate (PT) modification of DNA, in which the non-bridging oxygen of the backbone phosphate group is replaced by sulfur, is governed by the DndA-E proteins in prokaryotes. To better understand the biochemical mechanism of PT modification, functional analysis of the recently found PT-modifying enzyme DndEi, which has an additional domain compared with canonical DndE, from Riemerella anatipestifer is performed in this study. The additional domain is identified as a DNA helicase, and functional deletion of this domain in vivo leads to PT modification deficiency, indicating an essential role of helicase activity in PT modification. Subsequent analysis reveals that the additional domain has an ATPase activity. Intriguingly, the ATPase activity is strongly stimulated by DNA substrate containing a GAAC/GTTC motif (i.e. the motif at which PT modifications occur in R. anatipestifer) when the additional domain and the other domain (homologous to canonical DndE) are co-expressed as a full-length DndEi. These results reveal that PT modification is a biochemical process with DNA strand separation and intense ATP hydrolysis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  13. Bound water at protein-protein interfaces: partners, roles and hydrophobic bubbles as a conserved motif.

    Directory of Open Access Journals (Sweden)

    Mostafa H Ahmed

    Full Text Available There is a great interest in understanding and exploiting protein-protein associations as new routes for treating human disease. However, these associations are difficult to structurally characterize or model although the number of X-ray structures for protein-protein complexes is expanding. One feature of these complexes that has received little attention is the role of water molecules in the interfacial region.A data set of 4741 water molecules abstracted from 179 high-resolution (≤ 2.30 Å X-ray crystal structures of protein-protein complexes was analyzed with a suite of modeling tools based on the HINT forcefield and hydrogen-bonding geometry. A metric termed Relevance was used to classify the general roles of the water molecules.The water molecules were found to be involved in: a (bridging interactions with both proteins (21%, b favorable interactions with only one protein (53%, and c no interactions with either protein (26%. This trend is shown to be independent of the crystallographic resolution. Interactions with residue backbones are consistent for all classes and account for 21.5% of all interactions. Interactions with polar residues are significantly more common for the first group and interactions with non-polar residues dominate the last group. Waters interacting with both proteins stabilize on average the proteins' interaction (-0.46 kcal mol(-1, but the overall average contribution of a single water to the protein-protein interaction energy is unfavorable (+0.03 kcal mol(-1. Analysis of the waters without favorable interactions with either protein suggests that this is a conserved phenomenon: 42% of these waters have SASA ≤ 10 Å(2 and are thus largely buried, and 69% of these are within predominantly hydrophobic environments or "hydrophobic bubbles". Such water molecules may have an important biological purpose in mediating protein-protein interactions.

  14. Improvement of the Immunogenicity of Porcine Circovirus Type 2 DNA Vaccine by Recombinant ORF2 Gene and CpG Motifs.

    Science.gov (United States)

    Li, Jun; Shi, Jian-Li; Wu, Xiao-Yan; Fu, Fang; Yu, Jiang; Yuan, Xiao-Yuan; Peng, Zhe; Cong, Xiao-Yan; Xu, Shao-Jian; Sun, Wen-Bo; Cheng, Kai-Hui; Du, Yi-Jun; Wu, Jia-Qiang; Wang, Jin-Bao; Huang, Bao-Hua

    2015-06-01

    Nowadays, adjuvant is still important for boosting immunity and improving resistance in animals. In order to boost the immunity of porcine circovirus type 2 (PCV2) DNA vaccine, CpG motifs were inserted. In this study, the dose-effect was studied, and the immunity of PCV2 DNA vaccines by recombinant open reading frame 2 (ORF2) gene and CpG motifs was evaluated. Three-week-old Changbai piglets were inoculated intramuscularly with 200 μg, 400 μg, and 800 μg DNA vaccines containing 14 and 18 CpG motifs, respectively. Average gain and rectum temperature were recorded everyday during the experiments. Blood was collected from the piglets after vaccination to detect the changes of specific antibodies, interleukin-2, and immune cells every week. Tissues were collected for histopathology and polymerase chain reaction. The results indicated that compared to those of the control piglets, all concentrations of two DNA vaccines could induce PCV2-specific antibodies. A cellular immunity test showed that PCV2-specific lymphocytes proliferated the number of TH, TC, and CD3+ positive T-cells raised in the blood of DNA vaccine immune groups. There was no distinct pathological damage and viremia occurring in pigs that were inoculated with DNA vaccines, but there was some minor pathological damage in the control group. The results demonstrated that CpG motifs as an adjuvant could boost the humoral and cellular immunity of pigs to PCV2, especially in terms of cellular immunity. Comparing two DNA vaccines that were constructed, the one containing 18 CpG motifs was more effective. This is the first report that CpG motifs as an adjuvant insert to the PCV2 DNA vaccine could boost immunity.

  15. Loss of a highly conserved sterile alpha motif domain gene (WEEP) results in pendulous branch growth in peach trees.

    Science.gov (United States)

    Hollender, Courtney A; Pascal, Thierry; Tabb, Amy; Hadiarto, Toto; Srinivasan, Chinnathambi; Wang, Wanpeng; Liu, Zhongchi; Scorza, Ralph; Dardick, Chris

    2018-05-15

    Plant shoots typically grow upward in opposition to the pull of gravity. However, exceptions exist throughout the plant kingdom. Most conspicuous are trees with weeping or pendulous branches. While such trees have long been cultivated and appreciated for their ornamental value, the molecular basis behind the weeping habit is not known. Here, we characterized a weeping tree phenotype in Prunus persica (peach) and identified the underlying genetic mutation using a genomic sequencing approach. Weeping peach tree shoots exhibited a downward elliptical growth pattern and did not exhibit an upward bending in response to 90° reorientation. The causative allele was found to be an uncharacterized gene, Ppa013325 , having a 1.8-Kb deletion spanning the 5' end. This gene, dubbed WEEP , was predominantly expressed in phloem tissues and encodes a highly conserved 129-amino acid protein containing a sterile alpha motif (SAM) domain. Silencing WEEP in the related tree species Prunus domestica (plum) resulted in more outward, downward, and wandering shoot orientations compared to standard trees, supporting a role for WEEP in directing lateral shoot growth in trees. This previously unknown regulator of branch orientation, which may also be a regulator of gravity perception or response, provides insights into our understanding of how tree branches grow in opposition to gravity and could serve as a critical target for manipulating tree architecture for improved tree shape in agricultural and horticulture applications. Copyright © 2018 the Author(s). Published by PNAS.

  16. Exploring the conserved water site and hydration of a coiled-coil trimerisation motif: a MD simulation study.

    Science.gov (United States)

    Dolenc, Jozica; Baron, Riccardo; Missimer, John H; Steinmetz, Michel O; van Gunsteren, Wilfred F

    2008-07-21

    The solvent structure and dynamics around ccbeta-p, a 17-residue peptide that forms a parallel three-stranded alpha-helical coiled coil in solution, was analysed through 10 ns explicit solvent molecular dynamics (MD) simulations at 278 and 330 K. Comparison with two corresponding simulations of the monomeric form of ccbeta-p was used to investigate the changes of hydration upon coiled-coil formation. Pronounced peaks in the solvent density distribution between residues Arg8 and Glu13 of neighbouring helices show the presence of water bridges between the helices of the ccbeta-p trimer; this is in agreement with the water sites observed in X-ray crystallography experiments. Interestingly, this water site is structurally conserved in many three-stranded coiled coils and, together with the Arg and Glu residues, forms part of a motif that determines three-stranded coiled-coil formation. Our findings show that little direct correlation exists between the solvent density distribution and the temporal ordering of water around the trimeric coiled coil. The MD-calculated effective residence times of up to 40 ps show rapid exchange of surface water molecules with the bulk phase, and indicate that the solvent distribution around biomolecules requires interpretation in terms of continuous density distributions rather than in terms of discrete molecules of water. Together, our study contributes to understanding the principles of three-stranded coiled-coil formation.

  17. Conserved helicase domain of human RecQ4 is required for strand annealing-independent DNA unwinding

    DEFF Research Database (Denmark)

    Rossi, Marie L; Ghosh, Avik K; Kulikowicz, Tomasz

    2010-01-01

    Humans have five members of the well conserved RecQ helicase family: RecQ1, Bloom syndrome protein (BLM), Werner syndrome protein (WRN), RecQ4, and RecQ5, which are all known for their roles in maintaining genome stability. BLM, WRN, and RecQ4 are associated with premature aging and cancer...... provide the first evidence that human RecQ4's unwinding is independent of strand annealing, and that it does not require the presence of excess ssDNA. Moreover, we demonstrate that a point mutation of the conserved lysine in the Walker A motif abolished helicase activity, implying that not the N...... activities and protein partners of RecQ4 are conserved with those of the other RecQ helicases....

  18. Conservation archaeogenomics: ancient DNA and biodiversity in the Anthropocene.

    Science.gov (United States)

    Hofman, Courtney A; Rick, Torben C; Fleischer, Robert C; Maldonado, Jesús E

    2015-09-01

    There is growing consensus that we have entered the Anthropocene, a geologic epoch characterized by human domination of the ecosystems of the Earth. With the future uncertain, we are faced with understanding how global biodiversity will respond to anthropogenic perturbations. The archaeological record provides perspective on human-environment relations through time and across space. Ancient DNA (aDNA) analyses of plant and animal remains from archaeological sites are particularly useful for understanding past human-environment interactions, which can help guide conservation decisions during the environmental changes of the Anthropocene. Here, we define the emerging field of conservation archaeogenomics, which integrates archaeological and genomic data to generate baselines or benchmarks for scientists, managers, and policy-makers by evaluating climatic and human impacts on past, present, and future biodiversity. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. TFII-I regulates target genes in the PI-3K and TGF-β signaling pathways through a novel DNA binding motif.

    Science.gov (United States)

    Segura-Puimedon, Maria; Borralleras, Cristina; Pérez-Jurado, Luis A; Campuzano, Victoria

    2013-09-25

    General transcription factor (TFII-I) is a multi-functional protein involved in the transcriptional regulation of critical developmental genes, encoded by the GTF2I gene located on chromosome 7q11.23. Haploinsufficiency at GTF2I has been shown to play a major role in the neurodevelopmental features of Williams-Beuren syndrome (WBS). Identification of genes regulated by TFII-I is thus critical to detect molecular determinants of WBS as well as to identify potential new targets for specific pharmacological interventions, which are currently absent. We performed a microarray screening for transcriptional targets of TFII-I in cortex and embryonic cells from Gtf2i mutant and wild-type mice. Candidate genes with altered expression were verified using real-time PCR. A novel motif shared by deregulated genes was found and chromatin immunoprecipitation assays in embryonic fibroblasts were used to document in vitro TFII-I binding to this motif in the promoter regions of deregulated genes. Interestingly, the PI3K and TGFβ signaling pathways were over-represented among TFII-I-modulated genes. In this study we have found a highly conserved DNA element, common to a set of genes regulated by TFII-I, and identified and validated novel in vivo neuronal targets of this protein affecting the PI3K and TGFβ signaling pathways. Overall, our data further contribute to unravel the complexity and variability of the different genetic programs orchestrated by TFII-I. © 2013 Elsevier B.V. All rights reserved.

  20. A conserved WW domain-like motif regulates invariant chain-dependent cell-surface transport of the NKG2D ligand ULBP2

    DEFF Research Database (Denmark)

    Uhlenbrock, Franziska Katharina; van Andel, Esther; Andresen, Lars

    2015-01-01

    that the NKG2D ligand ULBP2 traffics over an invariant chain (Ii)-dependent pathway to the cell surface. This study set out to elucidate how Ii regulates ULBP2 cell-surface transport: We discovered conserved tryptophan (Trp) residues in the primary protein sequence of ULBP1-6 but not in the related MICA....../B. Substitution of Trp to alanine resulted in cell-surface inhibition of ULBP2 in different cancer cell lines. Moreover, the mutated ULBP2 constructs were retained and not degraded inside the cell, indicating a crucial role of this conserved Trp-motif in trafficking. Finally, overexpression of Ii increased...... surface expression of wt ULBP2 while Trp-mutants could not be expressed, proposing that this Trp-motif is required for an Ii-dependent cell-surface transport of ULBP2. Aberrant soluble ULBP2 is immunosuppressive. Thus, targeting a distinct protein module on the ULBP2 sequence could counteract...

  1. Large-scale discovery of promoter motifs in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Thomas A Down

    2007-01-01

    Full Text Available A key step in understanding gene regulation is to identify the repertoire of transcription factor binding motifs (TFBMs that form the building blocks of promoters and other regulatory elements. Identifying these experimentally is very laborious, and the number of TFBMs discovered remains relatively small, especially when compared with the hundreds of transcription factor genes predicted in metazoan genomes. We have used a recently developed statistical motif discovery approach, NestedMICA, to detect candidate TFBMs from a large set of Drosophila melanogaster promoter regions. Of the 120 motifs inferred in our initial analysis, 25 were statistically significant matches to previously reported motifs, while 87 appeared to be novel. Analysis of sequence conservation and motif positioning suggested that the great majority of these discovered motifs are predictive of functional elements in the genome. Many motifs showed associations with specific patterns of gene expression in the D. melanogaster embryo, and we were able to obtain confident annotation of expression patterns for 25 of our motifs, including eight of the novel motifs. The motifs are available through Tiffin, a new database of DNA sequence motifs. We have discovered many new motifs that are overrepresented in D. melanogaster promoter regions, and offer several independent lines of evidence that these are novel TFBMs. Our motif dictionary provides a solid foundation for further investigation of regulatory elements in Drosophila, and demonstrates techniques that should be applicable in other species. We suggest that further improvements in computational motif discovery should narrow the gap between the set of known motifs and the total number of transcription factors in metazoan genomes.

  2. Conservation of the rad21 Schizosaccharomyces pombe DNA double-strand break repair gene in mammals

    International Nuclear Information System (INIS)

    McKay, Michael J.; Spek, Peter van der; Kanaar, Roland; Smit, Bep; Bootsma, Dirk; Hoeijmakers, Jan H. J.

    1996-01-01

    Purpose/Objective: Genetic factors are likely to be major determinants of human cellular ionizing radiation sensitivity. DNA double strand breaks (dsbs) are significant ionizing radiation-induced lesions; cellular DNA dsb processing is also important in a number of other contexts. To further the understanding of DNA dsb processing in mammalian cells, we cloned and sequenced mammalian homologs of the rad21 Schizosaccharomyces pombe DNA dsb repair gene. Materials and Methods: The genes were cloned by evolutionary walking, exploiting sequence homology between the yeast and mammalian genes. Results: No major motifs indicative of a particular function were present in the predicted amino acid sequences of the mammalian genes. Alignment of the Rad21 amino acid sequence with its putative homologs showed that similarity was distributed across the length of the proteins, with more highly conserved regions at both termini. The mHR21 sp (mouse homolog ofR ad21, S. pombe) and hHR21 sp (humanh omolog of Rad21, S. pombe) predicted proteins were 96% identical, whereas the human and S. pombe proteins were 25% identical and 47% similar. RNA blot analysis showed that mHR21 sp mRNA was abundant in all adult mouse tissues examined, with highest expression in testis and thymus. In addition to a 3.1kb mRNA transcript in all tissues, an additional 2.2kb transcript was present at a high level in post-meiotic spermatids, white expression of the 3.1kb mRNA in testis was confined to the meiotic compartment. hHR21 sp mRNA was cell cycle regulated in human cells, increasing in late S phase to a peak in G2 phase. The level of hHR21 sp transcripts was not altered by exposure of normal diploid fibroblasts to 10 Gy ionizing radiation. In situ hybridization showed mHR21 sp resided on chromosome 15D3, whereashHR21 sp localized to the syntenic 8q24 region. Conclusion: Cloning these novel mammalian genes and characterization of their protein products should contribute to the understanding of cellular

  3. Comparisons of Copy Number, Genomic Structure, and Conserved Motifs for α-Amylase Genes from Barley, Rice, and Wheat

    Directory of Open Access Journals (Sweden)

    Qisen Zhang

    2017-10-01

    Full Text Available Barley is an important crop for the production of malt and beer. However, crops such as rice and wheat are rarely used for malting. α-amylase is the key enzyme that degrades starch during malting. In this study, we compared the genomic properties, gene copies, and conserved promoter motifs of α-amylase genes in barley, rice, and wheat. In all three crops, α-amylase consists of four subfamilies designated amy1, amy2, amy3, and amy4. In wheat and barley, members of amy1 and amy2 genes are localized on chromosomes 6 and 7, respectively. In rice, members of amy1 genes are found on chromosomes 1 and 2, and amy2 genes on chromosome 6. The barley genome has six amy1 members and three amy2 members. The wheat B genome contains four amy1 members and three amy2 members, while the rice genome has three amy1 members and one amy2 member. The B genome has mostly amy1 and amy2 members among the three wheat genomes. Amy1 promoters from all three crop genomes contain a GA-responsive complex consisting of a GA-responsive element (CAATAAA, pyrimidine box (CCTTTT and TATCCAT/C box. This study has shown that amy1 and amy2 from both wheat and barley have similar genomic properties, including exon/intron structures and GA-responsive elements on promoters, but these differ in rice. Like barley, wheat should have sufficient amy activity to degrade starch completely during malting. Other factors, such as high protein with haze issues and the lack of husk causing Lauting difficulty, may limit the use of wheat for brewing.

  4. N-termini of fungal CSL transcription factors are disordered, enriched in regulatory motifs and inhibit DNA binding in fission yeast.

    Directory of Open Access Journals (Sweden)

    Martin Převorovský

    Full Text Available CSL (CBF1/RBP-Jκ/Suppressor of Hairless/LAG-1 transcription factors are the effector components of the Notch receptor signalling pathway, which is critical for metazoan development. The metazoan CSL proteins (class M can also function in a Notch-independent manner. Recently, two novel classes of CSL proteins, designated F1 and F2, have been identified in fungi. The role of the fungal CSL proteins is unclear, because the Notch pathway is not present in fungi. In fission yeast, the Cbf11 and Cbf12 CSL paralogs play antagonistic roles in cell adhesion and the coordination of cell and nuclear division. Unusually long N-terminal extensions are typical for fungal and invertebrate CSL family members. In this study, we investigate the functional significance of these extended N-termini of CSL proteins.We identify 15 novel CSL family members from 7 fungal species and conduct bioinformatic analyses of a combined dataset containing 34 fungal and 11 metazoan CSL protein sequences. We show that the long, non-conserved N-terminal tails of fungal CSL proteins are likely disordered and enriched in phosphorylation sites and PEST motifs. In a case study of Cbf12 (class F2, we provide experimental evidence that the protein is proteolytically processed and that the N-terminus inhibits the Cbf12-dependent DNA binding activity in an electrophoretic mobility shift assay.This study provides insight into the characteristics of the long N-terminal tails of fungal CSL proteins that may be crucial for controlling DNA-binding and CSL function. We propose that the regulation of DNA binding by Cbf12 via its N-terminal region represents an important means by which fission yeast strikes a balance between the class F1 and class F2 paralog activities. This mode of regulation might be shared with other CSL-positive fungi, some of which are relevant to human disease and biotechnology.

  5. Counting of oligomers in sequences generated by markov chains for DNA motif discovery.

    Science.gov (United States)

    Shan, Gao; Zheng, Wei-Mou

    2009-02-01

    By means of the technique of the imbedded Markov chain, an efficient algorithm is proposed to exactly calculate first, second moments of word counts and the probability for a word to occur at least once in random texts generated by a Markov chain. A generating function is introduced directly from the imbedded Markov chain to derive asymptotic approximations for the problem. Two Z-scores, one based on the number of sequences with hits and the other on the total number of word hits in a set of sequences, are examined for discovery of motifs on a set of promoter sequences extracted from A. thaliana genome. Source code is available at http://www.itp.ac.cn/zheng/oligo.c.

  6. Sequence-specific DNA binding activity of the cross-brace zinc finger motif of the piggyBac transposase

    Science.gov (United States)

    Morellet, Nelly; Li, Xianghong; Wieninger, Silke A; Taylor, Jennifer L; Bischerour, Julien; Moriau, Séverine; Lescop, Ewen; Bardiaux, Benjamin; Mathy, Nathalie; Assrir, Nadine; Bétermier, Mireille; Nilges, Michael; Hickman, Alison B; Dyda, Fred; Craig, Nancy L; Guittet, Eric

    2018-01-01

    Abstract The piggyBac transposase (PB) is distinguished by its activity and utility in genome engineering, especially in humans where it has highly promising therapeutic potential. Little is known, however, about the structure–function relationships of the different domains of PB. Here, we demonstrate in vitro and in vivo that its C-terminal Cysteine-Rich Domain (CRD) is essential for DNA breakage, joining and transposition and that it binds to specific DNA sequences in the left and right transposon ends, and to an additional unexpectedly internal site at the left end. Using NMR, we show that the CRD adopts the specific fold of the cross-brace zinc finger protein family. We determine the interaction interfaces between the CRD and its target, the 5′-TGCGT-3′/3′-ACGCA-5′ motifs found in the left, left internal and right transposon ends, and use NMR results to propose docking models for the complex, which are consistent with our site-directed mutagenesis data. Our results provide support for a model of the PB/DNA interactions in the context of the transpososome, which will be useful for the rational design of PB mutants with increased activity. PMID:29385532

  7. Biomimetic trapping cocktail to screen reactive metabolites: use of an amino acid and DNA motif mixture as light/heavy isotope pairs differing in mass shift.

    Science.gov (United States)

    Hosaka, Shuto; Honda, Takuto; Lee, Seon Hwa; Oe, Tomoyuki

    2018-06-01

    Candidate drugs that can be metabolically transformed into reactive electrophilic products, such as epoxides, quinones, and nitroso compounds, are of special concern because subsequent covalent binding to bio-macromolecules can cause adverse drug reactions, such as allergic reactions, hepatotoxicity, and genotoxicity. Several strategies have been reported for screening reactive metabolites, such as a covalent binding assay with radioisotope-labeled drugs and a trapping method followed by LC-MS/MS analyses. Of these, a trapping method using glutathione is the most common, especially at the early stage of drug development. However, the cysteine of glutathione is not the only nucleophilic site in vivo; lysine, histidine, arginine, and DNA bases are also nucleophilic. Indeed, the glutathione trapping method tends to overlook several types of reactive metabolites, such as aldehydes, acylglucuronides, and nitroso compounds. Here, we introduce an alternate way for screening reactive metabolites as follows: A mixture of the light and heavy isotopes of simplified amino acid motifs and a DNA motif is used as a biomimetic trapping cocktail. This mixture consists of [ 2 H 0 ]/[ 2 H 3 ]-1-methylguanidine (arginine motif, Δ 3 Da), [ 2 H 0 ]/[ 2 H 4 ]-2-mercaptoethanol (cysteine motif, Δ 4 Da), [ 2 H 0 ]/[ 2 H 5 ]-4-methylimidazole (histidine motif, Δ 5 Da), [ 2 H 0 ]/[ 2 H 9 ]-n-butylamine (lysine motif, Δ 9 Da), and [ 13 C 0 , 15 N 0 ]/[ 13 C 1 , 15 N 2 ]-2'-deoxyguanosine (DNA motif, Δ 3 Da). Mass tag triggered data-dependent acquisition is used to find the characteristic doublet peaks, followed by specific identification of the light isotope peak using MS/MS. Forty-two model drugs were examined using an in vitro microsome experiment to validate the strategy. Graphical abstract Biomimetic trapping cocktail to screen reactive metabolites.

  8. The conserved dileucine- and tyrosine-based motifs in MLV and MPMV envelope glycoproteins are both important to regulate a common Env intracellular trafficking

    Directory of Open Access Journals (Sweden)

    Lopez-Vergès Sandra

    2006-09-01

    Full Text Available Abstract Background Retrovirus particles emerge from the assembly of two structural protein components, Gag that is translated as a soluble protein in the cytoplasm of the host cells, and Env, a type I transmembrane protein. Because both components are translated in different intracellular compartments, elucidating the mechanisms of retrovirus assembly thus requires the study of their intracellular trafficking. Results We used a CD25 (Tac chimera-based approach to study the trafficking of Moloney murine leukemia virus and Mason-Pfizer monkey virus Env proteins. We found that the cytoplasmic tails (CTs of both Env conserved two major signals that control a complex intracellular trafficking. A dileucine-based motif controls the sorting of the chimeras from the trans-Golgi network (TGN toward endosomal compartments. Env proteins then follow a retrograde transport to the TGN due to the action of a tyrosine-based motif. Mutation of either motif induces the mis-localization of the chimeric proteins and both motifs are found to mediate interactions of the viral CTs with clathrin adaptors. Conclusion This data reveals the unexpected complexity of the intracellular trafficking of retrovirus Env proteins that cycle between the TGN and endosomes. Given that Gag proteins hijack endosomal host proteins, our work suggests that the endosomal pathway may be used by retroviruses to ensure proper encountering of viral structural Gag and Env proteins in cells, an essential step of virus assembly.

  9. Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast.

    Science.gov (United States)

    Tsai, Zing Tsung-Yeh; Shiu, Shin-Han; Tsai, Huai-Kuang

    2015-08-01

    Transcription factor (TF) binding is determined by the presence of specific sequence motifs (SM) and chromatin accessibility, where the latter is influenced by both chromatin state (CS) and DNA structure (DS) properties. Although SM, CS, and DS have been used to predict TF binding sites, a predictive model that jointly considers CS and DS has not been developed to predict either TF-specific binding or general binding properties of TFs. Using budding yeast as model, we found that machine learning classifiers trained with either CS or DS features alone perform better in predicting TF-specific binding compared to SM-based classifiers. In addition, simultaneously considering CS and DS further improves the accuracy of the TF binding predictions, indicating the highly complementary nature of these two properties. The contributions of SM, CS, and DS features to binding site predictions differ greatly between TFs, allowing TF-specific predictions and potentially reflecting different TF binding mechanisms. In addition, a "TF-agnostic" predictive model based on three DNA "intrinsic properties" (in silico predicted nucleosome occupancy, major groove geometry, and dinucleotide free energy) that can be calculated from genomic sequences alone has performance that rivals the model incorporating experiment-derived data. This intrinsic property model allows prediction of binding regions not only across TFs, but also across DNA-binding domain families with distinct structural folds. Furthermore, these predicted binding regions can help identify TF binding sites that have a significant impact on target gene expression. Because the intrinsic property model allows prediction of binding regions across DNA-binding domain families, it is TF agnostic and likely describes general binding potential of TFs. Thus, our findings suggest that it is feasible to establish a TF agnostic model for identifying functional regulatory regions in potentially any sequenced genome.

  10. Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast.

    Directory of Open Access Journals (Sweden)

    Zing Tsung-Yeh Tsai

    2015-08-01

    Full Text Available Transcription factor (TF binding is determined by the presence of specific sequence motifs (SM and chromatin accessibility, where the latter is influenced by both chromatin state (CS and DNA structure (DS properties. Although SM, CS, and DS have been used to predict TF binding sites, a predictive model that jointly considers CS and DS has not been developed to predict either TF-specific binding or general binding properties of TFs. Using budding yeast as model, we found that machine learning classifiers trained with either CS or DS features alone perform better in predicting TF-specific binding compared to SM-based classifiers. In addition, simultaneously considering CS and DS further improves the accuracy of the TF binding predictions, indicating the highly complementary nature of these two properties. The contributions of SM, CS, and DS features to binding site predictions differ greatly between TFs, allowing TF-specific predictions and potentially reflecting different TF binding mechanisms. In addition, a "TF-agnostic" predictive model based on three DNA "intrinsic properties" (in silico predicted nucleosome occupancy, major groove geometry, and dinucleotide free energy that can be calculated from genomic sequences alone has performance that rivals the model incorporating experiment-derived data. This intrinsic property model allows prediction of binding regions not only across TFs, but also across DNA-binding domain families with distinct structural folds. Furthermore, these predicted binding regions can help identify TF binding sites that have a significant impact on target gene expression. Because the intrinsic property model allows prediction of binding regions across DNA-binding domain families, it is TF agnostic and likely describes general binding potential of TFs. Thus, our findings suggest that it is feasible to establish a TF agnostic model for identifying functional regulatory regions in potentially any sequenced genome.

  11. The Chilo iridescent virus DNA polymerase promoter contains an essential AAAAT motif

    NARCIS (Netherlands)

    Nalcacioglu, R.; Ince, I.A.; Vlak, J.M.; Demirbag, Z.; Oers, van M.M.

    2007-01-01

    The delayed-early DNA polymerase promoter of Chilo iridescent virus (CIV), officially known as Invertebrate iridescent virus, was fine mapped by constructing a series of increasing deletions and by introducing point mutations. The effects of these mutations were examined in a luciferase reporter

  12. GANN: Genetic algorithm neural networks for the detection of conserved combinations of features in DNA

    Directory of Open Access Journals (Sweden)

    Beiko Robert G

    2005-02-01

    Full Text Available Abstract Background The multitude of motif detection algorithms developed to date have largely focused on the detection of patterns in primary sequence. Since sequence-dependent DNA structure and flexibility may also play a role in protein-DNA interactions, the simultaneous exploration of sequence- and structure-based hypotheses about the composition of binding sites and the ordering of features in a regulatory region should be considered as well. The consideration of structural features requires the development of new detection tools that can deal with data types other than primary sequence. Results GANN (available at http://bioinformatics.org.au/gann is a machine learning tool for the detection of conserved features in DNA. The software suite contains programs to extract different regions of genomic DNA from flat files and convert these sequences to indices that reflect sequence and structural composition or the presence of specific protein binding sites. The machine learning component allows the classification of different types of sequences based on subsamples of these indices, and can identify the best combinations of indices and machine learning architecture for sequence discrimination. Another key feature of GANN is the replicated splitting of data into training and test sets, and the implementation of negative controls. In validation experiments, GANN successfully merged important sequence and structural features to yield good predictive models for synthetic and real regulatory regions. Conclusion GANN is a flexible tool that can search through large sets of sequence and structural feature combinations to identify those that best characterize a set of sequences.

  13. i-Motif of cytosine-rich human telomere DNA fragments containing natural base lesions

    Czech Academy of Sciences Publication Activity Database

    Dvořáková, Zuzana; Renčiuk, Daniel; Kejnovská, Iva; Školáková, Petra; Bednářová, Klára; Sagi, J.; Vorlíčková, Michaela

    2018-01-01

    Roč. 46, č. 4 (2018), s. 1624-1634 ISSN 1362-4962 R&D Projects: GA ČR(CZ) GA15-06785S; GA ČR GA17-12075S; GA ČR(CZ) GJ17-19170Y; GA MŠk EF15_003/0000477 Institutional support: RVO:68081707 Keywords : pair opening kinetics * g-quadruplex dna Subject RIV: CE - Biochemistry OBOR OECD: Biochemistry and molecular biology

  14. Evaluation of the Stability of DNA i-Motifs in the Nuclei of Living Mammalian Cells

    Czech Academy of Sciences Publication Activity Database

    Dzatko, S.; Krafčíková, M.; Haensel-Hertsch, R.; Fessl, T.; Fiala, R.; Loja, T.; Krafčík, D.; Mergny, Jean-Louis; Foldynova-Trantirkova, Silvie; Trantírek, L.

    2018-01-01

    Roč. 57, č. 8 (2018), s. 2165-2169 ISSN 1433-7851 R&D Projects: GA MŠk EF15_003/0000477 Institutional support: RVO:68081707 Keywords : g-quadruplex * telomeric dna * base-pairs * molecular switch Subject RIV: CG - Electrochemistry OBOR OECD: Electrochemistry (dry cells, batteries, fuel cells, corrosion metals, electrolysis) Impact factor: 11.994, year: 2016

  15. A single amino-acid change in a highly conserved motif of gp41 elicits HIV-1 neutralization and protects against CD4 depletion.

    Science.gov (United States)

    Petitdemange, Caroline; Achour, Abla; Dispinseri, Stefania; Malet, Isabelle; Sennepin, Alexis; Ho Tsong Fang, Raphaël; Crouzet, Joël; Marcelin, Anne-Geneviève; Calvez, Vincent; Scarlatti, Gabriella; Debré, Patrice; Vieillard, Vincent

    2013-09-01

    The induction of neutralizing antibodies against conserved regions of the human immunodeficiency virus type 1 (HIV-1) envelope protein is a major goal of vaccine strategies. We previously identified 3S, a critical conserved motif of gp41 that induces the NKp44L ligand of an activating NK receptor. In vivo, anti-3S antibodies protect against the natural killer (NK) cell-mediated CD4 depletion that occurs without efficient viral neutralization. Specific substitutions within the 3S peptide motif were prepared by directed mutagenesis. Virus production was monitored by measuring the p24 production. Neutralization assays were performed with immune-purified antibodies from immunized mice and a cohort of HIV-infected patients. Expression of NKp44L on CD4(+) T cells and degranulation assay on activating NK cells were both performed by flow cytometry. Here, we show that specific substitutions in the 3S motif reduce viral infection without affecting gp41 production, while decreasing both its capacity to induce NKp44L expression on CD4(+) T cells and its sensitivity to autologous NK cells. Generation of antibodies in mice against the W614 specific position in the 3S motif elicited a capacity to neutralize cross-clade viruses, notable in its magnitude, breadth, and durability. Antibodies against this 3S variant were also detected in sera from some HIV-1-infected patients, demonstrating both neutralization activity and protection against CD4 depletion. These findings suggest that a specific substitution in a 3S-based immunogen might allow the generation of specific antibodies, providing a foundation for a rational vaccine that combine a capacity to neutralize HIV-1 and to protect CD4(+) T cells.

  16. Determination of 5 '-leader sequences from radically disparate strains of porcine reproductive and respiratory syndrome virus reveals the presence of highly conserved sequence motifs

    DEFF Research Database (Denmark)

    Oleksiewicz, M.B.; Bøtner, Anette; Nielsen, Jens

    1999-01-01

    We determined the untranslated 5'-leader sequence for three different isolates of porcine reproductive and respiratory syndrome virus (PRRSV): pathogenic European- and American-types, as well as an American-type vaccine strain. 5'-leader from European- and American-type PRRSV differed in length...... (220 and 190 nt, respectively), and exhibited only approximately 50% nucleotide homology. Nevertheless, highly conserved areas were identified in the leader of all 3 PRRSV isolates, which constitute candidate motifs for binding of protein(s) involved in viral replication. These comparative data provide...

  17. Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d

    Directory of Open Access Journals (Sweden)

    Moffatt Barbara A

    2010-08-01

    Full Text Available Abstract Background Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB for coplanar aromatic motifs similar to those found in known glycan-binding proteins. Results The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192 in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Conclusions Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.

  18. Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d.

    Science.gov (United States)

    Doxey, Andrew C; Cheng, Zhenyu; Moffatt, Barbara A; McConkey, Brendan J

    2010-08-03

    Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB) for coplanar aromatic motifs similar to those found in known glycan-binding proteins. The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO) enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192) in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.

  19. Plant and yeast cornichon possess a conserved acidic motif required for correct targeting of plasma membrane cargos

    Czech Academy of Sciences Publication Activity Database

    Rosas-Santiago, P.; Lagunas-Goméz, D.; Yánez-Domínguez, C.; Vera-Estrella, R.; Zimmermannová, Olga; Sychrová, Hana; Pantoja, O.

    2017-01-01

    Roč. 1864, č. 10 (2017), s. 1809-1818 ISSN 0167-4889 R&D Projects: GA MŠk(CZ) LQ1604; GA MŠk(CZ) ED1.1.00/02.0109; GA ČR(CZ) GA17-01953S Institutional support: RVO:67985823 Keywords : cornichon * ScErv14 * acidic motif * cargo selection Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Biochemistry and molecular biology Impact factor: 4.521, year: 2016

  20. A conserved WW domain-like motif regulates invariant chain-dependent cell-surface transport of the NKG2D ligand ULBP2.

    Science.gov (United States)

    Uhlenbrock, Franziska; van Andel, Esther; Andresen, Lars; Skov, Søren

    2015-08-01

    Malignant cells expressing NKG2D ligands on their cell surface can be directly sensed and killed by NKG2D-bearing lymphocytes. To ensure this immune recognition, accumulating evidence suggests that NKG2D ligands are trafficed via alternative pathways to the cell surface. We have previously shown that the NKG2D ligand ULBP2 traffics over an invariant chain (Ii)-dependent pathway to the cell surface. This study set out to elucidate how Ii regulates ULBP2 cell-surface transport: We discovered conserved tryptophan (Trp) residues in the primary protein sequence of ULBP1-6 but not in the related MICA/B. Substitution of Trp to alanine resulted in cell-surface inhibition of ULBP2 in different cancer cell lines. Moreover, the mutated ULBP2 constructs were retained and not degraded inside the cell, indicating a crucial role of this conserved Trp-motif in trafficking. Finally, overexpression of Ii increased surface expression of wt ULBP2 while Trp-mutants could not be expressed, proposing that this Trp-motif is required for an Ii-dependent cell-surface transport of ULBP2. Aberrant soluble ULBP2 is immunosuppressive. Thus, targeting a distinct protein module on the ULBP2 sequence could counteract this abnormal expression of ULBP2. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. Global MYCN transcription factor binding analysis in neuroblastoma reveals association with distinct E-box motifs and regions of DNA hypermethylation.

    LENUS (Irish Health Repository)

    Murphy, Derek M

    2009-01-01

    BACKGROUND: Neuroblastoma, a cancer derived from precursor cells of the sympathetic nervous system, is a major cause of childhood cancer related deaths. The single most important prognostic indicator of poor clinical outcome in this disease is genomic amplification of MYCN, a member of a family of oncogenic transcription factors. METHODOLOGY: We applied MYCN chromatin immunoprecipitation to microarrays (ChIP-chip) using MYCN amplified\\/non-amplified cell lines as well as a conditional knockdown cell line to determine the distribution of MYCN binding sites within all annotated promoter regions. CONCLUSION: Assessment of E-box usage within consistently positive MYCN binding sites revealed a predominance for the CATGTG motif (p<0.0016), with significant enrichment of additional motifs CATTTG, CATCTG, CAACTG in the MYCN amplified state. For cell lines over-expressing MYCN, gene ontology analysis revealed enrichment for the binding of MYCN at promoter regions of numerous molecular functional groups including DNA helicases and mRNA transcriptional regulation. In order to evaluate MYCN binding with respect to other genomic features, we determined the methylation status of all annotated CpG islands and promoter sequences using methylated DNA immunoprecipitation (MeDIP). The integration of MYCN ChIP-chip and MeDIP data revealed a highly significant positive correlation between MYCN binding and DNA hypermethylation. This association was also detected in regions of hemizygous loss, indicating that the observed association occurs on the same homologue. In summary, these findings suggest that MYCN binding occurs more commonly at CATGTG as opposed to the classic CACGTG E-box motif, and that disease associated over expression of MYCN leads to aberrant binding to additional weaker affinity E-box motifs in neuroblastoma. The co-localization of MYCN binding and DNA hypermethylation further supports the dual role of MYCN, namely that of a classical transcription factor affecting the

  2. Multi-layered control of Galectin-8 mediated autophagy during adenovirus cell entry through a conserved PPxY motif in the viral capsid.

    Directory of Open Access Journals (Sweden)

    Charlotte Montespan

    2017-02-01

    Full Text Available Cells employ active measures to restrict infection by pathogens, even prior to responses from the innate and humoral immune defenses. In this context selective autophagy is activated upon pathogen induced membrane rupture to sequester and deliver membrane fragments and their pathogen contents for lysosomal degradation. Adenoviruses, which breach the endosome upon entry, escape this fate by penetrating into the cytosol prior to autophagosome sequestration of the ruptured endosome. We show that virus induced membrane damage is recognized through Galectin-8 and sequesters the autophagy receptors NDP52 and p62. We further show that a conserved PPxY motif in the viral membrane lytic protein VI is critical for efficient viral evasion of autophagic sequestration after endosomal lysis. Comparing the wildtype with a PPxY-mutant virus we show that depletion of Galectin-8 or suppression of autophagy in ATG5-/- MEFs rescues infectivity of the PPxY-mutant virus while depletion of the autophagy receptors NDP52, p62 has only minor effects. Furthermore we show that wildtype viruses exploit the autophagic machinery for efficient nuclear genome delivery and control autophagosome formation via the cellular ubiquitin ligase Nedd4.2 resulting in reduced antigenic presentation. Our data thus demonstrate that a short PPxY-peptide motif in the adenoviral capsid permits multi-layered viral control of autophagic processes during entry.

  3. Quantification of Chemical and Mechanical Effects on the Formation of the G-Quadruplex and i-Motif in Duplex DNA.

    Science.gov (United States)

    Selvam, Sangeetha; Mandal, Shankar; Mao, Hanbin

    2017-09-05

    The formation of biologically significant tetraplex DNA species, such as G-quadruplexes and i-motifs, is affected by chemical (ions and pH) and mechanical [superhelicity (σ) and molecular crowding] factors. Because of the extremely challenging experimental conditions, the relative importance of these factors on tetraplex folding is unknown. In this work, we quantitatively evaluated the chemical and mechanical effects on the population dynamics of DNA tetraplexes in the insulin-linked polymorphic region using magneto-optical tweezers. By mechanically unfolding individual tetraplexes, we found that ions and pH have the largest effects on the formation of the G-quadruplex and i-motif, respectively. Interestingly, superhelicity has the second largest effect followed by molecular crowding conditions. While chemical effects are specific to tetraplex species, mechanical factors have generic influences. The predominant effect of chemical factors can be attributed to the fact that they directly change the stability of a specific tetraplex, whereas the mechanical factors, superhelicity in particular, reduce the stability of the competing species by changing the kinetics of the melting and annealing of the duplex DNA template in a nonspecific manner. The substantial dependence of tetraplexes on superhelicity provides strong support that DNA tetraplexes can serve as topological sensors to modulate fundamental cellular processes such as transcription.

  4. Cations form sequence selective motifs within DNA grooves via a combination of cation-pi and ion-dipole/hydrogen bond interactions.

    Science.gov (United States)

    Stewart, Mikaela; Dunlap, Tori; Dourlain, Elizabeth; Grant, Bryce; McFail-Isom, Lori

    2013-01-01

    The fine conformational subtleties of DNA structure modulate many fundamental cellular processes including gene activation/repression, cellular division, and DNA repair. Most of these cellular processes rely on the conformational heterogeneity of specific DNA sequences. Factors including those structural characteristics inherent in the particular base sequence as well as those induced through interaction with solvent components combine to produce fine DNA structural variation including helical flexibility and conformation. Cation-pi interactions between solvent cations or their first hydration shell waters and the faces of DNA bases form sequence selectively and contribute to DNA structural heterogeneity. In this paper, we detect and characterize the binding patterns found in cation-pi interactions between solvent cations and DNA bases in a set of high resolution x-ray crystal structures. Specifically, we found that monovalent cations (Tl⁺) and the polarized first hydration shell waters of divalent cations (Mg²⁺, Ca²⁺) form cation-pi interactions with DNA bases stabilizing unstacked conformations. When these cation-pi interactions are combined with electrostatic interactions a pattern of specific binding motifs is formed within the grooves.

  5. The Arabidopsis GAGA-Binding Factor BASIC PENTACYSTEINE6 Recruits the POLYCOMB-REPRESSIVE COMPLEX1 Component LIKE HETEROCHROMATIN PROTEIN1 to GAGA DNA Motifs.

    Science.gov (United States)

    Hecker, Andreas; Brand, Luise H; Peter, Sébastien; Simoncello, Nathalie; Kilian, Joachim; Harter, Klaus; Gaudin, Valérie; Wanke, Dierk

    2015-07-01

    Polycomb-repressive complexes (PRCs) play key roles in development by repressing a large number of genes involved in various functions. Much, however, remains to be discovered about PRC-silencing mechanisms as well as their targeting to specific genomic regions. Besides other mechanisms, GAGA-binding factors in animals can guide PRC members in a sequence-specific manner to Polycomb-responsive DNA elements. Here, we show that the Arabidopsis (Arabidopsis thaliana) GAGA-motif binding factor protein basic pentacysteine6 (BPC6) interacts with like heterochromatin protein1 (LHP1), a PRC1 component, and associates with vernalization2 (VRN2), a PRC2 component, in vivo. By using a modified DNA-protein interaction enzyme-linked immunosorbant assay, we could show that BPC6 was required and sufficient to recruit LHP1 to GAGA motif-containing DNA probes in vitro. We also found that LHP1 interacts with VRN2 and, therefore, can function as a possible scaffold between BPC6 and VRN2. The lhp1-4 bpc4 bpc6 triple mutant displayed a pleiotropic phenotype, extreme dwarfism and early flowering, which disclosed synergistic functions of LHP1 and group II plant BPC members. Transcriptome analyses supported this synergy and suggested a possible function in the concerted repression of homeotic genes, probably through histone H3 lysine-27 trimethylation. Hence, our findings suggest striking similarities between animal and plant GAGA-binding factors in the recruitment of PRC1 and PRC2 components to Polycomb-responsive DNA element-like GAGA motifs, which must have evolved through convergent evolution. © 2015 American Society of Plant Biologists. All Rights Reserved.

  6. The Rev1 interacting region (RIR) motif in the scaffold protein XRCC1 mediates a low-affinity interaction with polynucleotide kinase/phosphatase (PNKP) during DNA single-strand break repair.

    Science.gov (United States)

    Breslin, Claire; Mani, Rajam S; Fanta, Mesfin; Hoch, Nicolas; Weinfeld, Michael; Caldecott, Keith W

    2017-09-29

    The scaffold protein X-ray repair cross-complementing 1 (XRCC1) interacts with multiple enzymes involved in DNA base excision repair and single-strand break repair (SSBR) and is important for genetic integrity and normal neurological function. One of the most important interactions of XRCC1 is that with polynucleotide kinase/phosphatase (PNKP), a dual-function DNA kinase/phosphatase that processes damaged DNA termini and that, if mutated, results in ataxia with oculomotor apraxia 4 (AOA4) and microcephaly with early-onset seizures and developmental delay (MCSZ). XRCC1 and PNKP interact via a high-affinity phosphorylation-dependent interaction site in XRCC1 and a forkhead-associated domain in PNKP. Here, we identified using biochemical and biophysical approaches a second PNKP interaction site in XRCC1 that binds PNKP with lower affinity and independently of XRCC1 phosphorylation. However, this interaction nevertheless stimulated PNKP activity and promoted SSBR and cell survival. The low-affinity interaction site required the highly conserved Rev1-interacting region (RIR) motif in XRCC1 and included three critical and evolutionarily invariant phenylalanine residues. We propose a bipartite interaction model in which the previously identified high-affinity interaction acts as a molecular tether, holding XRCC1 and PNKP together and thereby promoting the low-affinity interaction identified here, which then stimulates PNKP directly. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  7. Specific interaction of the nonstructural protein NS1 of minute virus of mice (MVM) with [ACCA](2) motifs in the centre of the right-end MVM DNA palindrome induces hairpin-primed viral DNA replication.

    Science.gov (United States)

    Willwand, Kurt; Moroianu, Adela; Hörlein, Rita; Stremmel, Wolfgang; Rommelaere, Jean

    2002-07-01

    The linear single-stranded DNA genome of minute virus of mice (MVM) is replicated via a double-stranded replicative form (RF) intermediate DNA. Amplification of viral RF DNA requires the structural transition of the right-end palindrome from a linear duplex into a double-hairpin structure, which serves for the repriming of unidirectional DNA synthesis. This conformational transition was found previously to be induced by the MVM nonstructural protein NS1. Elimination of the cognate NS1-binding sites, [ACCA](2), from the central region of the right-end palindrome next to the axis of symmetry was shown to markedly reduce the efficiency of hairpin-primed DNA replication, as measured in a reconstituted in vitro replication system. Thus, [ACCA](2) sequence motifs are essential as NS1-binding elements in the context of the structural transition of the right-end MVM palindrome.

  8. Identification and analysis of Eimeria nieschulzi gametocyte genes reveal splicing events of gam genes and conserved motifs in the wall-forming proteins within the genus Eimeria (Coccidia, Apicomplexa

    Directory of Open Access Journals (Sweden)

    Wiedmer Stefanie

    2017-01-01

    Full Text Available The genus Eimeria (Apicomplexa, Coccidia provides a wide range of different species with different hosts to study common and variable features within the genus and its species. A common characteristic of all known Eimeria species is the oocyst, the infectious stage where its life cycle starts and ends. In our study, we utilized Eimeria nieschulzi as a model organism. This rat-specific parasite has complex oocyst morphology and can be transfected and even cultivated in vitro up to the oocyst stage. We wanted to elucidate how the known oocyst wall-forming proteins are preserved in this rodent Eimeria species compared to other Eimeria. In newly obtained genomics data, we were able to identify different gametocyte genes that are orthologous to already known gam genes involved in the oocyst wall formation of avian Eimeria species. These genes appeared putatively as single exon genes, but cDNA analysis showed alternative splicing events in the transcripts. The analysis of the translated sequence revealed different conserved motifs but also dissimilar regions in GAM proteins, as well as polymorphic regions. The occurrence of an underrepresented gam56 gene version suggests the existence of a second distinct E. nieschulzi genotype within the E. nieschulzi Landers isolate that we maintain.

  9. Identification and analysis of Eimeria nieschulzi gametocyte genes reveal splicing events of gam genes and conserved motifs in the wall-forming proteins within the genus Eimeria (Coccidia, Apicomplexa)

    Science.gov (United States)

    Wiedmer, Stefanie; Erdbeer, Alexander; Volke, Beate; Randel, Stephanie; Kapplusch, Franz; Hanig, Sacha; Kurth, Michael

    2017-01-01

    The genus Eimeria (Apicomplexa, Coccidia) provides a wide range of different species with different hosts to study common and variable features within the genus and its species. A common characteristic of all known Eimeria species is the oocyst, the infectious stage where its life cycle starts and ends. In our study, we utilized Eimeria nieschulzi as a model organism. This rat-specific parasite has complex oocyst morphology and can be transfected and even cultivated in vitro up to the oocyst stage. We wanted to elucidate how the known oocyst wall-forming proteins are preserved in this rodent Eimeria species compared to other Eimeria. In newly obtained genomics data, we were able to identify different gametocyte genes that are orthologous to already known gam genes involved in the oocyst wall formation of avian Eimeria species. These genes appeared putatively as single exon genes, but cDNA analysis showed alternative splicing events in the transcripts. The analysis of the translated sequence revealed different conserved motifs but also dissimilar regions in GAM proteins, as well as polymorphic regions. The occurrence of an underrepresented gam56 gene version suggests the existence of a second distinct E. nieschulzi genotype within the E. nieschulzi Landers isolate that we maintain. PMID:29210668

  10. Highly Conserved Arg Residue of ERFNIN Motif of Pro-Domain is Important for pH-Induced Zymogen Activation Process in Cysteine Cathepsins K and L.

    Science.gov (United States)

    Aich, Pulakesh; Biswas, Sampa

    2018-06-01

    Pro-domain of a cysteine cathepsin contains a highly conserved Ex 2 Rx 2 Fx 2 Nx 3 Ix 3 N (ERFNIN) motif. The zymogen structure of cathepsins revealed that the Arg(R) residue of the motif is a central residue of a salt-bridge/H-bond network, stabilizing the scaffold of the pro-domain. Importance of the arginine is also demonstrated in studies where a single mutation (Arg → Trp) in human lysosomal cathepsin K (hCTSK) is linked to a bone-related genetic disorder "Pycnodysostosis". In the present study, we have characterized in vitro Arg → Trp mutant of hCTSK and the same mutant of hCTSL. The R → W mutant of hCTSK revealed that this mutation leads to an unstable zymogen that is spontaneously activated and auto-proteolytically degraded rapidly. In contrast, the same mutant of hCTSL is sufficiently stable and has proteolytic activity almost like its wild-type counterpart; however it shows an altered zymogen activation condition in terms of pH, temperature and time. Far and near UV circular dichroism and intrinsic tryptophan fluorescence experiments have revealed that the mutation has minimal effect on structure of the protease hCTSL. Molecular modeling studies shows that the mutated Trp31 in hCTSL forms an aromatic cluster with Tyr23 and Trp30 leading to a local stabilization of pro-domain and supplements the loss of salt-bridge interaction mediated by Arg31 in wild-type. In hCTSK-R31W mutant, due to presence of a non-aromatic Ser30 residue such interaction is not possible and may be responsible for local instability. These differences may cause detrimental effects of R31W mutation on the regulation of hCTSK auto-activation process compared to altered activation process in hCTSL.

  11. Protein clustering and RNA phylogenetic reconstruction of the influenza A [corrected] virus NS1 protein allow an update in classification and identification of motif conservation.

    Science.gov (United States)

    Sevilla-Reyes, Edgar E; Chavaro-Pérez, David A; Piten-Isidro, Elvira; Gutiérrez-González, Luis H; Santos-Mendoza, Teresa

    2013-01-01

    The non-structural protein 1 (NS1) of influenza A virus (IAV), coded by its third most diverse gene, interacts with multiple molecules within infected cells. NS1 is involved in host immune response regulation and is a potential contributor to the virus host range. Early phylogenetic analyses using 50 sequences led to the classification of NS1 gene variants into groups (alleles) A and B. We reanalyzed NS1 diversity using 14,716 complete NS IAV sequences, downloaded from public databases, without host bias. Removal of sequence redundancy and further structured clustering at 96.8% amino acid similarity produced 415 clusters that enhanced our capability to detect distinct subgroups and lineages, which were assigned a numerical nomenclature. Maximum likelihood phylogenetic reconstruction using RNA sequences indicated the previously identified deep branching separating group A from group B, with five distinct subgroups within A as well as two and five lineages within the A4 and A5 subgroups, respectively. Our classification model proposes that sequence patterns in thirteen amino acid positions are sufficient to fit >99.9% of all currently available NS1 sequences into the A subgroups/lineages or the B group. This classification reduces host and virus bias through the prioritization of NS1 RNA phylogenetics over host or virus phenetics. We found significant sequence conservation within the subgroups and lineages with characteristic patterns of functional motifs, such as the differential binding of CPSF30 and crk/crkL or the availability of a C-terminal PDZ-binding motif. To understand selection pressures and evolution acting on NS1, it is necessary to organize the available data. This updated classification may help to clarify and organize the study of NS1 interactions and pathogenic differences and allow the drawing of further functional inferences on sequences in each group, subgroup and lineage rather than on a strain-by-strain basis.

  12. The valine and lysine residues in the conserved FxVTxK motif are important for the function of phylogenetically distant plant cellulose synthases

    Energy Technology Data Exchange (ETDEWEB)

    Slabaugh, Erin; Scavuzzo-Duggan, Tess; Chaves, Arielle; Wilson, Liza; Wilson, Carmen; Davis, Jonathan K.; Cosgrove, Daniel J.; Anderson, Charles T.; Roberts, Alison W.; Haigler, Candace H.

    2015-12-08

    Cellulose synthases (CESAs) synthesize the β-1,4-glucan chains that coalesce to form cellulose microfibrils in plant cell walls. In addition to a large cytosolic (catalytic) domain, CESAs have eight predicted transmembrane helices (TMHs). However, analogous to the structure of BcsA, a bacterial CESA, predicted TMH5 in CESA may instead be an interfacial helix. This would place the conserved FxVTxK motif in the plant cell cytosol where it could function as a substrate-gating loop as occurs in BcsA. To define the functional importance of the CESA region containing FxVTxK, we tested five parallel mutations in Arabidopsis thaliana CESA1 and Physcomitrella patens CESA5 in complementation assays of the relevant cesa mutants. In both organisms, the substitution of the valine or lysine residues in FxVTxK severely affected CESA function. In Arabidopsis roots, both changes were correlated with lower cellulose anisotropy, as revealed by Pontamine Fast Scarlet. Analysis of hypocotyl inner cell wall layers by atomic force microscopy showed that two altered versions of Atcesa1 could rescue cell wall phenotypes observed in the mutant background line. Overall, the data show that the FxVTxK motif is functionally important in two phylogenetically distant plant CESAs. The results show that Physcomitrella provides an efficient model for assessing the effects of engineered CESA mutations affecting primary cell wall synthesis and that diverse testing systems can lead to nuanced insights into CESA structure–function relationships. Although CESA membrane topology needs to be experimentally determined, the results support the possibility that the FxVTxK region functions similarly in CESA and BcsA.

  13. Suiformes conservation: a study case of strategies for DNA utilization

    Indian Academy of Sciences (India)

    However, the amount and quality of DNA obtained is a major concern and compar- .... and an improvement of the DNA purity observable in the. A260/280 ratio (the purity ratio ... facilities at The University of Arizona, Tucson, USA. Thanks also.

  14. Comparative analysis of the full genome sequence of European bat lyssavirus type 1 and type 2 with other lyssaviruses and evidence for a conserved transcription termination and polyadenylation motif in the G-L 3' non-translated region.

    Science.gov (United States)

    Marston, D A; McElhinney, L M; Johnson, N; Müller, T; Conzelmann, K K; Tordo, N; Fooks, A R

    2007-04-01

    We report the first full-length genomic sequences for European bat lyssavirus type-1 (EBLV-1) and type-2 (EBLV-2). The EBLV-1 genomic sequence was derived from a virus isolated from a serotine bat in Hamburg, Germany, in 1968 and the EBLV-2 sequence was derived from a virus isolate from a human case of rabies that occurred in Scotland in 2002. A long-distance PCR strategy was used to amplify the open reading frames (ORFs), followed by standard and modified RACE (rapid amplification of cDNA ends) techniques to amplify the 3' and 5' ends. The lengths of each complete viral genome for EBLV-1 and EBLV-2 were 11 966 and 11 930 base pairs, respectively, and follow the standard rhabdovirus genome organization of five viral proteins. Comparison with other lyssavirus sequences demonstrates variation in degrees of homology, with the genomic termini showing a high degree of complementarity. The nucleoprotein was the most conserved, both intra- and intergenotypically, followed by the polymerase (L), matrix and glyco- proteins, with the phosphoprotein being the most variable. In addition, we have shown that the two EBLVs utilize a conserved transcription termination and polyadenylation (TTP) motif, approximately 50 nt upstream of the L gene start codon. All available lyssavirus sequences to date, with the exception of Pasteur virus (PV) and PV-derived isolates, use the second TTP site. This observation may explain differences in pathogenicity between lyssavirus strains, dependent on the length of the untranslated region, which might affect transcriptional activity and RNA stability.

  15. Loop 7 of E2 enzymes: an ancestral conserved functional motif involved in the E2-mediated steps of the ubiquitination cascade.

    Directory of Open Access Journals (Sweden)

    Elena Papaleo

    Full Text Available The ubiquitin (Ub system controls almost every aspect of eukaryotic cell biology. Protein ubiquitination depends on the sequential action of three classes of enzymes (E1, E2 and E3. E2 Ub-conjugating enzymes have a central role in the ubiquitination pathway, interacting with both E1 and E3, and influencing the ultimate fate of the substrates. Several E2s are characterized by an extended acidic insertion in loop 7 (L7, which if mutated is known to impair the proper E2-related functions. In the present contribution, we show that acidic loop is a conserved ancestral motif in E2s, relying on the presence of alternate hydrophobic and acidic residues. Moreover, the dynamic properties of a subset of family 3 E2s, as well as their binary and ternary complexes with Ub and the cognate E3, have been investigated. Here we provide a model of L7 role in the different steps of the ubiquitination cascade of family 3 E2s. The L7 hydrophobic residues turned out to be the main determinant for the stabilization of the E2 inactive conformations by a tight network of interactions in the catalytic cleft. Moreover, phosphorylation is known from previous studies to promote E2 competent conformations for Ub charging, inducing electrostatic repulsion and acting on the L7 acidic residues. Here we show that these active conformations are stabilized by a network of hydrophobic interactions between L7 and L4, the latter being a conserved interface for E3-recruitment in several E2s. In the successive steps, L7 conserved acidic residues also provide an interaction interface for both Ub and the Rbx1 RING subdomain of the cognate E3. Our data therefore suggest a crucial role for L7 of family 3 E2s in all the E2-mediated steps of the ubiquitination cascade. Its different functions are exploited thank to its conserved hydrophobic and acidic residues in a finely orchestrate mechanism.

  16. Conserved retinoblastoma protein-binding motif in human cytomegalovirus UL97 kinase minimally impacts viral replication but affects susceptibility to maribavir

    Directory of Open Access Journals (Sweden)

    Chou Sunwen

    2009-01-01

    Full Text Available Abstract The UL97 kinase has been shown to phosphorylate and inactivate the retinoblastoma protein (Rb and has three consensus Rb-binding motifs that might contribute to this activity. Recombinant viruses containing mutations in the Rb-binding motifs generally replicated well in human foreskin fibroblasts with only a slight delay in replication kinetics. Their susceptibility to the specific UL97 kinase inhibitor, maribavir, was also examined. Mutation of the amino terminal motif, which is involved in the inactivation of Rb, also renders the virus hypersensitive to the drug and suggests that the motif may play a role in its mechanism of action.

  17. Two sequence motifs from HIF-1α bind to the DNA-binding site of p53

    OpenAIRE

    Hansson, Lars O.; Friedler, Assaf; Freund, Stefan; Rüdiger, Stefan; Fersht, Alan R.

    2002-01-01

    There is evidence that hypoxia-inducible factor-1α (HIF-1α) interacts with the tumor suppressor p53. To characterize the putative interaction, we mapped the binding of the core domain of p53 (p53c) to an array of immobilized HIF-1α-derived peptides and found two peptide-sequence motifs that bound to p53c with micromolar affinity in solution. One sequence was adjacent to and the other coincided with the two proline residues of the oxygen-dependent degradation domain (P402 and P564) that act as...

  18. Efficient motif finding algorithms for large-alphabet inputs

    Directory of Open Access Journals (Sweden)

    Pavlovic Vladimir

    2010-10-01

    Full Text Available Abstract Background We consider the problem of identifying motifs, recurring or conserved patterns, in the biological sequence data sets. To solve this task, we present a new deterministic algorithm for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. Results The proposed algorithm (1 improves search efficiency compared to existing algorithms, and (2 scales well with the size of alphabet. On a synthetic planted DNA motif finding problem our algorithm is over 10× more efficient than MITRA, PMSPrune, and RISOTTO for long motifs. Improvements are orders of magnitude higher in the same setting with large alphabets. On benchmark TF-binding site problems (FNP, CRP, LexA we observed reduction in running time of over 12×, with high detection accuracy. The algorithm was also successful in rapidly identifying protein motifs in Lipocalin, Zinc metallopeptidase, and supersecondary structure motifs for Cadherin and Immunoglobin families. Conclusions Our algorithm reduces computational complexity of the current motif finding algorithms and demonstrate strong running time improvements over existing exact algorithms, especially in important and difficult cases of large-alphabet sequences.

  19. Discovering Motifs in Biological Sequences Using the Micron Automata Processor.

    Science.gov (United States)

    Roy, Indranil; Aluru, Srinivas

    2016-01-01

    Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-complete, and the largest solved instance reported to date is (26,11). We propose a novel algorithm for the (l,d) motif search problem using streaming execution over a large set of non-deterministic finite automata (NFA). This solution is designed to take advantage of the micron automata processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We demonstrate the capability for solving much larger instances of the (l, d) motif search problem using the resources available within a single automata processor board, by estimating run-times for problem instances (39,18) and (40,17). The paper serves as a useful guide to solving problems using this new accelerator technology.

  20. EBNA-2 of herpesvirus papio diverges significantly from the type A and type B EBNA-2 proteins of Epstein-Barr virus but retains an efficient transactivation domain with a conserved hydrophobic motif.

    Science.gov (United States)

    Ling, P D; Ryon, J J; Hayward, S D

    1993-01-01

    EBNA-2 contributes to the establishment of Epstein-Barr virus (EBV) latency in B cells and to the resultant alterations in B-cell growth pattern by up-regulating expression from specific viral and cellular promoters. We have taken a comparative approach toward characterizing functional domains within EBNA-2. To this end, we have cloned and sequenced the EBNA-2 gene from the closely related baboon virus herpesvirus papio (HVP). All human EBV isolates have either a type A or type B EBNA-2 gene. However, the HVP EBNA-2 gene falls into neither the type A category nor the type B category, suggesting that the separation into these two subtypes may have been a recent evolutionary event. Comparison of the predicted amino acid sequences indicates 37% amino acid identity with EBV type A EBNA-2 and 35% amino acid identity with type B EBNA-2. To define the domains of EBNA-2 required for transcriptional activation, the DNA binding domain of the GAL4 protein was fused to overlapping segments of EBV EBNA-2. This approach identified a 40-amino-acid (40-aa) EBNA-2 activation domain located between aa 437 and 477. Transactivation ability was completely lost when the amino-terminal boundary of this domain was moved to aa 441, indicating that the motif at aa 437 to 440, Pro-Ile-Leu-Phe, contains residues critical for function. The aa 437 boundary identified in these experiments coincides precisely with a block of conserved sequences in HVP EBNA-2, and the comparable carboxy-terminal region of HVP EBNA-2 also functioned as a strong transcriptional activation domain when fused to the Gal4(1-147) protein. The EBV and HVP EBNA-2 activation domains share a mixed proline-rich, negatively charged character with a striking conservation of positionally equivalent hydrophobic residues. The importance of the individual amino acids making up the Pro-Ile-Leu-Phe motif was examined by mutagenesis. Any alteration of these residues was found to reduce transactivation efficiency, with changes at the

  1. Spectrometric study of the folding process of i-motif-forming DNA sequences upstream of the c-kit transcription initiation site

    International Nuclear Information System (INIS)

    Bucek, Pavel; Gargallo, Raimundo; Kudrev, Andrei

    2010-01-01

    The c-kit oncogene shows a cytosine-rich DNA region upstream of the transcription initiation site which forms an i-motif structure at slightly acidic pH values (Bucek et al. ). In the present study, the pH-induced formation of i-motif - forming sequences 5'-CCC CTC CCT CGC GCC CGC CCG-3' (ckitC1, native), 5'-CCC TTC CCT TGT GCC CGC CCG-3' (ckitC2) and 5'-CCCTT CCC TTTTT CCC T CCC T-3' (ckitC3) was studied by spectroscopic techniques, such as UV molecular absorption and circular dichroism (CD), in tandem with two multivariate data analysis methods, the hard modelling-based matrix method and the soft modelling-based MCR-ALS approach. Use of the hard chemical modelling enabled us to propose the equilibrium model, which describes spectral changes as functions of solution acidity. Additionally, the intrinsic protonation constant, K in , and the cooperativity parameters, ω c , and ω a , were calculated from the fitting procedure of the coupled CD and molecular absorption spectra. In the case of ckitC2 and ckitC3, the hard model correctly reproduced the spectral variations observed experimentally. The results indicated that folding was accompanied by a cooperative process, i.e. the enhancement of protonated structure stability upon protonation. In contrast, unfolding was accompanied by an anticooperative process. Finally, folding of the native sequence, ckitC1, seemed to follow a more complex mechanism.

  2. The conserved basic residues and the charged amino acid residues at the α-helix of the zinc finger motif regulate the nuclear transport activity of triple C2H2 zinc finger proteins

    Science.gov (United States)

    Lin, Chih-Ying

    2018-01-01

    Zinc finger (ZF) motifs on proteins are frequently recognized as a structure for DNA binding. Accumulated reports indicate that ZF motifs contain nuclear localization signal (NLS) to facilitate the transport of ZF proteins into nucleus. We investigated the critical factors that facilitate the nuclear transport of triple C2H2 ZF proteins. Three conserved basic residues (hot spots) were identified among the ZF sequences of triple C2H2 ZF proteins that reportedly have NLS function. Additional basic residues can be found on the α-helix of the ZFs. Using the ZF domain (ZFD) of Egr-1 as a template, various mutants were constructed and expressed in cells. The nuclear transport activity of various mutants was estimated by analyzing the proportion of protein localized in the nucleus. Mutation at any hot spot of the Egr-1 ZFs reduced the nuclear transport activity. Changes of the basic residues at the α-helical region of the second ZF (ZF2) of the Egr-1 ZFD abolished the NLS activity. However, this activity can be restored by substituting the acidic residues at the homologous positions of ZF1 or ZF3 with basic residues. The restored activity dropped again when the hot spots at ZF1 or the basic residues in the α-helix of ZF3 were mutated. The variations in nuclear transport activity are linked directly to the binding activity of the ZF proteins with importins. This study was extended to other triple C2H2 ZF proteins. SP1 and KLF families, similar to Egr-1, have charged amino acid residues at the second (α2) and the third (α3) positions of the α-helix. Replacing the amino acids at α2 and α3 with acidic residues reduced the NLS activity of the SP1 and KLF6 ZFD. The reduced activity can be restored by substituting the α3 with histidine at any SP1 and KLF6 ZFD. The results show again the interchangeable role of ZFs and charge residues in the α-helix in regulating the NLS activity of triple C2H2 ZF proteins. PMID:29381770

  3. Comparative mtDNA analyses of three sympatric macropodids from a conservation area on the Huon Peninsula, Papua New Guinea.

    Science.gov (United States)

    McGreevy, Thomas J; Dabek, Lisa; Husband, Thomas P

    2016-07-01

    Matschie's tree kangaroo (Dendrolagus matschiei), New Guinea pademelon (Thylogale browni), and small dorcopsis (Dorcopsulus vanheurni) are sympatric macropodid taxa, of conservation concern, that inhabit the Yopno-Urawa-Som (YUS) Conservation Area on the Huon Peninsula, Papua New Guinea. We sequenced three partial mitochondrial DNA (mtDNA) genes from the three taxa to (i) investigate network structure; and (ii) identify conservation units within the YUS Conservation Area. All three taxa displayed a similar pattern in the spatial distribution of their mtDNA haplotypes and the Urawa and Som rivers on the Huon may have acted as a barrier to maternal gene flow. Matschie's tree kangaroo and New Guinea pademelon within the YUS Conservation Area should be managed as single conservation units because mtDNA nucleotides were not fixed for a given geographic area. However, two distinct conservation units were identified for small dorcopsis from the two different mountain ranges within the YUS Conservation Area.

  4. Fox-2 Splicing Factor Binds to a Conserved Intron Motif to PromoteInclusion of Protein 4.1R Alternative Exon 16

    Energy Technology Data Exchange (ETDEWEB)

    Ponthier, Julie L.; Schluepen, Christina; Chen, Weiguo; Lersch,Robert A.; Gee, Sherry L.; Hou, Victor C.; Lo, Annie J.; Short, Sarah A.; Chasis, Joel A.; Winkelmann, John C.; Conboy, John G.

    2006-03-01

    Activation of protein 4.1R exon 16 (E16) inclusion during erythropoiesis represents a physiologically important splicing switch that increases 4.1R affinity for spectrin and actin. Previous studies showed that negative regulation of E16 splicing is mediated by the binding of hnRNP A/B proteins to silencer elements in the exon and that downregulation of hnRNP A/B proteins in erythroblasts leads to activation of E16 inclusion. This paper demonstrates that positive regulation of E16 splicing can be mediated by Fox-2 or Fox-1, two closely related splicing factors that possess identical RNA recognition motifs. SELEX experiments with human Fox-1 revealed highly selective binding to the hexamer UGCAUG. Both Fox-1 and Fox-2 were able to bind the conserved UGCAUG elements in the proximal intron downstream of E16, and both could activate E16 splicing in HeLa cell co-transfection assays in a UGCAUG-dependent manner. Conversely, knockdown of Fox-2 expression, achieved with two different siRNA sequences resulted in decreased E16 splicing. Moreover, immunoblot experiments demonstrate mouse erythroblasts express Fox-2, but not Fox-1. These findings suggest that Fox-2 is a physiological activator of E16 splicing in differentiating erythroid cells in vivo. Recent experiments show that UGCAUG is present in the proximal intron sequence of many tissue-specific alternative exons, and we propose that the Fox family of splicing enhancers plays an important role in alternative splicing switches during differentiation in metazoan organisms.

  5. BayesMotif: de novo protein sorting motif discovery from impure datasets.

    Science.gov (United States)

    Hu, Jianjun; Zhang, Fan

    2010-01-18

    Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of

  6. Application of DNA barcodes in wildlife conservation in Tropical East Asia.

    Science.gov (United States)

    Wilson, John-James; Sing, Kong-Wah; Lee, Ping-Shin; Wee, Alison K S

    2016-10-01

    Over the past 50 years, Tropical East Asia has lost more biodiversity than any tropical region. Tropical East Asia is a megadiverse region with an acute taxonomic impediment. DNA barcodes are short standardized DNA sequences used for taxonomic purposes and have the potential to lessen the challenges of biodiversity inventory and assessments in regions where they are most needed. We reviewed DNA barcoding efforts in Tropical East Asia relative to other tropical regions. We suggest DNA barcodes (or metabarcodes from next-generation sequencers) may be especially useful for characterizing and connecting species-level biodiversity units in inventories encompassing taxa lacking formal description (particularly arthropods) and in large-scale, minimal-impact approaches to vertebrate monitoring and population assessments through secondary sources of DNA (invertebrate derived DNA and environmental DNA). We suggest interest and capacity for DNA barcoding are slowly growing in Tropical East Asia, particularly among the younger generation of researchers who can connect with the barcoding analogy and understand the need for new approaches to the conservation challenges being faced. © 2016 Society for Conservation Biology.

  7. Characterization of Bombyx mori mitochondrial transcription factor A, a conserved regulator of mitochondrial DNA.

    Science.gov (United States)

    Sumitani, Megumi; Kondo, Mari; Kasashima, Katsumi; Endo, Hitoshi; Nakamura, Kaoru; Misawa, Toshihiko; Tanaka, Hiromitsu; Sezutsu, Hideki

    2017-04-15

    In the present study, we initially cloned and characterized a mitochondrial transcription factor A (Tfam) homologue in the silkworm, Bombyx mori. Bombyx mori TFAM (BmTFAM) localized to mitochondria in cultured silkworm and human cells, and co-localized with mtDNA nucleoids in human HeLa cells. In an immunoprecipitation analysis, BmTFAM was found to associate with human mtDNA in mitochondria, indicating its feature as a non-specific DNA-binding protein. In spite of the low identity between BmTFAM and human TFAM (26.5%), the expression of BmTFAM rescued mtDNA copy number reductions and enlarged mtDNA nucleoids in HeLa cells, which were induced by human Tfam knockdown. Thus, BmTFAM compensates for the function of human TFAM in HeLa cells, demonstrating that the mitochondrial function of TFAM is highly conserved between silkworms and humans. BmTfam mRNA was strongly expressed in early embryos. Through double-stranded RNA (dsRNA)-based RNA interference (RNAi) in silkworm embryos, we found that the knockdown of BmTFAM reduced the amount of mtDNA and induced growth retardation at the larval stage. Collectively, these results demonstrate that BmTFAM is a highly conserved mtDNA regulator and may be a good candidate for investigating and modulating mtDNA metabolism in this model organism. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Characterization of the CrbS/R Two-Component System in Pseudomonas fluorescens Reveals a New Set of Genes under Its Control and a DNA Motif Required for CrbR-Mediated Transcriptional Activation

    Directory of Open Access Journals (Sweden)

    Edgardo Sepulveda

    2017-11-01

    Full Text Available The CrbS/R system is a two-component signal transduction system that regulates acetate utilization in Vibrio cholerae, P. aeruginosa, and P. entomophila. CrbS is a hybrid histidine kinase that belongs to a recently identified family, in which the signaling domain is fused to an SLC5 solute symporter domain through aSTAC domain. Upon activation by CrbS, CrbR activates transcription of the acs gene, which encodes an acetyl-CoA synthase (ACS, and the actP gene, which encodes an acetate/solute symporter. In this work, we characterized the CrbS/R system in Pseudomonas fluorescens SBW25. Through the quantitative proteome analysis of different mutants, we were able to identify a new set of genes under its control, which play an important role during growth on acetate. These results led us to the identification of a conserved DNA motif in the putative promoter region of acetate-utilization genes in the Gammaproteobacteria that is essential for the CrbR-mediated transcriptional activation of genes under acetate-utilizing conditions. Finally, we took advantage of the existence of a second SLC5-containing two-component signal transduction system in P. fluorescens, CbrA/B, to demonstrate that the activation of the response regulator by the histidine kinase is not dependent on substrate transport through the SLC5 domain.

  9. Disparate requirements for the Walker A and B ATPase motifs of human RAD51D in homologous recombination.

    Science.gov (United States)

    Wiese, Claudia; Hinz, John M; Tebbs, Robert S; Nham, Peter B; Urbin, Salustra S; Collins, David W; Thompson, Larry H; Schild, David

    2006-01-01

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks (ICLs). Ectopic expression of wild-type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  10. Disparate requirements for the Walker A and B ATPase motifs ofhuman RAD51D in homologous recombination

    Energy Technology Data Exchange (ETDEWEB)

    Wiese, Claudia; Hinz, John M.; Tebbs, Robert S.; Nham, Peter B.; Urbin, Salustra S.; Collins, David W.; Thompson, Larry H.; Schild, David

    2006-04-21

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C, and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks. Ectopic expression of wild type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  11. Structural Diversity in Conserved Regions Like the DRY-Motif among Viral 7TM Receptors-A Consequence of Evolutionary Pressure?

    DEFF Research Database (Denmark)

    Mølleskov-Jensen, Ann-Sofie; Sparre-Ulrich, Alexander Hovard; Davis-Poynter, Nicholas

    2012-01-01

    Several herpes- and poxviruses have captured chemokine receptors from their hosts and modified these to their own benefit. The human and viral chemokine receptors belong to class A 7 transmembrane (TM) receptors which are characterized by several structural motifs like the DRY-motif in TM3...... and the C-terminal tail. In the DRY-motif, the arginine residue serves important purposes by being directly involved in G protein coupling. Interestingly, among the viral receptors there is a greater diversity in the DRY-motif compared to their endogenous receptor homologous. The C-terminal receptor tail...... constitutes another regulatory region that through a number of phosphorylation sites is involved in signaling, desensitization, and internalization. Also this region is more variable among virus-encoded 7TM receptors compared to human class A receptors. In this review we will focus on these two structural...

  12. Use of ancient sedimentary DNA as a novel conservation tool for high-altitude tropical biodiversity.

    Science.gov (United States)

    Boessenkool, Sanne; McGlynn, Gayle; Epp, Laura S; Taylor, David; Pimentel, Manuel; Gizaw, Abel; Nemomissa, Sileshi; Brochmann, Christian; Popp, Magnus

    2014-04-01

    Conservation of biodiversity may in the future increasingly depend upon the availability of scientific information to set suitable restoration targets. In traditional paleoecology, sediment-based pollen provides a means to define preanthropogenic impact conditions, but problems in establishing the exact provenance and ecologically meaningful levels of taxonomic resolution of the evidence are limiting. We explored the extent to which the use of sedimentary ancient DNA (sedaDNA) may complement pollen data in reconstructing past alpine environments in the tropics. We constructed a record of afro-alpine plants retrieved from DNA preserved in sediment cores from 2 volcanic crater sites in the Albertine Rift, eastern Africa. The record extended well beyond the onset of substantial anthropogenic effects on tropical mountains. To ensure high-quality taxonomic inference from the sedaDNA sequences, we built an extensive DNA reference library covering the majority of the afro-alpine flora, by sequencing DNA from taxonomically verified specimens. Comparisons with pollen records from the same sediment cores showed that plant diversity recovered with sedaDNA improved vegetation reconstructions based on pollen records by revealing both additional taxa and providing increased taxonomic resolution. Furthermore, combining the 2 measures assisted in distinguishing vegetation change at different geographic scales; sedaDNA almost exclusively reflects local vegetation, whereas pollen can potentially originate from a wide area that in highlands in particular can span several ecozones. Our results suggest that sedaDNA may provide information on restoration targets and the nature and magnitude of human-induced environmental changes, including in high conservation priority, biodiversity hotspots, where understanding of preanthropogenic impact (or reference) conditions is highly limited. © 2013 Society for Conservation Biology.

  13. Genes with stable DNA methylation levels show higher evolutionary conservation than genes with fluctuant DNA methylation levels.

    Science.gov (United States)

    Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai

    2015-11-24

    Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.

  14. The role of DNA barcodes in understanding and conservation of mammal diversity in southeast Asia.

    Directory of Open Access Journals (Sweden)

    Charles M Francis

    Full Text Available BACKGROUND: Southeast Asia is recognized as a region of very high biodiversity, much of which is currently at risk due to habitat loss and other threats. However, many aspects of this diversity, even for relatively well-known groups such as mammals, are poorly known, limiting ability to develop conservation plans. This study examines the value of DNA barcodes, sequences of the mitochondrial COI gene, to enhance understanding of mammalian diversity in the region and hence to aid conservation planning. METHODOLOGY AND PRINCIPAL FINDINGS: DNA barcodes were obtained from nearly 1900 specimens representing 165 recognized species of bats. All morphologically or acoustically distinct species, based on classical taxonomy, could be discriminated with DNA barcodes except four closely allied species pairs. Many currently recognized species contained multiple barcode lineages, often with deep divergence suggesting unrecognized species. In addition, most widespread species showed substantial genetic differentiation across their distributions. Our results suggest that mammal species richness within the region may be underestimated by at least 50%, and there are higher levels of endemism and greater intra-specific population structure than previously recognized. CONCLUSIONS: DNA barcodes can aid conservation and research by assisting field workers in identifying species, by helping taxonomists determine species groups needing more detailed analysis, and by facilitating the recognition of the appropriate units and scales for conservation planning.

  15. DNA barcoding as a tool for coral reef conservation

    Science.gov (United States)

    Neigel, J.; Domingo, A.; Stake, J.

    2007-09-01

    DNA Barcoding (DBC) is a method for taxonomic identification of animals that is based entirely on the 5' portion of the mitochondrial gene, cytochrome oxidase subunit I ( COI-5). It can be especially useful for identification of larval forms or incomplete specimens lacking diagnostic morphological characters. DBC can also facilitate the discovery of species and in defining “molecular taxonomic units” in problematic groups. However, DBC is not a panacea for coral reef taxonomy. In two of the most ecologically important groups on coral reefs, the Anthozoa and Porifera, COI-5 sequences have diverged too little to be diagnostic for all species. Other problems for DBC include paraphyly in mitochondrial gene trees and lack of differentiation between hybrids and their maternal ancestors. DBC also depends on the availability of databases of COI-5 sequences, which are still in early stages of development. A global effort to barcode all fish species has demonstrated the importance of large-scale coordination and is yielding promising results. Whether or not COI-5 by itself is sufficient for species assignments has become a contentious question; it is generally advantageous to use sequences from multiple loci.

  16. The NS1 polypeptide of the murine parvovirus minute virus of mice binds to DNA sequences containing the motif [ACCA]2-3.

    Science.gov (United States)

    Cotmore, S F; Christensen, J; Nüesch, J P; Tattersall, P

    1995-03-01

    A DNA fragment containing the minute virus of mice 3' replication origin was specifically coprecipitated in immune complexes containing the virally coded NS1, but not the NS2, polypeptide. Antibodies directed against the amino- or carboxy-terminal regions of NS1 precipitated the NS1-origin complexes, but antibodies directed against NS1 amino acids 284 to 459 blocked complex formation. Using affinity-purified histidine-tagged NS1 preparations, we have shown that the specific protein-DNA interaction is of moderate affinity, being stable in 0.1 M salt but rapidly lost at higher salt concentrations. In contrast, generalized (or nonspecific) DNA binding by NS1 could be demonstrated only in low salt. Addition of ATP or gamma S-ATP enhanced specific DNA binding by wild-type NS1 severalfold, but binding was lost under conditions which favored ATP hydrolysis. NS1 molecules with mutations in a critical lysine residue (amino acid 405) in the consensus ATP-binding site bound to the origin, but this binding could not be enhanced by ATP addition. DNase I protection assays carried out with wild-type NS1 in the presence of gamma S-ATP gave footprints which extended over 43 nucleotides on both DNA strands, from the middle of the origin bubble sequence to a position some 14 bp beyond the nick site. The DNA-binding site for NS1 was mapped to a 22-bp fragment from the middle of the 3' replication origin which contains the sequence ACCAACCA. This conforms to a reiterated motif (ACCA)2-3, which occurs, in more or less degenerate form, at many sites throughout the minute virus of mice genome (J. W. Bodner, Virus Genes 2:167-182, 1989). Insertion of a single copy of the sequence (ACCA)3 was shown to be sufficient to confer NS1 binding on an otherwise unrecognized plasmid fragment. The functions of NS1 in the viral life cycle are reevaluated in the light of this result.

  17. DNA barcoding applied to ex situ tropical amphibian conservation programme reveals cryptic diversity in captive populations.

    Science.gov (United States)

    Crawford, Andrew J; Cruz, Catalina; Griffith, Edgardo; Ross, Heidi; Ibáñez, Roberto; Lips, Karen R; Driskell, Amy C; Bermingham, Eldredge; Crump, Paul

    2013-11-01

    Amphibians constitute a diverse yet still incompletely characterized clade of vertebrates, in which new species are still being discovered and described at a high rate. Amphibians are also increasingly endangered, due in part to disease-driven threats of extinctions. As an emergency response, conservationists have begun ex situ assurance colonies for priority species. The abundance of cryptic amphibian diversity, however, may cause problems for ex situ conservation. In this study we used a DNA barcoding approach to survey mitochondrial DNA (mtDNA) variation in captive populations of 10 species of Neotropical amphibians maintained in an ex situ assurance programme at El Valle Amphibian Conservation Center (EVACC) in the Republic of Panama. We combined these mtDNA sequences with genetic data from presumably conspecific wild populations sampled from across Panama, and applied genetic distance-based and character-based analyses to identify cryptic lineages. We found that three of ten species harboured substantial cryptic genetic diversity within EVACC, and an additional three species harboured cryptic diversity among wild populations, but not in captivity. Ex situ conservation efforts focused on amphibians are therefore vulnerable to an incomplete taxonomy leading to misidentification among cryptic species. DNA barcoding may therefore provide a simple, standardized protocol to identify cryptic diversity readily applicable to any amphibian community. © 2012 John Wiley & Sons Ltd.

  18. Triple basepair changes within and adjacent to the conserved YY1 motif upstream of the U3 enhancer repeats of SL3-3 murine leukemia virus cause a small but significant shortening of latency of T-lymphoma induction

    International Nuclear Information System (INIS)

    Ma Shiliang; Lovmand, Jette; Soerensen, Annette Balle; Luz, Arne; Schmidt, Joerg; Pedersen, Finn Skou

    2003-01-01

    A highly conserved sequence upstream of the transcriptional enhancer in the U3 of murine leukemia viruses (MLVs) was reported to mediate negative regulation of their expression. In transient expression studies, negative regulation was reported to be conferred by coexpression of the transcription factor YY1, which binds to a motif in the upstream conserved region (UCR). To address the function of the UCR and its YY1-motif in an in vivo model of MLV-host interactions we introduced six consecutive triple basepair mutations into this region of the potent T-lymphomagenic SL3-3 MLV. We report that all mutants have retained their replication competence and that they all, like the SL3-3 wild type (wt), induce T-cell lymphomas when injected into newborn mice of the SWR strain. However, all mutants induced disease with slightly shorter latency periods than the wt SL3-3, suggesting that the YY1 motif as well as its immediate context in the UCR have a negative effect on the pathogenicity of the virus. This result may have implications for the design of retroviral vectors

  19. Construction of a Holliday Junction in Small Circular DNA Molecules for Stable Motifs and Two-Dimensional Lattices.

    Science.gov (United States)

    Guo, Xin; Wang, Xue-Mei; Wei, Shuai; Xiao, Shou-Jun

    2018-04-12

    Design rules for DNA nanotechnology have been mostly learnt from using linear single-stranded (ss) DNA as the source material. For example, the core structure of a typical DAO (double crossover, antiparallel, odd half-turns) tile for assembling 2D lattices is constructed from only two linear ss-oligonucleotide scaffold strands, similar to two ropes making a square knot. Herein, a new type of coupled DAO (cDAO) tile and 2D lattices of small circular ss-oligonucleotides as scaffold strands and linear ss-oligonucleotides as staple strands are reported. A cDAO tile of cDAO-c64nt (c64nt: circular 64 nucleotides), shaped as a solid parallelogram, is constructed with a Holliday junction (HJ) at the center and two HJs at both poles of a c64nt; similarly, cDAO-c84nt, shaped as a crossed quadrilateral composed of two congruent triangles, is formed with a HJ at the center and four three-way junctions at the corners of a c84nt. Perfect 2D lattices were assembled from cDAO tiles: infinite nanostructures of nanoribbons, nanotubes, and nanorings, and finite nanostructures. The structural relationship between the visible lattices imaged by AFM and the corresponding invisible secondary and tertiary molecular structures of HJs, inclination angle of hydrogen bonds against the double-helix axis, and the chirality of the tile can be interpreted very well. This work could shed new light on DNA nanotechnology with unique circular tiles. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. The adeno-associated virus major regulatory protein Rep78-c-Jun-DNA motif complex modulates AP-1 activity

    International Nuclear Information System (INIS)

    Prasad, C. Krishna; Meyers, Craig; Zhan Dejin; You Hong; Chiriva-Internati, Maurizio; Mehta, Jawahar L.; Liu Yong; Hermonat, Paul L.

    2003-01-01

    Multiple epidemiologic studies show that adeno-associated virus (AAV) is negatively associated with cervical cancer (CX CA), a cancer which is positively associated with human papillomavirus (HPV) infection. Mechanisms for this correlation may be by Rep78's (AAV's major regulatory protein) ability to bind the HPV-16 p97 promoter DNA and inhibit transcription, to bind and interfere with the functions of the E7 oncoprotein of HPV-16, and to bind a variety of HPV-important cellular transcription factors such as Sp1 and TBP. c-Jun is another important cellular factor intimately linked to the HPV life cycle, as well as keratinocyte differentiation and skin development. Skin is the natural host tissue for both HPV and AAV. In this article it is demonstrated that Rep78 directly interacts with c-Jun, both in vitro and in vivo, as analyzed by Western blot, yeast two-hybrid cDNA, and electrophoretic mobility shift-supershift assay (EMSA supershift). Addition of anti-Rep78 antibodies inhibited the EMSA supershift. Investigating the biological implications of this interaction, Rep78 inhibited the c-Jun-dependent c-jun promoter in transient and stable chloramphenicol acetyl-transferase (CAT) assays. Rep78 also inhibited c-Jun-augmented c-jun promoter as well as the HPV-16 p97 promoter activity (also c-Jun regulated) in in vitro transcription assays in T47D nuclear extracts. Finally, the Rep78-c-Jun interaction mapped to the amino-half of Rep78. The ability of Rep78 to interact with c-Jun and down-regulate AP-1-dependent transcription suggests one more mechanism by which AAV may modulate the HPV life cycle and the carcinogenesis process

  1. Improving the Conservation of Mediterranean Chondrichthyans: The ELASMOMED DNA Barcode Reference Library.

    Directory of Open Access Journals (Sweden)

    Alessia Cariani

    Full Text Available Cartilaginous fish are particularly vulnerable to anthropogenic stressors and environmental change because of their K-selected reproductive strategy. Accurate data from scientific surveys and landings are essential to assess conservation status and to develop robust protection and management plans. Currently available data are often incomplete or incorrect as a result of inaccurate species identifications, due to a high level of morphological stasis, especially among closely related taxa. Moreover, several diagnostic characters clearly visible in adult specimens are less evident in juveniles. Here we present results generated by the ELASMOMED Consortium, a regional network aiming to sample and DNA-barcode the Mediterranean Chondrichthyans with the ultimate goal to provide a comprehensive DNA barcode reference library. This library will support and improve the molecular taxonomy of this group and the effectiveness of management and conservation measures. We successfully barcoded 882 individuals belonging to 42 species (17 sharks, 24 batoids and one chimaera, including four endemic and several threatened ones. Morphological misidentifications were found across most orders, further confirming the need for a comprehensive DNA barcoding library as a valuable tool for the reliable identification of specimens in support of taxonomist who are reviewing current identification keys. Despite low intraspecific variation among their barcode sequences and reduced samples size, five species showed preliminary evidence of phylogeographic structure. Overall, the ELASMOMED initiative further emphasizes the key role accurate DNA barcoding libraries play in establishing reliable diagnostic species specific features in otherwise taxonomically problematic groups for biodiversity management and conservation actions.

  2. The AT-Hook motif as a versatile minor groove anchor for promoting DNA binding of transcription factor fragments? ?Electronic supplementary information (ESI) available: Peptide synthesis, full experimental procedures and analytical data of the peptides and products obtained. See DOI: 10.1039/c5sc01415h Click here for additional data file.

    OpenAIRE

    Rodr?guez, J?ssica; Mosquera, Jes?s; Couceiro, Jose R.; V?zquez, M. Eugenio; Mascare?as, Jos? L.

    2015-01-01

    We report the development of chimeric DNA binding peptides comprising a DNA binding fragment of natural transcription factors (the basic region of a bZIP protein or a monomeric zinc finger module) and an AT-Hook peptide motif. The resulting peptide conjugates display high DNA affinity and excellent sequence selectivity. Furthermore, the AT-Hook motif also favors the cell internalization of the conjugates.

  3. Motif enrichment tool.

    Science.gov (United States)

    Blatti, Charles; Sinha, Saurabh

    2014-07-01

    The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Structure of the Cpf1 endonuclease R-loop complex after target DNA cleavage

    DEFF Research Database (Denmark)

    Stella, Stefano; Alcón, Pablo; Montoya, Guillermo

    2017-01-01

    involved in DNA unwinding to form a CRISPR RNA (crRNA)-DNA hybrid and a displaced DNA strand. The protospacer adjacent motif (PAM) is recognized by the PAM-interacting domain. The loop-lysine helix-loop motif in this domain contains three conserved lysine residues that are inserted in a dentate manner...... and the crRNA-DNA hybrid, avoiding DNA re-annealing. Mutations in key residues reveal a mechanism linking the PAM and DNA nuclease sites. Analysis of the Cpf1 structures proposes a singular working model of RNA-guided DNA cleavage, suggesting new avenues for redesign of Cpf1....

  5. Structure of a SUMO-binding-motif mimic bound to Smt3p–Ubc9p: conservation of a noncovalent Ubiquitin-like protein–E2 complex as a platform for selective interactions within a SUMO pathway

    Science.gov (United States)

    Duda, David M.; van Waardenburg, Robert C. A. M.; Borg, Laura A.; McGarity, Sierra; Nourse, Amanda; Waddell, M. Brett; Bjornsti, Mary-Ann; Schulman, Brenda A.

    2007-01-01

    Summary The SUMO ubiquitin-like proteins play regulatory roles in cell division, transcription, DNA repair, and protein subcellular localization. Paralleling other ubiquitin-like proteins, SUMO proteins are proteolytically processed to maturity, conjugated to targets by E1-E2-E3 cascades, and subsequently recognized by specific downstream effectors containing a SUMO-binding motif (SBM). SUMO and its E2 from the budding yeast S. cerevisiae, Smt3p and Ubc9p, are encoded by essential genes. Here we describe the 1.9 Å resolution crystal structure of a noncovalent Smt3p–Ubc9p complex. Unexpectedly, a heterologous portion of the crystallized complex derived from the expression construct mimics an SBM, and binds Smt3p in a manner resembling SBM binding to human SUMO family members. In the complex, Smt3p binds a surface distal from Ubc9's catalytic cysteine. The structure implies that a single molecule of Smt3p cannot bind concurrently to both the noncovalent binding site and the catalytic cysteine of a single Ubc9p molecule. However, formation of higher-order complexes can occur, where a single Smt3p covalently linked to one Ubc9p's catalytic cysteine also binds noncovalently to another molecule of Ubc9p. Comparison with other structures from the SUMO pathway suggests that formation of the noncovalent Smt3p–Ubc9p complex occurs mutually exclusively with many other Smt3p and Ubc9p interactions in the conjugation cascade. By contrast, high-resolution insights into how Smt3p–Ubc9p can also interact with downstream recognition machineries come from contacts with the SBM mimic. Interestingly, the overall architecture of the Smt3p–Ubc9p complex is strikingly similar to recent structures from the ubiquitin pathway. The results imply that noncovalent ubiquitin-like protein–E2 complexes are conserved platforms, which function as parts of larger assemblies involved many protein post-translational regulatory pathways. PMID:17475278

  6. DNA barcoding and traditional taxonomy: an integrated approach for biodiversity conservation.

    Science.gov (United States)

    Sheth, Bhavisha P; Thaker, Vrinda S

    2017-07-01

    Biological diversity is depleting at an alarming rate. Additionally, a vast amount of biodiversity still remains undiscovered. Taxonomy has been serving the purpose of describing, naming, and classifying species for more than 250 years. DNA taxonomy and barcoding have accelerated the rate of this process, thereby providing a tool for conservation practice. DNA barcoding and traditional taxonomy have their own inherent merits and demerits. The synergistic use of both methods, in the form of integrative taxonomy, has the potential to contribute to biodiversity conservation in a pragmatic timeframe and overcome their individual drawbacks. In this review, we discuss the basics of both these methods of biological identification (traditional taxonomy and DNA barcoding), the technical advances in integrative taxonomy, and future trends. We also present a comprehensive compilation of published examples of integrative taxonomy that refer to nine topics within biodiversity conservation. Morphological and molecular species limits were observed to be congruent in ∼41% of the 58 source studies. The majority of the studies highlighted the description of cryptic diversity through the use of molecular data, whereas research areas like endemism, biological invasion, and threatened species were less discussed in the literature.

  7. A Conserved Acidic Motif in the N-Terminal Domain of Nitrate Reductase Is Necessary for the Inactivation of the Enzyme in the Dark by Phosphorylation and 14-3-3 Binding1

    Science.gov (United States)

    Pigaglio, Emmanuelle; Durand, Nathalie; Meyer, Christian

    1999-01-01

    It has previously been shown that the N-terminal domain of tobacco (Nicotiana tabacum) nitrate reductase (NR) is involved in the inactivation of the enzyme by phosphorylation, which occurs in the dark (L. Nussaume, M. Vincentz, C. Meyer, J.P. Boutin, and M. Caboche [1995] Plant Cell 7: 611–621). The activity of a mutant NR protein lacking this N-terminal domain was no longer regulated by light-dark transitions. In this study smaller deletions were performed in the N-terminal domain of tobacco NR that removed protein motifs conserved among higher plant NRs. The resulting truncated NR-coding sequences were then fused to the cauliflower mosaic virus 35S RNA promoter and introduced in NR-deficient mutants of the closely related species Nicotiana plumbaginifolia. We found that the deletion of a conserved stretch of acidic residues led to an active NR protein that was more thermosensitive than the wild-type enzyme, but it was relatively insensitive to the inactivation by phosphorylation in the dark. Therefore, the removal of this acidic stretch seems to have the same effects on NR activation state as the deletion of the N-terminal domain. A hypothetical explanation for these observations is that a specific factor that impedes inactivation remains bound to the truncated enzyme. A synthetic peptide derived from this acidic protein motif was also found to be a good substrate for casein kinase II. PMID:9880364

  8. Hybrids of the bHLH and bZIP protein motifs display different DNA-binding activities in vivo vs. in vitro.

    Directory of Open Access Journals (Sweden)

    Hiu-Kwan Chow

    Full Text Available Minimalist hybrids comprising the DNA-binding domain of bHLH/PAS (basic-helix-loop-helix/Per-Arnt-Sim protein Arnt fused to the leucine zipper (LZ dimerization domain from bZIP (basic region-leucine zipper protein C/EBP were designed to bind the E-box DNA site, CACGTG, targeted by bHLHZ (basic-helix-loop-helix-zipper proteins Myc and Max, as well as the Arnt homodimer. The bHLHZ-like structure of ArntbHLH-C/EBP comprises the Arnt bHLH domain fused to the C/EBP LZ: i.e. swap of the 330 aa PAS domain for the 29 aa LZ. In the yeast one-hybrid assay (Y1H, transcriptional activation from the E-box was strong by ArntbHLH-C/EBP, and undetectable for the truncated ArntbHLH (PAS removed, as detected via readout from the HIS3 and lacZ reporters. In contrast, fluorescence anisotropy titrations showed affinities for the E-box with ArntbHLH-C/EBP and ArntbHLH comparable to other transcription factors (K(d 148.9 nM and 40.2 nM, respectively, but only under select conditions that maintained folded protein. Although in vivo yeast results and in vitro spectroscopic studies for ArntbHLH-C/EBP targeting the E-box correlate well, the same does not hold for ArntbHLH. As circular dichroism confirms that ArntbHLH-C/EBP is a much more strongly alpha-helical structure than ArntbHLH, we conclude that the nonfunctional ArntbHLH in the Y1H must be due to misfolding, leading to the false negative that this protein is incapable of targeting the E-box. Many experiments, including protein design and selections from large libraries, depend on protein domains remaining well-behaved in the nonnative experimental environment, especially small motifs like the bHLH (60-70 aa. Interestingly, a short helical LZ can serve as a folding- and/or solubility-enhancing tag, an important device given the focus of current research on exploration of vast networks of biomolecular interactions.

  9. Functional characterization of a conserved archaeal viral operon revealing single-stranded DNA binding, annealing and nuclease activities

    DEFF Research Database (Denmark)

    Guo, Yang; Kragelund, Birthe Brandt; White, Malcolm F.

    2015-01-01

    encoding proteins of unknown function and forming an operon with ORF207 (gp19). SIRV2 gp17 was found to be a single-stranded DNA (ssDNA) binding protein different in structure from all previously characterized ssDNA binding proteins. Mutagenesis of a few conserved basic residues suggested a U......-shaped binding path for ssDNA. The recombinant gp18 showed an ssDNA annealing activity often associated with helicases and recombinases. To gain insight into the biological role of the entire operon, we characterized SIRV2 gp19 and showed it to possess a 5'→3' ssDNA exonuclease activity, in addition...... for rudiviruses and the close interaction among the ssDNA binding, annealing and nuclease proteins strongly point to a role of the gene operon in genome maturation and/or DNA recombination that may function in viral DNA replication/repair....

  10. DNA-binding properties of the Bacillus subtilis and Aeribacillus pallidus AC6 σ(D) proteins.

    Science.gov (United States)

    Sevim, Elif; Gaballa, Ahmed; Beldüz, A Osman; Helmann, John D

    2011-01-01

    σ(D) proteins from Aeribacillus pallidus AC6 and Bacillus subtilis bound specifically, albeit weakly, to promoter DNA even in the absence of core RNA polymerase. Binding required a conserved CG motif within the -10 element, and this motif is known to be recognized by σ region 2.4 and critical for promoter activity.

  11. DNA-Binding Properties of the Bacillus subtilis and Aeribacillus pallidus AC6 σD Proteins▿

    Science.gov (United States)

    Sevim, Elif; Gaballa, Ahmed; Beldüz, A. Osman; Helmann, John D.

    2011-01-01

    σD proteins from Aeribacillus pallidus AC6 and Bacillus subtilis bound specifically, albeit weakly, to promoter DNA even in the absence of core RNA polymerase. Binding required a conserved CG motif within the −10 element, and this motif is known to be recognized by σ region 2.4 and critical for promoter activity. PMID:21097624

  12. DNA-Binding Properties of the Bacillus subtilis and Aeribacillus pallidus AC6 σD Proteins▿

    OpenAIRE

    Sevim, Elif; Gaballa, Ahmed; Beldüz, A. Osman; Helmann, John D.

    2010-01-01

    σD proteins from Aeribacillus pallidus AC6 and Bacillus subtilis bound specifically, albeit weakly, to promoter DNA even in the absence of core RNA polymerase. Binding required a conserved CG motif within the −10 element, and this motif is known to be recognized by σ region 2.4 and critical for promoter activity.

  13. Identification of conserved amino acids in the herpes simplex virus type 1 UL8 protein required for DNA synthesis and UL52 primase interaction in the virus replisome.

    Science.gov (United States)

    Muylaert, Isabella; Zhao, Zhiyuan; Andersson, Torbjörn; Elias, Per

    2012-09-28

    We have used oriS-dependent transient replication assays to search for species-specific interactions within the herpes simplex virus replisome. Hybrid replisomes derived from herpes simplex virus type 1 (HSV-1) and equine herpesvirus type 1 (EHV-1) failed to support DNA replication in cells. Moreover, the replisomes showed a preference for their cognate origin of replication. The results demonstrate that the herpesvirus replisome behaves as a molecular machine relying on functionally important interactions. We then searched for functional interactions in the replisome context by subjecting HSV-1 UL8 protein to extensive mutagenesis. 52 mutants were made by replacing single or clustered charged amino acids with alanines. Four mutants showed severe replication defects. Mutant A23 exhibited a lethal phenotype, and mutants A49, A52 and A53 had temperature-sensitive phenotypes. Mutants A49 and A53 did not interact with UL52 primase as determined by co-immunoprecipitation experiments. Using GFP-tagged UL8, we demonstrate that all mutants were unable to support formation of ICP8-containing nuclear replication foci. Extended mutagenesis suggested that a highly conserved motif corresponding to mutant A49 serves an important role for establishing a physical contact between UL8 and UL52. The replication-defective mutations affected conserved amino acids, and similar phenotypes were observed when the corresponding mutations were introduced into EHV-1 UL8.

  14. Codon based co-occurrence network motifs in human mitochondria

    Directory of Open Access Journals (Sweden)

    Pramod Shinde

    2017-10-01

    Full Text Available The nucleotide polymorphism in human mitochondrial genome (mtDNA tolled by codon position bias plays an indispensable role in human population dispersion and expansion. Herein, we constructed genome-wide nucleotide co-occurrence networks using a massive data consisting of five different geographical regions and around 3000 samples for each region. We developed a powerful network model to describe complex mitochondrial evolutionary patterns between codon and non-codon positions. It was interesting to report a different evolution of Asian genomes than those of the rest which is divulged by network motifs. We found evidence that mtDNA undergoes substantial amounts of adaptive evolution, a finding which was supported by a number of previous studies. The dominance of higher order motifs indicated the importance of long-range nucleotide co-occurrence in genomic diversity. Most notably, codon motifs apparently underpinned the preferences among codon positions for co-evolution which is probably highly biased during the origin of the genetic code. Our analyses manifested that codon position co-evolution is very well conserved across human sub-populations and independently maintained within human sub-populations implying the selective role of evolutionary processes on codon position co-evolution. Ergo, this study provided a framework to investigate cooperative genomic interactions which are critical in underlying complex mitochondrial evolution.

  15. The calmodulin-binding, short linear motif, NSCaTE is conserved in L-type channel ancestors of vertebrate Cav1.2 and Cav1.3 channels.

    Directory of Open Access Journals (Sweden)

    Valentina Taiakina

    Full Text Available NSCaTE is a short linear motif of (xWxxx(I or Lxxxx, composed of residues with a high helix-forming propensity within a mostly disordered N-terminus that is conserved in L-type calcium channels from protostome invertebrates to humans. NSCaTE is an optional, lower affinity and calcium-sensitive binding site for calmodulin (CaM which competes for CaM binding with a more ancient, C-terminal IQ domain on L-type channels. CaM bound to N- and C- terminal tails serve as dual detectors to changing intracellular Ca(2+ concentrations, promoting calcium-dependent inactivation of L-type calcium channels. NSCaTE is absent in some arthropod species, and is also lacking in vertebrate L-type isoforms, Cav1.1 and Cav1.4 channels. The pervasiveness of a methionine just downstream from NSCaTE suggests that L-type channels could generate alternative N-termini lacking NSCaTE through the choice of translational start sites. Long N-terminus with an NSCaTE motif in L-type calcium channel homolog LCav1 from pond snail Lymnaea stagnalis has a faster calcium-dependent inactivation than a shortened N-termini lacking NSCaTE. NSCaTE effects are present in low concentrations of internal buffer (0.5 mM EGTA, but disappears in high buffer conditions (10 mM EGTA. Snail and mammalian NSCaTE have an alpha-helical propensity upon binding Ca(2+-CaM and can saturate both CaM N-terminal and C-terminal domains in the absence of a competing IQ motif. NSCaTE evolved in ancestors of the first animals with internal organs for promoting a more rapid, calcium-sensitive inactivation of L-type channels.

  16. Space-related pharma-motifs for fast search of protein binding motifs and polypharmacological targets.

    Science.gov (United States)

    Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon

    2012-01-01

    To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.

  17. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

    Science.gov (United States)

    Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

    2013-01-01

    The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545

  18. DNA testing for parentage verification in a conservation nucleus of Pantaneiro horse

    Directory of Open Access Journals (Sweden)

    Fabiana Tavares Pires de Souza Sereno

    2008-01-01

    Full Text Available We investigated the genealogy of the in situ conservation nucleus of the Pantaneiro horse using DNA microsatellites by evaluating 101 horses, the group consisting of 71 adult horses (3 stallions, 40 male and 31 mares and 27 foals (14 colts and 13 fillies. Genomic DNA was extracted from hair roots and genotyped using 12 microsatellite markers (AHT4, AHT5, ASB2, ASB17, ASB23, HMS3 HMS6, HMS7, HTG4, HTG10, LEX33 and VHL20. The number of alleles per locus varied from 6 to 13, with a mean of 7.8 and the expected heterozygosity ranged from 0.544 to 0.734 (mean 0.644. The VLH20, ASB2, HTG10, ASB23 markers had a high (> 0.8 polymorphism information content and the total exclusion probability of the 12 microsatellite loci was 0.99. The genealogical study of the Pantaneiro horse using genetic markers was efficient in detecting mistakes during paternity and maternity designation and is an important tool which can be used together with traditional systems of animal identification. The use of genetic markers is recommended in the systematic control of the genealogical registrations and conservation plans to improve genetic aspects of the Pantaneiro horse.

  19. Conservation

    NARCIS (Netherlands)

    Noteboom, H.P.

    1985-01-01

    The IUCN/WWF Plants Conservation Programme 1984 — 1985. World Wildlife Fund chose plants to be the subject of their fund-raising campaign in the period 1984 — 1985. The objectives were to: 1. Use information techniques to achieve the conservation objectives of the Plants Programme – to save plants;

  20. Conservation.

    Science.gov (United States)

    National Audubon Society, New York, NY.

    This set of teaching aids consists of seven Audubon Nature Bulletins, providing the teacher and student with informational reading on various topics in conservation. The bulletins have these titles: Plants as Makers of Soil, Water Pollution Control, The Ground Water Table, Conservation--To Keep This Earth Habitable, Our Threatened Air Supply,…

  1. Characterization of Staphylococcus aureus Primosomal DnaD Protein: Highly Conserved C-Terminal Region Is Crucial for ssDNA and PriA Helicase Binding but Not for DnaA Protein-Binding and Self-Tetramerization.

    Directory of Open Access Journals (Sweden)

    Yen-Hua Huang

    Full Text Available The role of DnaD in the recruitment of replicative helicase has been identified. However, knowledge of the DNA, PriA, and DnaA binding mechanism of this protein for the DnaA- and PriA-directed replication primosome assemblies is limited. We characterized the DNA-binding properties of DnaD from Staphylococcus aureus (SaDnaD and analyzed its interactions with SaPriA and SaDnaA. The gel filtration chromatography analysis of purified SaDnaD and its deletion mutant proteins (SaDnaD1-195, SaDnaD1-200 and SaDnaD1-204 showed a stable tetramer in solution. This finding indicates that the C-terminal region aa 196-228 is not crucial for SaDnaD oligomerization. SaDnaD forms distinct complexes with ssDNA of different lengths. In fluorescence titrations, SaDnaD bound to ssDNA with a binding-site size of approximately 32 nt. A stable complex of SaDnaD1-195, SaDnaD1-200, and SaDnaD1-204 with ssDNA dT40 was undetectable, indicating that the C-terminal region of SaDnaD (particularly aa 205-228 is crucial for ssDNA binding. The SPR results revealed that SaDnaD1-195 can interact with SaDnaA but not with SaPriA, which may indicate that DnaD has different binding sites for PriA and DnaA. Both SaDnaD and SaDnaDY176A mutant proteins, but not SaDnaD1-195, can significantly stimulate the ATPase activity of SaPriA. Hence, the stimulation effect mainly resulted from direct contact within the protein-protein interaction, not via the DNA-protein interaction. Kinetic studies revealed that the SaDnaD-SaPriA interaction increases the Vmax of the SaPriA ATPase fivefold without significantly affecting the Km. These results indicate that the conserved C-terminal region is crucial for ssDNA and PriA helicase binding, but not for DnaA protein-binding and self-tetramerization.

  2. Altered response hierarchy and increased T-cell breadth upon HIV-1 conserved element DNA vaccination in macaques.

    Directory of Open Access Journals (Sweden)

    Viraj Kulkarni

    Full Text Available HIV sequence diversity and potential decoy epitopes are hurdles in the development of an effective AIDS vaccine. A DNA vaccine candidate comprising of highly conserved p24(gag elements (CE induced robust immunity in all 10 vaccinated macaques, whereas full-length gag DNA vaccination elicited responses to these conserved elements in only 5 of 11 animals, targeting fewer CE per animal. Importantly, boosting CE-primed macaques with DNA expressing full-length p55(gag increased both magnitude of CE responses and breadth of Gag immunity, demonstrating alteration of the hierarchy of epitope recognition in the presence of pre-existing CE-specific responses. Inclusion of a conserved element immunogen provides a novel and effective strategy to broaden responses against highly diverse pathogens by avoiding decoy epitopes, while focusing responses to critical viral elements for which few escape pathways exist.

  3. MSDmotif: exploring protein sites and motifs

    Directory of Open Access Journals (Sweden)

    Henrick Kim

    2008-07-01

    Full Text Available Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.

  4. Conservative fragments in bacterial 16S rRNA genes and primer design for 16S ribosomal DNA amplicons in metagenomic studies

    KAUST Repository

    Wang, Yong; Qian, Pei-Yuan

    2009-01-01

    Bacterial 16S ribosomal DNA (rDNA) amplicons have been widely used in the classification of uncultured bacteria inhabiting environmental niches. Primers targeting conservative regions of the rDNAs are used to generate amplicons of variant regions

  5. Paradoxical DNA repair and peroxide resistance gene conservation in Bacillus pumilus SAFR-032.

    Directory of Open Access Journals (Sweden)

    Jason Gioia

    Full Text Available BACKGROUND: Bacillus spores are notoriously resistant to unfavorable conditions such as UV radiation, gamma-radiation, H2O2, desiccation, chemical disinfection, or starvation. Bacillus pumilus SAFR-032 survives standard decontamination procedures of the Jet Propulsion Lab spacecraft assembly facility, and both spores and vegetative cells of this strain exhibit elevated resistance to UV radiation and H2O2 compared to other Bacillus species. PRINCIPAL FINDINGS: The genome of B. pumilus SAFR-032 was sequenced and annotated. Lists of genes relevant to DNA repair and the oxidative stress response were generated and compared to B. subtilis and B. licheniformis. Differences in conservation of genes, gene order, and protein sequences are highlighted because they potentially explain the extreme resistance phenotype of B. pumilus. The B. pumilus genome includes genes not found in B. subtilis or B. licheniformis and conserved genes with sequence divergence, but paradoxically lacks several genes that function in UV or H2O2 resistance in other Bacillus species. SIGNIFICANCE: This study identifies several candidate genes for further research into UV and H2O2 resistance. These findings will help explain the resistance of B. pumilus and are applicable to understanding sterilization survival strategies of microbes.

  6. Inferring Pongo conservation units: a perspective based on microsatellite and mitochondrial DNA analyses.

    Science.gov (United States)

    Kanthaswamy, Sreetharan; Kurushima, Jennifer D; Smith, David Glenn

    2006-10-01

    In order to define evolutionarily significant and management units (ESUs and MUs) among subpopulations of Sumatran (Pongo pygmaeus abelii) and Bornean (P. p. pygmaeus) orangutans we determined their genetic relationships. We analyzed partial sequences of four mitochondrial genes and nine autosomal microsatellite loci of 70 orangutans to test two hypotheses regarding the population structure within Borneo and the genetic distinction between Bornean and Sumatran orangutans. Our data show Bornean orangutans consist of two genetic clusters-the western and eastern clades. Each taxon exhibits relatively distinct mtDNA and nuclear genetic distributions that are likely attributable to genetic drift. These groups, however, do not warrant designations as separate conservation MUs because they demonstrate no demographic independence and only moderate genetic differentiation. Our findings also indicate relatively high levels of overall genetic diversity within Borneo, suggesting that observed habitat fragmentation and erosion during the last three decades had limited influence on genetic variability. Because the mtDNA of Bornean and Sumatran orangutans are not strictly reciprocally monophyletic, we recommend treating these populations as separate MUs and discontinuing inter-island translocation of animals unless absolutely necessary.

  7. HIV-1 p24(gag derived conserved element DNA vaccine increases the breadth of immune response in mice.

    Directory of Open Access Journals (Sweden)

    Viraj Kulkarni

    Full Text Available Viral diversity is considered a major impediment to the development of an effective HIV-1 vaccine. Despite this diversity, certain protein segments are nearly invariant across the known HIV-1 Group M sequences. We developed immunogens based on the highly conserved elements from the p24(gag region according to two principles: the immunogen must (i include strictly conserved elements of the virus that cannot mutate readily, and (ii exclude both HIV regions capable of mutating without limiting virus viability, and also immunodominant epitopes located in variable regions. We engineered two HIV-1 p24(gag DNA immunogens that express 7 highly Conserved Elements (CE of 12-24 amino acids in length and differ by only 1 amino acid in each CE ('toggle site', together covering >99% of the HIV-1 Group M sequences. Altering intracellular trafficking of the immunogens changed protein localization, stability, and also the nature of elicited immune responses. Immunization of C57BL/6 mice with p55(gag DNA induced poor, CD4(+ mediated cellular responses, to only 2 of the 7 CE; in contrast, vaccination with p24CE DNA induced cross-clade reactive, robust T cell responses to 4 of the 7 CE. The responses were multifunctional and composed of both CD4(+ and CD8(+ T cells with mature cytotoxic phenotype. These findings provide a method to increase immune response to universally conserved Gag epitopes, using the p24CE immunogen. p24CE DNA vaccination induced humoral immune responses similar in magnitude to those induced by p55(gag, which recognize the virus encoded p24(gag protein. The inclusion of DNA immunogens composed of conserved elements is a promising vaccine strategy to induce broader immunity by CD4(+ and CD8(+ T cells to additional regions of Gag compared to vaccination with p55(gag DNA, achieving maximal cross-clade reactive cellular and humoral responses.

  8. Two Tetrahymena G-DNA-binding proteins, TGP1 and TGP3, share novel motifs and may play a role in micronuclear division

    OpenAIRE

    Lu, Quan; Henderson, Eric

    2000-01-01

    G-DNA is a four-stranded DNA structure with diverse putative biological roles. We have previously purified and cloned a novel G-DNA-binding protein TGP1 from the ciliate Tetrahymena thermophila. Here we report the molecular cloning of TGP3, an additional G-DNA-binding protein from the same organism. The TGP3 cDNA encodes a 365 amino acid protein that is homologous to TGP1 (34% identity and 44% similarity). The proteins share a sequence pattern that contains two novel repetitive and homologous...

  9. Extreme conservation of the psaA/psaB intercistronic spacer reveals a translational motif coincident with the evolution of land plants.

    Science.gov (United States)

    Peredo, Elena L; Les, Donald H; King, Ursula M; Benoit, Lori K

    2012-12-01

    Although chloroplast transcriptional and translational mechanisms were derived originally from prokaryote endosymbionts, chloroplasts retain comparatively few genes as a consequence of the overall transfer to the nucleus of functions associated formerly with prokaryotic genomes. Various modifications reflect other evolutionary shifts toward eukaryotic regulation such as posttranscriptional transcript cleavage with individually processed cistrons in operons and gene expression regulated by nuclear-encoded sigma factors. We report a notable exception for the psaA-psaB-rps14 operon of land plant (embryophyte) chloroplasts, where the first two cistrons are separated by a spacer region to which no significant role had been attributed. We infer an important function of this region, as indicated by the conservation of identical, structurally significant sequences across embryophytes and their ancestral protist lineages, which diverged some 0.5 billion years ago. The psaA/psaB spacers of embryophytes and their progenitors exhibit few sequence and length variants, with most modeled transcripts resolving the same secondary structure: a loop with projecting Shine-Dalgarno site and well-defined stem that interacts with adjacent coding regions to sequester the psaB start codon. Although many functions of the original endosymbiont have been usurped by nuclear genes or interactions, conserved functional elements of embryophyte psaA/psaB spacers provide compelling evidence that translation of psaB is regulated here by a cis-acting mechanism comparable to those common in prokaryotes. Modeled transcripts also indicate that spacer variants in some plants (e.g., aquatic genus Najas) potentially reflect ecological adaptations to facilitate temperature-regulated translation of psaB.

  10. Defective recovery of semi-conservative DNA synthesis in xeroderma pigmentosum cells following split-dose ultraviolet irradiation

    International Nuclear Information System (INIS)

    Moustacchi, E.; Ehmann, U.K.; Friedberg, E.C.

    1979-01-01

    In normal human fibroblasts the authors observe an enhancement of the recovery of the rate of semi-conservative DNA synthesis after split-dose UV-irradation relative to a single total UV dose. The enhanced recovery is totally absent in both a xeroderma pigmentosum variant line and two xeroderma pigmentosum lines belonging to complementation groups A and C. (Auth.)

  11. The tyrosine Y2502.39 in Frizzled 4 defines a conserved motif important for structural integrity of the receptor and recruitment of Disheveled.

    Science.gov (United States)

    Strakova, Katerina; Matricon, Pierre; Yokota, Chika; Arthofer, Elisa; Bernatik, Ondrej; Rodriguez, David; Arenas, Ernest; Carlsson, Jens; Bryja, Vitezslav; Schulte, Gunnar

    2017-10-01

    Frizzleds (FZDs) are unconventional G protein-coupled receptors, which activate diverse intracellular signaling pathways via the phosphoprotein Disheveled (DVL) and heterotrimeric G proteins. The interaction interplay of FZDs with DVL and G proteins is complex, involves different regions of FZD and the potential dynamics are poorly understood. In the present study, we aimed to characterize the function of a highly conserved tyrosine (Y250 2.39 ) in the intracellular loop 1 (IL1) of human FZD 4 . We have found Y250 2.39 to be crucial for DVL2 interaction and DVL2 translocation to the plasma membrane. Mutant FZD 4 -Y250 2.39 F, impaired in DVL2 binding, was defective in both β-catenin-dependent and β-catenin-independent WNT signaling induced in Xenopus laevis embryos. The same mutant maintained interaction with the heterotrimeric G proteins Gα 12 and Gα 13 and was able to mediate WNT-induced G protein dissociation and G protein-dependent YAP/TAZ signaling. We conclude from modeling and dynamics simulation efforts that Y250 2.39 is important for the structural integrity of the FZD-DVL, but not for the FZD-G protein interface and hypothesize that the interaction network of Y250 2.39 and H348 4.46 plays a role in specifying downstream signaling pathways induced by the receptor. Copyright © 2017. Published by Elsevier Inc.

  12. Conserved Patterns of Microbial Immune Escape: Pathogenic Microbes of Diverse Origin Target the Human Terminal Complement Inhibitor Vitronectin via a Single Common Motif.

    Directory of Open Access Journals (Sweden)

    Teresia Hallström

    Full Text Available Pathogenicity of many microbes relies on their capacity to resist innate immunity, and to survive and persist in an immunocompetent human host microbes have developed highly efficient and sophisticated complement evasion strategies. Here we show that different human pathogens including Gram-negative and Gram-positive bacteria, as well as the fungal pathogen Candida albicans, acquire the human terminal complement regulator vitronectin to their surface. By using truncated vitronectin fragments we found that all analyzed microbial pathogens (n = 13 bound human vitronectin via the same C-terminal heparin-binding domain (amino acids 352-374. This specific interaction leaves the terminal complement complex (TCC regulatory region of vitronectin accessible, allowing inhibition of C5b-7 membrane insertion and C9 polymerization. Vitronectin complexed with the various microbes and corresponding proteins was thus functionally active and inhibited complement-mediated C5b-9 deposition. Taken together, diverse microbial pathogens expressing different structurally unrelated vitronectin-binding molecules interact with host vitronectin via the same conserved region to allow versatile control of the host innate immune response.

  13. Possible conservation units of the sun bear (Helarctos malayanus) in Sarawak based on variation of mtDNA control region.

    Science.gov (United States)

    Onuma, Manabu; Suzuki, Masatsugu; Ohtaishi, Noriyuki

    2006-11-01

    The mitochondrial DNA control region of the sun bear (Helarctos malayanus) was sequenced using 21 DNA samples collected from confiscated sun bears to identify conservation units, such as evolutionarily significant units and management units, in Sarawak, Borneo Island. A total of 10 haplotypes were observed, indicating the presence of at least two lineages in the sun bear population in Sarawak. Presumably, these two lineages could represent evolutionarily significant units. However, the geographical distributions of the two lineages remained unknown due to the lack of information regarding the exact capture locations of the confiscated sun bears. It is essential to elucidate the geographical distributions of these lineages in order to create a proper conservation plan for the sun bears in Sarawak. Therefore, further studies examining the haplotype distributions using DNA samples from known localities are essential.

  14. cDNA cloning of the basement membrane chondroitin sulfate proteoglycan core protein, bamacan: a five domain structure including coiled-coil motifs

    DEFF Research Database (Denmark)

    Wu, R R; Couchman, J R

    1997-01-01

    Basement membranes contain several proteoglycans, and those bearing heparan sulfate glycosaminoglycans such as perlecan and agrin usually predominate. Most mammalian basement membranes also contain chondroitin sulfate, and a core protein, bamacan, has been partially characterized. We have now....... The protein sequence has low overall homology, apart from very small NH2- and COOH-terminal motifs. At the junctions between the distal globular domains and the coiled-coil regions lie glycosylation sites, with up to three N-linked oligosaccharides and probably three chondroitin chains. Three other Ser...

  15. CompariMotif: quick and easy comparisons of sequence motifs.

    Science.gov (United States)

    Edwards, Richard J; Davey, Norman E; Shields, Denis C

    2008-05-15

    CompariMotif is a novel tool for making motif-motif comparisons, identifying and describing similarities between regular expression motifs. CompariMotif can identify a number of different relationships between motifs, including exact matches, variants of degenerate motifs and complex overlapping motifs. Motif relationships are scored using shared information content, allowing the best matches to be easily identified in large comparisons. Many input and search options are available, enabling a list of motifs to be compared to itself (to identify recurring motifs) or to datasets of known motifs. CompariMotif can be run online at http://bioware.ucd.ie/ and is freely available for academic use as a set of open source Python modules under a GNU General Public License from http://bioinformatics.ucd.ie/shields/software/comparimotif/

  16. Mutational analysis of the RecJ exonuclease of Escherichia coli: identification of phosphoesterase motifs.

    Science.gov (United States)

    Sutera, V A; Han, E S; Rajman, L A; Lovett, S T

    1999-10-01

    The recJ gene, identified in Escherichia coli, encodes a Mg(+2)-dependent 5'-to-3' exonuclease with high specificity for single-strand DNA. Genetic and biochemical experiments implicate RecJ exonuclease in homologous recombination, base excision, and methyl-directed mismatch repair. Genes encoding proteins with strong similarities to RecJ have been found in every eubacterial genome sequenced to date, with the exception of Mycoplasma and Mycobacterium tuberculosis. Multiple genes encoding proteins similar to RecJ are found in some eubacteria, including Bacillus and Helicobacter, and in the archaea. Among this divergent set of sequences, seven conserved motifs emerge. We demonstrate here that amino acids within six of these motifs are essential for both the biochemical and genetic functions of E. coli RecJ. These motifs may define interactions with Mg(2+) ions or substrate DNA. A large family of proteins more distantly related to RecJ is present in archaea, eubacteria, and eukaryotes, including a hypothetical protein in the MgPa adhesin operon of Mycoplasma, a domain of putative polyA polymerases in Synechocystis and Aquifex, PRUNE of Drosophila, and an exopolyphosphatase (PPX1) of Saccharomyces cereviseae. Because these six RecJ motifs are shared between exonucleases and exopolyphosphatases, they may constitute an ancient phosphoesterase domain now found in all kingdoms of life.

  17. Sequence-based Screening for Rare Enzymes: New Insights into the World of AMDases Reveal a Conserved Motif and 58 Novel Enzymes Clustering in Eight Distinct Families.

    Directory of Open Access Journals (Sweden)

    Janine Maimanakos

    2016-08-01

    Full Text Available Arylmalonate-Decarboxylases (AMDases, EC 4.1.1.76 are very rare and mostly underexplored enzymes. Currently only four known and biochemically characterized representatives exist. However, their ability to decarboxylate α-disubstituted malonic acid derivatives to optically pure products without cofactors makes them attractive and promising candidates for the use as biocatalysts in industrial processes. Until now, AMDases could not be separated from other members of the aspartate/glutamate racemase superfamily based on their gene sequences. Within this work, a search algorithm was developed that enables a reliable prediction of AMDase activity for potential candidates. Based on specific sequence patterns and screening methods 58 novel AMDase candidate genes could be identified in this work. Thereby, AMDases with the conserved sequence pattern of Bordetella bronchiseptica’s prototype appeared to be limited to the classes of Alpha-, Beta- and Gammaproteobacteria. Amino acid homologies and comparison of gene surrounding sequences enabled the classification of eight enzyme clusters. Particularly striking is the accumulation of genes coding for different transporters of the TTT family, TRAP transporters and ABC transporters as well as genes coding for mandelate racemases/muconate lactonizing enzymes that might be involved in substrate uptake or degradation of AMDase products. Further, three novel AMDases were characterized which showed a high enantiomeric excess (>99% of the (R-enantiomer of flurbiprofen. These are the recombinant AmdA and AmdV from Variovorax sp. strains HH01 and HH02, originated from soil, and AmdP from Polymorphum gilvum found by a data base search. Altogether our findings give new insights into the class of AMDases and reveal many previously unknown enzyme candidates with high potential for bioindustrial processes.

  18. Staphylococcus aureus MurC participates in L-alanine recognition via histidine 343, a conserved motif in the shallow hydrophobic pocket.

    Science.gov (United States)

    Kurokawa, Kenji; Nishida, Satoshi; Ishibashi, Mihoko; Mizumura, Hikaru; Ueno, Kohji; Yutsudo, Takashi; Maki, Hideki; Murakami, Kazuhisa; Sekimizu, Kazuhisa

    2008-03-01

    UDP-N-acetylmuramic acid:L-alanine ligase that is encoded by the murC gene, is indispensable for bacterial peptidoglycan biosynthesis and an important target for the development of antibacterial agents. Structure of MurC ligase with substrates has been described, however, little validation via studying the effects of mutations on the structure of MurC has been performed. In this study, we carried out a functional in vitro and in vivo characterization of Staphylococcus aureus MurCH343Y protein that has a temperature-sensitive mutation of a conserved residue in the predicted shallow hydrophobic pocket that holds a short L-alanine side chain. Purified H343Y and wild-type MurC had K(m) values for L-alanine of 3.2 and 0.44 mM, respectively, whereas there was no significant difference in their K(m) values for ATP and UDP-N-acetylmuramic acid, suggesting the specific alteration of L-alanine recognition in MurCH343Y protein. In a synthetic medium that excluded L-alanine, S. aureus murCH343Y mutant cells showed an allele-specific slow growth phenotype that was suppressed by addition of L-alanine. These results suggest that His343 of S. aureus MurC is essential for high-affinity binding to L-alanine both in vitro and in vivo and provide experimental evidence supporting the structural information of MurC ligase.

  19. Mapping biodiversity and setting conservation priorities for SE Queensland's rainforests using DNA barcoding.

    Science.gov (United States)

    Shapcott, Alison; Forster, Paul I; Guymer, Gordon P; McDonald, William J F; Faith, Daniel P; Erickson, David; Kress, W John

    2015-01-01

    Australian rainforests have been fragmented due to past climatic changes and more recently landscape change as a result of clearing for agriculture and urban spread. The subtropical rainforests of South Eastern Queensland are significantly more fragmented than the tropical World Heritage listed northern rainforests and are subject to much greater human population pressures. The Australian rainforest flora is relatively taxonomically rich at the family level, but less so at the species level. Current methods to assess biodiversity based on species numbers fail to adequately capture this richness at higher taxonomic levels. We developed a DNA barcode library for the SE Queensland rainforest flora to support a methodology for biodiversity assessment that incorporates both taxonomic diversity and phylogenetic relationships. We placed our SE Queensland phylogeny based on a three marker DNA barcode within a larger international rainforest barcode library and used this to calculate phylogenetic diversity (PD). We compared phylo- diversity measures, species composition and richness and ecosystem diversity of the SE Queensland rainforest estate to identify which bio subregions contain the greatest rainforest biodiversity, subregion relationships and their level of protection. We identified areas of highest conservation priority. Diversity was not correlated with rainforest area in SE Queensland subregions but PD was correlated with both the percent of the subregion occupied by rainforest and the diversity of regional ecosystems (RE) present. The patterns of species diversity and phylogenetic diversity suggest a strong influence of historical biogeography. Some subregions contain significantly more PD than expected by chance, consistent with the concept of refugia, while others were significantly phylogenetically clustered, consistent with recent range expansions.

  20. Mapping Biodiversity and Setting Conservation Priorities for SE Queensland’s Rainforests Using DNA Barcoding

    Science.gov (United States)

    Shapcott, Alison; Forster, Paul I.; Guymer, Gordon P.; McDonald, William J. F.; Faith, Daniel P.; Erickson, David; Kress, W. John

    2015-01-01

    Australian rainforests have been fragmented due to past climatic changes and more recently landscape change as a result of clearing for agriculture and urban spread. The subtropical rainforests of South Eastern Queensland are significantly more fragmented than the tropical World Heritage listed northern rainforests and are subject to much greater human population pressures. The Australian rainforest flora is relatively taxonomically rich at the family level, but less so at the species level. Current methods to assess biodiversity based on species numbers fail to adequately capture this richness at higher taxonomic levels. We developed a DNA barcode library for the SE Queensland rainforest flora to support a methodology for biodiversity assessment that incorporates both taxonomic diversity and phylogenetic relationships. We placed our SE Queensland phylogeny based on a three marker DNA barcode within a larger international rainforest barcode library and used this to calculate phylogenetic diversity (PD). We compared phylo- diversity measures, species composition and richness and ecosystem diversity of the SE Queensland rainforest estate to identify which bio subregions contain the greatest rainforest biodiversity, subregion relationships and their level of protection. We identified areas of highest conservation priority. Diversity was not correlated with rainforest area in SE Queensland subregions but PD was correlated with both the percent of the subregion occupied by rainforest and the diversity of regional ecosystems (RE) present. The patterns of species diversity and phylogenetic diversity suggest a strong influence of historical biogeography. Some subregions contain significantly more PD than expected by chance, consistent with the concept of refugia, while others were significantly phylogenetically clustered, consistent with recent range expansions. PMID:25803607

  1. Overlapping ETS and CRE Motifs (G/CCGGAAGTGACGTCA) Preferentially Bound by GABPα and CREB Proteins

    Science.gov (United States)

    Chatterjee, Raghunath; Zhao, Jianfei; He, Ximiao; Shlyakhtenko, Andrey; Mann, Ishminder; Waterfall, Joshua J.; Meltzer, Paul; Sathyanarayana, B. K.; FitzGerald, Peter C.; Vinson, Charles

    2012-01-01

    Previously, we identified 8-bps long DNA sequences (8-mers) that localize in human proximal promoters and grouped them into known transcription factor binding sites (TFBS). We now examine split 8-mers consisting of two 4-mers separated by 1-bp to 30-bps (X4-N1-30-X4) to identify pairs of TFBS that localize in proximal promoters at a precise distance. These include two overlapping TFBS: the ETS⇔ETS motif (C/GCCGGAAGCGGAA) and the ETS⇔CRE motif (C/GCGGAAGTGACGTCAC). The nucleotides in bold are part of both TFBS. Molecular modeling shows that the ETS⇔CRE motif can be bound simultaneously by both the ETS and the B-ZIP domains without protein-protein clashes. The electrophoretic mobility shift assay (EMSA) shows that the ETS protein GABPα and the B-ZIP protein CREB preferentially bind to the ETS⇔CRE motif only when the two TFBS overlap precisely. In contrast, the ETS domain of ETV5 and CREB interfere with each other for binding the ETS⇔CRE. The 11-mer (CGGAAGTGACG), the conserved part of the ETS⇔CRE motif, occurs 226 times in the human genome and 83% are in known regulatory regions. In vivo GABPα and CREB ChIP-seq peaks identified the ETS⇔CRE as the most enriched motif occurring in promoters of genes involved in mRNA processing, cellular catabolic processes, and stress response, suggesting that a specific class of genes is regulated by this composite motif. PMID:23050235

  2. Armadillo motifs involved in vesicular transport.

    Directory of Open Access Journals (Sweden)

    Harald Striegl

    Full Text Available Armadillo (ARM repeat proteins function in various cellular processes including vesicular transport and membrane tethering. They contain an imperfect repeating sequence motif that forms a conserved three-dimensional structure. Recently, structural and functional insight into tethering mediated by the ARM-repeat protein p115 has been provided. Here we describe the p115 ARM-motifs for reasons of clarity and nomenclature and show that both sequence and structure are highly conserved among ARM-repeat proteins. We argue that there is no need to invoke repeat types other than ARM repeats for a proper description of the structure of the p115 globular head region. Additionally, we propose to define a new subfamily of ARM-like proteins and show lack of evidence that the ARM motifs found in p115 are present in other long coiled-coil tethering factors of the golgin family.

  3. Memetic algorithms for de novo motif-finding in biomedical sequences.

    Science.gov (United States)

    Bi, Chengpeng

    2012-09-01

    The objectives of this study are to design and implement a new memetic algorithm for de novo motif discovery, which is then applied to detect important signals hidden in various biomedical molecular sequences. In this paper, memetic algorithms are developed and tested in de novo motif-finding problems. Several strategies in the algorithm design are employed that are to not only efficiently explore the multiple sequence local alignment space, but also effectively uncover the molecular signals. As a result, there are a number of key features in the implementation of the memetic motif-finding algorithm (MaMotif), including a chromosome replacement operator, a chromosome alteration-aware local search operator, a truncated local search strategy, and a stochastic operation of local search imposed on individual learning. To test the new algorithm, we compare MaMotif with a few of other similar algorithms using simulated and experimental data including genomic DNA, primary microRNA sequences (let-7 family), and transmembrane protein sequences. The new memetic motif-finding algorithm is successfully implemented in C++, and exhaustively tested with various simulated and real biological sequences. In the simulation, it shows that MaMotif is the most time-efficient algorithm compared with others, that is, it runs 2 times faster than the expectation maximization (EM) method and 16 times faster than the genetic algorithm-based EM hybrid. In both simulated and experimental testing, results show that the new algorithm is compared favorably or superior to other algorithms. Notably, MaMotif is able to successfully discover the transcription factors' binding sites in the chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) data, correctly uncover the RNA splicing signals in gene expression, and precisely find the highly conserved helix motif in the transmembrane protein sequences, as well as rightly detect the palindromic segments in the primary micro

  4. Semi-conservative synthesis of DNA in UV-sensitive mutant cells of Chinese hamster after UV-irradiation

    International Nuclear Information System (INIS)

    Vikhanskaya, F.L.; Khrebtukova, I.A.; Manuilova, E.S.

    1985-01-01

    A study was made of the rate of semi-conservative DNA synthesis in asynchronous UV-resistant (clone V79) and UV-sensitive clones (VII and XII) of Chinese hamster cells after UV-irradiation. In all 3 clones studied, UV-irradiation (5-30 J/m 2 ) induced a decrease in the rate of DNA synthesis during the subsequent 1-2 h. In the resistant clone (V79) recovery of DNA synthesis rate started after the first 2 h post-irradiation (5 J/m 2 ) and by the 3rd hour reached its maximum value, which constituted 70% of that observed in control, non-irradiated cells. The UV-sensitive mutant clones VII and XII showed no recovery in the rate of DNA synthesis during 6-7 h post-irradiation. The results obtained show that the survival of cells is correlated with the ability of DNA synthesis to recover after UV-irradiation in 3 clones studied. The observed recovery of UV-inhibited DNA synthesis in mutant clones may be due to certain defects in DNA repair. (orig.)

  5. Unique C. elegans telomeric overhang structures reveal the evolutionarily conserved properties of telomeric DNA

    Czech Academy of Sciences Publication Activity Database

    Školáková, Petra; Foldynová-Trantírková, Silvie; Bednářová, Klára; Fiala, R.; Vorlíčková, Michaela; Trantírek, L.

    2015-01-01

    Roč. 43, č. 9 (2015), s. 4733-4745 ISSN 0305-1048 R&D Projects: GA ČR(CZ) GA13-28310S; GA ČR(CZ) GAP205/12/0466 Institutional support: RVO:68081707 ; RVO:60077344 Keywords : NUCLEASE HYPERSENSITIVE ELEMENT * G-QUADRUPLEX STRUCTURES * I-MOTIF Subject RIV: BO - Biophysics Impact factor: 9.202, year: 2015

  6. Algal MIPs, high diversity and conserved motifs.

    Science.gov (United States)

    Anderberg, Hanna I; Danielson, Jonas Å H; Johanson, Urban

    2011-04-21

    Major intrinsic proteins (MIPs) also named aquaporins form channels facilitating the passive transport of water and other small polar molecules across membranes. MIPs are particularly abundant and diverse in terrestrial plants but little is known about their evolutionary history. In an attempt to investigate the origin of the plant MIP subfamilies, genomes of chlorophyte algae, the sister group of charophyte algae and land plants, were searched for MIP encoding genes. A total of 22 MIPs were identified in the nine analysed genomes and phylogenetic analyses classified them into seven subfamilies. Two of these, Plasma membrane Intrinsic Proteins (PIPs) and GlpF-like Intrinsic Proteins (GIPs), are also present in land plants and divergence dating support a common origin of these algal and land plant MIPs, predating the evolution of terrestrial plants. The subfamilies unique to algae were named MIPA to MIPE to facilitate the use of a common nomenclature for plant MIPs reflecting phylogenetically stable groups. All of the investigated genomes contained at least one MIP gene but only a few species encoded MIPs belonging to more than one subfamily. Our results suggest that at least two of the seven subfamilies found in land plants were present already in an algal ancestor. The total variation of MIPs and the number of different subfamilies in chlorophyte algae is likely to be even higher than that found in land plants. Our analyses indicate that genetic exchanges between several of the algal subfamilies have occurred. The PIP1 and PIP2 groups and the Ca2+ gating appear to be specific to land plants whereas the pH gating is a more ancient characteristic shared by all PIPs. Further studies are needed to discern the function of the algal specific subfamilies MIPA-E and to fully understand the evolutionary relationship of algal and terrestrial plant MIPs.

  7. Algal MIPs, high diversity and conserved motifs

    Directory of Open Access Journals (Sweden)

    Johanson Urban

    2011-04-01

    Full Text Available Abstract Background Major intrinsic proteins (MIPs also named aquaporins form channels facilitating the passive transport of water and other small polar molecules across membranes. MIPs are particularly abundant and diverse in terrestrial plants but little is known about their evolutionary history. In an attempt to investigate the origin of the plant MIP subfamilies, genomes of chlorophyte algae, the sister group of charophyte algae and land plants, were searched for MIP encoding genes. Results A total of 22 MIPs were identified in the nine analysed genomes and phylogenetic analyses classified them into seven subfamilies. Two of these, Plasma membrane Intrinsic Proteins (PIPs and GlpF-like Intrinsic Proteins (GIPs, are also present in land plants and divergence dating support a common origin of these algal and land plant MIPs, predating the evolution of terrestrial plants. The subfamilies unique to algae were named MIPA to MIPE to facilitate the use of a common nomenclature for plant MIPs reflecting phylogenetically stable groups. All of the investigated genomes contained at least one MIP gene but only a few species encoded MIPs belonging to more than one subfamily. Conclusions Our results suggest that at least two of the seven subfamilies found in land plants were present already in an algal ancestor. The total variation of MIPs and the number of different subfamilies in chlorophyte algae is likely to be even higher than that found in land plants. Our analyses indicate that genetic exchanges between several of the algal subfamilies have occurred. The PIP1 and PIP2 groups and the Ca2+ gating appear to be specific to land plants whereas the pH gating is a more ancient characteristic shared by all PIPs. Further studies are needed to discern the function of the algal specific subfamilies MIPA-E and to fully understand the evolutionary relationship of algal and terrestrial plant MIPs.

  8. Aberrant DNA Methylation in Human iPSCs Associates with MYC-Binding Motifs in a Clone-Specific Manner Independent of Genetics.

    Science.gov (United States)

    Panopoulos, Athanasia D; Smith, Erin N; Arias, Angelo D; Shepard, Peter J; Hishida, Yuriko; Modesto, Veronica; Diffenderfer, Kenneth E; Conner, Clay; Biggs, William; Sandoval, Efren; D'Antonio-Chronowska, Agnieszka; Berggren, W Travis; Izpisua Belmonte, Juan Carlos; Frazer, Kelly A

    2017-04-06

    Induced pluripotent stem cells (iPSCs) show variable methylation patterns between lines, some of which reflect aberrant differences relative to embryonic stem cells (ESCs). To examine whether this aberrant methylation results from genetic variation or non-genetic mechanisms, we generated human iPSCs from monozygotic twins to investigate how genetic background, clone, and passage number contribute. We found that aberrantly methylated CpGs are enriched in regulatory regions associated with MYC protein motifs and affect gene expression. We classified differentially methylated CpGs as being associated with genetic and/or non-genetic factors (clone and passage), and we found that aberrant methylation preferentially occurs at CpGs associated with clone-specific effects. We further found that clone-specific effects play a strong role in recurrent aberrant methylation at specific CpG sites across different studies. Our results argue that a non-genetic biological mechanism underlies aberrant methylation in iPSCs and that it is likely based on a probabilistic process involving MYC that takes place during or shortly after reprogramming. Published by Elsevier Inc.

  9. Synthesis of a Hoechst 32258 Analogue Amino Acid Building Block for Direct Incorporation of a Fluorescent High-Affinity DNA Binding Motif into Peptides

    DEFF Research Database (Denmark)

    Harrit, Niels; Behrens, Carsten; Nielsen, P. E.

    2001-01-01

    The synthesis of a new versatile "Hoechst 33258-like" Boc-protected amino acid building block for peptide synthesis is described. It is demonstrated that this new ligand is an effective mimic of Hoechst 33258 in terms of DNA affinity and sequence specificity. Furthermore, this minor groove binder...

  10. DNA barcoding for conservation, seed banking and ecological restoration of Acacia in the Midwest of Western Australia.

    Science.gov (United States)

    Nevill, Paul G; Wallace, Mark J; Miller, Joseph T; Krauss, Siegfried L

    2013-11-01

    We used DNA barcoding to address an important conservation issue in the Midwest of Western Australia, working on Australia's largest genus of flowering plant. We tested whether or not currently recommended plant DNA barcoding regions (matK and rbcL) were able to discriminate Acacia taxa of varying phylogenetic distances, and ultimately identify an ambiguously labelled seed collection from a mine-site restoration project. Although matK successfully identified the unknown seed as the rare and conservation priority listed A. karina, and was able to resolve six of the eleven study species, this region was difficult to amplify and sequence. In contrast, rbcL was straightforward to recover and align, but could not determine the origin of the seed and only resolved 3 of the 11 species. Other chloroplast regions (rpl32-trnL, psbA-trnH, trnL-F and trnK) had mixed success resolving the studied taxa. In general, species were better resolved in multilocus data sets compared to single-locus data sets. We recommend using the formal barcoding regions supplemented with data from other plastid regions, particularly rpl32-trnL, for barcoding in Acacia. Our study demonstrates the novel use of DNA barcoding for seed identification and illustrates the practical potential of DNA barcoding for the growing discipline of restoration ecology. © 2013 John Wiley & Sons Ltd.

  11. Import of desired nucleic acid sequences using addressing motif of mitochondrial ribosomal 5S-rRNA for fluorescent in vivo hybridization of mitochondrial DNA and RNA.

    Science.gov (United States)

    Zelenka, Jaroslav; Alán, Lukáš; Jabůrek, Martin; Ježek, Petr

    2014-04-01

    Based on the matrix-addressing sequence of mitochondrial ribosomal 5S-rRNA (termed MAM), which is naturally imported into mitochondria, we have constructed an import system for in vivo targeting of mitochondrial DNA (mtDNA) or mt-mRNA, in order to provide fluorescence hybridization of the desired sequences. Thus DNA oligonucleotides were constructed, containing the 5'-flanked T7 RNA polymerase promoter. After in vitro transcription and fluorescent labeling with Alexa Fluor(®) 488 or 647 dye, we obtained the fluorescent "L-ND5 probe" containing MAM and exemplar cargo, i.e., annealing sequence to a short portion of ND5 mRNA and to the light-strand mtDNA complementary to the heavy strand nd5 mt gene (5'-end 21 base pair sequence). For mitochondrial in vivo fluorescent hybridization, HepG2 cells were treated with dequalinium micelles, containing the fluorescent probes, bringing the probes proximally to the mitochondrial outer membrane and to the natural import system. A verification of import into the mitochondrial matrix of cultured HepG2 cells was provided by confocal microscopy colocalizations. Transfections using lipofectamine or probes without 5S-rRNA addressing MAM sequence or with MAM only were ineffective. Alternatively, the same DNA oligonucleotides with 5'-CACC overhang (substituting T7 promoter) were transcribed from the tetracycline-inducible pENTRH1/TO vector in human embryonic kidney T-REx®-293 cells, while mitochondrial matrix localization after import of the resulting unlabeled RNA was detected by PCR. The MAM-containing probe was then enriched by three-order of magnitude over the natural ND5 mRNA in the mitochondrial matrix. In conclusion, we present a proof-of-principle for mitochondrial in vivo hybridization and mitochondrial nucleic acid import.

  12. Identity and functions of CxxC-derived motifs.

    Science.gov (United States)

    Fomenko, Dmitri E; Gladyshev, Vadim N

    2003-09-30

    Two cysteines separated by two other residues (the CxxC motif) are employed by many redox proteins for formation, isomerization, and reduction of disulfide bonds and for other redox functions. The place of the C-terminal cysteine in this motif may be occupied by serine (the CxxS motif), modifying the functional repertoire of redox proteins. Here we found that the CxxC motif may also give rise to a motif, in which the C-terminal cysteine is replaced with threonine (the CxxT motif). Moreover, in contrast to a view that the N-terminal cysteine in the CxxC motif always serves as a nucleophilic attacking group, this residue could also be replaced with threonine (the TxxC motif), serine (the SxxC motif), or other residues. In each of these CxxC-derived motifs, the presence of a downstream alpha-helix was strongly favored. A search for conserved CxxC-derived motif/helix patterns in four complete genomes representing bacteria, archaea, and eukaryotes identified known redox proteins and suggested possible redox functions for several additional proteins. Catalytic sites in peroxiredoxins were major representatives of the TxxC motif, whereas those in glutathione peroxidases represented the CxxT motif. Structural assessments indicated that threonines in these enzymes could stabilize catalytic thiolates, suggesting revisions to previously proposed catalytic triads. Each of the CxxC-derived motifs was also observed in natural selenium-containing proteins, in which selenocysteine was present in place of a catalytic cysteine.

  13. Conserved number of U2 snDNA sites in Piabina argentea, Piabarchus stramineus and two Bryconamericus species (Characidae, Stevardiinae

    Directory of Open Access Journals (Sweden)

    Diovani Piscor

    2018-03-01

    Full Text Available ABSTRACT The chromosomal location of 5S rRNA and U2 snRNA genes of Piabina argentea, Piabarchus stramineus and two Bryconamericus species from two different Brazilian river basins were investigated, in order to contribute to the understanding of evolutionary characteristics of these repetitive DNAs in the subfamily Stevardiinae. The diploid chromosome number was 2n = 52 for Bryconamericus cf. iheringii, Bryconamericus turiuba, Piabarchus stramineus and Piabina argentea. The 5S rDNA clusters were located on one chromosome pair in P. stramineus and B. cf. iheringii, and on two pairs in B. turiuba and P. argentea. The U2 snDNA clusters were located on the one pair in all species. Two-color FISH experiments showed that the co-localization between 5S rDNA and U2 snDNA in P. stramineus can represent a marker for this species. Thus, the present study demonstrated that the number of U2 snDNA clusters observed for the four species was conserved, but particular characteristics can be found in the genome of each species.

  14. DNA analysis indicates that Asian elephants are native to Borneo and are therefore a high priority for conservation.

    Directory of Open Access Journals (Sweden)

    Prithiviraj Fernando

    2003-10-01

    Full Text Available The origin of Borneo's elephants is controversial. Two competing hypotheses argue that they are either indigenous, tracing back to the Pleistocene, or were introduced, descending from elephants imported in the 16th-18th centuries. Taxonomically, they have either been classified as a unique subspecies or placed under the Indian or Sumatran subspecies. If shown to be a unique indigenous population, this would extend the natural species range of the Asian elephant by 1300 km, and therefore Borneo elephants would have much greater conservation importance than if they were a feral population. We compared DNA of Borneo elephants to that of elephants from across the range of the Asian elephant, using a fragment of mitochondrial DNA, including part of the hypervariable d-loop, and five autosomal microsatellite loci. We find that Borneo's elephants are genetically distinct, with molecular divergence indicative of a Pleistocene colonisation of Borneo and subsequent isolation. We reject the hypothesis that Borneo's elephants were introduced. The genetic divergence of Borneo elephants warrants their recognition as a separate evolutionary significant unit. Thus, interbreeding Borneo elephants with those from other populations would be contraindicated in ex situ conservation, and their genetic distinctiveness makes them one of the highest priority populations for Asian elephant conservation.

  15. Crystallization and preliminary X-ray diffraction analysis of motif N from Saccharomyces cerevisiae Dbf4

    International Nuclear Information System (INIS)

    Matthews, Lindsay A.; Duong, Andrew; Prasad, Ajai A.; Duncker, Bernard P.; Guarné, Alba

    2009-01-01

    To understand the role of the Cdc7–Dbf4 complex in checkpoint responses, a fragment of Saccharomyces cerevisiae Dbf4 encompassing motif N was isolated, overproduced and crystallized. The Cdc7–Dbf4 complex plays an instrumental role in the initiation of DNA replication and is a target of replication-checkpoint responses in Saccharomyces cerevisiae. Cdc7 is a conserved serine/threonine kinase whose activity depends on association with its regulatory subunit, Dbf4. A conserved sequence near the N-terminus of Dbf4 (motif N) is necessary for the interaction of Cdc7–Dbf4 with the checkpoint kinase Rad53. To understand the role of the Cdc7–Dbf4 complex in checkpoint responses, a fragment of Saccharomyces cerevisiae Dbf4 encompassing motif N was isolated, overproduced and crystallized. A complete native data set was collected at 100 K from crystals that diffracted X-rays to 2.75 Å resolution and structure determination is currently under way

  16. MHC motif viewer

    DEFF Research Database (Denmark)

    Rapin, Nicolas Philippe Jean-Pierre; Hoof, Ilka; Lund, Ole

    2008-01-01

    . Algorithms that predict which peptides MHC molecules bind have recently been developed and cover many different alleles, but the utility of these algorithms is hampered by the lack of tools for browsing and comparing the specificity of these molecules. We have, therefore, developed a web server, MHC motif....... A special viewing feature, MHC fight, allows for display of the specificity of two different MHC molecules side by side. We show how the web server can be used to discover and display surprising similarities as well as differences between MHC molecules within and between different species. The MHC motif...

  17. Strong minor groove base conservation in sequence logos implies DNA distortion or base flipping during replication and transcription initiation | Center for Cancer Research

    Science.gov (United States)

    Dubbed "Tom's T" by Dhruba Chattoraj, the unusually conserved thymine at position +7 in bacteriophage P1 plasmid RepA DNA binding sites rises above repressor and acceptor sequence logos. The T appears to represent base flipping prior to helix opening in this DNA replication initation protein.

  18. Historic DNA for taxonomy and conservation: A case-study of a century-old Hawaiian hawkmoth type (Lepidoptera: Sphingidae.

    Directory of Open Access Journals (Sweden)

    Anna K Hundsdoerfer

    Full Text Available Analysing historic DNA from museum specimens offers the unique opportunity to study the molecular systematics and phylogenetics of rare and possibly extinct taxa. In the Hawaiian fauna, the hawkmoth, Hyles calida calida, occurs on several of the main islands and is quite frequent, whereas Hyles c. hawaiiensis is restricted to the Island of Hawaii where it appears to be very rare. Analysis of mitochondrial DNA sequences shows that Hyles c. hawaiiensis differs from the nominotypical subspecies by an average p-distance of 2.8%, which is of a similar order of magnitude to that found between other species of Hyles, suggesting that Hyles c. hawaiiensis should perhaps be awarded species status, although more data are required for a formal taxonomic revision. Given the rarity of this taxon, these analyses should be undertaken urgently so that conservation measures can be implemented before it becomes extinct.

  19. The Use of DNA Barcoding in Identification and Conservation of Rosewood (Dalbergia spp.)

    DEFF Research Database (Denmark)

    Hartvig, Ida; Czako, Mihaly; Kjaer, Erik Dahl

    2015-01-01

    efforts of Dalbergia species in Indochina. We used the recommended rbcL, matK and ITS barcoding markers on 95 samples covering 31 species of Dalbergia, and tested their discrimination ability with both traditional distance-based as well as different model-based machine learning methods. We specifically......The genus Dalbergia contains many valuable timber species threatened by illegal logging and deforestation, but knowledge on distributions and threats is often limited and accurate species identification difficult. The aim of this study was to apply DNA barcoding methods to support conservation...

  20. Identification of group specific motifs in Beta-lactamase family of proteins

    Directory of Open Access Journals (Sweden)

    Saxena Akansha

    2009-12-01

    Full Text Available Abstract Background Beta-lactamases are one of the most serious threats to public health. In order to combat this threat we need to study the molecular and functional diversity of these enzymes and identify signatures specific to these enzymes. These signatures will enable us to develop inhibitors and diagnostic probes specific to lactamases. The existing classification of beta-lactamases was developed nearly 30 years ago when few lactamases were available. DLact database contain more than 2000 beta-lactamase, which can be used to study the molecular diversity and to identify signatures specific to this family. Methods A set of 2020 beta-lactamase proteins available in the DLact database http://59.160.102.202/DLact were classified using graph-based clustering of Best Bi-Directional Hits. Non-redundant (> 90 percent identical protein sequences from each group were aligned using T-Coffee and annotated using information available in literature. Motifs specific to each group were predicted using PRATT program. Results The graph-based classification of beta-lactamase proteins resulted in the formation of six groups (Four major groups containing 191, 726, 774 and 73 proteins while two minor groups containing 50 and 8 proteins. Based on the information available in literature, we found that each of the four major groups correspond to the four classes proposed by Ambler. The two minor groups were novel and do not contain molecular signatures of beta-lactamase proteins reported in literature. The group-specific motifs showed high sensitivity (> 70% and very high specificity (> 90%. The motifs from three groups (corresponding to class A, C and D had a high level of conservation at DNA as well as protein level whereas the motifs from the fourth group (corresponding to class B showed conservation at only protein level. Conclusion The graph-based classification of beta-lactamase proteins corresponds with the classification proposed by Ambler, thus there is

  1. [Personal motif in art].

    Science.gov (United States)

    Gerevich, József

    2015-01-01

    One of the basic questions of the art psychology is whether a personal motif is to be found behind works of art and if so, how openly or indirectly it appears in the work itself. Analysis of examples and documents from the fine arts and literature allow us to conclude that the personal motif that can be identified by the viewer through symbols, at times easily at others with more difficulty, gives an emotional plus to the artistic product. The personal motif may be found in traumatic experiences, in communication to the model or with other emotionally important persons (mourning, disappointment, revenge, hatred, rivalry, revolt etc.), in self-searching, or self-analysis. The emotions are expressed in artistic activity either directly or indirectly. The intention nourished by the artist's identity (Kunstwollen) may stand in the way of spontaneous self-expression, channelling it into hidden paths. Under the influence of certain circumstances, the artist may arouse in the viewer, consciously or unconsciously, an illusionary, misleading image of himself. An examination of the personal motif is one of the important research areas of art therapy.

  2. Antibacterial small molecules targeting the conserved TOPRIM domain of DNA gyrase.

    Directory of Open Access Journals (Sweden)

    Scott S Walker

    Full Text Available To combat the threat of antibiotic-resistant Gram-negative bacteria, novel agents that circumvent established resistance mechanisms are urgently needed. Our approach was to focus first on identifying bioactive small molecules followed by chemical lead prioritization and target identification. Within this annotated library of bioactives, we identified a small molecule with activity against efflux-deficient Escherichia coli and other sensitized Gram-negatives. Further studies suggested that this compound inhibited DNA replication and selection for resistance identified mutations in a subunit of E. coli DNA gyrase, a type II topoisomerase. Our initial compound demonstrated weak inhibition of DNA gyrase activity while optimized compounds demonstrated significantly improved inhibition of E. coli and Pseudomonas aeruginosa DNA gyrase and caused cleaved complex stabilization, a hallmark of certain bactericidal DNA gyrase inhibitors. Amino acid substitutions conferring resistance to this new class of DNA gyrase inhibitors reside exclusively in the TOPRIM domain of GyrB and are not associated with resistance to the fluoroquinolones, suggesting a novel binding site for a gyrase inhibitor.

  3. Mitochondrial DNA haplotype distribution patterns in Pinus ponderosa (Pinaceae): range-wide evolutionary history and implications for conservation.

    Science.gov (United States)

    Potter, Kevin M; Hipkins, Valerie D; Mahalovich, Mary F; Means, Robert E

    2013-08-01

    Ponderosa pine (Pinus ponderosa Douglas ex P. Lawson & C. Lawson) exhibits complicated patterns of morphological and genetic variation across its range in western North America. This study aims to clarify P. ponderosa evolutionary history and phylogeography using a highly polymorphic mitochondrial DNA marker, with results offering insights into how geographical and climatological processes drove the modern evolutionary structure of tree species in the region. We amplified the mtDNA nad1 second intron minisatellite region for 3,100 trees representing 104 populations, and sequenced all length variants. We estimated population-level haplotypic diversity and determined diversity partitioning among varieties, races and populations. After aligning sequences of minisatellite repeat motifs, we evaluated evolutionary relationships among haplotypes. The geographical structuring of the 10 haplotypes corresponded with division between Pacific and Rocky Mountain varieties. Pacific haplotypes clustered with high bootstrap support, and appear to have descended from Rocky Mountain haplotypes. A greater proportion of diversity was partitioned between Rocky Mountain races than between Pacific races. Areas of highest haplotypic diversity were the southern Sierra Nevada mountain range in California, northwestern California, and southern Nevada. Pinus ponderosa haplotype distribution patterns suggest a complex phylogeographic history not revealed by other genetic and morphological data, or by the sparse paleoecological record. The results appear consistent with long-term divergence between the Pacific and Rocky Mountain varieties, along with more recent divergences not well-associated with race. Pleistocene refugia may have existed in areas of high haplotypic diversity, as well as the Great Basin, Southwestern United States/northern Mexico, and the High Plains.

  4. Missense mutations located in structural p53 DNA-binding motifs are associated with extremely poor survival in chronic lymphocytic leukemia.

    Science.gov (United States)

    Trbusek, Martin; Smardova, Jana; Malcikova, Jitka; Sebejova, Ludmila; Dobes, Petr; Svitakova, Miluse; Vranova, Vladimira; Mraz, Marek; Francova, Hana Skuhrova; Doubek, Michael; Brychtova, Yvona; Kuglik, Petr; Pospisilova, Sarka; Mayer, Jiri

    2011-07-01

    There is a distinct connection between TP53 defects and poor prognosis in chronic lymphocytic leukemia (CLL). It remains unclear whether patients harboring TP53 mutations represent a homogenous prognostic group. We evaluated the survival of patients with CLL and p53 defects identified at our institution by p53 yeast functional assay and complementary interphase fluorescence in situ hybridization analysis detecting del(17p) from 2003 to 2010. A defect of the TP53 gene was identified in 100 of 550 patients. p53 mutations were strongly associated with the deletion of 17p and the unmutated IgVH locus (both P DBMs), structurally well-defined parts of the DNA-binding domain, manifested a clearly shorter median survival (12 months) compared with patients having missense mutations outside DBMs (41 months; P = .002) or nonmissense alterations (36 months; P = .005). The difference in survival was similar in the analysis limited to patients harboring mutation accompanied by del(17p) and was also confirmed in a subgroup harboring TP53 defect at diagnosis. The patients with p53 DBMs mutation (at diagnosis) also manifested a short median time to first therapy (TTFT; 1 month). The substantially worse survival and the short TTFT suggest a strong mutated p53 gain-of-function phenotype in patients with CLL with DBMs mutations. The impact of p53 DBMs mutations on prognosis and response to therapy should be analyzed in investigative clinical trials.

  5. SOXE transcription factors form selective dimers on non-compact DNA motifs through multifaceted interactions between dimerization and high-mobility group domains.

    Science.gov (United States)

    Huang, Yong-Heng; Jankowski, Aleksander; Cheah, Kathryn S E; Prabhakar, Shyam; Jauch, Ralf

    2015-05-27

    The SOXE transcription factors SOX8, SOX9 and SOX10 are master regulators of mammalian development directing sex determination, gliogenesis, pancreas specification and neural crest development. We identified a set of palindromic SOX binding sites specifically enriched in regulatory regions of melanoma cells. SOXE proteins homodimerize on these sequences with high cooperativity. In contrast to other transcription factor dimers, which are typically rigidly spaced, SOXE group proteins can bind cooperatively at a wide range of dimer spacings. Using truncated forms of SOXE proteins, we show that a single dimerization (DIM) domain, that precedes the DNA binding high mobility group (HMG) domain, is sufficient for dimer formation, suggesting that DIM : HMG rather than DIM:DIM interactions mediate the dimerization. All SOXE members can also heterodimerize in this fashion, whereas SOXE heterodimers with SOX2, SOX4, SOX6 and SOX18 are not supported. We propose a structural model where SOXE-specific intramolecular DIM:HMG interactions are allosterically communicated to the HMG of juxtaposed molecules. Collectively, SOXE factors evolved a unique mode to combinatorially regulate their target genes that relies on a multifaceted interplay between the HMG and DIM domains. This property potentially extends further the diversity of target genes and cell-specific functions that are regulated by SOXE proteins.

  6. SOXE transcription factors form selective dimers on non-compact DNA motifs through multifaceted interactions between dimerization and high-mobility group domains

    Science.gov (United States)

    Huang, Yong-Heng; Jankowski, Aleksander; Cheah, Kathryn S. E.; Prabhakar, Shyam; Jauch, Ralf

    2015-01-01

    The SOXE transcription factors SOX8, SOX9 and SOX10 are master regulators of mammalian development directing sex determination, gliogenesis, pancreas specification and neural crest development. We identified a set of palindromic SOX binding sites specifically enriched in regulatory regions of melanoma cells. SOXE proteins homodimerize on these sequences with high cooperativity. In contrast to other transcription factor dimers, which are typically rigidly spaced, SOXE group proteins can bind cooperatively at a wide range of dimer spacings. Using truncated forms of SOXE proteins, we show that a single dimerization (DIM) domain, that precedes the DNA binding high mobility group (HMG) domain, is sufficient for dimer formation, suggesting that DIM : HMG rather than DIM:DIM interactions mediate the dimerization. All SOXE members can also heterodimerize in this fashion, whereas SOXE heterodimers with SOX2, SOX4, SOX6 and SOX18 are not supported. We propose a structural model where SOXE-specific intramolecular DIM:HMG interactions are allosterically communicated to the HMG of juxtaposed molecules. Collectively, SOXE factors evolved a unique mode to combinatorially regulate their target genes that relies on a multifaceted interplay between the HMG and DIM domains. This property potentially extends further the diversity of target genes and cell-specific functions that are regulated by SOXE proteins. PMID:26013289

  7. Motif analysis unveils the possible co-regulation of chloroplast genes and nuclear genes encoding chloroplast proteins.

    Science.gov (United States)

    Wang, Ying; Ding, Jun; Daniell, Henry; Hu, Haiyan; Li, Xiaoman

    2012-09-01

    Chloroplasts play critical roles in land plant cells. Despite their importance and the availability of at least 200 sequenced chloroplast genomes, the number of known DNA regulatory sequences in chloroplast genomes are limited. In this paper, we designed computational methods to systematically study putative DNA regulatory sequences in intergenic regions near chloroplast genes in seven plant species and in promoter sequences of nuclear genes in Arabidopsis and rice. We found that -35/-10 elements alone cannot explain the transcriptional regulation of chloroplast genes. We also concluded that there are unlikely motifs shared by intergenic sequences of most of chloroplast genes, indicating that these genes are regulated differently. Finally and surprisingly, we found five conserved motifs, each of which occurs in no more than six chloroplast intergenic sequences, are significantly shared by promoters of nuclear-genes encoding chloroplast proteins. By integrating information from gene function annotation, protein subcellular localization analyses, protein-protein interaction data, and gene expression data, we further showed support of the functionality of these conserved motifs. Our study implies the existence of unknown nuclear-encoded transcription factors that regulate both chloroplast genes and nuclear genes encoding chloroplast protein, which sheds light on the understanding of the transcriptional regulation of chloroplast genes.

  8. Environmental DNA for freshwater fish monitoring: insights for conservation within a protected area

    Directory of Open Access Journals (Sweden)

    Sara Fernandez

    2018-03-01

    Full Text Available Background Many fish species have been introduced in wild ecosystems around the world to provide food or leisure, deliberately or from farm escapes. Some of those introductions have had large ecological effects. The north American native rainbow trout (Oncorhynchus mykiss Walbaum, 1792 is one of the most widely farmed fish species in the world. It was first introduced in Spain in the late 19th century for sport fishing (Elvira 1995 and nowadays is used there for both fishing and aquaculture. On the other hand, the European native brown trout (Salmo trutta L. is catalogued as vulnerable in Spain. Detecting native and invasive fish populations in ecosystem monitoring is crucial, but it may be difficult from conventional sampling methods such as electrofishing. These techniques encompass some mortality, thus are not adequate for some ecosystems as the case of protected areas. Environmental DNA (eDNA analysis is a sensitive and non-invasive method that can be especially useful for rare and low-density species detection and inventory in water bodies. Methods In this study we employed two eDNA based methods (qPCR and nested PCR-RFLP to detect salmonid species from mountain streams within a protected area, The Biosphere Reserve and Natural Park of Redes (Upper Nalón Basin, Asturias, Northern Spain, where brown trout is the only native salmonid. We also measured some habitat variables to see how appropriate for salmonids the area is. The sampling area is located upstream impassable dams and contains one rainbow trout fish farm. Results Employing qPCR methodology, brown trout eDNA was detected in all the nine sampling sites surveyed, while nested PCR-RFLP method failed to detect it in two sampling points. Rainbow trout eDNA was detected with both techniques at all sites in the Nalón River’ (n1, n2 and n3. Salmonid habitat units and water quality were high from the area studied. Discussion In this study, a high quantity of rainbow trout eDNA was found

  9. Environmental DNA for freshwater fish monitoring: insights for conservation within a protected area.

    Science.gov (United States)

    Fernandez, Sara; Sandin, Miguel M; Beaulieu, Paul G; Clusa, Laura; Martinez, Jose L; Ardura, Alba; García-Vázquez, Eva

    2018-01-01

    Many fish species have been introduced in wild ecosystems around the world to provide food or leisure, deliberately or from farm escapes. Some of those introductions have had large ecological effects. The north American native rainbow trout ( Oncorhynchus mykiss Walbaum, 1792) is one of the most widely farmed fish species in the world. It was first introduced in Spain in the late 19th century for sport fishing (Elvira 1995) and nowadays is used there for both fishing and aquaculture. On the other hand, the European native brown trout ( Salmo trutta L.) is catalogued as vulnerable in Spain. Detecting native and invasive fish populations in ecosystem monitoring is crucial, but it may be difficult from conventional sampling methods such as electrofishing. These techniques encompass some mortality, thus are not adequate for some ecosystems as the case of protected areas. Environmental DNA (eDNA) analysis is a sensitive and non-invasive method that can be especially useful for rare and low-density species detection and inventory in water bodies. In this study we employed two eDNA based methods (qPCR and nested PCR-RFLP) to detect salmonid species from mountain streams within a protected area, The Biosphere Reserve and Natural Park of Redes (Upper Nalón Basin, Asturias, Northern Spain), where brown trout is the only native salmonid. We also measured some habitat variables to see how appropriate for salmonids the area is. The sampling area is located upstream impassable dams and contains one rainbow trout fish farm. Employing qPCR methodology, brown trout eDNA was detected in all the nine sampling sites surveyed, while nested PCR-RFLP method failed to detect it in two sampling points. Rainbow trout eDNA was detected with both techniques at all sites in the Nalón River' (n1, n2 and n3). Salmonid habitat units and water quality were high from the area studied. In this study, a high quantity of rainbow trout eDNA was found upstream and downstream of a fish farm located

  10. Positional bias of general and tissue-specific regulatory motifs in mouse gene promoters

    Directory of Open Access Journals (Sweden)

    Farré Domènec

    2007-12-01

    Full Text Available Abstract Background The arrangement of regulatory motifs in gene promoters, or promoter architecture, is the result of mutation and selection processes that have operated over many millions of years. In mammals, tissue-specific transcriptional regulation is related to the presence of specific protein-interacting DNA motifs in gene promoters. However, little is known about the relative location and spacing of these motifs. To fill this gap, we have performed a systematic search for motifs that show significant bias at specific promoter locations in a large collection of housekeeping and tissue-specific genes. Results We observe that promoters driving housekeeping gene expression are enriched in particular motifs with strong positional bias, such as YY1, which are of little relevance in promoters driving tissue-specific expression. We also identify a large number of motifs that show positional bias in genes expressed in a highly tissue-specific manner. They include well-known tissue-specific motifs, such as HNF1 and HNF4 motifs in liver, kidney and small intestine, or RFX motifs in testis, as well as many potentially novel regulatory motifs. Based on this analysis, we provide predictions for 559 tissue-specific motifs in mouse gene promoters. Conclusion The study shows that motif positional bias is an important feature of mammalian proximal promoters and that it affects both general and tissue-specific motifs. Motif positional constraints define very distinct promoter architectures depending on breadth of expression and type of tissue.

  11. Elucidating the evolutionary conserved DNA-binding specificities of WRKY transcription factors by molecular dynamics and in vitro binding assays

    Science.gov (United States)

    Brand, Luise H.; Fischer, Nina M.; Harter, Klaus; Kohlbacher, Oliver; Wanke, Dierk

    2013-01-01

    WRKY transcription factors constitute a large protein family in plants that is involved in the regulation of developmental processes and responses to biotic or abiotic stimuli. The question arises how stimulus-specific responses are mediated given that the highly conserved WRKY DNA-binding domain (DBD) exclusively recognizes the ‘TTGACY’ W-box consensus. We speculated that the W-box consensus might be more degenerate and yet undetected differences in the W-box consensus of WRKYs of different evolutionary descent exist. The phylogenetic analysis of WRKY DBDs suggests that they evolved from an ancestral group IIc-like WRKY early in the eukaryote lineage. A direct descent of group IIc WRKYs supports a monophyletic origin of all other group II and III WRKYs from group I by loss of an N-terminal DBD. Group I WRKYs are of paraphyletic descent and evolved multiple times independently. By homology modeling, molecular dynamics simulations and in vitro DNA–protein interaction-enzyme-linked immunosorbent assay with AtWRKY50 (IIc), AtWRKY33 (I) and AtWRKY11 (IId) DBDs, we revealed differences in DNA-binding specificities. Our data imply that other components are essentially required besides the W-box-specific binding to DNA to facilitate a stimulus-specific WRKY function. PMID:23975197

  12. The effect of tributyltin chloride on Caenorhabditis elegans germline is mediated by a conserved DNA damage checkpoint pathway.

    Science.gov (United States)

    Cheng, Zhe; Tian, Huimin; Chu, Hongran; Wu, Jianjian; Li, Yingying; Wang, Yanhai

    2014-03-21

    Tributyltin (TBT), one of the environmental pollutants, has been shown to impact the reproduction of animals. However, due to the lack of appropriate animal model, analysis of the affected molecular pathways in germ cells is lagging and has been particularly challenging. In the present study, we investigated the effects of tributyltin chloride (TBTCL) on the nematode Caenorhabditis elegans germline. We show that exposure of C. elegans to TBTCL causes significantly elevated level of sterility and embryonic lethality. TBTCL exposure results in an increased number of meiotic DNA double-strand breaks in germ cells, subsequently leading to activated DNA damage checkpoint. Exposing C. elegans to TBTCL causes dose- and time-dependent germline apoptosis. This apoptotic response was blocked in loss-of-function mutants of hus-1 (op241), mrt-2 (e2663) and p53/cep-1 (gk138), indicating that checkpoints and p53 are essential for mediating TBTCL-induced germ cell apoptosis. Moreover, TBTCL exposure can inhibit germ cell proliferation, which is also mediated by the conserved checkpoint pathway. We thereby propose that TBT exhibits its effects on the germline by inducing DNA damage and impaired maintenance of genomic integrity. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

  13. Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.

    Science.gov (United States)

    Nagar, Anurag; Hahsler, Michael

    2013-01-01

    Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to

  14. Poxvirus uracil-DNA glycosylase-An unusual member of the family I uracil-DNA glycosylases: Poxvirus Uracil-DNA Glycosylase

    Energy Technology Data Exchange (ETDEWEB)

    Schormann, Norbert [Department of Medicine, University of Alabama at Birmingham, Birmingham Alabama 35294; Zhukovskaya, Natalia [Department of Microbiology, School of Dental Medicine, University of Pennsylvania, Philadelphia Pennsylvania 19104; Bedwell, Gregory [Department of Microbiology, University of Alabama at Birmingham, Birmingham Alabama 35294; Nuth, Manunya [Department of Microbiology, School of Dental Medicine, University of Pennsylvania, Philadelphia Pennsylvania 19104; Gillilan, Richard [MacCHESS (Macromolecular Diffraction Facility at CHESS) Cornell University, Ithaca New York 14853; Prevelige, Peter E. [Department of Microbiology, University of Alabama at Birmingham, Birmingham Alabama 35294; Ricciardi, Robert P. [Department of Microbiology, School of Dental Medicine, University of Pennsylvania, Philadelphia Pennsylvania 19104; Abramson Cancer Center, School of Medicine, University of Pennsylvania, Philadelphia Pennsylvania 19104; Banerjee, Surajit [Department of Chemistry and Chemical Biology, Cornell University, and NE-CAT Argonne Illinois 60439; Chattopadhyay, Debasish [Department of Medicine, University of Alabama at Birmingham, Birmingham Alabama 35294

    2016-11-02

    We report that uracil-DNA glycosylases are ubiquitous enzymes, which play a key role repairing damages in DNA and in maintaining genomic integrity by catalyzing the first step in the base excision repair pathway. Within the superfamily of uracil-DNA glycosylases family I enzymes or UNGs are specific for recognizing and removing uracil from DNA. These enzymes feature conserved structural folds, active site residues and use common motifs for DNA binding, uracil recognition and catalysis. Within this family the enzymes of poxviruses are unique and most remarkable in terms of amino acid sequences, characteristic motifs and more importantly for their novel non-enzymatic function in DNA replication. UNG of vaccinia virus, also known as D4, is the most extensively characterized UNG of the poxvirus family. D4 forms an unusual heterodimeric processivity factor by attaching to a poxvirus-specific protein A20, which also binds to the DNA polymerase E9 and recruits other proteins necessary for replication. D4 is thus integrated in the DNA polymerase complex, and its DNA-binding and DNA scanning abilities couple DNA processivity and DNA base excision repair at the replication fork. In conclusion, the adaptations necessary for taking on the new function are reflected in the amino acid sequence and the three-dimensional structure of D4. We provide an overview of the current state of the knowledge on the structure-function relationship of D4.

  15. The ARTT motif and a unified structural understanding of substraterecognition in ADP ribosylating bacterial toxins and eukaryotic ADPribosyltransferases

    Energy Technology Data Exchange (ETDEWEB)

    Han, S.; Tainer, J.A.

    2001-08-01

    ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core has been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT

  16. The KYxxL motif in Rad17 protein is essential for the interaction with the 9–1–1 complex

    Energy Technology Data Exchange (ETDEWEB)

    Fukumoto, Yasunori, E-mail: fukumoto@faculty.chiba-u.jp [Laboratory of Molecular Cell Biology, Graduate School of Pharmaceutical Sciences, Chiba University, Chiba 260-8675 (Japan); Ikeuchi, Masayoshi; Nakayama, Yuji [Department of Biochemistry & Molecular Biology, Kyoto Pharmaceutical University, Kyoto 607-8414 (Japan); Yamaguchi, Naoto, E-mail: nyama@faculty.chiba-u.jp [Laboratory of Molecular Cell Biology, Graduate School of Pharmaceutical Sciences, Chiba University, Chiba 260-8675 (Japan)

    2016-09-02

    ATR-dependent DNA damage checkpoint is the major DNA damage checkpoint against UV irradiation and DNA replication stress. The Rad17–RFC and Rad9–Rad1–Hus1 (9–1–1) complexes interact with each other to contribute to ATR signaling, however, the precise regulatory mechanism of the interaction has not been established. Here, we identified a conserved sequence motif, KYxxL, in the AAA+ domain of Rad17 protein, and demonstrated that this motif is essential for the interaction with the 9–1–1 complex. We also show that UV-induced Rad17 phosphorylation is increased in the Rad17 KYxxL mutants. These data indicate that the interaction with the 9–1–1 complex is not required for Rad17 protein to be an efficient substrate for the UV-induced phosphorylation. Our data also raise the possibility that the 9–1–1 complex plays a negative regulatory role in the Rad17 phosphorylation. We also show that the nucleotide-binding activity of Rad17 is required for its nuclear localization. - Highlights: • We have identified a conserved KYxxL motif in Rad17 protein. • The KYxxL motif is crucial for the interaction with the 9–1–1 complex. • The KYxxL motif is dispensable or inhibitory for UV-induced Rad17 phosphorylation. • Nucleotide binding of Rad17 is required for its nuclear localization.

  17. Cloning of the cDNA for murine von Willebrand factor and identification of orthologous genes reveals the extent of conservation among diverse species.

    Science.gov (United States)

    Chitta, Mohan S; Duhé, Roy J; Kermode, John C

    2007-05-01

    Interaction of von Willebrand factor (VWF) with circulating platelets promotes hemostasis when a blood vessel is injured. The A1 domain of VWF is responsible for the initial interaction with platelets and is well conserved among species. Knowledge of the cDNA and genomic DNA sequences for human VWF allowed us to predict the cDNA sequence for murine VWF in silico and amplify its entire coding region by RT-PCR. The murine VWF cDNA has an open reading frame of 8,442 bp, encoding a protein of 2,813 amino acid residues with 83% identity to human pre-pro-VWF. The same strategy was used to predict in silico the cDNA sequence for the ortholog of VWF in a further six species. Many of these predictions diverged substantially from the putative Reference Sequences derived by ab initio methods. Our predicted sequences indicated that the VWF gene has a conserved structure of 52 exons in all seven mammalian species examined, as well as in the chicken. There is a minor structural variation in the pufferfish Takifugu rubripes insofar as the VWF gene in this species has 53 exons. Comparison of the translated amino acid sequences also revealed a high degree of conservation. In particular, the cysteine residues are conserved precisely throughout both the pro-peptide and the mature VWF sequence in all species, with a minor exception in the pufferfish VWF ortholog where two adjacent cysteine residues are omitted. The marked conservation of cysteine residues emphasizes the importance of the intricate pattern of disulfide bonds in governing the structure of pro-VWF and regulating the function of the mature VWF protein. It should also be emphasized that many of the conserved features of the VWF gene and protein were obscured when the comparison among species was based on the putative Reference Sequences instead of our predicted cDNA sequences.

  18. cDNA cloning and sequencing of human fibrillarin, a conserved nucleolar protein recognized by autoimmune antisera

    International Nuclear Information System (INIS)

    Aris, J.P.; Blobel, G.

    1991-01-01

    The authors have isolated a 1.1-kilobase cDNA clone that encodes human fibrillarin by screening a hepatoma library in parallel with DNA probes derived from the fibrillarin genes of Saccharomyces cerevisiae (NOP1) and Xenopus laevis. RNA blot analysis indicates that the corresponding mRNA is ∼1,300 nucleotides in length. Human fibrillarin expressed in vitro migrates on SDS gels as a 36-kDa protein that is specifically immunoprecipitated by antisera from humans with scleroderma autoimmune disease. Human fibrillarin contains an amino-terminal repetitive domain ∼75-80 amino acids in length that is rich in glycine and arginine residues and is similar to amino-terminal domains in the yeast and Xenopus fibrillarins. The occurrence of a putative RNA-binding domain and an RNP consensus sequence within the protein is consistent with the association of fibrillarin with small nucleolar RNAs. Protein sequence alignments show that 67% of amino acids from human fibrillarin are identical to those in yeast fibrillarin and that 81% are identical to those in Xenopus fibrillarin. This identity suggests the evolutionary conservation of an important function early in the pathway for ribosome biosynthesis

  19. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern applications require mining of motifs in one very long sequence (i.e., in the order of several gigabytes). For this case, there exist statistical approaches that are fast but inaccurate; or combinatorial methods that are sound and complete. Unfortunately, existing combinatorial methods are serial and very slow. Consequently, they are limited to very short sequences (i.e., a few megabytes), small alphabets (typically 4 symbols for DNA sequences), and restricted types of motifs. This paper presents ACME, a combinatorial method for extracting motifs from a single very long sequence. ACME arranges the search space in contiguous blocks that take advantage of the cache hierarchy in modern architectures, and achieves almost an order of magnitude performance gain in serial execution. It also decomposes the search space in a smart way that allows scalability to thousands of processors with more than 90% speedup. ACME is the only method that: (i) scales to gigabyte-long sequences; (ii) handles large alphabets; (iii) supports interesting types of motifs with minimal additional cost; and (iv) is optimized for a variety of architectures such as multi-core systems, clusters in the cloud, and supercomputers. ACME reduces the extraction time for an exact-length query from 4 hours to 7 minutes on a typical workstation; handles 3 orders of magnitude longer sequences; and scales up to 16, 384 cores on a supercomputer. Copyright is held by the owner/author(s).

  20. Automatic annotation of protein motif function with Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Gopalakrishnan Vanathi

    2004-09-01

    Full Text Available Abstract Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs.

  1. Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

    KAUST Repository

    Guturu, H.

    2013-11-11

    Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and \\'through-DNA\\' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.

  2. Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

    KAUST Repository

    Guturu, H.; Doxey, A. C.; Wenger, A. M.; Bejerano, G.

    2013-01-01

    Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and 'through-DNA' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.

  3. DNA Metabarcoding Reveals Diet Overlap between the Endangered Walia Ibex and Domestic Goats - Implications for Conservation

    Science.gov (United States)

    Gebremedhin, Berihun; Flagstad, Øystein; Bekele, Afework; Chala, Desalegn; Bakkestuen, Vegar; Boessenkool, Sanne; Popp, Magnus; Gussarova, Galina; Schrøder-Nielsen, Audun; Nemomissa, Sileshi; Brochmann, Christian; Stenseth, Nils Chr.

    2016-01-01

    Human population expansion and associated degradation of the habitat of many wildlife species cause loss of biodiversity and species extinctions. The small Simen Mountains National Park in Ethiopia is one of the last strongholds for the preservation of a number of afro-alpine mammals, plants and birds, and it is home to the rare endemic Walia ibex, Capra walie. The narrow distribution range of this species as well as potential competition for resources with livestock, especially with domestic goat, Capra hircus, may compromise its future survival. Based on a curated afro-alpine taxonomic reference library constructed for plant taxon identification, we investigated the diet of the Walia ibex and addressed the dietary overlap with domestic goat using DNA metabarcoding of faecal samples. Faeces of both species were collected from different localities in the National Park. We show that both species are browsers, with forbs, shrubs and trees comprising the largest proportion of their diet, supplemented by grasses. There was a considerable overlap in dietary preferences. Several of the preferred diet items of the Walia ibex (Alchemilla sp., Hypericum revolutum, Erica arborea and Rumex sp.) were also among the most preferred diet items of the domestic goat. These results indicate that there is potential for competition between the two species, especially during the dry season, when resources are limited. Our findings, in combination with the expected increase in domestic herbivores, suggest that management plans should consider the potential threat posed by domestic goats to ensure future survival of the endangered Walia ibex. PMID:27416020

  4. MtDNA COI-COII marker and drone congregation area: an efficient method to establish and monitor honeybee (Apis mellifera L.) conservation centres.

    Science.gov (United States)

    Bertrand, Bénédicte; Alburaki, Mohamed; Legout, Hélène; Moulin, Sibyle; Mougel, Florence; Garnery, Lionel

    2015-05-01

    Honeybee subspecies have been affected by human activities in Europe over the past few decades. One such example is the importation of nonlocal subspecies of bees which has had an adverse impact on the geographical repartition and subsequently on the genetic diversity of the black honeybee Apis mellifera mellifera. To restore the original diversity of this local honeybee subspecies, different conservation centres were set up in Europe. In this study, we established a black honeybee conservation centre Conservatoire de l'Abeille Noire d'Ile de France (CANIF) in the region of Ile-de-France, France. CANIF's honeybee colonies were intensively studied over a 3-year period. This study included a drone congregation area (DCA) located in the conservation centre. MtDNA COI-COII marker was used to evaluate the genetic diversity of CANIF's honeybee populations and the drones found and collected from the DCA. The same marker (mtDNA) was used to estimate the interactions and the haplotype frequency between CANIF's honeybee populations and 10 surrounding honeybee apiaries located outside of the CANIF. Our results indicate that the colonies of the conservation centre and the drones of the DCA show similar stable profiles compared to the surrounding populations with lower level of introgression. The mtDNA marker used on both DCA and colonies of the conservation centre seems to be an efficient approach to monitor and maintain the genetic diversity of the protected honeybee populations. © 2014 John Wiley & Sons Ltd.

  5. RNA motif search with data-driven element ordering.

    Science.gov (United States)

    Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa

    2016-05-18

    In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .

  6. RecO protein initiates DNA recombination and strand annealing through two alternative DNA binding mechanisms.

    Science.gov (United States)

    Ryzhikov, Mikhail; Gupta, Richa; Glickman, Michael; Korolev, Sergey

    2014-10-17

    Recombination mediator proteins (RMPs) are important for genome stability in all organisms. Several RMPs support two alternative reactions: initiation of homologous recombination and DNA annealing. We examined mechanisms of RMPs in both reactions with Mycobacterium smegmatis RecO (MsRecO) and demonstrated that MsRecO interacts with ssDNA by two distinct mechanisms. Zinc stimulates MsRecO binding to ssDNA during annealing, whereas the recombination function is zinc-independent and is regulated by interaction with MsRecR. Thus, different structural motifs or conformations of MsRecO are responsible for interaction with ssDNA during annealing and recombination. Neither annealing nor recombinase loading depends on MsRecO interaction with the conserved C-terminal tail of single-stranded (ss) DNA-binding protein (SSB), which is known to bind Escherichia coli RecO. However, similarly to E. coli proteins, MsRecO and MsRecOR do not dismiss SSB from ssDNA, suggesting that RMPs form a complex with SSB-ssDNA even in the absence of binding to the major protein interaction motif. We propose that alternative conformations of such complexes define the mechanism by which RMPs initiate the repair of stalled replication and support two different functions during recombinational repair of DNA breaks. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.

  7. A speedup technique for (l, d-motif finding algorithms

    Directory of Open Access Journals (Sweden)

    Dinh Hieu

    2011-03-01

    Full Text Available Abstract Background The discovery of patterns in DNA, RNA, and protein sequences has led to the solution of many vital biological problems. For instance, the identification of patterns in nucleic acid sequences has resulted in the determination of open reading frames, identification of promoter elements of genes, identification of intron/exon splicing sites, identification of SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have proven to be extremely helpful in domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, etc. Motifs are important patterns that are helpful in finding transcriptional regulatory elements, transcription factor binding sites, functional genomics, drug design, etc. As a result, numerous papers have been written to solve the motif search problem. Results Three versions of the motif search problem have been proposed in the literature: Simple Motif Search (SMS, (l, d-motif search (or Planted Motif Search (PMS, and Edit-distance-based Motif Search (EMS. In this paper we focus on PMS. Two kinds of algorithms can be found in the literature for solving the PMS problem: exact and approximate. An exact algorithm identifies the motifs always and an approximate algorithm may fail to identify some or all of the motifs. The exact version of PMS problem has been shown to be NP-hard. Exact algorithms proposed in the literature for PMS take time that is exponential in some of the underlying parameters. In this paper we propose a generic technique that can be used to speedup PMS algorithms. Conclusions We present a speedup technique that can be used on any PMS algorithm. We have tested our speedup technique on a number of algorithms. These experimental results show that our speedup technique is indeed very

  8. Joining inventory by parataxonomists with DNA barcoding of a large complex tropical conserved wildland in northwestern Costa Rica.

    Directory of Open Access Journals (Sweden)

    Daniel H Janzen

    Full Text Available BACKGROUND: The many components of conservation through biodiversity development of a large complex tropical wildland, Area de Conservacion Guanacaste (ACG, thrive on knowing what is its biodiversity and natural history. For 32 years a growing team of Costa Rican parataxonomists has conducted biodiversity inventory of ACG caterpillars, their food plants, and their parasitoids. In 2003, DNA barcoding was added to the inventory process. METHODOLOGY/PRINCIPAL FINDINGS: We describe some of the salient consequences for the parataxonomists of barcoding becoming part of a field biodiversity inventory process that has centuries of tradition. From the barcoding results, the parataxonomists, as well as other downstream users, gain a more fine-scale and greater understanding of the specimens they find, rear, photograph, database and deliver. The parataxonomists also need to adjust to collecting more specimens of what appear to be the "same species"--cryptic species that cannot be distinguished by eye or even food plant alone--while having to work with the name changes and taxonomic uncertainty that comes with discovering that what looked like one species may be many. CONCLUSIONS/SIGNIFICANCE: These career parataxonomists, despite their lack of formal higher education, have proven very capable of absorbing and working around the additional complexity and requirements for accuracy and detail that are generated by adding barcoding to the field base of the ACG inventory. In the process, they have also gained a greater understanding of the fine details of phylogeny, relatedness, evolution, and species-packing in their own tropical complex ecosytems. There is no reason to view DNA barcoding as incompatible in any way with tropical biodiversity inventory as conducted by parataxonomists. Their year-round on-site inventory effort lends itself well to the sampling patterns and sample sizes needed to build a thorough barcode library. Furthermore, the biological

  9. The canine MHC class Ia allele DLA-88*508:01 presents diverse self- and canine distemper virus-origin peptides of varying length that have a conserved binding motif.

    Science.gov (United States)

    Ross, Peter; Nemec, Paige S; Kapatos, Alexander; Miller, Keith R; Holmes, Jennifer C; Suter, Steven E; Buntzman, Adam S; Soderblom, Erik J; Collins, Edward J; Hess, Paul R

    2018-03-01

    Ideally, CD8+ T-cell responses against virally infected or malignant cells are defined at the level of the specific peptide and restricting MHC class I element, a determination not yet made in the dog. To advance the discovery of canine CTL epitopes, we sought to determine whether a putative classical MHC class Ia gene, Dog Leukocyte Antigen (DLA)-88, presents peptides from a viral pathogen, canine distemper virus (CDV). To investigate this possibility, DLA-88*508:01, an allele prevalent in Golden Retrievers, was expressed as a FLAG-tagged construct in canine histiocytic cells to allow affinity purification of peptide-DLA-88 complexes and subsequent elution of bound peptides. Pattern analysis of self peptide sequences, which were determined by liquid chromatography-tandem mass spectrometry (LC-MS/MS), permitted binding preferences to be inferred. DLA-88*508:01 binds peptides that are 9-to-12 amino acids in length, with a modest preference for 9- and 11-mers. Hydrophobic residues are favored at positions 2 and 3, as are K, R or F residues at the C-terminus. Testing motif-matched and -unmatched synthetic peptides via peptide-MHC surface stabilization assay using a DLA-88*508:01-transfected, TAP-deficient RMA-S line supported these conclusions. With CDV infection, 22 viral peptides ranging from 9-to-12 residues in length were identified in DLA-88*508:01 eluates by LC-MS/MS. Combined motif analysis and surface stabilization assay data suggested that 11 of these 22 peptides, derived from CDV hemagglutinin, large polymerase, matrix, nucleocapsid, and V proteins, were processed and presented, and thus, potential targets of anti-viral CTL in DLA-88*508:01-bearing dogs. The presentation of diverse self and viral peptides indicates that DLA-88 is a classical MHC class Ia gene. Copyright © 2018 Elsevier B.V. All rights reserved.

  10. Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise

    Science.gov (United States)

    Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San

    2018-01-01

    Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…

  11. SSTRAP: A computational model for genomic motif discovery ...

    African Journals Online (AJOL)

    Computational methods can potentially provide high-quality prediction of biological molecules such as DNA binding sites and Transcription factors and therefore reduce the time needed for experimental verification and challenges associated with experimental methods. These biological molecules or motifs have significant ...

  12. Some AFLP amplicons are highly conserved DNA sequences mapping to the same linkage groups in two F2 populations of carrot

    Directory of Open Access Journals (Sweden)

    Santos Carlos A.F.

    2002-01-01

    Full Text Available Amplified fragment length polymorphism (AFLP is a fast and reliable tool to generate a large number of DNA markers. In two unrelated F2 populations of carrot (Daucus carota L., Brasilia x HCM and B493 x QAL (wild carrot, it was hypothesized that DNA 1 digested with the same restriction endonuclease enzymes and amplified with the same primer combination and 2 sharing the same position in polyacrylamide gels should be conserved sequences. To test this hypothesis AFLP fragments from polyacrylamide gels were eluted, reamplified, separated in agarose gels, purified, cloned and sequenced. Among thirty-one paired fragments from each F2 population, twenty-six had identity greater than 91% and five presented identity of 24% to 44%. Among the twenty-six conserved AFLPs only one mapped to different linkage groups in the two populations while four of the five less-conserved bands mapped to different linkage groups. Of eight SCAR (sequence characterized amplified regions primers tested, one conserved AFLP resulted in co-dominant markers in both populations. Screening among 14 carrot inbreds or cultivars with three AFLP-SCAR primers revealed clear and polymorphic PCR products, with similar molecular sizes on agarose gels. The development of co-dominant markers based on conserved AFLP fragments will be useful to detect seed mixtures among hybrids, to improve and to merge linkage maps and to study diversity and phylogenetic relationships.

  13. Role of conserved cysteine residues in Herbaspirillum seropedicae NifA activity.

    Science.gov (United States)

    Oliveira, Marco A S; Baura, Valter A; Aquino, Bruno; Huergo, Luciano F; Kadowaki, Marco A S; Chubatsu, Leda S; Souza, Emanuel M; Dixon, Ray; Pedrosa, Fábio O; Wassem, Roseli; Monteiro, Rose A

    2009-01-01

    Herbaspirillum seropedicae is an endophytic diazotrophic bacterium that associates with economically important crops. NifA protein, the transcriptional activator of nif genes in H. seropedicae, binds to nif promoters and, together with RNA polymerase-sigma(54) holoenzyme, catalyzes the formation of open complexes to allow transcription initiation. The activity of H. seropedicae NifA is controlled by ammonium and oxygen levels, but the mechanisms of such control are unknown. Oxygen sensitivity is attributed to a conserved motif of cysteine residues in NifA that spans the central AAA+ domain and the interdomain linker that connects the AAA+ domain to the C-terminal DNA binding domain. Here we mutagenized this conserved motif of cysteines and assayed the activity of mutant proteins in vivo. We also purified the mutant variants of NifA and tested their capacity to bind to the nifB promoter region. Chimeric proteins between H. seropedicae NifA, an oxygen-sensitive protein, and Azotobacter vinelandii NifA, an oxygen-tolerant protein, were constructed and showed that the oxygen response is conferred by the central AAA+ and C-terminal DNA binding domains of H. seropedicae NifA. We conclude that the conserved cysteine motif is essential for NifA activity, although single cysteine-to-serine mutants are still competent at binding DNA.

  14. Relaxed selection against accidental binding of transcription factors with conserved chromatin contexts.

    Science.gov (United States)

    Babbitt, G A

    2010-10-15

    The spurious (or nonfunctional) binding of transcription factors (TF) to the wrong locations on DNA presents a formidable challenge to genomes given the relatively low ceiling for sequence complexity within the short lengths of most binding motifs. The high potential for the occurrence of random motifs and subsequent nonfunctional binding of many transcription factors should theoretically lead to natural selection against the occurrence of spurious motif throughout the genome. However, because of the active role that chromatin can influence over eukaryotic gene regulation, it may also be expected that many supposed spurious binding sites could escape purifying selection if (A) they simply occur in regions of high nucleosome occupancy or (B) their surrounding chromatin was dynamically involved in their identity and function. We compared nucleosome occupancy and the presence/absence of functionally conserved chromatin context to the strength of selection against spurious binding of various TF binding motifs in Saccharomyces yeast. While we find no direct relationship with nucleosome occupancy, we find strong evidence that transcription factors spatially associated with evolutionarily conserved chromatin states are under relaxed selection against accidental binding. Transcription factors (with/without) a conserved chromatin context were found to occur on average, (87.7%/49.3%) of their expected frequencies. Functional binding motifs with conserved chromatin contexts were also significantly shorter in length and more often clustered. These results indicate a role of chromatin context dependency in relaxing selection against spurious binding in nearly half of all TF binding motifs throughout the yeast genome. 2010 Elsevier B.V. All rights reserved.

  15. Methodological considerations for detection of terrestrial small-body salamander eDNA and implications for biodiversity conservation

    Science.gov (United States)

    Walker, Donald M.; Leys, Jacob E.; Dunham, Kelly E.; Oliver, Joshua C.; Schiller, Emily E.; Stephenson, Kelsey S.; Kimrey, John T.; Wooten, Jessica; Rogers, Mark W.

    2017-01-01

    Environmental DNA (eDNA) can be used as an assessment tool to detect populations of threatened species and provide fine-scale data required to make management decisions. The objectives of this project were to use quantitative PCR (qPCR) to: (i) detect spiked salamander DNA in soil, (ii) quantify eDNA degradation over time, (iii) determine detectability of salamander eDNA in a terrestrial environment using soil, faeces, and skin swabs, (iv) detect salamander eDNA in a mesocosm experiment. Salamander eDNA was positively detected in 100% of skin swabs and 66% of faecal samples and concentrations did not differ between the two sources. However, eDNA was not detected in soil samples collected from directly underneath wild-caught living salamanders. Salamander genomic DNA (gDNA) was detected in all qPCR reactions when spiked into soil at 10.0, 5.0, and 1.0 ng/g soil and spike concentration had a significant effect on detected concentrations. Only 33% of samples showed recoverable eDNA when spiked with 0.25 ng/g soil, which was the low end of eDNA detection. To determine the rate of eDNA degradation, gDNA (1 ng/g soil) was spiked into soil and quantified over seven days. Salamander eDNA concentrations decreased across days, but eDNA was still amplifiable at day 7. Salamander eDNA was detected in two of 182 mesocosm soil samples over 12 weeks (n = 52 control samples; n = 65 presence samples; n = 65 eviction samples). The discrepancy in detection success between experiments indicates the potential challenges for this method to be used as a monitoring technique for small-bodied wild terrestrial salamander populations.

  16. Identification of putative regulatory motifs in the upstream regions of co-expressed functional groups of genes in Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Joshi NV

    2009-01-01

    Full Text Available Abstract Background Regulation of gene expression in Plasmodium falciparum (Pf remains poorly understood. While over half the genes are estimated to be regulated at the transcriptional level, few regulatory motifs and transcription regulators have been found. Results The study seeks to identify putative regulatory motifs in the upstream regions of 13 functional groups of genes expressed in the intraerythrocytic developmental cycle of Pf. Three motif-discovery programs were used for the purpose, and motifs were searched for only on the gene coding strand. Four motifs – the 'G-rich', the 'C-rich', the 'TGTG' and the 'CACA' motifs – were identified, and zero to all four of these occur in the 13 sets of upstream regions. The 'CACA motif' was absent in functional groups expressed during the ring to early trophozoite transition. For functional groups expressed in each transition, the motifs tended to be similar. Upstream motifs in some functional groups showed 'positional conservation' by occurring at similar positions relative to the translational start site (TLS; this increases their significance as regulatory motifs. In the ribonucleotide synthesis, mitochondrial, proteasome and organellar translation machinery genes, G-rich, C-rich, CACA and TGTG motifs, respectively, occur with striking positional conservation. In the organellar translation machinery group, G-rich motifs occur close to the TLS. The same motifs were sometimes identified for multiple functional groups; differences in location and abundance of the motifs appear to ensure different modes of action. Conclusion The identification of positionally conserved over-represented upstream motifs throws light on putative regulatory elements for transcription in Pf.

  17. Conservative fragments in bacterial 16S rRNA genes and primer design for 16S ribosomal DNA amplicons in metagenomic studies

    KAUST Repository

    Wang, Yong

    2009-10-09

    Bacterial 16S ribosomal DNA (rDNA) amplicons have been widely used in the classification of uncultured bacteria inhabiting environmental niches. Primers targeting conservative regions of the rDNAs are used to generate amplicons of variant regions that are informative in taxonomic assignment. One problem is that the percentage coverage and application scope of the primers used in previous studies are largely unknown. In this study, conservative fragments of available rDNA sequences were first mined and then used to search for candidate primers within the fragments by measuring the coverage rate defined as the percentage of bacterial sequences containing the target. Thirty predicted primers with a high coverage rate (>90%) were identified, which were basically located in the same conservative regions as known primers in previous reports, whereas 30% of the known primers were associated with a coverage rate of <90%. The application scope of the primers was also examined by calculating the percentages of failed detections in bacterial phyla. Primers A519-539, E969- 983, E1063-1081, U515 and E517, are highly recommended because of their high coverage in almost all phyla. As expected, the three predominant phyla, Firmicutes, Gemmatimonadetes and Proteobacteria, are best covered by the predicted primers. The primers recommended in this report shall facilitate a comprehensive and reliable survey of bacterial diversity in metagenomic studies. © 2009 Wang, Qian.

  18. POWRS: position-sensitive motif discovery.

    Directory of Open Access Journals (Sweden)

    Ian W Davis

    Full Text Available Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm "POWRS" (POsition-sensitive WoRd Set for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties.BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/.

  19. Promoter Motifs in NCLDVs: An Evolutionary Perspective

    Science.gov (United States)

    Oliveira, Graziele Pereira; Andrade, Ana Cláudia dos Santos Pereira; Rodrigues, Rodrigo Araújo Lima; Arantes, Thalita Souza; Boratto, Paulo Victor Miranda; Silva, Ludmila Karen dos Santos; Dornas, Fábio Pio; Trindade, Giliane de Souza; Drumond, Betânia Paiva; La Scola, Bernard; Kroon, Erna Geessien; Abrahão, Jônatas Santos

    2017-01-01

    For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV), raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’) that could be evolved gradually by nucleotides’ gain and loss and point mutations. PMID:28117683

  20. Promoter Motifs in NCLDVs: An Evolutionary Perspective

    Directory of Open Access Journals (Sweden)

    Graziele Pereira Oliveira

    2017-01-01

    Full Text Available For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV, raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’ that could be evolved gradually by nucleotides’ gain and loss and point mutations.

  1. Superior induction of T cell responses to conserved HIV-1 regions by electroporated alphavirus replicon DNA compared to that with conventional plasmid DNA vaccine.

    Science.gov (United States)

    Knudsen, Maria L; Mbewe-Mvula, Alice; Rosario, Maximillian; Johansson, Daniel X; Kakoulidou, Maria; Bridgeman, Anne; Reyes-Sandoval, Arturo; Nicosia, Alfredo; Ljungberg, Karl; Hanke, Tomás; Liljeström, Peter

    2012-04-01

    Vaccination using "naked" DNA is a highly attractive strategy for induction of pathogen-specific immune responses; however, it has been only weakly immunogenic in humans. Previously, we constructed DNA-launched Semliki Forest virus replicons (DREP), which stimulate pattern recognition receptors and induce augmented immune responses. Also, in vivo electroporation was shown to enhance immune responses induced by conventional DNA vaccines. Here, we combine these two approaches and show that in vivo electroporation increases CD8(+) T cell responses induced by DREP and consequently decreases the DNA dose required to induce a response. The vaccines used in this study encode the multiclade HIV-1 T cell immunogen HIVconsv, which is currently being evaluated in clinical trials. Using intradermal delivery followed by electroporation, the DREP.HIVconsv DNA dose could be reduced to as low as 3.2 ng to elicit frequencies of HIV-1-specific CD8(+) T cells comparable to those induced by 1 μg of a conventional pTH.HIVconsv DNA vaccine, representing a 625-fold molar reduction in dose. Responses induced by both DREP.HIVconsv and pTH.HIVconsv were further increased by heterologous vaccine boosts employing modified vaccinia virus Ankara MVA.HIVconsv and attenuated chimpanzee adenovirus ChAdV63.HIVconsv. Using the same HIVconsv vaccines, the mouse observations were supported by an at least 20-fold-lower dose of DNA vaccine in rhesus macaques. These data point toward a strategy for overcoming the low immunogenicity of DNA vaccines in humans and strongly support further development of the DREP vaccine platform for clinical evaluation.

  2. Chromosome-wide mapping of DNA methylation patterns in normal and malignant prostate cells reveals pervasive methylation of gene-associated and conserved intergenic sequences

    Directory of Open Access Journals (Sweden)

    De Marzo Angelo M

    2011-06-01

    Full Text Available Abstract Background DNA methylation has been linked to genome regulation and dysregulation in health and disease respectively, and methods for characterizing genomic DNA methylation patterns are rapidly emerging. We have developed/refined methods for enrichment of methylated genomic fragments using the methyl-binding domain of the human MBD2 protein (MBD2-MBD followed by analysis with high-density tiling microarrays. This MBD-chip approach was used to characterize DNA methylation patterns across all non-repetitive sequences of human chromosomes 21 and 22 at high-resolution in normal and malignant prostate cells. Results Examining this data using computational methods that were designed specifically for DNA methylation tiling array data revealed widespread methylation of both gene promoter and non-promoter regions in cancer and normal cells. In addition to identifying several novel cancer hypermethylated 5' gene upstream regions that mediated epigenetic gene silencing, we also found several hypermethylated 3' gene downstream, intragenic and intergenic regions. The hypermethylated intragenic regions were highly enriched for overlap with intron-exon boundaries, suggesting a possible role in regulation of alternative transcriptional start sites, exon usage and/or splicing. The hypermethylated intergenic regions showed significant enrichment for conservation across vertebrate species. A sampling of these newly identified promoter (ADAMTS1 and SCARF2 genes and non-promoter (downstream or within DSCR9, C21orf57 and HLCS genes hypermethylated regions were effective in distinguishing malignant from normal prostate tissues and/or cell lines. Conclusions Comparison of chromosome-wide DNA methylation patterns in normal and malignant prostate cells revealed significant methylation of gene-proximal and conserved intergenic sequences. Such analyses can be easily extended for genome-wide methylation analysis in health and disease.

  3. A proposed vestigial translation initiation motif in VP1 of hepatitis A virus.

    Science.gov (United States)

    Kang, Jeong-Ah; Funkhouser, Ann W

    2002-07-01

    The internal ribosome entry site (IRES) of picornaviruses has a 3' polypyrimidine tract (PPT) 16-24 bases upstream of an AUG triplet (PPT/AUG motif). This motif is critical in determining the efficiency of cap-independent translation. HAV has a conserved PPT/AUG motif consisting of a nine base sequence (AGGUUUUUC) 23 bases upstream of the preferred AUG start codon. This HAV-specific PPT/AUG motif is repeated and conserved in VP1 of HAV, but not of other picornaviruses. We proposed that the PPT/AUG motif in the open reading frame initiated translation and/or had an impact on the life cycle of the virus. In vitro translation of mutant bicistronic mRNAs and growth in cell culture of mutant viruses provided no evidence that the VP1 PPT/AUG motif had any impact on either translation or growth. HAV differs from other picornaviruses in its inefficient growth in cell culture. Since the HAV-specific PPT/AUG motif is found in only 1 in 300,000 reported viral sequences outside the hepatovirus genus, this motif may be a vestigial translation initiation element and may have played a role in determining the unusual phenotype of HAV.

  4. Regulation of TCF ETS-domain transcription factors by helix-loop-helix motifs.

    Science.gov (United States)

    Stinson, Julie; Inoue, Toshiaki; Yates, Paula; Clancy, Anne; Norton, John D; Sharrocks, Andrew D

    2003-08-15

    DNA binding by the ternary complex factor (TCF) subfamily of ETS-domain transcription factors is tightly regulated by intramolecular and intermolecular interactions. The helix-loop-helix (HLH)-containing Id proteins are trans-acting negative regulators of DNA binding by the TCFs. In the TCF, SAP-2/Net/ERP, intramolecular inhibition of DNA binding is promoted by the cis-acting NID region that also contains an HLH-like motif. The NID also acts as a transcriptional repression domain. Here, we have studied the role of HLH motifs in regulating DNA binding and transcription by the TCF protein SAP-1 and how Cdk-mediated phosphorylation affects the inhibitory activity of the Id proteins towards the TCFs. We demonstrate that the NID region of SAP-1 is an autoinhibitory motif that acts to inhibit DNA binding and also functions as a transcription repression domain. This region can be functionally replaced by fusion of Id proteins to SAP-1, whereby the Id moiety then acts to repress DNA binding in cis. Phosphorylation of the Ids by cyclin-Cdk complexes results in reduction in protein-protein interactions between the Ids and TCFs and relief of their DNA-binding inhibitory activity. In revealing distinct mechanisms through which HLH motifs modulate the activity of TCFs, our results therefore provide further insight into the role of HLH motifs in regulating TCF function and how the inhibitory properties of the trans-acting Id HLH proteins are themselves regulated by phosphorylation.

  5. DNA nanotechnology: On-command molecular Trojans

    Science.gov (United States)

    Niemeyer, Christof M.

    2017-12-01

    Lipid-motif-decorated DNA nanocapsules filled with photoresponsive polymers are capable of delivering signalling molecules into target organisms for biological perturbations at high spatiotemporal resolution.

  6. Purification and functional motifs of the recombinant ATPase of orf virus.

    Science.gov (United States)

    Lin, Fong-Yuan; Chan, Kun-Wei; Wang, Chi-Young; Wong, Min-Liang; Hsu, Wei-Li

    2011-10-01

    Our previous study showed that the recombinant ATPase encoded by the A32L gene of orf virus displayed ATP hydrolysis activity as predicted from its amino acids sequence. This viral ATPase contains four known functional motifs (motifs I-IV) and a novel AYDG motif; they are essential for ATP hydrolysis reaction by binding ATP and magnesium ions. The motifs I and II correspond with the Walker A and B motifs of the typical ATPase, respectively. To examine the biochemical roles of these five conserved motifs, recombinant ATPases of five deletion mutants derived from the Taiping strain were expressed and purified. Their ATPase functions were assayed and compared with those of two wild type strains, Taiping and Nantou isolated in Taiwan. Our results showed that deletions at motifs I-III or IV exhibited lower activity than that of the wild type. Interestingly, deletion of AYDG motif decreased the ATPase activity more significantly than those of motifs I-IV deletions. Divalent ions such as magnesium and calcium were essential for ATPase activity. Moreover, our recombinant proteins of orf virus also demonstrated GTPase activity, though weaker than the original ATPase activity. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. Motif signatures of transcribed enhancers

    KAUST Repository

    Kleftogiannis, Dimitrios

    2017-09-14

    In mammalian cells, transcribed enhancers (TrEn) play important roles in the initiation of gene expression and maintenance of gene expression levels in spatiotemporal manner. One of the most challenging questions in biology today is how the genomic characteristics of enhancers relate to enhancer activities. This is particularly critical, as several recent studies have linked enhancer sequence motifs to specific functional roles. To date, only a limited number of enhancer sequence characteristics have been investigated, leaving space for exploring the enhancers genomic code in a more systematic way. To address this problem, we developed a novel computational method, TELS, aimed at identifying predictive cell type/tissue specific motif signatures. We used TELS to compile a comprehensive catalog of motif signatures for all known TrEn identified by the FANTOM5 consortium across 112 human primary cells and tissues. Our results confirm that distinct cell type/tissue specific motif signatures characterize TrEn. These signatures allow discriminating successfully a) TrEn from random controls, proxy of non-enhancer activity, and b) cell type/tissue specific TrEn from enhancers expressed and transcribed in different cell types/tissues. TELS codes and datasets are publicly available at http://www.cbrc.kaust.edu.sa/TELS.

  8. DNA from the past informs ex situ conservation for the future: an "extinct" species of Galápagos tortoise identified in captivity.

    Directory of Open Access Journals (Sweden)

    Michael A Russello

    2010-01-01

    Full Text Available Although not unusual to find captive relicts of species lost in the wild, rarely are presumed extinct species rediscovered outside of their native range. A recent study detected living descendents of an extinct Galápagos tortoise species (Chelonoidis elephantopus once endemic to Floreana Island on the neighboring island of Isabela. This finding adds to the growing cryptic diversity detected among these species in the wild. There also exists a large number of Galápagos tortoises in captivity of ambiguous origin. The recently accumulated population-level haplotypic and genotypic data now available for C. elephantopus add a critical reference population to the existing database of 11 extant species for investigating the origin of captive individuals of unknown ancestry.We reanalyzed mitochondrial DNA control region haplotypes and microsatellite genotypes of 156 captive individuals using an expanded reference database that included all extant Galápagos tortoise species as well as the extinct species from Floreana. Nine individuals (six females and three males exhibited strong signatures of Floreana ancestry and a high probability of assignment to C. elephantopus as detected by Bayesian assignment and clustering analyses of empirical and simulated data. One male with high assignment probability to C. elephantopus based on microsatellite genotypic data also possessed a "Floreana-like" mitochondrial DNA haplotype.Historical DNA analysis of museum specimens has provided critical spatial and temporal components to ecological, evolutionary, taxonomic and conservation-related research, but rarely has it informed ex situ species recovery efforts. Here, the availability of population-level genotypic data from the extinct C. elephantopus enabled the identification of nine Galápagos tortoise individuals of substantial conservation value that were previously misassigned to extant species of varying conservation status. As all captive individuals of C

  9. Mutations in the putative zinc-binding motif of UL52 demonstrate a complex interdependence between the UL5 and UL52 subunits of the human herpes simplex virus type 1 helicase/primase complex.

    Science.gov (United States)

    Chen, Yan; Carrington-Lawrence, Stacy D; Bai, Ping; Weller, Sandra K

    2005-07-01

    Herpes simplex virus type 1 (HSV-1) encodes a heterotrimeric helicase-primase (UL5/8/52) complex. UL5 contains seven motifs found in helicase superfamily 1, and UL52 contains conserved motifs found in primases. The contributions of each subunit to the biochemical activities of the complex, however, remain unclear. We have previously demonstrated that a mutation in the putative zinc finger at UL52 C terminus abrogates not only primase but also ATPase, helicase, and DNA-binding activities of a UL5/UL52 subcomplex, indicating a complex interdependence between the two subunits. To test this hypothesis and to further investigate the role of the zinc finger in the enzymatic activities of the helicase-primase, a series of mutations were constructed in this motif. They differed in their ability to complement a UL52 null virus: totally defective, partial complementation, and potentiating. In this study, four of these mutants were studied biochemically after expression and purification from insect cells infected with recombinant baculoviruses. All mutants show greatly reduced primase activity. Complementation-defective mutants exhibited severe defects in ATPase, helicase, and DNA-binding activities. Partially complementing mutants displayed intermediate levels of these activities, except that one showed a wild-type level of helicase activity. These data suggest that the UL52 zinc finger motif plays an important role in the activities of the helicase-primase complex. The observation that mutations in UL52 affected helicase, ATPase, and DNA-binding activities indicates that UL52 binding to DNA via the zinc finger may be necessary for loading UL5. Alternatively, UL5 and UL52 may share a DNA-binding interface.

  10. Perception Enhancement using Visual Attributes in Sequence Motif Visualization

    OpenAIRE

    Oon, Yin; Lee, Nung; Kok, Wei

    2016-01-01

    Sequence logo is a well-accepted scientific method to visualize the conservation characteristics of biological sequence motifs. Previous studies found that using sequence logo graphical representation for scientific evidence reports or arguments could seriously cause biases and misinterpretation by users. This study investigates on the visual attributes performance of a sequence logo in helping users to perceive and interpret the information based on preattentive theories and Gestalt principl...

  11. Nuclear and Mitochondrial DNA Analyses of Golden Eagles (Aquila chrysaetos canadensis from Three Areas in Western North America; Initial Results and Conservation Implications.

    Directory of Open Access Journals (Sweden)

    Erica H Craig

    Full Text Available Understanding the genetics of a population is a critical component of developing conservation strategies. We used archived tissue samples from golden eagles (Aquila chrysaetos canadensis in three geographic regions of western North America to conduct a preliminary study of the genetics of the North American subspecies, and to provide data for United States Fish and Wildlife Service (USFWS decision-making for golden eagle management. We used a combination of mitochondrial DNA (mtDNA D-loop sequences and 16 nuclear DNA (nDNA microsatellite loci to investigate the extent of gene flow among our sampling areas in Idaho, California and Alaska and to determine if we could distinguish birds from the different geographic regions based on their genetic profiles. Our results indicate high genetic diversity, low genetic structure and high connectivity. Nuclear DNA Fst values between Idaho and California were low but significantly different from zero (0.026. Bayesian clustering methods indicated a single population, and we were unable to distinguish summer breeding residents from different regions. Results of the mtDNA AMOVA showed that most of the haplotype variation (97% was within the geographic populations while 3% variation was partitioned among them. One haplotype was common to all three areas. One region-specific haplotype was detected in California and one in Idaho, but additional sampling is required to determine if these haplotypes are unique to those geographic areas or a sampling artifact. We discuss potential sources of the high gene flow for this species including natal and breeding dispersal, floaters, and changes in migratory behavior as a result of environmental factors such as climate change and habitat alteration. Our preliminary findings can help inform the USFWS in development of golden eagle management strategies and provide a basis for additional research into the complex dynamics of the North American subspecies.

  12. Nuclear and Mitochondrial DNA Analyses of Golden Eagles (Aquila chrysaetos canadensis) from Three Areas in Western North America; Initial Results and Conservation Implications.

    Science.gov (United States)

    Craig, Erica H; Adams, Jennifer R; Waits, Lisette P; Fuller, Mark R; Whittington, Diana M

    2016-01-01

    Understanding the genetics of a population is a critical component of developing conservation strategies. We used archived tissue samples from golden eagles (Aquila chrysaetos canadensis) in three geographic regions of western North America to conduct a preliminary study of the genetics of the North American subspecies, and to provide data for United States Fish and Wildlife Service (USFWS) decision-making for golden eagle management. We used a combination of mitochondrial DNA (mtDNA) D-loop sequences and 16 nuclear DNA (nDNA) microsatellite loci to investigate the extent of gene flow among our sampling areas in Idaho, California and Alaska and to determine if we could distinguish birds from the different geographic regions based on their genetic profiles. Our results indicate high genetic diversity, low genetic structure and high connectivity. Nuclear DNA Fst values between Idaho and California were low but significantly different from zero (0.026). Bayesian clustering methods indicated a single population, and we were unable to distinguish summer breeding residents from different regions. Results of the mtDNA AMOVA showed that most of the haplotype variation (97%) was within the geographic populations while 3% variation was partitioned among them. One haplotype was common to all three areas. One region-specific haplotype was detected in California and one in Idaho, but additional sampling is required to determine if these haplotypes are unique to those geographic areas or a sampling artifact. We discuss potential sources of the high gene flow for this species including natal and breeding dispersal, floaters, and changes in migratory behavior as a result of environmental factors such as climate change and habitat alteration. Our preliminary findings can help inform the USFWS in development of golden eagle management strategies and provide a basis for additional research into the complex dynamics of the North American subspecies.

  13. Nuclear and mitochondrial DNA analyses of golden eagles (Aquila chrysaetos canadensis) from three areas in western North America; initial results and conservation implications

    Science.gov (United States)

    Craig, Erica H; Adams, Jennifer R.; Waits, Lisette P.; Fuller, Mark R.; Whittington, Diana M.

    2016-01-01

    Understanding the genetics of a population is a critical component of developing conservation strategies. We used archived tissue samples from golden eagles (Aquila chrysaetos canadensis) in three geographic regions of western North America to conduct a preliminary study of the genetics of the North American subspecies, and to provide data for United States Fish and Wildlife Service (USFWS) decision-making for golden eagle management. We used a combination of mitochondrial DNA (mtDNA) D-loop sequences and 16 nuclear DNA (nDNA) microsatellite loci to investigate the extent of gene flow among our sampling areas in Idaho, California and Alaska and to determine if we could distinguish birds from the different geographic regions based on their genetic profiles. Our results indicate high genetic diversity, low genetic structure and high connectivity. Nuclear DNA Fst values between Idaho and California were low but significantly different from zero (0.026). Bayesian clustering methods indicated a single population, and we were unable to distinguish summer breeding residents from different regions. Results of the mtDNA AMOVA showed that most of the haplotype variation (97%) was within the geographic populations while 3% variation was partitioned among them. One haplotype was common to all three areas. One region-specific haplotype was detected in California and one in Idaho, but additional sampling is required to determine if these haplotypes are unique to those geographic areas or a sampling artifact. We discuss potential sources of the high gene flow for this species including natal and breeding dispersal, floaters, and changes in migratory behavior as a result of environmental factors such as climate change and habitat alteration. Our preliminary findings can help inform the USFWS in development of golden eagle management strategies and provide a basis for additional research into the complex dynamics of the North American subspecies.

  14. Conserved structural chemistry for incision activity in structurally non-homologous apurinic/apyrimidinic endonuclease APE1 and endonuclease IV DNA repair enzymes.

    Energy Technology Data Exchange (ETDEWEB)

    Tsutakawa, Susan E.; Shin, David S.; Mol, Clifford D.; Izum, Tadahide; Arvai, Andrew S.; Mantha, Anil K.; Szczesny, Bartosz; Ivanov, Ivaylo N.; Hosfield, David J.; Maiti, Buddhadev; Pique, Mike E.; Frankel, Kenneth A.; Hitomi, Kenichi; Cunningham, Richard P.; Mitra, Sankar; Tainer, John A.

    2013-03-22

    Non-coding apurinic/apyrimidinic (AP) sites in DNA form spontaneously and as DNA base excision repair intermediates are the most common toxic and mutagenic in vivo DNA lesion. For repair, AP sites must be processed by 5' AP endonucleases in initial stages of base repair. Human APE1 and bacterial Nfo represent the two conserved 5' AP endonuclease families in the biosphere; they both recognize AP sites and incise the phosphodiester backbone 5' to the lesion, yet they lack similar structures and metal ion requirements. Here, we determined and analyzed crystal structures of a 2.4 ? resolution APE1-DNA product complex with Mg(2+) and a 0.92 Nfo with three metal ions. Structural and biochemical comparisons of these two evolutionarily distinct enzymes characterize key APE1 catalytic residues that are potentially functionally similar to Nfo active site components, as further tested and supported by computational analyses. We observe a magnesium-water cluster in the APE1 active site, with only Glu-96 forming the direct protein coordination to the Mg(2+). Despite differences in structure and metal requirements of APE1 and Nfo, comparison of their active site structures surprisingly reveals strong geometric conservation of the catalytic reaction, with APE1 catalytic side chains positioned analogously to Nfo metal positions, suggesting surprising functional equivalence between Nfo metal ions and APE1 residues. The finding that APE1 residues are positioned to substitute for Nfo metal ions is supported by the impact of mutations on activity. Collectively, the results illuminate the activities of residues, metal ions, and active site features for abasic site endonucleases.

  15. Conserved Organisation of 45S rDNA Sites and rDNA Gene Copy Number among Major Clades of Early Land Plants

    Czech Academy of Sciences Publication Activity Database

    Rosato, M.; Kovařík, Aleš; Garilleti, R.; Rosselló, J. A.

    2016-01-01

    Roč. 11, č. 9 (2016), č. článku e0162544. E-ISSN 1932-6203 R&D Projects: GA ČR GBP501/12/G090 Institutional support: RVO:68081707 Keywords : molecular cytogenetic analyses * nuclear ribosomal dna Subject RIV: BO - Biophysics Impact factor: 2.806, year: 2016

  16. Solution NMR structure of the HLTF HIRAN domain: a conserved module in SWI2/SNF2 DNA damage tolerance proteins

    International Nuclear Information System (INIS)

    Korzhnev, Dmitry M.; Neculai, Dante; Dhe-Paganon, Sirano; Arrowsmith, Cheryl H.; Bezsonova, Irina

    2016-01-01

    HLTF is a SWI2/SNF2-family ATP-dependent chromatin remodeling enzyme that acts in the error-free branch of DNA damage tolerance (DDT), a cellular mechanism that enables replication of damaged DNA while leaving damage repair for a later time. Human HLTF and a closely related protein SHPRH, as well as their yeast homologue Rad5, are multi-functional enzymes that share E3 ubiquitin-ligase activity required for activation of the error-free DDT. HLTF and Rad5 also function as ATP-dependent dsDNA translocases and possess replication fork reversal activities. Thus, they can convert Y-shaped replication forks into X-shaped Holliday junction structures that allow error-free replication over DNA lesions. The fork reversal activity of HLTF is dependent on 3′-ssDNA-end binding activity of its N-terminal HIRAN domain. Here we present the solution NMR structure of the human HLTF HIRAN domain, an OB-like fold module found in organisms from bacteria (as a stand-alone domain) to plants, fungi and metazoan (in combination with SWI2/SNF2 helicase-like domain). The obtained structure of free HLTF HIRAN is similar to recently reported structures of its DNA bound form, while the NMR analysis also reveals that the DNA binding site of the free domain exhibits conformational heterogeneity. Sequence comparison of N-terminal regions of HLTF, SHPRH and Rad5 aided by knowledge of the HLTF HIRAN structure suggests that the SHPRH N-terminus also includes an uncharacterized structured module, exhibiting weak sequence similarity with HIRAN regions of HLTF and Rad5, and potentially playing a similar functional role.

  17. Solution NMR structure of the HLTF HIRAN domain: a conserved module in SWI2/SNF2 DNA damage tolerance proteins

    Energy Technology Data Exchange (ETDEWEB)

    Korzhnev, Dmitry M. [University of Connecticut Health, Department of Molecular Biology and Biophysics (United States); Neculai, Dante [Zhejiang University, School of Medicine (China); Dhe-Paganon, Sirano [Dana-Farber Cancer Institute, Department of Cancer Biology (United States); Arrowsmith, Cheryl H. [University of Toronto, Structural Genomics Consortium (Canada); Bezsonova, Irina, E-mail: bezsonova@uchc.edu [University of Connecticut Health, Department of Molecular Biology and Biophysics (United States)

    2016-11-15

    HLTF is a SWI2/SNF2-family ATP-dependent chromatin remodeling enzyme that acts in the error-free branch of DNA damage tolerance (DDT), a cellular mechanism that enables replication of damaged DNA while leaving damage repair for a later time. Human HLTF and a closely related protein SHPRH, as well as their yeast homologue Rad5, are multi-functional enzymes that share E3 ubiquitin-ligase activity required for activation of the error-free DDT. HLTF and Rad5 also function as ATP-dependent dsDNA translocases and possess replication fork reversal activities. Thus, they can convert Y-shaped replication forks into X-shaped Holliday junction structures that allow error-free replication over DNA lesions. The fork reversal activity of HLTF is dependent on 3′-ssDNA-end binding activity of its N-terminal HIRAN domain. Here we present the solution NMR structure of the human HLTF HIRAN domain, an OB-like fold module found in organisms from bacteria (as a stand-alone domain) to plants, fungi and metazoan (in combination with SWI2/SNF2 helicase-like domain). The obtained structure of free HLTF HIRAN is similar to recently reported structures of its DNA bound form, while the NMR analysis also reveals that the DNA binding site of the free domain exhibits conformational heterogeneity. Sequence comparison of N-terminal regions of HLTF, SHPRH and Rad5 aided by knowledge of the HLTF HIRAN structure suggests that the SHPRH N-terminus also includes an uncharacterized structured module, exhibiting weak sequence similarity with HIRAN regions of HLTF and Rad5, and potentially playing a similar functional role.

  18. Selection against spurious promoter motifs correlates withtranslational efficiency across bacteria

    Energy Technology Data Exchange (ETDEWEB)

    Froula, Jeffrey L.; Francino, M. Pilar

    2007-05-01

    Because binding of RNAP to misplaced sites could compromise the efficiency of transcription, natural selection for the optimization of gene expression should regulate the distribution of DNA motifs capable of RNAP-binding across the genome. Here we analyze the distribution of the -10 promoter motifs that bind the {sigma}{sup 70} subunit of RNAP in 42 bacterial genomes. We show that selection on these motifs operates across the genome, maintaining an over-representation of -10 motifs in regulatory sequences while eliminating them from the nonfunctional and, in most cases, from the protein coding regions. In some genomes, however, -10 sites are over-represented in the coding sequences; these sites could induce pauses effecting regulatory roles throughout the length of a transcriptional unit. For nonfunctional sequences, the extent of motif under-representation varies across genomes in a manner that broadly correlates with the number of tRNA genes, a good indicator of translational speed and growth rate. This suggests that minimizing the time invested in gene transcription is an important selective pressure against spurious binding. However, selection against spurious binding is detectable in the reduced genomes of host-restricted bacteria that grow at slow rates, indicating that components of efficiency other than speed may also be important. Minimizing the number of RNAP molecules per cell required for transcription, and the corresponding energetic expense, may be most relevant in slow growers. These results indicate that genome-level properties affecting the efficiency of transcription and translation can respond in an integrated manner to optimize gene expression. The detection of selection against promoter motifs in nonfunctional regions also implies that no sequence may evolve free of selective constraints, at least in the relatively small and unstructured genomes of bacteria.

  19. DnaA protein DNA-binding domain binds to Hda protein to promote inter-AAA+ domain interaction involved in regulatory inactivation of DnaA.

    Science.gov (United States)

    Keyamura, Kenji; Katayama, Tsutomu

    2011-08-19

    Chromosomal replication is initiated from the replication origin oriC in Escherichia coli by the active ATP-bound form of DnaA protein. The regulatory inactivation of DnaA (RIDA) system, a complex of the ADP-bound Hda and the DNA-loaded replicase clamp, represses extra initiations by facilitating DnaA-bound ATP hydrolysis, yielding the inactive ADP-bound form of DnaA. However, the mechanisms involved in promoting the DnaA-Hda interaction have not been determined except for the involvement of an interaction between the AAA+ domains of the two. This study revealed that DnaA Leu-422 and Pro-423 residues within DnaA domain IV, including a typical DNA-binding HTH motif, are specifically required for RIDA-dependent ATP hydrolysis in vitro and that these residues support efficient interaction with the DNA-loaded clamp·Hda complex and with Hda in vitro. Consistently, substitutions of these residues caused accumulation of ATP-bound DnaA in vivo and oriC-dependent inhibition of cell growth. Leu-422 plays a more important role in these activities than Pro-423. By contrast, neither of these residues is crucial for DNA replication from oriC, although they are highly conserved in DnaA orthologues. Structural analysis of a DnaA·Hda complex model suggested that these residues make contact with residues in the vicinity of the Hda AAA+ sensor I that participates in formation of a nucleotide-interacting surface. Together, the results show that functional DnaA-Hda interactions require a second interaction site within DnaA domain IV in addition to the AAA+ domain and suggest that these interactions are crucial for the formation of RIDA complexes that are active for DnaA-ATP hydrolysis.

  20. DnaA Protein DNA-binding Domain Binds to Hda Protein to Promote Inter-AAA+ Domain Interaction Involved in Regulatory Inactivation of DnaA*

    Science.gov (United States)

    Keyamura, Kenji; Katayama, Tsutomu

    2011-01-01

    Chromosomal replication is initiated from the replication origin oriC in Escherichia coli by the active ATP-bound form of DnaA protein. The regulatory inactivation of DnaA (RIDA) system, a complex of the ADP-bound Hda and the DNA-loaded replicase clamp, represses extra initiations by facilitating DnaA-bound ATP hydrolysis, yielding the inactive ADP-bound form of DnaA. However, the mechanisms involved in promoting the DnaA-Hda interaction have not been determined except for the involvement of an interaction between the AAA+ domains of the two. This study revealed that DnaA Leu-422 and Pro-423 residues within DnaA domain IV, including a typical DNA-binding HTH motif, are specifically required for RIDA-dependent ATP hydrolysis in vitro and that these residues support efficient interaction with the DNA-loaded clamp·Hda complex and with Hda in vitro. Consistently, substitutions of these residues caused accumulation of ATP-bound DnaA in vivo and oriC-dependent inhibition of cell growth. Leu-422 plays a more important role in these activities than Pro-423. By contrast, neither of these residues is crucial for DNA replication from oriC, although they are highly conserved in DnaA orthologues. Structural analysis of a DnaA·Hda complex model suggested that these residues make contact with residues in the vicinity of the Hda AAA+ sensor I that participates in formation of a nucleotide-interacting surface. Together, the results show that functional DnaA-Hda interactions require a second interaction site within DnaA domain IV in addition to the AAA+ domain and suggest that these interactions are crucial for the formation of RIDA complexes that are active for DnaA-ATP hydrolysis. PMID:21708944

  1. Characterization of genetic diversity in chickpea using SSR markers, Start Codon Targeted Polymorphism (SCoT) and Conserved DNA-Derived Polymorphism (CDDP).

    Science.gov (United States)

    Hajibarat, Zahra; Saidi, Abbas; Hajibarat, Zohreh; Talebi, Reza

    2015-07-01

    To evaluate the genetic diversity among 48 genotypes of chickpea comprising cultivars, landraces and internationally developed improved lines genetic distances were evaluated using three different molecular marker techniques: Simple Sequence Repeat (SSR); Start Codon Targeted (SCoT) and Conserved DNA-derived Polymorphism (CDDP). Average polymorphism information content (PIC) for SSR, SCoT and CDDP markers was 0.47, 0.45 and 0.45, respectively, and this revealed that three different marker types were equal for the assessment of diversity amongst genotypes. Cluster analysis for SSR and SCoT divided the genotypes in to three distinct clusters and using CDDP markers data, genotypes grouped in to five clusters. There were positive significant correlation (r = 0.43, P SSR markers. These results suggest that efficiency of SSR, SCOT and CDDP markers was relatively the same in fingerprinting of chickpea genotypes. To our knowledge, this is the first detailed report of using targeted DNA region molecular marker (CDDP) for genetic diversity analysis in chickpea in comparison with SCoT and SSR markers. Overall, our results are able to prove the suitability of SCoT and CDDP markers for genetic diversity analysis in chickpea for their high rates of polymorphism and their potential for genome diversity and germplasm conservation.

  2. Discriminative Motif Discovery via Simulated Evolution and Random Under-Sampling

    OpenAIRE

    Song, Tao; Gu, Hong

    2014-01-01

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the sta...

  3. Efficacy of the core DNA barcodes in identifying processed and poorly conserved plant materials commonly used in South African traditional medicine

    Directory of Open Access Journals (Sweden)

    Ledile Mankga

    2013-12-01

    Full Text Available Medicinal plants cover a broad range of taxa, which may be phylogenetically less related but morphologically very similar. Such morphological similarity between species may lead to misidentification and inappropriate use. Also the substitution of a medicinal plant by a cheaper alternative (e.g. other non-medicinal plant species, either due to misidentification, or deliberately to cheat consumers, is an issue of growing concern. In this study, we used DNA barcoding to identify commonly used medicinal plants in South Africa. Using the core plant barcodes, matK and rbcLa, obtained from processed and poorly conserved materials sold at the muthi traditional medicine market, we tested efficacy of the barcodes in species discrimination. Based on genetic divergence, PCR amplification efficiency and BLAST algorithm, we revealed varied discriminatory potentials for the DNA barcodes. In general, the barcodes exhibited high discriminatory power, indicating their effectiveness in verifying the identity of the most common plant species traded in South African medicinal markets. BLAST algorithm successfully matched 61% of the queries against a reference database, suggesting that most of the information supplied by sellers at traditional medicinal markets in South Africa is correct. Our findings reinforce the utility of DNA barcoding technique in limiting false identification that can harm public health.

  4. Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

    KAUST Repository

    Alam, Tanvir

    2018-03-11

    Short Linear Motifs (SLiMs) contribute to almost every cellular function by connecting appropriate protein partners. Accurate prediction of SLiMs is difficult due to their shortness and sequence degeneracy. Leucine-aspartic acid (LD) motifs are SLiMs that link paxillin family proteins to factors controlling (cancer) cell adhesion, motility and survival. The existence and importance of LD motifs beyond the paxillin family is poorly understood. To enable a proteome-wide assessment of these motifs, we developed an active-learning based framework that iteratively integrates computational predictions with experimental validation. Our analysis of the human proteome identified a dozen proteins that contain LD motifs, all being involved in cell adhesion and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter-species comparison revealed a conserved LD signalling core, and reveals the emergence of species-specific adaptive connections, while maintaining a strong functional focus of the LD motif interactome. Collectively, our data elucidate the mechanisms underlying the origin and adaptation of an ancestral SLiM.

  5. The Verrucomicrobia LexA-binding Motif: Insights into the Evolutionary Dynamics of the SOS Response

    Directory of Open Access Journals (Sweden)

    Ivan Erill

    2016-07-01

    Full Text Available The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  6. The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response.

    Science.gov (United States)

    Erill, Ivan; Campoy, Susana; Kılıç, Sefa; Barbé, Jordi

    2016-01-01

    The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division, and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  7. Systematic discovery of regulatory motifs in Fusarium graminearum by comparing four Fusarium genomes

    Directory of Open Access Journals (Sweden)

    Kistler Corby

    2010-03-01

    Full Text Available Abstract Background Fusarium graminearum (Fg, a major fungal pathogen of cultivated cereals, is responsible for billions of dollars in agriculture losses. There is a growing interest in understanding the transcriptional regulation of this organism, especially the regulation of genes underlying its pathogenicity. The generation of whole genome sequence assemblies for Fg and three closely related Fusarium species provides a unique opportunity for such a study. Results Applying comparative genomics approaches, we developed a computational pipeline to systematically discover evolutionarily conserved regulatory motifs in the promoter, downstream and the intronic regions of Fg genes, based on the multiple alignments of sequenced Fusarium genomes. Using this method, we discovered 73 candidate regulatory motifs in the promoter regions. Nearly 30% of these motifs are highly enriched in promoter regions of Fg genes that are associated with a specific functional category. Through comparison to Saccharomyces cerevisiae (Sc and Schizosaccharomyces pombe (Sp, we observed conservation of transcription factors (TFs, their binding sites and the target genes regulated by these TFs related to pathways known to respond to stress conditions or phosphate metabolism. In addition, this study revealed 69 and 39 conserved motifs in the downstream regions and the intronic regions, respectively, of Fg genes. The top intronic motif is the splice donor site. For the downstream regions, we noticed an intriguing absence of the mammalian and Sc poly-adenylation signals among the list of conserved motifs. Conclusion This study provides the first comprehensive list of candidate regulatory motifs in Fg, and underscores the power of comparative genomics in revealing functional elements among related genomes. The conservation of regulatory pathways among the Fusarium genomes and the two yeast species reveals their functional significance, and provides new insights in their

  8. Kopi dan Kakao dalam Kreasi Motif Batik Khas Jember

    Directory of Open Access Journals (Sweden)

    Irfa'ina Rohana Salma

    2015-06-01

    Full Text Available ABSTRAK Batik Jember selama ini identik dengan motif daun tembakau. Visualisasi daun tembakau dalam motif Batik Jember cukup lemah, yaitu kurang berkarakter karena motif yang muncul adalah seperti gambar daun pada umumnya. Oleh karena itu perlu diciptakan desain motif batik khas Jember yang sumber inspirasinya digali dari kekayaan alam lainnya dari Jember yang mempunyai bentuk spesifik dan karakteristik sehingga identitas motif bisa didapatkan dengan lebih kuat. Hasil alam khas Jember tersebut adalah kopi dan kakao. Tujuan penciptaan seni ini adalah untuk menghasilkan motif batik  baru yang mempunyai ciri khas Jember. Metode yang digunakan yaitu pengumpulan data, pengamatan mendalam terhadap objek penciptaan, pengkajian sumber inspirasi, pembuatan desain motif, dan perwujudan menjadi batik. Dari penciptaan seni ini berhasil dikreasikan 6 (enam motif batik yaitu: (1 Motif Uwoh Kopi; (2 Motif Godong Kopi;  (3 Motif Ceplok Kakao; (4 Motif Kakao Raja; (5 Motif Kakao Biru; dan (6 Motif Wiji Mukti. Berdasarkan hasil penilaian “Selera Estetika” diketahui bahwa motif yang paling banyak disukai adalah Motif Uwoh Kopi dan Motif Kakao Raja. Kata kunci: Motif Woh Kopi, Motif Godong Kopi, Motif Ceplok Kakao, Motif Kakao Raja, Motif Kakao Biru, Motif Wiji Mukti ABSTRACTBatik Jember is synonymous with tobacco leaf motif. Tobacco leaf shape is quite weak in the visual appearance characterized as that motif emerges like a picture of leaves in general. Therefore, it is necessary to create a distinctive design motif extracted from other natural resources of Jember that have specific shapes and characteristics that can be obtained as the stronger motif identity. The typical natural resources from Jember are coffee and cocoa. The purpose of the creation of this art is to produce the unique, creative and innovative batik and have specific characteristics of Jember. The method used are data collection, observation of the object, reviewing inspiration sources

  9. Translational Control of Host Gene Expression by a Cys-Motif Protein Encoded in a Bracovirus.

    Directory of Open Access Journals (Sweden)

    Eunseong Kim

    Full Text Available Translational control is a strategy that various viruses use to manipulate their hosts to suppress acute antiviral response. Polydnaviruses, a group of insect double-stranded DNA viruses symbiotic to some endoparasitoid wasps, are divided into two genera: ichnovirus (IV and bracovirus (BV. In IV, some Cys-motif genes are known as host translation-inhibitory factors (HTIF. The genome of endoparasitoid wasp Cotesia plutellae contains a Cys-motif gene (Cp-TSP13 homologous to an HTIF known as teratocyte-secretory protein 14 (TSP14 of Microplitis croceipes. Cp-TSP13 consists of 129 amino acid residues with a predicted molecular weight of 13.987 kDa and pI value of 7.928. Genomic DNA region encoding its open reading frame has three introns. Cp-TSP13 possesses six conserved cysteine residues as other Cys-motif genes functioning as HTIF. Cp-TSP13 was expressed in Plutella xylostella larvae parasitized by C. plutellae. C. plutellae bracovirus (CpBV was purified and injected into non-parasitized P. xylostella that expressed Cp-TSP13. Cp-TSP13 was cloned into a eukaryotic expression vector and used to infect Sf9 cells to transiently express Cp-TSP13. The synthesized Cp-TSP13 protein was detected in culture broth. An overlaying experiment showed that the purified Cp-TSP13 entered hemocytes. It was localized in the cytosol. Recombinant Cp-TSP13 significantly inhibited protein synthesis of secretory proteins when it was added to in vitro cultured fat body. In addition, the recombinant Cp-TSP13 directly inhibited the translation of fat body mRNAs in in vitro translation assay using rabbit reticulocyte lysate. Moreover, the recombinant Cp-TSP13 significantly suppressed cellular immune responses by inhibiting hemocyte-spreading behavior. It also exhibited significant insecticidal activities by both injection and feeding routes. These results indicate that Cp-TSP13 is a viral HTIF.

  10. DNA-based identification reveals illegal trade of threatened shark species in a global elasmobranch conservation hotspot.

    Science.gov (United States)

    Feitosa, Leonardo Manir; Martins, Ana Paula Barbosa; Giarrizzo, Tommaso; Macedo, Wagner; Monteiro, Iann Leonardo; Gemaque, Romário; Nunes, Jorge Luiz Silva; Gomes, Fernanda; Schneider, Horácio; Sampaio, Iracilda; Souza, Rosália; Sales, João Bráullio; Rodrigues-Filho, Luís Fernando; Tchaicka, Lígia; Carvalho-Costa, Luís Fernando

    2018-02-20

    Here, we report trading of endangered shark species in a world hotspot for elasmobranch conservation in Brazil. Data on shark fisheries are scarce in Brazil, although the northern and northeastern regions have the highest indices of shark bycatch. Harvest is made primarily with processed carcasses lacking head and fins, which hampers reliable species identification and law enforcement on illegal catches. We used partial sequences of two mitochondrial genes (COI and/or NADH2) to identify 17 shark species from 427 samples being harvested and marketed on the northern coast of Brazil. Nine species (53%) are listed under some extinction threat category according to Brazilian law and international authorities (IUCN - International Union for Conservation of Nature; CITES - Convention on International Trade of Endangered Species of Wild Fauna and Flora). The number increases to 13 (76%) if we also consider the Near Threatened category. Hammerhead sharks are under threat worldwide, and composed 18.7% of samples, with Sphyrna mokarran being the fourth most common species among samples. As illegal trade of threatened shark species is a worldwide conservation problem, molecular identification of processed meat or specimens lacking diagnostic body parts is a highly effective tool for species identification and law enforcement.

  11. The Helicobacter pylori HpyAXII restriction–modification system limits exogenous DNA uptake by targeting GTAC sites but shows asymmetric conservation of the DNA methyltransferase and restriction endonuclease components

    Science.gov (United States)

    Humbert, Olivier; Salama, Nina R.

    2008-01-01

    The naturally competent organism Helicobacter pylori encodes a large number of restriction–modification (R–M) systems that consist of a restriction endonuclease and a DNA methyltransferase. R–M systems are not only believed to limit DNA exchange among bacteria but may also have other cellular functions. We report a previously uncharacterized H. pylori type II R–M system, M.HpyAXII/R.HpyAXII. We show that this system targets GTAC sites, which are rare in the H. pylori chromosome but numerous in ribosomal RNA genes. As predicted, this type II R–M system showed attributes of a selfish element. Deletion of the methyltransferase M.HpyAXII is lethal when associated with an active endonuclease R.HpyAXII unless compensated by adaptive mutation or gene amplification. R.HpyAXII effectively restricted both unmethylated plasmid and chromosomal DNA during natural transformation and was predicted to belong to the novel ‘half pipe’ structural family of endonucleases. Analysis of a panel of clinical isolates revealed that R.HpyAXII was functional in a small number of H. pylori strains (18.9%, n = 37), whereas the activity of M.HpyAXII was highly conserved (92%, n = 50), suggesting that GTAC methylation confers a selective advantage to H. pylori. However, M.HpyAXII activity did not enhance H. pylori fitness during stomach colonization of a mouse infection model. PMID:18978016

  12. MUTATION ON WD DIPEPTIDE MOTIFS OF THE p48 SUBUNIT OF CHROMATIN ASSEMBLY FACTOR-1 CAUSING VIABILITY AND GROWTH OF DT40 CHICKEN B CELL LINE

    Directory of Open Access Journals (Sweden)

    Ahyar Ahmad

    2010-07-01

    Full Text Available Chromatin assembly factor-1 (CAF-1, a protein complex consisting of three subunits, p150, p60, and p48, is highly conserved from yeast to humans and facilitated nucleosome assembly of newly replicated DNA. The p48 subunit, CAF-1p48 (p48, with seven WD (Trp-Asp repeat motifs, is a member of the WD protein family. The immunoprecipitation experiment revealed that ß-propeller structure of p48 was less stringent for it's binding to HDAC-1, but more stringent for its binding to both histones H4 and CAF-1p60 but not to ASF-1, indicating that the proper ß-propeller structure of p48 is essential for the binding to these two proteins histone H4 and CAF-1p60. Complementation experiments, involving missense and truncated mutants of FLAG-tagged p48, revealed that mutations of every of seven WD dipeptide motifs, like both the N-terminal and C-terminal truncated mutations, could not rescue for the tet-induced lethality. These results indicate not only that p48 is essential for the viability of vertebrate cells, although the yeast p48 homolog is nonessential, but also that all the seven WD dipeptide motifs are necessary for the maintenance of the proper structure of p48 that is fundamentally important for cell viability.   Keywords: Chromatin assembly factor-1, complementation experiments, viability

  13. Statistical tests to compare motif count exceptionalities

    Directory of Open Access Journals (Sweden)

    Vandewalle Vincent

    2007-03-01

    Full Text Available Abstract Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use.

  14. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease.

    Science.gov (United States)

    Anders, Carolin; Niewoehner, Ole; Duerst, Alessia; Jinek, Martin

    2014-09-25

    The CRISPR-associated protein Cas9 is an RNA-guided endonuclease that cleaves double-stranded DNA bearing sequences complementary to a 20-nucleotide segment in the guide RNA. Cas9 has emerged as a versatile molecular tool for genome editing and gene expression control. RNA-guided DNA recognition and cleavage strictly require the presence of a protospacer adjacent motif (PAM) in the target DNA. Here we report a crystal structure of Streptococcus pyogenes Cas9 in complex with a single-molecule guide RNA and a target DNA containing a canonical 5'-NGG-3' PAM. The structure reveals that the PAM motif resides in a base-paired DNA duplex. The non-complementary strand GG dinucleotide is read out via major-groove interactions with conserved arginine residues from the carboxy-terminal domain of Cas9. Interactions with the minor groove of the PAM duplex and the phosphodiester group at the +1 position in the target DNA strand contribute to local strand separation immediately upstream of the PAM. These observations suggest a mechanism for PAM-dependent target DNA melting and RNA-DNA hybrid formation. Furthermore, this study establishes a framework for the rational engineering of Cas9 enzymes with novel PAM specificities.

  15. Nucleotide-mimetic synthetic ligands for DNA-recognizing enzymes One-step purification of Pfu DNA polymerase.

    Science.gov (United States)

    Melissis, S; Labrou, N E; Clonis, Y D

    2006-07-28

    The commercial availability of DNA polymerases has revolutionized molecular biotechnology and certain sectors of the bio-industry. Therefore, the development of affinity adsorbents for purification of DNA polymerases is of academic interest and practical importance. In the present study we describe the design, synthesis and evaluation of a combinatorial library of novel affinity ligands for the purification of DNA polymerases (Pols). Pyrococcus furiosus DNA polymerase (Pfu Pol) was employed as a proof-of-principle example. Affinity ligand design was based on mimicking the natural interactions between deoxynucleoside-triphosphates (dNTPs) and the B-motif, a conserved structural moiety found in Pol-I and Pol-II family of enzymes. Solid-phase 'structure-guided' combinatorial chemistry was used to construct a library of 26 variants of the B-motif-binding 'lead' ligand X-Trz-Y (X is a purine derivative and Y is an aliphatic/aromatic sulphonate or phosphonate derivative) using 1,3,5-triazine (Trz) as the scaffold for assembly. The 'lead' ligand showed complementarity against a Lys and a Tyr residue of the polymerase B-motif. The ligand library was screened for its ability to bind and purify Pfu Pol from Escherichia coli extract. One immobilized ligand (oABSAd), bearing 9-aminoethyladenine (AEAd) and sulfanilic acid (oABS) linked on the triazine scaffold, displayed the highest purifying ability and binding capacity (0,55 mg Pfu Pol/g wet gel). Adsorption equilibrium studies with this affinity ligand and Pfu Pol determined a dissociation constant (K(D)) of 83 nM for the respective complex. The oABSAd affinity adsorbent was exploited in the development of a facile Pfu Pol purification protocol, affording homogeneous enzyme (>99% purity) in a single chromatography step. Quality control tests showed that Pfu Pol purified on the B-motif-complementing ligand is free of nucleic acids and contaminating nuclease activities, therefore, suitable for experimental use.

  16. Distribution of CpG Motifs in Upstream Gene Domains in a Reef Coral and Sea Anemone: Implications for Epigenetics in Cnidarians.

    Directory of Open Access Journals (Sweden)

    Adam G Marsh

    Full Text Available Coral reefs are under assault from stressors including global warming, ocean acidification, and urbanization. Knowing how these factors impact the future fate of reefs requires delineating stress responses across ecological, organismal and cellular scales. Recent advances in coral reef biology have integrated molecular processes with ecological fitness and have identified putative suites of temperature acclimation genes in a Scleractinian coral Acropora hyacinthus. We wondered what unique characteristics of these genes determined their coordinate expression in response to temperature acclimation, and whether or not other corals and cnidarians would likewise possess these features. Here, we focus on cytosine methylation as an epigenetic DNA modification that is responsive to environmental stressors. We identify common conserved patterns of cytosine-guanosine dinucleotide (CpG motif frequencies in upstream promoter domains of different functional gene groups in two cnidarian genomes: a coral (Acropora digitifera and an anemone (Nematostella vectensis. Our analyses show that CpG motif frequencies are prominent in the promoter domains of functional genes associated with environmental adaptation, particularly those identified in A. hyacinthus. Densities of CpG sites in upstream promoter domains near the transcriptional start site (TSS are 1.38x higher than genomic background levels upstream of -2000 bp from the TSS. The increase in CpG usage suggests selection to allow for DNA methylation events to occur more frequently within 1 kb of the TSS. In addition, observed shifts in CpG densities among functional groups of genes suggests a potential role for epigenetic DNA methylation within promoter domains to impact functional gene expression responses in A. digitifera and N. vectensis. Identifying promoter epigenetic sequence motifs among genes within specific functional groups establishes an approach to describe integrated cellular responses to

  17. Distribution of CpG Motifs in Upstream Gene Domains in a Reef Coral and Sea Anemone: Implications for Epigenetics in Cnidarians.

    Science.gov (United States)

    Marsh, Adam G; Hoadley, Kenneth D; Warner, Mark E

    2016-01-01

    Coral reefs are under assault from stressors including global warming, ocean acidification, and urbanization. Knowing how these factors impact the future fate of reefs requires delineating stress responses across ecological, organismal and cellular scales. Recent advances in coral reef biology have integrated molecular processes with ecological fitness and have identified putative suites of temperature acclimation genes in a Scleractinian coral Acropora hyacinthus. We wondered what unique characteristics of these genes determined their coordinate expression in response to temperature acclimation, and whether or not other corals and cnidarians would likewise possess these features. Here, we focus on cytosine methylation as an epigenetic DNA modification that is responsive to environmental stressors. We identify common conserved patterns of cytosine-guanosine dinucleotide (CpG) motif frequencies in upstream promoter domains of different functional gene groups in two cnidarian genomes: a coral (Acropora digitifera) and an anemone (Nematostella vectensis). Our analyses show that CpG motif frequencies are prominent in the promoter domains of functional genes associated with environmental adaptation, particularly those identified in A. hyacinthus. Densities of CpG sites in upstream promoter domains near the transcriptional start site (TSS) are 1.38x higher than genomic background levels upstream of -2000 bp from the TSS. The increase in CpG usage suggests selection to allow for DNA methylation events to occur more frequently within 1 kb of the TSS. In addition, observed shifts in CpG densities among functional groups of genes suggests a potential role for epigenetic DNA methylation within promoter domains to impact functional gene expression responses in A. digitifera and N. vectensis. Identifying promoter epigenetic sequence motifs among genes within specific functional groups establishes an approach to describe integrated cellular responses to environmental stress in

  18. Conservation of the LexA repressor binding site in Deinococcus radiodurans

    Directory of Open Access Journals (Sweden)

    Khan Feroz

    2008-03-01

    Full Text Available The LexA protein is a transcriptional repressor of the bacterial SOS DNA repair system, which comprises a set of DNA repair and cellular survival genes that are induced in response to DNA damage. Its varied DNA binding motifs have been characterized and reported in the Escherichia coli, Bacillus subtilis, rhizobia family members, marine magnetotactic bacterium, Salmonella typhimurium and recently in Mycobacterium tuberculosis and this motifs information has been used in our theoretical analysis to detect its novel regulated genes in radio-resistant Deinococcus radiodurans genome. This bacterium showed presence of SOS-box like consensus sequence in the upstream sequences of 3166 genes with >60% motif score similarity percentage (MSSP on both strands. Attempts to identify LexA-binding sites and the composition of the putative SOS regulon in D. radiodurans have been unsuccessful so far. To resolve the problem we performed theoretical analysis with modifications on reported data set of genes related to DNA repair (61 genes, stress response (145 genes and some unusual predicted operons (21 clusters. Expression of some of the predicted SOS-box regulated operon members then was examined through the previously reported microarray data which confirm the expression of only single predicted operon i.e. DRB0143 (AAA superfamily NTPase related to 5-methylcytosine specific restriction enzyme subunit McrB and DRB0144 (homolog of the McrC subunit of the McrBC restriction modification system. The methodology involved weight matrix construction through CONSENSUS algorithm using information of conserved upstream sequences of eight known genes including dinB, tagC, lexA, recA, uvrB, yneA of B. subtilis while lexA and recA of D. radiodurans through phylogenetic footprinting method and later detection of similar conserved SOS-box like LexA binding motifs through both RSAT & PoSSuMsearch programs. The resultant DNA consensus sequence had highly conserved 14 bp SOS

  19. [Cloning of cDNA for RNA polymerase subunit from the fission yeast Schizosaccharomyces pombe by heterospecific complementation in Saccharomyces cerevisiae].

    Science.gov (United States)

    Shpakovskiĭ, G V; Lebedenko, E N; Thuriaux, P

    1997-02-01

    The rpb10 cDNA of the fission yeast Schizosaccharomyces pombe, encoding one of the five small subunits common to all three nuclear DNA-dependent RNA polymerases, was isolated from an expression cDNA library by two independent approaches: PCR-based screening and direct suppression by means of heterospecific complementation of a temperature-sensitive mutant defective in the corresponding gene of Saccharomyces cerevisiae. The cloned Sz. pombe cDNA encodes a protein Rpb10 of 71 amino acids with an M of 8,275 Da, sharing 51 amino acids (71% identity) with the subunit ABC10 beta of RNA polymerases I-III from S. cerevisiae. All eukaryotic members of this protein family have the same general organization featuring two highly conserved motifs (RCFT/SCGK and RYCCRRM) around an atypical zinc finger and an additional invariant HVDLIEK motif toward the C-terminal end. The last motif is only characteristics for homologs from eukaryotes. In keeping with this remarkable structural conservation, the Sz. pombe cDNA also fully complemented a S. cerevisiae deletion mutant lacking subunit ABC10 beta (null allele rpb10-delta 1::HIS3).

  20. A novel type of DNA-binding protein interacts with a conserved sequence in an early nodulin ENOD12 promoter

    DEFF Research Database (Denmark)

    Christiansen, H; Hansen, A C; Vijn, I

    1996-01-01

    The pea genes PsENOD12A and PsENOD12B are expressed in the root hairs shortly after infection with the nitrogen-fixing bacterium Rhizobium leguminosarum bv. viciae or after application of purified Nod factors. A 199 bp promoter fragment of the PsENOD12B gene contains sufficient information for Nod...... factor-induced tissue-specific expression. We have isolated a Vicia sativa cDNA encoding a 1641 amino acid protein, ENBP1, that interacts with the 199 bp ENOD12 promoter. Two different DNA-binding domains were identified in ENBP1. A domain containing six AT-hooks interacts specifically with an AT...... of the ENBP1 transcript in cells expressing ENOD12 strongly suggest that ENBP1 is a transcription factor involved in the regulation of ENOD12. Finally, the C-terminal region of ENBP1 shows strong homology to a protein from rat that is specifically expressed in testis tissue. Udgivelsesdato: 1996-Dec...

  1. Fitness for synchronization of network motifs

    DEFF Research Database (Denmark)

    Vega, Y.M.; Vázquez-Prada, M.; Pacheco, A.F.

    2004-01-01

    We study the synchronization of Kuramoto's oscillators in small parts of networks known as motifs. We first report on the system dynamics for the case of a scale-free network and show the existence of a non-trivial critical point. We compute the probability that network motifs synchronize, and fi...... that the fitness for synchronization correlates well with motifs interconnectedness and structural complexity. Possible implications for present debates about network evolution in biological and other systems are discussed....

  2. How many species and under what names? Using DNA barcoding and GenBank data for west Central African amphibian conservation.

    Directory of Open Access Journals (Sweden)

    Jessica L Deichmann

    the taxonomy of complex groups. Our methods provide an example of how non-taxonomists and parataxonomists working in understudied parts of the world with limited geographic sampling and comparative morphological material can use DNA barcoding and publicly available sequence data (GenBank to rapidly identify the number of species and assign tentative names to aid in urgent conservation management actions and contribute to taxonomic resolution.

  3. A unique uracil-DNA binding protein of the uracil DNA glycosylase superfamily.

    Science.gov (United States)

    Sang, Pau Biak; Srinath, Thiruneelakantan; Patil, Aravind Goud; Woo, Eui-Jeon; Varshney, Umesh

    2015-09-30

    Uracil DNA glycosylases (UDGs) are an important group of DNA repair enzymes, which pioneer the base excision repair pathway by recognizing and excising uracil from DNA. Based on two short conserved sequences (motifs A and B), UDGs have been classified into six families. Here we report a novel UDG, UdgX, from Mycobacterium smegmatis and other organisms. UdgX specifically recognizes uracil in DNA, forms a tight complex stable to sodium dodecyl sulphate, 2-mercaptoethanol, urea and heat treatment, and shows no detectable uracil excision. UdgX shares highest homology to family 4 UDGs possessing Fe-S cluster. UdgX possesses a conserved sequence, KRRIH, which forms a flexible loop playing an important role in its activity. Mutations of H in the KRRIH sequence to S, G, A or Q lead to gain of uracil excision activity in MsmUdgX, establishing it as a novel member of the UDG superfamily. Our observations suggest that UdgX marks the uracil-DNA for its repair by a RecA dependent process. Finally, we observed that the tight binding activity of UdgX is useful in detecting uracils in the genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Discriminative motif discovery via simulated evolution and random under-sampling.

    Directory of Open Access Journals (Sweden)

    Tao Song

    Full Text Available Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  5. Discriminative motif discovery via simulated evolution and random under-sampling.

    Science.gov (United States)

    Song, Tao; Gu, Hong

    2014-01-01

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs) training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  6. The ORF59 DNA polymerase processivity factor homologs of Old World primate RV2 rhadinoviruses are highly conserved nuclear antigens expressed in differentiated epithelium in infected macaques

    Directory of Open Access Journals (Sweden)

    Burnside Kellie L

    2009-11-01

    Full Text Available Abstract Background ORF59 DNA polymerase processivity factor of the human rhadinovirus, Kaposi's sarcoma-associated herpesvirus (KSHV, is required for efficient copying of the genome during virus replication. KSHV ORF59 is antigenic in the infected host and is used as a marker for virus activation and replication. Results We cloned, sequenced and expressed the genes encoding related ORF59 proteins from the RV1 rhadinovirus homologs of KSHV from chimpanzee (PtrRV1 and three species of macaques (RFHVMm, RFHVMn and RFHVMf, and have compared them with ORF59 proteins obtained from members of the more distantly-related RV2 rhadinovirus lineage infecting the same non-human primate species (PtrRV2, RRV, MneRV2, and MfaRV2, respectively. We found that ORF59 homologs of the RV1 and RV2 Old World primate rhadinoviruses are highly conserved with distinct phylogenetic clustering of the two rhadinovirus lineages. RV1 and RV2 ORF59 C-terminal domains exhibit a strong lineage-specific conservation. Rabbit antiserum was developed against a C-terminal polypeptide that is highly conserved between the macaque RV2 ORF59 sequences. This anti-serum showed strong reactivity towards ORF59 encoded by the macaque RV2 rhadinoviruses, RRV (rhesus and MneRV2 (pig-tail, with no cross reaction to human or macaque RV1 ORF59 proteins. Using this antiserum and RT-qPCR, we determined that RRV ORF59 is expressed early after permissive infection of both rhesus primary fetal fibroblasts and African green monkey kidney epithelial cells (Vero in vitro. RRV- and MneRV2-infected foci showed strong nuclear expression of ORF59 that correlated with production of infectious progeny virus. Immunohistochemical studies of an MneRV2-infected macaque revealed strong nuclear expression of ORF59 in infected cells within the differentiating layer of epidermis corroborating previous observations that differentiated epithelial cells are permissive for replication of KSHV-like rhadinoviruses

  7. DNA nanotechnology

    Science.gov (United States)

    Seeman, Nadrian C.; Sleiman, Hanadi F.

    2018-01-01

    DNA is the molecule that stores and transmits genetic information in biological systems. The field of DNA nanotechnology takes this molecule out of its biological context and uses its information to assemble structural motifs and then to connect them together. This field has had a remarkable impact on nanoscience and nanotechnology, and has been revolutionary in our ability to control molecular self-assembly. In this Review, we summarize the approaches used to assemble DNA nanostructures and examine their emerging applications in areas such as biophysics, diagnostics, nanoparticle and protein assembly, biomolecule structure determination, drug delivery and synthetic biology. The introduction of orthogonal interactions into DNA nanostructures is discussed, and finally, a perspective on the future directions of this field is presented.

  8. A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif

    Directory of Open Access Journals (Sweden)

    Asita Elengoe

    2015-01-01

    Full Text Available Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD of heat shock 70 kDa protein (PDB: 1HJO with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD simulation. Human DNA binding domain of p53 motif (SCMGGMNR retrieved from UniProt (UniProtKB: P04637 was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.

  9. A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.

    Science.gov (United States)

    Tran, Ngoc Tam L; Huang, Chun-Hsi

    2014-02-20

    ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs that other tools may not discover. We recommended the users to use multiple motif finding Web tools that implement different algorithms for obtaining significant motifs, overlapping resemble motifs, and non-overlapping motifs. Finally, we provided our suggestions for future development of motif finding Web tool that better assists researchers for finding motifs in ChIP-Seq data.

  10. Mcm10 regulates DNA replication elongation by stimulating the CMG replicative helicase.

    Science.gov (United States)

    Lõoke, Marko; Maloney, Michael F; Bell, Stephen P

    2017-02-01

    Activation of the Mcm2-7 replicative DNA helicase is the committed step in eukaryotic DNA replication initiation. Although Mcm2-7 activation requires binding of the helicase-activating proteins Cdc45 and GINS (forming the CMG complex), an additional protein, Mcm10, drives initial origin DNA unwinding by an unknown mechanism. We show that Mcm10 binds a conserved motif located between the oligonucleotide/oligosaccharide fold (OB-fold) and A subdomain of Mcm2. Although buried in the interface between these domains in Mcm2-7 structures, mutations predicted to separate the domains and expose this motif restore growth to conditional-lethal MCM10 mutant cells. We found that, in addition to stimulating initial DNA unwinding, Mcm10 stabilizes Cdc45 and GINS association with Mcm2-7 and stimulates replication elongation in vivo and in vitro. Furthermore, we identified a lethal allele of MCM10 that stimulates initial DNA unwinding but is defective in replication elongation and CMG binding. Our findings expand the roles of Mcm10 during DNA replication and suggest a new model for Mcm10 function as an activator of the CMG complex throughout DNA replication. © 2017 Lõoke et al.; Published by Cold Spring Harbor Laboratory Press.

  11. Temporal motifs in time-dependent networks

    International Nuclear Information System (INIS)

    Kovanen, Lauri; Karsai, Márton; Kaski, Kimmo; Kertész, János; Saramäki, Jari

    2011-01-01

    Temporal networks are commonly used to represent systems where connections between elements are active only for restricted periods of time, such as telecommunication, neural signal processing, biochemical reaction and human social interaction networks. We introduce the framework of temporal motifs to study the mesoscale topological–temporal structure of temporal networks in which the events of nodes do not overlap in time. Temporal motifs are classes of similar event sequences, where the similarity refers not only to topology but also to the temporal order of the events. We provide a mapping from event sequences to coloured directed graphs that enables an efficient algorithm for identifying temporal motifs. We discuss some aspects of temporal motifs, including causality and null models, and present basic statistics of temporal motifs in a large mobile call network

  12. Motif discovery in ranked lists of sequences

    DEFF Research Database (Denmark)

    Nielsen, Morten Muhlig; Tataru, Paula; Madsen, Tobias

    2016-01-01

    Motif analysis has long been an important method to characterize biological functionality and the current growth of sequencing-based genomics experiments further extends its potential. These diverse experiments often generate sequence lists ranked by some functional property. There is therefore...... advantage of the regular expression feature, including enrichments for combinations of different microRNA seed sites. The method is implemented and made publicly available as an R package and supports high parallelization on multi-core machinery....... a growing need for motif analysis methods that can exploit this coupled data structure and be tailored for specific biological questions. Here, we present an exploratory motif analysis tool, Regmex (REGular expression Motif EXplorer), which offers several methods to evaluate the correlation of motifs...

  13. iFORM: Incorporating Find Occurrence of Regulatory Motifs.

    Science.gov (United States)

    Ren, Chao; Chen, Hebing; Yang, Bite; Liu, Feng; Ouyang, Zhangyi; Bo, Xiaochen; Shu, Wenjie

    2016-01-01

    Accurately identifying the binding sites of transcription factors (TFs) is crucial to understanding the mechanisms of transcriptional regulation and human disease. We present incorporating Find Occurrence of Regulatory Motifs (iFORM), an easy-to-use and efficient tool for scanning DNA sequences with TF motifs described as position weight matrices (PWMs). Both performance assessment with a receiver operating characteristic (ROC) curve and a correlation-based approach demonstrated that iFORM achieves higher accuracy and sensitivity by integrating five classical motif discovery programs using Fisher's combined probability test. We have used iFORM to provide accurate results on a variety of data in the ENCODE Project and the NIH Roadmap Epigenomics Project, and the tool has demonstrated its utility in further elucidating individual roles of functional elements. Both the source and binary codes for iFORM can be freely accessed at https://github.com/wenjiegroup/iFORM. The identified TF binding sites across human cell and tissue types using iFORM have been deposited in the Gene Expression Omnibus under the accession ID GSE53962.

  14. WildSpan: mining structured motifs from protein sequences

    Directory of Open Access Journals (Sweden)

    Chen Chien-Yu

    2011-03-01

    Full Text Available Abstract Background Automatic extraction of motifs from biological sequences is an important research problem in study of molecular biology. For proteins, it is desired to discover sequence motifs containing a large number of wildcard symbols, as the residues associated with functional sites are usually largely separated in sequences. Discovering such patterns is time-consuming because abundant combinations exist when long gaps (a gap consists of one or more successive wildcards are considered. Mining algorithms often employ constraints to narrow down the search space in order to increase efficiency. However, improper constraint models might degrade the sensitivity and specificity of the motifs discovered by computational methods. We previously proposed a new constraint model to handle large wildcard regions for discovering functional motifs of proteins. The patterns that satisfy the proposed constraint model are called W-patterns. A W-pattern is a structured motif that groups motif symbols into pattern blocks interleaved with large irregular gaps. Considering large gaps reflects the fact that functional residues are not always from a single region of protein sequences, and restricting motif symbols into clusters corresponds to the observation that short motifs are frequently present within protein families. To efficiently discover W-patterns for large-scale sequence annotation and function prediction, this paper first formally introduces the problem to solve and proposes an algorithm named WildSpan (sequential pattern mining across large wildcard regions that incorporates several pruning strategies to largely reduce the mining cost. Results WildSpan is shown to efficiently find W-patterns containing conserved residues that are far separated in sequences. We conducted experiments with two mining strategies, protein-based and family-based mining, to evaluate the usefulness of W-patterns and performance of WildSpan. The protein-based mining mode

  15. MotifNet: a web-server for network motif analysis.

    Science.gov (United States)

    Smoly, Ilan Y; Lerman, Eugene; Ziv-Ukelson, Michal; Yeger-Lotem, Esti

    2017-06-15

    Network motifs are small topological patterns that recur in a network significantly more often than expected by chance. Their identification emerged as a powerful approach for uncovering the design principles underlying complex networks. However, available tools for network motif analysis typically require download and execution of computationally intensive software on a local computer. We present MotifNet, the first open-access web-server for network motif analysis. MotifNet allows researchers to analyze integrated networks, where nodes and edges may be labeled, and to search for motifs of up to eight nodes. The output motifs are presented graphically and the user can interactively filter them by their significance, number of instances, node and edge labels, and node identities, and view their instances. MotifNet also allows the user to distinguish between motifs that are centered on specific nodes and motifs that recur in distinct parts of the network. MotifNet is freely available at http://netbio.bgu.ac.il/motifnet . The website was implemented using ReactJs and supports all major browsers. The server interface was implemented in Python with data stored on a MySQL database. estiyl@bgu.ac.il or michaluz@cs.bgu.ac.il. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  16. Molecular forensics in avian conservation: a DNA-based approach for identifying mammalian predators of ground-nesting birds and eggs.

    Science.gov (United States)

    Hopken, Matthew W; Orning, Elizabeth K; Young, Julie K; Piaggio, Antoinette J

    2016-01-07

    The greater sage-grouse (Centrocercus urophasianus) is a ground-nesting bird from the Northern Rocky Mountains and a species at risk of extinction in in multiple U.S. states and Canada. Herein we report results from a proof of concept that mitochondrial and nuclear DNAs from mammalian predator saliva could be non-invasively collected from depredated greater sage-grouse eggshells and carcasses and used for predator species identification. Molecular forensic approaches have been applied to identify predators from depredated remains as one strategy to better understand predator-prey dynamics and guide management strategies. This can aid conservation efforts by correctly identifying predators most likely to impact threatened and endangered species. DNA isolated from non-invasive samples around nesting sites (e.g. fecal or hair samples) is one method that can increase the success and accuracy of predator species identification when compared to relying on nest remains alone. Predator saliva DNA was collected from depredated eggshells and carcasses using swabs. We sequenced two partial fragments of two mitochondrial genes and obtained microsatellite genotypes using canid specific primers for species and individual identification, respectively. Using this multilocus approach we were able to identify predators, at least down to family, from 11 out of 14 nests (79%) and three out of seven carcasses (47%). Predators detected most frequently were canids (86%), while other taxa included rodents, a striped skunk, and cattle. We attempted to match the genotypes of individual coyotes obtained from eggshells and carcasses with those obtained from fecal samples and coyotes collected in the areas, but no genotype matches were found. Predation is a main cause of nest failure in ground-nesting birds and can impact reproduction and recruitment. To inform predator management for ground-nesting bird conservation, accurate identification of predator species is necessary. Considering

  17. Molecular features of the complementarity determining region 3 motif of the T cell population and subsets in the blood of patients with chronic severe hepatitis B

    Directory of Open Access Journals (Sweden)

    Yang Jiezuan

    2011-12-01

    Full Text Available Abstract Background T cell receptor (TCR reflects the status and function of T cells. We previously developed a gene melting spectral pattern (GMSP assay, which rapidly detects clonal expansion of the T cell receptor β variable gene (TCRBV in patients with HBV by using quantitative real-time reverse transcription PCR (qRT-PCR with DNA melting curve analysis. However, the molecular profiles of TCRBV in peripheral blood mononuclear cells (PBMCs and CD8+, CD8- cell subsets from chronic severe hepatitis B (CSHB patients have not been well described. Methods Human PBMCs were separated and sorted into CD8+ and CD8- cell subsets using density gradient centrifugation and magnetic activated cell sorting (MACS. The molecular features of the TCRBV CDR3 motif were determined using GMSP analysis; the TCRBV families were cloned and sequenced when the GMSP profile showed a single-peak, indicative of a monoclonal population. Results The number of skewed TCRBV in the CD8+ cell subset was significantly higher than that of the CD8- cell subset as assessed by GMSP analysis. The TCRBV11 and BV7 were expressed more frequently than other members of TCRBV family in PBMCs and CD8+, CD8- subsets. Also the relatively conserved amino acid motifs were detected in the TCRBV22, BV18 and BV11 CDR3 in PBMCs among patients with CSHB. Conclusions The molecular features of the TCRBV CDR3 were markedly different among PBMCs and CD8+, CD8- cell subsets derived from CSHB patients. Analysis of the TCRBV expression in the CD8+ subset was more accurate in assessing the status and function of circulating T cells. The expression of TCRBV11, BV7 and the relatively conserved CDR3 amino acid motifs could also help to predict and treat patients with CSHB.

  18. Violation of an evolutionarily conserved immunoglobulin diversity gene sequence preference promotes production of dsDNA-specific IgG antibodies.

    Directory of Open Access Journals (Sweden)

    Aaron Silva-Sanchez

    Full Text Available Variability in the developing antibody repertoire is focused on the third complementarity determining region of the H chain (CDR-H3, which lies at the center of the antigen binding site where it often plays a decisive role in antigen binding. The power of VDJ recombination and N nucleotide addition has led to the common conception that the sequence of CDR-H3 is unrestricted in its variability and random in its composition. Under this view, the immune response is solely controlled by somatic positive and negative clonal selection mechanisms that act on individual B cells to promote production of protective antibodies and prevent the production of self-reactive antibodies. This concept of a repertoire of random antigen binding sites is inconsistent with the observation that diversity (DH gene segment sequence content by reading frame (RF is evolutionarily conserved, creating biases in the prevalence and distribution of individual amino acids in CDR-H3. For example, arginine, which is often found in the CDR-H3 of dsDNA binding autoantibodies, is under-represented in the commonly used DH RFs rearranged by deletion, but is a frequent component of rarely used inverted RF1 (iRF1, which is rearranged by inversion. To determine the effect of altering this germline bias in DH gene segment sequence on autoantibody production, we generated mice that by genetic manipulation are forced to utilize an iRF1 sequence encoding two arginines. Over a one year period we collected serial serum samples from these unimmunized, specific pathogen-free mice and found that more than one-fifth of them contained elevated levels of dsDNA-binding IgG, but not IgM; whereas mice with a wild type DH sequence did not. Thus, germline bias against the use of arginine enriched DH sequence helps to reduce the likelihood of producing self-reactive antibodies.

  19. The NTP-binding motif in cowpea mosaic virus B polyprotein is essential for viral replication

    NARCIS (Netherlands)

    Peters, S A; Verver, J; Nollen, E A; van Lent, J W; Wellink, J; van Kammen, A

    1994-01-01

    We have assessed the functional importance of the NTP-binding motif (NTBM) in the cowpea mosaic virus (CPMV) B-RNA-encoded 58K domain by changing two conserved amino acids within the consensus A and B sites (GKSRTGK500S and MDD545, respectively). Both Lys-500 to Thr and Asp-545 to Pro substitutions

  20. The WSXWS motif in cytokine receptors is a molecular switch involved in receptor activation

    DEFF Research Database (Denmark)

    Dagil, Robert; Knudsen, Maiken J.; Olsen, Johan Gotthardt

    2012-01-01

    The prolactin receptor (PRLR) is activated by binding of prolactin in a 2:1 complex, but the activation mechanism is poorly understood. PRLR has a conserved WSXWS motif generic to cytokine class I receptors. We have determined the nuclear magnetic resonance solution structure of the membrane...

  1. Efficient sequential and parallel algorithms for planted motif search.

    Science.gov (United States)

    Nicolae, Marius; Rajasekaran, Sanguthevar

    2014-01-31

    Motif searching is an important step in the detection of rare events occurring in a set of DNA or protein sequences. One formulation of the problem is known as (l,d)-motif search or Planted Motif Search (PMS). In PMS we are given two integers l and d and n biological sequences. We want to find all sequences of length l that appear in each of the input sequences with at most d mismatches. The PMS problem is NP-complete. PMS algorithms are typically evaluated on certain instances considered challenging. Despite ample research in the area, a considerable performance gap exists because many state of the art algorithms have large runtimes even for moderately challenging instances. This paper presents a fast exact parallel PMS algorithm called PMS8. PMS8 is the first algorithm to solve the challenging (l,d) instances (25,10) and (26,11). PMS8 is also efficient on instances with larger l and d such as (50,21). We include a comparison of PMS8 with several state of the art algorithms on multiple problem instances. This paper also presents necessary and sufficient conditions for 3 l-mers to have a common d-neighbor. The program is freely available at http://engr.uconn.edu/~man09004/PMS8/. We present PMS8, an efficient exact algorithm for Planted Motif Search. PMS8 introduces novel ideas for generating common neighborhoods. We have also implemented a parallel version for this algorithm. PMS8 can solve instances not solved by any previous algorithms.

  2. Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

    Energy Technology Data Exchange (ETDEWEB)

    Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

    2007-02-21

    Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

  3. Modulation of i-motif thermodynamic stability by the introduction of UNA (unlocked nucleic acid) monomers

    DEFF Research Database (Denmark)

    Pasternak, Anna; Wengel, Jesper

    2011-01-01

    The influence of acyclic RNA derivatives, UNA (unlocked nucleic acid) monomers, on i-DNA thermodynamic stability has been investigated. The 22 nt human telomeric fragment was chosen as the model sequence for stability studies. UNA monomers modulate i-motif stability in a position-depending manner...

  4. Microbial expression of proteins containing long repetitive Arg-Gly-Asp cell adhesive motifs created by overlap elongation PCR

    International Nuclear Information System (INIS)

    Kurihara, Hiroyuki; Shinkai, Masashige; Nagamune, Teruyuki

    2004-01-01

    We developed a novel method for creating repetitive DNA libraries using overlap elongation PCR, and prepared a DNA library encoding repetitive Arg-Gly-Asp (RGD) cell adhesive motifs. We obtained various length DNAs encoding repetitive RGD from a short monomer DNA (18 bp) after a thermal cyclic reaction without a DNA template for amplification, and isolated DNAs encoding 2, 21, and 43 repeats of the RGD motif. We cloned these DNAs into a protein expression vector and overexpressed them as thioredoxin fusion proteins: RGD2, RGD21, and RGD43, respectively. The solubility of RGD43 in water was low and it formed a fibrous precipitate in water. Scanning electron microscopy revealed that RGD43 formed a branched 3D-network structure in the solid state. To evaluate the function of the cell adhesive motifs in RGD43, mouse fibroblast cells were cultivated on the RGD43 scaffold. The fibroblast cells adhered to the RGD43 scaffold and extended long filopodia

  5. Population Structure of mtDNA Variation due to Pleistocene Fluctuations in the South American Maned Wolf (Chrysocyon brachyurus, Illiger, 1815): Management Units for Conservation.

    Science.gov (United States)

    González, Susana; Cosse, Mariana; Franco, María del Rosario; Emmons, Louise; Vynne, Carly; Duarte, José Maurício Barbanti; Beccacesi, Marcelo D; Maldonado, Jesús E

    2015-01-01

    The maned wolf (Chrysocyon brachyurus) is one of the largest South American canids, and conservation across this charismatic carnivore's large range is presently hampered by a lack of knowledge about possible natural subdivisions which could influence the population's viability. To elucidate the phylogeographic patterns and demographic history of the species, we used 2 mtDNA markers (D-loop and cytochrome b) from 87 individuals collected throughout their range, in Argentina, Bolivia, Brazil, and Uruguay. We found moderate levels of haplotype and nucleotide diversity, and the 14 D-loop haplotypes were closely related. Genetic structure results revealed 4 groups, and when coupled with model inferences from a coalescent analysis, suggested that maned wolves have undergone demographic fluctuations due to changes in climate and habitat during the Pleistocene glaciation period approximately 24000 years before present (YBP). This genetic signature points to an event that occurred within the timing estimated for the start of the contraction of the Cerrado around 50000 YBP. Our results reveal a genetic signature of population size expansion followed by contraction during Pleistocene interglaciations, which had similar impacts on other South American mammals. The 4 groups should for now be considered management units, within which future monitoring efforts should be conducted independently. © The American Genetic Association 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  6. Sequence-specific high mobility group box factors recognize 10-12-base pair minor groove motifs

    DEFF Research Database (Denmark)

    van Beest, M; Dooijes, D; van De Wetering, M

    2000-01-01

    Sequence-specific high mobility group (HMG) box factors bind and bend DNA via interactions in the minor groove. Three-dimensional NMR analyses have provided the structural basis for this interaction. The cognate HMG domain DNA motif is generally believed to span 6-8 bases. However, alignment...

  7. One motif to bind them: A small-XXX-small motif affects transmembrane domain 1 oligomerization, function, localization, and cross-talk between two yeast GPCRs.

    Science.gov (United States)

    Lock, Antonia; Forfar, Rachel; Weston, Cathryn; Bowsher, Leo; Upton, Graham J G; Reynolds, Christopher A; Ladds, Graham; Dixon, Ann M

    2014-12-01

    G protein-coupled receptors (GPCRs) are the largest family of cell-surface receptors in mammals and facilitate a range of physiological responses triggered by a variety of ligands. GPCRs were thought to function as monomers, however it is now accepted that GPCR homo- and hetero-oligomers also exist and influence receptor properties. The Schizosaccharomyces pombe GPCR Mam2 is a pheromone-sensing receptor involved in mating and has previously been shown to form oligomers in vivo. The first transmembrane domain (TMD) of Mam2 contains a small-XXX-small motif, overrepresented in membrane proteins and well-known for promoting helix-helix interactions. An ortholog of Mam2 in Saccharomyces cerevisiae, Ste2, contains an analogous small-XXX-small motif which has been shown to contribute to receptor homo-oligomerization, localization and function. Here we have used experimental and computational techniques to characterize the role of the small-XXX-small motif in function and assembly of Mam2 for the first time. We find that disruption of the motif via mutagenesis leads to reduction of Mam2 TMD1 homo-oligomerization and pheromone-responsive cellular signaling of the full-length protein. It also impairs correct targeting to the plasma membrane. Mutation of the analogous motif in Ste2 yielded similar results, suggesting a conserved mechanism for assembly. Using co-expression of the two fungal receptors in conjunction with computational models, we demonstrate a functional change in G protein specificity and propose that this is brought about through hetero-dimeric interactions of Mam2 with Ste2 via the complementary small-XXX-small motifs. This highlights the potential of these motifs to affect a range of properties that can be investigated in other GPCRs. Copyright © 2014. Published by Elsevier B.V.

  8. Hunting Motifs in Situla Art

    Directory of Open Access Journals (Sweden)

    Andrej Preložnik

    2013-07-01

    Full Text Available Situla art developed as an echo of the toreutic style which had spread from the Near East through the Phoenicians, Greeks and Etruscans as far as the Veneti, Raeti, Histri, and their eastern neighbours in the region of Dolenjska (Lower Carniola. An Early Iron Age phenomenon (c. 600—300 BC, it rep- resents the major and most arresting form of the contemporary visual arts in an area stretching from the foot of the Apennines in the south to the Drava and Sava rivers in the east. Indeed, individual pieces have found their way across the Alpine passes and all the way north to the Danube. In the world and art of the situlae, a prominent role is accorded to ani- mals. They are displayed in numerous representations of human activities on artefacts crafted in the classic situla style – that is, between the late 6th  and early 5th centuries BC – as passive participants (e.g. in pageants or in harness or as an active element of the situla narrative. The most typical example of the latter is the hunting scene. Today we know at least four objects decorat- ed exclusively with hunting themes, and a number of situlae and other larger vessels where hunting scenes are embedded in composite narratives. All this suggests a popularity unparallelled by any other genre. Clearly recognisable are various hunting techniques and weapons, each associated with a particu- lar type of game (Fig. 1. The chase of a stag with javelin, horse and hound is depicted on the long- familiar and repeatedly published fibula of Zagorje (Fig. 2. It displays a hound mauling the stag’s back and a hunter on horseback pursuing a hind, her neck already pierced by the javelin. To judge by the (so far unnoticed shaft end un- der the stag’s muzzle, the hunter would have been brandishing a second jave- lin as well, like the warrior of the Vače fibula or the rider of the Nesactium situla, presumably himself a hunter. Many parallels to his motif are known from Greece, Etruria, and

  9. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Science.gov (United States)

    Zandevakili, Pooya; Hu, Ming; Qin, Zhaohui

    2012-01-01

    Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS) that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU)-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  10. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Directory of Open Access Journals (Sweden)

    Pooya Zandevakili

    Full Text Available Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  11. Improved i-motif thermal stability by insertion of anthraquinone monomers

    DEFF Research Database (Denmark)

    Gouda, Alaa S; Amine, Mahasen S.; Pedersen, Erik Bjerregaard

    2017-01-01

    In order to gain insight into how to improve thermal stability of i-motifs when used in the context of biomedical and nanotechnological applications, novel anthraquinone-modified i-motifs were synthesized by insertion of 1,8-, 1,4-, 1,5- and 2,6-disubstituted anthraquinone monomers into the TAA...... loops of a 22mer cytosine-rich human telomeric DNA sequence. The influence of the four anthraquinone linkers on the i-motif thermal stability was investigated at 295 nm and pH 5.5. Anthraquinone monomers modulate the i-motif stability in a position-depending manner and the modulation also depends...... unlocked nucleic acid monomers or twisted intercalating nucleic acid. The 2,6-disubstituted anthraquinone linker replacing T10 enabled a significant increase of i-motif thermal melting by 8.2 °C. A substantial increase of 5.0 °C in i-motif thermal melting was recorded when both A6 and T16 were modified...

  12. Footprinting of Chlorella virus DNA ligase bound at a nick in duplex DNA.

    Science.gov (United States)

    Odell, M; Shuman, S

    1999-05-14

    The 298-amino acid ATP-dependent DNA ligase of Chlorella virus PBCV-1 is the smallest eukaryotic DNA ligase known. The enzyme has intrinsic specificity for binding to nicked duplex DNA. To delineate the ligase-DNA interface, we have footprinted the enzyme binding site on DNA and the DNA binding site on ligase. The size of the exonuclease III footprint of ligase bound a single nick in duplex DNA is 19-21 nucleotides. The footprint is asymmetric, extending 8-9 nucleotides on the 3'-OH side of the nick and 11-12 nucleotides on the 5'-phosphate side. The 5'-phosphate moiety is essential for the binding of Chlorella virus ligase to nicked DNA. Here we show that the 3'-OH moiety is not required for nick recognition. The Chlorella virus ligase binds to a nicked ligand containing 2',3'-dideoxy and 5'-phosphate termini, but cannot catalyze adenylation of the 5'-end. Hence, the 3'-OH is important for step 2 chemistry even though it is not itself chemically transformed during DNA-adenylate formation. A 2'-OH cannot substitute for the essential 3'-OH in adenylation at a nick or even in strand closure at a preadenylated nick. The protein side of the ligase-DNA interface was probed by limited proteolysis of ligase with trypsin and chymotrypsin in the presence and absence of nicked DNA. Protease accessible sites are clustered within a short segment from amino acids 210-225 located distal to conserved motif V. The ligase is protected from proteolysis by nicked DNA. Protease cleavage of the native enzyme prior to DNA addition results in loss of DNA binding. These results suggest a bipartite domain structure in which the interdomain segment either comprises part of the DNA binding site or undergoes a conformational change upon DNA binding. The domain structure of Chlorella virus ligase inferred from the solution experiments is consistent with the structure of T7 DNA ligase determined by x-ray crystallography.

  13. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed; Mansour, Essam; Kalnis, Panos

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern

  14. Deciphering functional glycosaminoglycan motifs in development.

    Science.gov (United States)

    Townley, Robert A; Bülow, Hannes E

    2018-03-23

    Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. Bayesian centroid estimation for motif discovery.

    Science.gov (United States)

    Carvalho, Luis

    2013-01-01

    Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  16. Bayesian centroid estimation for motif discovery.

    Directory of Open Access Journals (Sweden)

    Luis Carvalho

    Full Text Available Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  17. SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.

    Science.gov (United States)

    Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude

    2011-07-01

    The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.

  18. DNA regulatory motif selection based on support vector machine ...

    African Journals Online (AJOL)

    Administrator

    2011-10-19

    Oct 19, 2011 ... ... gene expression values of controls and i x i y. 1 i y = 1 i y = −. 1. 2. { , ,..., , } i i i im i g. x x. x y. = 1. 2. 1. 2. , ,..., ,. , ,..., k i i i im. x x x. x x x x x. = =.

  19. Interaction of a nodule specific, trans-acting factor with distinct DNA elements in the soybean leghaemoglobin Ibc(3) 5' upstream region

    DEFF Research Database (Denmark)

    Jensen, Erik Østergaard; Marcker, Kjeld A; Schell, J

    1988-01-01

    Nuclear extracts from soybean nodules, leaves and roots were used to investigate protein-DNA interactions in the 5' upstream (promoter) region of the soybean leghaemoglobin lbc(3) gene. Two distinct regions were identified which strongly bind a nodule specific factor. A Bal31 deletion analysis......, but with different affinities. Elements 1 and 2 share a common motif, although their AT-rich DNA sequences differ. Element 2 is highly conserved at an analogous position in other soybean lb gene 5' upstream regions. Udgivelsesdato: 1988-May...

  20. The PCNA interaction protein box sequence in Rad54 is an integral part of its ATPase domain and is required for efficient DNA repair and recombination

    DEFF Research Database (Denmark)

    Burgess, Rebecca C; Sebesta, Marek; Sisakova, Alexandra

    2013-01-01

    Rad54 is an ATP-driven translocase involved in the genome maintenance pathway of homologous recombination (HR). Although its activity has been implicated in several steps of HR, its exact role(s) at each step are still not fully understood. We have identified a new interaction between Rad54...... and the replicative DNA clamp, proliferating cell nuclear antigen (PCNA). This interaction was only mildly weakened by the mutation of two key hydrophobic residues in the highly-conserved PCNA interaction motif (PIP-box) of Rad54 (Rad54-AA). Intriguingly, the rad54-AA mutant cells displayed sensitivity to DNA damage...

  1. Native characterization of nucleic acid motif thermodynamics via non-covalent catalysis

    Science.gov (United States)

    Wang, Chunyan; Bae, Jin H.; Zhang, David Yu

    2016-01-01

    DNA hybridization thermodynamics is critical for accurate design of oligonucleotides for biotechnology and nanotechnology applications, but parameters currently in use are inaccurately extrapolated based on limited quantitative understanding of thermal behaviours. Here, we present a method to measure the ΔG° of DNA motifs at temperatures and buffer conditions of interest, with significantly better accuracy (6- to 14-fold lower s.e.) than prior methods. The equilibrium constant of a reaction with thermodynamics closely approximating that of a desired motif is numerically calculated from directly observed reactant and product equilibrium concentrations; a DNA catalyst is designed to accelerate equilibration. We measured the ΔG° of terminal fluorophores, single-nucleotide dangles and multinucleotide dangles, in temperatures ranging from 10 to 45 °C. PMID:26782977

  2. Mitochondrial and Y chromosome haplotype motifs as diagnostic markers of Jewish ancestry: a reconsideration.

    Directory of Open Access Journals (Sweden)

    Sergio eTofanelli

    2014-11-01

    Full Text Available Several authors have proposed haplotype motifs based on site variants at the mitochondrial genome (mtDNA and the non-recombining portion of the Y chromosome (NRY to trace the genealogies of Jewish people. Here, we analyzed their main approaches and test the feasibility of adopting motifs as ancestry markers through construction of a large database of mtDNA and NRY haplotypes from public genetic genealogical repositories. We verified the reliability of Jewish ancestry prediction based on the Cohen and Levite Modal Haplotypes in their classical 6 STR marker format or in the extended 12 STR format, as well as four founder mtDNA lineages (HVS-I segments accounting for about 40% of the current population of Ashkenazi Jews. For this purpose we compared haplotype composition in individuals of self-reported Jewish ancestry with the rest of European, African or Middle Eastern samples, to test for non-random association of ethno-geographic groups and haplotypes. Overall, NRY and mtDNA based motifs, previously reported to differentiate between groups, were found to be more represented in Jewish compared to non-Jewish groups. However, this seems to stem from common ancestors of Jewish lineages being rather recent respect to ancestors of non-Jewish lineages with the same haplotype signatures. Moreover, the polyphyly of haplotypes which contain the proposed motifs and the misuse of constant mutation rates heavily affected previous attempts to correctly dating the origin of common ancestries. Accordingly, our results stress the limitations of using the above haplotype motifs as reliable Jewish ancestry predictors and show its inadequacy for forensic or genealogical purposes.

  3. The MHC motif viewer: a visualization tool for MHC binding motifs

    DEFF Research Database (Denmark)

    Rapin, Nicolas; Hoof, Ilka; Lund, Ole

    2010-01-01

    is hampered by the lack of tools for browsing and comparing specificity of these molecules. We have developed a Web server, MHC Motif Viewer, which allows the display of the binding motif for MHC class I proteins for human, chimpanzee, rhesus monkey, mouse, and swine, as well as HLA-DR protein sequences...

  4. Analisis Unsur Matematika pada Motif Sulam Usus

    Directory of Open Access Journals (Sweden)

    Fredi Ganda Putra

    2017-12-01

    Full Text Available Based on interviews with researchers sources said that the beginning of the intestine embroidery is an art of genuine crafts. Called the intestine embroidery because this technique is a technique of combining a strand of cloth resembling the intestine formed according to the pattern by means of embroidered using a thread. Intestinal embroidery techniques were originally used to create a cover of the women's customary wardrobe of Lampung or often referred to as bebe. But not many people in Lampung, especially people who live in Lampung are still many who do not know and recognize the intestine embroidery because most only know tapis only characteristic of Lampung, besides that there are other cultural results that is embroidered intestine. There are still many who do not know that the intestine motif there is a knowledge of mathematics. The researcher's problem formulation is whether there are mathematical elements contained in the intestine embroidery motif based on the concept of geometry. The purpose of this study is to determine whether there are elements of mathematics contained in the intestine motif based on the concept of geometry. Subjects in this study consisted of 4 people obtained by purposive sampling technique. From the results of data analysis conducted by using descriptive analysis and discussion as follows: (1 Intestinal embroidery motif contains the meaning of mathematics and culture or often called Etnomatematika. On the meaning of culture there is a link between the embroidery intestine with a culture that has been there before as the existence of cultural linkage between Hindu belief Buddhism and there are similarities of motifs and decorative patterns contained in the motif embroidery intestine with ornamental variety in Indonesia. (2 The relationship between the intestine with mathematical motifs there are elements of mathematics such as geometry elements in the form of geometry of dimension one and dimension two, and the

  5. Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

    Science.gov (United States)

    Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A

    2013-09-02

    In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome

  6. Arabidopsis DNA methyltransferase AtDNMT2 associates with histone deacetylase AtHD2s activity

    Energy Technology Data Exchange (ETDEWEB)

    Song, Yuan [Key Laboratory of Arid and Grassland Agroecology, Ministry of Education, School of Life Science, Lanzhou University, Lanzhou 730000 (China); Southern Crop Protection and Food Research Centre, Agriculture and Agri-Food Canada, 1391 Sandford Street, London, ON, Canada N5V4T3 (Canada); Wu, Keqiang [Institute of Plant Biology, National Taiwan University, Taipei 106, Taiwan (China); Dhaubhadel, Sangeeta [Southern Crop Protection and Food Research Centre, Agriculture and Agri-Food Canada, 1391 Sandford Street, London, ON, Canada N5V4T3 (Canada); An, Lizhe, E-mail: lizhean@lzu.edu.cn [Key Laboratory of Arid and Grassland Agroecology, Ministry of Education, School of Life Science, Lanzhou University, Lanzhou 730000 (China); Tian, Lining, E-mail: tianl@agr.gc.ca [Southern Crop Protection and Food Research Centre, Agriculture and Agri-Food Canada, 1391 Sandford Street, London, ON, Canada N5V4T3 (Canada)

    2010-05-28

    DNA methyltransferase2 (DNMT2) is always deemed to be enigmatic, because it contains highly conserved DNA methyltransferase motifs but lacks the DNA methylation catalytic capability. Here we show that Arabidopsis DNA methyltransferase2 (AtDNMT2) is localized in nucleus and associates with histone deacetylation. Bimolecular fluorescence complementation and pull-down assays show AtDNMT2 interacts with type-2 histone deacetylases (AtHD2s), a unique type of histone deacetylase family in plants. Through analyzing the expression of AtDNMT2: ss-glucuronidase (GUS) fusion protein, we demonstrate that AtDNMT2 has the ability to repress gene expression at transcription level. Meanwhile, the expression of AtDNMT2 gene is altered in athd2c mutant plants. We propose that AtDNMT2 possibly involves in the activity of histone deacetylation and plant epigenetic regulatory network.

  7. Arabidopsis DNA methyltransferase AtDNMT2 associates with histone deacetylase AtHD2s activity

    International Nuclear Information System (INIS)

    Song, Yuan; Wu, Keqiang; Dhaubhadel, Sangeeta; An, Lizhe; Tian, Lining

    2010-01-01

    DNA methyltransferase2 (DNMT2) is always deemed to be enigmatic, because it contains highly conserved DNA methyltransferase motifs but lacks the DNA methylation catalytic capability. Here we show that Arabidopsis DNA methyltransferase2 (AtDNMT2) is localized in nucleus and associates with histone deacetylation. Bimolecular fluorescence complementation and pull-down assays show AtDNMT2 interacts with type-2 histone deacetylases (AtHD2s), a unique type of histone deacetylase family in plants. Through analyzing the expression of AtDNMT2: ss-glucuronidase (GUS) fusion protein, we demonstrate that AtDNMT2 has the ability to repress gene expression at transcription level. Meanwhile, the expression of AtDNMT2 gene is altered in athd2c mutant plants. We propose that AtDNMT2 possibly involves in the activity of histone deacetylation and plant epigenetic regulatory network.

  8. Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.

    Directory of Open Access Journals (Sweden)

    Arnoldo J Müller-Molina

    Full Text Available To know the map between transcription factors (TFs and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.

  9. Direct AUC optimization of regulatory motifs.

    Science.gov (United States)

    Zhu, Lin; Zhang, Hong-Bo; Huang, De-Shuang

    2017-07-15

    The discovery of transcription factor binding site (TFBS) motifs is essential for untangling the complex mechanism of genetic variation under different developmental and environmental conditions. Among the huge amount of computational approaches for de novo identification of TFBS motifs, discriminative motif learning (DML) methods have been proven to be promising for harnessing the discovery power of accumulated huge amount of high-throughput binding data. However, they have to sacrifice accuracy for speed and could fail to fully utilize the information of the input sequences. We propose a novel algorithm called CDAUC for optimizing DML-learned motifs based on the area under the receiver-operating characteristic curve (AUC) criterion, which has been widely used in the literature to evaluate the significance of extracted motifs. We show that when the considered AUC loss function is optimized in a coordinate-wise manner, the cost function of each resultant sub-problem is a piece-wise constant function, whose optimal value can be found exactly and efficiently. Further, a key step of each iteration of CDAUC can be efficiently solved as a computational geometry problem. Experimental results on real world high-throughput datasets illustrate that CDAUC outperforms competing methods for refining DML motifs, while being one order of magnitude faster. Meanwhile, preliminary results also show that CDAUC may also be useful for improving the interpretability of convolutional kernels generated by the emerging deep learning approaches for predicting TF sequences specificities. CDAUC is available at: https://drive.google.com/drive/folders/0BxOW5MtIZbJjNFpCeHlBVWJHeW8 . dshuang@tongji.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  10. High antiviral effect of TiO2·PL–DNA nanocomposites targeted to conservative regions of (−RNA and (+RNA of influenza A virus in cell culture

    Directory of Open Access Journals (Sweden)

    Asya S. Levina

    2016-08-01

    Full Text Available Background: The development of new antiviral drugs based on nucleic acids is under scrutiny. An important problem in this aspect is to find the most vulnerable conservative regions in the viral genome as targets for the action of these agents. Another challenge is the development of an efficient system for their delivery into cells. To solve this problem, we proposed a TiO2·PL–DNA nanocomposite consisting of titanium dioxide nanoparticles and polylysine (PL-containing oligonucleotides.Results: The TiO2·PL–DNA nanocomposites bearing the DNA fragments targeted to different conservative regions of (−RNA and (+RNA of segment 5 of influenza A virus (IAV were studied for their antiviral activity in MDCK cells infected with the H1N1, H5N1, and H3N2 virus subtypes. Within the negative strand of each of the studied strains, the efficiency of DNA fragments increased in the direction of its 3’-end. Thus, the DNA fragment aimed at the 3’-noncoding region of (−RNA was the most efficient and inhibited the reproduction of different IAV subtypes by 3–4 orders of magnitude. Although to a lesser extent, the DNA fragments targeted at the AUG region of (+RNA and the corresponding region of (−RNA were also active. For all studied viral subtypes, the nanocomposites bearing the DNA fragments targeted to (−RNA appeared to be more efficient than those containing fragments aimed at the corresponding (+RNA regions.Conclusion: The proposed TiO2·PL–DNA nanocomposites can be successfully used for highly efficient and site-specific inhibition of influenza A virus of different subtypes. Some patterns of localization of the most vulnerable regions in IAV segment 5 for the action of DNA-based drugs were found. The (−RNA strand of IAV segment 5 appeared to be more sensitive as compared to (+RNA.

  11. The MARVEL transmembrane motif of occludin mediates oligomerization and targeting to the basolateral surface in epithelia.

    Science.gov (United States)

    Yaffe, Yakey; Shepshelovitch, Jeanne; Nevo-Yassaf, Inbar; Yeheskel, Adva; Shmerling, Hedva; Kwiatek, Joanna M; Gaus, Katharina; Pasmanik-Chor, Metsada; Hirschberg, Koret

    2012-08-01

    Occludin (Ocln), a MARVEL-motif-containing protein, is found in all tight junctions. MARVEL motifs are comprised of four transmembrane helices associated with the localization to or formation of diverse membrane subdomains by interacting with the proximal lipid environment. The functions of the Ocln MARVEL motif are unknown. Bioinformatics sequence- and structure-based analyses demonstrated that the MARVEL domain of Ocln family proteins has distinct evolutionarily conserved sequence features that are consistent with its basolateral membrane localization. Live-cell microscopy, fluorescence resonance energy transfer (FRET) and bimolecular fluorescence complementation (BiFC) were used to analyze the intracellular distribution and self-association of fluorescent-protein-tagged full-length human Ocln or the Ocln MARVEL motif excluding the cytosolic C- and N-termini (amino acids 60-269, FP-MARVEL-Ocln). FP-MARVEL-Ocln efficiently arrived at the plasma membrane (PM) and was sorted to the basolateral PM in filter-grown polarized MDCK cells. A series of conserved aromatic amino acids within the MARVEL domain were found to be associated with Ocln dimerization using BiFC. FP-MARVEL-Ocln inhibited membrane pore growth during Triton-X-100-induced solubilization and was shown to increase the membrane-ordered state using Laurdan, a lipid dye. These data demonstrate that the Ocln MARVEL domain mediates self-association and correct sorting to the basolateral membrane.

  12. Solution NMR characterization of Sgf73(1-104) indicates that Zn ion is required to stabilize zinc finger motif

    International Nuclear Information System (INIS)

    Lai, Chaohua; Wu, Minhao; Li, Pan; Shi, Chaowei; Tian, Changlin; Zang, Jianye

    2010-01-01

    Zinc finger motif contains a zinc ion coordinated by several conserved amino acid residues. Yeast Sgf73 protein was identified as a component of SAGA (Spt/Ada/Gcn5 acetyltransferase) multi-subunit complex and Sgf73 protein was known to contain two zinc finger motifs. Sgf73(1-104), containing the first zinc finger motif, was necessary to modulate the deubiquitinase activity of SAGA complex. Here, Sgf73(1-104) was over-expressed using bacterial expression system and purified for solution NMR (nuclear magnetic resonance) structural studies. Secondary structure and site-specific relaxation analysis of Sgf73(1-104) were achieved after solution NMR backbone assignment. Solution NMR and circular dichroism analysis of Sgf73(1-104) after zinc ion removal using chelation reagent EDTA (ethylene-diamine-tetraacetic acid) demonstrated that zinc ion was required to maintain stable conformation of the zinc finger motif.

  13. The C-Terminal RpoN Domain of sigma54 Forms an unpredictedHelix-Turn-Helix Motif Similar to domains of sigma70

    Energy Technology Data Exchange (ETDEWEB)

    Doucleff, Michaeleen; Malak, Lawrence T.; Pelton, Jeffrey G.; Wemmer, David E.

    2005-11-01

    The ''{delta}'' subunit of prokaryotic RNA-polymerase allows gene-specific transcription initiation. Two {sigma} families have been identified, {sigma}{sup 70} and {sigma}{sup 54}, which use distinct mechanisms to initiate transcription and share no detectable sequence homology. Although the {sigma}{sup 70}-type factors have been well characterized structurally by x-ray crystallography, no high-resolution structural information is available for the {sigma}{sup 54}-type factors. Here we present the NMR derived structure of the C-terminal domain of {sigma}{sup 54} from Aquifex aeolicus. This domain (Thr323 to Gly389), which contains the highly conserved RpoN box sequence, consists of a poorly structured N-terminal tail followed by a three-helix bundle, which is surprisingly similar to domains of the {sigma}{sup 70}-type proteins. Residues of the RpoN box, which have previously been shown to be critical for DNA binding, form the second helix of an unpredicted helix-turn-helix motif. This structure's homology with other DNA binding proteins, combined with previous biochemical data, suggest how the C-terminal domain of {sigma}{sup 54} binds to DNA.

  14. Assessment of algorithms for inferring positional weight matrix motifs of transcription factor binding sites using protein binding microarray data.

    Directory of Open Access Journals (Sweden)

    Yaron Orenstein

    Full Text Available The new technology of protein binding microarrays (PBMs allows simultaneous measurement of the binding intensities of a transcription factor to tens of thousands of synthetic double-stranded DNA probes, covering all possible 10-mers. A key computational challenge is inferring the binding motif from these data. We present a systematic comparison of four methods developed specifically for reconstructing a binding site motif represented as a positional weight matrix from PBM data. The reconstructed motifs were evaluated in terms of three criteria: concordance with reference motifs from the literature and ability to predict in vivo and in vitro bindings. The evaluation encompassed over 200 transcription factors and some 300 assays. The results show a tradeoff between how the methods perform according to the different criteria, and a dichotomy of method types. Algorithms that construct motifs with low information content predict PBM probe ranking more faithfully, while methods that produce highly informative motifs match reference motifs better. Interestingly, in predicting high-affinity binding, all methods give far poorer results for in vivo assays compared to in vitro assays.

  15. Highly scalable Ab initio genomic motif identification

    KAUST Repository

    Marchand, Benoit; Bajic, Vladimir B.; Kaushik, Dinesh

    2011-01-01

    We present results of scaling an ab initio motif family identification system, Dragon Motif Finder (DMF), to 65,536 processor cores of IBM Blue Gene/P. DMF seeks groups of mutually similar polynucleotide patterns within a set of genomic sequences and builds various motif families from them. Such information is of relevance to many problems in life sciences. Prior attempts to scale such ab initio motif-finding algorithms achieved limited success. We solve the scalability issues using a combination of mixed-mode MPI-OpenMP parallel programming, master-slave work assignment, multi-level workload distribution, multi-level MPI collectives, and serial optimizations. While the scalability of our algorithm was excellent (94% parallel efficiency on 65,536 cores relative to 256 cores on a modest-size problem), the final speedup with respect to the original serial code exceeded 250,000 when serial optimizations are included. This enabled us to carry out many large-scale ab initio motiffinding simulations in a few hours while the original serial code would have needed decades of execution time. Copyright 2011 ACM.

  16. Radiation and desiccation response motif mediates radiation induced gene expression in D. radiodurans

    International Nuclear Information System (INIS)

    Anaganti, Narasimha; Basu, Bhakti; Apte, Shree Kumar

    2015-01-01

    Deinococcus radiodurans is an extremophile that withstands lethal doses of several DNA damaging agents such as gamma irradiation, UV rays, desiccation and chemical mutagens. The organism responds to DNA damage by inducing expression of several DNA repair genes. At least 25 radiation inducible gene promoters harbour a 17 bp palindromic sequence known as radiation and desiccation response motif (RDRM) implicated in gamma radiation inducible gene expression. However, mechanistic details of gamma radiation-responsive up-regulation in gene expression remain enigmatic. The promoters of highly radiation induced genes ddrB (DR0070), gyrB (DR0906), gyrA (DR1913), a hypothetical gene (DR1143) and recA (DR2338) from D. radiodurans were cloned in a green fluorescence protein (GFP)-based promoter probe shuttle vector pKG and their promoter activity was assessed in both E. coli as well as in D. radiodurans. The gyrA, gyrB and DR1143 gene promoters were active in E. coli although ddrB and recA promoters showed very weak activity. In D. radiodurans, all the five promoters were induced several fold following 6 kGy gamma irradiation. Highest induction was observed for ddrB promoter (25 fold), followed by DR1143 promoter (15 fold). The induction in the activity of gyrB, gyrA and recA promoters was 5, 3 and 2 fold, respectively. To assess the role of RDRM, the 17 bp palindromic sequence was deleted from these promoters. The promoters devoid of RDRM sequence displayed increase in the basal expression activity, but the radiation-responsive induction in promoter activity was completely lost. The substitution of two conserved bases of RDRM sequence yielded decreased radiation induction of PDR0070 promoter. Deletion of 5 bases from 5'-end of PDR0070 RDRM increased basal promoter activity, but radiation induction was completely abolished. Replacement of RDRM with non specific sequence of PDR0070 resulted in loss of basal expression and radiation induction. The results demonstrate that

  17. Target motifs affecting natural immunity by a constitutive CRISPR-Cas system in Escherichia coli.

    Directory of Open Access Journals (Sweden)

    Cristóbal Almendros

    Full Text Available Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR and CRISPR associated (cas genes conform the CRISPR-Cas systems of various bacteria and archaea and produce degradation of invading nucleic acids containing sequences (protospacers that are complementary to repeat intervening spacers. It has been demonstrated that the base sequence identity of a protospacer with the cognate spacer and the presence of a protospacer adjacent motif (PAM influence CRISPR-mediated interference efficiency. By using an original transformation assay with plasmids targeted by a resident spacer here we show that natural CRISPR-mediated immunity against invading DNA occurs in wild type Escherichia coli. Unexpectedly, the strongest activity is observed with protospacer adjoining nucleotides (interference motifs that differ from the PAM both in sequence and location. Hence, our results document for the first time native CRISPR activity in E. coli and demonstrate that positions next to the PAM in invading DNA influence their recognition and degradation by these prokaryotic immune systems.

  18. CombiMotif: A new algorithm for network motifs discovery in protein-protein interaction networks

    Science.gov (United States)

    Luo, Jiawei; Li, Guanghui; Song, Dan; Liang, Cheng

    2014-12-01

    Discovering motifs in protein-protein interaction networks is becoming a current major challenge in computational biology, since the distribution of the number of network motifs can reveal significant systemic differences among species. However, this task can be computationally expensive because of the involvement of graph isomorphic detection. In this paper, we present a new algorithm (CombiMotif) that incorporates combinatorial techniques to count non-induced occurrences of subgraph topologies in the form of trees. The efficiency of our algorithm is demonstrated by comparing the obtained results with the current state-of-the art subgraph counting algorithms. We also show major differences between unicellular and multicellular organisms. The datasets and source code of CombiMotif are freely available upon request.

  19. Motif III in superfamily 2 "helicases" helps convert the binding energy of ATP into a high-affinity RNA binding site in the yeast DEAD-box protein Ded1.

    Science.gov (United States)

    Banroques, Josette; Doère, Monique; Dreyfus, Marc; Linder, Patrick; Tanner, N Kyle

    2010-03-05

    Motif III in the putative helicases of superfamily 2 is highly conserved in both its sequence and its structural context. It typically consists of the sequence alcohol-alanine-alcohol (S/T-A-S/T). Historically, it was thought to link ATPase activity with a "helicase" strand displacement activity that disrupts RNA or DNA duplexes. DEAD-box proteins constitute the largest family of superfamily 2; they are RNA-dependent ATPases and ATP-dependent RNA binding proteins that, in some cases, are able to disrupt short RNA duplexes. We made mutations of motif III (S-A-T) in the yeast DEAD-box protein Ded1 and analyzed in vivo phenotypes and in vitro properties. Moreover, we made a tertiary model of Ded1 based on the solved structure of Vasa. We used Ded1 because it has relatively high ATPase and RNA binding activities; it is able to displace moderately stable duplexes at a large excess of substrate. We find that the alanine and the threonine in the second and third positions of motif III are more important than the serine, but that mutations of all three residues have strong phenotypes. We purified the wild-type and various mutants expressed in Escherichia coli. We found that motif III mutations affect the RNA-dependent hydrolysis of ATP (k(cat)), but not the affinity for ATP (K(m)). Moreover, mutations alter and reduce the affinity for single-stranded RNA and subsequently reduce the ability to disrupt duplexes. We obtained intragenic suppressors of the S-A-C mutant that compensate for the mutation by enhancing the affinity for ATP and RNA. We conclude that motif III and the binding energy of gamma-PO(4) of ATP are used to coordinate motifs I, II, and VI and the two RecA-like domains to create a high-affinity single-stranded RNA binding site. It also may help activate the beta,gamma-phosphoanhydride bond of ATP. (c) 2009 Elsevier Ltd. All rights reserved.

  20. Structural and functional analyses of DNA-sensing and immune activation by human cGAS.

    Science.gov (United States)

    Kato, Kazuki; Ishii, Ryohei; Goto, Eiji; Ishitani, Ryuichiro; Tokunaga, Fuminori; Nureki, Osamu

    2013-01-01

    The detection of cytosolic DNA, derived from pathogens or host cells, by cytosolic receptors is essential for appropriate host immune responses. Cyclic GMP-AMP synthase (cGAS) is a newly identified cytosolic DNA receptor that produces cyclic GMP-AMP, which activates stimulator of interferon genes (STING), resulting in TBK1-IRF3 pathway activation followed by the production of type I interferons. Here we report the crystal structure of human cGAS. The structure revealed that a cluster of lysine and arginine residues forms the positively charged DNA binding surface of human cGAS, which is important for the STING-dependent immune activation. A structural comparison with other previously determined cGASs and our functional analyses suggested that a conserved zinc finger motif and a leucine residue on the DNA binding surface are crucial for the DNA-specific immune response of human cGAS, consistent with previous work. These structural features properly orient the DNA binding to cGAS, which is critical for DNA-induced cGAS activation and STING-dependent immune activation. Furthermore, we showed that the cGAS-induced activation of STING also involves the activation of the NF-κB and IRF3 pathways. Our results indicated that cGAS is a DNA sensor that efficiently activates the host immune system by inducing two distinct pathways.

  1. Structural and functional analyses of DNA-sensing and immune activation by human cGAS.

    Directory of Open Access Journals (Sweden)

    Kazuki Kato

    Full Text Available The detection of cytosolic DNA, derived from pathogens or host cells, by cytosolic receptors is essential for appropriate host immune responses. Cyclic GMP-AMP synthase (cGAS is a newly identified cytosolic DNA receptor that produces cyclic GMP-AMP, which activates stimulator of interferon genes (STING, resulting in TBK1-IRF3 pathway activation followed by the production of type I interferons. Here we report the crystal structure of human cGAS. The structure revealed that a cluster of lysine and arginine residues forms the positively charged DNA binding surface of human cGAS, which is important for the STING-dependent immune activation. A structural comparison with other previously determined cGASs and our functional analyses suggested that a conserved zinc finger motif and a leucine residue on the DNA binding surface are crucial for the DNA-specific immune response of human cGAS, consistent with previous work. These structural features properly orient the DNA binding to cGAS, which is critical for DNA-induced cGAS activation and STING-dependent immune activation. Furthermore, we showed that the cGAS-induced activation of STING also involves the activation of the NF-κB and IRF3 pathways. Our results indicated that cGAS is a DNA sensor that efficiently activates the host immune system by inducing two distinct pathways.

  2. Identification of the divergent calmodulin binding motif in yeast Ssb1/Hsp75 protein and in other HSP70 family members.

    Science.gov (United States)

    Heinen, R C; Diniz-Mendes, L; Silva, J T; Paschoalin, V M F

    2006-11-01

    Yeast soluble proteins were fractionated by calmodulin-agarose affinity chromatography and the Ca2+/calmodulin-binding proteins were analyzed by SDS-PAGE. One prominent protein of 66 kDa was excised from the gel, digested with trypsin and the masses of the resultant fragments were determined by MALDI/MS. Twenty-one of 38 monoisotopic peptide masses obtained after tryptic digestion were matched to the heat shock protein Ssb1/Hsp75, covering 37% of its sequence. Computational analysis of the primary structure of Ssb1/Hsp75 identified a unique potential amphipathic alpha-helix in its N-terminal ATPase domain with features of target regions for Ca2+/calmodulin binding. This region, which shares 89% similarity to the experimentally determined calmodulin-binding domain from mouse, Hsc70, is conserved in near half of the 113 members of the HSP70 family investigated, from yeast to plant and animals. Based on the sequence of this region, phylogenetic analysis grouped the HSP70s in three distinct branches. Two of them comprise the non-calmodulin binding Hsp70s BIP/GR78, a subfamily of eukaryotic HSP70 localized in the endoplasmic reticulum, and DnaK, a subfamily of prokaryotic HSP70. A third heterogeneous group is formed by eukaryotic cytosolic HSP70s containing the new calmodulin-binding motif and other cytosolic HSP70s whose sequences do not conform to those conserved motif, indicating that not all eukaryotic cytosolic Hsp70s are target for calmodulin regulation. Furthermore, the calmodulin-binding domain found in eukaryotic HSP70s is also the target for binding of Bag-1 - an enhancer of ADP/ATP exchange activity of Hsp70s. A model in which calmodulin displaces Bag-1 and modulates Ssb1/Hsp75 chaperone activity is discussed.

  3. Identification of the divergent calmodulin binding motif in yeast Ssb1/Hsp75 protein and in other HSP70 family members

    Directory of Open Access Journals (Sweden)

    R.C. Heinen

    2006-11-01

    Full Text Available Yeast soluble proteins were fractionated by calmodulin-agarose affinity chromatography and the Ca2+/calmodulin-binding proteins were analyzed by SDS-PAGE. One prominent protein of 66 kDa was excised from the gel, digested with trypsin and the masses of the resultant fragments were determined by MALDI/MS. Twenty-one of 38 monoisotopic peptide masses obtained after tryptic digestion were matched to the heat shock protein Ssb1/Hsp75, covering 37% of its sequence. Computational analysis of the primary structure of Ssb1/Hsp75 identified a unique potential amphipathic alpha-helix in its N-terminal ATPase domain with features of target regions for Ca2+/calmodulin binding. This region, which shares 89% similarity to the experimentally determined calmodulin-binding domain from mouse, Hsc70, is conserved in near half of the 113 members of the HSP70 family investigated, from yeast to plant and animals. Based on the sequence of this region, phylogenetic analysis grouped the HSP70s in three distinct branches. Two of them comprise the non-calmodulin binding Hsp70s BIP/GR78, a subfamily of eukaryotic HSP70 localized in the endoplasmic reticulum, and DnaK, a subfamily of prokaryotic HSP70. A third heterogeneous group is formed by eukaryotic cytosolic HSP70s containing the new calmodulin-binding motif and other cytosolic HSP70s whose sequences do not conform to those conserved motif, indicating that not all eukaryotic cytosolic Hsp70s are target for calmodulin regulation. Furthermore, the calmodulin-binding domain found in eukaryotic HSP70s is also the target for binding of Bag-1 - an enhancer of ADP/ATP exchange activity of Hsp70s. A model in which calmodulin displaces Bag-1 and modulates Ssb1/Hsp75 chaperone activity is discussed.

  4. Identification of the Raptor-binding motif on Arabidopsis S6 kinase and its use as a TOR signaling suppressor

    Energy Technology Data Exchange (ETDEWEB)

    Son, Ora; Kim, Sunghan; Hur, Yoon-Sun; Cheon, Choong-Ill, E-mail: ccheon@sookmyung.ac.kr

    2016-03-25

    TOR (target of rapamycin) kinase signaling plays central role as a regulator of growth and proliferation in all eukaryotic cells and its key signaling components and effectors are also conserved in plants. Unlike the mammalian and yeast counterparts, however, we found through yeast two-hybrid analysis that multiple regions of the Arabidopsis Raptor (regulatory associated protein of TOR) are required for binding to its substrate. We also identified that a 44-amino acid region at the N-terminal end of Arabidopsis ribosomal S6 kinase 1 (AtS6K1) specifically interacted with AtRaptor1, indicating that this region may contain a functional equivalent of the TOS (TOR-Signaling) motif present in the mammalian TOR substrates. Transient over-expression of this 44-amino acid fragment in Arabidopsis protoplasts resulted in significant decrease in rDNA transcription, demonstrating a feasibility of developing a new plant-specific TOR signaling inhibitor based upon perturbation of the Raptor-substrate interaction. - Highlights: • Multiple regions on the Arabidopsis Raptor protein were found to be involved in substrate binding. • N-terminal end of the Arabidopsis ribosomal S6 kinase 1 (AtS6K1) was responsible for interacting with AtRaptor1. • The Raptor-interacting fragment of AtS6K1 could be utilized as an effective inhibitor of plant TOR signaling.

  5. Identification of the Raptor-binding motif on Arabidopsis S6 kinase and its use as a TOR signaling suppressor

    International Nuclear Information System (INIS)

    Son, Ora; Kim, Sunghan; Hur, Yoon-Sun; Cheon, Choong-Ill

    2016-01-01

    TOR (target of rapamycin) kinase signaling plays central role as a regulator of growth and proliferation in all eukaryotic cells and its key signaling components and effectors are also conserved in plants. Unlike the mammalian and yeast counterparts, however, we found through yeast two-hybrid analysis that multiple regions of the Arabidopsis Raptor (regulatory associated protein of TOR) are required for binding to its substrate. We also identified that a 44-amino acid region at the N-terminal end of Arabidopsis ribosomal S6 kinase 1 (AtS6K1) specifically interacted with AtRaptor1, indicating that this region may contain a functional equivalent of the TOS (TOR-Signaling) motif present in the mammalian TOR substrates. Transient over-expression of this 44-amino acid fragment in Arabidopsis protoplasts resulted in significant decrease in rDNA transcription, demonstrating a feasibility of developing a new plant-specific TOR signaling inhibitor based upon perturbation of the Raptor-substrate interaction. - Highlights: • Multiple regions on the Arabidopsis Raptor protein were found to be involved in substrate binding. • N-terminal end of the Arabidopsis ribosomal S6 kinase 1 (AtS6K1) was responsible for interacting with AtRaptor1. • The Raptor-interacting fragment of AtS6K1 could be utilized as an effective inhibitor of plant TOR signaling.

  6. Identification of the Raptor-binding motif on Arabidopsis S6 kinase and its use as a TOR signaling suppressor.

    Science.gov (United States)

    Son, Ora; Kim, Sunghan; Hur, Yoon-Sun; Cheon, Choong-Ill

    2016-03-25

    TOR (target of rapamycin) kinase signaling plays central role as a regulator of growth and proliferation in all eukaryotic cells and its key signaling components and effectors are also conserved in plants. Unlike the mammalian and yeast counterparts, however, we found through yeast two-hybrid analysis that multiple regions of the Arabidopsis Raptor (regulatory associated protein of TOR) are required for binding to its substrate. We also identified that a 44-amino acid region at the N-terminal end of Arabidopsis ribosomal S6 kinase 1 (AtS6K1) specifically interacted with AtRaptor1, indicating that this region may contain a functional equivalent of the TOS (TOR-Signaling) motif present in the mammalian TOR substrates. Transient over-expression of this 44-amino acid fragment in Arabidopsis protoplasts resulted in significant decrease in rDNA transcription, demonstrating a feasibility of developing a new plant-specific TOR signaling inhibitor based upon perturbation of the Raptor-substrate interaction. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Ancient mtDNA genetic variants modulate mtDNA transcription and replication.

    Directory of Open Access Journals (Sweden)

    Sarit Suissa

    2009-05-01

    Full Text Available Although the functional consequences of mitochondrial DNA (mtDNA genetic backgrounds (haplotypes, haplogroups have been demonstrated by both disease association studies and cell culture experiments, it is not clear which of the mutations within the haplogroup carry functional implications and which are "evolutionary silent hitchhikers". We set forth to study the functionality of haplogroup-defining mutations within the mtDNA transcription/replication regulatory region by in vitro transcription, hypothesizing that haplogroup-defining mutations occurring within regulatory motifs of mtDNA could affect these processes. We thus screened >2500 complete human mtDNAs representing all major populations worldwide for natural variation in experimentally established protein binding sites and regulatory regions comprising a total of 241 bp in each mtDNA. Our screen revealed 77/241 sites showing point mutations that could be divided into non-fixed (57/77, 74% and haplogroup/sub-haplogroup-defining changes (i.e., population fixed changes, 20/77, 26%. The variant defining Caucasian haplogroup J (C295T increased the binding of TFAM (Electro Mobility Shift Assay and the capacity of in vitro L-strand transcription, especially of a shorter transcript that maps immediately upstream of conserved sequence block 1 (CSB1, a region associated with RNA priming of mtDNA replication. Consistent with this finding, cybrids (i.e., cells sharing the same nuclear genetic background but differing in their mtDNA backgrounds harboring haplogroup J mtDNA had a >2 fold increase in mtDNA copy number, as compared to cybrids containing haplogroup H, with no apparent differences in steady state levels of mtDNA-encoded transcripts. Hence, a haplogroup J regulatory region mutation affects mtDNA replication or stability, which may partially account for the phenotypic impact of this haplogroup. Our analysis thus demonstrates, for the first time, the functional impact of particular mtDNA

  8. Rif1 controls DNA replication by directing Protein Phosphatase 1 to reverse Cdc7-mediated phosphorylation of the MCM complex.

    Science.gov (United States)

    Hiraga, Shin-Ichiro; Alvino, Gina M; Chang, Fujung; Lian, Hui-Yong; Sridhar, Akila; Kubota, Takashi; Brewer, Bonita J; Weinreich, Michael; Raghuraman, M K; Donaldson, Anne D

    2014-02-15

    Initiation of eukaryotic DNA replication requires phosphorylation of the MCM complex by Dbf4-dependent kinase (DDK), composed of Cdc7 kinase and its activator, Dbf4. We report here that budding yeast Rif1 (Rap1-interacting factor 1) controls DNA replication genome-wide and describe how Rif1 opposes DDK function by directing Protein Phosphatase 1 (PP1)-mediated dephosphorylation of the MCM complex. Deleting RIF1 partially compensates for the limited DDK activity in a cdc7-1 mutant strain by allowing increased, premature phosphorylation of Mcm4. PP1 interaction motifs within the Rif1 N-terminal domain are critical for its repressive effect on replication. We confirm that Rif1 interacts with PP1 and that PP1 prevents premature Mcm4 phosphorylation. Remarkably, our results suggest that replication repression by Rif1 is itself also DDK-regulated through phosphorylation near the PP1-interacting motifs. Based on our findings, we propose that Rif1 is a novel PP1 substrate targeting subunit that counteracts DDK-mediated phosphorylation during replication. Fission yeast and mammalian Rif1 proteins have also been implicated in regulating DNA replication. Since PP1 interaction sites are evolutionarily conserved within the Rif1 sequence, it is likely that replication control by Rif1 through PP1 is a conserved mechanism.

  9. Annotating RNA motifs in sequences and alignments.

    Science.gov (United States)

    Gardner, Paul P; Eldai, Hisham

    2015-01-01

    RNA performs a diverse array of important functions across all cellular life. These functions include important roles in translation, building translational machinery and maturing messenger RNA. More recent discoveries include the miRNAs and bacterial sRNAs that regulate gene expression, the thermosensors, riboswitches and other cis-regulatory elements that help prokaryotes sense their environment and eukaryotic piRNAs that suppress transposition. However, there can be a long period between the initial discovery of a RNA and determining its function. We present a bioinformatic approach to characterize RNA motifs, which are critical components of many RNA structure-function relationships. These motifs can, in some instances, provide researchers with functional hypotheses for uncharacterized RNAs. Moreover, we introduce a new profile-based database of RNA motifs--RMfam--and illustrate some applications for investigating the evolution and functional characterization of RNA. All the data and scripts associated with this work are available from: https://github.com/ppgardne/RMfam. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Dynamic motifs in socio-economic networks

    Science.gov (United States)

    Zhang, Xin; Shao, Shuai; Stanley, H. Eugene; Havlin, Shlomo

    2014-12-01

    Socio-economic networks are of central importance in economic life. We develop a method of identifying and studying motifs in socio-economic networks by focusing on “dynamic motifs,” i.e., evolutionary connection patterns that, because of “node acquaintances” in the network, occur much more frequently than random patterns. We examine two evolving bi-partite networks: i) the world-wide commercial ship chartering market and ii) the ship build-to-order market. We find similar dynamic motifs in both bipartite networks, even though they describe different economic activities. We also find that “influence” and “persistence” are strong factors in the interaction behavior of organizations. When two companies are doing business with the same customer, it is highly probable that another customer who currently only has business relationship with one of these two companies, will become customer of the second in the future. This is the effect of influence. Persistence means that companies with close business ties to customers tend to maintain their relationships over a long period of time.

  11. A phylogenetic study of SPBP and RAI1: evolutionary conservation of chromatin binding modules.

    Directory of Open Access Journals (Sweden)

    Sagar Darvekar

    Full Text Available Our genome is assembled into and array of highly dynamic nucleosome structures allowing spatial and temporal access to DNA. The nucleosomes are subject to a wide array of post-translational modifications, altering the DNA-histone interaction and serving as docking sites for proteins exhibiting effector or "reader" modules. The nuclear proteins SPBP and RAI1 are composed of several putative "reader" modules which may have ability to recognise a set of histone modification marks. Here we have performed a phylogenetic study of their putative reader modules, the C-terminal ePHD/ADD like domain, a novel nucleosome binding region and an AT-hook motif. Interactions studies in vitro and in yeast cells suggested that despite the extraordinary long loop region in their ePHD/ADD-like chromatin binding domains, the C-terminal region of both proteins seem to adopt a cross-braced topology of zinc finger interactions similar to other structurally determined ePHD/ADD structures. Both their ePHD/ADD-like domain and their novel nucleosome binding domain are highly conserved in vertebrate evolution, and construction of a phylogenetic tree displayed two well supported clusters representing SPBP and RAI1, respectively. Their genome and domain organisation suggest that SPBP and RAI1 have occurred from a gene duplication event. The phylogenetic tree suggests that this duplication has happened early in vertebrate evolution, since only one gene was identified in insects and lancelet. Finally, experimental data confirm that the conserved novel nucleosome binding region of RAI1 has the ability to bind the nucleosome core and histones. However, an adjacent conserved AT-hook motif as identified in SPBP is not present in RAI1, and deletion of the novel nucleosome binding region of RAI1 did not significantly affect its nuclear localisation.

  12. Cloning, expression, purification, crystallization and preliminary X-ray diffraction analysis of the central zinc-binding domain of the human Mcm10 DNA-replication factor

    International Nuclear Information System (INIS)

    Jung, Nam Young; Bae, Won Jin; Chang, Jeong Ho; Kim, Young Chang; Cho, Yunje

    2008-01-01

    Mcm10 is a highly conserved nuclear protein that plays a key role in the initiation and elongation processes of DNA replication by providing a physical link between the Mcm2–7 complex and DNA polymerases. In this study, the central domain of human Mcm10 was crystallized using the hanging-drop vapour-diffusion method in the presence of PEG 3350. The initiation of eukaryotic DNA replication requires the tightly controlled assembly of a set of replication factors. Mcm10 is a highly conserved nuclear protein that plays a key role in the initiation and elongation processes of DNA replication by providing a physical link between the Mcm2–7 complex and DNA polymerases. The central domain, which contains the CCCH zinc-binding motif, is most conserved within Mcm10 and binds to DNA and several proteins, including proliferative cell nuclear antigen. In this study, the central domain of human Mcm10 was crystallized using the hanging-drop vapour-diffusion method in the presence of PEG 3350. An X-ray diffraction data set was collected to a resolution of 2.6 Å on a synchrotron beamline. The crystals formed belonged to space group R3, with unit-cell parameters a = b = 99.5, c = 133.0 Å. According to Matthews coefficient calculations, the crystals were predicted to contain six MCM10 central domain molecules in the asymmetric unit

  13. Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

    KAUST Repository

    Sayadi, Ahmed

    2011-07-20

    The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length of the motifs and their variable degree of conservation makes their identification hard since it is difficult to correctly estimate the statistical significance of their occurrence. Consequently, only a small fraction of them have been discovered so far. We describe here an approach for the discovery of SLiMs based on their occurrence in evolutionarily unrelated proteins belonging to the same biological, signalling or metabolic pathway and give specific examples of its effectiveness in both rediscovering known motifs and in discovering novel ones. An automatic implementation of the procedure, available for download, allows significant motifs to be identified, automatically annotated with functional, evolutionary and structural information and organized in a database that can be inspected and queried. An instance of the database populated with pre-computed data on seven organisms is accessible through a publicly available server and we believe it constitutes by itself a useful resource for the life sciences (http://www.biocomputing.it/modipath).

  14. Homologous regions of Fen1 and p21Cip1 compete for binding to the same site on PCNA: a potential mechanism to co-ordinate DNA replication and repair.

    Science.gov (United States)

    Warbrick, E; Lane, D P; Glover, D M; Cox, L S

    1997-05-15

    Following genomic damage, the cessation of DNA replication is co-ordinated with onset of DNA repair; this co-ordination is essential to avoid mutation and genomic instability. To investigate these phenomena, we have analysed proteins that interact with PCNA, which is required for both DNA replication and repair. One such protein is p21Cip1, which inhibits DNA replication through its interaction with PCNA, while allowing repair to continue. We have identified an interaction between PCNA and the structure specific nuclease, Fen1, which is involved in DNA replication. Deletion analysis suggests that p21Cip1 and Fen1 bind to the same region of PCNA. Within Fen1 and its homologues a small region (10 amino acids) is sufficient for PCNA binding, which contains an 8 amino acid conserved PCNA-binding motif. This motif shares critical residues with the PCNA-binding region of p21Cip1. A PCNA binding peptide from p21Cip1 competes with Fen1 peptides for binding to PCNA, disrupts the Fen1-PCNA complex in replicating cell extracts, and concomitantly inhibits DNA synthesis. Competition between homologous regions of Fen1 and p21Cip1 for binding to the same site on PCNA may provide a mechanism to co-ordinate the functions of PCNA in DNA replication and repair.

  15. High-Resolution Profiling of Drosophila Replication Start Sites Reveals a DNA Shape and Chromatin Signature of Metazoan Origins

    Directory of Open Access Journals (Sweden)

    Federico Comoglio

    2015-05-01

    Full Text Available At every cell cycle, faithful inheritance of metazoan genomes requires the concerted activation of thousands of DNA replication origins. However, the genetic and chromatin features defining metazoan replication start sites remain largely unknown. Here, we delineate the origin repertoire of the Drosophila genome at high resolution. We address the role of origin-proximal G-quadruplexes and suggest that they transiently stall replication forks in vivo. We dissect the chromatin configuration of replication origins and identify a rich spatial organization of chromatin features at initiation sites. DNA shape and chromatin configurations, not strict sequence motifs, mark and predict origins in higher eukaryotes. We further examine the link between transcription and origin firing and reveal that modulation of origin activity across cell types is intimately linked to cell-type-specific transcriptional programs. Our study unravels conserved origin features and provides unique insights into the relationship among DNA topology, chromatin, transcription, and replication initiation across metazoa.

  16. [Three regions of Rpb10 mini-subunit of nuclear RNA polymerases are strictly conserved in all eukaryotes].

    Science.gov (United States)

    Shpakovskiĭ, G V; Lebedenko, E N

    1996-12-01

    The rpb10+ cDNA from the fission yeast Schizosaccharomyces pombe was cloned using two independent approaches (PCR and genetic suppression). The cloned cDNA encoded the Rpb10 subunit common for all three RNA polymerases. Comparison of the deduced amino acid sequence of the Sz. pombe Rbp10 subunit (71 amino acid residues) with those of the homologous subunits of RNA polymerases I, II, and III from Saccharomyces cerevisiae and Home sapiens revealed that heptapeptides RCFT/SCGK (residues 6-12), RYCCRRM (residues 43-49), and HVDLIEK (residues 53-59) were evolutionarily the most conserved structural motifs of these subunits. It is shown that the Rbp10 subunit from Sz. pombe can substitute its homolog (ABC10 beta) in the baker's yeast S. cerevisiae.

  17. CONTEMPORARY USAGE OF TRADITIONAL TURKISH MOTIFS IN PRODUCT DESIGNS

    Directory of Open Access Journals (Sweden)

    Tulay Gumuser

    2012-12-01

    Full Text Available The aim of this study is to identify the traditional Turkish motifs and its relations among present industrial designs. Traditional Turkish motifs played a very important role in 16th century onwards. The arts of the Ottoman Empire were used because of their symbolic meanings and unique styles. When we examine these motifs we encounter; Tiger Stripe, Three Spot (Çintemani, Rumi, Hatayi, Penç, Cloud, Crescent, Star, Crown, Hyacinth, Tulip and Carnation motifs. Nowadays, Turkish designers have begun to use these traditional Turkish motifs in their designs so as to create differences and awareness in the world design. The examples of these industrial designs, using the Turkish motifs, have survived and have Ottoman heritage and historical value. In this study, the Turkish motifs will be examined along with their focus on contemporary Turkish industrial designs used today.

  18. Aplikasi Ornamen Khas Maluku untuk Pengembangan Desain Motif Batik

    Directory of Open Access Journals (Sweden)

    Masiswo Masiswo

    2016-04-01

    Full Text Available ABSTRAKMaluku memiliki banyak ragam hias budaya warisan nilai leluhur berupa ornamen etnis yang merupakan kesenian dan keterampilan kerajinan. Hasil warisan tersebut sampai saat ini masih lestari hidup serta dapat dinikmati sebagai konsumsi rohani yang memuaskan manusia. Berkaitan dengan keberlangsungan nilai-nilai tradisi etnis yang berwujud pada ornamen-ornamen daerah Maluku, maka dikembangkan untuk kebutuhan manusia berupa motif batik pada kain. Pengembangan ornamen ini lebih menekankan pada representasi akan bentuk-bentuk ornamen yang diterapkan pada kerajinan batik berupa motif khas Maluku. Pengembangan alternatif desain motif batik dibuat tiga variasi yang bersumber dari ornamen khas Maluku dibuat prototipe produknya dan diuji ketahanan luntur warnanya. Hasil uji ketahanan luntur warna terhadap gosokan basah dari tiga prototipe produk berpredikat baik sekali terdapat pada “Motif Siwa” dan predikat baik pada motif “Siwa Talang” dan motif “Matahari Siwa Talang”.Kata kunci: desain, Maluku, motif batik, ornamenABSTRACTMaluku has much decorative ancestral cultural heritage value in the form of ornament ethnic arts and crafts skills. The result of the legacy is still sustainable living can be enjoyed as well as satisfying spiritual human consumption.Related to the sustainability of traditional values in the form of ethnic ornaments Maluku, it was developed for human needs in the form of batik cloth . The development of these ornaments will be more emphasis on the representation forms of ornamentation that is applied to a batik motif Maluku. Development of alternative design motif made three variations. The development of three alternative design motifs derived from the Maluku ornaments made and tested a prototype product color fastness. The test results of color fastness to wet rubbing of the three prototypes are excellent products predicated on the "Motif Siwa" and a good rating on the motif "Siwa Talang" and motif "Matahari Siwa

  19. Tight regulation of the Epstein-Barr virus setpoint: interindividual differences in Epstein-Barr virus DNA load are conserved after HIV infection

    NARCIS (Netherlands)

    Piriou, Erwan; van Dort, Karel; Otto, Sigrid; van Oers, Marinus H. J.; van Baarle, Debbie

    2008-01-01

    Healthy individuals carry a constant number of Epstein-Barr virus-infected B cells in the peripheral blood over time. Here, we show that interindividual differences in Epstein-Barr virus DNA levels are maintained after HIV infection, providing evidence for the existence of an individual Epstein-Barr

  20. Unlocked nucleic acids with a pyrene-modified uracil: Synthesis, hybridization studies, fluorescent properties and i-motif stability

    DEFF Research Database (Denmark)

    Perlíková, P.; Karlsen, K.K.; Pedersen, E.B.

    2014-01-01

    The synthesis of two new phosphoramidite building blocks for the incorporation of 5-(pyren-1-yl)uracilyl unlocked nucleic acid (UNA) monomers into oligonucleotides has been developed. Monomers containing a pyrene-modified nucleobase component were found to destabilize an i-motif structure at pH 5...... intensities upon hybridization to DNA or RNA. Efficient quenching of fluorescence of pyrene-modified UNA monomers was observed after formation of i-motif structures at pH 5.2. The stabilizing/destabilizing effect of pyrene-modified nucleic acids might be useful for designing antisense oligonucleotides...

  1. Conservation of batik: Conseptual framework of design and process development

    Science.gov (United States)

    Syamwil, Rodia

    2018-03-01

    Development of Conservation Batik concept becomes critical due to the recessive of traditional batik as the intangible cultural heritage of humanity. The existence of printed batik, polluting process, and new stream design becomes the consequences of batik industry transformation to creative industry. Conservation Batik was proposed to answer all the threats to traditional batik, in the aspect of technique, process, and motif. However, creativities are also critical to meet consumer satisfaction. Research and development was conducted, start with the initial research in formulating the concept, and exploration of ideas to develop the designs of conservation motifs. In development steps, cyclical process to complete motif with high preferences, in the aspect of aesthetics, productivity, and efficiency. Data were collected through bibliography, documentation, observation, and interview, and analyzed in qualitative methods. The concept of Conservation Batik adopted from the principles of Universitas Negeri Semarang (UNNES) vision, as well as theoretical analyses, and expert judgment. Conservation Batik are assessed from three aspect, design, process, and consumer preferences. Conservation means the effort of safeguarding, promoting, maintaining, and preserving. Concervation Batik concept could be interpreted as batik with: (1) traditional values and authenticity; (2) the values of philosophycal meanings; (3) eco-friendly process with minimum waste; (4) conservation as idea resources of design; and (5) raising up of classic motifs.

  2. UKIRAN KERAWANG ACEH GAYO SEBAGAI INSPIRASI PENCIPTAAN MOTIF BATIK KHAS GAYO

    Directory of Open Access Journals (Sweden)

    Irfa ina Rohana Salma

    2016-12-01

    Full Text Available ABSTRAK Industri batik mulai berkembang di Gayo, tetapi belum memiliki motif batik khas daerah. Oleh karena itu perlu diciptakan motif batik khas Gayo, dengan mengambil inspirasi dari ukiran yang terdapat pada rumah tradisional yang biasa disebut ukiran kerawang Gayo. Tujuan penciptaan seni ini adalah untuk menciptakan motif batik yang memiliki ciri khas Gayo. Metode yang digunakan yaitu eksplorasi ide, perancangan, dan perwujudan menjadi motif batik. Dalam kegiatan ini telah diciptakan enam motif batik khas Gayo yaitu: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif Gayo Lurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. Hasil uji kesukaan terhadap motif kepada lima puluh responden menunjukkan bahwa Motif Ceplok Gayo paling banyak dipilih oleh responden yaitu sebesar 19%, sedangkan Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo Lurus 15% dan Motif Gayo Tegak 14%. Rata-rata motif yang dihasilkan mendapatkan apresiasi yang baik dari responden, sehingga semua motif layak diproduksi sebagai batik khas Gayo.Kata kunci: batik Gayo, Motif Ceplok Gayo, Motif Parang Gayo.ABSTRACTBatik industry began to develop in Gayo, but have not had a typical batik motif itself. Therefore, it is necessary to create batik motifs of Gayo, by taking inspiration from the carvings found in traditional houses commonly called kerawang Gayo. The purpose of this art is to create motifs those have a Gayo characteristic. The method used are the idea exploration, design, and motifs embodiment. In this activity has created six Gayo batik motifs, namely: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif GayoLurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. The test results fondness of the motives to fifty respondents indicated that the Motif Ceplok Gayo most preferred by respondents ie 19%, while Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo

  3. RMOD: a tool for regulatory motif detection in signaling network.

    Directory of Open Access Journals (Sweden)

    Jinki Kim

    Full Text Available Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.

  4. Requirement for asparagine in the aquaporin NPA sequence signature motifs for cation exclusion

    DEFF Research Database (Denmark)

    Wree, Dorothea; Wu, Binghua; Zeuthen, Thomas

    2011-01-01

    Two highly conserved NPA motifs are a hallmark of the aquaporin (AQP) family. The NPA triplets form N-terminal helix capping structures with the Asn side chains located in the centre of the water or solute-conducting channel, and are considered to play an important role in AQP selectivity. Although...... interchangeable at both NPA sites without affecting protein expression or water, glycerol and methylamine permeability. However, other mutations in the NPA region led to reduced permeability (S186C and S186D), to nonfunctional channels (N64D), or even to lack of protein expression (S186A and S186T). Using...... electrophysiology, we found that an analogous mammalian AQP1 N76S mutant excluded protons and potassium ions, but leaked sodium ions, providing an argument for the overwhelming prevalence of Asn over other amino acids. We conclude that, at the first position in the NPA motifs, only Asn provides efficient helix cap...

  5. Does the evolutionary conservation of microsatellite loci imply function?

    Energy Technology Data Exchange (ETDEWEB)

    Shriver, M.D.; Deka, R.; Ferrell, R.E. [Univ. of Pittsburgh, PA (United States)] [and others

    1994-09-01

    Microsatellites are highly polymorphic tandem arrays of short (1-6 bp) sequence motifs which have been found widely distributed in the genomes of all eukaryotes. We have analyzed allele frequency data on 16 microsatellite loci typed in the great apes (human, chimp, orangutan, and gorilla). The majority of these loci (13) were isolated from human genomic libraries; three were cloned from chimpanzee genomic DNA. Most of these loci are not only present in all apes species, but are polymorphic with comparable levels of heterozygosity and have alleles which overlap in size. The extent of divergence of allele frequencies among these four species were studies using the stepwise-weighted genetic distance (Dsw), which was previously shown to conform to linearity with evolutionary time since divergence for loci where mutations exist in a stepwise fashion. The phylogenetic tree of the great apes constructed from this distance matrix was consistent with the expected topology, with a high bootstrap confidence (82%) for the human/chimp clade. However, the allele frequency distributions of these species are 10 times more similar to each other than expected when they were calibrated with a conservative estimate of the time since separation of humans and the apes. These results are in agreement with sequence-based surveys of microsatellites which have demonstrated that they are highly (90%) conserved over short periods of evolutionary time (< 10 million years) and moderately (30%) conserved over long periods of evolutionary time (> 60-80 million years). This evolutionary conservation has prompted some authors to speculate that there are functional constraints on microsatellite loci. In contrast, the presence of directional bias of mutations with constraints and/or selection against aberrant sized alleles can explain these results.

  6. An efficient identification strategy of clonal tea cultivars using long-core motif SSR markers.

    Science.gov (United States)

    Wang, Rang Jian; Gao, Xiang Feng; Kong, Xiang Rui; Yang, Jun

    2016-01-01

    Microsatellites, or simple sequence repeats (SSRs), especially those with long-core motifs (tri-, tetra-, penta-, and hexa-nucleotide) represent an excellent tool for DNA fingerprinting. SSRs with long-core motifs are preferred since neighbor alleles are more easily separated and identified from each other, which render the interpretation of electropherograms and the true alleles more reliable. In the present work, with the purpose of characterizing a set of core SSR markers with long-core motifs for well fingerprinting clonal cultivars of tea (Camellia sinensis), we analyzed 66 elite clonal tea cultivars in China with 33 initially-chosen long-core motif SSR markers covering all the 15 linkage groups of tea plant genome. A set of 6 SSR markers were conclusively selected as core SSR markers after further selection. The polymorphic information content (PIC) of the core SSR markers was >0.5, with ≤5 alleles in each marker containing 10 or fewer genotypes. Phylogenetic analysis revealed that the core SSR markers were not strongly correlated with the trait 'cultivar processing-property'. The combined probability of identity (PID) between two random cultivars for the whole set of 6 SSR markers was estimated to be 2.22 × 10(-5), which was quite low, confirmed the usefulness of the proposed SSR markers for fingerprinting analyses in Camellia sinensis. Moreover, for the sake of quickly discriminating the clonal tea cultivars, a cultivar identification diagram (CID) was subsequently established using these core markers, which fully reflected the identification process and provided the immediate information about which SSR markers were needed to identify a cultivar chosen among the tested ones. The results suggested that long-core motif SSR markers used in the investigation contributed to the accurate and efficient identification of the clonal tea cultivars and enabled the protection of intellectual property.

  7. SiteBinder: an improved approach for comparing multiple protein structural motifs.

    Science.gov (United States)

    Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav

    2012-02-27

    There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.

  8. Structural motifs of pre-nucleation clusters.

    Science.gov (United States)

    Zhang, Y; Türkmen, I R; Wassermann, B; Erko, A; Rühl, E

    2013-10-07

    Structural motifs of pre-nucleation clusters prepared in single, optically levitated supersaturated aqueous aerosol microparticles containing CaBr2 as a model system are reported. Cluster formation is identified by means of X-ray absorption in the Br K-edge regime. The salt concentration beyond the saturation point is varied by controlling the humidity in the ambient atmosphere surrounding the 15-30 μm microdroplets. This leads to the formation of metastable supersaturated liquid particles. Distinct spectral shifts in near-edge spectra as a function of salt concentration are observed, in which the energy position of the Br K-edge is red-shifted by up to 7.1 ± 0.4 eV if the dilute solution is compared to the solid. The K-edge positions of supersaturated solutions are found between these limits. The changes in electronic structure are rationalized in terms of the formation of pre-nucleation clusters. This assumption is verified by spectral simulations using first-principle density functional theory and molecular dynamics calculations, in which structural motifs are considered, explaining the experimental results. These consist of solvated CaBr2 moieties, rather than building blocks forming calcium bromide hexahydrates, the crystal system that is formed by drying aqueous CaBr2 solutions.

  9. DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

    Science.gov (United States)

    de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

    2015-11-16

    Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. The PDZ-binding motif of Yes-associated protein is required for its co-activation of TEAD-mediated CTGF transcription and oncogenic cell transforming activity

    International Nuclear Information System (INIS)

    Shimomura, Tadanori; Miyamura, Norio; Hata, Shoji; Miura, Ryota; Hirayama, Jun; Nishina, Hiroshi

    2014-01-01

    Highlights: •Loss of the PDZ-binding motif inhibits constitutively active YAP (5SA)-induced oncogenic cell transformation. •The PDZ-binding motif of YAP promotes its nuclear localization in cultured cells and mouse liver. •Loss of the PDZ-binding motif inhibits YAP (5SA)-induced CTGF transcription in cultured cells and mouse liver. -- Abstract: YAP is a transcriptional co-activator that acts downstream of the Hippo signaling pathway and regulates multiple cellular processes, including proliferation. Hippo pathway-dependent phosphorylation of YAP negatively regulates its function. Conversely, attenuation of Hippo-mediated phosphorylation of YAP increases its ability to stimulate proliferation and eventually induces oncogenic transformation. The C-terminus of YAP contains a highly conserved PDZ-binding motif that regulates YAP’s functions in multiple ways. However, to date, the importance of the PDZ-binding motif to the oncogenic cell transforming activity of YAP has not been determined. In this study, we disrupted the PDZ-binding motif in the YAP (5SA) protein, in which the sites normally targeted by Hippo pathway-dependent phosphorylation are mutated. We found that loss of the PDZ-binding motif significantly inhibited the oncogenic transformation of cultured cells induced by YAP (5SA). In addition, the increased nuclear localization of YAP (5SA) and its enhanced activation of TEAD-dependent transcription of the cell proliferation gene CTGF were strongly reduced when the PDZ-binding motif was deleted. Similarly, in mouse liver, deletion of the PDZ-binding motif suppressed nuclear localization of YAP (5SA) and YAP (5SA)-induced CTGF expression. Taken together, our results indicate that the PDZ-binding motif of YAP is critical for YAP-mediated oncogenesis, and that this effect is mediated by YAP’s co-activation of TEAD-mediated CTGF transcription

  11. The PDZ-binding motif of Yes-associated protein is required for its co-activation of TEAD-mediated CTGF transcription and oncogenic cell transforming activity

    Energy Technology Data Exchange (ETDEWEB)

    Shimomura, Tadanori; Miyamura, Norio; Hata, Shoji; Miura, Ryota; Hirayama, Jun, E-mail: hirayama.dbio@mri.tmd.ac.jp; Nishina, Hiroshi, E-mail: nishina.dbio@mri.tmd.ac.jp

    2014-01-17

    Highlights: •Loss of the PDZ-binding motif inhibits constitutively active YAP (5SA)-induced oncogenic cell transformation. •The PDZ-binding motif of YAP promotes its nuclear localization in cultured cells and mouse liver. •Loss of the PDZ-binding motif inhibits YAP (5SA)-induced CTGF transcription in cultured cells and mouse liver. -- Abstract: YAP is a transcriptional co-activator that acts downstream of the Hippo signaling pathway and regulates multiple cellular processes, including proliferation. Hippo pathway-dependent phosphorylation of YAP negatively regulates its function. Conversely, attenuation of Hippo-mediated phosphorylation of YAP increases its ability to stimulate proliferation and eventually induces oncogenic transformation. The C-terminus of YAP contains a highly conserved PDZ-binding motif that regulates YAP’s functions in multiple ways. However, to date, the importance of the PDZ-binding motif to the oncogenic cell transforming activity of YAP has not been determined. In this study, we disrupted the PDZ-binding motif in the YAP (5SA) protein, in which the sites normally targeted by Hippo pathway-dependent phosphorylation are mutated. We found that loss of the PDZ-binding motif significantly inhibited the oncogenic transformation of cultured cells induced by YAP (5SA). In addition, the increased nuclear localization of YAP (5SA) and its enhanced activation of TEAD-dependent transcription of the cell proliferation gene CTGF were strongly reduced when the PDZ-binding motif was deleted. Similarly, in mouse liver, deletion of the PDZ-binding motif suppressed nuclear localization of YAP (5SA) and YAP (5SA)-induced CTGF expression. Taken together, our results indicate that the PDZ-binding motif of YAP is critical for YAP-mediated oncogenesis, and that this effect is mediated by YAP’s co-activation of TEAD-mediated CTGF transcription.

  12. Characterization and evolution of the mitochondrial DNA control region in hornbills (Bucerotiformes).

    Science.gov (United States)

    Delport, Wayne; Ferguson, J Willem H; Bloomer, Paulette

    2002-06-01

    We determined the mitochondrial DNA control region sequences of six Bucerotiformes. Hornbills have the typical avian gene order and their control region is similar to other avian control regions in that it is partitioned into three domains: two variable domains that flank a central conserved domain. Two characteristics of the hornbill control region sequence differ from that of other birds. First, domain I is AT rich as opposed to AC rich, and second, the control region is approximately 500 bp longer than that of other birds. Both these deviations from typical avian control region sequence are explainable on the basis of repeat motifs in domain I of the hornbill control region. The repeat motifs probably originated from a duplication of CSB-1 as has been determined in chicken, quail, and snowgoose. Furthermore, the hornbill repeat motifs probably arose before the divergence of hornbills from each other but after the divergence of hornbills from other avian taxa. The mitochondrial control region of hornbills is suitable for both phylogenetic and population studies, with domains I and II probably more suited to population and phylogenetic analyses, respectively.

  13. Parole, Sintagmatik, dan Paradigmatik Motif Batik Mega Mendung

    Directory of Open Access Journals (Sweden)

    Rudi - Nababan

    2012-04-01

    Full Text Available ABSTRACT   Discussing traditional batik is related a lot to the organization system of fine arts element ac- companying it, either the pattern of the motif or the technique of the making. In this case, the motif of Mega Mendung Cirebon certainly has patterns and rules which are traditionally different from the other motifs in other areas. Through  semiotics analysis especially with Saussure and Pierce concept, it can be traced that batik with Cirebon motif, in this case Mega Mendung motif, has parole and langue system, as unique fine arts language in batik, and structure of visual syntagmatic and paradigmatic. In the context of batik motif as fine arts language, it is surely related to sign system as symbol and icon.       Keywords: visual semiotic, Cirebon’s batik.

  14. Crammed signaling motifs in the T-cell receptor.

    Science.gov (United States)

    Borroto, Aldo; Abia, David; Alarcón, Balbino

    2014-09-01

    Although the T cell antigen receptor (TCR) is long known to contain multiple signaling subunits (CD3γ, CD3δ, CD3ɛ and CD3ζ), their role in signal transduction is still not well understood. The presence of at least one immunoreceptor tyrosine-based activation motif (ITAM) in each CD3 subunit has led to the idea that the multiplication of such elements essentially serves to amplify signals. However, the evolutionary conservation of non-ITAM sequences suggests that each CD3 subunit is likely to have specific non-redundant roles at some stage of development or in mature T cell function. The CD3ɛ subunit is paradigmatic because in a relatively short cytoplasmic sequence (∼55 amino acids) it contains several docking sites for proteins involved in intracellular trafficking and signaling, proteins whose relevance in T cell activation is slowly starting to be revealed. In this review we will summarize our current knowledge on the signaling effectors that bind directly to the TCR and we will propose a hierarchy in their response to TCR triggering. Copyright © 2014 Elsevier B.V. All rights reserved.

  15. Rapid identification of DNA-binding proteins by mass spectrometry

    DEFF Research Database (Denmark)

    Nordhoff, E.; Korgsdam, A.-M.; Jørgensen, H.F.

    1999-01-01

    We report a protocol for the rapid identification of DNA-binding proteins. Immobilized DNA probes harboring a specific sequence motif are incubated with cell or nuclear extract. Proteins are analyzed directly off the solid support by matrix-assisted laser desorption/ionization time-of-flight mass...... was validated by the identification of known prokaryotic and eukaryotic DNA-binding proteins, and its use provided evidence that poly(ADP-ribose) polymerase exhibits DNA sequence-specific binding to DNA....

  16. Conservation science in a terrorist age: the impact of airport security screening on the viability and DNA integrity of frozen felid spermatozoa.

    Science.gov (United States)

    Gloor, Kayleen T; Winget, Doug; Swanson, William F

    2006-09-01

    In response to growing terrorism concerns, the Transportation Security Administration now requires that all checked baggage at U.S. airports be scanned through a cabinet x-ray system, which may increase risk of radiation damage to transported biologic samples and other sensitive genetic material. The objective of this study was to investigate the effect of these new airport security regulations on the viability and DNA integrity of frozen felid spermatozoa. Semen was collected from two domestic cats (Felis silvestris catus) and one fishing cat (Prionailurus viverrinus), cryopreserved in plastic freezing straws, and transferred into liquid nitrogen dry shippers for security screening. Treatment groups included frozen samples from each male scanned once or three times using a Transportation Security Administration-operated cabinet x-ray system, in addition to non-scanned samples (i.e., negative control) and samples previously scanned three times and exposed to five additional high-intensity x-ray bursts (i.e., positive control). Dosimeters placed in empty dry shippers were used to quantify radiation exposure. Following treatment, straws were thawed and spermatozoa analyzed for post-thaw motility (percentage motile and rate of progressive movement), acrosome status, and DNA integrity using single-cell gel electrophoresis (i.e., the comet assay). Dosimeter measurements determined that each airport screening procedure produced approximately 16 mrem of radiation exposure. Our results indicated that all levels of radiation exposure adversely affected (P 0.05) among treatment groups. Results also showed that the amount of double-stranded DNA damage was greater (P cat species scanned three times compared to samples scanned once or negative controls. Findings suggest that new airport security measures may cause radiation-induced damage to frozen spermatozoa and other valuable biologic samples transported on passenger aircraft and that alternative modes of sample

  17. Evidence on How a Conserved Glycine in the Hinge Region of HapR Regulates Its DNA Binding Ability: LESSONS FROM A NATURAL VARIANT.

    Energy Technology Data Exchange (ETDEWEB)

    M Dongre; N Singh; C Dureja; N Peddada; A Solanki; F Ashish; S Raychaudhuri

    2011-12-31

    HapR has been recognized as a quorum-sensing master regulator in Vibrio cholerae. Because it controls a plethora of disparate cellular events, the absence of a functional HapR affects the physiology of V. cholerae to a great extent. In the current study, we pursued an understanding of an observation of a natural protease-deficient non-O1, non-O139 variant V. cholerae strain V2. Intriguingly, a nonfunctional HapR (henceforth designated as HapRV2) harboring a substitution of glycine to aspartate at position 39 of the N-terminal hinge region has been identified. An in vitro gel shift assay clearly suggested the inability of HapRV2 to interact with various cognate promoters. Reinstatement of glycine at position 39 restores DNA binding ability of HapRV2 (HapRV2G), thereby rescuing the protease-negative phenotype of this strain. The elution profile of HapRV2 and HapRV2G proteins in size-exclusion chromatography and their circular dichroism spectra did not reflect any significant differences to explain the functional discrepancies between the two proteins. To gain insight into the structure-function relationship of these two proteins, we acquired small/wide angle x-ray scattering data from samples of the native and G39D mutant. Although Guinier analysis and indirect Fourier transformation of scattering indicated only a slight difference in the shape parameters, structure reconstruction using dummy amino acids concluded that although HapR adopts a 'Y' shape similar to its crystal structure, the G39D mutation in hinge drastically altered the DNA binding domains by bringing them in close proximity. This altered spatial orientation of the helix-turn-helix domains in this natural variant provides the first structural evidence on the functional role of the hinge region in quorum sensing-related DNA-binding regulatory proteins of Vibrio spp.

  18. Defining the plasticity of transcription factor binding sites by Deconstructing DNA consensus sequences: the PhoP-binding sites among gamma/enterobacteria.

    Directory of Open Access Journals (Sweden)

    Oscar Harari

    2010-07-01

    Full Text Available Transcriptional regulators recognize specific DNA sequences. Because these sequences are embedded in the background of genomic DNA, it is hard to identify the key cis-regulatory elements that determine disparate patterns of gene expression. The detection of the intra- and inter-species differences among these sequences is crucial for understanding the molecular basis of both differential gene expression and evolution. Here, we address this problem by investigating the target promoters controlled by the DNA-binding PhoP protein, which governs virulence and Mg(2+ homeostasis in several bacterial species. PhoP is particularly interesting; it is highly conserved in different gamma/enterobacteria, regulating not only ancestral genes but also governing the expression of dozens of horizontally acquired genes that differ from species to species. Our approach consists of decomposing the DNA binding site sequences for a given regulator into families of motifs (i.e., termed submotifs using a machine learning method inspired by the "Divide & Conquer" strategy. By partitioning a motif into sub-patterns, computational advantages for classification were produced, resulting in the discovery of new members of a regulon, and alleviating the problem of distinguishing functional sites in chromatin immunoprecipitation and DNA microarray genome-wide analysis. Moreover, we found that certain partitions were useful in revealing biological properties of binding site sequences, including modular gains and losses of PhoP binding sites through evolutionary turnover events, as well as conservation in distant species. The high conservation of PhoP submotifs within gamma/enterobacteria, as well as the regulatory protein that recognizes them, suggests that the major cause of divergence between related species is not due to the binding sites, as was previously suggested for other regulators. Instead, the divergence may be attributed to the fast evolution of orthologous target

  19. Good news for conservation: mitochondrial and microsatellite DNA data detect limited genetic signatures of inter-basin fish transfer in Thymallus thymallus (Salmonidae from the Upper Drava River

    Directory of Open Access Journals (Sweden)

    Meraner A.

    2013-06-01

    Full Text Available In the last few decades, numerous populations of European grayling, Thymallus thymallus, have been suffering from stocking-induced genetic admixture of foreign strains into wild populations. Concordantly, genetic introgression was also reportedfor grayling stocks inhabiting the Upper Drava River, but all published genetic data based on specimens caught at least a decade ago, when stocking load was strong. Here, we applied mitochondrial control region sequencing and nuclear microsatellite genotyping to Upper Drava grayling fry collections and reference samples to update patterns and extent of human-mediated introgression. In contrast to previous data, we highlighted an almost genetic integrity of Drava grayling, evidencing limited genetic signatures of trans-basin stocking for grayling of Northern Alpine Danubian origin. Recent hybridisation was detected only twice among sixty-nine samples, while several cases of later-generation hybrids were disclosed by linking mitochondrial sequence to nuclear genetic data. The observed past, but very limited recent genetic introgression in grayling from Upper Drava seems to reflect shifting stocking trends, changing from massive introduction of trans-basin fish to more conservation-oriented strategies during the last 27 years. In a conservation context, we encourage pursuing the use of local wild grayling for supportive- and captive-breeding, but underline the need for genetic approaches in brood-stock selection programs. Finally, our integrated results from sibship reconstruction validate our strictly fry-based sampling scheme, thus offering a reasonable alternative also for other rheophilic fish species with similar life-history characteristics.

  20. GNG Motifs Can Replace a GGG Stretch during G-Quadruplex Formation in a Context Dependent Manner.

    Directory of Open Access Journals (Sweden)

    Kohal Das

    Full Text Available G-quadruplexes are one of the most commonly studied non-B DNA structures. Generally, these structures are formed using a minimum of 4, three guanine tracts, with connecting loops ranging from one to seven. Recent studies have reported deviation from this general convention. One such deviation is the involvement of bulges in the guanine tracts. In this study, guanines along with bulges, also referred to as GNG motifs have been extensively studied using recently reported HOX11 breakpoint fragile region I as a model template. By strategic mutagenesis approach we show that the contribution from continuous G-tracts may be dispensible during G-quadruplex formation when such motifs are flanked by GNGs. Importantly, the positioning and number of GNG/GNGNG can also influence the formation of G-quadruplexes. Further, we assessed three genomic regions from HIF1 alpha, VEGF and SHOX gene for G-quadruplex formation using GNG motifs. We show that HIF1 alpha sequence harbouring GNG motifs can fold into intramolecular G-quadruplex. In contrast, GNG motifs in mutant VEGF sequence could not participate in structure formation, suggesting that the usage of GNG is context dependent. Importantly, we show that when two continuous stretches of guanines are flanked by two independent GNG motifs in a naturally occurring sequence (SHOX, it can fold into an intramolecular G-quadruplex. Finally, we show the specific binding of G-quadruplex binding protein, Nucleolin and G-quadruplex antibody, BG4 to SHOX G-quadruplex. Overall, our study provides novel insights into the role of GNG motifs in G-quadruplex structure formation which may have both physiological and pathological implications.

  1. Novel Strategy for Discrimination of Transcription Factor Binding Motifs Employing Mathematical Neural Network

    Science.gov (United States)

    Sugimoto, Asuka; Sumi, Takuya; Kang, Jiyoung; Tateno, Masaru

    2017-07-01

    Recognition in biological macromolecular systems, such as DNA-protein recognition, is one of the most crucial problems to solve toward understanding the fundamental mechanisms of various biological processes. Since specific base sequences of genome DNA are discriminated by proteins, such as transcription factors (TFs), finding TF binding motifs (TFBMs) in whole genome DNA sequences is currently a central issue in interdisciplinary biophysical and information sciences. In the present study, a novel strategy to create a discriminant function for discrimination of TFBMs by constituting mathematical neural networks (NNs) is proposed, together with a method to determine the boundary of signals (TFBMs) and noise in the NN-score (output) space. This analysis also leads to the mathematical limitation of discrimination in the recognition of features representing TFBMs, in an information geometrical manifold. Thus, the present strategy enables the identification of the whole space of TFBMs, right up to the noise boundary.

  2. Motif-role-fingerprints: the building-blocks of motifs, clustering-coefficients and transitivities in directed networks.

    Directory of Open Access Journals (Sweden)

    Mark D McDonnell

    Full Text Available Complex networks are frequently characterized by metrics for which particular subgraphs are counted. One statistic from this category, which we refer to as motif-role fingerprints, differs from global subgraph counts in that the number of subgraphs in which each node participates is counted. As with global subgraph counts, it can be important to distinguish between motif-role fingerprints that are 'structural' (induced subgraphs and 'functional' (partial subgraphs. Here we show mathematically that a vector of all functional motif-role fingerprints can readily be obtained from an arbitrary directed adjacency matrix, and then converted to structural motif-role fingerprints by multiplying that vector by a specific invertible conversion matrix. This result demonstrates that a unique structural motif-role fingerprint exists for any given functional motif-role fingerprint. We demonstrate a similar result for the cases of functional and structural motif-fingerprints without node roles, and global subgraph counts that form the basis of standard motif analysis. We also explicitly highlight that motif-role fingerprints are elemental to several popular metrics for quantifying the subgraph structure of directed complex networks, including motif distributions, directed clustering coefficient, and transitivity. The relationships between each of these metrics and motif-role fingerprints also suggest new subtypes of directed clustering coefficients and transitivities. Our results have potential utility in analyzing directed synaptic networks constructed from neuronal connectome data, such as in terms of centrality. Other potential applications include anomaly detection in networks, identification of similar networks and identification of similar nodes within networks. Matlab code for calculating all stated metrics following calculation of functional motif-role fingerprints is provided as S1 Matlab File.

  3. Ni2+-binding RNA motifs with an asymmetric purine-rich internal loop and a G-A base pair.

    Science.gov (United States)

    Hofmann, H P; Limmer, S; Hornung, V; Sprinzl, M

    1997-01-01

    RNA molecules with high affinity for immobilized Ni2+ were isolated from an RNA pool with 50 randomized positions by in vitro selection-amplification. The selected RNAs preferentially bind Ni2+ and Co2+ over other cations from first series transition metals. Conserved structure motifs, comprising about 15 nt, were identified that are likely to represent the Ni2+ binding sites. Two conserved motifs contain an asymmetric purine-rich internal loop and probably a mismatch G-A base pair. The structure of one of these motifs was studied with proton NMR spectroscopy and formation of the G-A pair at the junction of helix and internal loop was demonstrated. Using Ni2+ as a paramagnetic probe, a divalent metal ion binding site near this G-A base pair was identified. Ni2+ ions bound to this motif exert a specific stabilization effect. We propose that small asymmetric purine-rich loops that contain a G-A interaction may represent a divalent metal ion binding site in RNA. PMID:9409620

  4. Rekayasa Pengembangan Desain Motif Batik Khas Melayu

    Directory of Open Access Journals (Sweden)

    Eustasia Sri Murwati

    2016-04-01

    Full Text Available ABSTRAKPengembangan desain batik melalui rancang bangun perekayasaan desain menurut ragam hias Melayu meliputi pengembangan motif dan proses, termasuk pemilihan komposisi warna. Proses yang sering dilakukan yaitu proses celup, penghilangan lilin dan celup warna tumpangan atau proses colet, celup, penghilangan lilin atau celup kemudian penghilangan lilin yang disebut Batik Kelengan. Setiap pulau di Indonesia mempunyai ciri khas budaya dan kesenian yang dikenal dengan corak/ragam hias khas daerah, juga ornamen yang diminati oleh masyarakat dari daerah tersebut atau dari daerah lain. Kondisi demikian mendorong pertumbuhan industri kerajinan yang memanfaatkan unsur–unsur seni. Adapun motif yang diperoleh adalah: Ayam Berlaga, Bungo Matahari, Kuntum Bersanding, Lancang Kuning, Encong Kerinci, Durian Pecah, Bungo Bintang, Bungo Pauh Kecil, Riang-riang, Bungo Nagaro. Pengembangan desain tersebut dipilih 3 produk terbaik yang dinilai oleh 5 penilai yang ahli di bidang desain batik, yaitu motif Durian Pecah, Ayam Berlaga, dan Bungo Matahari. Rancang bangun diversifikasi desain dengan memanfaatkan unsur–unsur seni dan ketrampilan etnis Melayu yaitu pemilihan ragam hias dan motif batik Melayu untuk diterapkan ke bahan sandang dengan komposisi warna yang menarik, sehingga produk memenuhi selera konsumen. Memperbaiki keberagaman batik dengan meningkatkan desain produk antara lain menuangkan ragam hias Melayu ke dalam proses batik yang menggunakan berbagai macam warna sehingga komposisi warna memadai. Diperoleh hasil produk batik dengan ragam hias Melayu yang berkualitas dan komposisi warna yang sesuai dengan karakter ragam hias Melayu. Rancang bangun desain produk untuk mendapatkan formulasi desain serta kelayakan prosesnya dengan penekanan pada teknologi akrab lingkungan dilaksanakan dengan alternatif pendekatan yaitu penciptaan desain bentuk baru.Kata kunci: desain, batik, rancang bangun, ragam hias, MelayuABSTRACTDevelopment of batik design through

  5. Transnationalism as a motif in family stories.

    Science.gov (United States)

    Stone, Elizabeth; Gomez, Erica; Hotzoglou, Despina; Lipnitsky, Jane Y

    2005-12-01

    Family stories have long been recognized as a vehicle for assessing components of a family's emotional and social life, including the degree to which an immigrant family has been willing to assimilate. Transnationalism, defined as living in one or more cultures and maintaining connections to both, is now increasingly common. A qualitative study of family stories in the family of those who appear completely "American" suggests that an affiliation with one's home country is nevertheless detectable in the stories via motifs such as (1) positively connotated home remedies, (2) continuing denigration of home country "enemies," (3) extensive knowledge of the home country history and politics, (4) praise of endogamy and negative assessment of exogamy, (5) superiority of home country to America, and (6) beauty of home country. Furthermore, an awareness of which model--assimilationist or transnational--governs a family's experience may help clarify a clinician's understanding of a family's strengths, vulnerabilities, and mode of framing their cultural experiences.

  6. Insights into the Pathogenesis of Anaplastic Large-Cell Lymphoma through Genome-wide DNA Methylation Profiling

    Directory of Open Access Journals (Sweden)

    Melanie R. Hassler

    2016-10-01

    Full Text Available Aberrant DNA methylation patterns in malignant cells allow insight into tumor evolution and development and can be used for disease classification. Here, we describe the genome-wide DNA methylation signatures of NPM-ALK-positive (ALK+ and NPM-ALK-negative (ALK− anaplastic large-cell lymphoma (ALCL. We find that ALK+ and ALK− ALCL share common DNA methylation changes for genes involved in T cell differentiation and immune response, including TCR and CTLA-4, without an ALK-specific impact on tumor DNA methylation in gene promoters. Furthermore, we uncover a close relationship between global ALCL DNA methylation patterns and those in distinct thymic developmental stages and observe tumor-specific DNA hypomethylation in regulatory regions that are enriched for conserved transcription factor binding motifs such as AP1. Our results indicate similarity between ALCL tumor cells and thymic T cell subsets and a direct relationship between ALCL oncogenic signaling and DNA methylation through transcription factor induction and occupancy.

  7. Control of DEMETER DNA demethylase gene transcription in male and female gamete companion cells in Arabidopsis thaliana.

    Science.gov (United States)

    Park, Jin-Sup; Frost, Jennifer M; Park, Kyunghyuk; Ohr, Hyonhwa; Park, Guen Tae; Kim, Seohyun; Eom, Hyunjoo; Lee, Ilha; Brooks, Janie S; Fischer, Robert L; Choi, Yeonhee

    2017-02-21

    The DEMETER (DME) DNA glycosylase initiates active DNA demethylation via the base-excision repair pathway and is vital for reproduction in Arabidopsis thaliana DME-mediated DNA demethylation is preferentially targeted to small, AT-rich, and nucleosome-depleted euchromatic transposable elements, influencing expression of adjacent genes and leading to imprinting in the endosperm. In the female gametophyte, DME expression and subsequent genome-wide DNA demethylation are confined to the companion cell of the egg, the central cell. Here, we show that, in the male gametophyte, DME expression is limited to the companion cell of sperm, the vegetative cell, and to a narrow window of time: immediately after separation of the companion cell lineage from the germline. We define transcriptional regulatory elements of DME using reporter genes, showing that a small region, which surprisingly lies within the DME gene, controls its expression in male and female companion cells. DME expression from this minimal promoter is sufficient to rescue seed abortion and the aberrant DNA methylome associated with the null dme-2 mutation. Within this minimal promoter, we found short, conserved enhancer sequences necessary for the transcriptional activities of DME and combined predicted binding motifs with published transcription factor binding coordinates to produce a list of candidate upstream pathway members in the genetic circuitry controlling DNA demethylation in gamete companion cells. These data show how DNA demethylation is regulated to facilitate endosperm gene imprinting and potential transgenerational epigenetic regulation, without subjecting the germline to potentially deleterious transposable element demethylation.

  8. Motif decomposition of the phosphotyrosine proteome reveals a new N-terminal binding motif for SHIP2

    DEFF Research Database (Denmark)

    Miller, Martin Lee; Hanke, S.; Hinsby, A. M.

    2008-01-01

    set of 481 unique phosphotyrosine (Tyr(P)) peptides by sequence similarity to known ligands of the Src homology 2 (SH2) and the phosphotyrosine binding (PTB) domains. From 20 clusters we extracted 16 known and four new interaction motifs. Using quantitative mass spectrometry we pulled down Tyr......(P)-specific binding partners for peptides corresponding to the extracted motifs. We confirmed numerous previously known interaction motifs and found 15 new interactions mediated by phosphosites not previously known to bind SH2 or PTB. Remarkably, a novel hydrophobic N-terminal motif ((L/V/I)(L/V/I)pY) was identified...

  9. Role of NH2-terminal hydrophobic motif in the subcellular localization of ATP-binding cassette protein subfamily D: Common features in eukaryotic organisms

    International Nuclear Information System (INIS)

    Lee, Asaka; Asahina, Kota; Okamoto, Takumi; Kawaguchi, Kosuke; Kostsin, Dzmitry G.; Kashiwayama, Yoshinori; Takanashi, Kojiro; Yazaki, Kazufumi; Imanaka, Tsuneo; Morita, Masashi

    2014-01-01

    Highlights: • ABCD proteins classifies based on with or without NH 2 -terminal hydrophobic segment. • The ABCD proteins with the segment are targeted peroxisomes. • The ABCD proteins without the segment are targeted to the endoplasmic reticulum. • The role of the segment in organelle targeting is conserved in eukaryotic organisms. - Abstract: In mammals, four ATP-binding cassette (ABC) proteins belonging to subfamily D have been identified. ABCD1–3 possesses the NH 2 -terminal hydrophobic region and are targeted to peroxisomes, while ABCD4 lacking the region is targeted to the endoplasmic reticulum (ER). Based on hydropathy plot analysis, we found that several eukaryotes have ABCD protein homologs lacking the NH 2 -terminal hydrophobic segment (H0 motif). To investigate whether the role of the NH 2 -terminal H0 motif in subcellular localization is conserved across species, we expressed ABCD proteins from several species (metazoan, plant and fungi) in fusion with GFP in CHO cells and examined their subcellular localization. ABCD proteins possessing the NH 2 -terminal H0 motif were localized to peroxisomes, while ABCD proteins lacking this region lost this capacity. In addition, the deletion of the NH 2 -terminal H0 motif of ABCD protein resulted in their localization to the ER. These results suggest that the role of the NH 2 -terminal H0 motif in organelle targeting is widely conserved in living organisms

  10. Conservation Value

    OpenAIRE

    Tisdell, Clement A.

    2010-01-01

    This paper outlines the significance of the concept of conservation value and discusses ways in which it is determined paying attention to views stemming from utilitarian ethics and from deontological ethics. The importance of user costs in relation to economic decisions about the conservation and use of natural resources is emphasised. Particular attention is given to competing views about the importance of conserving natural resources in order to achieve economic sustainability. This then l...

  11. Net (ERP/SAP2) one of the Ras-inducible TCFs, has a novel inhibitory domain with resemblance to the helix-loop-helix motif.

    Science.gov (United States)

    Maira, S M; Wurtz, J M; Wasylyk, B

    1996-11-01

    The three ternary complex factors (TCFs), Net (ERP/ SAP-2), ELK-1 and SAP-1, are highly related ets oncogene family members that participate in the response of the cell to Ras and growth signals. Understanding the different roles of these factors will provide insights into how the signals result in coordinate regulation of the cell. We show that Net inhibits transcription under basal conditions, in which SAP-1a is inactive and ELK-1 stimulates. Repression is mediated by the NID, the Net Inhibitory Domain of about 50 amino acids, which autoregulates the Net protein and also inhibits when it is isolated in a heterologous fusion protein. Net is particularly sensitive to Ras activation. Ras activates Net through the C-domain, which is conserved between the three TCFs, and the NID is an efficient inhibitor of Ras activation. The NID, as well as more C-terminal sequences, inhibit DNA binding. Net is more refractory to DNA binding than the other TCFs, possibly due to the presence of multiple inhibitory elements. The NID may adopt a helix-loop-helix (HLH) structure, as evidenced by homology to other HLH motifs, structure predictions, model building and mutagenesis of critical residues. The sequence resemblance with myogenic factors suggested that Net may form complexes with the same partners. Indeed, we found that Net can interact in vivo with the basic HLH factor, E47. We propose that Net is regulated at the level of its latent DNA-binding activity by protein interactions and/or phosphorylation. Net may form complexes with HLH proteins as well as SRF on specific promotor sequences. The identification of the novel inhibitory domain provides a new inroad into exploring the different roles of the ternary complex factors in growth control and transformation.

  12. Relative Stabilities of Conserved and Non-Conserved Structures in the OB-Fold Superfamily

    Directory of Open Access Journals (Sweden)

    Andrei T. Alexandrescu

    2009-05-01

    Full Text Available The OB-fold is a diverse structure superfamily based on a β-barrel motif that is often supplemented with additional non-conserved secondary structures. Previous deletion mutagenesis and NMR hydrogen exchange studies of three OB-fold proteins showed that the structural stabilities of sites within the conserved β-barrels were larger than sites in non-conserved segments. In this work we examined a database of 80 representative domain structures currently classified as OB-folds, to establish the basis of this effect. Residue-specific values were obtained for the number of Cα-Cα distance contacts, sequence hydrophobicities, crystallographic B-factors, and theoretical B-factors calculated from a Gaussian Network Model. All four parameters point to a larger average flexibility for the non-conserved structures compared to the conserved β-barrels. The theoretical B-factors and contact densities show the highest sensitivity.Our results suggest a model of protein structure evolution in which novel structural features develop at the periphery of conserved motifs. Core residues are more resistant to structural changes during evolution since their substitution would disrupt a larger number of interactions. Similar factors are likely to account for the differences in stability to unfolding between conserved and non-conserved structures.

  13. Promzea: a pipeline for discovery of co-regulatory motifs in maize and other plant species and its application to the anthocyanin and phlobaphene biosynthetic pathways and the Maize Development Atlas.

    Science.gov (United States)

    Liseron-Monfils, Christophe; Lewis, Tim; Ashlock, Daniel; McNicholas, Paul D; Fauteux, François; Strömvik, Martina; Raizada, Manish N

    2013-03-15

    The discovery of genetic networks and cis-acting DNA motifs underlying their regulation is a major objective of transcriptome studies. The recent release of the maize genome (Zea mays L.) has facilitated in silico searches for regulatory motifs. Several algorithms exist to predict cis-acting elements, but none have been adapted for maize. A benchmark data set was used to evaluate the accuracy of three motif discovery programs: BioProspector, Weeder and MEME. Analysis showed that each motif discovery tool had limited accuracy and appeared to retrieve a distinct set of motifs. Therefore, using the benchmark, statistical filters were optimized to reduce the false discovery ratio, and then remaining motifs from all programs were combined to improve motif prediction. These principles were integrated into a user-friendly pipeline for motif discovery in maize called Promzea, available at http://www.promzea.org and on the Discovery Environment of the iPlant Collaborative website. Promzea was subsequently expanded to include rice and Arabidopsis. Within Promzea, a user enters cDNA sequences or gene IDs; corresponding upstream sequences are retrieved from the maize genome. Predicted motifs are filtered, combined and ranked. Promzea searches the chosen plant genome for genes containing each candidate motif, providing the user with the gene list and corresponding gene annotations. Promzea was validated in silico using a benchmark data set: the Promzea pipeline showed a 22% increase in nucleotide sensitivity compared to the best standalone program tool, Weeder, with equivalent nucleotide specificity. Promzea was also validated by its ability to retrieve the experimentally defined binding sites of transcription factors that regulate the maize anthocyanin and phlobaphene biosynthetic pathways. Promzea predicted additional promoter motifs, and genome-wide motif searches by Promzea identified 127 non-anthocyanin/phlobaphene genes that each contained all five predicted promoter

  14. Assessing local structure motifs using order parameters for motif recognition, interstitial identification, and diffusion path characterization

    Science.gov (United States)

    Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; Haranczyk, Maciej

    2017-11-01

    Structure-property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal closed packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.

  15. Assessing Local Structure Motifs Using Order Parameters for Motif Recognition, Interstitial Identification, and Diffusion Path Characterization

    Directory of Open Access Journals (Sweden)

    Nils E. R. Zimmermann

    2017-11-01

    Full Text Available Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP database (61,422 compounds for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.

  16. MPN+, a putative catalytic motif found in a subset of MPN domain proteins from eukaryotes and prokaryotes, is critical for Rpn11 function

    Directory of Open Access Journals (Sweden)

    Hofmann Kay

    2002-09-01

    Full Text Available Abstract Background Three macromolecular assemblages, the lid complex of the proteasome, the COP9-Signalosome (CSN and the eIF3 complex, all consist of multiple proteins harboring MPN and PCI domains. Up to now, no specific function for any of these proteins has been defined, nor has the importance of these motifs been elucidated. In particular Rpn11, a lid subunit, serves as the paradigm for MPN-containing proteins as it is highly conserved and important for proteasome function. Results We have identified a sequence motif, termed the MPN+ motif, which is highly conserved in a subset of MPN domain proteins such as Rpn11 and Csn5/Jab1, but is not present outside of this subfamily. The MPN+ motif consists of five polar residues that resemble the active site residues of hydrolytic enzyme classes, particularly that of metalloproteases. By using site-directed mutagenesis, we show that the MPN+ residues are important for the function of Rpn11, while a highly conserved Cys residue outside of the MPN+ motif is not essential. Single amino acid substitutions in MPN+ residues all show similar phenotypes, including slow growth, sensitivity to temperature and amino acid analogs, and general proteasome-dependent proteolysis defects. Conclusions The MPN+ motif is abundant in certain MPN-domain proteins, including newly identified proteins of eukaryotes, bacteria and archaea thought to act outside of the traditional large PCI/MPN complexes. The putative catalytic nature of the MPN+ motif makes it a good candidate for a pivotal enzymatic function, possibly a proteasome-associated deubiquitinating activity and a CSN-associated Nedd8/Rub1-removing activity.

  17. The identification of functional motifs in temporal gene expression analysis

    Directory of Open Access Journals (Sweden)

    Michael G. Surette

    2005-01-01

    Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.

  18. RNA recognition motif (RRM)-containing proteins in Bombyx mori

    African Journals Online (AJOL)

    STORAGESEVER

    2009-03-20

    Mar 20, 2009 ... Recognition Motif (RRM), sometimes referred to as. RNP1, is one of the first identified domains for RNA interaction. RRM is very common ..... Apart from the RRM motif, eIF3-S9 has a Trp-Asp. (WD) repeat domain, Poly (A) ...

  19. Fingerprint motifs of phytases | Fan | African Journal of Biotechnology

    African Journals Online (AJOL)

    Among the total of potential 173 phytases gained in 11 plant genomes through MAST, PAPhys are the major phytases, and HAPhys are the minor, and other phytase groups are not found in planta. Keywords: Phytase, fingerprint motif, multiple EM for motif elicitation (MEME), MAST African Journal of Biotechnology Vol.

  20. Environmental influences on DNA curvature

    DEFF Research Database (Denmark)

    Ussery, David; Higgins, C.F.; Bolshoy, A.

    1999-01-01

    DNA curvature plays an important role in many biological processes. To study environmentalinfluences on DNA curvature we compared the anomalous migration on polyacrylamide gels ofligation ladders of 11 specifically-designed oligonucleotides. At low temperatures (25 degreesC and below) most......, whilst spermine enhanced theanomalous migration of a different set of sequences. Sequences with a GGC motif exhibitedgreater curvature than predicted by the presently-used angles for the nearest-neighbour wedgemodel and are especially sensitive to Mg2+. The data have implications for models...... for DNAcurvature and for environmentally-sensitive DNA conformations in the regulation of geneexpression....

  1. The LINKS motif zippers trans-acyltransferase polyketide synthase assembly lines into a biosynthetic megacomplex.

    Science.gov (United States)

    Gay, Darren C; Wagner, Drew T; Meinke, Jessica L; Zogzas, Charles E; Gay, Glen R; Keatinge-Clay, Adrian T

    2016-03-01

    Polyketides such as the clinically-valuable antibacterial agent mupirocin are constructed by architecturally-sophisticated assembly lines known as trans-acyltransferase polyketide synthases. Organelle-sized megacomplexes composed of several copies of trans-acyltransferase polyketide synthase assembly lines have been observed by others through transmission electron microscopy to be located at the Bacillus subtilis plasma membrane, where the synthesis and export of the antibacterial polyketide bacillaene takes place. In this work we analyze ten crystal structures of trans-acyltransferase polyketide synthases ketosynthase domains, seven of which are reported here for the first time, to characterize a motif capable of zippering assembly lines into a megacomplex. While each of the three-helix LINKS (Laterally-INteracting Ketosynthase Sequence) motifs is observed to similarly dock with a spatially-reversed copy of itself through hydrophobic and ionic interactions, the amino acid sequences of this motif are not conserved. Such a code is appropriate for mediating homotypic contacts between assembly lines to ensure the ordered self-assembly of a noncovalent, yet tightly-knit, enzymatic network. LINKS-mediated lateral interactions would also have the effect of bolstering the vertical association of the polypeptides that comprise a polyketide synthase assembly line. Copyright © 2015 Elsevier Inc. All rights reserved.

  2. Identification of sequence motifs significantly associated with antisense activity

    Directory of Open Access Journals (Sweden)

    Peek Andrew S

    2007-06-01

    Full Text Available Abstract Background Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. Results We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. Conclusion The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic

  3. A sialoreceptor binding motif in the Mycoplasma synoviae adhesin VlhA.

    Directory of Open Access Journals (Sweden)

    Meghan May

    Full Text Available Mycoplasma synoviae depends on its adhesin VlhA to mediate cytadherence to sialylated host cell receptors. Allelic variants of VlhA arise through recombination between an assemblage of promoterless vlhA pseudogenes and a single transcription promoter site, creating lineages of M. synoviae that each express a different vlhA allele. The predicted full-length VlhA sequences adjacent to the promoter of nine lineages of M. synoviae varying in avidity of cytadherence were aligned with that of the reference strain MS53 and with a 60-a.a. hemagglutinating VlhA C-terminal fragment from a Tunisian lineage of strain WVU1853(T. Seven different sequence variants of an imperfectly conserved, single-copy, 12-a.a. candidate cytadherence motif were evident amid the flanking variable residues of the 11 total sequences examined. The motif was predicted to adopt a short hairpin structure in a low-complexity region near the C-terminus of VlhA. Biotinylated synthetic oligopeptides representing four selected variants of the 12-a.a. motif, with the whole synthesized 60-a.a. fragment as a positive control, differed (P<0.01 in the extent they bound to chicken erythrocyte membranes. All bound to a greater extent (P<0.01 than scrambled or irrelevant VlhA domain negative control peptides did. Experimentally introduced branched-chain amino acid (BCAA substitutions Val3Ile and Leu7Ile did not significantly alter binding, whereas fold-destabilizing substitutions Thr4Gly and Ala9Gly tended to reduce it (P<0.05. Binding was also reduced to background levels (P<0.01 when the peptides were exposed to desialylated membranes, or were pre-saturated with free sialic acid before exposure to untreated membranes. From this evidence we conclude that the motif P-X-(BCAA-X-F-X-(BCAA-X-A-K-X-G binds sialic acid and likely mediates VlhA-dependent M. synoviae attachment to host cells. This conserved mechanism retains the potential for fine-scale rheostasis in binding avidity, which could be a

  4. Four signature motifs define the first class of structurally related large coiled-coil proteins in plants.

    Directory of Open Access Journals (Sweden)

    Meier Iris

    2002-04-01

    Full Text Available Abstract Background Animal and yeast proteins containing long coiled-coil domains are involved in attaching other proteins to the large, solid-state components of the cell. One subgroup of long coiled-coil proteins are the nuclear lamins, which are involved in attaching chromatin to the nuclear envelope and have recently been implicated in inherited human diseases. In contrast to other eukaryotes, long coiled-coil proteins have been barely investigated in plants. Results We have searched the completed Arabidopsis genome and have identified a family of structurally related long coiled-coil proteins. Filament-like plant proteins (FPP were identified by sequence similarity to a tomato cDNA that encodes a coiled-coil protein which interacts with the nuclear envelope-associated protein, MAF1. The FPP family is defined by four novel unique sequence motifs and by two clusters of long coiled-coil domains separated by a non-coiled-coil linker. All family members are expressed in a variety of Arabidopsis tissues. A homolog sharing the structural features was identified in the monocot rice, indicating conservation among angiosperms. Conclusion Except for myosins, this is the first characterization of a family of long coiled-coil proteins in plants. The tomato homolog of the FPP family binds in a yeast two-hybrid assay to a nuclear envelope-associated protein. This might suggest that FPP family members function in nuclear envelope biology. Because the full Arabidopsis genome does not appear to contain genes for lamins, it is of interest to investigate other long coiled-coil proteins, which might functionally replace lamins in the plant kingdom.

  5. How conserved are the bacterial communities associated with aphids? A detailed assessment of the Brevicoryne brassicae (Hemiptera: Aphididae) using 16S rDNA.

    Science.gov (United States)

    Clark, E L; Daniell, T J; Wishart, J; Hubbard, S F; Karley, A J

    2012-12-01

    Aphids harbor a community of bacteria that include obligate and facultative endosymbionts belonging to the Enterobacteriaceae along with opportunistic, commensal, or pathogenic bacteria. This study represents the first detailed analysis of the identity and diversity of the bacterial community associated with the cabbage aphid, Brevicoryne brassicae (L.). 16S rDNA sequence analysis revealed that the community of bacteria associated with B. brassicae was diverse, with at least four different bacterial community types detected among aphid lines, collected from widely dispersed sites in Northern Britain. The bacterial sequence types isolated from B. brassicae showed little similarity to any bacterial endosymbionts characterized in insects; instead, they were closely related to free-living extracellular bacterial species that have been isolated from the aphid gut or that are known to be present in the environment, suggesting that they are opportunistic bacteria transmitted between the aphid gut and the environment. To quantify variation in bacterial community between aphid lines, which was driven largely by differences in the proportions of two dominant bacterial orders, the Pseudomonales and the Enterobacteriales, we developed a novel real-time (Taqman) qPCR assay. By improving our knowledge of aphid microbial ecology, and providing novel molecular tools to examine the presence and function of the microbial community, this study forms the basis of further research to explore the influence of the extracellular bacterial community on aphid fitness, pest status, and susceptibility to control by natural enemies.

  6. Principal component analysis for predicting transcription-factor binding motifs from array-derived data

    Directory of Open Access Journals (Sweden)

    Vincenti Matthew P

    2005-11-01

    Full Text Available Abstract Background The responses to interleukin 1 (IL-1 in human chondrocytes constitute a complex regulatory mechanism, where multiple transcription factors interact combinatorially to transcription-factor binding motifs (TFBMs. In order to select a critical set of TFBMs from genomic DNA information and an array-derived data, an efficient algorithm to solve a combinatorial optimization problem is required. Although computational approaches based on evolutionary algorithms are commonly employed, an analytical algorithm would be useful to predict TFBMs at nearly no computational cost and evaluate varying modelling conditions. Singular value decomposition (SVD is a powerful method to derive primary components of a given matrix. Applying SVD to a promoter matrix defined from regulatory DNA sequences, we derived a novel method to predict the critical set of TFBMs. Results The promoter matrix was defined to establish a quantitative relationship between the IL-1-driven mRNA alteration and genomic DNA sequences of the IL-1 responsive genes. The matrix was decomposed with SVD, and the effects of 8 potential TFBMs (5'-CAGGC-3', 5'-CGCCC-3', 5'-CCGCC-3', 5'-ATGGG-3', 5'-GGGAA-3', 5'-CGTCC-3', 5'-AAAGG-3', and 5'-ACCCA-3' were predicted from a pool of 512 random DNA sequences. The prediction included matches to the core binding motifs of biologically known TFBMs such as AP2, SP1, EGR1, KROX, GC-BOX, ABI4, ETF, E2F, SRF, STAT, IK-1, PPARγ, STAF, ROAZ, and NFκB, and their significance was evaluated numerically using Monte Carlo simulation and genetic algorithm. Conclusion The described SVD-based prediction is an analytical method to provide a set of potential TFBMs involved in transcriptional regulation. The results would be useful to evaluate analytically a contribution of individual DNA sequences.

  7. Motif statistics and spike correlations in neuronal networks

    International Nuclear Information System (INIS)

    Hu, Yu; Shea-Brown, Eric; Trousdale, James; Josić, Krešimir

    2013-01-01

    Motifs are patterns of subgraphs of complex networks. We studied the impact of such patterns of connectivity on the level of correlated, or synchronized, spiking activity among pairs of cells in a recurrent network of integrate and fire neurons. For a range of network architectures, we find that the pairwise correlation coefficients, averaged across the network, can be closely approximated using only three statistics of network connectivity. These are the overall network connection probability and the frequencies of two second order motifs: diverging motifs, in which one cell provides input to two others, and chain motifs, in which two cells are connected via a third intermediary cell. Specifically, the prevalence of diverging and chain motifs tends to increase correlation. Our method is based on linear response theory, which enables us to express spiking statistics using linear algebra, and a resumming technique, which extrapolates from second order motifs to predict the overall effect of coupling on network correlation. Our motif-based results seek to isolate the effect of network architecture perturbatively from a known network state. (paper)

  8. Computational analyses of synergism in small molecular network motifs.

    Directory of Open Access Journals (Sweden)

    Yili Zhang

    2014-03-01

    Full Text Available Cellular functions and responses to stimuli are controlled by complex regulatory networks that comprise a large diversity of molecular components and their interactions. However, achieving an intuitive understanding of the dynamical properties and responses to stimuli of these networks is hampered by their large scale and complexity. To address this issue, analyses of regulatory networks often focus on reduced models that depict distinct, reoccurring connectivity patterns referred to as motifs. Previous modeling studies have begun to characterize the dynamics of small motifs, and to describe ways in which variations in parameters affect their responses to stimuli. The present study investigates how variations in pairs of parameters affect responses in a series of ten common network motifs, identifying concurrent variations that act synergistically (or antagonistically to alter the responses of the motifs to stimuli. Synergism (or antagonism was quantified using degrees of nonlinear blending and additive synergism. Simulations identified concurrent variations that maximized synergism, and examined the ways in which it was affected by stimulus protocols and the architecture of a motif. Only a subset of architectures exhibited synergism following paired changes in parameters. The approach was then applied to a model describing interlocked feedback loops governing the synthesis of the CREB1 and CREB2 transcription factors. The effects of motifs on synergism for this biologically realistic model were consistent with those for the abstract models of single motifs. These results have implications for the rational design of combination drug therapies with the potential for synergistic interactions.

  9. Triadic motifs in the dependence networks of virtual societies

    Science.gov (United States)

    Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing