WorldWideScience

Sample records for identifying cis-regulatory sequences

  1. A method for selecting cis-acting regulatory sequences that respond to small molecule effectors

    Directory of Open Access Journals (Sweden)

    Allas Ülar

    2010-08-01

    Full Text Available Abstract Background Several cis-acting regulatory sequences functioning at the level of mRNA or nascent peptide and specifically influencing transcription or translation have been described. These regulatory elements often respond to specific chemicals. Results We have developed a method that allows us to select cis-acting regulatory sequences that respond to diverse chemicals. The method is based on the β-lactamase gene containing a random sequence inserted into the beginning of the ORF. Several rounds of selection are used to isolate sequences that suppress β-lactamase expression in response to the compound under study. We have isolated sequences that respond to erythromycin, troleandomycin, chloramphenicol, meta-toluate and homoserine lactone. By introducing synonymous and non-synonymous mutations we have shown that at least in the case of erythromycin the sequences act at the peptide level. We have also tested the cross-activities of the constructs and found that in most cases the sequences respond most strongly to the compound on which they were isolated. Conclusions Several selected peptides showed ligand-specific changes in amino acid frequencies, but no consensus motif could be identified. This is consistent with previous observations on natural cis-acting peptides, showing that it is often impossible to demonstrate a consensus. Applying the currently developed method on a larger scale, by selecting and comparing an extended set of sequences, might allow the sequence rules underlying the activity of cis-acting regulatory peptides to be identified.

  2. On the Concept of Cis-regulatory Information: From Sequence Motifs to Logic Functions

    Science.gov (United States)

    Tarpine, Ryan; Istrail, Sorin

    The regulatory genome is about the “system level organization of the core genomic regulatory apparatus, and how this is the locus of causality underlying the twin phenomena of animal development and animal evolution” (E.H. Davidson. The Regulatory Genome: Gene Regulatory Networks in Development and Evolution, Academic Press, 2006). Information processing in the regulatory genome is done through regulatory states, defined as sets of transcription factors (sequence-specific DNA binding proteins which determine gene expression) that are expressed and active at the same time. The core information processing machinery consists of modular DNA sequence elements, called cis-modules, that interact with transcription factors. The cis-modules “read” the information contained in the regulatory state of the cell through transcription factor binding, “process” it, and directly or indirectly communicate with the basal transcription apparatus to determine gene expression. This endowment of each gene with the information-receiving capacity through their cis-regulatory modules is essential for the response to every possible regulatory state to which it might be exposed during all phases of the life cycle and in all cell types. We present here a set of challenges addressed by our CYRENE research project aimed at studying the cis-regulatory code of the regulatory genome. The CYRENE Project is devoted to (1) the construction of a database, the cis-Lexicon, containing comprehensive information across species about experimentally validated cis-regulatory modules; and (2) the software development of a next-generation genome browser, the cis-Browser, specialized for the regulatory genome. The presentation is anchored on three main computational challenges: the Gene Naming Problem, the Consensus Sequence Bottleneck Problem, and the Logic Function Inference Problem.

  3. Close Sequence Comparisons are Sufficient to Identify Humancis-Regulatory Elements

    Energy Technology Data Exchange (ETDEWEB)

    Prabhakar, Shyam; Poulin, Francis; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Couronne, Olivier; Pennacchio, Len A.

    2005-12-01

    Cross-species DNA sequence comparison is the primary method used to identify functional noncoding elements in human and other large genomes. However, little is known about the relative merits of evolutionarily close and distant sequence comparisons, due to the lack of a universal metric for sequence conservation, and also the paucity of empirically defined benchmark sets of cis-regulatory elements. To address this problem, we developed a general-purpose algorithm (Gumby) that detects slowly-evolving regions in primate, mammalian and more distant comparisons without requiring adjustment of parameters, and ranks conserved elements by P-value using Karlin-Altschul statistics. We benchmarked Gumby predictions against previously identified cis-regulatory elements at diverse genomic loci, and also tested numerous extremely conserved human-rodent sequences for transcriptional enhancer activity using reporter-gene assays in transgenic mice. Human regulatory elements were identified with acceptable sensitivity and specificity by comparison with 1-5 other eutherian mammals or 6 other simian primates. More distant comparisons (marsupial, avian, amphibian and fish) failed to identify many of the empirically defined functional noncoding elements. We derived an intuitive relationship between ancient and recent noncoding sequence conservation from whole genome comparative analysis, which explains some of these findings. Lastly, we determined that, in addition to strength of conservation, genomic location and/or density of surrounding conserved elements must also be considered in selecting candidate enhancers for testing at embryonic time points.

  4. Evolution of Cis-Regulatory Elements and Regulatory Networks in Duplicated Genes of Arabidopsis.

    Science.gov (United States)

    Arsovski, Andrej A; Pradinuk, Julian; Guo, Xu Qiu; Wang, Sishuo; Adams, Keith L

    2015-12-01

    Plant genomes contain large numbers of duplicated genes that contribute to the evolution of new functions. Following duplication, genes can exhibit divergence in their coding sequence and their expression patterns. Changes in the cis-regulatory element landscape can result in changes in gene expression patterns. High-throughput methods developed recently can identify potential cis-regulatory elements on a genome-wide scale. Here, we use a recent comprehensive data set of DNase I sequencing-identified cis-regulatory binding sites (footprints) at single-base-pair resolution to compare binding sites and network connectivity in duplicated gene pairs in Arabidopsis (Arabidopsis thaliana). We found that duplicated gene pairs vary greatly in their cis-regulatory element architecture, resulting in changes in regulatory network connectivity. Whole-genome duplicates (WGDs) have approximately twice as many footprints in their promoters left by potential regulatory proteins than do tandem duplicates (TDs). The WGDs have a greater average number of footprint differences between paralogs than TDs. The footprints, in turn, result in more regulatory network connections between WGDs and other genes, forming denser, more complex regulatory networks than shown by TDs. When comparing regulatory connections between duplicates, WGDs had more pairs in which the two genes are either partially or fully diverged in their network connections, but fewer genes with no network connections than the TDs. There is evidence of younger TDs and WGDs having fewer unique connections compared with older duplicates. This study provides insights into cis-regulatory element evolution and network divergence in duplicated genes. © 2015 American Society of Plant Biologists. All Rights Reserved.

  5. RNA-ID, a Powerful Tool for Identifying and Characterizing Regulatory Sequences.

    Science.gov (United States)

    Brule, C E; Dean, K M; Grayhack, E J

    2016-01-01

    The identification and analysis of sequences that regulate gene expression is critical because regulated gene expression underlies biology. RNA-ID is an efficient and sensitive method to discover and investigate regulatory sequences in the yeast Saccharomyces cerevisiae, using fluorescence-based assays to detect green fluorescent protein (GFP) relative to a red fluorescent protein (RFP) control in individual cells. Putative regulatory sequences can be inserted either in-frame or upstream of a superfolder GFP fusion protein whose expression, like that of RFP, is driven by the bidirectional GAL1,10 promoter. In this chapter, we describe the methodology to identify and study cis-regulatory sequences in the RNA-ID system, explaining features and variations of the RNA-ID reporter, as well as some applications of this system. We describe in detail the methods to analyze a single regulatory sequence, from construction of a single GFP variant to assay of variants by flow cytometry, as well as modifications required to screen libraries of different strains simultaneously. We also describe subsequent analyses of regulatory sequences. © 2016 Elsevier Inc. All rights reserved.

  6. RNA-ID, a highly sensitive and robust method to identify cis-regulatory sequences using superfolder GFP and a fluorescence-based assay.

    Science.gov (United States)

    Dean, Kimberly M; Grayhack, Elizabeth J

    2012-12-01

    We have developed a robust and sensitive method, called RNA-ID, to screen for cis-regulatory sequences in RNA using fluorescence-activated cell sorting (FACS) of yeast cells bearing a reporter in which expression of both superfolder green fluorescent protein (GFP) and yeast codon-optimized mCherry red fluorescent protein (RFP) is driven by the bidirectional GAL1,10 promoter. This method recapitulates previously reported progressive inhibition of translation mediated by increasing numbers of CGA codon pairs, and restoration of expression by introduction of a tRNA with an anticodon that base pairs exactly with the CGA codon. This method also reproduces effects of paromomycin and context on stop codon read-through. Five key features of this method contribute to its effectiveness as a selection for regulatory sequences: The system exhibits greater than a 250-fold dynamic range, a quantitative and dose-dependent response to known inhibitory sequences, exquisite resolution that allows nearly complete physical separation of distinct populations, and a reproducible signal between different cells transformed with the identical reporter, all of which are coupled with simple methods involving ligation-independent cloning, to create large libraries. Moreover, we provide evidence that there are sequences within a 9-nt library that cause reduced GFP fluorescence, suggesting that there are novel cis-regulatory sequences to be found even in this short sequence space. This method is widely applicable to the study of both RNA-mediated and codon-mediated effects on expression.

  7. Identifying cis-regulatory modules by combining comparative and compositional analysis of DNA.

    Science.gov (United States)

    Pierstorff, Nora; Bergman, Casey M; Wiehe, Thomas

    2006-12-01

    Predicting cis-regulatory modules (CRMs) in higher eukaryotes is a challenging computational task. Commonly used methods to predict CRMs based on the signal of transcription factor binding sites (TFBS) are limited by prior information about transcription factor specificity. More general methods that bypass the reliance on TFBS models are needed for comprehensive CRM prediction. We have developed a method to predict CRMs called CisPlusFinder that identifies high density regions of perfect local ungapped sequences (PLUSs) based on multiple species conservation. By assuming that PLUSs contain core TFBS motifs that are locally overrepresented, the method attempts to capture the expected features of CRM structure and evolution. Applied to a benchmark dataset of CRMs involved in early Drosophila development, CisPlusFinder predicts more annotated CRMs than all other methods tested. Using the REDfly database, we find that some 'false positive' predictions in the benchmark dataset correspond to recently annotated CRMs. Our work demonstrates that CRM prediction methods that combine comparative genomic data with statistical properties of DNA may achieve reasonable performance when applied genome-wide in the absence of an a priori set of known TFBS motifs. The program CisPlusFinder can be downloaded at http://jakob.genetik.uni-koeln.de/bioinformatik/people/nora/nora.html. All software is licensed under the Lesser GNU Public License (LGPL).

  8. Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.

    Science.gov (United States)

    Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R

    2005-09-01

    We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.

  9. Genome-wide prediction of cis-regulatory regions using supervised deep learning methods.

    Science.gov (United States)

    Li, Yifeng; Shi, Wenqiang; Wasserman, Wyeth W

    2018-05-31

    In the human genome, 98% of DNA sequences are non-protein-coding regions that were previously disregarded as junk DNA. In fact, non-coding regions host a variety of cis-regulatory regions which precisely control the expression of genes. Thus, Identifying active cis-regulatory regions in the human genome is critical for understanding gene regulation and assessing the impact of genetic variation on phenotype. The developments of high-throughput sequencing and machine learning technologies make it possible to predict cis-regulatory regions genome wide. Based on rich data resources such as the Encyclopedia of DNA Elements (ENCODE) and the Functional Annotation of the Mammalian Genome (FANTOM) projects, we introduce DECRES based on supervised deep learning approaches for the identification of enhancer and promoter regions in the human genome. Due to their ability to discover patterns in large and complex data, the introduction of deep learning methods enables a significant advance in our knowledge of the genomic locations of cis-regulatory regions. Using models for well-characterized cell lines, we identify key experimental features that contribute to the predictive performance. Applying DECRES, we delineate locations of 300,000 candidate enhancers genome wide (6.8% of the genome, of which 40,000 are supported by bidirectional transcription data), and 26,000 candidate promoters (0.6% of the genome). The predicted annotations of cis-regulatory regions will provide broad utility for genome interpretation from functional genomics to clinical applications. The DECRES model demonstrates potentials of deep learning technologies when combined with high-throughput sequencing data, and inspires the development of other advanced neural network models for further improvement of genome annotations.

  10. An in vivo cis-regulatory screen at the type 2 diabetes associated TCF7L2 locus identifies multiple tissue-specific enhancers.

    Directory of Open Access Journals (Sweden)

    Daniel Savic

    Full Text Available Genome-wide association studies (GWAS have repeatedly shown an association between non-coding variants in the TCF7L2 locus and risk for type 2 diabetes (T2D, implicating a role for cis-regulatory variation within this locus in disease etiology. Supporting this hypothesis, we previously localized complex regulatory activity to the TCF7L2 T2D-associated interval using an in vivo bacterial artificial chromosome (BAC enhancer-trapping reporter strategy. To follow-up on this broad initial survey of the TCF7L2 regulatory landscape, we performed a fine-mapping enhancer scan using in vivo mouse transgenic reporter assays. We functionally interrogated approximately 50% of the sequences within the T2D-associated interval, utilizing sequence conservation within this 92-kb interval to determine the regulatory potential of all evolutionary conserved sequences that exhibited conservation to the non-eutherian mammal opossum. Included in this study was a detailed functional interrogation of sequences spanning both protective and risk alleles of single nucleotide polymorphism (SNP rs7903146, which has exhibited allele-specific enhancer function in pancreatic beta cells. Using these assays, we identified nine segments regulating various aspects of the TCF7L2 expression profile and that constitute nearly 70% of the sequences tested. These results highlight the regulatory complexity of this interval and support the notion that a TCF7L2 cis-regulatory disruption leads to T2D predisposition.

  11. Cis-regulatory somatic mutations and gene-expression alteration in B-cell lymphomas.

    Science.gov (United States)

    Mathelier, Anthony; Lefebvre, Calvin; Zhang, Allen W; Arenillas, David J; Ding, Jiarui; Wasserman, Wyeth W; Shah, Sohrab P

    2015-04-23

    With the rapid increase of whole-genome sequencing of human cancers, an important opportunity to analyze and characterize somatic mutations lying within cis-regulatory regions has emerged. A focus on protein-coding regions to identify nonsense or missense mutations disruptive to protein structure and/or function has led to important insights; however, the impact on gene expression of mutations lying within cis-regulatory regions remains under-explored. We analyzed somatic mutations from 84 matched tumor-normal whole genomes from B-cell lymphomas with accompanying gene expression measurements to elucidate the extent to which these cancers are disrupted by cis-regulatory mutations. We characterize mutations overlapping a high quality set of well-annotated transcription factor binding sites (TFBSs), covering a similar portion of the genome as protein-coding exons. Our results indicate that cis-regulatory mutations overlapping predicted TFBSs are enriched in promoter regions of genes involved in apoptosis or growth/proliferation. By integrating gene expression data with mutation data, our computational approach culminates with identification of cis-regulatory mutations most likely to participate in dysregulation of the gene expression program. The impact can be measured along with protein-coding mutations to highlight key mutations disrupting gene expression and pathways in cancer. Our study yields specific genes with disrupted expression triggered by genomic mutations in either the coding or the regulatory space. It implies that mutated regulatory components of the genome contribute substantially to cancer pathways. Our analyses demonstrate that identifying genomically altered cis-regulatory elements coupled with analysis of gene expression data will augment biological interpretation of mutational landscapes of cancers.

  12. Comparative genome sequencing of drosophila pseudoobscura: Chromosomal, gene and cis-element evolution

    Energy Technology Data Exchange (ETDEWEB)

    Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Todd, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catherine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenee; Verduzco, Daniel; Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

    2004-04-01

    The genome sequence of a second fruit fly, D. pseudoobscura, presents an opportunity for comparative analysis of a primary model organism D. melanogaster. The vast majority of Drosophila genes have remained on the same arm, but within each arm gene order has been extensively reshuffled leading to the identification of approximately 1300 syntenic blocks. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 35 My since divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome wide average consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than control sequences between the species but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a picture of repeat mediated chromosomal rearrangement, and high co-adaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila.

  13. Pathogenic adaptation of intracellular bacteria by rewiring a cis-regulatory input function.

    Science.gov (United States)

    Osborne, Suzanne E; Walthers, Don; Tomljenovic, Ana M; Mulder, David T; Silphaduang, Uma; Duong, Nancy; Lowden, Michael J; Wickham, Mark E; Waller, Ross F; Kenney, Linda J; Coombes, Brian K

    2009-03-10

    The acquisition of DNA by horizontal gene transfer enables bacteria to adapt to previously unexploited ecological niches. Although horizontal gene transfer and mutation of protein-coding sequences are well-recognized forms of pathogen evolution, the evolutionary significance of cis-regulatory mutations in creating phenotypic diversity through altered transcriptional outputs is not known. We show the significance of regulatory mutation for pathogen evolution by mapping and then rewiring a cis-regulatory module controlling a gene required for murine typhoid. Acquisition of a binding site for the Salmonella pathogenicity island-2 regulator, SsrB, enabled the srfN gene, ancestral to the Salmonella genus, to play a role in pathoadaptation of S. typhimurium to a host animal. We identified the evolved cis-regulatory module and quantified the fitness gain that this regulatory output accrues for the bacterium using competitive infections of host animals. Our findings highlight a mechanism of pathogen evolution involving regulatory mutation that is selected because of the fitness advantage the new regulatory output provides the incipient clones.

  14. cisMEP: an integrated repository of genomic epigenetic profiles and cis-regulatory modules in Drosophila.

    Science.gov (United States)

    Yang, Tzu-Hsien; Wang, Chung-Ching; Hung, Po-Cheng; Wu, Wei-Sheng

    2014-01-01

    Cis-regulatory modules (CRMs), or the DNA sequences required for regulating gene expression, play the central role in biological researches on transcriptional regulation in metazoan species. Nowadays, the systematic understanding of CRMs still mainly resorts to computational methods due to the time-consuming and small-scale nature of experimental methods. But the accuracy and reliability of different CRM prediction tools are still unclear. Without comparative cross-analysis of the results and combinatorial consideration with extra experimental information, there is no easy way to assess the confidence of the predicted CRMs. This limits the genome-wide understanding of CRMs. It is known that transcription factor binding and epigenetic profiles tend to determine functions of CRMs in gene transcriptional regulation. Thus integration of the genome-wide epigenetic profiles with systematically predicted CRMs can greatly help researchers evaluate and decipher the prediction confidence and possible transcriptional regulatory functions of these potential CRMs. However, these data are still fragmentary in the literatures. Here we performed the computational genome-wide screening for potential CRMs using different prediction tools and constructed the pioneer database, cisMEP (cis-regulatory module epigenetic profile database), to integrate these computationally identified CRMs with genomic epigenetic profile data. cisMEP collects the literature-curated TFBS location data and nine genres of epigenetic data for assessing the confidence of these potential CRMs and deciphering the possible CRM functionality. cisMEP aims to provide a user-friendly interface for researchers to assess the confidence of different potential CRMs and to understand the functions of CRMs through experimentally-identified epigenetic profiles. The deposited potential CRMs and experimental epigenetic profiles for confidence assessment provide experimentally testable hypotheses for the molecular mechanisms

  15. Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

    DEFF Research Database (Denmark)

    Richards, Stephen; Liu, Yue; Bettencourt, Brian R.

    2005-01-01

    years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences......We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each...... between the species-but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence...

  16. In silico analysis of cis-acting regulatory elements in 5' regulatory regions of sucrose transporter gene families in rice (Oryza sativa Japonica) and Arabidopsis thaliana.

    Science.gov (United States)

    Ibraheem, Omodele; Botha, Christiaan E J; Bradley, Graeme

    2010-12-01

    The regulation of gene expression involves a multifarious regulatory system. Each gene contains a unique combination of cis-acting regulatory sequence elements in the 5' regulatory region that determines its temporal and spatial expression. Cis-acting regulatory elements are essential transcriptional gene regulatory units; they control many biological processes and stress responses. Thus a full understanding of the transcriptional gene regulation system will depend on successful functional analyses of cis-acting elements. Cis-acting regulatory elements present within the 5' regulatory region of the sucrose transporter gene families in rice (Oryza sativa Japonica cultivar-group) and Arabidopsis thaliana, were identified using a bioinformatics approach. The possible cis-acting regulatory elements were predicted by scanning 1.5kbp of 5' regulatory regions of the sucrose transporter genes translational start sites, using Plant CARE, PLACE and Genomatix Matinspector professional databases. Several cis-acting regulatory elements that are associated with plant development, plant hormonal regulation and stress response were identified, and were present in varying frequencies within the 1.5kbp of 5' regulatory region, among which are; A-box, RY, CAT, Pyrimidine-box, Sucrose-box, ABRE, ARF, ERE, GARE, Me-JA, ARE, DRE, GA-motif, GATA, GT-1, MYC, MYB, W-box, and I-box. This result reveals the probable cis-acting regulatory elements that possibly are involved in the expression and regulation of sucrose transporter gene families in rice and Arabidopsis thaliana during cellular development or environmental stress conditions. Copyright © 2010 Elsevier Ltd. All rights reserved.

  17. Using reporter gene assays to identify cis regulatory differences between humans and chimpanzees.

    Science.gov (United States)

    Chabot, Adrien; Shrit, Ralla A; Blekhman, Ran; Gilad, Yoav

    2007-08-01

    Most phenotypic differences between human and chimpanzee are likely to result from differences in gene regulation, rather than changes to protein-coding regions. To date, however, only a handful of human-chimpanzee nucleotide differences leading to changes in gene regulation have been identified. To hone in on differences in regulatory elements between human and chimpanzee, we focused on 10 genes that were previously found to be differentially expressed between the two species. We then designed reporter gene assays for the putative human and chimpanzee promoters of the 10 genes. Of seven promoters that we found to be active in human liver cell lines, human and chimpanzee promoters had significantly different activity in four cases, three of which recapitulated the gene expression difference seen in the microarray experiment. For these three genes, we were therefore able to demonstrate that a change in cis influences expression differences between humans and chimpanzees. Moreover, using site-directed mutagenesis on one construct, the promoter for the DDA3 gene, we were able to identify three nucleotides that together lead to a cis regulatory difference between the species. High-throughput application of this approach can provide a map of regulatory element differences between humans and our close evolutionary relatives.

  18. Regulatory elements of the floral homeotic gene AGAMOUS identified by phylogenetic footprinting and shadowing.

    Energy Technology Data Exchange (ETDEWEB)

    Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.

    2003-06-01

    OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally important for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.

  19. Barcoded DNA-tag reporters for multiplex cis-regulatory analysis.

    Directory of Open Access Journals (Sweden)

    Jongmin Nam

    Full Text Available Cis-regulatory DNA sequences causally mediate patterns of gene expression, but efficient experimental analysis of these control systems has remained challenging. Here we develop a new version of "barcoded" DNA-tag reporters, "Nanotags" that permit simultaneous quantitative analysis of up to 130 distinct cis-regulatory modules (CRMs. The activities of these reporters are measured in single experiments by the NanoString RNA counting method and other quantitative procedures. We demonstrate the efficiency of the Nanotag method by simultaneously measuring hourly temporal activities of 126 CRMs from 46 genes in the developing sea urchin embryo, otherwise a virtually impossible task. Nanotags are also used in gene perturbation experiments to reveal cis-regulatory responses of many CRMs at once. Nanotag methodology can be applied to many research areas, ranging from gene regulatory networks to functional and evolutionary genomics.

  20. cis sequence effects on gene expression

    Directory of Open Access Journals (Sweden)

    Jacobs Kevin

    2007-08-01

    Full Text Available Abstract Background Sequence and transcriptional variability within and between individuals are typically studied independently. The joint analysis of sequence and gene expression variation (genetical genomics provides insight into the role of linked sequence variation in the regulation of gene expression. We investigated the role of sequence variation in cis on gene expression (cis sequence effects in a group of genes commonly studied in cancer research in lymphoblastoid cell lines. We estimated the proportion of genes exhibiting cis sequence effects and the proportion of gene expression variation explained by cis sequence effects using three different analytical approaches, and compared our results to the literature. Results We generated gene expression profiling data at N = 697 candidate genes from N = 30 lymphoblastoid cell lines for this study and used available candidate gene resequencing data at N = 552 candidate genes to identify N = 30 candidate genes with sufficient variance in both datasets for the investigation of cis sequence effects. We used two additive models and the haplotype phylogeny scanning approach of Templeton (Tree Scanning to evaluate association between individual SNPs, all SNPs at a gene, and diplotypes, with log-transformed gene expression. SNPs and diplotypes at eight candidate genes exhibited statistically significant (p cis sequence effects in our study, respectively. Conclusion Based on analysis of our results and the extant literature, one in four genes exhibits significant cis sequence effects, and for these genes, about 30% of gene expression variation is accounted for by cis sequence variation. Despite diverse experimental approaches, the presence or absence of significant cis sequence effects is largely supported by previously published studies.

  1. Detection of Weakly Conserved Ancestral Mammalian RegulatorySequences by Primate Comparisons

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Qian-fei; Prabhakar, Shyam; Chanan, Sumita; Cheng,Jan-Fang; Rubin, Edward M.; Boffelli, Dario

    2006-06-01

    Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detectcryptic functional elements, which are too weakly conserved among mammalsto distinguish from nonfunctional DNA. To address this problem, weexplored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.

  2. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences

    OpenAIRE

    Lescot, Magali; Déhais, Patrice; Thijs, Gert; Marchal, Kathleen; Moreau, Yves; Van de Peer, Yves; Rouzé, Pierre; Rombauts, Stephane

    2002-01-01

    PlantCARE is a database of plant cis-acting regulatory elements, enhancers and repressors. Regulatory elements are represented by positional matrices, consensus sequences and individual sites on particular promoter sequences. Links to the EMBL, TRANSFAC and MEDLINE databases are provided when available. Data about the transcription sites are extracted mainly from the literature, supplemented with an increasing number of in silico predicted data. Apart from a general description for specific t...

  3. Nomadic enhancers: tissue-specific cis-regulatory elements of yellow have divergent genomic positions among Drosophila species.

    Directory of Open Access Journals (Sweden)

    Gizem Kalay

    2010-11-01

    Full Text Available cis-regulatory DNA sequences known as enhancers control gene expression in space and time. They are central to metazoan development and are often responsible for changes in gene regulation that contribute to phenotypic evolution. Here, we examine the sequence, function, and genomic location of enhancers controlling tissue- and cell-type specific expression of the yellow gene in six Drosophila species. yellow is required for the production of dark pigment, and its expression has evolved largely in concert with divergent pigment patterns. Using Drosophila melanogaster as a transgenic host, we examined the expression of reporter genes in which either 5' intergenic or intronic sequences of yellow from each species controlled the expression of Green Fluorescent Protein. Surprisingly, we found that sequences controlling expression in the wing veins, as well as sequences controlling expression in epidermal cells of the abdomen, thorax, and wing, were located in different genomic regions in different species. By contrast, sequences controlling expression in bristle-associated cells were located in the intron of all species. Differences in the precise pattern of spatial expression within the developing epidermis of D. melanogaster transformants usually correlated with adult pigmentation in the species from which the cis-regulatory sequences were derived, which is consistent with cis-regulatory evolution affecting yellow expression playing a central role in Drosophila pigmentation divergence. Sequence comparisons among species favored a model in which sequential nucleotide substitutions were responsible for the observed changes in cis-regulatory architecture. Taken together, these data demonstrate frequent changes in yellow cis-regulatory architecture among Drosophila species. Similar analyses of other genes, combining in vivo functional tests of enhancer activity with in silico comparative genomics, are needed to determine whether the pattern of

  4. ChIP-Seq-Annotated Heliconius erato Genome Highlights Patterns of cis-Regulatory Evolution in Lepidoptera

    Directory of Open Access Journals (Sweden)

    James J. Lewis

    2016-09-01

    Full Text Available Uncovering phylogenetic patterns of cis-regulatory evolution remains a fundamental goal for evolutionary and developmental biology. Here, we characterize the evolution of regulatory loci in butterflies and moths using chromatin immunoprecipitation sequencing (ChIP-seq annotation of regulatory elements across three stages of head development. In the process we provide a high-quality, functionally annotated genome assembly for the butterfly, Heliconius erato. Comparing cis-regulatory element conservation across six lepidopteran genomes, we find that regulatory sequences evolve at a pace similar to that of protein-coding regions. We also observe that elements active at multiple developmental stages are markedly more conserved than elements with stage-specific activity. Surprisingly, we also find that stage-specific proximal and distal regulatory elements evolve at nearly identical rates. Our study provides a benchmark for genome-wide patterns of regulatory element evolution in insects, and it shows that developmental timing of activity strongly predicts patterns of regulatory sequence evolution.

  5. Using hexamers to predict cis-regulatory motifs in Drosophila

    Directory of Open Access Journals (Sweden)

    Kibler Dennis

    2005-10-01

    Full Text Available Abstract Background Cis-regulatory modules (CRMs are short stretches of DNA that help regulate gene expression in higher eukaryotes. They have been found up to 1 megabase away from the genes they regulate and can be located upstream, downstream, and even within their target genes. Due to the difficulty of finding CRMs using biological and computational techniques, even well-studied regulatory systems may contain CRMs that have not yet been discovered. Results We present a simple, efficient method (HexDiff based only on hexamer frequencies of known CRMs and non-CRM sequence to predict novel CRMs in regulatory systems. On a data set of 16 gap and pair-rule genes containing 52 known CRMs, predictions made by HexDiff had a higher correlation with the known CRMs than several existing CRM prediction algorithms: Ahab, Cluster Buster, MSCAN, MCAST, and LWF. After combining the results of the different algorithms, 10 putative CRMs were identified and are strong candidates for future study. The hexamers used by HexDiff to distinguish between CRMs and non-CRM sequence were also analyzed and were shown to be enriched in regulatory elements. Conclusion HexDiff provides an efficient and effective means for finding new CRMs based on known CRMs, rather than known binding sites.

  6. PlantCARE, a plant cis-acting regulatory element database

    OpenAIRE

    Rombauts, Stephane; Déhais, Patrice; Van Montagu, Marc; Rouzé, Pierre

    1999-01-01

    PlantCARE is a database of plant cis- acting regulatory elements, enhancers and repressors. Besides the transcription motifs found on a sequence, it also offers a link to the EMBL entry that contains the full gene sequence as well as a description of the conditions in which a motif becomes functional. The information on these sites is given by matrices, consensus and individual site sequences on particular genes, depending on the available information. PlantCARE is a relational database avail...

  7. Identification of a cis-regulatory element by transient analysis of co-ordinately regulated genes

    Directory of Open Access Journals (Sweden)

    Allan Andrew C

    2008-07-01

    Full Text Available Abstract Background Transcription factors (TFs co-ordinately regulate target genes that are dispersed throughout the genome. This co-ordinate regulation is achieved, in part, through the interaction of transcription factors with conserved cis-regulatory motifs that are in close proximity to the target genes. While much is known about the families of transcription factors that regulate gene expression in plants, there are few well characterised cis-regulatory motifs. In Arabidopsis, over-expression of the MYB transcription factor PAP1 (PRODUCTION OF ANTHOCYANIN PIGMENT 1 leads to transgenic plants with elevated anthocyanin levels due to the co-ordinated up-regulation of genes in the anthocyanin biosynthetic pathway. In addition to the anthocyanin biosynthetic genes, there are a number of un-associated genes that also change in expression level. This may be a direct or indirect consequence of the over-expression of PAP1. Results Oligo array analysis of PAP1 over-expression Arabidopsis plants identified genes co-ordinately up-regulated in response to the elevated expression of this transcription factor. Transient assays on the promoter regions of 33 of these up-regulated genes identified eight promoter fragments that were transactivated by PAP1. Bioinformatic analysis on these promoters revealed a common cis-regulatory motif that we showed is required for PAP1 dependent transactivation. Conclusion Co-ordinated gene regulation by individual transcription factors is a complex collection of both direct and indirect effects. Transient transactivation assays provide a rapid method to identify direct target genes from indirect target genes. Bioinformatic analysis of the promoters of these direct target genes is able to locate motifs that are common to this sub-set of promoters, which is impossible to identify with the larger set of direct and indirect target genes. While this type of analysis does not prove a direct interaction between protein and DNA

  8. Statistical significance of cis-regulatory modules

    Directory of Open Access Journals (Sweden)

    Smith Andrew D

    2007-01-01

    Full Text Available Abstract Background It is becoming increasingly important for researchers to be able to scan through large genomic regions for transcription factor binding sites or clusters of binding sites forming cis-regulatory modules. Correspondingly, there has been a push to develop algorithms for the rapid detection and assessment of cis-regulatory modules. While various algorithms for this purpose have been introduced, most are not well suited for rapid, genome scale scanning. Results We introduce methods designed for the detection and statistical evaluation of cis-regulatory modules, modeled as either clusters of individual binding sites or as combinations of sites with constrained organization. In order to determine the statistical significance of module sites, we first need a method to determine the statistical significance of single transcription factor binding site matches. We introduce a straightforward method of estimating the statistical significance of single site matches using a database of known promoters to produce data structures that can be used to estimate p-values for binding site matches. We next introduce a technique to calculate the statistical significance of the arrangement of binding sites within a module using a max-gap model. If the module scanned for has defined organizational parameters, the probability of the module is corrected to account for organizational constraints. The statistical significance of single site matches and the architecture of sites within the module can be combined to provide an overall estimation of statistical significance of cis-regulatory module sites. Conclusion The methods introduced in this paper allow for the detection and statistical evaluation of single transcription factor binding sites and cis-regulatory modules. The features described are implemented in the Search Tool for Occurrences of Regulatory Motifs (STORM and MODSTORM software.

  9. Dynamic SPR monitoring of yeast nuclear protein binding to a cis-regulatory element

    International Nuclear Information System (INIS)

    Mao, Grace; Brody, James P.

    2007-01-01

    Gene expression is controlled by protein complexes binding to short specific sequences of DNA, called cis-regulatory elements. Expression of most eukaryotic genes is controlled by dozens of these elements. Comprehensive identification and monitoring of these elements is a major goal of genomics. In pursuit of this goal, we are developing a surface plasmon resonance (SPR) based assay to identify and monitor cis-regulatory elements. To test whether we could reliably monitor protein binding to a regulatory element, we immobilized a 16 bp region of Saccharomyces cerevisiae chromosome 5 onto a gold surface. This 16 bp region of DNA is known to bind several proteins and thought to control expression of the gene RNR1, which varies through the cell cycle. We synchronized yeast cell cultures, and then sampled these cultures at a regular interval. These samples were processed to purify nuclear lysate, which was then exposed to the sensor. We found that nuclear protein binds this particular element of DNA at a significantly higher rate (as compared to unsynchronized cells) during G1 phase. Other time points show levels of DNA-nuclear protein binding similar to the unsynchronized control. We also measured the apparent association complex of the binding to be 0.014 s -1 . We conclude that (1) SPR-based assays can monitor DNA-nuclear protein binding and that (2) for this particular cis-regulatory element, maximum DNA-nuclear protein binding occurs during G1 phase

  10. Validation of Skeletal Muscle cis-Regulatory Module Predictions Reveals Nucleotide Composition Bias in Functional Enhancers

    Science.gov (United States)

    Kwon, Andrew T.; Chou, Alice Yi; Arenillas, David J.; Wasserman, Wyeth W.

    2011-01-01

    We performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated C2C12 myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational regulatory sequence analysis methods for CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition can be an important characteristic to incorporate in future methods for improved predictive specificity. Muscle-related TFBSs predicted within the functional sequences display greater sequence conservation than non-TFBS flanking regions. Comparison with recent MyoD and histone modification ChIP-Seq data supports the validity of the functional regions. PMID:22144875

  11. Validation of skeletal muscle cis-regulatory module predictions reveals nucleotide composition bias in functional enhancers.

    Directory of Open Access Journals (Sweden)

    Andrew T Kwon

    2011-12-01

    Full Text Available We performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated C2C12 myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational regulatory sequence analysis methods for CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition can be an important characteristic to incorporate in future methods for improved predictive specificity. Muscle-related TFBSs predicted within the functional sequences display greater sequence conservation than non-TFBS flanking regions. Comparison with recent MyoD and histone modification ChIP-Seq data supports the validity of the functional regions.

  12. A New Algorithm for Identifying Cis-Regulatory Modules Based on Hidden Markov Model

    Directory of Open Access Journals (Sweden)

    Haitao Guo

    2017-01-01

    Full Text Available The discovery of cis-regulatory modules (CRMs is the key to understanding mechanisms of transcription regulation. Since CRMs have specific regulatory structures that are the basis for the regulation of gene expression, how to model the regulatory structure of CRMs has a considerable impact on the performance of CRM identification. The paper proposes a CRM discovery algorithm called ComSPS. ComSPS builds a regulatory structure model of CRMs based on HMM by exploring the rules of CRM transcriptional grammar that governs the internal motif site arrangement of CRMs. We test ComSPS on three benchmark datasets and compare it with five existing methods. Experimental results show that ComSPS performs better than them.

  13. A New Algorithm for Identifying Cis-Regulatory Modules Based on Hidden Markov Model

    Science.gov (United States)

    2017-01-01

    The discovery of cis-regulatory modules (CRMs) is the key to understanding mechanisms of transcription regulation. Since CRMs have specific regulatory structures that are the basis for the regulation of gene expression, how to model the regulatory structure of CRMs has a considerable impact on the performance of CRM identification. The paper proposes a CRM discovery algorithm called ComSPS. ComSPS builds a regulatory structure model of CRMs based on HMM by exploring the rules of CRM transcriptional grammar that governs the internal motif site arrangement of CRMs. We test ComSPS on three benchmark datasets and compare it with five existing methods. Experimental results show that ComSPS performs better than them. PMID:28497059

  14. Implications of duplicated cis-regulatory elements in the evolution of metazoans: the DDI model or how simplicity begets novelty.

    Science.gov (United States)

    Jiménez-Delgado, Senda; Pascual-Anaya, Juan; Garcia-Fernàndez, Jordi

    2009-07-01

    The discovery that most regulatory genes were conserved among animals from distant phyla challenged the ideas that gene duplication and divergence of homologous coding sequences were the basis for major morphological changes in metazoan evolution. In recent years, however, the interest for the roles, conservation and changes of non-coding sequences grew-up in parallel with genome sequencing projects. Presently, many independent studies are highlighting the importance that subtle changes in cis-regulatory regions had in the evolution of morphology trough the Animal Kingdom. Here we will show and discuss some of these studies, and underscore the future of cis-Evo-Devo research. Nevertheless, we would also explore how gene duplication, which includes duplication of regulatory regions, may have been critical for spatial or temporal co-option of new regulatory networks, causing the deployment of new transcriptome scenarios, and how these induced morphological changes were critical for the evolution of new forms. Forty years after Susumu Ohno famous sentence 'natural selection merely modifies, while redundancy creates', we suggest the alternative: 'natural selection modifies, while redundancy of cis-regulatory elements innovates', and propose the Duplication-Degeneration-Innovation model to explain the increased evolvability of duplicated cis-regulatory regions. Paradoxically, making regulation simpler by subfunctionalization paved the path for future complexity or, in other words, 'to make it simple to make it complex'.

  15. In silico modeling of epigenetic-induced changes in photoreceptor cis-regulatory elements.

    Science.gov (United States)

    Hossain, Reafa A; Dunham, Nicholas R; Enke, Raymond A; Berndsen, Christopher E

    2018-01-01

    DNA methylation is a well-characterized epigenetic repressor of mRNA transcription in many plant and vertebrate systems. However, the mechanism of this repression is not fully understood. The process of transcription is controlled by proteins that regulate recruitment and activity of RNA polymerase by binding to specific cis-regulatory sequences. Cone-rod homeobox (CRX) is a well-characterized mammalian transcription factor that controls photoreceptor cell-specific gene expression. Although much is known about the functions and DNA binding specificity of CRX, little is known about how DNA methylation modulates CRX binding affinity to genomic cis-regulatory elements. We used bisulfite pyrosequencing of human ocular tissues to measure DNA methylation levels of the regulatory regions of RHO , PDE6B, PAX6 , and LINE1 retrotransposon repeats. To describe the molecular mechanism of repression, we used molecular modeling to illustrate the effect of DNA methylation on human RHO regulatory sequences. In this study, we demonstrate an inverse correlation between DNA methylation in regulatory regions adjacent to the human RHO and PDE6B genes and their subsequent transcription in human ocular tissues. Docking of CRX to the DNA models shows that CRX interacts with the grooves of these sequences, suggesting changes in groove structure could regulate binding. Molecular dynamics simulations of the RHO promoter and enhancer regions show changes in the flexibility and groove width upon epigenetic modification. Models also demonstrate changes in the local dynamics of CRX binding sites within RHO regulatory sequences which may account for the repression of CRX-dependent transcription. Collectively, these data demonstrate epigenetic regulation of CRX binding sites in human retinal tissue and provide insight into the mechanism of this mode of epigenetic regulation to be tested in future experiments.

  16. Brachyury, Foxa2 and the cis-Regulatory Origins of the Notochord.

    Directory of Open Access Journals (Sweden)

    Diana S José-Edwards

    2015-12-01

    Full Text Available A main challenge of modern biology is to understand how specific constellations of genes are activated to differentiate cells and give rise to distinct tissues. This study focuses on elucidating how gene expression is initiated in the notochord, an axial structure that provides support and patterning signals to embryos of humans and all other chordates. Although numerous notochord genes have been identified, the regulatory DNAs that orchestrate development and propel evolution of this structure by eliciting notochord gene expression remain mostly uncharted, and the information on their configuration and recurrence is still quite fragmentary. Here we used the simple chordate Ciona for a systematic analysis of notochord cis-regulatory modules (CRMs, and investigated their composition, architectural constraints, predictive ability and evolutionary conservation. We found that most Ciona notochord CRMs relied upon variable combinations of binding sites for the transcription factors Brachyury and/or Foxa2, which can act either synergistically or independently from one another. Notably, one of these CRMs contains a Brachyury binding site juxtaposed to an (AC microsatellite, an unusual arrangement also found in Brachyury-bound regulatory regions in mouse. In contrast, different subsets of CRMs relied upon binding sites for transcription factors of widely diverse families. Surprisingly, we found that neither intra-genomic nor interspecific conservation of binding sites were reliably predictive hallmarks of notochord CRMs. We propose that rather than obeying a rigid sequence-based cis-regulatory code, most notochord CRMs are rather unique. Yet, this study uncovered essential elements recurrently used by divergent chordates as basic building blocks for notochord CRMs.

  17. Brachyury, Foxa2 and the cis-Regulatory Origins of the Notochord.

    Science.gov (United States)

    José-Edwards, Diana S; Oda-Ishii, Izumi; Kugler, Jamie E; Passamaneck, Yale J; Katikala, Lavanya; Nibu, Yutaka; Di Gregorio, Anna

    2015-12-01

    A main challenge of modern biology is to understand how specific constellations of genes are activated to differentiate cells and give rise to distinct tissues. This study focuses on elucidating how gene expression is initiated in the notochord, an axial structure that provides support and patterning signals to embryos of humans and all other chordates. Although numerous notochord genes have been identified, the regulatory DNAs that orchestrate development and propel evolution of this structure by eliciting notochord gene expression remain mostly uncharted, and the information on their configuration and recurrence is still quite fragmentary. Here we used the simple chordate Ciona for a systematic analysis of notochord cis-regulatory modules (CRMs), and investigated their composition, architectural constraints, predictive ability and evolutionary conservation. We found that most Ciona notochord CRMs relied upon variable combinations of binding sites for the transcription factors Brachyury and/or Foxa2, which can act either synergistically or independently from one another. Notably, one of these CRMs contains a Brachyury binding site juxtaposed to an (AC) microsatellite, an unusual arrangement also found in Brachyury-bound regulatory regions in mouse. In contrast, different subsets of CRMs relied upon binding sites for transcription factors of widely diverse families. Surprisingly, we found that neither intra-genomic nor interspecific conservation of binding sites were reliably predictive hallmarks of notochord CRMs. We propose that rather than obeying a rigid sequence-based cis-regulatory code, most notochord CRMs are rather unique. Yet, this study uncovered essential elements recurrently used by divergent chordates as basic building blocks for notochord CRMs.

  18. Bounded search for de novo identification of degenerate cis-regulatory elements

    Directory of Open Access Journals (Sweden)

    Khetani Radhika S

    2006-05-01

    Full Text Available Abstract Background The identification of statistically overrepresented sequences in the upstream regions of coregulated genes should theoretically permit the identification of potential cis-regulatory elements. However, in practice many cis-regulatory elements are highly degenerate, precluding the use of an exhaustive word-counting strategy for their identification. While numerous methods exist for inferring base distributions using a position weight matrix, recent studies suggest that the independence assumptions inherent in the model, as well as the inability to reach a global optimum, limit this approach. Results In this paper, we report PRISM, a degenerate motif finder that leverages the relationship between the statistical significance of a set of binding sites and that of the individual binding sites. PRISM first identifies overrepresented, non-degenerate consensus motifs, then iteratively relaxes each one into a high-scoring degenerate motif. This approach requires no tunable parameters, thereby lending itself to unbiased performance comparisons. We therefore compare PRISM's performance against nine popular motif finders on 28 well-characterized S. cerevisiae regulons. PRISM consistently outperforms all other programs. Finally, we use PRISM to predict the binding sites of uncharacterized regulons. Our results support a proposed mechanism of action for the yeast cell-cycle transcription factor Stb1, whose binding site has not been determined experimentally. Conclusion The relationship between statistical measures of the binding sites and the set as a whole leads to a simple means of identifying the diverse range of cis-regulatory elements to which a protein binds. This approach leverages the advantages of word-counting, in that position dependencies are implicitly accounted for and local optima are more easily avoided. While we sacrifice guaranteed optimality to prevent the exponential blowup of exhaustive search, we prove that the error

  19. Direct activation of a notochord cis-regulatory module by Brachyury and FoxA in the ascidian Ciona intestinalis.

    Science.gov (United States)

    Passamaneck, Yale J; Katikala, Lavanya; Perrone, Lorena; Dunn, Matthew P; Oda-Ishii, Izumi; Di Gregorio, Anna

    2009-11-01

    The notochord is a defining feature of the chordate body plan. Experiments in ascidian, frog and mouse embryos have shown that co-expression of Brachyury and FoxA class transcription factors is required for notochord development. However, studies on the cis-regulatory sequences mediating the synergistic effects of these transcription factors are complicated by the limited knowledge of notochord genes and cis-regulatory modules (CRMs) that are directly targeted by both. We have identified an easily testable model for such investigations in a 155-bp notochord-specific CRM from the ascidian Ciona intestinalis. This CRM contains functional binding sites for both Ciona Brachyury (Ci-Bra) and FoxA (Ci-FoxA-a). By combining point mutation analysis and misexpression experiments, we demonstrate that binding of both transcription factors to this CRM is necessary and sufficient to activate transcription. To gain insights into the cis-regulatory criteria controlling its activity, we investigated the organization of the transcription factor binding sites within the 155-bp CRM. The 155-bp sequence contains two Ci-Bra binding sites with identical core sequences but opposite orientations, only one of which is required for enhancer activity. Changes in both orientation and spacing of these sites substantially affect the activity of the CRM, as clusters of identical sites found in the Ciona genome with different arrangements are unable to activate transcription in notochord cells. This work presents the first evidence of a synergistic interaction between Brachyury and FoxA in the activation of an individual notochord CRM, and highlights the importance of transcription factor binding site arrangement for its function.

  20. LDsplit: screening for cis-regulatory motifs stimulating meiotic recombination hotspots by analysis of DNA sequence polymorphisms.

    Science.gov (United States)

    Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie

    2014-02-17

    As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of

  1. Cis-regulatory element based targeted gene finding: genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana.

    Science.gov (United States)

    Zhang, Weixiong; Ruan, Jianhua; Ho, Tuan-Hua David; You, Youngsook; Yu, Taotao; Quatrano, Ralph S

    2005-07-15

    A fundamental problem of computational genomics is identifying the genes that respond to certain endogenous cues and environmental stimuli. This problem can be referred to as targeted gene finding. Since gene regulation is mainly determined by the binding of transcription factors and cis-regulatory DNA sequences, most existing gene annotation methods, which exploit the conservation of open reading frames, are not effective in finding target genes. A viable approach to targeted gene finding is to exploit the cis-regulatory elements that are known to be responsible for the transcription of target genes. Given such cis-elements, putative target genes whose promoters contain the elements can be identified. As a case study, we apply the above approach to predict the genes in model plant Arabidopsis thaliana which are inducible by a phytohormone, abscisic acid (ABA), and abiotic stress, such as drought, cold and salinity. We first construct and analyze two ABA specific cis-elements, ABA-responsive element (ABRE) and its coupling element (CE), in A.thaliana, based on their conservation in rice and other cereal plants. We then use the ABRE-CE module to identify putative ABA-responsive genes in A.thaliana. Based on RT-PCR verification and the results from literature, this method has an accuracy rate of 67.5% for the top 40 predictions. The cis-element based targeted gene finding approach is expected to be widely applicable since a large number of cis-elements in many species are available.

  2. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

    Science.gov (United States)

    Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

    2015-01-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930

  3. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence.

    Directory of Open Access Journals (Sweden)

    Kacy L Gordon

    2015-05-01

    Full Text Available Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2 from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements.

  4. Creating and validating cis-regulatory maps of tissue-specific gene expression regulation

    Science.gov (United States)

    O'Connor, Timothy R.; Bailey, Timothy L.

    2014-01-01

    Predicting which genomic regions control the transcription of a given gene is a challenge. We present a novel computational approach for creating and validating maps that associate genomic regions (cis-regulatory modules–CRMs) with genes. The method infers regulatory relationships that explain gene expression observed in a test tissue using widely available genomic data for ‘other’ tissues. To predict the regulatory targets of a CRM, we use cross-tissue correlation between histone modifications present at the CRM and expression at genes within 1 Mbp of it. To validate cis-regulatory maps, we show that they yield more accurate models of gene expression than carefully constructed control maps. These gene expression models predict observed gene expression from transcription factor binding in the CRMs linked to that gene. We show that our maps are able to identify long-range regulatory interactions and improve substantially over maps linking genes and CRMs based on either the control maps or a ‘nearest neighbor’ heuristic. Our results also show that it is essential to include CRMs predicted in multiple tissues during map-building, that H3K27ac is the most informative histone modification, and that CAGE is the most informative measure of gene expression for creating cis-regulatory maps. PMID:25200088

  5. Mutations in the newly identified RAX regulatory sequence are not a frequent cause of micro/anophthalmia.

    Science.gov (United States)

    Chassaing, Nicolas; Vigouroux, Adeline; Calvas, Patrick

    2009-06-01

    Microphthalmia and anophthalmia are at the severe end of the spectrum of abnormalities in ocular development. A few genes (SOX2, OTX2, RAX, and CHX10) have been implicated in isolated micro/anophthalmia, but causative mutations of these genes explain less than a quarter of these developmental defects. A specifically conserved SOX2/OTX2-mediated RAX expression regulatory sequence has recently been identified. We postulated that mutations in this sequence could lead to micro/anophthalmia, and thus we performed molecular screening of this regulatory element in patients suffering from micro/anophthalmia. Fifty-one patients suffering from nonsyndromic microphthalmia (n = 40) or anophthalmia (n = 11) were included in this study after negative molecular screening for SOX2, OTX2, RAX, and CHX10 mutations. Mutation screening of the RAX regulatory sequence was performed by direct sequencing for these patients. No mutations were identified in the highly conserved RAX regulatory sequence in any of the 51 patients. Mutations in the newly identified RAX regulatory sequence do not represent a frequent cause of nonsyndromic micro/anophthalmia.

  6. Alignment and prediction of cis-regulatory modules based on a probabilistic model of evolution.

    Directory of Open Access Journals (Sweden)

    Xin He

    2009-03-01

    Full Text Available Cross-species comparison has emerged as a powerful paradigm for predicting cis-regulatory modules (CRMs and understanding their evolution. The comparison requires reliable sequence alignment, which remains a challenging task for less conserved noncoding sequences. Furthermore, the existing models of DNA sequence evolution generally do not explicitly treat the special properties of CRM sequences. To address these limitations, we propose a model of CRM evolution that captures different modes of evolution of functional transcription factor binding sites (TFBSs and the background sequences. A particularly novel aspect of our work is a probabilistic model of gains and losses of TFBSs, a process being recognized as an important part of regulatory sequence evolution. We present a computational framework that uses this model to solve the problems of CRM alignment and prediction. Our alignment method is similar to existing methods of statistical alignment but uses the conserved binding sites to improve alignment. Our CRM prediction method deals with the inherent uncertainties of binding site annotations and sequence alignment in a probabilistic framework. In simulated as well as real data, we demonstrate that our program is able to improve both alignment and prediction of CRM sequences over several state-of-the-art methods. Finally, we used alignments produced by our program to study binding site conservation in genome-wide binding data of key transcription factors in the Drosophila blastoderm, with two intriguing results: (i the factor-bound sequences are under strong evolutionary constraints even if their neighboring genes are not expressed in the blastoderm and (ii binding sites in distal bound sequences (relative to transcription start sites tend to be more conserved than those in proximal regions. Our approach is implemented as software, EMMA (Evolutionary Model-based cis-regulatory Module Analysis, ready to be applied in a broad biological context.

  7. Microevolution of cis-regulatory elements: an example from the pair-rule segmentation gene fushi tarazu in the Drosophila melanogaster subgroup.

    Directory of Open Access Journals (Sweden)

    Mohammed Bakkali

    Full Text Available The importance of non-coding DNAs that control transcription is ever noticeable, but the characterization and analysis of the evolution of such DNAs presents challenges not found in the analysis of coding sequences. In this study of the cis-regulatory elements of the pair rule segmentation gene fushi tarazu (ftz I report the DNA sequences of ftz's zebra element (promoter and a region containing the proximal enhancer from a total of 45 fly lines belonging to several populations of the species Drosophila melanogaster, D. simulans, D. sechellia, D. mauritiana, D. yakuba, D. teissieri, D. orena and D. erecta. Both elements evolve at slower rate than ftz synonymous sites, thus reflecting their functional importance. The promoter evolves more slowly than the average for ftz's coding sequence while, on average, the enhancer evolves more rapidly, suggesting more functional constraint and effective purifying selection on the former. Comparative analysis of the number and nature of base substitutions failed to detect significant evidence for positive/adaptive selection in transcription-factor-binding sites. These seem to evolve at similar rates to regions not known to bind transcription factors. Although this result reflects the evolutionary flexibility of the transcription factor binding sites, it also suggests a complex and still not completely understood nature of even the characterized cis-regulatory sequences. The latter seem to contain more functional parts than those currently identified, some of which probably transcription factor binding. This study illustrates ways in which functional assignments of sequences within cis-acting sequences can be used in the search for adaptive evolution, but also highlights difficulties in how such functional assignment and analysis can be carried out.

  8. PlantPAN: Plant promoter analysis navigator, for identifying combinatorial cis-regulatory elements with distance constraint in plant gene groups

    Directory of Open Access Journals (Sweden)

    Huang Hsien-Da

    2008-11-01

    Full Text Available Abstract Background The elucidation of transcriptional regulation in plant genes is important area of research for plant scientists, following the mapping of various plant genomes, such as A. thaliana, O. sativa and Z. mays. A variety of bioinformatic servers or databases of plant promoters have been established, although most have been focused only on annotating transcription factor binding sites in a single gene and have neglected some important regulatory elements (tandem repeats and CpG/CpNpG islands in promoter regions. Additionally, the combinatorial interaction of transcription factors (TFs is important in regulating the gene group that is associated with the same expression pattern. Therefore, a tool for detecting the co-regulation of transcription factors in a group of gene promoters is required. Results This study develops a database-assisted system, PlantPAN (Plant Promoter Analysis Navigator, for recognizing combinatorial cis-regulatory elements with a distance constraint in sets of plant genes. The system collects the plant transcription factor binding profiles from PLACE, TRANSFAC (public release 7.0, AGRIS, and JASPER databases and allows users to input a group of gene IDs or promoter sequences, enabling the co-occurrence of combinatorial transcription factor binding sites (TFBSs within a defined distance (20 bp to 200 bp to be identified. Furthermore, the new resource enables other regulatory features in a plant promoter, such as CpG/CpNpG islands and tandem repeats, to be displayed. The regulatory elements in the conserved regions of the promoters across homologous genes are detected and presented. Conclusion In addition to providing a user-friendly input/output interface, PlantPAN has numerous advantages in the analysis of a plant promoter. Several case studies have established the effectiveness of PlantPAN. This novel analytical resource is now freely available at http://PlantPAN.mbc.nctu.edu.tw.

  9. Characterization of a putative cis-regulatory element that controls transcriptional activity of the pig uroplakin II gene promoter

    International Nuclear Information System (INIS)

    Kwon, Deug-Nam; Park, Mi-Ryung; Park, Jong-Yi; Cho, Ssang-Goo; Park, Chankyu; Oh, Jae-Wook; Song, Hyuk; Kim, Jae-Hwan; Kim, Jin-Hoi

    2011-01-01

    Highlights: → The sequences of -604 to -84 bp of the pUPII promoter contained the region of a putative negative cis-regulatory element. → The core promoter was located in the 5F-1. → Transcription factor HNF4 can directly bind in the pUPII core promoter region, which plays a critical role in controlling promoter activity. → These features of the pUPII promoter are fundamental to development of a target-specific vector. -- Abstract: Uroplakin II (UPII) is a one of the integral membrane proteins synthesized as a major differentiation product of mammalian urothelium. UPII gene expression is bladder specific and differentiation dependent, but little is known about its transcription response elements and molecular mechanism. To identify the cis-regulatory elements in the pig UPII (pUPII) gene promoter region, we constructed pUPII 5' upstream region deletion mutants and demonstrated that each of the deletion mutants participates in controlling the expression of the pUPII gene in human bladder carcinoma RT4 cells. We also identified a new core promoter region and putative negative cis-regulatory element within a minimal promoter region. In addition, we showed that hepatocyte nuclear factor 4 (HNF4) can directly bind in the pUPII core promoter (5F-1) region, which plays a critical role in controlling promoter activity. Transient cotransfection experiments showed that HNF4 positively regulates pUPII gene promoter activity. Thus, the binding element and its binding protein, HNF4 transcription factor, may be involved in the mechanism that specifically regulates pUPII gene transcription.

  10. Computational and molecular dissection of an X-box cis-Regulatory module

    OpenAIRE

    Warrington, Timothy Burton

    2015-01-01

    Ciliopathies are a class of human diseases marked by dysfunction of the cellular organelle, cilia. While many of the molecular components that make up cilia have been identified and studied, comparatively little is understood about the transcriptional regulation of genes encoding these components. The conserved transcription factor Regulatory Factor X (RFX)/DAF-19, which acts through binding to the cis-regulatory motif known as X-box, has been shown to regulate ciliary genes in many animals f...

  11. A New Approach to Sequence Analysis Exemplified by Identification of cis-Elements in Abscisic Acid Inducible Promoters

    DEFF Research Database (Denmark)

    Busk, Peter Kamp; Hallin, Peter Fischer; Salomon, Jesper

    -regulatory elements. We have developed a method for identifying short, conserved motifs in biological sequences such as proteins, DNA and RNA5. This method was used for analysis of approximately 2000 Arabidopsis thaliana promoters that have been shown by DNA array analysis to be induced by abscisic acid6....... These promoters were compared to 28000 promoters that are not induced by abscisic acid. The analysis identified previously described ABA-inducible promoter elements such as ABRE, CE3 and CRT1 but also new cis-elements were found. Furthermore, the list of DNA elements could be used to predict ABA...

  12. CRX ChIP-seq reveals the cis-regulatory architecture of mouse photoreceptors

    NARCIS (Netherlands)

    J.C. Corbo (Joseph); K.A. Lawrence (Karen); M. Karlstetter (Marcus); C.A. Myers (Connie); M. Abdelaziz (Musa); W. Dirkes (William); K. Weigelt (Karin); M. Seifert (Martin); V. Benes (Vladimir); L.G. Fritsche (Lars); B.H.F. Weber (Bernhard); T. Langmann (Thomas)

    2010-01-01

    textabstractApproximately 98% of mammalian DNA is noncoding, yet we understand relatively little about the function of this enigmatic portion of the genome. The cis-regulatory elements that control gene expression reside in noncoding regions and can be identified by mapping the binding sites of

  13. CisSERS: Customizable In Silico Sequence Evaluation for Restriction Sites.

    Science.gov (United States)

    Sharpe, Richard M; Koepke, Tyson; Harper, Artemus; Grimes, John; Galli, Marco; Satoh-Cruz, Mio; Kalyanaraman, Ananth; Evans, Katherine; Kramer, David; Dhingra, Amit

    2016-01-01

    High-throughput sequencing continues to produce an immense volume of information that is processed and assembled into mature sequence data. Data analysis tools are urgently needed that leverage the embedded DNA sequence polymorphisms and consequent changes to restriction sites or sequence motifs in a high-throughput manner to enable biological experimentation. CisSERS was developed as a standalone open source tool to analyze sequence datasets and provide biologists with individual or comparative genome organization information in terms of presence and frequency of patterns or motifs such as restriction enzymes. Predicted agarose gel visualization of the custom analyses results was also integrated to enhance the usefulness of the software. CisSERS offers several novel functionalities, such as handling of large and multiple datasets in parallel, multiple restriction enzyme site detection and custom motif detection features, which are seamlessly integrated with real time agarose gel visualization. Using a simple fasta-formatted file as input, CisSERS utilizes the REBASE enzyme database. Results from CisSERS enable the user to make decisions for designing genotyping by sequencing experiments, reduced representation sequencing, 3'UTR sequencing, and cleaved amplified polymorphic sequence (CAPS) molecular markers for large sample sets. CisSERS is a java based graphical user interface built around a perl backbone. Several of the applications of CisSERS including CAPS molecular marker development were successfully validated using wet-lab experimentation. Here, we present the tool CisSERS and results from in-silico and corresponding wet-lab analyses demonstrating that CisSERS is a technology platform solution that facilitates efficient data utilization in genomics and genetics studies.

  14. Conserved cis-regulatory regions in a large genomic landscape control SHH and BMP-regulated Gremlin1 expression in mouse limb buds

    Directory of Open Access Journals (Sweden)

    Zuniga Aimée

    2012-08-01

    Full Text Available Abstract Background Mouse limb bud is a prime model to study the regulatory interactions that control vertebrate organogenesis. Major aspects of limb bud development are controlled by feedback loops that define a self-regulatory signalling system. The SHH/GREM1/AER-FGF feedback loop forms the core of this signalling system that operates between the posterior mesenchymal organiser and the ectodermal signalling centre. The BMP antagonist Gremlin1 (GREM1 is a critical node in this system, whose dynamic expression is controlled by BMP, SHH, and FGF signalling and key to normal progression of limb bud development. Previous analysis identified a distant cis-regulatory landscape within the neighbouring Formin1 (Fmn1 locus that is required for Grem1 expression, reminiscent of the genomic landscapes controlling HoxD and Shh expression in limb buds. Results Three highly conserved regions (HMCO1-3 were identified within the previously defined critical genomic region and tested for their ability to regulate Grem1 expression in mouse limb buds. Using a combination of BAC and conventional transgenic approaches, a 9 kb region located ~70 kb downstream of the Grem1 transcription unit was identified. This region, termed Grem1 Regulatory Sequence 1 (GRS1, is able to recapitulate major aspects of Grem1 expression, as it drives expression of a LacZ reporter into the posterior and, to a lesser extent, in the distal-anterior mesenchyme. Crossing the GRS1 transgene into embryos with alterations in the SHH and BMP pathways established that GRS1 depends on SHH and is modulated by BMP signalling, i.e. integrates inputs from these pathways. Chromatin immunoprecipitation revealed interaction of endogenous GLI3 proteins with the core cis-regulatory elements in the GRS1 region. As GLI3 is a mediator of SHH signal transduction, these results indicated that SHH directly controls Grem1 expression through the GRS1 region. Finally, all cis-regulatory regions within the Grem1

  15. Functional dissection of the promoter of the pollen-specific gene NTP303 reveals a novel pollen-specific, and conserved cis-regulatory element.

    Science.gov (United States)

    Weterings, K; Schrauwen, J; Wullems, G; Twell, D

    1995-07-01

    Regulatory elements within the promoter of the pollen-specific NTP303 gene from tobacco were analysed by transient and stable expression analyses. Analysis of precisely targeted mutations showed that the NTP303 promoter is not regulated by any of the previously described pollen-specific cis-regulatory elements. However, two adjacent regions from -103 to -86 bp and from -86 to -59 bp were shown to contain sequences which positively regulated the NTP303 promoter. Both of these regions were capable of driving pollen-specific expression from a heterologous promoter, independent of orientation and in an additive manner. The boundaries of the minimal, functional NTP303 promoter were determined to lie within the region -86 to -51 bp. The sequence AAATGA localized from -94 to -89 bp was identified as a novel cis-acting element, of which the TGA triplet was shown to comprise an active part. This element was shown to be completely conserved in the similarly regulated promoter of the Bp 10 gene from Brassica napus encoding a homologue of the NTP303 gene.

  16. CisSERS: Customizable In Silico Sequence Evaluation for Restriction Sites.

    Directory of Open Access Journals (Sweden)

    Richard M Sharpe

    Full Text Available High-throughput sequencing continues to produce an immense volume of information that is processed and assembled into mature sequence data. Data analysis tools are urgently needed that leverage the embedded DNA sequence polymorphisms and consequent changes to restriction sites or sequence motifs in a high-throughput manner to enable biological experimentation. CisSERS was developed as a standalone open source tool to analyze sequence datasets and provide biologists with individual or comparative genome organization information in terms of presence and frequency of patterns or motifs such as restriction enzymes. Predicted agarose gel visualization of the custom analyses results was also integrated to enhance the usefulness of the software. CisSERS offers several novel functionalities, such as handling of large and multiple datasets in parallel, multiple restriction enzyme site detection and custom motif detection features, which are seamlessly integrated with real time agarose gel visualization. Using a simple fasta-formatted file as input, CisSERS utilizes the REBASE enzyme database. Results from CisSERS enable the user to make decisions for designing genotyping by sequencing experiments, reduced representation sequencing, 3'UTR sequencing, and cleaved amplified polymorphic sequence (CAPS molecular markers for large sample sets. CisSERS is a java based graphical user interface built around a perl backbone. Several of the applications of CisSERS including CAPS molecular marker development were successfully validated using wet-lab experimentation. Here, we present the tool CisSERS and results from in-silico and corresponding wet-lab analyses demonstrating that CisSERS is a technology platform solution that facilitates efficient data utilization in genomics and genetics studies.

  17. An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.

    Science.gov (United States)

    Liu, Bingqiang; Zhang, Hanyuan; Zhou, Chuan; Li, Guojun; Fennell, Anne; Wang, Guanghui; Kang, Yu; Liu, Qi; Ma, Qin

    2016-08-09

    Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance

  18. A HLA class I cis-regulatory element whose activity can be modulated by hormones.

    Science.gov (United States)

    Sim, B C; Hui, K M

    1994-12-01

    To elucidate the basis of the down-regulation in major histocompatibility complex (MHC) class I gene expression and to identify possible DNA-binding regulatory elements that have the potential to interact with class I MHC genes, we have studied the transcriptional regulation of class I HLA genes in human breast carcinoma cells. A 9 base pair (bp) negative cis-regulatory element (NRE) has been identified using band-shift assays employing DNA sequences derived from the 5'-flanking region of HLA class I genes. This 9-bp element, GTCATGGCG, located within exon I of the HLA class I gene, can potently inhibit the expression of a heterologous thymidine kinase (TK) gene promoter and the HLA enhancer element. Furthermore, this regulatory element can exert its suppressive function in either the sense or anti-sense orientation. More interestingly, NRE can suppress dexamethasone-mediated gene activation in the context of the reported glucocorticoid-responsive element (GRE) in MCF-7 cells but has no influence on the estrogen-mediated transcriptional activation of MCF-7 cells in the context of the reported estrogen-responsive element (ERE). Furthermore, the presence of such a regulatory element within the HLA class I gene whose activity can be modulated by hormones correlates well with our observation that the level of HLA class I gene expression can be down-regulated by hormones in human breast carcinoma cells. Such interactions between negative regulatory elements and specific hormone trans-activators are novel and suggest a versatile form of transcriptional control.

  19. Characterization of Cer-1 cis-regulatory region during early Xenopus development.

    Science.gov (United States)

    Silva, Ana Cristina; Filipe, Mário; Steinbeisser, Herbert; Belo, José António

    2011-05-01

    Cerberus-related molecules are well-known Wnt, Nodal, and BMP inhibitors that have been implicated in different processes including anterior–posterior patterning and left–right asymmetry. In both mouse and frog, two Cerberus-related genes have been isolated, mCer-1 and mCer-2, and Xcer and Xcoco, respectively. Until now, little is known about the mechanisms involved in their transcriptional regulation. Here, we report a heterologous analysis of the mouse Cerberus-1 gene upstream regulatory regions, responsible for its expression in the visceral endodermal cells. Our analysis showed that the consensus sequences for a TATA, CAAT, or GC boxes were absent but a TGTGG sequence was present at position -172 to -168 bp, relative to the ATG. Using a series of deletion constructs and transient expression in Xenopus embryos, we found that a fragment of 1.4 kb of Cer-1 promoter sequence could reproduce the endogenous expression pattern of Xenopus cerberus. A 0.7-kb mcer-1 upstream region was able to drive reporter expression to the involuting mesendodermal cells, while further deletions abolished reporter gene expression. Our results suggest that although no sequence similarity was found between mouse and Xenopus cerberus cis-regulatory regions, the signaling cascades regulating cerberus expression, during gastrulation, is conserved.

  20. Predicting tissue specific cis-regulatory modules in the human genome using pairs of co-occurring motifs

    Directory of Open Access Journals (Sweden)

    Girgis Hani Z

    2012-02-01

    Full Text Available Abstract Background Researchers seeking to unlock the genetic basis of human physiology and diseases have been studying gene transcription regulation. The temporal and spatial patterns of gene expression are controlled by mainly non-coding elements known as cis-regulatory modules (CRMs and epigenetic factors. CRMs modulating related genes share the regulatory signature which consists of transcription factor (TF binding sites (TFBSs. Identifying such CRMs is a challenging problem due to the prohibitive number of sequence sets that need to be analyzed. Results We formulated the challenge as a supervised classification problem even though experimentally validated CRMs were not required. Our efforts resulted in a software system named CrmMiner. The system mines for CRMs in the vicinity of related genes. CrmMiner requires two sets of sequences: a mixed set and a control set. Sequences in the vicinity of the related genes comprise the mixed set, whereas the control set includes random genomic sequences. CrmMiner assumes that a large percentage of the mixed set is made of background sequences that do not include CRMs. The system identifies pairs of closely located motifs representing vertebrate TFBSs that are enriched in the training mixed set consisting of 50% of the gene loci. In addition, CrmMiner selects a group of the enriched pairs to represent the tissue-specific regulatory signature. The mixed and the control sets are searched for candidate sequences that include any of the selected pairs. Next, an optimal Bayesian classifier is used to distinguish candidates found in the mixed set from their control counterparts. Our study proposes 62 tissue-specific regulatory signatures and putative CRMs for different human tissues and cell types. These signatures consist of assortments of ubiquitously expressed TFs and tissue-specific TFs. Under controlled settings, CrmMiner identified known CRMs in noisy sets up to 1:25 signal-to-noise ratio. CrmMiner was

  1. Prediction of tissue-specific cis-regulatory modules using Bayesian networks and regression trees

    Directory of Open Access Journals (Sweden)

    Chen Xiaoyu

    2007-12-01

    Full Text Available Abstract Background In vertebrates, a large part of gene transcriptional regulation is operated by cis-regulatory modules. These modules are believed to be regulating much of the tissue-specificity of gene expression. Results We develop a Bayesian network approach for identifying cis-regulatory modules likely to regulate tissue-specific expression. The network integrates predicted transcription factor binding site information, transcription factor expression data, and target gene expression data. At its core is a regression tree modeling the effect of combinations of transcription factors bound to a module. A new unsupervised EM-like algorithm is developed to learn the parameters of the network, including the regression tree structure. Conclusion Our approach is shown to accurately identify known human liver and erythroid-specific modules. When applied to the prediction of tissue-specific modules in 10 different tissues, the network predicts a number of important transcription factor combinations whose concerted binding is associated to specific expression.

  2. [Analysis of cis-regulatory element distribution in gene promoters of Gossypium raimondii and Arabidopsis thaliana].

    Science.gov (United States)

    Sun, Gao-Fei; He, Shou-Pu; Du, Xiong-Ming

    2013-10-01

    Cotton genomic studies have boomed since the release of Gossypium raimondii draft genome. In this study, cis-regulatory element (CRE) in 1 kb length sequence upstream 5' UTR of annotated genes were selected and scanned in the Arabidopsis thaliana (At) and Gossypium raimondii (Gr) genomes, based on the database of PLACE (Plant cis-acting Regulatory DNA Elements). According to the definition of this study, 44 (12.3%) and 57 (15.5%) CREs presented "peak-like" distribution in the 1 kb selected sequences of both genomes, respectively. Thirty-four of them were peak-like distributed in both genomes, which could be further categorized into 4 types based on their core sequences. The coincidence of TATABOX peak position and their actual position ((-) -30 bp) indicated that the position of a common CRE was conservative in different genes, which suggested that the peak position of these CREs was their possible actual position of transcription factors. The position of a common CRE was also different between the two genomes due to stronger length variation of 5' UTR in Gr than At. Furthermore, most of the peak-like CREs were located in the region of -110 bp-0 bp, which suggested that concentrated distribution might be conductive to the interaction of transcription factors, and then regulate the gene expression in downstream.

  3. Characterization of Putative cis-Regulatory Elements in Genes Preferentially Expressed in Arabidopsis Male Meiocytes

    Directory of Open Access Journals (Sweden)

    Junhua Li

    2014-01-01

    Full Text Available Meiosis is essential for plant reproduction because it is the process during which homologous chromosome pairing, synapsis, and meiotic recombination occur. The meiotic transcriptome is difficult to investigate because of the size of meiocytes and the confines of anther lobes. The recent development of isolation techniques has enabled the characterization of transcriptional profiles in male meiocytes of Arabidopsis. Gene expression in male meiocytes shows unique features. The direct interaction of transcription factors (TFs with DNA regulatory sequences forms the basis for the specificity of transcriptional regulation. Here, we identified putative cis-regulatory elements (CREs associated with male meiocyte-expressed genes using in silico tools. The upstream regions (1 kb of the top 50 genes preferentially expressed in Arabidopsis meiocytes possessed conserved motifs. These motifs are putative binding sites of TFs, some of which share common functions, such as roles in cell division. In combination with cell-type-specific analysis, our findings could be a substantial aid for the identification and experimental verification of the protein-DNA interactions for the specific TFs that drive gene expression in meiocytes.

  4. Network-directed cis-mediator analysis of normal prostate tissue expression profiles reveals downstream regulatory associations of prostate cancer susceptibility loci.

    Science.gov (United States)

    Larson, Nicholas B; McDonnell, Shannon K; Fogarty, Zach; Larson, Melissa C; Cheville, John; Riska, Shaun; Baheti, Saurabh; Weber, Alexandra M; Nair, Asha A; Wang, Liang; O'Brien, Daniel; Davila, Jaime; Schaid, Daniel J; Thibodeau, Stephen N

    2017-10-17

    Large-scale genome-wide association studies have identified multiple single-nucleotide polymorphisms associated with risk of prostate cancer. Many of these genetic variants are presumed to be regulatory in nature; however, follow-up expression quantitative trait loci (eQTL) association studies have to-date been restricted largely to cis -acting associations due to study limitations. While trans -eQTL scans suffer from high testing dimensionality, recent evidence indicates most trans -eQTL associations are mediated by cis -regulated genes, such as transcription factors. Leveraging a data-driven gene co-expression network, we conducted a comprehensive cis -mediator analysis using RNA-Seq data from 471 normal prostate tissue samples to identify downstream regulatory associations of previously identified prostate cancer risk variants. We discovered multiple trans -eQTL associations that were significantly mediated by cis -regulated transcripts, four of which involved risk locus 17q12, proximal transcription factor HNF1B , and target trans -genes with known HNF response elements ( MIA2 , SRC , SEMA6A , KIF12 ). We additionally identified evidence of cis -acting down-regulation of MSMB via rs10993994 corresponding to reduced co-expression of NDRG1 . The majority of these cis -mediator relationships demonstrated trans -eQTL replicability in 87 prostate tissue samples from the Gene-Tissue Expression Project. These findings provide further biological context to known risk loci and outline new hypotheses for investigation into the etiology of prostate cancer.

  5. Functional evolution of cis-regulatory modules at a homeotic gene in Drosophila.

    Directory of Open Access Journals (Sweden)

    Margaret C W Ho

    2009-11-01

    Full Text Available It is a long-held belief in evolutionary biology that the rate of molecular evolution for a given DNA sequence is inversely related to the level of functional constraint. This belief holds true for the protein-coding homeotic (Hox genes originally discovered in Drosophila melanogaster. Expression of the Hox genes in Drosophila embryos is essential for body patterning and is controlled by an extensive array of cis-regulatory modules (CRMs. How the regulatory modules functionally evolve in different species is not clear. A comparison of the CRMs for the Abdominal-B gene from different Drosophila species reveals relatively low levels of overall sequence conservation. However, embryonic enhancer CRMs from other Drosophila species direct transgenic reporter gene expression in the same spatial and temporal patterns during development as their D. melanogaster orthologs. Bioinformatic analysis reveals the presence of short conserved sequences within defined CRMs, representing gap and pair-rule transcription factor binding sites. One predicted binding site for the gap transcription factor KRUPPEL in the IAB5 CRM was found to be altered in Superabdominal (Sab mutations. In Sab mutant flies, the third abdominal segment is transformed into a copy of the fifth abdominal segment. A model for KRUPPEL-mediated repression at this binding site is presented. These findings challenge our current understanding of the relationship between sequence evolution at the molecular level and functional activity of a CRM. While the overall sequence conservation at Drosophila CRMs is not distinctive from neighboring genomic regions, functionally critical transcription factor binding sites within embryonic enhancer CRMs are highly conserved. These results have implications for understanding mechanisms of gene expression during embryonic development, enhancer function, and the molecular evolution of eukaryotic regulatory modules.

  6. Organization of cis-acting regulatory elements in osmotic- and cold-stress-responsive promoters.

    Science.gov (United States)

    Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo

    2005-02-01

    cis-Acting regulatory elements are important molecular switches involved in the transcriptional regulation of a dynamic network of gene activities controlling various biological processes, including abiotic stress responses, hormone responses and developmental processes. In particular, understanding regulatory gene networks in stress response cascades depends on successful functional analyses of cis-acting elements. The ever-improving accuracy of transcriptome expression profiling has led to the identification of various combinations of cis-acting elements in the promoter regions of stress-inducible genes involved in stress and hormone responses. Here we discuss major cis-acting elements, such as the ABA-responsive element (ABRE) and the dehydration-responsive element/C-repeat (DRE/CRT), that are a vital part of ABA-dependent and ABA-independent gene expression in osmotic and cold stress responses.

  7. PROSPECT improves cis-acting regulatory element prediction by integrating expression profile data with consensus pattern searches

    Science.gov (United States)

    Fujibuchi, Wataru; Anderson, John S. J.; Landsman, David

    2001-01-01

    Consensus pattern and matrix-based searches designed to predict cis-acting transcriptional regulatory sequences have historically been subject to large numbers of false positives. We sought to decrease false positives by incorporating expression profile data into a consensus pattern-based search method. We have systematically analyzed the expression phenotypes of over 6000 yeast genes, across 121 expression profile experiments, and correlated them with the distribution of 14 known regulatory elements over sequences upstream of the genes. Our method is based on a metric we term probabilistic element assessment (PEA), which is a ranking of potential sites based on sequence similarity in the upstream regions of genes with similar expression phenotypes. For eight of the 14 known elements that we examined, our method had a much higher selectivity than a naïve consensus pattern search. Based on our analysis, we have developed a web-based tool called PROSPECT, which allows consensus pattern-based searching of gene clusters obtained from microarray data. PMID:11574681

  8. Accelerated Evolution of Conserved Noncoding Sequences in theHuman Genome

    Energy Technology Data Exchange (ETDEWEB)

    Prambhakar, Shyam; Noonan, James P.; Paabo, Svante; Rubin, EdwardM.

    2006-07-06

    Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detect"cryptic" functional elements, which are too weakly conserved amongmammals to distinguish from nonfunctional DNA. To address this problem,we explored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.

  9. Lmx1b-targeted cis-regulatory modules involved in limb dorsalization.

    Science.gov (United States)

    Haro, Endika; Watson, Billy A; Feenstra, Jennifer M; Tegeler, Luke; Pira, Charmaine U; Mohan, Subburaman; Oberg, Kerby C

    2017-06-01

    Lmx1b is a homeodomain transcription factor responsible for limb dorsalization. Despite striking double-ventral (loss-of-function) and double-dorsal (gain-of-function) limb phenotypes, no direct gene targets in the limb have been confirmed. To determine direct targets, we performed a chromatin immunoprecipitation against Lmx1b in mouse limbs at embryonic day 12.5 followed by next-generation sequencing (ChIP-seq). Nearly 84% ( n =617) of the Lmx1b-bound genomic intervals (LBIs) identified overlap with chromatin regulatory marks indicative of potential cis -regulatory modules (PCRMs). In addition, 73 LBIs mapped to CRMs that are known to be active during limb development. We compared Lmx1b-bound PCRMs with genes regulated by Lmx1b and found 292 PCRMs within 1 Mb of 254 Lmx1b-regulated genes. Gene ontological analysis suggests that Lmx1b targets extracellular matrix production, bone/joint formation, axonal guidance, vascular development, cell proliferation and cell movement. We validated the functional activity of a PCRM associated with joint-related Gdf5 that provides a mechanism for Lmx1b-mediated joint modification and a PCRM associated with Lmx1b that suggests a role in autoregulation. This is the first report to describe genome-wide Lmx1b binding during limb development, directly linking Lmx1b to targets that accomplish limb dorsalization. © 2017. Published by The Company of Biologists Ltd.

  10. A saturation screen for cis-acting regulatory DNA in the Hox genes of Ciona intestinalis

    Energy Technology Data Exchange (ETDEWEB)

    Keys, David N.; Lee, Byung-in; Di Gregorio, Anna; Harafuji, Naoe; Detter, Chris; Wang, Mei; Kahsai, Orsalem; Ahn, Sylvia; Arellano, Andre; Zhang, Quin; Trong, Stephan; Doyle, Sharon A.; Satoh, Noriyuki; Satou, Yutaka; Saiga, Hidetoshi; Christian, Allen; Rokhsar, Dan; Hawkins, Trevor L.; Levine, Mike; Richardson, Paul

    2005-01-05

    A screen for the systematic identification of cis-regulatory elements within large (>100 kb) genomic domains containing Hox genes was performed by using the basal chordate Ciona intestinalis. Randomly generated DNA fragments from bacterial artificial chromosomes containing two clusters of Hox genes were inserted into a vector upstream of a minimal promoter and lacZ reporter gene. A total of 222 resultant fusion genes were separately electroporated into fertilized eggs, and their regulatory activities were monitored in larvae. In sum, 21 separable cis-regulatory elements were found. These include eight Hox linked domains that drive expression in nested anterior-posterior domains of ectodermally derived tissues. In addition to vertebrate-like CNS regulation, the discovery of cis-regulatory domains that drive epidermal transcription suggests that C. intestinalis has arthropod-like Hox patterning in the epidermis.

  11. Identification of putative cis-regulatory elements in Cryptosporidium parvum by de novo pattern finding

    Directory of Open Access Journals (Sweden)

    Kissinger Jessica C

    2007-01-01

    Full Text Available Abstract Background Cryptosporidium parvum is a unicellular eukaryote in the phylum Apicomplexa. It is an obligate intracellular parasite that causes diarrhea and is a significant AIDS-related pathogen. Cryptosporidium parvum is not amenable to long-term laboratory cultivation or classical molecular genetic analysis. The parasite exhibits a complex life cycle, a broad host range, and fundamental mechanisms of gene regulation remain unknown. We have used data from the recently sequenced genome of this organism to uncover clues about gene regulation in C. parvum. We have applied two pattern finding algorithms MEME and AlignACE to identify conserved, over-represented motifs in the 5' upstream regions of genes in C. parvum. To support our findings, we have established comparative real-time -PCR expression profiles for the groups of genes examined computationally. Results We find that groups of genes that share a function or belong to a common pathway share upstream motifs. Different motifs are conserved upstream of different groups of genes. Comparative real-time PCR studies show co-expression of genes within each group (in sub-sets during the life cycle of the parasite, suggesting co-regulation of these genes may be driven by the use of conserved upstream motifs. Conclusion This is one of the first attempts to characterize cis-regulatory elements in the absence of any previously characterized elements and with very limited expression data (seven genes only. Using de novo pattern finding algorithms, we have identified specific DNA motifs that are conserved upstream of genes belonging to the same metabolic pathway or gene family. We have demonstrated the co-expression of these genes (often in subsets using comparative real-time-PCR experiments thus establishing evidence for these conserved motifs as putative cis-regulatory elements. Given the lack of prior information concerning expression patterns and organization of promoters in C. parvum we

  12. Patterns of cis regulatory variation in diverse human populations.

    Directory of Open Access Journals (Sweden)

    Barbara E Stranger

    Full Text Available The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for

  13. Systematic identification of cis-regulatory sequences active in mouse and human embryonic stem cells.

    Directory of Open Access Journals (Sweden)

    Marica Grskovic

    2007-08-01

    Full Text Available Understanding the transcriptional regulation of pluripotent cells is of fundamental interest and will greatly inform efforts aimed at directing differentiation of embryonic stem (ES cells or reprogramming somatic cells. We first analyzed the transcriptional profiles of mouse ES cells and primordial germ cells and identified genes upregulated in pluripotent cells both in vitro and in vivo. These genes are enriched for roles in transcription, chromatin remodeling, cell cycle, and DNA repair. We developed a novel computational algorithm, CompMoby, which combines analyses of sequences both aligned and non-aligned between different genomes with a probabilistic segmentation model to systematically predict short DNA motifs that regulate gene expression. CompMoby was used to identify conserved overrepresented motifs in genes upregulated in pluripotent cells. We show that the motifs are preferentially active in undifferentiated mouse ES and embryonic germ cells in a sequence-specific manner, and that they can act as enhancers in the context of an endogenous promoter. Importantly, the activity of the motifs is conserved in human ES cells. We further show that the transcription factor NF-Y specifically binds to one of the motifs, is differentially expressed during ES cell differentiation, and is required for ES cell proliferation. This study provides novel insights into the transcriptional regulatory networks of pluripotent cells. Our results suggest that this systematic approach can be broadly applied to understanding transcriptional networks in mammalian species.

  14. A cis-regulatory sequence driving metabolic insecticide resistance in mosquitoes: functional characterisation and signatures of selection.

    Science.gov (United States)

    Wilding, Craig S; Smith, Ian; Lynd, Amy; Yawson, Alexander Egyir; Weetman, David; Paine, Mark J I; Donnelly, Martin J

    2012-09-01

    Although cytochrome P450 (CYP450) enzymes are frequently up-regulated in mosquitoes resistant to insecticides, no regulatory motifs driving these expression differences with relevance to wild populations have been identified. Transposable elements (TEs) are often enriched upstream of those CYP450s involved in insecticide resistance, leading to the assumption that they contribute regulatory motifs that directly underlie the resistance phenotype. A partial CuRE1 (Culex Repetitive Element 1) transposable element is found directly upstream of CYP9M10, a cytochrome P450 implicated previously in larval resistance to permethrin in the ISOP450 strain of Culex quinquefasciatus, but is absent from the equivalent genomic region of a susceptible strain. Via expression of CYP9M10 in Escherichia coli we have now demonstrated time- and NADPH-dependant permethrin metabolism, prerequisites for confirmation of a role in metabolic resistance, and through qPCR shown that CYP9M10 is >20-fold over-expressed in ISOP450 compared to a susceptible strain. In a fluorescent reporter assay the region upstream of CYP9M10 from ISOP450 drove 10× expression compared to the equivalent region (lacking CuRE1) from the susceptible strain. Close correspondence with the gene expression fold-change implicates the upstream region including CuRE1 as a cis-regulatory element involved in resistance. Only a single CuRE1 bearing allele, identical to the CuRE1 bearing allele in the resistant strain, is found throughout Sub-Saharan Africa, in contrast to the diversity encountered in non-CuRE1 alleles. This suggests a single origin and subsequent spread due to selective advantage. CuRE1 is detectable using a simple diagnostic. When applied to C. quinquefasciatus larvae from Ghana we have demonstrated a significant association with permethrin resistance in multiple field sites (mean Odds Ratio = 3.86) suggesting this marker has relevance to natural populations of vector mosquitoes. However, when CuRE1 was excised

  15. Genetic mapping uncovers cis-regulatory landscape of RNA editing.

    Science.gov (United States)

    Ramaswami, Gokul; Deng, Patricia; Zhang, Rui; Anna Carbone, Mary; Mackay, Trudy F C; Li, Jin Billy

    2015-09-16

    Adenosine-to-inosine (A-to-I) RNA editing, catalysed by ADAR enzymes conserved in metazoans, plays an important role in neurological functions. Although the fine-tuning mechanism provided by A-to-I RNA editing is important, the underlying rules governing ADAR substrate recognition are not well understood. We apply a quantitative trait loci (QTL) mapping approach to identify genetic variants associated with variability in RNA editing. With very accurate measurement of RNA editing levels at 789 sites in 131 Drosophila melanogaster strains, here we identify 545 editing QTLs (edQTLs) associated with differences in RNA editing. We demonstrate that many edQTLs can act through changes in the local secondary structure for edited dsRNAs. Furthermore, we find that edQTLs located outside of the edited dsRNA duplex are enriched in secondary structure, suggesting that distal dsRNA structure beyond the editing site duplex affects RNA editing efficiency. Our work will facilitate the understanding of the cis-regulatory code of RNA editing.

  16. Identifying cis-mediators for trans-eQTLs across many human tissues using genomic mediation analysis.

    Science.gov (United States)

    Yang, Fan; Wang, Jiebiao; Pierce, Brandon L; Chen, Lin S

    2017-11-01

    The impact of inherited genetic variation on gene expression in humans is well-established. The majority of known expression quantitative trait loci (eQTLs) impact expression of local genes ( cis -eQTLs). More research is needed to identify effects of genetic variation on distant genes ( trans -eQTLs) and understand their biological mechanisms. One common trans -eQTLs mechanism is "mediation" by a local ( cis ) transcript. Thus, mediation analysis can be applied to genome-wide SNP and expression data in order to identify transcripts that are " cis -mediators" of trans -eQTLs, including those " cis -hubs" involved in regulation of many trans -genes. Identifying such mediators helps us understand regulatory networks and suggests biological mechanisms underlying trans -eQTLs, both of which are relevant for understanding susceptibility to complex diseases. The multitissue expression data from the Genotype-Tissue Expression (GTEx) program provides a unique opportunity to study cis -mediation across human tissue types. However, the presence of complex hidden confounding effects in biological systems can make mediation analyses challenging and prone to confounding bias, particularly when conducted among diverse samples. To address this problem, we propose a new method: Genomic Mediation analysis with Adaptive Confounding adjustment (GMAC). It enables the search of a very large pool of variables, and adaptively selects potential confounding variables for each mediation test. Analyses of simulated data and GTEx data demonstrate that the adaptive selection of confounders by GMAC improves the power and precision of mediation analysis. Application of GMAC to GTEx data provides new insights into the observed patterns of cis -hubs and trans -eQTL regulation across tissue types. © 2017 Yang et al.; Published by Cold Spring Harbor Laboratory Press.

  17. Strand-specific RNA-seq reveals widespread occurrence of novel cis-natural antisense transcripts in rice

    Directory of Open Access Journals (Sweden)

    Lu Tingting

    2012-12-01

    Full Text Available Abstract Background Cis-natural antisense transcripts (cis-NATs are RNAs transcribed from the antisense strand of a gene locus, and are complementary to the RNA transcribed from the sense strand. Common techniques including microarray approach and analysis of transcriptome databases are the major ways to globally identify cis-NATs in various eukaryotic organisms. Genome-wide in silico analysis has identified a large number of cis-NATs that may generate endogenous short interfering RNAs (nat-siRNAs, which participate in important biogenesis mechanisms for transcriptional and post-transcriptional regulation in rice. However, the transcriptomes are yet to be deeply sequenced to comprehensively investigate cis-NATs. Results We applied high-throughput strand-specific complementary DNA sequencing technology (ssRNA-seq to deeply sequence mRNA for assessing sense and antisense transcripts that were derived under salt, drought and cold stresses, and normal conditions, in the model plant rice (Oryza sativa. Combined with RAP-DB genome annotation (the Rice Annotation Project Database build-5 data set, 76,013 transcripts corresponding to 45,844 unique gene loci were assembled, in which 4873 gene loci were newly identified. Of 3819 putative rice cis-NATs, 2292 were detected as expressed and giving rise to small RNAs from their overlapping regions through integrated analysis of ssRNA-seq data and small RNA data. Among them, 503 cis-NATs seemed to be associated with specific conditions. The deep sequence data from isolated epidermal cells of rice seedlings further showed that 54.0% of cis-NATs were expressed simultaneously in a population of homogenous cells. Nearly 9.7% of rice transcripts were involved in one-to-one or many-to-many cis-NATs formation. Furthermore, only 17.4-34.7% of 223 many-to-many cis-NAT groups were all expressed and generated nat-siRNAs, indicating that only some cis-NAT groups may be involved in complex regulatory networks. Conclusions

  18. In silico discovery of transcription regulatory elements in Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Le Roch Karine G

    2008-02-01

    Full Text Available Abstract Background With the sequence of the Plasmodium falciparum genome and several global mRNA and protein life cycle expression profiling projects now completed, elucidating the underlying networks of transcriptional control important for the progression of the parasite life cycle is highly pertinent to the development of new anti-malarials. To date, relatively little is known regarding the specific mechanisms the parasite employs to regulate gene expression at the mRNA level, with studies of the P. falciparum genome sequence having revealed few cis-regulatory elements and associated transcription factors. Although it is possible the parasite may evoke mechanisms of transcriptional control drastically different from those used by other eukaryotic organisms, the extreme AT-rich nature of P. falciparum intergenic regions (~90% AT presents significant challenges to in silico cis-regulatory element discovery. Results We have developed an algorithm called Gene Enrichment Motif Searching (GEMS that uses a hypergeometric-based scoring function and a position-weight matrix optimization routine to identify with high-confidence regulatory elements in the nucleotide-biased and repeat sequence-rich P. falciparum genome. When applied to promoter regions of genes contained within 21 co-expression gene clusters generated from P. falciparum life cycle microarray data using the semi-supervised clustering algorithm Ontology-based Pattern Identification, GEMS identified 34 putative cis-regulatory elements associated with a variety of parasite processes including sexual development, cell invasion, antigenic variation and protein biosynthesis. Among these candidates were novel motifs, as well as many of the elements for which biological experimental evidence already exists in the Plasmodium literature. To provide evidence for the biological relevance of a cell invasion-related element predicted by GEMS, reporter gene and electrophoretic mobility shift assays

  19. kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets

    Science.gov (United States)

    Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S.; Beer, Michael A.

    2013-01-01

    Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167–80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org. PMID:23771147

  20. Combinatorial Cis-regulation in Saccharomyces Species

    Directory of Open Access Journals (Sweden)

    Aaron T. Spivak

    2016-03-01

    Full Text Available Transcriptional control of gene expression requires interactions between the cis-regulatory elements (CREs controlling gene promoters. We developed a sensitive computational method to identify CRE combinations with conserved spacing that does not require genome alignments. When applied to seven sensu stricto and sensu lato Saccharomyces species, 80% of the predicted interactions displayed some evidence of combinatorial transcriptional behavior in several existing datasets including: (1 chromatin immunoprecipitation data for colocalization of transcription factors, (2 gene expression data for coexpression of predicted regulatory targets, and (3 gene ontology databases for common pathway membership of predicted regulatory targets. We tested several predicted CRE interactions with chromatin immunoprecipitation experiments in a wild-type strain and strains in which a predicted cofactor was deleted. Our experiments confirmed that transcription factor (TF occupancy at the promoters of the CRE combination target genes depends on the predicted cofactor while occupancy of other promoters is independent of the predicted cofactor. Our method has the additional advantage of identifying regulatory differences between species. By analyzing the S. cerevisiae and S. bayanus genomes, we identified differences in combinatorial cis-regulation between the species and showed that the predicted changes in gene regulation explain several of the species-specific differences seen in gene expression datasets. In some instances, the same CRE combinations appear to regulate genes involved in distinct biological processes in the two different species. The results of this research demonstrate that (1 combinatorial cis-regulation can be inferred by multi-genome analysis and (2 combinatorial cis-regulation can explain differences in gene expression between species.

  1. New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes.

    Science.gov (United States)

    Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou

    2011-11-01

    Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.

  2. Genome-wide identification and quantification of cis- and trans-regulated genes responding to Marek’s disease virus infection via analysis of allele-specific expression

    Directory of Open Access Journals (Sweden)

    Sean eMaceachern

    2012-01-01

    Full Text Available Marek’s disease (MD is a commercially important neoplastic disease of chickens caused by Marek’s disease virus (MDV, an oncogenic alphaherpesvirus. Selecting for increased genetic resistance to MD is a control strategy that can augment vaccinal control measures. To identify high-confidence candidate MD resistance genes, we conducted a genome-wide screen for allele-specific expression (ASE amongst F1 progeny of two inbred chicken lines that differ in MD resistance. High throughput sequencing was used to profile transcriptomes from pools of uninfected and infected individuals at 4 days post-infection to identify any genes showing ASE in response to MDV infection. RNA sequencing identified 22,655 single nucleotide polymorphisms (SNPs of which 5,360 in 3,773 genes exhibited significant allelic imbalance. Illumina GoldenGate assays were subsequently used to quantify regulatory variation controlled at the gene (cis and elsewhere in the genome (trans by examining differences in expression between F1 individuals and artificial F1 RNA pools over 6 time periods in 1,536 of the most significant SNPs identified by RNA sequencing. Allelic imbalance as a result of cis-regulatory changes was confirmed in 861 of the 1,233 GoldenGate assays successfully examined. Furthermore we have identified 7 genes that display trans-regulation only in infected animals and approximately 500 SNP that show a complex interaction between cis- and trans-regulatory changes. Our results indicate ASE analyses are a powerful approach to identify regulatory variation responsible for differences in transcript abundance in genes underlying complex traits. And the genes with SNPs exhibiting ASE provide a strong foundation to further investigate the causative polymorphisms and genetic mechanisms for MD resistance. Finally, the methods used here for identifying specific genes and SNPs may have practical implications for applying marker-assisted selection to complex traits that are

  3. Identifying Cis-Regulatory Changes Involved in the Evolution of Aerobic Fermentation in Yeasts

    Science.gov (United States)

    Lin, Zhenguo; Wang, Tzi-Yuan; Tsai, Bing-Shi; Wu, Fang-Ting; Yu, Fu-Jung; Tseng, Yu-Jung; Sung, Huang-Mo; Li, Wen-Hsiung

    2013-01-01

    Gene regulation change has long been recognized as an important mechanism for phenotypic evolution. We used the evolution of yeast aerobic fermentation as a model to explore how gene regulation has evolved and how this process has contributed to phenotypic evolution and adaptation. Most eukaryotes fully oxidize glucose to CO2 and H2O in mitochondria to maximize energy yield, whereas some yeasts, such as Saccharomyces cerevisiae and its relatives, predominantly ferment glucose into ethanol even in the presence of oxygen, a phenomenon known as aerobic fermentation. We examined the genome-wide gene expression levels among 12 different yeasts and found that a group of genes involved in the mitochondrial respiration process showed the largest reduction in gene expression level during the evolution of aerobic fermentation. Our analysis revealed that the downregulation of these genes was significantly associated with massive loss of binding motifs of Cbf1p in the fermentative yeasts. Our experimental assays confirmed the binding of Cbf1p to the predicted motif and the activator role of Cbf1p. In summary, our study laid a foundation to unravel the long-time mystery about the genetic basis of evolution of aerobic fermentation, providing new insights into understanding the role of cis-regulatory changes in phenotypic evolution. PMID:23650209

  4. Deciphering Cis-Regulatory Element Mediated Combinatorial Regulation in Rice under Blast Infected Condition.

    Directory of Open Access Journals (Sweden)

    Arindam Deb

    Full Text Available Combinations of cis-regulatory elements (CREs present at the promoters facilitate the binding of several transcription factors (TFs, thereby altering the consequent gene expressions. Due to the eminent complexity of the regulatory mechanism, the combinatorics of CRE-mediated transcriptional regulation has been elusive. In this work, we have developed a new methodology that quantifies the co-occurrence tendencies of CREs present in a set of promoter sequences; these co-occurrence scores are filtered in three consecutive steps to test their statistical significance; and the significantly co-occurring CRE pairs are presented as networks. These networks of co-occurring CREs are further transformed to derive higher order of regulatory combinatorics. We have further applied this methodology on the differentially up-regulated gene-sets of rice tissues under fungal (Magnaporthe infected conditions to demonstrate how it helps to understand the CRE-mediated combinatorial gene regulation. Our analysis includes a wide spectrum of biologically important results. The CRE pairs having a strong tendency to co-occur often exhibit very similar joint distribution patterns at the promoters of rice. We couple the network approach with experimental results of plant gene regulation and defense mechanisms and find evidences of auto and cross regulation among TF families, cross-talk among multiple hormone signaling pathways, similarities and dissimilarities in regulatory combinatorics between different tissues, etc. Our analyses have pointed a highly distributed nature of the combinatorial gene regulation facilitating an efficient alteration in response to fungal attack. All together, our proposed methodology could be an important approach in understanding the combinatorial gene regulation. It can be further applied to unravel the tissue and/or condition specific combinatorial gene regulation in other eukaryotic systems with the availability of annotated genomic

  5. Preaxial polydactyly/triphalangeal thumb is associated with changed transcription factor-binding affinity in a family with a novel point mutation in the long-range cis-regulatory element ZRS

    DEFF Research Database (Denmark)

    Farooq, Muhammad; Troelsen, Jesper T; Boyd, Mette

    2010-01-01

    A cis-regulatory sequence also known as zone of polarizing activity (ZPA) regulatory sequence (ZRS) located in intron 5 of LMBR1 is essential for expression of sonic hedgehog (SHH) in the developing posterior limb bud mesenchyme. Even though many point mutations causing preaxial duplication defects...... demonstrated a marked difference between wild-type and the mutant probe, which uniquely bound one or several transcription factors extracted from Caco-2 cells. This finding supports a model in which ectopic anterior SHH expression in the developing limb results from abnormal binding of one or more...

  6. A robust approach to identifying tissue-specific gene expression regulatory variants using personalized human induced pluripotent stem cells.

    Directory of Open Access Journals (Sweden)

    Je-Hyuk Lee

    2009-11-01

    Full Text Available Normal variation in gene expression due to regulatory polymorphisms is often masked by biological and experimental noise. In addition, some regulatory polymorphisms may become apparent only in specific tissues. We derived human induced pluripotent stem (iPS cells from adult skin primary fibroblasts and attempted to detect tissue-specific cis-regulatory variants using in vitro cell differentiation. We used padlock probes and high-throughput sequencing for digital RNA allelotyping and measured allele-specific gene expression in primary fibroblasts, lymphoblastoid cells, iPS cells, and their differentiated derivatives. We show that allele-specific expression is both cell type and genotype-dependent, but the majority of detectable allele-specific expression loci remains consistent despite large changes in the cell type or the experimental condition following iPS reprogramming, except on the X-chromosome. We show that our approach to mapping cis-regulatory variants reduces in vitro experimental noise and reveals additional tissue-specific variants using skin-derived human iPS cells.

  7. Cis-regulatory signatures of orthologous stress-associated bZIP transcription factors from rice, sorghum and Arabidopsis based on phylogenetic footprints

    Directory of Open Access Journals (Sweden)

    Xu Fuyu

    2012-09-01

    Full Text Available Abstract Background The potential contribution of upstream sequence variation to the unique features of orthologous genes is just beginning to be unraveled. A core subset of stress-associated bZIP transcription factors from rice (Oryza sativa formed ten clusters of orthologous groups (COG with genes from the monocot sorghum (Sorghum bicolor and dicot Arabidopsis (Arabidopsis thaliana. The total cis-regulatory information content of each stress-associated COG was examined by phylogenetic footprinting to reveal ortholog-specific, lineage-specific and species-specific conservation patterns. Results The most apparent pattern observed was the occurrence of spatially conserved ‘core modules’ among the COGs but not among paralogs. These core modules are comprised of various combinations of two to four putative transcription factor binding site (TFBS classes associated with either developmental or stress-related functions. Outside the core modules are specific stress (ABA, oxidative, abiotic, biotic or organ-associated signals, which may be functioning as ‘regulatory fine-tuners’ and further define lineage-specific and species-specific cis-regulatory signatures. Orthologous monocot and dicot promoters have distinct TFBS classes involved in disease and oxidative-regulated expression, while the orthologous rice and sorghum promoters have distinct combinations of root-specific signals, a pattern that is not particularly conserved in Arabidopsis. Conclusions Patterns of cis-regulatory conservation imply that each ortholog has distinct signatures, further suggesting that they are potentially unique in a regulatory context despite the presumed conservation of broad biological function during speciation. Based on the observed patterns of conservation, we postulate that core modules are likely primary determinants of basal developmental programming, which may be integrated with and further elaborated by additional intrinsic or extrinsic signals in

  8. Human apolipoprotein CIII gene expression is regulated by positive and negative cis-acting elements and tissue-specific protein factors

    International Nuclear Information System (INIS)

    Reue, K.; Leff, T.; Breslow, J.L.

    1988-01-01

    Apolipoprotein CIII (apoCIII) is a major protein constituent of triglyceride-rich lipoproteins and is synthesized primarily in the liver. Cis-acting DNA elements required for liver-specific apoCIII gene transcription were identified with transient expression assays in the human hepatoma (HepG2) and epithelial carcinoma (HeLa) cell lines. In liver cells, 821 nucleotides of the human apoCIII gene 5'-flanking sequence were required for maximum levels of gene expression, while the proximal 110 nucleotides alone were sufficient. No expression was observed in similar studies with HeLa cells. The level of expression was modulated by a combination of positive and negative cis-acting sequences, which interact with distinct sets of proteins from liver and HeLa cell nuclear extracts. The proximal positive regulatory region shares homology with similarly located sequences of other genes strongly expressed in the liver, including α 1 -antitrypsin and other apolipoprotein genes. The negative regulatory region is striking homologous to the human β-interferon gene regulatory element. The distal positive region shares homology with some viral enhancers and has properties of a tissue-specific enhancer. The regulation of the apoCIII gene is complex but shares features with other genes, suggesting shuffling of regulatory elements as a common mechanism for cell type-specific gene expression

  9. Cis-regulatory elements in the primate brain: from functional specialization to neurodegeneration

    NARCIS (Netherlands)

    Vermunt, Marit W.

    2017-01-01

    Over the last decade, the noncoding part of the genome has been shown to harbour thousands of cis-regulatory elements, such as enhancers, that activate well-defined gene expression programs. Here, we charted active enhancers in a multiplicity of human brain regions to understand the role of

  10. Plasticity of the cis-regulatory input function of a gene.

    Directory of Open Access Journals (Sweden)

    Avraham E Mayo

    2006-04-01

    Full Text Available The transcription rate of a gene is often controlled by several regulators that bind specific sites in the gene's cis-regulatory region. The combined effect of these regulators is described by a cis-regulatory input function. What determines the form of an input function, and how variable is it with respect to mutations? To address this, we employ the well-characterized lac operon of Escherichia coli, which has an elaborate input function, intermediate between Boolean AND-gate and OR-gate logic. We mapped in detail the input function of 12 variants of the lac promoter, each with different point mutations in the regulator binding sites, by means of accurate expression measurements from living cells. We find that even a few mutations can significantly change the input function, resulting in functions that resemble Pure AND gates, OR gates, or single-input switches. Other types of gates were not found. The variant input functions can be described in a unified manner by a mathematical model. The model also lets us predict which functions cannot be reached by point mutations. The input function that we studied thus appears to be plastic, in the sense that many of the mutations do not ruin the regulation completely but rather result in new ways to integrate the inputs.

  11. Discovery of cis-elements between sorghum and rice using co-expression and evolutionary conservation

    Directory of Open Access Journals (Sweden)

    Haberer Georg

    2009-06-01

    Full Text Available Abstract Background The spatiotemporal regulation of gene expression largely depends on the presence and absence of cis-regulatory sites in the promoter. In the economically highly important grass family, our knowledge of transcription factor binding sites and transcriptional networks is still very limited. With the completion of the sorghum genome and the available rice genome sequence, comparative promoter analyses now allow genome-scale detection of conserved cis-elements. Results In this study, we identified thousands of phylogenetic footprints conserved between orthologous rice and sorghum upstream regions that are supported by co-expression information derived from three different rice expression data sets. In a complementary approach, cis-motifs were discovered by their highly conserved co-occurrence in syntenic promoter pairs. Sequence conservation and matches to known plant motifs support our findings. Expression similarities of gene pairs positively correlate with the number of motifs that are shared by gene pairs and corroborate the importance of similar promoter architectures for concerted regulation. This strongly suggests that these motifs function in the regulation of transcript levels in rice and, presumably also in sorghum. Conclusion Our work provides the first large-scale collection of cis-elements for rice and sorghum and can serve as a paradigm for cis-element analysis through comparative genomics in grasses in general.

  12. Evolution of New cis-Regulatory Motifs Required for Cell-Specific Gene Expression in Caenorhabditis.

    Directory of Open Access Journals (Sweden)

    Michalis Barkoulas

    2016-09-01

    Full Text Available Patterning of C. elegans vulval cell fates relies on inductive signaling. In this induction event, a single cell, the gonadal anchor cell, secretes LIN-3/EGF and induces three out of six competent precursor cells to acquire a vulval fate. We previously showed that this developmental system is robust to a four-fold variation in lin-3/EGF genetic dose. Here using single-molecule FISH, we find that the mean level of expression of lin-3 in the anchor cell is remarkably conserved. No change in lin-3 expression level could be detected among C. elegans wild isolates and only a low level of change-less than 30%-in the Caenorhabditis genus and in Oscheius tipulae. In C. elegans, lin-3 expression in the anchor cell is known to require three transcription factor binding sites, specifically two E-boxes and a nuclear-hormone-receptor (NHR binding site. Mutation of any of these three elements in C. elegans results in a dramatic decrease in lin-3 expression. Yet only a single E-box is found in the Drosophilae supergroup of Caenorhabditis species, including C. angaria, while the NHR-binding site likely only evolved at the base of the Elegans group. We find that a transgene from C. angaria bearing a single E-box is sufficient for normal expression in C. elegans. Even a short 58 bp cis-regulatory fragment from C. angaria with this single E-box is able to replace the three transcription factor binding sites at the endogenous C. elegans lin-3 locus, resulting in the wild-type expression level. Thus, regulatory evolution occurring in cis within a 58 bp lin-3 fragment, results in a strict requirement for the NHR binding site and a second E-box in C. elegans. This single-cell, single-molecule, quantitative and functional evo-devo study demonstrates that conserved expression levels can hide extensive change in cis-regulatory site requirements and highlights the evolution of new cis-regulatory elements required for cell-specific gene expression.

  13. RegRNA: an integrated web server for identifying regulatory RNA motifs and elements

    OpenAIRE

    Huang, Hsi-Yuan; Chien, Chia-Hung; Jen, Kuan-Hua; Huang, Hsien-Da

    2006-01-01

    Numerous regulatory structural motifs have been identified as playing essential roles in transcriptional and post-transcriptional regulation of gene expression. RegRNA is an integrated web server for identifying the homologs of regulatory RNA motifs and elements against an input mRNA sequence. Both sequence homologs and structural homologs of regulatory RNA motifs can be recognized. The regulatory RNA motifs supported in RegRNA are categorized into several classes: (i) motifs in mRNA 5′-untra...

  14. BET Bromodomain Inhibition Releases the Mediator Complex from Select cis-Regulatory Elements.

    Science.gov (United States)

    Bhagwat, Anand S; Roe, Jae-Seok; Mok, Beverly Y L; Hohmann, Anja F; Shi, Junwei; Vakoc, Christopher R

    2016-04-19

    The bromodomain and extraterminal (BET) protein BRD4 can physically interact with the Mediator complex, but the relevance of this association to the therapeutic effects of BET inhibitors in cancer is unclear. Here, we show that BET inhibition causes a rapid release of Mediator from a subset of cis-regulatory elements in the genome of acute myeloid leukemia (AML) cells. These sites of Mediator eviction were highly correlated with transcriptional suppression of neighboring genes, which are enriched for targets of the transcription factor MYB and for functions related to leukemogenesis. A shRNA screen of Mediator in AML cells identified the MED12, MED13, MED23, and MED24 subunits as performing a similar regulatory function to BRD4 in this context, including a shared role in sustaining a block in myeloid maturation. These findings suggest that the interaction between BRD4 and Mediator has functional importance for gene-specific transcriptional activation and for AML maintenance. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Deep sequencing reveals double mutations in cis of MPL exon 10 in myeloproliferative neoplasms.

    Science.gov (United States)

    Pietra, Daniela; Brisci, Angela; Rumi, Elisa; Boggi, Sabrina; Elena, Chiara; Pietrelli, Alessandro; Bordoni, Roberta; Ferrari, Maurizio; Passamonti, Francesco; De Bellis, Gianluca; Cremonesi, Laura; Cazzola, Mario

    2011-04-01

    Somatic mutations of MPL exon 10, mainly involving a W515 substitution, have been described in JAK2 (V617F)-negative patients with essential thrombocythemia and primary myelofibrosis. We used direct sequencing and high-resolution melt analysis to identify mutations of MPL exon 10 in 570 patients with myeloproliferative neoplasms, and allele specific PCR and deep sequencing to further characterize a subset of mutated patients. Somatic mutations were detected in 33 of 221 patients (15%) with JAK2 (V617F)-negative essential thrombocythemia or primary myelofibrosis. Only one patient with essential thrombocythemia carried both JAK2 (V617F) and MPL (W515L). High-resolution melt analysis identified abnormal patterns in all the MPL mutated cases, while direct sequencing did not detect the mutant MPL in one fifth of them. In 3 cases carrying double MPL mutations, deep sequencing analysis showed identical load and location in cis of the paired lesions, indicating their simultaneous occurrence on the same chromosome.

  16. Bifunctional cis-Abienol Synthase from Abies balsamea Discovered by Transcriptome Sequencing and Its Implications for Diterpenoid Fragrance Production*

    Science.gov (United States)

    Zerbe, Philipp; Chiang, Angela; Yuen, Macaire; Hamberger, Björn; Hamberger, Britta; Draper, Jason A.; Britton, Robert; Bohlmann, Jörg

    2012-01-01

    The labdanoid diterpene alcohol cis-abienol is a major component of the aromatic oleoresin of balsam fir (Abies balsamea) and serves as a valuable bioproduct material for the fragrance industry. Using high-throughput 454 transcriptome sequencing and metabolite profiling of balsam fir bark tissue, we identified candidate diterpene synthase sequences for full-length cDNA cloning and functional characterization. We discovered a bifunctional class I/II cis-abienol synthase (AbCAS), along with the paralogous levopimaradiene/abietadiene synthase and isopimaradiene synthase, all of which are members of the gymnosperm-specific TPS-d subfamily. The AbCAS-catalyzed formation of cis-abienol proceeds via cyclization and hydroxylation at carbon C-8 of a postulated carbocation intermediate in the class II active site, followed by cleavage of the diphosphate group and termination of the reaction sequence without further cyclization in the class I active site. This reaction mechanism is distinct from that of synthases of the isopimaradiene- or levopimaradiene/abietadiene synthase type, which employ deprotonation reactions in the class II active site and secondary cyclizations in the class I active site, leading to tricyclic diterpenes. Comparative homology modeling suggested the active site residues Asp-348, Leu-617, Phe-696, and Gly-723 as potentially important for the specificity of AbCAS. As a class I/II bifunctional enzyme, AbCAS is a promising target for metabolic engineering of cis-abienol production. PMID:22337889

  17. Changes in cis-regulatory elements of a key floral regulator are associated with divergence of inflorescence architectures.

    Science.gov (United States)

    Kusters, Elske; Della Pina, Serena; Castel, Rob; Souer, Erik; Koes, Ronald

    2015-08-15

    Higher plant species diverged extensively with regard to the moment (flowering time) and position (inflorescence architecture) at which flowers are formed. This seems largely caused by variation in the expression patterns of conserved genes that specify floral meristem identity (FMI), rather than changes in the encoded proteins. Here, we report a functional comparison of the promoters of homologous FMI genes from Arabidopsis, petunia, tomato and Antirrhinum. Analysis of promoter-reporter constructs in petunia and Arabidopsis, as well as complementation experiments, showed that the divergent expression of leafy (LFY) and the petunia homolog aberrant leaf and flower (ALF) results from alterations in the upstream regulatory network rather than cis-regulatory changes. The divergent expression of unusual floral organs (UFO) from Arabidopsis, and the petunia homolog double top (DOT), however, is caused by the loss or gain of cis-regulatory promoter elements, which respond to trans-acting factors that are expressed in similar patterns in both species. Introduction of pUFO:UFO causes no obvious defects in Arabidopsis, but in petunia it causes the precocious and ectopic formation of flowers. This provides an example of how a change in a cis-regulatory region can account for a change in the plant body plan. © 2015. Published by The Company of Biologists Ltd.

  18. Multiple Functional Variants in cis Modulate PDYN Expression.

    Science.gov (United States)

    Babbitt, Courtney C; Silverman, Jesse S; Haygood, Ralph; Reininga, Jennifer M; Rockman, Matthew V; Wray, Gregory A

    2010-02-01

    Understanding genetic variation and its functional consequences within cis-regulatory regions remains an important challenge in human genetics and evolution. Here, we present a fine-scale functional analysis of segregating variation within the cis-regulatory region of prodynorphin, a gene that encodes an endogenous opioid precursor with roles in cognition and disease. In order to characterize the functional consequences of segregating variation in cis in a region under balancing selection in different human populations, we examined associations between specific polymorphisms and gene expression in vivo and in vitro. We identified five polymorphisms within the 5' flanking region that affect transcript abundance: a 68-bp repeat recognized in prior studies, as well as two microsatellites and two single nucleotide polymorphisms not previously implicated as functional variants. The impact of these variants on transcription differs by brain region, sex, and cell type, implying interactions between cis genotype and the differentiated state of cells. The effects of individual variants on expression level are not additive in some combinations, implying epistatic interactions between nearby variants. These data reveal an unexpectedly complex relationship between segregating genetic variation and its expression-trait consequences and highlights the importance of close functional scrutiny of natural genetic variation within even relatively well-studied cis-regulatory regions.

  19. Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

    Science.gov (United States)

    Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A

    2013-09-02

    In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome

  20. Detection of discriminative sequence patterns in the neighborhood of proline cis peptide bonds and their functional annotation

    Directory of Open Access Journals (Sweden)

    Papaloukas Costas

    2009-04-01

    Full Text Available Abstract Background Polypeptides are composed of amino acids covalently bonded via a peptide bond. The majority of peptide bonds in proteins is found to occur in the trans conformation. In spite of their infrequent occurrence, cis peptide bonds play a key role in the protein structure and function, as well as in many significant biological processes. Results We perform a systematic analysis of regions in protein sequences that contain a proline cis peptide bond in order to discover non-random associations between the primary sequence and the nature of proline cis/trans isomerization. For this purpose an efficient pattern discovery algorithm is employed which discovers regular expression-type patterns that are overrepresented (i.e. appear frequently repeated in a set of sequences. Four types of pattern discovery are performed: i exact pattern discovery, ii pattern discovery using a chemical equivalency set, iii pattern discovery using a structural equivalency set and iv pattern discovery using certain amino acids' physicochemical properties. The extracted patterns are carefully validated using a specially implemented scoring function and a significance measure (i.e. log-probability estimate indicative of their specificity. The score threshold for the first three types of pattern discovery is 0.90 while for the last type of pattern discovery 0.80. Regarding the significance measure, all patterns yielded values in the range [-9, -31] which ensure that the derived patterns are highly unlikely to have emerged by chance. Among the highest scoring patterns, most of them are consistent with previous investigations concerning the neighborhood of cis proline peptide bonds, and many new ones are identified. Finally, the extracted patterns are systematically compared against the PROSITE database, in order to gain insight into the functional implications of cis prolyl bonds. Conclusion Cis patterns with matches in the PROSITE database fell mostly into two

  1. Changes in Cis-regulatory Elements during Morphological Evolution

    Directory of Open Access Journals (Sweden)

    Yu-Lee Paul

    2012-10-01

    Full Text Available How have animals evolved new body designs (morphological evolution? This requires explanations both for simple morphological changes, such as differences in pigmentation and hair patterns between different Drosophila populations and species, and also for more complex changes, such as differences in the forelimbs of mice and bats, and the necks of amphibians and reptiles. The genetic changes and pathways involved in these evolutionary steps require identification. Many, though not all, of these events occur by changes in cis-regulatory (enhancer elements within developmental genes. Enhancers are modular, each affecting expression in only one or a few tissues. Therefore it is possible to add, remove or alter an enhancer without producing changes in multiple tissues, and thereby avoid widespread (pleiotropic deleterious effects. Ideally, for a given step in morphological evolution it is necessary to identify (i the change in phenotype, (ii the changes in gene expression, (iii the DNA region, enhancer or otherwise, affected, (iv the mutation involved, (v the nature of the transcription or other factors that bind to this site. In practice these data are incomplete for most of the published studies upon morphological evolution. Here, the investigations are categorized according to how far these analyses have proceeded.

  2. Thermodynamic state ensemble models of cis-regulation.

    Directory of Open Access Journals (Sweden)

    Marc S Sherman

    Full Text Available A major goal in computational biology is to develop models that accurately predict a gene's expression from its surrounding regulatory DNA. Here we present one class of such models, thermodynamic state ensemble models. We describe the biochemical derivation of the thermodynamic framework in simple terms, and lay out the mathematical components that comprise each model. These components include (1 the possible states of a promoter, where a state is defined as a particular arrangement of transcription factors bound to a DNA promoter, (2 the binding constants that describe the affinity of the protein-protein and protein-DNA interactions that occur in each state, and (3 whether each state is capable of transcribing. Using these components, we demonstrate how to compute a cis-regulatory function that encodes the probability of a promoter being active. Our intention is to provide enough detail so that readers with little background in thermodynamics can compose their own cis-regulatory functions. To facilitate this goal, we also describe a matrix form of the model that can be easily coded in any programming language. This formalism has great flexibility, which we show by illustrating how phenomena such as competition between transcription factors and cooperativity are readily incorporated into these models. Using this framework, we also demonstrate that Michaelis-like functions, another class of cis-regulatory models, are a subset of the thermodynamic framework with specific assumptions. By recasting Michaelis-like functions as thermodynamic functions, we emphasize the relationship between these models and delineate the specific circumstances representable by each approach. Application of thermodynamic state ensemble models is likely to be an important tool in unraveling the physical basis of combinatorial cis-regulation and in generating formalisms that accurately predict gene expression from DNA sequence.

  3. Coevolution within a transcriptional network by compensatory trans and cis mutations

    KAUST Repository

    Kuo, D.

    2010-10-26

    Transcriptional networks have been shown to evolve very rapidly, prompting questions as to how such changes arise and are tolerated. Recent comparisons of transcriptional networks across species have implicated variations in the cis-acting DNA sequences near genes as the main cause of divergence. What is less clear is how these changes interact with trans-acting changes occurring elsewhere in the genetic circuit. Here, we report the discovery of a system of compensatory trans and cis mutations in the yeast AP-1 transcriptional network that allows for conserved transcriptional regulation despite continued genetic change. We pinpoint a single species, the fungal pathogen Candida glabrata, in which a trans mutation has occurred very recently in a single AP-1 family member, distinguishing it from its Saccharomyces ortholog. Comparison of chromatin immunoprecipitation profiles between Candida and Saccharomyces shows that, despite their different DNA-binding domains, the AP-1 orthologs regulate a conserved block of genes. This conservation is enabled by concomitant changes in the cis-regulatory motifs upstream of each gene. Thus, both trans and cis mutations have perturbed the yeast AP-1 regulatory system in such a way as to compensate for one another. This demonstrates an example of “coevolution” between a DNA-binding transcription factor and its cis-regulatory site, reminiscent of the coevolution of protein binding partners.

  4. The cis-regulatory element CCACGTGG is involved in ABA and water-stress responses of the maize gene rab28.

    Science.gov (United States)

    Pla, M; Vilardell, J; Guiltinan, M J; Marcotte, W R; Niogret, M F; Quatrano, R S; Pagès, M

    1993-01-01

    The maize gene rab28 has been identified as ABA-inducible in embryos and vegetative tissues. It is also induced by water stress in young leaves. The proximal promoter region contains the conserved cis-acting element CCACGTGG (ABRE) reported for ABA induction in other plant genes. Transient expression assays in rice protoplasts indicate that a 134 bp fragment (-194 to -60 containing the ABRE) fused to a truncated cauliflower mosaic virus promoter (35S) is sufficient to confer ABA-responsiveness upon the GUS reporter gene. Gel retardation experiments indicate that nuclear proteins from tissues in which the rab28 gene is expressed can interact specifically with this 134 bp DNA fragment. Nuclear protein extracts from embryo and water-stressed leaves generate specific complexes of different electrophoretic mobility which are stable in the presence of detergent and high salt. However, by DMS footprinting the same guanine-specific contacts with the ABRE in both the embryo and leaf binding activities were detected. These results indicate that the rab28 promoter sequence CCACGTGG is a functional ABA-responsive element, and suggest that distinct regulatory factors with apparent similar affinity for the ABRE sequence may be involved in the hormone action during embryo development and in vegetative tissues subjected to osmotic stress.

  5. DNA watermarks in non-coding regulatory sequences

    Directory of Open Access Journals (Sweden)

    Pyka Martin

    2009-07-01

    Full Text Available Abstract Background DNA watermarks can be applied to identify the unauthorized use of genetically modified organisms. It has been shown that coding regions can be used to encrypt information into living organisms by using the DNA-Crypt algorithm. Yet, if the sequence of interest presents a non-coding DNA sequence, either the function of a resulting functional RNA molecule or a regulatory sequence, such as a promoter, could be affected. For our studies we used the small cytoplasmic RNA 1 in yeast and the lac promoter region of Escherichia coli. Findings The lac promoter was deactivated by the integrated watermark. In addition, the RNA molecules displayed altered configurations after introducing a watermark, but surprisingly were functionally intact, which has been verified by analyzing the growth characteristics of both wild type and watermarked scR1 transformed yeast cells. In a third approach we introduced a second overlapping watermark into the lac promoter, which did not affect the promoter activity. Conclusion Even though the watermarked RNA and one of the watermarked promoters did not show any significant differences compared to the wild type RNA and wild type promoter region, respectively, it cannot be generalized that other RNA molecules or regulatory sequences behave accordingly. Therefore, we do not recommend integrating watermark sequences into regulatory regions.

  6. Spatially conserved regulatory elements identified within human and mouse Cd247 gene using high-throughput sequencing data from the ENCODE project

    DEFF Research Database (Denmark)

    Pundhir, Sachin; Hannibal, Tine Dahlbæk; Bang-Berthelsen, Claus Heiner

    2014-01-01

    . In this study, we have utilized the wealth of high-throughput sequencing data produced during the Encyclopedia of DNA Elements (ENCODE) project to identify spatially conserved regulatory elements within the Cd247 gene from human and mouse. We show the presence of two transcription factor binding sites...

  7. Cis-regulatory control of the nuclear receptor Coup-TF gene in the sea urchin Paracentrotus lividus embryo.

    Directory of Open Access Journals (Sweden)

    Lamprini G Kalampoki

    Full Text Available Coup-TF, an orphan member of the nuclear receptor super family, has a fundamental role in the development of metazoan embryos. The study of the gene's regulatory circuit in the sea urchin embryo will facilitate the placement of this transcription factor in the well-studied embryonic Gene Regulatory Network (GRN. The Paracentrotus lividus Coup-TF gene (PlCoup-TF is expressed throughout embryonic development preferentially in the oral ectoderm of the gastrula and the ciliary band of the pluteus stage. Two overlapping λ genomic clones, containing three exons and upstream sequences of PlCoup-TF, were isolated from a genomic library. The transcription initiation site was determined and 5' deletions and individual segments of a 1930 bp upstream region were placed ahead of a GFP reporter cassette and injected into fertilized P.lividus eggs. Module a (-532 to -232, was necessary and sufficient to confer ciliary band expression to the reporter. Comparison of P.lividus and Strongylocentrotus purpuratus upstream Coup-TF sequences, revealed considerable conservation, but none within module a. 5' and internal deletions into module a, defined a smaller region that confers ciliary band specific expression. Putative regulatory cis-acting elements (RE1, RE2 and RE3 within module a, were specifically bound by proteins in sea urchin embryonic nuclear extracts. Site-specific mutagenesis of these elements resulted in loss of reporter activity (RE1 or ectopic expression (RE2, RE3. It is proposed that sea urchin transcription factors, which bind these three regulatory sites, are necessary for spatial and quantitative regulation of the PlCoup-TF gene at pluteus stage sea urchin embryos. These findings lead to the future identification of these factors and to the hierarchical positioning of PlCoup-TF within the embryonic GRN.

  8. XcisClique: analysis of regulatory bicliques

    Directory of Open Access Journals (Sweden)

    Grene Ruth

    2006-04-01

    Full Text Available Abstract Background Modeling of cis-elements or regulatory motifs in promoter (upstream regions of genes is a challenging computational problem. In this work, set of regulatory motifs simultaneously present in the promoters of a set of genes is modeled as a biclique in a suitably defined bipartite graph. A biologically meaningful co-occurrence of multiple cis-elements in a gene promoter is assessed by the combined analysis of genomic and gene expression data. Greater statistical significance is associated with a set of genes that shares a common set of regulatory motifs, while simultaneously exhibiting highly correlated gene expression under given experimental conditions. Methods XcisClique, the system developed in this work, is a comprehensive infrastructure that associates annotated genome and gene expression data, models known cis-elements as regular expressions, identifies maximal bicliques in a bipartite gene-motif graph; and ranks bicliques based on their computed statistical significance. Significance is a function of the probability of occurrence of those motifs in a biclique (a hypergeometric distribution, and on the new sum of absolute values statistic (SAV that uses Spearman correlations of gene expression vectors. SAV is a statistic well-suited for this purpose as described in the discussion. Results XcisClique identifies new motif and gene combinations that might indicate as yet unidentified involvement of sets of genes in biological functions and processes. It currently supports Arabidopsis thaliana and can be adapted to other organisms, assuming the existence of annotated genomic sequences, suitable gene expression data, and identified regulatory motifs. A subset of Xcis Clique functionalities, including the motif visualization component MotifSee, source code, and supplementary material are available at https://bioinformatics.cs.vt.edu/xcisclique/.

  9. Location analysis for the estrogen receptor-? reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

    OpenAIRE

    Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

    2010-01-01

    Location analysis for estrogen receptor-? (ER?)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ER?-bound loci and quantify the incidence of ERE sequences under two stringencies of detection:

  10. Network perturbation by recurrent regulatory variants in cancer.

    Directory of Open Access Journals (Sweden)

    Kiwon Jang

    2017-03-01

    Full Text Available Cancer driving genes have been identified as recurrently affected by variants that alter protein-coding sequences. However, a majority of cancer variants arise in noncoding regions, and some of them are thought to play a critical role through transcriptional perturbation. Here we identified putative transcriptional driver genes based on combinatorial variant recurrence in cis-regulatory regions. The identified genes showed high connectivity in the cancer type-specific transcription regulatory network, with high outdegree and many downstream genes, highlighting their causative role during tumorigenesis. In the protein interactome, the identified transcriptional drivers were not as highly connected as coding driver genes but appeared to form a network module centered on the coding drivers. The coding and regulatory variants associated via these interactions between the coding and transcriptional drivers showed exclusive and complementary occurrence patterns across tumor samples. Transcriptional cancer drivers may act through an extensive perturbation of the regulatory network and by altering protein network modules through interactions with coding driver genes.

  11. Generation of Chimeric RNAs by cis-splicing of adjacent genes (cis-SAGe) in mammals.

    Science.gov (United States)

    Zhuo, Jian-Shu; Jing, Xiao-Yan; Du, Xin; Yang, Xiu-Qin

    2018-02-20

    Chimeric RNA molecules, possessing exons from two or more independent genes, are traditionally believed to be produced by chromosome rearrangement. However, recent studies revealed that cis-splicing of adjacent genes (cis- SAGe) is one of the major mechanisms underlying the formation of chimeric RNAs. cis-SAGe refers to intergenic splicing of directly adjacent genes with the same transcriptional orientation, resulting in read-through transcripts, termed chimeric RNAs, which contain sequences from two or more parental genes. cis-SAGe was first identified in tumor cells, since then its potential in carcinogenesis has attracted extensive attention. More and more scientists are focusing on it. With the development of research, cis-SAGe was found to be ubiquitous in various normal tissues, and might make a crucial contribution to the formation of novel genes in the evolution of genomes. In this review, we summarize the splicing pattern, expression characteristics, possible mechanisms, and significance of cis-SAGe in mammals. This review will be helpful for general understanding of the current status and development tendency of cis-SAGe.

  12. Two negative cis-regulatory regions involved in fruit-specific promoter activity from watermelon (Citrullus vulgaris S.).

    Science.gov (United States)

    Yin, Tao; Wu, Hanying; Zhang, Shanglong; Lu, Hongyu; Zhang, Lingxiao; Xu, Yong; Chen, Daming; Liu, Jingmei

    2009-01-01

    A 1.8 kb 5'-flanking region of the large subunit of ADP-glucose pyrophosphorylase, isolated from watermelon (Citrullus vulgaris S.), has fruit-specific promoter activity in transgenic tomato plants. Two negative regulatory regions, from -986 to -959 and from -472 to -424, were identified in this promoter region by fine deletion analyses. Removal of both regions led to constitutive expression in epidermal cells. Gain-of-function experiments showed that these two regions were sufficient to inhibit RFP (red fluorescent protein) expression in transformed epidermal cells when fused to the cauliflower mosaic virus (CaMV) 35S minimal promoter. Gel mobility shift experiments demonstrated the presence of leaf nuclear factors that interact with these two elements. A TCCAAAA motif was identified in these two regions, as well as one in the reverse orientation, which was confirmed to be a novel specific cis-element. A quantitative beta-glucuronidase (GUS) activity assay of stable transgenic tomato plants showed that the activities of chimeric promoters harbouring only one of the two cis-elements, or both, were approximately 10-fold higher in fruits than in leaves. These data confirm that the TCCAAAA motif functions as a fruit-specific element by inhibiting gene expression in leaves.

  13. Cis-acting regulatory sequences promote high-frequency gene conversion between repeated sequences in mammalian cells.

    Science.gov (United States)

    Raynard, Steven J; Baker, Mark D

    2004-01-01

    In mammalian cells, little is known about the nature of recombination-prone regions of the genome. Previously, we reported that the immunoglobulin heavy chain (IgH) mu locus behaved as a hotspot for mitotic, intrachromosomal gene conversion (GC) between repeated mu constant (Cmu) regions in mouse hybridoma cells. To investigate whether elements within the mu gene regulatory region were required for hotspot activity, gene targeting was used to delete a 9.1 kb segment encompassing the mu gene promoter (Pmu), enhancer (Emu) and switch region (Smu) from the locus. In these cell lines, GC between the Cmu repeats was significantly reduced, indicating that this 'recombination-enhancing sequence' (RES) is necessary for GC hotspot activity at the IgH locus. Importantly, the RES fragment stimulated GC when appended to the same Cmu repeats integrated at ectopic genomic sites. We also show that deletion of Emu and flanking matrix attachment regions (MARs) from the RES abolishes GC hotspot activity at the IgH locus. However, no stimulation of ectopic GC was observed with the Emu/MARs fragment alone. Finally, we provide evidence that no correlation exists between the level of transcription and GC promoted by the RES. We suggest a model whereby Emu/MARS enhances mitotic GC at the endogenous IgH mu locus by effecting chromatin modifications in adjacent DNA.

  14. Sequence-based model of gap gene regulatory network.

    Science.gov (United States)

    Kozlov, Konstantin; Gursky, Vitaly; Kulakovskiy, Ivan; Samsonova, Maria

    2014-01-01

    The detailed analysis of transcriptional regulation is crucially important for understanding biological processes. The gap gene network in Drosophila attracts large interest among researches studying mechanisms of transcriptional regulation. It implements the most upstream regulatory layer of the segmentation gene network. The knowledge of molecular mechanisms involved in gap gene regulation is far less complete than that of genetics of the system. Mathematical modeling goes beyond insights gained by genetics and molecular approaches. It allows us to reconstruct wild-type gene expression patterns in silico, infer underlying regulatory mechanism and prove its sufficiency. We developed a new model that provides a dynamical description of gap gene regulatory systems, using detailed DNA-based information, as well as spatial transcription factor concentration data at varying time points. We showed that this model correctly reproduces gap gene expression patterns in wild type embryos and is able to predict gap expression patterns in Kr mutants and four reporter constructs. We used four-fold cross validation test and fitting to random dataset to validate the model and proof its sufficiency in data description. The identifiability analysis showed that most model parameters are well identifiable. We reconstructed the gap gene network topology and studied the impact of individual transcription factor binding sites on the model output. We measured this impact by calculating the site regulatory weight as a normalized difference between the residual sum of squares error for the set of all annotated sites and for the set with the site of interest excluded. The reconstructed topology of the gap gene network is in agreement with previous modeling results and data from literature. We showed that 1) the regulatory weights of transcription factor binding sites show very weak correlation with their PWM score; 2) sites with low regulatory weight are important for the model output; 3

  15. Rare and common regulatory variation in population-scale sequenced human genomes.

    Directory of Open Access Journals (Sweden)

    Stephen B Montgomery

    2011-07-01

    Full Text Available Population-scale genome sequencing allows the characterization of functional effects of a broad spectrum of genetic variants underlying human phenotypic variation. Here, we investigate the influence of rare and common genetic variants on gene expression patterns, using variants identified from sequencing data from the 1000 genomes project in an African and European population sample and gene expression data from lymphoblastoid cell lines. We detect comparable numbers of expression quantitative trait loci (eQTLs when compared to genotypes obtained from HapMap 3, but as many as 80% of the top expression quantitative trait variants (eQTVs discovered from 1000 genomes data are novel. The properties of the newly discovered variants suggest that mapping common causal regulatory variants is challenging even with full resequencing data; however, we observe significant enrichment of regulatory effects in splice-site and nonsense variants. Using RNA sequencing data, we show that 46.2% of nonsynonymous variants are differentially expressed in at least one individual in our sample, creating widespread potential for interactions between functional protein-coding and regulatory variants. We also use allele-specific expression to identify putative rare causal regulatory variants. Furthermore, we demonstrate that outlier expression values can be due to rare variant effects, and we approximate the number of such effects harboured in an individual by effect size. Our results demonstrate that integration of genomic and RNA sequencing analyses allows for the joint assessment of genome sequence and genome function.

  16. Phylogeny based discovery of regulatory elements

    Directory of Open Access Journals (Sweden)

    Cohen Barak A

    2006-05-01

    Full Text Available Abstract Background Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequences. Results We developed a probabilistic framework that combines an HKY85 model, which assigns probabilities to different base substitutions between species, and weight matrix models of transcription factor binding sites, which describe the probabilities of observing particular nucleotides at specific positions in the binding site. The method incorporates the phylogenies of the species under consideration and takes into account the position specific variation of transcription factor binding sites. Using our framework we assessed the suitability of alignments of genomic sequences from commonly used species as substrates for comparative genomic approaches to regulatory motif finding. We then applied this technique to Saccharomyces cerevisiae and related species by examining all possible six base pair DNA sequences (hexamers and identifying sequences that are conserved in a significant number of promoters. By combining similar conserved hexamers we reconstructed known cis-regulatory motifs and made predictions of previously unidentified motifs. We tested one prediction experimentally, finding it to be a regulatory element involved in the transcriptional response to glucose. Conclusion The experimental validation of a regulatory element prediction missed by other large-scale motif finding studies demonstrates that our approach is a useful addition to the current suite of tools for finding regulatory motifs.

  17. MicroRNA signature of cis-platin resistant vs. cis-platin sensitive ovarian cancer cell lines

    Directory of Open Access Journals (Sweden)

    Kumar Smriti

    2011-09-01

    Full Text Available Abstract Background Ovarian cancer is the leading cause of death from gynecologic cancer in women worldwide. According to the National Cancer Institute, ovarian cancer has the highest mortality rate among all the reproductive cancers in women. Advanced stage diagnosis and chemo/radio-resistance is a major obstacle in treating advanced ovarian cancer. The most commonly employed chemotherapeutic drug for ovarian cancer treatment is cis-platin. As with most chemotherapeutic drugs, many patients eventually become resistant to cis-platin and therefore, diminishing its effect. The efficacy of current treatments may be improved by increasing the sensitivity of cancer cells to chemo/radiation therapies. Methods The present study is focused on identifying the differential expression of regulatory microRNAs (miRNAs between cis-platin sensitive (A2780, and cis-platin resistant (A2780/CP70 cell lines. Cell proliferation assays were conducted to test the sensitivity of the two cell lines to cis-platin. Differential expression patterns of miRNA between cis-platin sensitive and cis-platin resistant cell lines were analyzed using novel LNA technology. Results Our results revealed changes in expression of 11 miRNAs out of 1,500 miRNAs analyzed. Out of the 11 miRNAs identified, 5 were up-regulated in the A2780/CP70 cell line and 6 were down regulated as compared to cis-platin sensitive A2780 cells. Our microRNA data was further validated by quantitative real-time PCR for these selected miRNAs. Ingenuity Pathway Analysis (IPA and Kyoto Encyclopedia of Genes and Genomes (KEGG analysis was performed for the selected miRNAs and their putative targets to identify the potential pathways and networks involved in cis-platin resistance. Conclusions Our data clearly showed the differential expression of 11 miRNAs in cis-platin resistant cells, which could potentially target many important pathways including MAPK, TGF-β signaling, actin cytoskeleton, ubiquitin mediated

  18. Genome-wide analysis of regulatory proteases sequences identified through bioinformatics data mining in Taenia solium.

    Science.gov (United States)

    Yan, Hong-Bin; Lou, Zhong-Zi; Li, Li; Brindley, Paul J; Zheng, Yadong; Luo, Xuenong; Hou, Junling; Guo, Aijiang; Jia, Wan-Zhong; Cai, Xuepeng

    2014-06-04

    Cysticercosis remains a major neglected tropical disease of humanity in many regions, especially in sub-Saharan Africa, Central America and elsewhere. Owing to the emerging drug resistance and the inability of current drugs to prevent re-infection, identification of novel vaccines and chemotherapeutic agents against Taenia solium and related helminth pathogens is a public health priority. The T. solium genome and the predicted proteome were reported recently, providing a wealth of information from which new interventional targets might be identified. In order to characterize and classify the entire repertoire of protease-encoding genes of T. solium, which act fundamental biological roles in all life processes, we analyzed the predicted proteins of this cestode through a combination of bioinformatics tools. Functional annotation was performed to yield insights into the signaling processes relevant to the complex developmental cycle of this tapeworm and to highlight a suite of the proteases as potential intervention targets. Within the genome of this helminth parasite, we identified 200 open reading frames encoding proteases from five clans, which correspond to 1.68% of the 11,902 protein-encoding genes predicted to be present in its genome. These proteases include calpains, cytosolic, mitochondrial signal peptidases, ubiquitylation related proteins, and others. Many not only show significant similarity to proteases in the Conserved Domain Database but have conserved active sites and catalytic domains. KEGG Automatic Annotation Server (KAAS) analysis indicated that ~60% of these proteases share strong sequence identities with proteins of the KEGG database, which are involved in human disease, metabolic pathways, genetic information processes, cellular processes, environmental information processes and organismal systems. Also, we identified signal peptides and transmembrane helices through comparative analysis with classes of important regulatory proteases

  19. Eucalyptus ESTs involved in the production of 9-cis epoxycarotenoid dioxygenase, a regulatory enzyme of abscisic acid production

    Directory of Open Access Journals (Sweden)

    Iraê A. Guerrini

    2005-01-01

    Full Text Available Abscisic acid (ABA regulates stress responses in plants, and genomic tools can help us to understand the mechanisms involved in that process. FAPESP, a Brazilian research foundation, in association with four private forestry companies, has established the FORESTs database (https://forests.esalq.usp.br. A search was carried out in the Eucalyptus expressed sequence tag database to find ESTs involved with 9-cis epoxycarotenoid dioxygenase (NCED, the regulatory enzyme for ABA biosynthesis, using the basic local BLAST alignment tool. We found four clusters (EGEZLV2206B11.g, EGJMWD2252H08.g, EGBFRT3107F10.g, and EGEQFB1200H10.g, which represent similar sequences of the gene that produces NCED. Data showed that the EGBFRT3107F10.g cluster was similar to the maize (Zea mays NCED enzyme, while EGEZLV2206B11.g and EGJMWD2252H08.g clusters were similar to the avocado (Persea americana NCED enzyme. All Eucalyptus clusters were expressed in several tissues, especially in flower buds, where ABA has a special participation during the floral development process.

  20. Minimal and contributing sequence determinants of the cis-acting locus of transfer (clt) of streptomycete plasmid pIJ101 occur within an intrinsically curved plasmid region.

    Science.gov (United States)

    Ducote, M J; Prakash, S; Pettis, G S

    2000-12-01

    Efficient interbacterial transfer of streptomycete plasmid pIJ101 requires the pIJ101 tra gene, as well as a cis-acting plasmid function known as clt. Here we show that the minimal pIJ101 clt locus consists of a sequence no greater than 54 bp in size that includes essential inverted-repeat and direct-repeat sequences and is located in close proximity to the 3' end of the korB regulatory gene. Evidence that sequences extending beyond the minimal locus and into the korB open reading frame influence clt transfer function and demonstration that clt-korB sequences are intrinsically curved raise the possibility that higher-order structuring of DNA and protein within this plasmid region may be an inherent feature of efficient pIJ101 transfer.

  1. Identification of Cis-Acting Promoter Elements in Cold- and Dehydration-Induced Transcriptional Pathways in Arabidopsis, Rice, and Soybean

    Science.gov (United States)

    Maruyama, Kyonoshin; Todaka, Daisuke; Mizoi, Junya; Yoshida, Takuya; Kidokoro, Satoshi; Matsukura, Satoko; Takasaki, Hironori; Sakurai, Tetsuya; Yamamoto, Yoshiharu Y.; Yoshiwara, Kyouko; Kojima, Mikiko; Sakakibara, Hitoshi; Shinozaki, Kazuo; Yamaguchi-Shinozaki, Kazuko

    2012-01-01

    The genomes of three plants, Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and soybean (Glycine max), have been sequenced, and their many genes and promoters have been predicted. In Arabidopsis, cis-acting promoter elements involved in cold- and dehydration-responsive gene expression have been extensively analysed; however, the characteristics of such cis-acting promoter sequences in cold- and dehydration-inducible genes of rice and soybean remain to be clarified. In this study, we performed microarray analyses using the three species, and compared characteristics of identified cold- and dehydration-inducible genes. Transcription profiles of the cold- and dehydration-responsive genes were similar among these three species, showing representative upregulated (dehydrin/LEA) and downregulated (photosynthesis-related) genes. All (46 = 4096) hexamer sequences in the promoters of the three species were investigated, revealing the frequency of conserved sequences in cold- and dehydration-inducible promoters. A core sequence of the abscisic acid-responsive element (ABRE) was the most conserved in dehydration-inducible promoters of all three species, suggesting that transcriptional regulation for dehydration-inducible genes is similar among these three species, with the ABRE-dependent transcriptional pathway. In contrast, for cold-inducible promoters, the conserved hexamer sequences were diversified among these three species, suggesting the existence of diverse transcriptional regulatory pathways for cold-inducible genes among the species. PMID:22184637

  2. Multiple cis-regulatory elements are involved in the complex regulation of the sieve element-specific MtSEO-F1 promoter from Medicago truncatula.

    Science.gov (United States)

    Bucsenez, M; Rüping, B; Behrens, S; Twyman, R M; Noll, G A; Prüfer, D

    2012-09-01

    The sieve element occlusion (SEO) gene family includes several members that are expressed specifically in immature sieve elements (SEs) in the developing phloem of dicotyledonous plants. To determine how this restricted expression profile is achieved, we analysed the SE-specific Medicago truncatula SEO-F1 promoter (PMtSEO-F1) by constructing deletion, substitution and hybrid constructs and testing them in transgenic tobacco plants using green fluorescent protein as a reporter. This revealed four promoter regions, each containing cis-regulatory elements that activate transcription in SEs. One of these segments also contained sufficient information to suppress PMtSEO-F1 transcription in the phloem companion cells (CCs). Subsequent in silico analysis revealed several candidate cis-regulatory elements that PMtSEO-F1 shares with other SEO promoters. These putative sieve element boxes (PSE boxes) are promising candidates for cis-regulatory elements controlling the SE-specific expression of PMtSEO-F1. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.

  3. In silico detection of sequence variations modifying transcriptional regulation.

    Directory of Open Access Journals (Sweden)

    Malin C Andersen

    2008-01-01

    Full Text Available Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers. The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation.

  4. In Silico Detection of Sequence Variations Modifying Transcriptional Regulation

    Science.gov (United States)

    Andersen, Malin C; Engström, Pär G; Lithwick, Stuart; Arenillas, David; Eriksson, Per; Lenhard, Boris; Wasserman, Wyeth W; Odeberg, Jacob

    2008-01-01

    Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers). The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation. PMID:18208319

  5. Structural and functional analysis of mouse Msx1 gene promoter: sequence conservation with human MSX1 promoter points at potential regulatory elements.

    Science.gov (United States)

    Gonzalez, S M; Ferland, L H; Robert, B; Abdelhay, E

    1998-06-01

    Vertebrate Msx genes are related to one of the most divergent homeobox genes of Drosophila, the muscle segment homeobox (msh) gene, and are expressed in a well-defined pattern at sites of tissue interactions. This pattern of expression is conserved in vertebrates as diverse as quail, zebrafish, and mouse in a range of sites including neural crest, appendages, and craniofacial structures. In the present work, we performed structural and functional analyses in order to identify potential cis-acting elements that may be regulating Msx1 gene expression. To this end, a 4.9-kb segment of the 5'-flanking region was sequenced and analyzed for transcription-factor binding sites. Four regions showing a high concentration of these sites were identified. Transfection assays with fragments of regulatory sequences driving the expression of the bacterial lacZ reporter gene showed that a region of 4 kb upstream of the transcription start site contains positive and negative elements responsible for controlling gene expression. Interestingly, a fragment of 130 bp seems to contain the minimal elements necessary for gene expression, as its removal completely abolishes gene expression in cultured cells. These results are reinforced by comparison of this region with the human Msx1 gene promoter, which shows extensive conservation, including many consensus binding sites, suggesting a regulatory role for them.

  6. Uniform, optimal signal processing of mapped deep-sequencing data.

    Science.gov (United States)

    Kumar, Vibhor; Muratani, Masafumi; Rayan, Nirmala Arul; Kraus, Petra; Lufkin, Thomas; Ng, Huck Hui; Prabhakar, Shyam

    2013-07-01

    Despite their apparent diversity, many problems in the analysis of high-throughput sequencing data are merely special cases of two general problems, signal detection and signal estimation. Here we adapt formally optimal solutions from signal processing theory to analyze signals of DNA sequence reads mapped to a genome. We describe DFilter, a detection algorithm that identifies regulatory features in ChIP-seq, DNase-seq and FAIRE-seq data more accurately than assay-specific algorithms. We also describe EFilter, an estimation algorithm that accurately predicts mRNA levels from as few as 1-2 histone profiles (R ∼0.9). Notably, the presence of regulatory motifs in promoters correlates more with histone modifications than with mRNA levels, suggesting that histone profiles are more predictive of cis-regulatory mechanisms. We show by applying DFilter and EFilter to embryonic forebrain ChIP-seq data that regulatory protein identification and functional annotation are feasible despite tissue heterogeneity. The mathematical formalism underlying our tools facilitates integrative analysis of data from virtually any sequencing-based functional profile.

  7. Single nucleotide polymorphism in transcriptional regulatory regions and expression of environmentally responsive genes

    International Nuclear Information System (INIS)

    Wang, Xuting; Tomso, Daniel J.; Liu Xuemei; Bell, Douglas A.

    2005-01-01

    Single nucleotide polymorphisms (SNPs) in the human genome are DNA sequence variations that can alter an individual's response to environmental exposure. SNPs in gene coding regions can lead to changes in the biological properties of the encoded protein. In contrast, SNPs in non-coding gene regulatory regions may affect gene expression levels in an allele-specific manner, and these functional polymorphisms represent an important but relatively unexplored class of genetic variation. The main challenge in analyzing these SNPs is a lack of robust computational and experimental methods. Here, we first outline mechanisms by which genetic variation can impact gene regulation, and review recent findings in this area; then, we describe a methodology for bioinformatic discovery and functional analysis of regulatory SNPs in cis-regulatory regions using the assembled human genome sequence and databases on sequence polymorphism and gene expression. Our method integrates SNP and gene databases and uses a set of computer programs that allow us to: (1) select SNPs, from among the >9 million human SNPs in the NCBI dbSNP database, that are similar to cis-regulatory element (RE) consensus sequences; (2) map the selected dbSNP entries to the human genome assembly in order to identify polymorphic REs near gene start sites; (3) prioritize the candidate polymorphic RE containing genes by searching the existing genotype and gene expression data sets. The applicability of this system has been demonstrated through studies on p53 responsive elements and is being extended to additional pathways and environmentally responsive genes

  8. Identifying noncoding risk variants using disease-relevant gene regulatory networks.

    Science.gov (United States)

    Gao, Long; Uzun, Yasin; Gao, Peng; He, Bing; Ma, Xiaoke; Wang, Jiahui; Han, Shizhong; Tan, Kai

    2018-02-16

    Identifying noncoding risk variants remains a challenging task. Because noncoding variants exert their effects in the context of a gene regulatory network (GRN), we hypothesize that explicit use of disease-relevant GRNs can significantly improve the inference accuracy of noncoding risk variants. We describe Annotation of Regulatory Variants using Integrated Networks (ARVIN), a general computational framework for predicting causal noncoding variants. It employs a set of novel regulatory network-based features, combined with sequence-based features to infer noncoding risk variants. Using known causal variants in gene promoters and enhancers in a number of diseases, we show ARVIN outperforms state-of-the-art methods that use sequence-based features alone. Additional experimental validation using reporter assay further demonstrates the accuracy of ARVIN. Application of ARVIN to seven autoimmune diseases provides a holistic view of the gene subnetwork perturbed by the combinatorial action of the entire set of risk noncoding mutations.

  9. Differential trypanosome surface coat regulation by a CCCH protein that co-associates with procyclin mRNA cis-elements.

    Directory of Open Access Journals (Sweden)

    Pegine Walrad

    2009-02-01

    Full Text Available The genome of Trypanosoma brucei is unusual in being regulated almost entirely at the post-transcriptional level. In terms of regulation, the best-studied genes are procyclins, which encode a family of major surface GPI-anchored glycoproteins (EP1, EP2, EP3, GPEET that show differential expression in the parasite's tsetse-fly vector. Although procyclin mRNA cis-regulatory sequences have provided the paradigm for post-transcriptional control in kinetoplastid parasites, trans-acting regulators of procyclin mRNAs are unidentified, despite intensive effort over 15 years. Here we identify the developmental regulator, TbZFP3, a CCCH-class predicted RNA binding protein, as an isoform-specific regulator of Procyclin surface coat expression in trypanosomes. We demonstrate (i that endogenous TbZFP3 shows sequence-specific co-precipitation of EP1 and GPEET, but not EP2 and EP3, procyclin mRNA isoforms, (ii that ectopic overexpression of TbZFP3 does not perturb the mRNA abundance of procyclin transcripts, but rather that (iii their protein expression is regulated in an isoform-specific manner, as evidenced by mass spectrometric analysis of the Procyclin expression signature in the transgenic cell lines. The TbZFP3 mRNA-protein complex (TbZFP3mRNP is identified as a trans-regulator of differential surface protein expression in trypanosomes. Moreover, its sequence-specific interactions with procyclin mRNAs are compatible with long-established predictions for Procyclin regulation. Combined with the known association of TbZFP3 with the translational apparatus, this study provides a long-sought missing link between surface protein cis-regulatory signals and the gene expression machinery in trypanosomes.

  10. Characterization of upstream sequences of the LIM2 gene that bind developmentally regulated and lens-specific proteins

    Institute of Scientific and Technical Information of China (English)

    HSU Heng; Robert L. CHURCH

    2004-01-01

    During lens development, lens epithelial cells differentiate into fiber cells. To date, four major lens fiber cell intrinsic membrane proteins (MIP) ranging in size from 70 kD to 19 kD have been characterized. The second most abundant lens fiber cell intrinsic membrane protein is MP19. This protein probably is involved with lens cell communication and relates with cataractogenesis. The aim of this research is to characterize upstream sequences of the MP19 (also called LIM2) gene that bind developmentally regulated and lens-specific proteins. We have used the gel mobility assays and corresponding competition experiments to identify and characterize cis elements within approximately 500 bases of LIM2 upstream sequences. Our studies locate the positions of some cis elements, including a "CA" repeat, a methylation Hha I island, an FnuD II site, an Ap1 and an Ap2 consensus sequences, and identify some specific cis elements which relate to lens-specific transcription of LIM2. Our experiments also preliminarily identify trans factors which bind to specific cis elements of the LIM2 promoter and/or regulate transcription of LIM2. We conclude that developmental regulation and coordination of the MP 19 gene in ocular lens fiber cells is controlled by the presence of specific cis elements that bind regulatory trans factors that affect LIM2 gene expression. DNA methylation is one mechanism of controlling LIM2 gene expression during lens development.

  11. Single nucleotide resolution RNA-seq uncovers new regulatory mechanisms in the opportunistic pathogen Streptococcus agalactiae.

    Science.gov (United States)

    Rosinski-Chupin, Isabelle; Sauvage, Elisabeth; Sismeiro, Odile; Villain, Adrien; Da Cunha, Violette; Caliot, Marie-Elise; Dillies, Marie-Agnès; Trieu-Cuot, Patrick; Bouloc, Philippe; Lartigue, Marie-Frédérique; Glaser, Philippe

    2015-05-30

    Streptococcus agalactiae, or Group B Streptococcus, is a leading cause of neonatal infections and an increasing cause of infections in adults with underlying diseases. In an effort to reconstruct the transcriptional networks involved in S. agalactiae physiology and pathogenesis, we performed an extensive and robust characterization of its transcriptome through a combination of differential RNA-sequencing in eight different growth conditions or genetic backgrounds and strand-specific RNA-sequencing. Our study identified 1,210 transcription start sites (TSSs) and 655 transcript ends as well as 39 riboswitches and cis-regulatory regions, 39 cis-antisense non-coding RNAs and 47 small RNAs potentially acting in trans. Among these putative regulatory RNAs, ten were differentially expressed in response to an acid stress and two riboswitches sensed directly or indirectly the pH modification. Strikingly, 15% of the TSSs identified were associated with the incorporation of pseudo-templated nucleotides, showing that reiterative transcription is a pervasive process in S. agalactiae. In particular, 40% of the TSSs upstream genes involved in nucleotide metabolism show reiterative transcription potentially regulating gene expression, as exemplified for pyrG and thyA encoding the CTP synthase and the thymidylate synthase respectively. This comprehensive map of the transcriptome at the single nucleotide resolution led to the discovery of new regulatory mechanisms in S. agalactiae. It also provides the basis for in depth analyses of transcriptional networks in S. agalactiae and of the regulatory role of reiterative transcription following variations of intra-cellular nucleotide pools.

  12. BLSSpeller: exhaustive comparative discovery of conserved cis-regulatory elements.

    Science.gov (United States)

    De Witte, Dieter; Van de Velde, Jan; Decap, Dries; Van Bel, Michiel; Audenaert, Pieter; Demeester, Piet; Dhoedt, Bart; Vandepoele, Klaas; Fostier, Jan

    2015-12-01

    The accurate discovery and annotation of regulatory elements remains a challenging problem. The growing number of sequenced genomes creates new opportunities for comparative approaches to motif discovery. Putative binding sites are then considered to be functional if they are conserved in orthologous promoter sequences of multiple related species. Existing methods for comparative motif discovery usually rely on pregenerated multiple sequence alignments, which are difficult to obtain for more diverged species such as plants. As a consequence, misaligned regulatory elements often remain undetected. We present a novel algorithm that supports both alignment-free and alignment-based motif discovery in the promoter sequences of related species. Putative motifs are exhaustively enumerated as words over the IUPAC alphabet and screened for conservation using the branch length score. Additionally, a confidence score is established in a genome-wide fashion. In order to take advantage of a cloud computing infrastructure, the MapReduce programming model is adopted. The method is applied to four monocotyledon plant species and it is shown that high-scoring motifs are significantly enriched for open chromatin regions in Oryza sativa and for transcription factor binding sites inferred through protein-binding microarrays in O.sativa and Zea mays. Furthermore, the method is shown to recover experimentally profiled ga2ox1-like KN1 binding sites in Z.mays. BLSSpeller was written in Java. Source code and manual are available at http://bioinformatics.intec.ugent.be/blsspeller Klaas.Vandepoele@psb.vib-ugent.be or jan.fostier@intec.ugent.be. Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  13. The noncoding human genome and the future of personalised medicine.

    Science.gov (United States)

    Cowie, Philip; Hay, Elizabeth A; MacKenzie, Alasdair

    2015-01-30

    Non-coding cis-regulatory sequences act as the 'eyes' of the genome and their role is to perceive, organise and relay cellular communication information to RNA polymerase II at gene promoters. The evolution of these sequences, that include enhancers, silencers, insulators and promoters, has progressed in multicellular organisms to the extent that cis-regulatory sequences make up as much as 10% of the human genome. Parallel evidence suggests that 75% of polymorphisms associated with heritable disease occur within predicted cis-regulatory sequences that effectively alter the 'perception' of cis-regulatory sequences or render them blind to cell communication cues. Cis-regulatory sequences also act as major functional targets of epigenetic modification thus representing an important conduit through which changes in DNA-methylation affects disease susceptibility. The objectives of the current review are (1) to describe what has been learned about identifying and characterising cis-regulatory sequences since the sequencing of the human genome; (2) to discuss their role in interpreting cell signalling pathways pathways; and (3) outline how this role may be altered by polymorphisms and epigenetic changes. We argue that the importance of the cis-regulatory genome for the interpretation of cellular communication pathways cannot be overstated and understanding its role in health and disease will be critical for the future development of personalised medicine.

  14. PReMod: a database of genome-wide mammalian cis-regulatory module predictions.

    Science.gov (United States)

    Ferretti, Vincent; Poitras, Christian; Bergeron, Dominique; Coulombe, Benoit; Robert, François; Blanchette, Mathieu

    2007-01-01

    We describe PReMod, a new database of genome-wide cis-regulatory module (CRM) predictions for both the human and the mouse genomes. The prediction algorithm, described previously in Blanchette et al. (2006) Genome Res., 16, 656-668, exploits the fact that many known CRMs are made of clusters of phylogenetically conserved and repeated transcription factors (TF) binding sites. Contrary to other existing databases, PReMod is not restricted to modules located proximal to genes, but in fact mostly contains distal predicted CRMs (pCRMs). Through its web interface, PReMod allows users to (i) identify pCRMs around a gene of interest; (ii) identify pCRMs that have binding sites for a given TF (or a set of TFs) or (iii) download the entire dataset for local analyses. Queries can also be refined by filtering for specific chromosomal regions, for specific regions relative to genes or for the presence of CpG islands. The output includes information about the binding sites predicted within the selected pCRMs, and a graphical display of their distribution within the pCRMs. It also provides a visual depiction of the chromosomal context of the selected pCRMs in terms of neighboring pCRMs and genes, all of which are linked to the UCSC Genome Browser and the NCBI. PReMod: http://genomequebec.mcgill.ca/PReMod.

  15. REDfly: a Regulatory Element Database for Drosophila.

    Science.gov (United States)

    Gallo, Steven M; Li, Long; Hu, Zihua; Halfon, Marc S

    2006-02-01

    Bioinformatics studies of transcriptional regulation in the metazoa are significantly hindered by the absence of readily available data on large numbers of transcriptional cis-regulatory modules (CRMs). Even the richly annotated Drosophila melanogaster genome lacks extensive CRM information. We therefore present here a database of Drosophila CRMs curated from the literature complete with both DNA sequence and a searchable description of the gene expression pattern regulated by each CRM. This resource should greatly facilitate the development of computational approaches to CRM discovery as well as bioinformatics analyses of regulatory sequence properties and evolution.

  16. Transcriptomic analysis of rice aleurone cells identified a novel abscisic acid response element.

    Science.gov (United States)

    Watanabe, Kenneth A; Homayouni, Arielle; Gu, Lingkun; Huang, Kuan-Ying; Ho, Tuan-Hua David; Shen, Qingxi J

    2017-09-01

    Seeds serve as a great model to study plant responses to drought stress, which is largely mediated by abscisic acid (ABA). The ABA responsive element (ABRE) is a key cis-regulatory element in ABA signalling. However, its consensus sequence (ACGTG(G/T)C) is present in the promoters of only about 40% of ABA-induced genes in rice aleurone cells, suggesting other ABREs may exist. To identify novel ABREs, RNA sequencing was performed on aleurone cells of rice seeds treated with 20 μM ABA. Gibbs sampling was used to identify enriched elements, and particle bombardment-mediated transient expression studies were performed to verify the function. Gene ontology analysis was performed to predict the roles of genes containing the novel ABREs. This study revealed 2443 ABA-inducible genes and a novel ABRE, designated as ABREN, which was experimentally verified to mediate ABA signalling in rice aleurone cells. Many of the ABREN-containing genes are predicted to be involved in stress responses and transcription. Analysis of other species suggests that the ABREN may be monocot specific. This study also revealed interesting expression patterns of genes involved in ABA metabolism and signalling. Collectively, this study advanced our understanding of diverse cis-regulatory sequences and the transcriptomes underlying ABA responses in rice aleurone cells. © 2017 John Wiley & Sons Ltd.

  17. KIRMES: kernel-based identification of regulatory modules in euchromatic sequences.

    Science.gov (United States)

    Schultheiss, Sebastian J; Busch, Wolfgang; Lohmann, Jan U; Kohlbacher, Oliver; Rätsch, Gunnar

    2009-08-15

    Understanding transcriptional regulation is one of the main challenges in computational biology. An important problem is the identification of transcription factor (TF) binding sites in promoter regions of potential TF target genes. It is typically approached by position weight matrix-based motif identification algorithms using Gibbs sampling, or heuristics to extend seed oligos. Such algorithms succeed in identifying single, relatively well-conserved binding sites, but tend to fail when it comes to the identification of combinations of several degenerate binding sites, as those often found in cis-regulatory modules. We propose a new algorithm that combines the benefits of existing motif finding with the ones of support vector machines (SVMs) to find degenerate motifs in order to improve the modeling of regulatory modules. In experiments on microarray data from Arabidopsis thaliana, we were able to show that the newly developed strategy significantly improves the recognition of TF targets. The python source code (open source-licensed under GPL), the data for the experiments and a Galaxy-based web service are available at http://www.fml.mpg.de/raetsch/suppl/kirmes/.

  18. A 20 bp cis-acting element is both necessary and sufficient to mediate elicitor response of a maize PRms gene.

    Science.gov (United States)

    Raventós, D; Jensen, A B; Rask, M B; Casacuberta, J M; Mundy, J; San Segundo, B

    1995-01-01

    Transient gene expression assays in barley aleurone protoplasts were used to identify a cis-regulatory element involved in the elicitor-responsive expression of the maize PRms gene. Analysis of transcriptional fusions between PRms 5' upstream sequences and a chloramphenicol acetyltransferase reporter gene, as well as chimeric promoters containing PRms promoter fragments or repeated oligonucleotides fused to a minimal promoter, delineated a 20 bp sequence which functioned as an elicitor-response element (ERE). This sequence contains a motif (-246 AATTGACC) similar to sequences found in promoters of other pathogen-responsive genes. The analysis also indicated that an enhancing sequence(s) between -397 and -296 is required for full PRms activation by elicitors. The protein kinase inhibitor staurosporine was found to completely block the transcriptional activation induced by elicitors. These data indicate that protein phosphorylation is involved in the signal transduction pathway leading to PRms expression.

  19. Characterization of the Promoter Region of an Arabidopsis Gene for 9-cis-Epoxycarotenoid Dioxygenase Involved in Dehydration-Inducible Transcription

    Science.gov (United States)

    Behnam, Babak; Iuchi, Satoshi; Fujita, Miki; Fujita, Yasunari; Takasaki, Hironori; Osakabe, Yuriko; Yamaguchi-Shinozaki, Kazuko; Kobayashi, Masatomo; Shinozaki, Kazuo

    2013-01-01

    Plants respond to dehydration stress and tolerate water-deficit status through complex physiological and cellular processes. Many genes are induced by water deficit. Abscisic acid (ABA) plays important roles in tolerance to dehydration stress by inducing many stress genes. ABA is synthesized de novo in response to dehydration. Most of the genes involved in ABA biosynthesis have been identified, and they are expressed mainly in leaf vascular tissues. Of the products of such genes, 9-cis-epoxycarotenoid dioxygenase (NCED) is a key enzyme in ABA biosynthesis. One of the five NCED genes in Arabidopsis, AtNCED3, is significantly induced by dehydration. To understand the regulatory mechanism of the early stages of the dehydration stress response, it is important to analyse the transcriptional regulatory systems of AtNCED3. In the present study, we found that an overlapping G-box recognition sequence (5′-CACGTG-3′) at −2248 bp from the transcriptional start site of AtNCED3 is an important cis-acting element in the induction of the dehydration response. We discuss the possible transcriptional regulatory system of dehydration-responsive AtNCED3 expression, and how this may control the level of ABA under water-deficit conditions. PMID:23604098

  20. Two potential hookworm DAF-16 target genes, SNR-3 and LPP-1: gene structure, expression profile, and implications of a cis-regulatory element in the regulation of gene expression.

    Science.gov (United States)

    Gao, Xin; Goggin, Kevin; Dowling, Camille; Qian, Jason; Hawdon, John M

    2015-01-08

    Hookworms infect nearly 700 million people, causing anemia and developmental stunting in heavy infections. Little is known about the genomic structure or gene regulation in hookworms, although recent publication of draft genome assemblies has allowed the first investigations of these topics to be undertaken. The transcription factor DAF-16 mediates multiple developmental pathways in the free living nematode Caenorhabditis elegans, and is involved in the recovery from the developmentally arrested L3 in hookworms. Identification of downstream targets of DAF-16 will provide a better understanding of the molecular mechanism of hookworm infection. Genomic Fragment 2.23 containing a DAF-16 binding element (DBE) was used to identify overlapping complementary expressed sequence tags (ESTs). These sequences were used to search a draft assembly of the Ancylostoma caninum genome, and identified two neighboring genes, snr-3 and lpp-1, in a tail-to-tail orientation. Expression patterns of both genes during parasitic development were determined by qRT-PCR. DAF-16 dependent cis-regulatory activity of fragment 2.23 was investigated using an in vitro reporter system. The snr-3 gene spans approximately 5.6 kb in the genome and contains 3 exons and 2 introns, and contains the DBE in its 3' untranslated region. Downstream from snr-3 in a tail-to-tail arrangement is the gene lpp-1. The lpp-1 gene spans more than 6 kb and contains 10 exons and 9 introns. The A. caninum genome contains 2 apparent splice variants, but there are 7 splice variants in the A. ceylanicum genome. While the gene order is similar, the gene structures of the hookworm genes differ from their C. elegans orthologs. Both genes show peak expression in the late L4 stage. Using a cell culture based expression system, fragment 2.23 was found to have both DAF-16-dependent promoter and enhancer activity that required an intact DBE. Two putative DAF-16 targets were identified by genome wide screening for DAF-16 binding

  1. A transcription factor collective defines the HSN serotonergic neuron regulatory landscape.

    Science.gov (United States)

    Lloret-Fernández, Carla; Maicas, Miren; Mora-Martínez, Carlos; Artacho, Alejandro; Jimeno-Martín, Ángela; Chirivella, Laura; Weinberg, Peter; Flames, Nuria

    2018-03-22

    Cell differentiation is controlled by individual transcription factors (TFs) that together activate a selection of enhancers in specific cell types. How these combinations of TFs identify and activate their target sequences remains poorly understood. Here, we identify the cis -regulatory transcriptional code that controls the differentiation of serotonergic HSN neurons in Caenorhabditis elegans . Activation of the HSN transcriptome is directly orchestrated by a collective of six TFs. Binding site clusters for this TF collective form a regulatory signature that is sufficient for de novo identification of HSN neuron functional enhancers. Among C. elegans neurons, the HSN transcriptome most closely resembles that of mouse serotonergic neurons. Mouse orthologs of the HSN TF collective also regulate serotonergic differentiation and can functionally substitute for their worm counterparts which suggests deep homology. Our results identify rules governing the regulatory landscape of a critically important neuronal type in two species separated by over 700 million years. © 2018, Lloret-Fernández et al.

  2. Bioinformatic analysis of cis-regulatory interactions between progesterone and estrogen receptors in breast cancer

    Directory of Open Access Journals (Sweden)

    Matloob Khushi

    2014-11-01

    Full Text Available Chromatin factors interact with each other in a cell and sequence-specific manner in order to regulate transcription and a wealth of publically available datasets exists describing the genomic locations of these interactions. Our recently published BiSA (Binding Sites Analyser database contains transcription factor binding locations and epigenetic modifications collected from published studies and provides tools to analyse stored and imported data. Using BiSA we investigated the overlapping cis-regulatory role of estrogen receptor alpha (ERα and progesterone receptor (PR in the T-47D breast cancer cell line. We found that ERα binding sites overlap with a subset of PR binding sites. To investigate further, we re-analysed raw data to remove any biases introduced by the use of distinct tools in the original publications. We identified 22,152 PR and 18,560 ERα binding sites (<5% false discovery rate with 4,358 overlapping regions among the two datasets. BiSA statistical analysis revealed a non-significant overall overlap correlation between the two factors, suggesting that ERα and PR are not partner factors and do not require each other for binding to occur. However, Monte Carlo simulation by Binary Interval Search (BITS, Relevant Distance, Absolute Distance, Jaccard and Projection tests by Genometricorr revealed a statistically significant spatial correlation of binding regions on chromosome between the two factors. Motif analysis revealed that the shared binding regions were enriched with binding motifs for ERα, PR and a number of other transcription and pioneer factors. Some of these factors are known to co-locate with ERα and PR binding. Therefore spatially close proximity of ERα binding sites with PR binding sites suggests that ERα and PR, in general function independently at the molecular level, but that their activities converge on a specific subset of transcriptional targets.

  3. Impacts of Neanderthal-Introgressed Sequences on the Landscape of Human Gene Expression.

    Science.gov (United States)

    McCoy, Rajiv C; Wakefield, Jon; Akey, Joshua M

    2017-02-23

    Regulatory variation influencing gene expression is a key contributor to phenotypic diversity, both within and between species. Unfortunately, RNA degrades too rapidly to be recovered from fossil remains, limiting functional genomic insights about our extinct hominin relatives. Many Neanderthal sequences survive in modern humans due to ancient hybridization, providing an opportunity to assess their contributions to transcriptional variation and to test hypotheses about regulatory evolution. We developed a flexible Bayesian statistical approach to quantify allele-specific expression (ASE) in complex RNA-seq datasets. We identified widespread expression differences between Neanderthal and modern human alleles, indicating pervasive cis-regulatory impacts of introgression. Brain regions and testes exhibited significant downregulation of Neanderthal alleles relative to other tissues, consistent with natural selection influencing the tissue-specific regulatory landscape. Our study demonstrates that Neanderthal-inherited sequences are not silent remnants of ancient interbreeding but have measurable impacts on gene expression that contribute to variation in modern human phenotypes. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Two cis-acting elements responsible for posttranscriptional trans-regulation of gene expression of human T-cell leukemia virus type I

    International Nuclear Information System (INIS)

    Seiki, Motoharu; Inoue, Junichiro; Hidaka, Makoto; Yoshida, Mitsuaki

    1988-01-01

    The pX sequence of human T-cell leukemia virus type I codes for two nuclear proteins, p40 tax and p27 rex and a cytoplasmic protein, p21 X-III . p40 tax activates transcription from the long terminal repeat (LTR), whereas p27 rex modulates posttranscriptional processing to accumulate gag and env mRNAs that retain intron sequences. In this paper, the authors identify two cis-acting sequence elements needed for regulation by p27 rex : a 5' splice signal and a specific sequence in the 3' LTR. These two sequence elements are sufficient for regulation by p27 rex ; expression of a cellular gene (metallothionein I) became sensitive to rex regulation when the LTR was inserted at the 3' end of this gene. The requirement for these two elements suggests and unusual regulatory mechanism of RNA processing in the nucleus

  5. Computational exploration of cis-regulatory modules in rhythmic expression data using the "Exploration of Distinctive CREs and CRMs" (EDCC) and "CRM Network Generator" (CNG) programs.

    Science.gov (United States)

    Bekiaris, Pavlos Stephanos; Tekath, Tobias; Staiger, Dorothee; Danisman, Selahattin

    2018-01-01

    Understanding the effect of cis-regulatory elements (CRE) and clusters of CREs, which are called cis-regulatory modules (CRM), in eukaryotic gene expression is a challenge of computational biology. We developed two programs that allow simple, fast and reliable analysis of candidate CREs and CRMs that may affect specific gene expression and that determine positional features between individual CREs within a CRM. The first program, "Exploration of Distinctive CREs and CRMs" (EDCC), correlates candidate CREs and CRMs with specific gene expression patterns. For pairs of CREs, EDCC also determines positional preferences of the single CREs in relation to each other and to the transcriptional start site. The second program, "CRM Network Generator" (CNG), prioritizes these positional preferences using a neural network and thus allows unbiased rating of the positional preferences that were determined by EDCC. We tested these programs with data from a microarray study of circadian gene expression in Arabidopsis thaliana. Analyzing more than 1.5 million pairwise CRE combinations, we found 22 candidate combinations, of which several contained known clock promoter elements together with elements that had not been identified as relevant to circadian gene expression before. CNG analysis further identified positional preferences of these CRE pairs, hinting at positional information that may be relevant for circadian gene expression. Future wet lab experiments will have to determine which of these combinations confer daytime specific circadian gene expression.

  6. Third-Generation Sequencing and Analysis of Four Complete Pig Liver Esterase Gene Sequences in Clones Identified by Screening BAC Library.

    Science.gov (United States)

    Zhou, Qiongqiong; Sun, Wenjuan; Liu, Xiyan; Wang, Xiliang; Xiao, Yuncai; Bi, Dingren; Yin, Jingdong; Shi, Deshi

    2016-01-01

    Pig liver carboxylesterase (PLE) gene sequences in GenBank are incomplete, which has led to difficulties in studying the genetic structure and regulation mechanisms of gene expression of PLE family genes. The aim of this study was to obtain and analysis of complete gene sequences of PLE family by screening from a Rongchang pig BAC library and third-generation PacBio gene sequencing. After a number of existing incomplete PLE isoform gene sequences were analysed, primers were designed based on conserved regions in PLE exons, and the whole pig genome used as a template for Polymerase chain reaction (PCR) amplification. Specific primers were then selected based on the PCR amplification results. A three-step PCR screening method was used to identify PLE-positive clones by screening a Rongchang pig BAC library and PacBio third-generation sequencing was performed. BLAST comparisons and other bioinformatics methods were applied for sequence analysis. Five PLE-positive BAC clones, designated BAC-10, BAC-70, BAC-75, BAC-119 and BAC-206, were identified. Sequence analysis yielded the complete sequences of four PLE genes, PLE1, PLE-B9, PLE-C4, and PLE-G2. Complete PLE gene sequences were defined as those containing regulatory sequences, exons, and introns. It was found that, not only did the PLE exon sequences of the four genes show a high degree of homology, but also that the intron sequences were highly similar. Additionally, the regulatory region of the genes contained two 720bps reverse complement sequences that may have an important function in the regulation of PLE gene expression. This is the first report to confirm the complete sequences of four PLE genes. In addition, the study demonstrates that each PLE isoform is encoded by a single gene and that the various genes exhibit a high degree of sequence homology, suggesting that the PLE family evolved from a single ancestral gene. Obtaining the complete sequences of these PLE genes provides the necessary foundation for

  7. Identification of cis-regulatory sequences that activate transcription in the suspensor of plant embryos.

    Science.gov (United States)

    Kawashima, Tomokazu; Wang, Xingjun; Henry, Kelli F; Bi, Yuping; Weterings, Koen; Goldberg, Robert B

    2009-03-03

    Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the scarlet runner bean (Phaseolus coccineus) G564 gene to understand how genes are activated specifically within the suspensor during early embryo development. Previously, we showed that the G564 upstream region has a block of tandem repeats, which contain a conserved 10-bp motif (GAAAAG(C)/(T)GAA), and that deletion of these repeats results in a loss of suspensor transcription. Here, we use gain-of-function (GOF) experiments with transgenic globular-stage tobacco embryos to show that only 1 of the 5 tandem repeats is required to drive suspensor-specific transcription. Fine-scale deletion and scanning mutagenesis experiments with 1 tandem repeat uncovered a 54-bp region that contains all of the sequences required to activate transcription in the suspensor, including the 10-bp motif (GAAAAGCGAA) and a similar 10-bp-like motif (GAAAAACGAA). Site-directed mutagenesis and GOF experiments indicated that both the 10-bp and 10-bp-like motifs are necessary, but not sufficient to activate transcription in the suspensor, and that a sequence (TTGGT) between the 10-bp and the 10-bp-like motifs is also necessary for suspensor transcription. Together, these data identify sequences that are required to activate transcription in the suspensor of a plant embryo after fertilization.

  8. Elements in the transcriptional regulatory region flanking herpes simplex virus type 1 oriS stimulate origin function.

    Science.gov (United States)

    Wong, S W; Schaffer, P A

    1991-05-01

    Like other DNA-containing viruses, the three origins of herpes simplex virus type 1 (HSV-1) DNA replication are flanked by sequences containing transcriptional regulatory elements. In a transient plasmid replication assay, deletion of sequences comprising the transcriptional regulatory elements of ICP4 and ICP22/47, which flank oriS, resulted in a greater than 80-fold decrease in origin function compared with a plasmid, pOS-822, which retains these sequences. In an effort to identify specific cis-acting elements responsible for this effect, we conducted systematic deletion analysis of the flanking region with plasmid pOS-822 and tested the resulting mutant plasmids for origin function. Stimulation by cis-acting elements was shown to be both distance and orientation dependent, as changes in either parameter resulted in a decrease in oriS function. Additional evidence for the stimulatory effect of flanking sequences on origin function was demonstrated by replacement of these sequences with the cytomegalovirus immediate-early promoter, resulting in nearly wild-type levels of oriS function. In competition experiments, cotransfection of cells with the test plasmid, pOS-822, and increasing molar concentrations of a competitor plasmid which contained the ICP4 and ICP22/47 transcriptional regulatory regions but lacked core origin sequences resulted in a significant reduction in the replication efficiency of pOS-822, demonstrating that factors which bind specifically to the oriS-flanking sequences are likely involved as auxiliary proteins in oriS function. Together, these studies demonstrate that trans-acting factors and the sites to which they bind play a critical role in the efficiency of HSV-1 DNA replication from oriS in transient-replication assays.

  9. Nutritional control of gene expression in Drosophila larvae via TOR, Myc and a novel cis-regulatory element

    Directory of Open Access Journals (Sweden)

    Grewal Savraj S

    2010-01-01

    Full Text Available Abstract Background Nutrient availability is a key determinant of eukaryotic cell growth. In unicellular organisms many signaling and transcriptional networks link nutrient availability to the expression of metabolic genes required for growth. However, less is known about the corresponding mechanisms that operate in metazoans. We used gene expression profiling to explore this issue in developing Drosophila larvae. Results We found that starvation for dietary amino acids (AA's leads to dynamic changes in transcript levels of many metabolic genes. The conserved insulin/PI3K and TOR signaling pathways mediate nutrition-dependent growth in Drosophila and other animals. We found that many AA starvation-responsive transcripts were also altered in TOR mutants. In contrast, although PI3K overexpression induced robust changes in the expression of many metabolic genes, these changes showed limited overlap with the AA starvation expression profile. We did however identify a strong overlap between genes regulated by the transcription factor, Myc, and AA starvation-responsive genes, particularly those involved in ribosome biogenesis, protein synthesis and mitochondrial function. The consensus Myc DNA binding site is enriched in promoters of these AA starvation genes, and we found that Myc overexpression could bypass dietary AA to induce expression of these genes. We also identified another sequence motif (Motif 1 enriched in the promoters of AA starvation-responsive genes. We showed that Motif 1 was both necessary and sufficient to mediate transcriptional responses to dietary AA in larvae. Conclusions Our data suggest that many of the transcriptional effects of amino acids are mediated via signaling through the TOR pathway in Drosophila larvae. We also find that these transcriptional effects are mediated through at least two mechanisms: via the transcription factor Myc, and via the Motif 1 cis-regulatory element. These studies begin to elucidate a nutrient

  10. Mapping cis-Regulatory Domains in the Human Genome UsingMulti-Species Conservation of Synteny

    Energy Technology Data Exchange (ETDEWEB)

    Ahituv, Nadav; Prabhakar, Shyam; Poulin, Francis; Rubin, EdwardM.; Couronne, Olivier

    2005-06-13

    Our inability to associate distant regulatory elements with the genes that they regulate has largely precluded their examination for sequence alterations contributing to human disease. One major obstacle is the large genomic space surrounding targeted genes in which such elements could potentially reside. In order to delineate gene regulatory boundaries we used whole-genome human-mouse-chicken (HMC) and human-mouse-frog (HMF) multiple alignments to compile conserved blocks of synteny (CBS), under the hypothesis that these blocks have been kept intact throughout evolution at least in part by the requirement of regulatory elements to stay linked to the genes that they regulate. A total of 2,116 and 1,942 CBS>200 kb were assembled for HMC and HMF respectively, encompassing 1.53 and 0.86 Gb of human sequence. To support the existence of complex long-range regulatory domains within these CBS we analyzed the prevalence and distribution of chromosomal aberrations leading to position effects (disruption of a genes regulatory environment), observing a clear bias not only for mapping onto CBS but also for longer CBS size. Our results provide a genome wide data set characterizing the regulatory domains of genes and the conserved regulatory elements within them.

  11. Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

    Directory of Open Access Journals (Sweden)

    Guo Xiang

    2008-12-01

    regulatory motifs in other species. These results suggest that these two motifs are likely to represent transcription factor binding sites in Theileria. Conclusion Theileria genomes are highly compact, with selection seemingly favoring short introns and intergenic regions. Three over-represented sequence motifs were independently identified in intergenic regions of both Theileria species, and the evidence suggests that at least two of them play a role in transcriptional control in T. parva. These are prime candidates for experimental validation of transcription factor binding sites in this single-celled eukaryotic parasite. Sequences similar to two of these Theileria motifs are conserved in Plasmodium hinting at the possibility of common regulatory machinery across the phylum Apicomplexa.

  12. Identification of a cis-regulatory region of a gene in Arabidopsis thaliana whose induction by dehydration is mediated by abscisic acid and requires protein synthesis.

    Science.gov (United States)

    Iwasaki, T; Yamaguchi-Shinozaki, K; Shinozaki, K

    1995-05-20

    In Arabidopsis thaliana, the induction of a dehydration-responsive gene, rd22, is mediated by abscisic acid (ABA) but the gene does not include any sequence corresponding to the consensus ABA-responsive element (ABRE), RYACGTGGYR, in its promoter region. The cis-regulatory region of the rd22 promoter was identified by monitoring the expression of beta-glucuronidase (GUS) activity in leaves of transgenic tobacco plants transformed with chimeric gene fusions constructed between 5'-deleted promoters of rd22 and the coding region of the GUS reporter gene. A 67-bp nucleotide fragment corresponding to positions -207 to -141 of the rd22 promoter conferred responsiveness to dehydration and ABA on a non-responsive promoter. The 67-bp fragment contains the sequences of the recognition sites for some transcription factors, such as MYC, MYB, and GT-1. The fact that accumulation of rd22 mRNA requires protein synthesis raises the possibility that the expression of rd22 might be regulated by one of these trans-acting protein factors whose de novo synthesis is induced by dehydration or ABA. Although the structure of the RD22 protein is very similar to that of a non-storage seed protein, USP, of Vicia faba, the expression of the GUS gene driven by the rd22 promoter in non-stressed transgenic Arabidopsis plants was found mainly in flowers and bolted stems rather than in seeds.

  13. Small RNAs and the regulation of cis-natural antisense transcripts in Arabidopsis

    Directory of Open Access Journals (Sweden)

    Lonardi Stefano

    2008-01-01

    Full Text Available Abstract Background In spite of large intergenic spaces in plant and animal genomes, 7% to 30% of genes in the genomes encode overlapping cis-natural antisense transcripts (cis-NATs. The widespread occurrence of cis-NATs suggests an evolutionary advantage for this type of genomic arrangement. Experimental evidence for the regulation of two cis-NAT gene pairs by natural antisense transcripts-generated small interfering RNAs (nat-siRNAs via the RNA interference (RNAi pathway has been reported in Arabidopsis. However, the extent of siRNA-mediated regulation of cis-NAT genes is still unclear in any genome. Results The hallmarks of RNAi regulation of NATs are 1 inverse regulation of two genes in a cis-NAT pair by environmental and developmental cues and 2 generation of siRNAs by cis-NAT genes. We examined Arabidopsis transcript profiling data from public microarray databases to identify cis-NAT pairs whose sense and antisense transcripts show opposite expression changes. A subset of the cis-NAT genes displayed negatively correlated expression profiles as well as inverse differential expression changes under at least one of the examined developmental stages or treatment conditions. By searching the Arabidopsis Small RNA Project (ASRP and Massively Parallel Signature Sequencing (MPSS small RNA databases as well as our stress-treated small RNA dataset, we found small RNAs that matched at least one gene in 646 pairs out of 1008 (64% protein-coding cis-NAT pairs, which suggests that siRNAs may regulate the expression of many cis-NAT genes. 209 putative siRNAs have the potential to target more than one gene and half of these small RNAs could target multiple members of a gene family. Furthermore, the majority of the putative siRNAs within the overlapping regions tend to target only one transcript of a given NAT pair, which is consistent with our previous finding on salt- and bacteria-induced nat-siRNAs. In addition, we found that genes encoding plastid- or

  14. Genome-wide decoding of hierarchical modular structure of transcriptional regulation by cis-element and expression clustering.

    Science.gov (United States)

    Leyfer, Dmitriy; Weng, Zhiping

    2005-09-01

    A holistic approach to the study of cellular processes is identifying both gene-expression changes and regulatory elements promoting such changes. Cellular regulatory processes can be viewed as transcriptional modules (TMs), groups of coexpressed genes regulated by groups of transcription factors (TFs). We set out to devise a method that would identify TMs while avoiding arbitrary thresholds on TM sizes and number. Assuming that gene expression is determined by TFs that bind to the gene's promoter, clustering of genes based on TF binding sites (cis-elements) should create gene groups similar to those obtained by gene expression clustering. Intersections between the expression and cis-element-based gene clusters reveal TMs. Statistical significance assigned to each TM allows identification of regulatory units of any size. Our method correctly identifies the number and sizes of TMs on simulated datasets. We demonstrate that yeast experimental TMs are biologically relevant by comparing them with MIPS and GO categories. Our modules are in statistically significant agreement with TMs from other research groups. This work suggests that there is no preferential division of biological processes into regulatory units; each degree of partitioning exhibits a slice of biological network revealing hierarchical modular organization of transcriptional regulation.

  15. Regulatory sequences driving expression of the sea urchin Otp homeobox gene in oral ectoderm cells.

    Science.gov (United States)

    Cavalieri, Vincenzo; Bernardo, Maria Di; Spinelli, Giovanni

    2007-01-01

    PlOtp (Orthopedia), a homeodomain-containing transcription factor, has been recently characterized as a key regulator of the morphogenesis of the skeletal system in the embryo of the sea urchin Paracentrotus lividus. Otp acts as a positive regulator in a subset of oral ectodermal cells which transmit short-range signals to the underlying primary mesenchyme cells where skeletal synthesis is initiated. To shed some light on the molecular mechanisms involved in such a process, we begun a functional analysis of the cis-regulatory sequences of the Otp gene. Congruent with the spatial expression profile of the endogenous Otp gene, we found that while a DNA region from -494 to +358 is shown to drive in vivo GFP reporter expression in the oral ectoderm, but also in the foregut, a larger region spanning from -2044 to +358 is needed to give firmly established tissue specificity. Microinjection of PCR-amplified DNA constructs, truncated in the 5' regulatory region, and determination of GFP mRNA level in injected embryos allowed the identification of a 5'-flanking fragment of 184bp in length, essential for expression of the transgene in the oral ectoderm of pluteus stage embryos. Finally, we conducted DNAse I-footprinting assays in nuclear extracts for the 184bp region and detected two protected sequences. Data bank search indicates that these sites contain consensus binding sites for transcription factors.

  16. Both positive and negative regulatory elements mediate expression of a photoregulated CAB gene from Nicotiana plumbaginifolia.

    Science.gov (United States)

    Castresana, C; Garcia-Luque, I; Alonso, E; Malik, V S; Cashmore, A R

    1988-01-01

    We have analyzed promoter regulatory elements from a photoregulated CAB gene (Cab-E) isolated from Nicotiana plumbaginifolia. These studies have been performed by introducing chimeric gene constructs into tobacco cells via Agrobacterium tumefaciens-mediated transformation. Expression studies on the regenerated transgenic plants have allowed us to characterize three positive and one negative cis-acting elements that influence photoregulated expression of the Cab-E gene. Within the upstream sequences we have identified two positive regulatory elements (PRE1 and PRE2) which confer maximum levels of photoregulated expression. These sequences contain multiple repeated elements related to the sequence-ACCGGCCCACTT-. We have also identified within the upstream region a negative regulatory element (NRE) extremely rich in AT sequences, which reduces the level of gene expression in the light. We have defined a light regulatory element (LRE) within the promoter region extending from -396 to -186 bp which confers photoregulated expression when fused to a constitutive nopaline synthase ('nos') promoter. Within this region there is a 132-bp element, extending from -368 to -234 bp, which on deletion from the Cab-E promoter reduces gene expression from high levels to undetectable levels. Finally, we have demonstrated for a full length Cab-E promoter conferring high levels of photoregulated expression, that sequences proximal to the Cab-E TATA box are not replaceable by corresponding sequences from a 'nos' promoter. This contrasts with the apparent equivalence of these Cab-E and 'nos' TATA box-proximal sequences in truncated promoters conferring low levels of photoregulated expression. Images PMID:2901343

  17. Screening for sequence-specific RNA-BPs by comprehensive UV crosslinking

    Directory of Open Access Journals (Sweden)

    Le Meuth-Metzinger Valerie

    2002-06-01

    Full Text Available Abstract Background Specific cis-elements and the associated trans-acting factors have been implicated in the post-transcriptional regulation of gene expression. In the era of genome wide analyses identifying novel trans-acting factors and cis-regulatory elements is a step towards understanding coordinated gene expression. UV-crosslink analysis is a standard method used to identify RNA-binding proteins. Uridine is traditionally used to radiolabel substrate RNAs, however, proteins binding to cis-elments particularly uridine poor will be weakly or not detected. We evaluate here the possibility of using UV-crosslinking with RNA substrates radiolabeled with each of the four ribonucleotides as an approach for screening for novel sequence specific RNA-binding proteins. Results The radiolabeled RNA substrates were derived from the 3'UTRs of the cloned Eg and c-mos Xenopus laevis maternal mRNAs. Specific, but not identical, uv-crosslinking signals were obtained, some of which corresponded to already identified proteins. A signal for a novel 90 kDa protein was observed with the c-mos 3'UTR radiolabeled with both CTP and GTP but not with UTP. The binding site of the 90 kDa RNA-binding protein was localised to a 59-nucleotide portion of the c-mos 3'UTR. Conclusion That the 90 kDa signal was detected with RNAs radiolabeled with CTP or GTP but not UTP illustrates the advantage of radiolabeling all four nucleotides in a UV-crosslink based screen. This method can be used for both long and short RNAs and does not require knowledge of the cis-acting sequence. It should be amenable to high throughput screening for RNA binding proteins.

  18. Novel 9-cis/all-trans β-carotene isomerases from plastidic oil bodies in Dunaliella bardawil catalyze the conversion of all-trans to 9-cis β-carotene.

    Science.gov (United States)

    Davidi, Lital; Pick, Uri

    2017-06-01

    We identified and demonstrated the function of 9-cis/all-trans β-carotene isomerases in plastidic globules of Dunaliella bardawil, the species accumulating the highest levels of 9-cis β-carotene that is essential for humans. The halotolerant alga Dunaliella bardawil is unique in that it accumulates under light stress high levels of β-carotene in plastidic lipid globules. The pigment is composed of two major isomers: all-trans β-carotene, the common natural form of this pigment, and 9-cis β-carotene. The biosynthetic pathway of β-carotene is known, but it is not clear how the 9-cis isomer is formed. We identified in plastidic lipid globules that were isolated from D. bardawil two proteins with high sequence homology to the D27 protein-a 9-cis/all-trans β-carotene isomerase from rice (Alder et al. Science 335:1348-1351, 2012). The proteins are enriched in the oil globules by 6- to 17-fold compared to chloroplast proteins. The expression of the corresponding genes, 9-cis-βC-iso1 and 9-cis-βC-iso2, is enhanced under light stress. The synthetic proteins catalyze in vitro conversion of all-trans to 9-cis β-carotene. Expression of the 9-cis-βC-iso1 or of 9-cis-βC-iso2 genes in an E. coli mutant line that harbors β-carotene biosynthesis genes enhanced the conversion of all-trans into 9-cis β-carotene. These results suggest that 9-cis-βC-ISO1 and 9-cis-βC-ISO2 proteins are responsible for the formation of 9-cis β-carotene in D. bardawil under stress conditions.

  19. Ancient and recent positive selection transformed opioid cis-regulation in humans.

    Directory of Open Access Journals (Sweden)

    Matthew V Rockman

    2005-12-01

    Full Text Available Changes in the cis-regulation of neural genes likely contributed to the evolution of our species' unique attributes, but evidence of a role for natural selection has been lacking. We found that positive natural selection altered the cis-regulation of human prodynorphin, the precursor molecule for a suite of endogenous opioids and neuropeptides with critical roles in regulating perception, behavior, and memory. Independent lines of phylogenetic and population genetic evidence support a history of selective sweeps driving the evolution of the human prodynorphin promoter. In experimental assays of chimpanzee-human hybrid promoters, the selected sequence increases transcriptional inducibility. The evidence for a change in the response of the brain's natural opioids to inductive stimuli points to potential human-specific characteristics favored during evolution. In addition, the pattern of linked nucleotide and microsatellite variation among and within modern human populations suggests that recent selection, subsequent to the fixation of the human-specific mutations and the peopling of the globe, has favored different prodynorphin cis-regulatory alleles in different parts of the world.

  20. Speeding cis-trans regulation discovery by phylogenomic analyses coupled with screenings of an arrayed library of Arabidopsis transcription factors.

    Directory of Open Access Journals (Sweden)

    Gabriel Castrillo

    Full Text Available Transcriptional regulation is an important mechanism underlying gene expression and has played a crucial role in evolution. The number, position and interactions between cis-elements and transcription factors (TFs determine the expression pattern of a gene. To identify functionally relevant cis-elements in gene promoters, a phylogenetic shadowing approach with a lipase gene (LIP1 was used. As a proof of concept, in silico analyses of several Brassicaceae LIP1 promoters identified a highly conserved sequence (LIP1 element that is sufficient to drive strong expression of a reporter gene in planta. A collection of ca. 1,200 Arabidopsis thaliana TF open reading frames (ORFs was arrayed in a 96-well format (RR library and a convenient mating based yeast one hybrid (Y1H screening procedure was established. We constructed an episomal plasmid (pTUY1H to clone the LIP1 element and used it as bait for Y1H screenings. A novel interaction with an HD-ZIP (AtML1 TF was identified and abolished by a 2 bp mutation in the LIP1 element. A role of this interaction in transcriptional regulation was confirmed in planta. In addition, we validated our strategy by reproducing the previously reported interaction between a MYB-CC (PHR1 TF, a central regulator of phosphate starvation responses, with a conserved promoter fragment (IPS1 element containing its cognate binding sequence. Finally, we established that the LIP1 and IPS1 elements were differentially bound by HD-ZIP and MYB-CC family members in agreement with their genetic redundancy in planta. In conclusion, combining in silico analyses of orthologous gene promoters with Y1H screening of the RR library represents a powerful approach to decipher cis- and trans-regulatory codes.

  1. PAA, WSH, and CIS Overview Self-Study #47656

    Energy Technology Data Exchange (ETDEWEB)

    Schroeder, Rachel Anne [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

    2017-09-14

    This course presents an overview of the Department of Energy’s (DOE’s) regulatory requirements relevant to the Price-Anderson Amendments Act (PAAA, also referred to as nuclear safety), worker safety and health (WSH), and classified information security (CIS) that are enforceable under the DOE enforcement program; describes the DOE enforcement process; and provides an overview of Los Alamos National Laboratory’s (LANL’s) internal compliance program relative to these DOE regulatory requirements. The LANL PAAA Program is responsible for maintaining LANL’s internal compliance program, which ensures the prompt identification, screening, and reporting of noncompliances to DOE regulatory requirements pertaining to nuclear safety, WSH, and CIS to build the strongest mitigation position for the Laboratory with respect to civil or other penalties.

  2. The Evolution of Bony Vertebrate Enhancers at Odds with Their Coding Sequence Landscape.

    Science.gov (United States)

    Yousaf, Aisha; Sohail Raza, Muhammad; Ali Abbasi, Amir

    2015-08-06

    Enhancers lie at the heart of transcriptional and developmental gene regulation. Therefore, changes in enhancer sequences usually disrupt the target gene expression and result in disease phenotypes. Despite the well-established role of enhancers in development and disease, evolutionary sequence studies are lacking. The current study attempts to unravel the puzzle of bony vertebrates' conserved noncoding elements (CNE) enhancer evolution. Bayesian phylogenetics of enhancer sequences spotlights promising interordinal relationships among placental mammals, proposing a closer relationship between humans and laurasiatherians while placing rodents at the basal position. Clock-based estimates of enhancer evolution provided a dynamic picture of interspecific rate changes across the bony vertebrate lineage. Moreover, coelacanth in the study augmented our appreciation of the vertebrate cis-regulatory evolution during water-land transition. Intriguingly, we observed a pronounced upsurge in enhancer evolution in land-dwelling vertebrates. These novel findings triggered us to further investigate the evolutionary trend of coding as well as CNE nonenhancer repertoires, to highlight the relative evolutionary dynamics of diverse genomic landscapes. Surprisingly, the evolutionary rates of enhancer sequences were clearly at odds with those of the coding and the CNE nonenhancer sequences during vertebrate adaptation to land, with land vertebrates exhibiting significantly reduced rates of coding sequence evolution in comparison to their fast evolving regulatory landscape. The observed variation in tetrapod cis-regulatory elements caused the fine-tuning of associated gene regulatory networks. Therefore, the increased evolutionary rate of tetrapods' enhancer sequences might be responsible for the variation in developmental regulatory circuits during the process of vertebrate adaptation to land. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for

  3. New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes

    DEFF Research Database (Denmark)

    Parker, Brian John; Moltke, Ida; Roth, Adam

    2011-01-01

    a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein...

  4. Transformation of Migration Flows Between the Russian Far East and CIS and non-CIS States

    Directory of Open Access Journals (Sweden)

    Motrich E. L.

    2010-06-01

    Full Text Available Basic trends in the migration processes in the Russian Far East are shown. Special emphasis is placed on the transformation of migration interactions with CIS and non-CIS countries both at the level of the region as a whole, and at the level of the Far Eastern territories of the Russian Federation. An extent of using foreign labor in different periods of the Russian Far East socio-economic development and the regulatory support of this process are shown. Prospects for attracting and utilizing foreign labor are stated

  5. Computational methods in sequence and structure prediction

    Science.gov (United States)

    Lang, Caiyi

    This dissertation is organized into two parts. In the first part, we will discuss three computational methods for cis-regulatory element recognition in three different gene regulatory networks as the following: (a) Using a comprehensive "Phylogenetic Footprinting Comparison" method, we will investigate the promoter sequence structures of three enzymes (PAL, CHS and DFR) that catalyze sequential steps in the pathway from phenylalanine to anthocyanins in plants. Our result shows there exists a putative cis-regulatory element "AC(C/G)TAC(C)" in the upstream of these enzyme genes. We propose this cis-regulatory element to be responsible for the genetic regulation of these three enzymes and this element, might also be the binding site for MYB class transcription factor PAP1. (b) We will investigate the role of the Arabidopsis gene glutamate receptor 1.1 (AtGLR1.1) in C and N metabolism by utilizing the microarray data we obtained from AtGLR1.1 deficient lines (antiAtGLR1.1). We focus our investigation on the putatively co-regulated transcript profile of 876 genes we have collected in antiAtGLR1.1 lines. By (a) scanning the occurrence of several groups of known abscisic acid (ABA) related cisregulatory elements in the upstream regions of 876 Arabidopsis genes; and (b) exhaustive scanning of all possible 6-10 bps motif occurrence in the upstream regions of the same set of genes, we are able to make a quantative estimation on the enrichment level of each of the cis-regulatory element candidates. We finally conclude that one specific cis-regulatory element group, called "ABRE" elements, are statistically highly enriched within the 876-gene group as compared to their occurrence within the genome. (c) We will introduce a new general purpose algorithm, called "fuzzy REDUCE1", which we have developed recently for automated cis-regulatory element identification. In the second part, we will discuss our newly devised protein design framework. With this framework we have developed

  6. Association analysis identifies ZNF750 regulatory variants in psoriasis

    Directory of Open Access Journals (Sweden)

    Birnbaum Ramon Y

    2011-12-01

    Full Text Available Abstract Background Mutations in the ZNF750 promoter and coding regions have been previously associated with Mendelian forms of psoriasis and psoriasiform dermatitis. ZNF750 encodes a putative zinc finger transcription factor that is highly expressed in keratinocytes and represents a candidate psoriasis gene. Methods We examined whether ZNF750 variants were associated with psoriasis in a large case-control population. We sequenced the promoter and exon regions of ZNF750 in 716 Caucasian psoriasis cases and 397 Caucasian controls. Results We identified a total of 47 variants, including 38 rare variants of which 35 were novel. Association testing identified two ZNF750 haplotypes associated with psoriasis (p ZNF750 promoter and 5' UTR variants displayed a 35-55% reduction of ZNF750 promoter activity, consistent with the promoter activity reduction seen in a Mendelian psoriasis family with a ZNF750 promoter variant. However, the rare promoter and 5' UTR variants identified in this study did not strictly segregate with the psoriasis phenotype within families. Conclusions Two haplotypes of ZNF750 and rare 5' regulatory variants of ZNF750 were found to be associated with psoriasis. These rare 5' regulatory variants, though not causal, might serve as a genetic modifier of psoriasis.

  7. Evolution of cichlid vision via trans-regulatory divergence

    Directory of Open Access Journals (Sweden)

    O’Quin Kelly E

    2012-12-01

    Full Text Available Abstract Background Phenotypic evolution may occur through mutations that affect either the structure or expression of protein-coding genes. Although the evolution of color vision has historically been attributed to structural mutations within the opsin genes, recent research has shown that opsin regulatory mutations can also tune photoreceptor sensitivity and color vision. Visual sensitivity in African cichlid fishes varies as a result of the differential expression of seven opsin genes. We crossed cichlid species that express different opsin gene sets and scanned their genome for expression Quantitative Trait Loci (eQTL responsible for these differences. Our results shed light on the role that different structural, cis-, and trans-regulatory mutations play in the evolution of color vision. Results We identified 11 eQTL that contribute to the divergent expression of five opsin genes. On three linkage groups, several eQTL formed regulatory “hotspots” associated with the expression of multiple opsins. Importantly, however, the majority of the eQTL we identified (8/11 or 73% occur on linkage groups located trans to the opsin genes, suggesting that cichlid color vision has evolved primarily via trans-regulatory divergence. By modeling the impact of just two of these trans-regulatory eQTL, we show that opsin regulatory mutations can alter cichlid photoreceptor sensitivity and color vision at least as much as opsin structural mutations can. Conclusions Combined with previous work, we demonstrate that the evolution of cichlid color vision results from the interplay of structural, cis-, and especially trans-regulatory loci. Although there are numerous examples of structural and cis-regulatory mutations that contribute to phenotypic evolution, our results suggest that trans-regulatory mutations could contribute to phenotypic divergence more commonly than previously expected, especially in systems like color vision, where compensatory changes in the

  8. Complete re-sequencing of a 2Mb topological domain encompassing the FTO/IRXB genes identifies a novel obesity-associated region upstream of IRX5

    DEFF Research Database (Denmark)

    Hunt, Lilian E; Noyvert, Boris; Bhaw-Rosun, Leena

    2015-01-01

    BACKGROUND: Association studies have identified a number of loci that contribute to an increased body mass index (BMI), the strongest of which is in the first intron of the FTO gene on human chromosome 16q12.2. However, this region is both non-coding and under strong linkage disequilibrium, making...... it recalcitrant to functional interpretation. Furthermore, the FTO gene is located within a complex cis-regulatory landscape defined by a topologically associated domain that includes the IRXB gene cluster, a trio of developmental regulators. Consequently, at least three genes in this interval have been...... implicated in the aetiology of obesity. METHODS: Here, we sequence a 2 Mb region encompassing the FTO, RPGRIP1L and IRXB cluster genes in 284 individuals from a well-characterised study group of Danish men containing extremely overweight young adults and controls. We further replicate our findings both...

  9. Integrative modeling of eQTLs and cis-regulatory elements suggests mechanisms underlying cell type specificity of eQTLs.

    Directory of Open Access Journals (Sweden)

    Christopher D Brown

    Full Text Available Genetic variants in cis-regulatory elements or trans-acting regulators frequently influence the quantity and spatiotemporal distribution of gene transcription. Recent interest in expression quantitative trait locus (eQTL mapping has paralleled the adoption of genome-wide association studies (GWAS for the analysis of complex traits and disease in humans. Under the hypothesis that many GWAS associations tag non-coding SNPs with small effects, and that these SNPs exert phenotypic control by modifying gene expression, it has become common to interpret GWAS associations using eQTL data. To fully exploit the mechanistic interpretability of eQTL-GWAS comparisons, an improved understanding of the genetic architecture and causal mechanisms of cell type specificity of eQTLs is required. We address this need by performing an eQTL analysis in three parts: first we identified eQTLs from eleven studies on seven cell types; then we integrated eQTL data with cis-regulatory element (CRE data from the ENCODE project; finally we built a set of classifiers to predict the cell type specificity of eQTLs. The cell type specificity of eQTLs is associated with eQTL SNP overlap with hundreds of cell type specific CRE classes, including enhancer, promoter, and repressive chromatin marks, regions of open chromatin, and many classes of DNA binding proteins. These associations provide insight into the molecular mechanisms generating the cell type specificity of eQTLs and the mode of regulation of corresponding eQTLs. Using a random forest classifier with cell specific CRE-SNP overlap as features, we demonstrate the feasibility of predicting the cell type specificity of eQTLs. We then demonstrate that CREs from a trait-associated cell type can be used to annotate GWAS associations in the absence of eQTL data for that cell type. We anticipate that such integrative, predictive modeling of cell specificity will improve our ability to understand the mechanistic basis of human

  10. Pleiotropy constrains the evolution of protein but not regulatory sequences in a transcription regulatory network influencing complex social behaviours

    Directory of Open Access Journals (Sweden)

    Daria eMolodtsova

    2014-12-01

    Full Text Available It is increasingly apparent that genes and networks that influence complex behaviour are evolutionary conserved, which is paradoxical considering that behaviour is labile over evolutionary timescales. How does adaptive change in behaviour arise if behaviour is controlled by conserved, pleiotropic, and likely evolutionary constrained genes? Pleiotropy and connectedness are known to constrain the general rate of protein evolution, prompting some to suggest that the evolution of complex traits, including behaviour, is fuelled by regulatory sequence evolution. However, we seldom have data on the strength of selection on mutations in coding and regulatory sequences, and this hinders our ability to study how pleiotropy influences coding and regulatory sequence evolution. Here we use population genomics to estimate the strength of selection on coding and regulatory mutations for a transcriptional regulatory network that influences complex behaviour of honey bees. We found that replacement mutations in highly connected transcription factors and target genes experience significantly stronger negative selection relative to weakly connected transcription factors and targets. Adaptively evolving proteins were significantly more likely to reside at the periphery of the regulatory network, while proteins with signs of negative selection were near the core of the network. Interestingly, connectedness and network structure had minimal influence on the strength of selection on putative regulatory sequences for both transcription factors and their targets. Our study indicates that adaptive evolution of complex behaviour can arise because of positive selection on protein-coding mutations in peripheral genes, and on regulatory sequence mutations in both transcription factors and their targets throughout the network.

  11. Piecing together cis-regulatory networks: insights from epigenomics studies in plants.

    Science.gov (United States)

    Huang, Shao-Shan C; Ecker, Joseph R

    2018-05-01

    5-Methylcytosine, a chemical modification of DNA, is a covalent modification found in the genomes of both plants and animals. Epigenetic inheritance of phenotypes mediated by DNA methylation is well established in plants. Most of the known mechanisms of establishing, maintaining and modifying DNA methylation have been worked out in the reference plant Arabidopsis thaliana. Major functions of DNA methylation in plants include regulation of gene expression and silencing of transposable elements (TEs) and repetitive sequences, both of which have parallels in mammalian biology, involve interaction with the transcriptional machinery, and may have profound effects on the regulatory networks in the cell. Methylome and transcriptome dynamics have been investigated in development and environmental responses in Arabidopsis and agriculturally and ecologically important plants, revealing the interdependent relationship among genomic context, methylation patterns, and expression of TE and protein coding genes. Analyses of methylome variation among plant natural populations and species have begun to quantify the extent of genetic control of methylome variation vs. true epimutation, and model the evolutionary forces driving methylome evolution in both short and long time scales. The ability of DNA methylation to positively or negatively modulate binding affinity of transcription factors (TFs) provides a natural link from genome sequence and methylation changes to transcription. Technologies that allow systematic determination of methylation sensitivities of TFs, in native genomic and methylation context without confounding factors such as histone modifications, will provide baseline datasets for building cell-type- and individual-specific regulatory networks that underlie the establishment and inheritance of complex traits. This article is categorized under: Laboratory Methods and Technologies > Genetic/Genomic Methods Biological Mechanisms > Regulatory Biology. © 2017 Wiley

  12. A de novo 1.58 Mb deletion, including MAP2K6 and mapping 1.28 Mb upstream to SOX9, identified in a patient with Pierre Robin sequence and osteopenia with multiple fractures.

    Science.gov (United States)

    Smyk, Marta; Roeder, Elizabeth; Cheung, Sau Wai; Szafranski, Przemyslaw; Stankiewicz, Paweł

    2015-08-01

    Defects of long-range regulatory elements of dosage-sensitive genes represent an under-recognized mechanism underlying genetic diseases. Haploinsufficiency of SOX9, the gene essential for development of testes and differentiation of chondrocytes, results in campomelic dysplasia, a skeletal malformation syndrome often associated with sex reversal. Chromosomal rearrangements with breakpoints mapping up to 1.6 Mb up- and downstream to SOX9, and disrupting its distant cis-regulatory elements, have been described in patients with milder forms of campomelic dysplasia, Pierre Robin sequence, and sex reversal. We present an ∼1.58 Mb deletion mapping ∼1.28 Mb upstream to SOX9 that encompasses its putative long-range cis-regulatory element(s) and MAP2K6 in a patient with Pierre Robin sequence and osteopenia with multiple fractures. Low bone mass panel testing using massively parallel sequencing of 23 nuclear genes, including COL1A1 and COL1A2 was negative. Based on the previous mouse model of Map2k6, suggesting that Sox9 is likely a downstream target of the p38 MAPK pathway, and our previous chromosome conformation capture-on-chip (4C) data showing potential interactions between SOX9 promoter and MAP2K6, we hypothesize that deletion of MAP2K6 might have affected SOX9 expression and contributed to our patient's phenotype. © 2015 Wiley Periodicals, Inc.

  13. Genome-wide methylation analysis identified sexually dimorphic methylated regions in hybrid tilapia

    Science.gov (United States)

    Wan, Zi Yi; Xia, Jun Hong; Lin, Grace; Wang, Le; Lin, Valerie C. L.; Yue, Gen Hua

    2016-01-01

    Sexual dimorphism is an interesting biological phenomenon. Previous studies showed that DNA methylation might play a role in sexual dimorphism. However, the overall picture of the genome-wide methylation landscape in sexually dimorphic species remains unclear. We analyzed the DNA methylation landscape and transcriptome in hybrid tilapia (Oreochromis spp.) using whole genome bisulfite sequencing (WGBS) and RNA-sequencing (RNA-seq). We found 4,757 sexually dimorphic differentially methylated regions (DMRs), with significant clusters of DMRs located on chromosomal regions associated with sex determination. CpG methylation in promoter regions was negatively correlated with the gene expression level. MAPK/ERK pathway was upregulated in male tilapia. We also inferred active cis-regulatory regions (ACRs) in skeletal muscle tissues from WGBS datasets, revealing sexually dimorphic cis-regulatory regions. These results suggest that DNA methylation contribute to sex-specific phenotypes and serve as resources for further investigation to analyze the functions of these regions and their contributions towards sexual dimorphisms. PMID:27782217

  14. TFpredict and SABINE: sequence-based prediction of structural and functional characteristics of transcription factors.

    Directory of Open Access Journals (Sweden)

    Johannes Eichner

    Full Text Available One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networks underlying the adaptation of organisms to dynamically changing environmental conditions. As experimental techniques for determining TF binding sites are expensive and mostly performed for selected TFs only, accurate computational approaches are needed to analyze transcriptional regulation in eukaryotes on a genome-wide level. We implemented a four-step classification workflow which for a given protein sequence (1 discriminates TFs from other proteins, (2 determines the structural superclass of TFs, (3 identifies the DNA-binding domains of TFs and (4 predicts their cis-acting DNA motif. While existing tools were extended and adapted for performing the latter two prediction steps, the first two steps are based on a novel numeric sequence representation which allows for combining existing knowledge from a BLAST scan with robust machine learning-based classification. By evaluation on a set of experimentally confirmed TFs and non-TFs, we demonstrate that our new protein sequence representation facilitates more reliable identification and structural classification of TFs than previously proposed sequence-derived features. The algorithms underlying our proposed methodology are implemented in the two complementary tools TFpredict and SABINE. The online and stand-alone versions of TFpredict and SABINE are freely available to academics at http://www.cogsys.cs.uni-tuebingen.de/software/TFpredict/ and http://www.cogsys.cs.uni-tuebingen.de/software/SABINE/.

  15. Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes

    OpenAIRE

    Kreiman, Gabriel

    2004-01-01

    Sequence information and high‐throughput methods to measure gene expression levels open the door to explore transcriptional regulation using computational tools. Combinatorial regulation and sparseness of regulatory elements throughout the genome allow organisms to control the spatial and temporal patterns of gene expression. Here we study the organization of cis‐regulatory elements in sets of co‐regulated genes. We build an algorithm to search for combinations of transcription factor binding...

  16. The upstream regulatory sequence of the light harvesting complex Lhcf2 gene of the marine diatom Phaeodactylum tricornutum enhances transcription in an orientation- and distance-independent fashion.

    Science.gov (United States)

    Russo, Monia Teresa; Annunziata, Rossella; Sanges, Remo; Ferrante, Maria Immacolata; Falciatore, Angela

    2015-12-01

    Diatoms are a key phytoplankton group in the contemporary ocean, showing extraordinary adaptation capacities to rapidly changing environments. The recent availability of whole genome sequences from representative species has revealed distinct features in their genomes, like novel combinations of genes encoding distinct metabolisms and a significant number of diatom-specific genes. However, the regulatory mechanisms driving diatom gene expression are still largely uncharacterized. Considering the wide variety of fields of study orbiting diatoms, ranging from ecology, evolutionary biology to biotechnology, it is thus essential to increase our understanding of fundamental gene regulatory processes such as transcriptional regulation. To this aim, we explored the functional properties of the 5'-flanking region of the Phaeodatylum tricornutum Lhcf2 gene, encoding a member of the Light Harvesting Complex superfamily and we showed that this region enhances transcription of a GUS reporter gene in an orientation- and distance-independent fashion. This represents the first example of a cis-regulatory sequence with enhancer-like features discovered in diatoms and it is instrumental for the generation of novel genetic tools and diatom exploitation in different areas of study. Copyright © 2015 Elsevier B.V. All rights reserved.

  17. Retinal Expression of the Drosophila eyes absent Gene Is Controlled by Several Cooperatively Acting Cis-regulatory Elements

    Science.gov (United States)

    Neuman, Sarah D.; Bashirullah, Arash; Kumar, Justin P.

    2016-01-01

    The eyes absent (eya) gene of the fruit fly, Drosophila melanogaster, is a member of an evolutionarily conserved gene regulatory network that controls eye formation in all seeing animals. The loss of eya leads to the complete elimination of the compound eye while forced expression of eya in non-retinal tissues is sufficient to induce ectopic eye formation. Within the developing retina eya is expressed in a dynamic pattern and is involved in tissue specification/determination, cell proliferation, apoptosis, and cell fate choice. In this report we explore the mechanisms by which eya expression is spatially and temporally governed in the developing eye. We demonstrate that multiple cis-regulatory elements function cooperatively to control eya transcription and that spacing between a pair of enhancer elements is important for maintaining correct gene expression. Lastly, we show that the loss of eya expression in sine oculis (so) mutants is the result of massive cell death and a progressive homeotic transformation of retinal progenitor cells into head epidermis. PMID:27930646

  18. Transcriptome sequencing from diverse human populations reveals differentiated regulatory architecture.

    Directory of Open Access Journals (Sweden)

    Alicia R Martin

    2014-08-01

    Full Text Available Large-scale sequencing efforts have documented extensive genetic variation within the human genome. However, our understanding of the origins, global distribution, and functional consequences of this variation is far from complete. While regulatory variation influencing gene expression has been studied within a handful of populations, the breadth of transcriptome differences across diverse human populations has not been systematically analyzed. To better understand the spectrum of gene expression variation, alternative splicing, and the population genetics of regulatory variation in humans, we have sequenced the genomes, exomes, and transcriptomes of EBV transformed lymphoblastoid cell lines derived from 45 individuals in the Human Genome Diversity Panel (HGDP. The populations sampled span the geographic breadth of human migration history and include Namibian San, Mbuti Pygmies of the Democratic Republic of Congo, Algerian Mozabites, Pathan of Pakistan, Cambodians of East Asia, Yakut of Siberia, and Mayans of Mexico. We discover that approximately 25.0% of the variation in gene expression found amongst individuals can be attributed to population differences. However, we find few genes that are systematically differentially expressed among populations. Of this population-specific variation, 75.5% is due to expression rather than splicing variability, and we find few genes with strong evidence for differential splicing across populations. Allelic expression analyses indicate that previously mapped common regulatory variants identified in eight populations from the International Haplotype Map Phase 3 project have similar effects in our seven sampled HGDP populations, suggesting that the cellular effects of common variants are shared across diverse populations. Together, these results provide a resource for studies analyzing functional differences across populations by estimating the degree of shared gene expression, alternative splicing, and

  19. Expression of the central growth regulator BIG BROTHER is regulated by multiple cis-elements

    Directory of Open Access Journals (Sweden)

    Breuninger Holger

    2012-03-01

    Full Text Available Abstract Background Much of the organismal variation we observe in nature is due to differences in organ size. The observation that even closely related species can show large, stably inherited differences in organ size indicates a strong genetic component to the control of organ size. Despite recent progress in identifying factors controlling organ growth in plants, our overall understanding of this process remains limited, partly because the individual factors have not yet been connected into larger regulatory pathways or networks. To begin addressing this aim, we have studied the upstream regulation of expression of BIG BROTHER (BB, a central growth-control gene in Arabidopsis thaliana that prevents overgrowth of organs. Final organ size and BB expression levels are tightly correlated, implying the need for precise control of its expression. BB expression mirrors proliferative activity, yet the gene functions to limit proliferation, suggesting that it acts in an incoherent feedforward loop downstream of growth activators to prevent over-proliferation. Results To investigate the upstream regulation of BB we combined a promoter deletion analysis with a phylogenetic footprinting approach. We were able to narrow down important, highly conserved, cis-regulatory elements within the BB promoter. Promoter sequences of other Brassicaceae species were able to partially complement the A. thaliana bb-1 mutant, suggesting that at least within the Brassicaceae family the regulatory pathways are conserved. Conclusions This work underlines the complexity involved in precise quantitative control of gene expression and lays the foundation for identifying important upstream regulators that determine BB expression levels and thus final organ size.

  20. A minimal murine Msx-1 gene promoter. Organization of its cis-regulatory motifs and their role in transcriptional activation in cells in culture and in transgenic mice.

    Science.gov (United States)

    Takahashi, T; Guron, C; Shetty, S; Matsui, H; Raghow, R

    1997-09-05

    To dissect the cis-regulatory elements of the murine Msx-1 promoter, which lacks a conventional TATA element, a putative Msx-1 promoter DNA fragment (from -1282 to +106 base pairs (bp)) or its congeners containing site-specific alterations were fused to luciferase reporter and introduced into NIH3T3 and C2C12 cells, and the expression of luciferase was assessed in transient expression assays. The functional consequences of the sequential 5' deletions of the promotor revealed that multiple positive and negative regulatory elements participate in regulating transcription of the Msx-1 gene. Surprisingly, however, the optimal expression of Msx-1 promoter in either NIH3T3 or C2C12 cells required only 165 bp of the upstream sequence to warrant detailed examination of its structure. Therefore, the functional consequences of site-specific deletions and point mutations of the cis-acting elements of the minimal Msx-1 promoter were systematically examined. Concomitantly, potential transcriptional factor(s) interacting with the cis-acting elements of the minimal promoter were also studied by gel electrophoretic mobility shift assays and DNase I footprinting. Combined analyses of the minimal promoter by DNase I footprinting, electrophoretic mobility shift assays, and super shift assays with specific antibodies revealed that 5'-flanking regions from -161 to -154 and from -26 to -13 of the Msx-1 promoter contains an authentic E box (proximal E box), capable of binding a protein immunologically related to the upstream stimulating factor 1 (USF-1) and a GC-rich sequence motif which can bind to Sp1 (proximal Sp1), respectively. Additionally, we observed that the promoter activation was seriously hampered if the proximal E box was removed or mutated, and the promoter activity was eliminated completely if the proximal Sp1 site was similarly altered. Absolute dependence of the Msx-1 minimal promoter on Sp1 could be demonstrated by transient expression assays in the Sp1-deficient

  1. POEM: Identifying joint additive effects on regulatory circuits

    Directory of Open Access Journals (Sweden)

    Maya eBotzman

    2016-04-01

    Full Text Available Motivation: Expression Quantitative Trait Locus (eQTL mapping tackles the problem of identifying variation in DNA sequence that have an effect on the transcriptional regulatory network. Major computational efforts are aimed at characterizing the joint effects of several eQTLs acting in concert to govern the expression of the same genes. Yet, progress towards a comprehensive prediction of such joint effects is limited. For example, existing eQTL methods commonly discover interacting loci affecting the expression levels of a module of co-regulated genes. Such ‘modularization’ approaches, however, are focused on epistatic relations and thus have limited utility for the case of additive (non-epistatic effects.Results: Here we present POEM (Pairwise effect On Expression Modules, a methodology for identifying pairwise eQTL effects on gene modules. POEM is specifically designed to achieve high performance in the case of additive joint effects. We applied POEM to transcription profiles measured in bone marrow-derived dendritic cells across a population of genotyped mice. Our study reveals widespread additive, trans-acting pairwise effects on gene modules, characterizes their organizational principles, and highlights high-order interconnections between modules within the immune signaling network. These analyses elucidate the central role of additive pairwise effect in regulatory circuits, and provide computational tools for future investigations into the interplay between eQTLs.Availability: The software described in this article is available at csgi.tau.ac.il/POEM/.

  2. Ready access to functionally embellished cis-hydrindanes and cis-decalins: protecting group-free total syntheses of (±)-Nootkatone and (±)-Noreremophilane.

    Science.gov (United States)

    Handore, Kishor L; Seetharamsingh, B; Reddy, D Srinivasa

    2013-08-16

    A simple and efficient synthesis of functionalized cis-hydrindanes and cis-decalins was achieved using a sequential Diels-Alder/aldol approach in a highly diastereoselective manner. The scope of this method was tested with a variety of substrates and was successfully applied to the synthesis of two natural products in racemic form. The highlights of the present work provide ready access to 13 new cis-hydrindanes/cis-decalins, a protecting group-free total synthesis of an insect repellent Nootkatone, and the first synthesis of a Noreremophilane using the shortest sequence.

  3. Variant-aware saturating mutagenesis using multiple Cas9 nucleases identifies regulatory elements at trait-associated loci.

    Science.gov (United States)

    Canver, Matthew C; Lessard, Samuel; Pinello, Luca; Wu, Yuxuan; Ilboudo, Yann; Stern, Emily N; Needleman, Austen J; Galactéros, Frédéric; Brugnara, Carlo; Kutlar, Abdullah; McKenzie, Colin; Reid, Marvin; Chen, Diane D; Das, Partha Pratim; A Cole, Mitchel; Zeng, Jing; Kurita, Ryo; Nakamura, Yukio; Yuan, Guo-Cheng; Lettre, Guillaume; Bauer, Daniel E; Orkin, Stuart H

    2017-04-01

    Cas9-mediated, high-throughput, saturating in situ mutagenesis permits fine-mapping of function across genomic segments. Disease- and trait-associated variants identified in genome-wide association studies largely cluster at regulatory loci. Here we demonstrate the use of multiple designer nucleases and variant-aware library design to interrogate trait-associated regulatory DNA at high resolution. We developed a computational tool for the creation of saturating-mutagenesis libraries with single or multiple nucleases with incorporation of variants. We applied this methodology to the HBS1L-MYB intergenic region, which is associated with red-blood-cell traits, including fetal hemoglobin levels. This approach identified putative regulatory elements that control MYB expression. Analysis of genomic copy number highlighted potential false-positive regions, thus emphasizing the importance of off-target analysis in the design of saturating-mutagenesis experiments. Together, these data establish a widely applicable high-throughput and high-resolution methodology to identify minimal functional sequences within large disease- and trait-associated regions.

  4. 9-cis-retinoic acid represses estrogen-induced expression of the very low density apolipoprotein II gene.

    Science.gov (United States)

    Schippers, I J; Kloppenburg, M; Snippe, L; Ab, G

    1994-11-01

    The chicken very low density apolipoprotein II (apoVLDLII) gene is estrogen-inducible and specifically expressed in liver. We examined the possible involvement of the retinoid X receptor (RXR) and its ligand 9-cis-retinoic acid (9-cis-RA) in the activation of the apoVLDLII promoter. We first concentrated on a potential RXR recognition site, which deviates at only one position from a perfect direct A/GGGTCA repeat spaced by one nucleotide (DR-1) and was earlier identified as a common HNF-4/COUP-TF recognition site. However, band shift analysis revealed that this imperfect DR-1 motif does not interact with RXR alpha-homodimers. In accordance with this observation we found that this regulatory element does not mediate transactivation through RXR alpha in the presence of 9-cis-RA. However, our experiments revealed another, unexpected, effect of 9-cis-RA. Instead of stimulating, 9-cis-RA attenuated estrogen-induced expression of transfected estrogen-responsive VLDL-CAT reporter plasmids. This repression appeared to take place through the main estrogen response element (ERE) of the gene. Importantly, 9-cis-RA also strongly repressed the estrogen-induced expression of the endogenous apoVLDLII gene in cultured chicken hepatoma cells.

  5. Isolation of Persicaria minor sesquiterpene synthase promoter and its deletions for transgenic Arabidopsis thaliana

    Science.gov (United States)

    Omar, Aimi Farehah; Ismail, Ismanizan

    2016-11-01

    Sesquiterpene synthase (SS) catalyzes the formation of sesquiterpenes from farnesyl diphosphate (FDP) via carbocation intermediates. In this study, the promoter region of sesquiterpene synthase was isolated from Persicaria minor to identify possible cis-acting elements in the promoter. The full-length PmSS promoter of P. minor is 1824-bp sequences. The sequence was analyzed and several putative cis-acting regulatory elements were identified. Three cis-acting regulatory elements were selected for deletion analysis which are cis-acting element involved in wound responsiveness (WUN), cis - acting element involved in defense and stress responsiveness (TC) and cis-acting element involved in ABA responsiveness (ABRE). Series of deletions were conducted to assess the promoter activity producing three truncated fragments promoter; Prom 2 1606-bp, Prom 3 1144- bp, and Prom 4 921-bp. The full-length promoter and its deletion series were cloned into the pBGWFS7 vector which contain β-glucuronidase (GUS) gene and green fluorescent protein (GFP) as the reporter gene. All constructs were successfully transformed into Arabidopsis thaliana based on PCR of positive BASTA resistance plants.

  6. Directed evolution of toluene dioxygenase from Pseudomonas putida for improved selectivity toward cis-indandiol during indene bioconversion.

    Science.gov (United States)

    Zhang, N; Stewart, B G; Moore, J C; Greasham, R L; Robinson, D K; Buckland, B C; Lee, C

    2000-10-01

    Toluene dioxygenase (TDO) from Pseudomonas putida F1 converts indene to a mixture of cis-indandiol (racemic), 1-indenol, and 1-indanone. The desired product, cis-(1S,2R)-indandiol, is a potential key intermediate in the chemical synthesis of indinavir sulfate (Crixivan), Merck's HIV-1 protease inhibitor for the treatment of AIDS. To reduce the undesirable byproducts 1-indenol and 1-indanone formed during indene bioconversion, the recombinant TDO expressed in Escherichia coli was evolved by directed evolution using the error-prone polymerase chain reaction (epPCR) method. High-throughput fluorometric and spectrophotometric assays were developed for rapid screening of the mutant libraries in a 96-well format. Mutants with reduced 1-indenol by-product formation were identified, and the individual indene bioconversion product profiles of the selected mutants were confirmed by HPLC. Changes in the amino acid sequence of the mutant enzymes were identified by analyzing the nucleotide sequence of the genes. A mutant with the most desirable product profile from each library, defined as the most reduced 1-indenol concentration and with the highest cis-(1S,2R)-indandiol enantiomeric excess, was used to perform each subsequent round of mutagenesis. After three rounds of mutagenesis and screening, mutant 1C4-3G was identified to have a threefold reduction in 1-indenol formation over the wild type (20% vs 60% of total products) and a 40% increase of product (cis-indandiol) yield.

  7. A novel method for predicting activity of cis-regulatory modules, based on a diverse training set.

    Science.gov (United States)

    Yang, Wei; Sinha, Saurabh

    2017-01-01

    With the rapid emergence of technologies for locating cis-regulatory modules (CRMs) genome-wide, the next pressing challenge is to assign precise functions to each CRM, i.e. to determine the spatiotemporal domains or cell-types where it drives expression. A popular approach to this task is to model the typical k-mer composition of a set of CRMs known to drive a common expression pattern, and assign that pattern to other CRMs exhibiting a similar k-mer composition. This approach does not rely on prior knowledge of transcription factors relevant to the CRM or their binding motifs, and is thus more widely applicable than motif-based methods for predicting CRM activity, but is also prone to false positive predictions. We present a novel strategy to improve the above-mentioned approach: to predict if a CRM drives a specific gene expression pattern, assess not only how similar the CRM is to other CRMs with similar activity but also to CRMs with distinct activities. We use a state-of-the-art statistical method to quantify a CRM's sequence similarity to many different training sets of CRMs, and employ a classification algorithm to integrate these similarity scores into a single prediction of the CRM's activity. This strategy is shown to significantly improve CRM activity prediction over current approaches. Our implementation of the new method, called IMMBoost, is freely available as source code, at https://github.com/weiyangedward/IMMBoost CONTACT: sinhas@illinois.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Investment in the CEE/CIS region

    International Nuclear Information System (INIS)

    Lemierre, J.

    2002-01-01

    The energy investments in the Central and Eastern European region and the Commonwealth of Independent States (CIS) region are discussed in this Keynote Address. The message is addressed to regulators and governments. The restructuring of old industries to save energy is highlighted. The regulatory system must undergo a substantial reform. Another message is placed for investors in the energy field. (R.P.)

  9. Identification of cis-acting regulatory elements in the human oxytocin gene promoter.

    Science.gov (United States)

    Richard, S; Zingg, H H

    1991-12-01

    The expression of hormone-inducible genes is determined by the interaction of trans-acting factors with hormone-inducible elements and elements mediating basal and cell-specific expression. We have shown earlier that the gene encoding the hypothalamic nonapeptide oxytocin (OT) is under the control of an estrogen response element (ERE). The present study was aimed at identifying cis-acting elements mediating basal expression of the OT gene. A construct containing sequences -381 to +36 of the human OT gene was linked to a reporter gene and transiently transfected into a series of neuronal and nonneuronal cell lines. Expression of this construct was cell specific: it was highest in the neuroblastoma-derived cell line, Neuro-2a, and lowest in NIH 3T3 and JEG-3 cells. By 5' deletion analysis, we determined that a segment from -49 to +36 was capable of mediating cells-pecific promoter activity. Within this segment, we identified three proximal promoter elements (PPE-1, PPE-2, and PPE-3) that are each required for promoter activity. Most notably, mutation of a conserved purine-rich element (GAGAGA) contained within PPE-2 leads to a 10-fold decrease in promoter strength. Gel mobility shift analysis with three different double-stranded oligonucleotides demonstrated that each proximal promoter element binds distinct nuclear factors. In each case, only the homologous oligonucleotide, but neither of the oligonucleotides corresponding to adjacent elements, was able to act as a competitor. Thus, a different set of factors appears to bind independently to each element. By reinserting the homologous ERE or a heterologous glucocorticoid response element upstream of intact or altered proximal promoter segments we determined that removal or mutation of proximal promoter elements decreases basal expression, but does not abrogate the hormone responsiveness of the promoter. In conclusion, these results indicate that an important component of the transcriptional activity of the OT

  10. The MTP1 promoters from Arabidopsis halleri reveal cis-regulating elements for the evolution of metal tolerance.

    Science.gov (United States)

    Fasani, Elisa; DalCorso, Giovanni; Varotto, Claudio; Li, Mingai; Visioli, Giovanna; Mattarozzi, Monica; Furini, Antonella

    2017-06-01

    In the hyperaccumulator Arabidopsis halleri, the zinc (Zn) vacuolar transporter MTP1 is a key component of hypertolerance. Because protein sequences and functions are highly conserved between A. halleri and Arabidopsis thaliana, Zn tolerance in A. halleri may reflect the constitutively higher MTP1 expression compared with A. thaliana, based on copy number expansion and different cis regulation. Three MTP1 promoters were characterized in A. halleri ecotype I16. The comparison with the A. thaliana MTP1 promoter revealed different expression profiles correlated with specific cis-acting regulatory elements. The MTP1 5' untranslated region, highly conserved among A. thaliana, Arabidopsis lyrata and A. halleri, contains a dimer of MYB-binding motifs in the A. halleri promoters absent in the A. thaliana and A. lyrata sequences. Site-directed mutagenesis of these motifs revealed their role for expression in trichomes. A. thaliana mtp1 transgenic lines expressing AtMTP1 controlled by the native A. halleri promoter were more Zn-tolerant than lines carrying mutations on MYB-binding motifs. Differences in Zn tolerance were associated with different distribution of Zn among plant organs and in trichomes. The different cis-acting elements in the MTP1 promoters of A. halleri, particularly the MYB-binding sites, are probably involved in the evolution of Zn tolerance. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  11. The lncRNA Malat1 Is Dispensable for Mouse Development but Its Transcription Plays a cis-Regulatory Role in the Adult

    Directory of Open Access Journals (Sweden)

    Bin Zhang

    2012-07-01

    Full Text Available Genome-wide studies have identified thousands of long noncoding RNAs (lncRNAs lacking protein-coding capacity. However, most lncRNAs are expressed at a very low level, and in most cases there is no genetic evidence to support their in vivo function. Malat1 (metastasis associated lung adenocarcinoma transcript 1 is among the most abundant and highly conserved lncRNAs, and it exhibits an uncommon 3′-end processing mechanism. In addition, its specific nuclear localization, developmental regulation, and dysregulation in cancer are suggestive of it having a critical biological function. We have characterized a Malat1 loss-of-function genetic model that indicates that Malat1 is not essential for mouse pre- and postnatal development. Furthermore, depletion of Malat1 does not affect global gene expression, splicing factor level and phosphorylation status, or alternative pre-mRNA splicing. However, among a small number of genes that were dysregulated in adult Malat1 knockout mice, many were Malat1 neighboring genes, thus indicating a potential cis-regulatory role of Malat1 gene transcription.

  12. WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences

    Directory of Open Access Journals (Sweden)

    Pesole Graziano

    2007-02-01

    Full Text Available Abstract Background This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely used given the ever increasing amount of genomic data available. Results We present an algorithm that identifies conserved transcription factor binding sites in a given sequence by comparing it to one or more homologs, adapting a framework we previously introduced for the discovery of sites in sequences from co-regulated genes. Differently from the most commonly used methods, the approach we present does not need or compute an alignment of the sequences investigated, nor resorts to descriptors of the binding specificity of known transcription factors. The main novel idea we introduce is a relative measure of conservation, assuming that true functional elements should present a higher level of conservation with respect to the rest of the sequence surrounding them. We present tests where we applied the algorithm to the identification of conserved annotated sites in homologous promoters, as well as in distal regions like enhancers. Conclusion Results of the tests show how the algorithm can provide fast and reliable predictions of conserved transcription factor binding sites regulating the transcription of a gene, with better performances than other available methods for the same task. We also show examples on how the algorithm can be successfully employed when promoter annotations of the genes investigated are missing, or when regulatory sites and regions are located far away from the genes.

  13. Sequence conservation and combinatorial complexity of Drosophila neural precursor cell enhancers

    Directory of Open Access Journals (Sweden)

    Kuzin Alexander

    2008-08-01

    Full Text Available Abstract Background The presence of highly conserved sequences within cis-regulatory regions can serve as a valuable starting point for elucidating the basis of enhancer function. This study focuses on regulation of gene expression during the early events of Drosophila neural development. We describe the use of EvoPrinter and cis-Decoder, a suite of interrelated phylogenetic footprinting and alignment programs, to characterize highly conserved sequences that are shared among co-regulating enhancers. Results Analysis of in vivo characterized enhancers that drive neural precursor gene expression has revealed that they contain clusters of highly conserved sequence blocks (CSBs made up of shorter shared sequence elements which are present in different combinations and orientations within the different co-regulating enhancers; these elements contain either known consensus transcription factor binding sites or consist of novel sequences that have not been functionally characterized. The CSBs of co-regulated enhancers share a large number of sequence elements, suggesting that a diverse repertoire of transcription factors may interact in a highly combinatorial fashion to coordinately regulate gene expression. We have used information gained from our comparative analysis to discover an enhancer that directs expression of the nervy gene in neural precursor cells of the CNS and PNS. Conclusion The combined use EvoPrinter and cis-Decoder has yielded important insights into the combinatorial appearance of fundamental sequence elements required for neural enhancer function. Each of the 30 enhancers examined conformed to a pattern of highly conserved blocks of sequences containing shared constituent elements. These data establish a basis for further analysis and understanding of neural enhancer function.

  14. Identification and characterization of cis-acting elements involved in the regulation of ABA- and/or GA-mediated LuPLR1 gene expression and lignan biosynthesis in flax (Linum usitatissimum L.) cell cultures.

    Science.gov (United States)

    Corbin, Cyrielle; Renouard, Sullivan; Lopez, Tatiana; Lamblin, Frédéric; Lainé, Eric; Hano, Christophe

    2013-03-15

    Pinoresinol lariciresinol reductase 1, encoded by the LuPLR1 gene in flax (Linum usitatissimum L.), is responsible for the biosynthesis of (+)-secoisolariciresinol, a cancer chemopreventive phytoestrogenic lignan accumulated in high amount in the hull of flaxseed. Our recent studies have demonstrated a key role of abscisic acid (ABA) in the regulation of LuPLR1 gene expression and thus of the (+)-secoisolariciresinol synthesis during the flax seedcoat development. It is well accepted that gibberellins (GA) and ABA play antagonistic roles in the regulation of numerous developmental processes; therefore it is of interest to clarify their respective effects on lignan biosynthesis. Herein, using flax cell suspension cultures, we demonstrate that LuPLR1 gene expression and (+)-secoisolariciresinol synthesis are up-regulated by ABA and down-regulated by GA. The LuPLR1 gene promoter analysis and mutation experiments allow us to identify and characterize two important cis-acting sequences (ABRE and MYB2) required for these regulations. These results imply that a cross-talk between ABA and GA signaling orchestrated by transcription factors is involved in the regulation of lignan biosynthesis. This is particularly evidenced in the case of the ABRE cis-regulatory sequence of LuPLR1 gene promoter that appears to be a common target sequence of GA and ABA signals. Copyright © 2012 Elsevier GmbH. All rights reserved.

  15. Mapping of cis-regulatory sites in the promoter of testis-specific stellate genes of Drosophila melanogaster.

    Science.gov (United States)

    Olenkina, O M; Egorova, K S; Aravin, A A; Naumova, N M; Gvozdev, V A; Olenina, L V

    2012-11-01

    Tandem Stellate genes organized into two clusters in heterochromatin and euchromatin of the X-chromosome are part of the Ste-Su(Ste) genetic system required for maintenance of male fertility and reproduction of Drosophila melanogaster. Stellate genes encode a regulatory subunit of protein kinase CK2 and are the main targets of germline-specific piRNA-silencing; their derepression leads to appearance of protein crystals in spermatocytes, meiotic disturbances, and male sterility. A short promoter region of 134 bp appears to be sufficient for testis-specific transcription of Stellate, and it contains three closely located cis-regulatory elements called E-boxes. By using reporter analysis, we confirmed a strong functionality of the E-boxes in the Stellate promoter for in vivo transcription. Using selective mutagenesis, we have shown that the presence of the central E-box 2 is preferable to maintain a high-level testis-specific transcription of the reporter gene under the Stellate promoter. The Stellate promoter provides transcription even in heterochromatin, and corresponding mRNAs are translated with the generation of full-size protein products in case of disturbances in the piRNA-silencing process. We have also shown for the first time that the activity of the Stellate promoter is determined by chromatin context of the X-chromosome in male germinal cells, and it increases at about twofold when relocating in autosomes.

  16. Statistical approaches to use a model organism for regulatory sequences annotation of newly sequenced species.

    Directory of Open Access Journals (Sweden)

    Pietro Liò

    Full Text Available A major goal of bioinformatics is the characterization of transcription factors and the transcriptional programs they regulate. Given the speed of genome sequencing, we would like to quickly annotate regulatory sequences in newly-sequenced genomes. In such cases, it would be helpful to predict sequence motifs by using experimental data from closely related model organism. Here we present a general algorithm that allow to identify transcription factor binding sites in one newly sequenced species by performing Bayesian regression on the annotated species. First we set the rationale of our method by applying it within the same species, then we extend it to use data available in closely related species. Finally, we generalise the method to handle the case when a certain number of experiments, from several species close to the species on which to make inference, are available. In order to show the performance of the method, we analyse three functionally related networks in the Ascomycota. Two gene network case studies are related to the G2/M phase of the Ascomycota cell cycle; the third is related to morphogenesis. We also compared the method with MatrixReduce and discuss other types of validation and tests. The first network is well known and provides a biological validation test of the method. The two cell cycle case studies, where the gene network size is conserved, demonstrate an effective utility in annotating new species sequences using all the available replicas from model species. The third case, where the gene network size varies among species, shows that the combination of information is less powerful but is still informative. Our methodology is quite general and could be extended to integrate other high-throughput data from model organisms.

  17. In silico Analysis of osr40c1 Promoter Sequence Isolated from Indica Variety Pokkali

    Directory of Open Access Journals (Sweden)

    W.S.I. de Silva

    2017-07-01

    Full Text Available The promoter region of a drought and abscisic acid (ABA inducible gene, osr40c1, was isolated from a salt-tolerant indica rice variety Pokkali, which is 670 bp upstream of the putative translation start codon. In silico promoter analysis of resulted sequence showed that at least 15 types of putative motifs were distributed within the sequence, including two types of common promoter elements, TATA and CAAT boxes. Additionally, several putative cis-acing regulatory elements which may be involved in regulation of osr40c1 expression under different conditions were found in the 5′-upstream region of osr40c1. These are ABA-responsive element, light-responsive elements (ATCT-motif, Box I, G-box, GT1-motif, Gap-box and Sp1, myeloblastosis oncogene response element (CCAAT-box, auxin responsive element (TGA-element, gibberellin-responsive element (GARE-motif and fungal-elicitor responsive elements (Box E and Box-W1. A putative regulatory element, required for endosperm-specific pattern of gene expression designated as Skn-1 motif, was also detected in the Pokkali osr40c1 promoter region. In conclusion, the bioinformatic analysis of osr40c1 promoter region isolated from indica rice variety Pokkali led to the identification of several important stress-responsive cis-acting regulatory elements, and therefore, the isolated promoter sequence could be employed in rice genetic transformation to mediate expression of abiotic stress induced genes.

  18. In Silico Expression Analysis.

    Science.gov (United States)

    Bolívar, Julio; Hehl, Reinhard; Bülow, Lorenz

    2016-01-01

    Information on the specificity of cis-sequences enables the design of functional synthetic plant promoters that are responsive to specific stresses. Potential cis-sequences may be experimentally tested, however, correlation of genomic sequence with gene expression data enables an in silico expression analysis approach to bioinformatically assess the stress specificity of candidate cis-sequences prior to experimental verification. The present chapter demonstrates an example for the in silico validation of a potential cis-regulatory sequence responsive to cold stress. The described online tool can be applied for the bioinformatic assessment of cis-sequences responsive to most abiotic and biotic stresses of plants. Furthermore, a method is presented based on a reverted in silico expression analysis approach that predicts highly specific potentially functional cis-regulatory elements for a given stress.

  19. Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

    Science.gov (United States)

    Yohda, Masafumi; Yagi, Osami; Takechi, Ayane; Kitajima, Mizuki; Matsuda, Hisashi; Miyamura, Naoaki; Aizawa, Tomoko; Nakajima, Mutsuyasu; Sunairi, Michio; Daiba, Akito; Miyajima, Takashi; Teruya, Morimi; Teruya, Kuniko; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Juan, Ayaka; Nakano, Kazuma; Aoyama, Misako; Terabayashi, Yasunobu; Satou, Kazuhito; Hirano, Takashi

    2015-07-01

    A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  20. In silico Analysis of osr40c1 Promoter Sequence Isolated from Indica Variety Pokkali

    OpenAIRE

    W.S.I. de Silva; M.M.N. Perera; K.L.N.S. Perera; A.M. Wickramasuriya; G.A.U. Jayasekera

    2017-01-01

    The promoter region of a drought and abscisic acid (ABA) inducible gene, osr40c1, was isolated from a salt-tolerant indica rice variety Pokkali, which is 670 bp upstream of the putative translation start codon. In silico promoter analysis of resulted sequence showed that at least 15 types of putative motifs were distributed within the sequence, including two types of common promoter elements, TATA and CAAT boxes. Additionally, several putative cis-acing regulatory elements which may be involv...

  1. FDA's Activities Supporting Regulatory Application of "Next Gen" Sequencing Technologies.

    Science.gov (United States)

    Wilson, Carolyn A; Simonyan, Vahan

    2014-01-01

    Applications of next-generation sequencing (NGS) technologies require availability and access to an information technology (IT) infrastructure and bioinformatics tools for large amounts of data storage and analyses. The U.S. Food and Drug Administration (FDA) anticipates that the use of NGS data to support regulatory submissions will continue to increase as the scientific and clinical communities become more familiar with the technologies and identify more ways to apply these advanced methods to support development and evaluation of new biomedical products. FDA laboratories are conducting research on different NGS platforms and developing the IT infrastructure and bioinformatics tools needed to enable regulatory evaluation of the technologies and the data sponsors will submit. A High-performance Integrated Virtual Environment, or HIVE, has been launched, and development and refinement continues as a collaborative effort between the FDA and George Washington University to provide the tools to support these needs. The use of a highly parallelized environment facilitated by use of distributed cloud storage and computation has resulted in a platform that is both rapid and responsive to changing scientific needs. The FDA plans to further develop in-house capacity in this area, while also supporting engagement by the external community, by sponsoring an open, public workshop to discuss NGS technologies and data formats standardization, and to promote the adoption of interoperability protocols in September 2014. Next-generation sequencing (NGS) technologies are enabling breakthroughs in how the biomedical community is developing and evaluating medical products. One example is the potential application of this method to the detection and identification of microbial contaminants in biologic products. In order for the U.S. Food and Drug Administration (FDA) to be able to evaluate the utility of this technology, we need to have the information technology infrastructure and

  2. Asymmetrical distribution of non-conserved regulatory sequences at PHOX2B is reflected at the ENCODE loci and illuminates a possible genome-wide trend

    Directory of Open Access Journals (Sweden)

    McCallion Andrew S

    2009-01-01

    Full Text Available Abstract Background Transcriptional regulatory elements are central to development and interspecific phenotypic variation. Current regulatory element prediction tools rely heavily upon conservation for prediction of putative elements. Recent in vitro observations from the ENCODE project combined with in vivo analyses at the zebrafish phox2b locus suggests that a significant fraction of regulatory elements may fall below commonly applied metrics of conservation. We propose to explore these observations in vivo at the human PHOX2B locus, and also evaluate the potential evidence for genome-wide applicability of these observations through a novel analysis of extant data. Results Transposon-based transgenic analysis utilizing a tiling path proximal to human PHOX2B in zebrafish recapitulates the observations at the zebrafish phox2b locus of both conserved and non-conserved regulatory elements. Analysis of human sequences conserved with previously identified zebrafish phox2b regulatory elements demonstrates that the orthologous sequences exhibit overlapping regulatory control. Additionally, analysis of non-conserved sequences scattered over 135 kb 5' to PHOX2B, provides evidence of non-conserved regulatory elements positively biased with close proximity to the gene. Furthermore, we provide a novel analysis of data from the ENCODE project, finding a non-uniform distribution of regulatory elements consistent with our in vivo observations at PHOX2B. These observations remain largely unchanged when one accounts for the sequence repeat content of the assayed intervals, when the intervals are sub-classified by biological role (developmental versus non-developmental, or by gene density (gene desert versus non-gene desert. Conclusion While regulatory elements frequently display evidence of evolutionary conservation, a fraction appears to be undetected by current metrics of conservation. In vivo observations at the PHOX2B locus, supported by our analyses of in

  3. Prediction of transcriptional regulatory elements for plant hormone responses based on microarray data

    Directory of Open Access Journals (Sweden)

    Yamaguchi-Shinozaki Kazuko

    2011-02-01

    Full Text Available Abstract Background Phytohormones organize plant development and environmental adaptation through cell-to-cell signal transduction, and their action involves transcriptional activation. Recent international efforts to establish and maintain public databases of Arabidopsis microarray data have enabled the utilization of this data in the analysis of various phytohormone responses, providing genome-wide identification of promoters targeted by phytohormones. Results We utilized such microarray data for prediction of cis-regulatory elements with an octamer-based approach. Our test prediction of a drought-responsive RD29A promoter with the aid of microarray data for response to drought, ABA and overexpression of DREB1A, a key regulator of cold and drought response, provided reasonable results that fit with the experimentally identified regulatory elements. With this succession, we expanded the prediction to various phytohormone responses, including those for abscisic acid, auxin, cytokinin, ethylene, brassinosteroid, jasmonic acid, and salicylic acid, as well as for hydrogen peroxide, drought and DREB1A overexpression. Totally 622 promoters that are activated by phytohormones were subjected to the prediction. In addition, we have assigned putative functions to 53 octamers of the Regulatory Element Group (REG that have been extracted as position-dependent cis-regulatory elements with the aid of their feature of preferential appearance in the promoter region. Conclusions Our prediction of Arabidopsis cis-regulatory elements for phytohormone responses provides guidance for experimental analysis of promoters to reveal the basis of the transcriptional network of phytohormone responses.

  4. Transcriptome landscape of Lactococcus lactis reveals many novel RNAs including a small regulatory RNA involved in carbon uptake and metabolism.

    Science.gov (United States)

    van der Meulen, Sjoerd B; de Jong, Anne; Kok, Jan

    2016-01-01

    RNA sequencing has revolutionized genome-wide transcriptome analyses, and the identification of non-coding regulatory RNAs in bacteria has thus increased concurrently. Here we reveal the transcriptome map of the lactic acid bacterial paradigm Lactococcus lactis MG1363 by employing differential RNA sequencing (dRNA-seq) and a combination of manual and automated transcriptome mining. This resulted in a high-resolution genome annotation of L. lactis and the identification of 60 cis-encoded antisense RNAs (asRNAs), 186 trans-encoded putative regulatory RNAs (sRNAs) and 134 novel small ORFs. Based on the putative targets of asRNAs, a novel classification is proposed. Several transcription factor DNA binding motifs were identified in the promoter sequences of (a)sRNAs, providing insight in the interplay between lactococcal regulatory RNAs and transcription factors. The presence and lengths of 14 putative sRNAs were experimentally confirmed by differential Northern hybridization, including the abundant RNA 6S that is differentially expressed depending on the available carbon source. For another sRNA, LLMGnc_147, functional analysis revealed that it is involved in carbon uptake and metabolism. L. lactis contains 13% leaderless mRNAs (lmRNAs) that, from an analysis of overrepresentation in GO classes, seem predominantly involved in nucleotide metabolism and DNA/RNA binding. Moreover, an A-rich sequence motif immediately following the start codon was uncovered, which could provide novel insight in the translation of lmRNAs. Altogether, this first experimental genome-wide assessment of the transcriptome landscape of L. lactis and subsequent sRNA studies provide an extensive basis for the investigation of regulatory RNAs in L. lactis and related lactococcal species.

  5. RNA regulatory elements and polyadenylation in plants

    Directory of Open Access Journals (Sweden)

    Arthur G. Hunt

    2012-01-01

    Full Text Available Alternative poly(A site choice (also known as alternative polyadenylation, or APA has the potential to affect gene expression in qualitative and quantitative ways. Alternative polyadenylation may affect as many as 82% of all expressed genes in a plant. The consequences of APA include the generation of transcripts with differing 3’-UTRs (and thus differing potential regulatory potential and of transcripts with differing protein-coding potential. Genome-wide studies of possible APA suggest a linkage with pre-mRNA splicing, and indicate a coincidence of and perhaps cooperation between RNA regulatory elements that affect splicing efficiency and the recognition of novel intronic poly(A sites. These studies also raise the possibility of the existence of a novel class of polyadenylation-related cis elements that are distinct from the well-characterized plant polyadenylation signal. Many potential APA events, however, have not been associated with identifiable cis elements. The present state of the field reveals a broad scope of APA, and also numerous opportunities for research into mechanisms that govern both choice and regulation of poly(A sites in plants.

  6. Regulatory Architecture of Gene Expression Variation in the Threespine Stickleback Gasterosteus aculeatus

    Directory of Open Access Journals (Sweden)

    Victoria L. Pritchard

    2017-01-01

    Full Text Available Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus, an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats.

  7. Enhanced regulatory sequence prediction using gapped k-mer features.

    Science.gov (United States)

    Ghandi, Mahmoud; Lee, Dongwon; Mohammad-Noori, Morteza; Beer, Michael A

    2014-07-01

    Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem.

  8. Enhanced regulatory sequence prediction using gapped k-mer features.

    Directory of Open Access Journals (Sweden)

    Mahmoud Ghandi

    2014-07-01

    Full Text Available Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem.

  9. Divergently overlapping cis-encoded antisense RNA regulating toxin-antitoxin systems from E. coli: hok/sok, ldr/rdl, symE/symR.

    Science.gov (United States)

    Kawano, Mitsuoki

    2012-12-01

    Toxin-antitoxin (TA) systems are categorized into three classes based on the type of antitoxin. In type I TA systems, the antitoxin is a small antisense RNA that inhibits translation of small toxic proteins by binding to the corresponding mRNAs. Those type I TA systems were originally identified as plasmid stabilization modules rendering a post-segregational killing (PSK) effect on the host cells. The type I TA loci also exist on the Escherichia coli chromosome but their biological functions are less clear. Genetic organization and regulatory elements of hok/sok and ldr/rdl families are very similar and the toxins are predicted to contain a transmembrane domain, but otherwise share no detectable sequence similarity. This review will give an overview of the type I TA modules of E. coli K-12, especially hok/sok, ldr/rdl and SOS-inducible symE/symR systems, which are regulated by divergently overlapping cis-encoded antisense RNAs.

  10. Computational methods to dissect cis-regulatory transcriptional ...

    Indian Academy of Sciences (India)

    The formation of diverse cell types from an invariant set of genes is governed by biochemical and molecular processes that regulate gene activity. A complete understanding of the regulatory mechanisms of gene expression is the major function of genomics. Computational genomics is a rapidly emerging area for ...

  11. Shared regulatory sites are abundant in the human genome and shed light on genome evolution and disease pleiotropy.

    Science.gov (United States)

    Tong, Pin; Monahan, Jack; Prendergast, James G D

    2017-03-01

    Large-scale gene expression datasets are providing an increasing understanding of the location of cis-eQTLs in the human genome and their role in disease. However, little is currently known regarding the extent of regulatory site-sharing between genes. This is despite it having potentially wide-ranging implications, from the determination of the way in which genetic variants may shape multiple phenotypes to the understanding of the evolution of human gene order. By first identifying the location of non-redundant cis-eQTLs, we show that regulatory site-sharing is a relatively common phenomenon in the human genome, with over 10% of non-redundant regulatory variants linked to the expression of multiple nearby genes. We show that these shared, local regulatory sites are linked to high levels of chromatin looping between the regulatory sites and their associated genes. In addition, these co-regulated gene modules are found to be strongly conserved across mammalian species, suggesting that shared regulatory sites have played an important role in shaping human gene order. The association of these shared cis-eQTLs with multiple genes means they also appear to be unusually important in understanding the genetics of human phenotypes and pleiotropy, with shared regulatory sites more often linked to multiple human phenotypes than other regulatory variants. This study shows that regulatory site-sharing is likely an underappreciated aspect of gene regulation and has important implications for the understanding of various biological phenomena, including how the two and three dimensional structures of the genome have been shaped and the potential causes of disease pleiotropy outside coding regions.

  12. Cis-Lunar Base Camp

    Science.gov (United States)

    Merrill, Raymond G.; Goodliff, Kandyce E.; Mazanek, Daniel D.; Reeves, John D., Jr.

    2012-01-01

    Historically, when mounting expeditions into uncharted territories, explorers have established strategically positioned base camps to pre-position required equipment and consumables. These base camps are secure, safe positions from which expeditions can depart when conditions are favorable, at which technology and operations can be tested and validated, and facilitate timely access to more robust facilities in the event of an emergency. For human exploration missions into deep space, cis-lunar space is well suited to serve as such a base camp. The outer regions of cis-lunar space, such as the Earth-Moon Lagrange points, lie near the edge of Earth s gravity well, allowing equipment and consumables to be aggregated with easy access to deep space and to the lunar surface, as well as more distant destinations, such as near-Earth Asteroids (NEAs) and Mars and its moons. Several approaches to utilizing a cis-lunar base camp for sustainable human exploration, as well as some possible future applications are identified. The primary objective of the analysis presented in this paper is to identify options, show the macro trends, and provide information that can be used as a basis for more detailed mission development. Compared within are the high-level performance and cost of 15 preliminary cis-lunar exploration campaigns that establish the capability to conduct crewed missions of up to one year in duration, and then aggregate mass in cis-lunar space to facilitate an expedition from Cis-Lunar Base Camp. Launch vehicles, chemical propulsion stages, and electric propulsion stages are discussed and parametric sizing values are used to create architectures of in-space transportation elements that extend the existing in-space supply chain to cis-lunar space. The transportation options to cis-lunar space assessed vary in efficiency by almost 50%; from 0.16 to 0.68 kg of cargo in cis-lunar space for every kilogram of mass in Low Earth Orbit (LEO). For the 15 cases, 5-year campaign

  13. Neutral forces acting on intragenomic variability shape the Escherichia coli regulatory network topology.

    Science.gov (United States)

    Ruths, Troy; Nakhleh, Luay

    2013-05-07

    Cis-regulatory networks (CRNs) play a central role in cellular decision making. Like every other biological system, CRNs undergo evolution, which shapes their properties by a combination of adaptive and nonadaptive evolutionary forces. Teasing apart these forces is an important step toward functional analyses of the different components of CRNs, designing regulatory perturbation experiments, and constructing synthetic networks. Although tests of neutrality and selection based on molecular sequence data exist, no such tests are currently available based on CRNs. In this work, we present a unique genotype model of CRNs that is grounded in a genomic context and demonstrate its use in identifying portions of the CRN with properties explainable by neutral evolutionary forces at the system, subsystem, and operon levels. We leverage our model against experimentally derived data from Escherichia coli. The results of this analysis show statistically significant and substantial neutral trends in properties previously identified as adaptive in origin--degree distribution, clustering coefficient, and motifs--within the E. coli CRN. Our model captures the tightly coupled genome-interactome of an organism and enables analyses of how evolutionary events acting at the genome level, such as mutation, and at the population level, such as genetic drift, give rise to neutral patterns that we can quantify in CRNs.

  14. Thermal degradation kinetics of all-trans and cis-carotenoids in a light-induced model system.

    Science.gov (United States)

    Xiao, Ya-Dong; Huang, Wu-Yang; Li, Da-Jing; Song, Jiang-Feng; Liu, Chun-Quan; Wei, Qiu-Yu; Zhang, Min; Yang, Qiu-Ming

    2018-01-15

    Thermal degradation kinetics of lutein, zeaxanthin, β-cryptoxanthin, β-carotene was studied at 25, 35, and 45°C in a model system. Qualitative and quantitative analyses of all-trans- and cis-carotenoids were conducted using HPLC-DAD-MS technologies. Kinetic and thermodynamic parameters were calculated by non-linear regression. A total of 29 geometrical isomers and four oxidation products were detected, including all-trans-, keto compounds, mono-cis- and di-cis-isomers. Degradations of all-trans-lutein, zeaxanthin, β-cryptoxanthin, and β-carotene were described by a first-order kinetic model, with the order of rate constants as k β -carotene >k β -cryptoxanthin >k lutein >k zeaxanthin . Activation energies of zeaxanthin, lutein, β-cryptoxanthin, and β-carotene were 65.6, 38.9, 33.9, and 8.6kJ/moL, respectively. cis-carotenoids also followed with the first-order kinetic model, but they did not show a defined sequence of degradation rate constants and activation energies at different temperatures. A possible degradation pathway of four carotenoids was identified to better understand the mechanism of carotenoid degradation. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Novel sequence variations in LAMA2 and SGCG genes modulating cis-acting regulatory elements and RNA secondary structure

    Directory of Open Access Journals (Sweden)

    Olfa Siala

    2010-01-01

    Full Text Available In this study, we detected new sequence variations in LAMA2 and SGCG genes in 5 ethnic populations, and analysed their effect on enhancer composition and mRNA structure. PCR amplification and DNA sequencing were performed and followed by bioinformatics analyses using ESEfinder as well as MFOLD software. We found 3 novel sequence variations in the LAMA2 (c.3174+22_23insAT and c.6085 +12delA and SGCG (c.*102A/C genes. These variations were present in 210 tested healthy controls from Tunisian, Moroccan, Algerian, Lebanese and French populations suggesting that they represent novel polymorphisms within LAMA2 and SGCG genes sequences. ESEfinder showed that the c.*102A/C substitution created a new exon splicing enhancer in the 3'UTR of SGCG genes, whereas the c.6085 +12delA deletion was situated in the base pairing region between LAMA2 mRNA and the U1snRNA spliceosomal components. The RNA structure analyses showed that both variations modulated RNA secondary structure. Our results are suggestive of correlations between mRNA folding and the recruitment of spliceosomal components mediating splicing, including SR proteins. The contribution of common sequence variations to mRNA structural and functional diversity will contribute to a better study of gene expression.

  16. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    Energy Technology Data Exchange (ETDEWEB)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  17. Comparative evaluation of capillary electrophoresis and high-performance liquid chromatography for the separation of cis-cis, cis-trans, and trans-trans isomers of atracurium besylate.

    Science.gov (United States)

    de Moraes, M de L; Polakiewicz, B; Mattua, M F; Tavares, M F

    1998-01-01

    Atracurium besylate is a highly selective nondepolarizing neuromuscular blocking agent routinely used during anesthetic procedures. The commercial presentation of this drug is a mixture of positional isomers, cis-cis, cis-trans, and trans-trans. Reversed-phase high-performance liquid chromatography has been the technique of choice for the analysis of atracurium besylate formulations at the quality control laboratory of Núcleo de Desenvolvimento Cristália (São Paulo, Brazil), a local pharmaceutical company. HPLC analysis is usually conducted under gradient elution using acetonitrile/0.1 M phosphate buffer eluent mixture as mobile phase and an octadecyl silica (ODS)-packed column. The complete elution of the three isomers takes about 1 hr. In this work, an alternative capillary electrophoresis methodology was developed. The complete resolution of all three isomers was accomplished in about 13 min (+20 kV/72 cm, 211 nm direct detection) using a 60-mM phosphate buffer solution (pH 4) containing 20 mM beta-cyclodextrin and 4 M urea. The isomer ratio was found to be 59.1% cis-cis, 35.9% cis-trans, and 5.02% trans-trans (expected ratio: 59:35:6). Laudanosine, a major metabolite of atracurium besylate, was identified in two commercially available formulations, Tracur (Núcleo de Desenvolvimento Cristália) and Tracrium (Glaxo Wellcome, S.A., Rio de Janeiro, Brazil). Its concentration increases considerably during storage of the product, even if the product is stored at low temperatures.

  18. Prediction of transcriptional regulatory sites in the complete genome sequence of Escherichia coli K-12.

    Science.gov (United States)

    Thieffry, D; Salgado, H; Huerta, A M; Collado-Vides, J

    1998-06-01

    As one of the best-characterized free-living organisms, Escherichia coli and its recently completed genomic sequence offer a special opportunity to exploit systematically the variety of regulatory data available in the literature in order to make a comprehensive set of regulatory predictions in the whole genome. The complete genome sequence of E.coli was analyzed for the binding of transcriptional regulators upstream of coding sequences. The biological information contained in RegulonDB (Huerta, A.M. et al., Nucleic Acids Res.,26,55-60, 1998) for 56 different transcriptional proteins was the support to implement a stringent strategy combining string search and weight matrices. We estimate that our search included representatives of 15-25% of the total number of regulatory binding proteins in E.coli. This search was performed on the set of 4288 putative regulatory regions, each 450 bp long. Within the regions with predicted sites, 89% are regulated by one protein and 81% involve only one site. These numbers are reasonably consistent with the distribution of experimental regulatory sites. Regulatory sites are found in 603 regions corresponding to 16% of operon regions and 10% of intra-operonic regions. Additional evidence gives stronger support to some of these predictions, including the position of the site, biological consistency with the function of the downstream gene, as well as genetic evidence for the regulatory interaction. The predictions described here were incorporated into the map presented in the paper describing the complete E.coli genome (Blattner,F.R. et al., Science, 277, 1453-1461, 1997). The complete set of predictions in GenBank format is available at the url: http://www. cifn.unam.mx/Computational_Biology/E.coli-predictions ecoli-reg@cifn.unam.mx, collado@cifn.unam.mx

  19. CoryneRegNet: an ontology-based data warehouse of corynebacterial transcription factors and regulatory networks.

    Science.gov (United States)

    Baumbach, Jan; Brinkrolf, Karina; Czaja, Lisa F; Rahmann, Sven; Tauch, Andreas

    2006-02-14

    The application of DNA microarray technology in post-genomic analysis of bacterial genome sequences has allowed the generation of huge amounts of data related to regulatory networks. This data along with literature-derived knowledge on regulation of gene expression has opened the way for genome-wide reconstruction of transcriptional regulatory networks. These large-scale reconstructions can be converted into in silico models of bacterial cells that allow a systematic analysis of network behavior in response to changing environmental conditions. CoryneRegNet was designed to facilitate the genome-wide reconstruction of transcriptional regulatory networks of corynebacteria relevant in biotechnology and human medicine. During the import and integration process of data derived from experimental studies or literature knowledge CoryneRegNet generates links to genome annotations, to identified transcription factors and to the corresponding cis-regulatory elements. CoryneRegNet is based on a multi-layered, hierarchical and modular concept of transcriptional regulation and was implemented by using the relational database management system MySQL and an ontology-based data structure. Reconstructed regulatory networks can be visualized by using the yFiles JAVA graph library. As an application example of CoryneRegNet, we have reconstructed the global transcriptional regulation of a cellular module involved in SOS and stress response of corynebacteria. CoryneRegNet is an ontology-based data warehouse that allows a pertinent data management of regulatory interactions along with the genome-scale reconstruction of transcriptional regulatory networks. These models can further be combined with metabolic networks to build integrated models of cellular function including both metabolism and its transcriptional regulation.

  20. Ultraconservation identifies a small subset of extremely constrained developmental enhancers

    Energy Technology Data Exchange (ETDEWEB)

    Pennacchio, Len A.; Visel, Axel; Prabhakar, Shyam; Akiyama, Jennifer A.; Shoukry, Malak; Lewis, Keith D.; Holt, Amy; Plajzer-Frick, Ingrid; Afzal, Veena; Rubin, Edward M.; Pennacchio, Len A.

    2007-10-01

    While experimental studies have suggested that non-coding ultraconserved DNA elements are central nodes in the regulatory circuitry that specifies mammalian embryonic development, the possible functional relevance of their>200bp of perfect sequence conservation between human-mouse-rat remains obscure 1,2. Here we have compared the in vivo enhancer activity of a genome-wide set of 231 non-exonic sequences with ultraconserved cores to that of 206 sequences that are under equivalently severe human-rodent constraint (ultra-like), but lack perfect sequence conservation. In transgenic mouse assays, 50percent of the ultraconserved and 50percent of the ultra-like conserved elements reproducibly functioned as tissue-specific enhancers at embryonic day 11.5. In this in vivo assay, we observed that ultraconserved enhancers and constrained non-ultraconserved enhancers targeted expression to a similar spectrum of tissues with a particular enrichment in the developing central nervous system. A human genome-wide comparative screen uncovered ~;;2,600 non-coding elements that evolved under ultra-like human-rodent constraint and are similarly enriched near transcriptional regulators and developmental genes as the much smaller number of ultraconserved elements. These data indicate that ultraconserved elements possessing absolute human-rodent sequence conservation are not distinct from other non-coding elements that are under comparable purifying selection in mammals and suggest they are principal constituents of the cis-regulatory framework of mammalian development.

  1. Meta-analysis of breast cancer microarray studies in conjunction with conserved cis-elements suggest patterns for coordinate regulation

    Directory of Open Access Journals (Sweden)

    Lundberg Cathryn

    2008-01-01

    Full Text Available Abstract Background Gene expression measurements from breast cancer (BrCa tumors are established clinical predictive tools to identify tumor subtypes, identify patients showing poor/good prognosis, and identify patients likely to have disease recurrence. However, diverse breast cancer datasets in conjunction with diagnostic clinical arrays show little overlap in the sets of genes identified. One approach to identify a set of consistently dysregulated candidate genes in these tumors is to employ meta-analysis of multiple independent microarray datasets. This allows one to compare expression data from a diverse collection of breast tumor array datasets generated on either cDNA or oligonucleotide arrays. Results We gathered expression data from 9 published microarray studies examining estrogen receptor positive (ER+ and estrogen receptor negative (ER- BrCa tumor cases from the Oncomine database. We performed a meta-analysis and identified genes that were universally up or down regulated with respect to ER+ versus ER- tumor status. We surveyed both the proximal promoter and 3' untranslated regions (3'UTR of our top-ranking genes in each expression group to test whether common sequence elements may contribute to the observed expression patterns. Utilizing a combination of known transcription factor binding sites (TFBS, evolutionarily conserved mammalian promoter and 3'UTR motifs, and microRNA (miRNA seed sequences, we identified numerous motifs that were disproportionately represented between the two gene classes suggesting a common regulatory network for the observed gene expression patterns. Conclusion Some of the genes we identified distinguish key transcripts previously seen in array studies, while others are newly defined. Many of the genes identified as overexpressed in ER- tumors were previously identified as expression markers for neoplastic transformation in multiple human cancers. Moreover, our motif analysis identified a collection of

  2. Diverse activities of viral cis-acting RNA regulatory elements revealed using multicolor, long-term, single-cell imaging.

    Science.gov (United States)

    Pocock, Ginger M; Zimdars, Laraine L; Yuan, Ming; Eliceiri, Kevin W; Ahlquist, Paul; Sherer, Nathan M

    2017-02-01

    Cis-acting RNA structural elements govern crucial aspects of viral gene expression. How these structures and other posttranscriptional signals affect RNA trafficking and translation in the context of single cells is poorly understood. Herein we describe a multicolor, long-term (>24 h) imaging strategy for measuring integrated aspects of viral RNA regulatory control in individual cells. We apply this strategy to demonstrate differential mRNA trafficking behaviors governed by RNA elements derived from three retroviruses (HIV-1, murine leukemia virus, and Mason-Pfizer monkey virus), two hepadnaviruses (hepatitis B virus and woodchuck hepatitis virus), and an intron-retaining transcript encoded by the cellular NXF1 gene. Striking behaviors include "burst" RNA nuclear export dynamics regulated by HIV-1's Rev response element and the viral Rev protein; transient aggregations of RNAs into discrete foci at or near the nuclear membrane triggered by multiple elements; and a novel, pulsiform RNA export activity regulated by the hepadnaviral posttranscriptional regulatory element. We incorporate single-cell tracking and a data-mining algorithm into our approach to obtain RNA element-specific, high-resolution gene expression signatures. Together these imaging assays constitute a tractable, systems-based platform for studying otherwise difficult to access spatiotemporal features of viral and cellular gene regulation. © 2017 Pocock et al. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

  3. Regulatory Architecture of Gene Expression Variation in the Threespine Stickleback Gasterosteus aculeatus.

    Science.gov (United States)

    Pritchard, Victoria L; Viitaniemi, Heidi M; McCairns, R J Scott; Merilä, Juha; Nikinmaa, Mikko; Primmer, Craig R; Leder, Erica H

    2017-01-05

    Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus), an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL) underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats. Copyright © 2017 Pritchard et al.

  4. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Science.gov (United States)

    Fauteux, François; Strömvik, Martina V

    2009-01-01

    Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs

  5. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Directory of Open Access Journals (Sweden)

    Fauteux François

    2009-10-01

    Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

  6. CisLunar Habitat Internal Architecture Design Criteria

    Science.gov (United States)

    Jones, R.; Kennedy, K.; Howard, R.; Whitmore, M.; Martin, C.; Garate, J.

    2017-01-01

    Lunar Habitat Internal Architecture Study is to become a forcing function to establish a common understanding of CisLunar Phase-1 Habitation Internal Architecture design criteria, processes, and tools. The scope of the CisLunar Habitat Internal Architecture study is to design, develop, demonstrate, and evaluate a Phase-1 CisLunar Habitat common module internal architecture based on design criteria agreed to by NASA, the International Partners, and Commercial Exploration teams. This task is to define the CisLunar Phase-1 Internal Architecture Government Reference Design, assist NASA in becoming a "smart buyer" for Phase-1 Habitat Concepts, and ultimately to derive standards and requirements from the Internal Architecture Design Process. The first step was to define a Habitat Internal Architecture Design Criteria and create a structured philosophy to be used by design teams as a filter by which critical aspects of consideration would be identified for the purpose of organizing and utilizing interior spaces. With design criteria in place, the team will develop a series of iterative internal architecture concept designs which will be assessed by means of an evaluation criteria and process. These assessments will successively drive and refine the design, leading to the combination and down-selection of design concepts. A single refined reference design configuration will be developed into in a medium-to-high fidelity mockup. A multi-day human-in-the-loop mission test will fully evaluate the reference design and validate its configuration. Lessons learned from the design and evaluation will enable the team to identify appropriate standards for Phase-1 CisLunar Habitat Internal Architecture and will enable NASA to develop derived requirements in support of maturing CisLunar Habitation capabilities. This paper will describe the criteria definition process, workshop event, and resulting CisLunar Phase-1 Habitat Internal Architecture Design Criteria.

  7. Identifying structural variants using linked-read sequencing data.

    Science.gov (United States)

    Elyanow, Rebecca; Wu, Hsin-Ta; Raphael, Benjamin J

    2017-11-03

    Structural variation, including large deletions, duplications, inversions, translocations, and other rearrangements, is common in human and cancer genomes. A number of methods have been developed to identify structural variants from Illumina short-read sequencing data. However, reliable identification of structural variants remains challenging because many variants have breakpoints in repetitive regions of the genome and thus are difficult to identify with short reads. The recently developed linked-read sequencing technology from 10X Genomics combines a novel barcoding strategy with Illumina sequencing. This technology labels all reads that originate from a small number (~5-10) DNA molecules ~50Kbp in length with the same molecular barcode. These barcoded reads contain long-range sequence information that is advantageous for identification of structural variants. We present Novel Adjacency Identification with Barcoded Reads (NAIBR), an algorithm to identify structural variants in linked-read sequencing data. NAIBR predicts novel adjacencies in a individual genome resulting from structural variants using a probabilistic model that combines multiple signals in barcoded reads. We show that NAIBR outperforms several existing methods for structural variant identification - including two recent methods that also analyze linked-reads - on simulated sequencing data and 10X whole-genome sequencing data from the NA12878 human genome and the HCC1954 breast cancer cell line. Several of the novel somatic structural variants identified in HCC1954 overlap known cancer genes. Software is available at compbio.cs.brown.edu/software. braphael@princeton.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  8. Identifying TF-MiRNA Regulatory Relationships Using Multiple Features.

    Directory of Open Access Journals (Sweden)

    Mingyu Shao

    Full Text Available MicroRNAs are known to play important roles in the transcriptional and post-transcriptional regulation of gene expression. While intensive research has been conducted to identify miRNAs and their target genes in various genomes, there is only limited knowledge about how microRNAs are regulated. In this study, we construct a pipeline that can infer the regulatory relationships between transcription factors and microRNAs from ChIP-Seq data with high confidence. In particular, after identifying candidate peaks from ChIP-Seq data, we formulate the inference as a PU learning (learning from only positive and unlabeled examples problem. Multiple features including the statistical significance of the peaks, the location of the peaks, the transcription factor binding site motifs, and the evolutionary conservation are derived from peaks for training and prediction. To further improve the accuracy of our inference, we also apply a mean reciprocal rank (MRR-based method to the candidate peaks. We apply our pipeline to infer TF-miRNA regulatory relationships in mouse embryonic stem cells. The experimental results show that our approach provides very specific findings of TF-miRNA regulatory relationships.

  9. Murine homeobox-containing gene, Msx-1: analysis of genomic organization, promoter structure, and potential autoregulatory cis-acting elements.

    Science.gov (United States)

    Kuzuoka, M; Takahashi, T; Guron, C; Raghow, R

    1994-05-01

    Detailed molecular organization of the coding and upstream regulatory regions of the murine homeodomain-containing gene, Msx-1, is reported. The protein-encoding portion of the gene is contained in two exons, 590 and 1214 bp in length, separated by a 2107-bp intron; the homeodomain is located in the second exon. The two-exon organization of the murine Msx-1 gene resembles a number of other homeodomain-containing genes. The 5'-(GTAAGT) and 3'-(CCCTAG) splicing junctions and the mRNA polyadenylation signal (UAUAA) of the murine Msx-1 gene are also characteristic of other vertebrate genes. By nuclease protection and primer extension assays, the start of transcription of the Msx-1 gene was located 256 bp upstream of the first AUG. Computer analysis of the promoter proximal 1280-bp sequence revealed a number of potentially important cis-regulatory sequences; these include the recognition elements for Ap-1, Ap-2, Ap-3, Sp-1, a possible binding site for RAR:RXR, and a number of TCF-1 consensus motifs. Importantly, a perfect reverse complement of (C/G)TTAATTG, which was recently shown to be an optimal binding sequence for the homeodomain of Msx-1 protein (K.M. Catron, N. Iler, and C. Abate (1993) Mol. Cell. Biol. 13:2354-2365), was also located in the murine Msx-1 promoter. Binding of bacterially expressed Msx-1 homeodomain polypeptide to Msx-1-specific oligonucleotide was experimentally demonstrated, raising a distinct possibility of autoregulation of this developmentally regulated gene.

  10. Integrated systems approach identifies risk regulatory pathways and key regulators in coronary artery disease.

    Science.gov (United States)

    Zhang, Yan; Liu, Dianming; Wang, Lihong; Wang, Shuyuan; Yu, Xuexin; Dai, Enyu; Liu, Xinyi; Luo, Shanshun; Jiang, Wei

    2015-12-01

    Coronary artery disease (CAD) is the most common type of heart disease. However, the molecular mechanisms of CAD remain elusive. Regulatory pathways are known to play crucial roles in many pathogenic processes. Thus, inferring risk regulatory pathways is an important step toward elucidating the mechanisms underlying CAD. With advances in high-throughput data, we developed an integrated systems approach to identify CAD risk regulatory pathways and key regulators. Firstly, a CAD-related core subnetwork was identified from a curated transcription factor (TF) and microRNA (miRNA) regulatory network based on a random walk algorithm. Secondly, candidate risk regulatory pathways were extracted from the subnetwork by applying a breadth-first search (BFS) algorithm. Then, risk regulatory pathways were prioritized based on multiple CAD-associated data sources. Finally, we also proposed a new measure to prioritize upstream regulators. We inferred that phosphatase and tensin homolog (PTEN) may be a key regulator in the dysregulation of risk regulatory pathways. This study takes a closer step than the identification of disease subnetworks or modules. From the risk regulatory pathways, we could understand the flow of regulatory information in the initiation and progression of the disease. Our approach helps to uncover its potential etiology. We developed an integrated systems approach to identify risk regulatory pathways. We proposed a new measure to prioritize the key regulators in CAD. PTEN may be a key regulator in dysregulation of the risk regulatory pathways.

  11. 78 FR 44275 - Semiannual Regulatory Agenda

    Science.gov (United States)

    2013-07-23

    ... Rights. National Park Service--Completed Actions Regulation Sequence No. Title Identifier No. 200 Winter.... Timetable: Action Date FR Cite NPRM 07/00/13 Final Action 05/00/14 Regulatory Flexibility Analysis Required...: Action Date FR Cite NPRM 10/00/14 Final Action 10/00/14 Regulatory Flexibility Analysis Required: Yes...

  12. CoryneRegNet: An ontology-based data warehouse of corynebacterial transcription factors and regulatory networks

    Directory of Open Access Journals (Sweden)

    Czaja Lisa F

    2006-02-01

    Full Text Available Abstract Background The application of DNA microarray technology in post-genomic analysis of bacterial genome sequences has allowed the generation of huge amounts of data related to regulatory networks. This data along with literature-derived knowledge on regulation of gene expression has opened the way for genome-wide reconstruction of transcriptional regulatory networks. These large-scale reconstructions can be converted into in silico models of bacterial cells that allow a systematic analysis of network behavior in response to changing environmental conditions. Description CoryneRegNet was designed to facilitate the genome-wide reconstruction of transcriptional regulatory networks of corynebacteria relevant in biotechnology and human medicine. During the import and integration process of data derived from experimental studies or literature knowledge CoryneRegNet generates links to genome annotations, to identified transcription factors and to the corresponding cis-regulatory elements. CoryneRegNet is based on a multi-layered, hierarchical and modular concept of transcriptional regulation and was implemented by using the relational database management system MySQL and an ontology-based data structure. Reconstructed regulatory networks can be visualized by using the yFiles JAVA graph library. As an application example of CoryneRegNet, we have reconstructed the global transcriptional regulation of a cellular module involved in SOS and stress response of corynebacteria. Conclusion CoryneRegNet is an ontology-based data warehouse that allows a pertinent data management of regulatory interactions along with the genome-scale reconstruction of transcriptional regulatory networks. These models can further be combined with metabolic networks to build integrated models of cellular function including both metabolism and its transcriptional regulation.

  13. cis-Apa: a practical linker for the microwave-assisted preparation of cyclic pseudopeptides via RCM cyclative cleavage.

    Science.gov (United States)

    Baron, Alice; Verdié, Pascal; Martinez, Jean; Lamaty, Frédéric

    2011-02-04

    A new linker cis-5-aminopent-3-enoic acid (cis-Apa) was prepared for the synthesis of cyclic pseudopeptides by cyclization-cleavage by using ring-closing methatesis (RCM). We developed a new synthetic pathway for the preparation of the cis-Apa linker that was tested in the cyclization-cleavage process of different RGD peptide sequences. Different macrocyclic peptidomimetics were prepared by using this integrated microwave-assisted method, showing that the readily available cis-Apa amino acid is well adapted as a linker in the cyclization-cleavage process.

  14. Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression.

    Science.gov (United States)

    Fairfax, Benjamin P; Humburg, Peter; Makino, Seiko; Naranbhai, Vivek; Wong, Daniel; Lau, Evelyn; Jostins, Luke; Plant, Katharine; Andrews, Robert; McGee, Chris; Knight, Julian C

    2014-03-07

    To systematically investigate the impact of immune stimulation upon regulatory variant activity, we exposed primary monocytes from 432 healthy Europeans to interferon-γ (IFN-γ) or differing durations of lipopolysaccharide and mapped expression quantitative trait loci (eQTLs). More than half of cis-eQTLs identified, involving hundreds of genes and associated pathways, are detected specifically in stimulated monocytes. Induced innate immune activity reveals multiple master regulatory trans-eQTLs including the major histocompatibility complex (MHC), coding variants altering enzyme and receptor function, an IFN-β cytokine network showing temporal specificity, and an interferon regulatory factor 2 (IRF2) transcription factor-modulated network. Induced eQTL are significantly enriched for genome-wide association study loci, identifying context-specific associations to putative causal genes including CARD9, ATM, and IRF8. Thus, applying pathophysiologically relevant immune stimuli assists resolution of functional genetic variants.

  15. Properties of Sequence Conservation in Upstream Regulatory and Protein Coding Sequences among Paralogs in Arabidopsis thaliana

    Science.gov (United States)

    Richardson, Dale N.; Wiehe, Thomas

    Whole genome duplication (WGD) has catalyzed the formation of new species, genes with novel functions, altered expression patterns, complexified signaling pathways and has provided organisms a level of genetic robustness. We studied the long-term evolution and interrelationships of 5’ upstream regulatory sequences (URSs), protein coding sequences (CDSs) and expression correlations (EC) of duplicated gene pairs in Arabidopsis. Three distinct methods revealed significant evolutionary conservation between paralogous URSs and were highly correlated with microarray-based expression correlation of the respective gene pairs. Positional information on exact matches between sequences unveiled the contribution of micro-chromosomal rearrangements on expression divergence. A three-way rank analysis of URS similarity, CDS divergence and EC uncovered specific gene functional biases. Transcription factor activity was associated with gene pairs exhibiting conserved URSs and divergent CDSs, whereas a broad array of metabolic enzymes was found to be associated with gene pairs showing diverged URSs but conserved CDSs.

  16. The evolutionary capacitor HSP90 buffers the regulatory effects of mammalian endogenous retroviruses.

    Science.gov (United States)

    Hummel, Barbara; Hansen, Erik C; Yoveva, Aneliya; Aprile-Garcia, Fernando; Hussong, Rebecca; Sawarkar, Ritwick

    2017-03-01

    Understanding how genotypes are linked to phenotypes is important in biomedical and evolutionary studies. The chaperone heat-shock protein 90 (HSP90) buffers genetic variation by stabilizing proteins with variant sequences, thereby uncoupling phenotypes from genotypes. Here we report an unexpected role of HSP90 in buffering cis-regulatory variation affecting gene expression. By using the tripartite-motif-containing 28 (TRIM28; also known as KAP1)-mediated epigenetic pathway, HSP90 represses the regulatory influence of endogenous retroviruses (ERVs) on neighboring genes that are critical for mouse development. Our data based on natural variations in the mouse genome show that genes respond to HSP90 inhibition in a manner dependent on their genomic location with regard to strain-specific ERV-insertion sites. The evolutionary-capacitor function of HSP90 may thus have facilitated the exaptation of ERVs as key modifiers of gene expression and morphological diversification. Our findings add a new regulatory layer through which HSP90 uncouples phenotypic outcomes from individual genotypes.

  17. Selective constraints in experimentally defined primate regulatory regions.

    Directory of Open Access Journals (Sweden)

    Daniel J Gaffney

    2008-08-01

    Full Text Available Changes in gene regulation may be important in evolution. However, the evolutionary properties of regulatory mutations are currently poorly understood. This is partly the result of an incomplete annotation of functional regulatory DNA in many species. For example, transcription factor binding sites (TFBSs, a major component of eukaryotic regulatory architecture, are typically short, degenerate, and therefore difficult to differentiate from randomly occurring, nonfunctional sequences. Furthermore, although sites such as TFBSs can be computationally predicted using evolutionary conservation as a criterion, estimates of the true level of selective constraint (defined as the fraction of strongly deleterious mutations occurring at a locus in regulatory regions will, by definition, be upwardly biased in datasets that are a priori evolutionarily conserved. Here we investigate the fitness effects of regulatory mutations using two complementary datasets of human TFBSs that are likely to be relatively free of ascertainment bias with respect to evolutionary conservation but, importantly, are supported by experimental data. The first is a collection of almost >2,100 human TFBSs drawn from the literature in the TRANSFAC database, and the second is derived from several recent high-throughput chromatin immunoprecipitation coupled with genomic microarray (ChIP-chip analyses. We also define a set of putative cis-regulatory modules (pCRMs by spatially clustering multiple TFBSs that regulate the same gene. We find that a relatively high proportion ( approximately 37% of mutations at TFBSs are strongly deleterious, similar to that at a 2-fold degenerate protein-coding site. However, constraint is significantly reduced in human and chimpanzee pCRMS and ChIP-chip sequences, relative to macaques. We estimate that the fraction of regulatory mutations that have been driven to fixation by positive selection in humans is not significantly different from zero. We also find

  18. Evidence for deep regulatory similarities in early developmental programs across highly diverged insects.

    Science.gov (United States)

    Kazemian, Majid; Suryamohan, Kushal; Chen, Jia-Yu; Zhang, Yinan; Samee, Md Abul Hassan; Halfon, Marc S; Sinha, Saurabh

    2014-09-01

    Many genes familiar from Drosophila development, such as the so-called gap, pair-rule, and segment polarity genes, play important roles in the development of other insects and in many cases appear to be deployed in a similar fashion, despite the fact that Drosophila-like "long germband" development is highly derived and confined to a subset of insect families. Whether or not these similarities extend to the regulatory level is unknown. Identification of regulatory regions beyond the well-studied Drosophila has been challenging as even within the Diptera (flies, including mosquitoes) regulatory sequences have diverged past the point of recognition by standard alignment methods. Here, we demonstrate that methods we previously developed for computational cis-regulatory module (CRM) discovery in Drosophila can be used effectively in highly diverged (250-350 Myr) insect species including Anopheles gambiae, Tribolium castaneum, Apis mellifera, and Nasonia vitripennis. In Drosophila, we have successfully used small sets of known CRMs as "training data" to guide the search for other CRMs with related function. We show here that although species-specific CRM training data do not exist, training sets from Drosophila can facilitate CRM discovery in diverged insects. We validate in vivo over a dozen new CRMs, roughly doubling the number of known CRMs in the four non-Drosophila species. Given the growing wealth of Drosophila CRM annotation, these results suggest that extensive regulatory sequence annotation will be possible in newly sequenced insects without recourse to costly and labor-intensive genome-scale experiments. We develop a new method, Regulus, which computes a probabilistic score of similarity based on binding site composition (despite the absence of nucleotide-level sequence alignment), and demonstrate similarity between functionally related CRMs from orthologous loci. Our work represents an important step toward being able to trace the evolutionary history of gene

  19. Evidence for Deep Regulatory Similarities in Early Developmental Programs across Highly Diverged Insects

    Science.gov (United States)

    Zhang, Yinan; Samee, Md. Abul Hassan; Halfon, Marc S.; Sinha, Saurabh

    2014-01-01

    Many genes familiar from Drosophila development, such as the so-called gap, pair-rule, and segment polarity genes, play important roles in the development of other insects and in many cases appear to be deployed in a similar fashion, despite the fact that Drosophila-like “long germband” development is highly derived and confined to a subset of insect families. Whether or not these similarities extend to the regulatory level is unknown. Identification of regulatory regions beyond the well-studied Drosophila has been challenging as even within the Diptera (flies, including mosquitoes) regulatory sequences have diverged past the point of recognition by standard alignment methods. Here, we demonstrate that methods we previously developed for computational cis-regulatory module (CRM) discovery in Drosophila can be used effectively in highly diverged (250–350 Myr) insect species including Anopheles gambiae, Tribolium castaneum, Apis mellifera, and Nasonia vitripennis. In Drosophila, we have successfully used small sets of known CRMs as “training data” to guide the search for other CRMs with related function. We show here that although species-specific CRM training data do not exist, training sets from Drosophila can facilitate CRM discovery in diverged insects. We validate in vivo over a dozen new CRMs, roughly doubling the number of known CRMs in the four non-Drosophila species. Given the growing wealth of Drosophila CRM annotation, these results suggest that extensive regulatory sequence annotation will be possible in newly sequenced insects without recourse to costly and labor-intensive genome-scale experiments. We develop a new method, Regulus, which computes a probabilistic score of similarity based on binding site composition (despite the absence of nucleotide-level sequence alignment), and demonstrate similarity between functionally related CRMs from orthologous loci. Our work represents an important step toward being able to trace the evolutionary

  20. SRD: a Staphylococcus regulatory RNA database.

    Science.gov (United States)

    Sassi, Mohamed; Augagneur, Yoann; Mauro, Tony; Ivain, Lorraine; Chabelskaya, Svetlana; Hallier, Marc; Sallou, Olivier; Felden, Brice

    2015-05-01

    An overflow of regulatory RNAs (sRNAs) was identified in a wide range of bacteria. We designed and implemented a new resource for the hundreds of sRNAs identified in Staphylococci, with primary focus on the human pathogen Staphylococcus aureus. The "Staphylococcal Regulatory RNA Database" (SRD, http://srd.genouest.org/) compiled all published data in a single interface including genetic locations, sequences and other features. SRD proposes novel and simplified identifiers for Staphylococcal regulatory RNAs (srn) based on the sRNA's genetic location in S. aureus strain N315 which served as a reference. From a set of 894 sequences and after an in-depth cleaning, SRD provides a list of 575 srn exempt of redundant sequences. For each sRNA, their experimental support(s) is provided, allowing the user to individually assess their validity and significance. RNA-seq analysis performed on strains N315, NCTC8325, and Newman allowed us to provide further details, upgrade the initial annotation, and identified 159 RNA-seq independent transcribed sRNAs. The lists of 575 and 159 sRNAs sequences were used to predict the number and location of srns in 18 S. aureus strains and 10 other Staphylococci. A comparison of the srn contents within 32 Staphylococcal genomes revealed a poor conservation between species. In addition, sRNA structure predictions obtained with MFold are accessible. A BLAST server and the intaRNA program, which is dedicated to target prediction, were implemented. SRD is the first sRNA database centered on a genus; it is a user-friendly and scalable device with the possibility to submit new sequences that should spread in the literature. © 2015 Sassi et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  1. Co-regulation of the atrial natriuretic factor and cardiac myosin light chain-2 genes during alpha-adrenergic stimulation of neonatal rat ventricular cells. Identification of cis sequences within an embryonic and a constitutive contractile protein gene which mediate inducible expression.

    Science.gov (United States)

    Knowlton, K U; Baracchini, E; Ross, R S; Harris, A N; Henderson, S A; Evans, S M; Glembotski, C C; Chien, K R

    1991-04-25

    To study the mechanisms which mediate the transcriptional activation of cardiac genes during alpha adrenergic stimulation, the present study examined the regulated expression of three cardiac genes, a ventricular embryonic gene (atrial natriuretic factor, ANF), a constitutively expressed contractile protein gene (cardiac MLC-2), and a cardiac sodium channel gene. alpha 1-Adrenergic stimulation activates the expression and release of ANF from neonatal ventricular cells. As assessed by RNase protection analyses, treatment with alpha-adrenergic agonists increases the steady-state levels of ANF mRNA by greater than 15-fold. However, a rat cardiac sodium channel gene mRNA is not induced, indicating that alpha-adrenergic stimulation does not lead to an increase in the expression of all cardiac genes. Studies employing a series of rat ANF luciferase and rat MLC-2 luciferase fusion genes identify 315- and 92-base pair cis regulatory sequences within an embryonic gene (ANF) and a constitutively expressed contractile protein gene (MLC-2), respectively, which mediate alpha-adrenergic-inducible gene expression. Transfection of various ANF luciferase reporters into neonatal rat ventricular cells demonstrated that upstream sequences which mediate tissue-specific expression (-3003 to -638) can be segregated from those responsible for inducibility. The lack of inducibility of a cardiac Na+ channel gene, and the segregation of ANF gene sequences which mediate cardiac specific from those which mediate inducible expression, provides further insight into the relationship between muscle-specific and inducible expression during cardiac myocyte hypertrophy. Based on these results, a testable model is proposed for the induction of embryonic cardiac genes and constitutively expressed contractile protein genes and the noninducibility of a subset of cardiac genes during alpha-adrenergic stimulation of neonatal rat ventricular cells.

  2. cis-Acting Complex-Trait-Associated lincRNA Expression Correlates with Modulation of Chromosomal Architecture

    Directory of Open Access Journals (Sweden)

    Jennifer Yihong Tan

    2017-02-01

    Full Text Available Summary: Intergenic long noncoding RNAs (lincRNAs are the largest class of transcripts in the human genome. Although many have recently been linked to complex human traits, the underlying mechanisms for most of these transcripts remain undetermined. We investigated the regulatory roles of a high-confidence and reproducible set of 69 trait-relevant lincRNAs (TR-lincRNAs in human lymphoblastoid cells whose biological relevance is supported by their evolutionary conservation during recent human history and genetic interactions with other trait-associated loci. Their enrichment in enhancer-like chromatin signatures, interactions with nearby trait-relevant protein-coding loci, and preferential location at topologically associated domain (TAD boundaries provide evidence that TR-lincRNAs likely regulate proximal trait-relevant gene expression in cis by modulating local chromosomal architecture. This is consistent with the positive and significant correlation found between TR-lincRNA abundance and intra-TAD DNA-DNA contacts. Our results provide insights into the molecular mode of action by which TR-lincRNAs contribute to complex human traits. : Tan et al. identify and characterize 69 human complex trait/disease-associated lincRNAs in LCLs. They show that these loci are often associated with cis-regulation of gene expression and tend to be localized at TAD boundaries, suggesting that these lincRNAs may influence chromosomal architecture. Keywords: intergenic long noncoding RNA, lincRNA, GWAS, expression quantitative trait loci, eQTL, complex trait and disease, enhancer, cis-regulation, topologically associated domains, TAD

  3. Highly conserved non-coding elements on either side of SOX9 associated with Pierre Robin sequence.

    Science.gov (United States)

    Benko, Sabina; Fantes, Judy A; Amiel, Jeanne; Kleinjan, Dirk-Jan; Thomas, Sophie; Ramsay, Jacqueline; Jamshidi, Negar; Essafi, Abdelkader; Heaney, Simon; Gordon, Christopher T; McBride, David; Golzio, Christelle; Fisher, Malcolm; Perry, Paul; Abadie, Véronique; Ayuso, Carmen; Holder-Espinasse, Muriel; Kilpatrick, Nicky; Lees, Melissa M; Picard, Arnaud; Temple, I Karen; Thomas, Paul; Vazquez, Marie-Paule; Vekemans, Michel; Roest Crollius, Hugues; Hastie, Nicholas D; Munnich, Arnold; Etchevers, Heather C; Pelet, Anna; Farlie, Peter G; Fitzpatrick, David R; Lyonnet, Stanislas

    2009-03-01

    Pierre Robin sequence (PRS) is an important subgroup of cleft palate. We report several lines of evidence for the existence of a 17q24 locus underlying PRS, including linkage analysis results, a clustering of translocation breakpoints 1.06-1.23 Mb upstream of SOX9, and microdeletions both approximately 1.5 Mb centromeric and approximately 1.5 Mb telomeric of SOX9. We have also identified a heterozygous point mutation in an evolutionarily conserved region of DNA with in vitro and in vivo features of a developmental enhancer. This enhancer is centromeric to the breakpoint cluster and maps within one of the microdeletion regions. The mutation abrogates the in vitro enhancer function and alters binding of the transcription factor MSX1 as compared to the wild-type sequence. In the developing mouse mandible, the 3-Mb region bounded by the microdeletions shows a regionally specific chromatin decompaction in cells expressing Sox9. Some cases of PRS may thus result from developmental misexpression of SOX9 due to disruption of very-long-range cis-regulatory elements.

  4. Integration analysis of microRNA and mRNA paired expression profiling identifies deregulated microRNA-transcription factor-gene regulatory networks in ovarian endometriosis.

    Science.gov (United States)

    Zhao, Luyang; Gu, Chenglei; Ye, Mingxia; Zhang, Zhe; Li, Li'an; Fan, Wensheng; Meng, Yuanguang

    2018-01-22

    The etiology and pathophysiology of endometriosis remain unclear. Accumulating evidence suggests that aberrant microRNA (miRNA) and transcription factor (TF) expression may be involved in the pathogenesis and development of endometriosis. This study therefore aims to survey the key miRNAs, TFs and genes and further understand the mechanism of endometriosis. Paired expression profiling of miRNA and mRNA in ectopic endometria compared with eutopic endometria were determined by high-throughput sequencing techniques in eight patients with ovarian endometriosis. Binary interactions and circuits among the miRNAs, TFs, and corresponding genes were identified by the Pearson correlation coefficients. miRNA-TF-gene regulatory networks were constructed using bioinformatic methods. Eleven selected miRNAs and TFs were validated by quantitative reverse transcription-polymerase chain reaction in 22 patients. Overall, 107 differentially expressed miRNAs and 6112 differentially expressed mRNAs were identified by comparing the sequencing of the ectopic endometrium group and the eutopic endometrium group. The miRNA-TF-gene regulatory network consists of 22 miRNAs, 12 TFs and 430 corresponding genes. Specifically, some key regulators from the miR-449 and miR-34b/c cluster, miR-200 family, miR-106a-363 cluster, miR-182/183, FOX family, GATA family, and E2F family as well as CEBPA, SOX9 and HNF4A were suggested to play vital regulatory roles in the pathogenesis of endometriosis. Integration analysis of the miRNA and mRNA expression profiles presents a unique insight into the regulatory network of this enigmatic disorder and possibly provides clues regarding replacement therapy for endometriosis.

  5. Distinct forms of the β subunit of GTP-binding regulatory proteins identified by molecular cloning

    International Nuclear Information System (INIS)

    Fong, H.K.W.; Amatruda, T.T. III; Birren, B.W.; Simon, M.I.

    1987-01-01

    Two distinct β subunits of guanine nucleotide-binding regulatory proteins have been identified by cDNA cloning and are referred to as β 1 and β 1 subunits. The bovine transducin β subunit (β 1 ) has been cloned previously. The author now isolated and analyzed cDNA clones that encode the β 2 subunit from bovine adrenal, bovine brain, and a human myeloid leukemia cell line, HL-60. The 340-residue M/sub r/ 37,329 Β 2 protein is 90% identical with β 1 in predicted amino acid sequence, and it is also organized as a series of repetitive homologous segments. The major mRNA that encodes the bovine β 2 subunit is 1.7 kilobases in length. It is expressed at lower levels than β 1 subunit mRNA in all tissues examined. The β 1 and β 2 messages are expressed in cloned human cell lines. Hybridization of cDNA probes to bovine DNA showed that β 1 and β 2 are encoded by separate genes. The amino acid sequences for the bovine and human β 2 subunit are identical, as are the amino acid sequences for the bovine and human β 1 subunit. This evolutionary conservation suggests that the two β subunits have different roles in the signal transduction process

  6. Does positive selection drive transcription factor binding site turnover? A test with Drosophila cis-regulatory modules.

    Directory of Open Access Journals (Sweden)

    Bin Z He

    2011-04-01

    Full Text Available Transcription factor binding site(s (TFBS gain and loss (i.e., turnover is a well-documented feature of cis-regulatory module (CRM evolution, yet little attention has been paid to the evolutionary force(s driving this turnover process. The predominant view, motivated by its widespread occurrence, emphasizes the importance of compensatory mutation and genetic drift. Positive selection, in contrast, although it has been invoked in specific instances of adaptive gene expression evolution, has not been considered as a general alternative to neutral compensatory evolution. In this study we evaluate the two hypotheses by analyzing patterns of single nucleotide polymorphism in the TFBS of well-characterized CRM in two closely related Drosophila species, Drosophila melanogaster and Drosophila simulans. An important feature of the analysis is classification of TFBS mutations according to the direction of their predicted effect on binding affinity, which allows gains and losses to be evaluated independently along the two phylogenetic lineages. The observed patterns of polymorphism and divergence are not compatible with neutral evolution for either class of mutations. Instead, multiple lines of evidence are consistent with contributions of positive selection to TFBS gain and loss as well as purifying selection in its maintenance. In discussion, we propose a model to reconcile the finding of selection driving TFBS turnover with constrained CRM function over long evolutionary time.

  7. Cis-regulatory RNA elements that regulate specialized ribosome activity.

    Science.gov (United States)

    Xue, Shifeng; Barna, Maria

    2015-01-01

    Recent evidence has shown that the ribosome itself can play a highly regulatory role in the specialized translation of specific subpools of mRNAs, in particular at the level of ribosomal proteins (RP). However, the mechanism(s) by which this selection takes place has remained poorly understood. In our recent study, we discovered a combination of unique RNA elements in the 5'UTRs of mRNAs that allows for such control by the ribosome. These mRNAs contain a Translation Inhibitory Element (TIE) that inhibits general cap-dependent translation, and an Internal Ribosome Entry Site (IRES) that relies on a specific RP for activation. The unique combination of an inhibitor of general translation and an activator of specialized translation is key to ribosome-mediated control of gene expression. Here we discuss how these RNA regulatory elements provide a new level of control to protein expression and their implications for gene expression, organismal development and evolution.

  8. TransDetect Identifies a New Regulatory Module Controlling Phosphate Accumulation.

    Science.gov (United States)

    Pal, Sikander; Kisko, Mushtak; Dubos, Christian; Lacombe, Benoit; Berthomieu, Pierre; Krouk, Gabriel; Rouached, Hatem

    2017-10-01

    Identifying transcription factor (TFs) cooperation controlling target gene expression is still an arduous challenge. The accuracy of current methods at genome scale significantly drops with the increase in number of genes, which limits their applicability to more complex genomes, like animals and plants. Here, we developed an algorithm, TransDetect, able to predict TF combinations controlling the expression level of a given gene. TransDetect was used to identify novel TF modules regulating the expression of Arabidopsis ( Arabidopsis thaliana ) phosphate transporter PHO1;H3 comprising MYB15, MYB84, bHLH35, and ICE1. These TFs were confirmed to interact between themselves and with the PHO1;H3 promoter. Phenotypic and genetic analyses of TF mutants enable the organization of these four TFs and PHO1;H3 in a new gene regulatory network controlling phosphate accumulation in zinc-dependent manner. This demonstrates the potential of TransDetect to extract directionality in nondynamic transcriptomes and to provide a blueprint to identify gene regulatory network involved in a given biological process. © 2017 American Society of Plant Biologists. All Rights Reserved.

  9. The evolution of gene expression QTL in Saccharomyces cerevisiae.

    Directory of Open Access Journals (Sweden)

    James Ronald

    2007-08-01

    Full Text Available Understanding the evolutionary forces that influence patterns of gene expression variation will provide insights into the mechanisms of evolutionary change and the molecular basis of phenotypic diversity. To date, studies of gene expression evolution have primarily been made by analyzing how gene expression levels vary within and between species. However, the fundamental unit of heritable variation in transcript abundance is the underlying regulatory allele, and as a result it is necessary to understand gene expression evolution at the level of DNA sequence variation. Here we describe the evolutionary forces shaping patterns of genetic variation for 1206 cis-regulatory QTL identified in a cross between two divergent strains of Saccharomyces cerevisiae. We demonstrate that purifying selection against mildly deleterious alleles is the dominant force governing cis-regulatory evolution in S. cerevisiae and estimate the strength of selection. We also find that essential genes and genes with larger codon bias are subject to slightly stronger cis-regulatory constraint and that positive selection has played a role in the evolution of major trans-acting QTL.

  10. Identification and Functional Analysis of Gene Regulatory Sequences Interacting with Colorectal Tumor Suppressors

    DEFF Research Database (Denmark)

    Dahlgaard, Katja; Troelsen, Jesper

    2018-01-01

    Several tumor suppressors possess gene regulatory activity. Here, we describe how promoter and promoter/enhancer reporter assays can be used to characterize a colorectal tumor suppressor proteins’ gene regulatory activity of possible target genes. In the first part, a bioinformatic approach...... of the quick and efficient In-Fusion cloning method, and how to carry out transient transfections of Caco-2 colon cancer cells with the produced luciferase reporter plasmids using polyethyleneimine (PEI). A plan describing how to set up and carry out the luciferase expression assay is presented. The luciferase...... to identify relevant gene regulatory regions of potential target genes is presented. In the second part, it is demonstrated how to prepare and carry out the functional assay. We explain how to clone the bioinformatically identified gene regulatory regions into luciferase reporter plasmids by the use...

  11. Identifying Cancer Subtypes from miRNA-TF-mRNA Regulatory Networks and Expression Data.

    Directory of Open Access Journals (Sweden)

    Taosheng Xu

    Full Text Available Identifying cancer subtypes is an important component of the personalised medicine framework. An increasing number of computational methods have been developed to identify cancer subtypes. However, existing methods rarely use information from gene regulatory networks to facilitate the subtype identification. It is widely accepted that gene regulatory networks play crucial roles in understanding the mechanisms of diseases. Different cancer subtypes are likely caused by different regulatory mechanisms. Therefore, there are great opportunities for developing methods that can utilise network information in identifying cancer subtypes.In this paper, we propose a method, weighted similarity network fusion (WSNF, to utilise the information in the complex miRNA-TF-mRNA regulatory network in identifying cancer subtypes. We firstly build the regulatory network where the nodes represent the features, i.e. the microRNAs (miRNAs, transcription factors (TFs and messenger RNAs (mRNAs and the edges indicate the interactions between the features. The interactions are retrieved from various interatomic databases. We then use the network information and the expression data of the miRNAs, TFs and mRNAs to calculate the weight of the features, representing the level of importance of the features. The feature weight is then integrated into a network fusion approach to cluster the samples (patients and thus to identify cancer subtypes. We applied our method to the TCGA breast invasive carcinoma (BRCA and glioblastoma multiforme (GBM datasets. The experimental results show that WSNF performs better than the other commonly used computational methods, and the information from miRNA-TF-mRNA regulatory network contributes to the performance improvement. The WSNF method successfully identified five breast cancer subtypes and three GBM subtypes which show significantly different survival patterns. We observed that the expression patterns of the features in some mi

  12. Location analysis for the estrogen receptor-α reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

    Science.gov (United States)

    Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

    2010-01-01

    Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966

  13. Location analysis for the estrogen receptor-alpha reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements.

    Science.gov (United States)

    Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B

    2010-04-01

    Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.

  14. Mouse transgenesis identifies conserved functional enhancers and cis-regulatory motif in the vertebrate LIM homeobox gene Lhx2 locus.

    Directory of Open Access Journals (Sweden)

    Alison P Lee

    Full Text Available The vertebrate Lhx2 is a member of the LIM homeobox family of transcription factors. It is essential for the normal development of the forebrain, eye, olfactory system and liver as well for the differentiation of lymphoid cells. However, despite the highly restricted spatio-temporal expression pattern of Lhx2, nothing is known about its transcriptional regulation. In mammals and chicken, Crb2, Dennd1a and Lhx2 constitute a conserved linkage block, while the intervening Dennd1a is lost in the fugu Lhx2 locus. To identify functional enhancers of Lhx2, we predicted conserved noncoding elements (CNEs in the human, mouse and fugu Crb2-Lhx2 loci and assayed their function in transgenic mouse at E11.5. Four of the eight CNE constructs tested functioned as tissue-specific enhancers in specific regions of the central nervous system and the dorsal root ganglia (DRG, recapitulating partial and overlapping expression patterns of Lhx2 and Crb2 genes. There was considerable overlap in the expression domains of the CNEs, which suggests that the CNEs are either redundant enhancers or regulating different genes in the locus. Using a large set of CNEs (810 CNEs associated with transcription factor-encoding genes that express predominantly in the central nervous system, we predicted four over-represented 8-mer motifs that are likely to be associated with expression in the central nervous system. Mutation of one of them in a CNE that drove reporter expression in the neural tube and DRG abolished expression in both domains indicating that this motif is essential for expression in these domains. The failure of the four functional enhancers to recapitulate the complete expression pattern of Lhx2 at E11.5 indicates that there must be other Lhx2 enhancers that are either located outside the region investigated or divergent in mammals and fishes. Other approaches such as sequence comparison between multiple mammals are required to identify and characterize such enhancers.

  15. Identifying Corneal Infections in Formalin-Fixed Specimens Using Next Generation Sequencing.

    Science.gov (United States)

    Li, Zhigang; Breitwieser, Florian P; Lu, Jennifer; Jun, Albert S; Asnaghi, Laura; Salzberg, Steven L; Eberhart, Charles G

    2018-01-01

    We test the ability of next-generation sequencing, combined with computational analysis, to identify a range of organisms causing infectious keratitis. This retrospective study evaluated 16 cases of infectious keratitis and four control corneas in formalin-fixed tissues from the pathology laboratory. Infectious cases also were analyzed in the microbiology laboratory using culture, polymerase chain reaction, and direct staining. Classified sequence reads were analyzed with two different metagenomics classification engines, Kraken and Centrifuge, and visualized using the Pavian software tool. Sequencing generated 20 to 46 million reads per sample. On average, 96% of the reads were classified as human, 0.3% corresponded to known vectors or contaminant sequences, 1.7% represented microbial sequences, and 2.4% could not be classified. The two computational strategies successfully identified the fungal, bacterial, and amoebal pathogens in most patients, including all four bacterial and mycobacterial cases, five of six fungal cases, three of three Acanthamoeba cases, and one of three herpetic keratitis cases. In several cases, additional potential pathogens also were identified. In one case with cytomegalovirus identified by Kraken and Centrifuge, the virus was confirmed by direct testing, while two where Staphylococcus aureus or cytomegalovirus were identified by Centrifuge but not Kraken could not be confirmed. Confirmation was not attempted for an additional three potential pathogens identified by Kraken and 11 identified by Centrifuge. Next generation sequencing combined with computational analysis can identify a wide range of pathogens in formalin-fixed corneal specimens, with potential applications in clinical diagnostics and research.

  16. Systematic identification of regulatory variants associated with cancer risk.

    Science.gov (United States)

    Liu, Song; Liu, Yuwen; Zhang, Qin; Wu, Jiayu; Liang, Junbo; Yu, Shan; Wei, Gong-Hong; White, Kevin P; Wang, Xiaoyue

    2017-10-23

    Most cancer risk-associated single nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) are noncoding and it is challenging to assess their functional impacts. To systematically identify the SNPs that affect gene expression by modulating activities of distal regulatory elements, we adapt the self-transcribing active regulatory region sequencing (STARR-seq) strategy, a high-throughput technique to functionally quantify enhancer activities. From 10,673 SNPs linked with 996 cancer risk-associated SNPs identified in previous GWAS studies, we identify 575 SNPs in the fragments that positively regulate gene expression, and 758 SNPs in the fragments with negative regulatory activities. Among them, 70 variants are regulatory variants for which the two alleles confer different regulatory activities. We analyze in depth two regulatory variants-breast cancer risk SNP rs11055880 and leukemia risk-associated SNP rs12142375-and demonstrate their endogenous regulatory activities on expression of ATF7IP and PDE4B genes, respectively, using a CRISPR-Cas9 approach. By identifying regulatory variants associated with cancer susceptibility and studying their molecular functions, we hope to help the interpretation of GWAS results and provide improved information for cancer risk assessment.

  17. Human developmental enhancers conserved between deuterostomes and protostomes.

    Directory of Open Access Journals (Sweden)

    Shoa L Clarke

    Full Text Available The identification of homologies, whether morphological, molecular, or genetic, is fundamental to our understanding of common biological principles. Homologies bridging the great divide between deuterostomes and protostomes have served as the basis for current models of animal evolution and development. It is now appreciated that these two clades share a common developmental toolkit consisting of conserved transcription factors and signaling pathways. These patterning genes sometimes show common expression patterns and genetic interactions, suggesting the existence of similar or even conserved regulatory apparatus. However, previous studies have found no regulatory sequence conserved between deuterostomes and protostomes. Here we describe the first such enhancers, which we call bilaterian conserved regulatory elements (Bicores. Bicores show conservation of sequence and gene synteny. Sequence conservation of Bicores reflects conserved patterns of transcription factor binding sites. We predict that Bicores act as response elements to signaling pathways, and we show that Bicores are developmental enhancers that drive expression of transcriptional repressors in the vertebrate central nervous system. Although the small number of identified Bicores suggests extensive rewiring of cis-regulation between the protostome and deuterostome clades, additional Bicores may be revealed as our understanding of cis-regulatory logic and sample of bilaterian genomes continue to grow.

  18. 5' Region of the human interleukin 4 gene: structure and potential regulatory elements

    Energy Technology Data Exchange (ETDEWEB)

    Eder, A; Krafft-Czepa, H; Krammer, P H

    1988-01-25

    The lymphokine Interleukin 4 (IL-4) is secreted by antigen or mitogen activated T lymphocytes. IL-4 stimulates activation and differentiation of B lymphocytes and growth of T lymphocytes and mast cells. The authors isolated the human IL-4 gene from a lambda EMBL3 genomic library. As a probe they used a synthetic oligonucleotide spanning position 40 to 79 of the published IL-4 cDNA sequence. The 5' promoter region contains several sequence elements which may have a cis-acting regulatory function for IL-4 gene expression. These elements include a TATA-box, three CCAAT-elements (two are on the non-coding strand) and an octamer motif. A comparison of the 5' flanking region of the human murine IL-4 gene (4) shows that the region between position -306 and +44 is highly conserved (83% homology).

  19. Direct repeat sequences are essential for function of the cis-acting locus of transfer (clt) of Streptomyces phaeochromogenes plasmid pJV1.

    Science.gov (United States)

    Franco, Bernardo; González-Cerón, Gabriela; Servín-González, Luis

    2003-11-01

    The functionality of direct and inverted repeat sequences inside the cis acting locus of transfer (clt) of the Streptomyces plasmid pJV1 was determined by testing the effect of different deletions on plasmid transfer. The results show that the single most important element for pJV1 clt function is a series of evenly spaced 9 bp long direct repeats which match the consensus CCGCACA(C/G)(C/G), since their deletion caused a dramatic reduction in plasmid transfer. The presence of these repeats in the absence of any other clt sequences allowed plasmid transfer to occur at a frequency that was at least two orders of magnitude higher than that obtained in the complete absence of clt. A database search revealed regions with a similar organization, and in the same position, in Streptomyces plasmids pSN22 and pSLS, which have transfer proteins homologous to those of pJV1.

  20. Characterizing the D2 statistic: word matches in biological sequences.

    Science.gov (United States)

    Forêt, Sylvain; Wilson, Susan R; Burden, Conrad J

    2009-01-01

    Word matches are often used in sequence comparison methods, either as a measure of sequence similarity or in the first search steps of algorithms such as BLAST or BLAT. The D2 statistic is the number of matches of words of k letters between two sequences. Recent advances have been made in the characterization of this statistic and in the approximation of its distribution. Here, these results are extended to the case of approximate word matches. We compute the exact value of the variance of the D2 statistic for the case of a uniform letter distribution, and introduce a method to provide accurate approximations of the variance in the remaining cases. This enables the distribution of D2 to be approximated for typical situations arising in biological research. We apply these results to the identification of cis-regulatory modules, and show that this method detects such sequences with a high accuracy. The ability to approximate the distribution of D2 for both exact and approximate word matches will enable the use of this statistic in a more precise manner for sequence comparison, database searches, and identification of transcription factor binding sites.

  1. An enhanced computational platform for investigating the roles of regulatory RNA and for identifying functional RNA motifs

    OpenAIRE

    Chang, Tzu-Hao; Huang, Hsi-Yuan; Hsu, Justin Bo-Kai; Weng, Shun-Long; Horng, Jorng-Tzong; Huang, Hsien-Da

    2013-01-01

    Background Functional RNA molecules participate in numerous biological processes, ranging from gene regulation to protein synthesis. Analysis of functional RNA motifs and elements in RNA sequences can obtain useful information for deciphering RNA regulatory mechanisms. Our previous work, RegRNA, is widely used in the identification of regulatory motifs, and this work extends it by incorporating more comprehensive and updated data sources and analytical approaches into a new platform. Methods ...

  2. Ancient Pbx-Hox signatures define hundreds of vertebrate developmental enhancers

    Directory of Open Access Journals (Sweden)

    Parker Hugo J

    2011-12-01

    Full Text Available Abstract Background Gene regulation through cis-regulatory elements plays a crucial role in development and disease. A major aim of the post-genomic era is to be able to read the function of cis-regulatory elements through scrutiny of their DNA sequence. Whilst comparative genomics approaches have identified thousands of putative regulatory elements, our knowledge of their mechanism of action is poor and very little progress has been made in systematically de-coding them. Results Here, we identify ancient functional signatures within vertebrate conserved non-coding elements (CNEs through a combination of phylogenetic footprinting and functional assay, using genomic sequence from the sea lamprey as a reference. We uncover a striking enrichment within vertebrate CNEs for conserved binding-site motifs of the Pbx-Hox hetero-dimer. We further show that these predict reporter gene expression in a segment specific manner in the hindbrain and pharyngeal arches during zebrafish development. Conclusions These findings evoke an evolutionary scenario in which many CNEs evolved early in the vertebrate lineage to co-ordinate Hox-dependent gene-regulatory interactions that pattern the vertebrate head. In a broader context, our evolutionary analyses reveal that CNEs are composed of tightly linked transcription-factor binding-sites (TFBSs, which can be systematically identified through phylogenetic footprinting approaches. By placing a large number of ancient vertebrate CNEs into a developmental context, our findings promise to have a significant impact on efforts toward de-coding gene-regulatory elements that underlie vertebrate development, and will facilitate building general models of regulatory element evolution.

  3. Regulatory Monitoring of Fortified Foods: Identifying Barriers and Good Practices

    Science.gov (United States)

    Rowe, Laura A; Vossenaar, Marieke; Garrett, Greg S

    2015-01-01

    While fortification of staple foods and condiments has gained enormous global traction, poor performance persists throughout many aspects of implementation, most notably around the critical element of regulatory monitoring, which is essential for ensuring foods meet national fortification standards. Where coverage of fortified foods is high, limited nutritional impact of fortification programs largely exists due to regulatory monitoring that insufficiently identifies and holds producers accountable for underfortified products. Based on quality assurance data from 20 national fortification programs in 12 countries, we estimate that less than half of the samples are adequately fortified against relevant national standards. In this paper, we outline key findings from a literature review, key informant interviews with 11 fortification experts, and semi-quantitative surveys with 39 individuals from regulatory agencies and the food fortification industry in 17 countries on the perceived effectiveness of regulatory monitoring systems and barriers to compliance against national fortification standards. Findings highlight that regulatory agencies and industry disagree on the value that enforcement mechanisms have in ensuring compliance against standards. Perceived political risk of enforcement and poorly resourced inspectorate capacity appear to adversely reinforce each other within an environment of unclear legislation to create a major hurdle for improving overall compliance of fortification programs against national standards. Budget constraints affect the ability of regulatory agencies to create a well-trained inspector cadre and improve the detection and enforcement of non-compliant and underfortified products. Recommendations to improve fortification compliance include improving technical capacity; ensuring sustained leadership, accountability, and funding in both the private and the public sectors; and removing political barriers to ensure consistent detection of

  4. Crystal structure of cis-anti-cis-dicyclohexane-18-crown-6 acetonitrile disolvate

    Directory of Open Access Journals (Sweden)

    Alexander Nazarenko

    2015-07-01

    Full Text Available The title compound (systematic name: cis-anti-cis-2,5,8,15,18,21-hexaoxatricyclo[20.4.0.09,14]hexacosane acetonitrile disolvate, C20H36O6·2CH3CN, crystallizes from an acetonitrile solution of dicyclohexane-18-crown-6 on evaporation. The molecule is arranged around a center of symmetry with half the crown ether molecule and one molecule of acetonitrile symmetry independent. All O—C—C—O torsion angles are gauche while all C—O—C—C angles are trans. The sequence of torsion angles is [(tg+t(tg−t]3; the geometry of oxygen atoms is close to pseudo-D3d with three atoms below and three atoms above the mean plane, with an average deviation of ±0.16 (1 Å from the mean plane. This geometry is identical to that observed in metal ion complexes of dicyclohexane-18-crown-6 but differs significantly from the conformation of a free unsolvated molecule. Each acetonitrile molecule connects to a crown ether molecule via two of its methyl group H atoms (C—H...O. Weaker interactions exist between the third H atom of the acetonitrile methyl group and an O atom of a neighbouring crown ether molecule (C—H...O; and between the N atom of the acetonitrile molecule and a H atom of another neighbouring crown ether molecule. All these intermolecular interactions create a three-dimensional network stabilizing the disolvate.

  5. A genomic approach to identify regulatory nodes in the transcriptional network of systemic acquired resistance in plants.

    Directory of Open Access Journals (Sweden)

    Dong Wang

    2006-11-01

    Full Text Available Many biological processes are controlled by intricate networks of transcriptional regulators. With the development of microarray technology, transcriptional changes can be examined at the whole-genome level. However, such analysis often lacks information on the hierarchical relationship between components of a given system. Systemic acquired resistance (SAR is an inducible plant defense response involving a cascade of transcriptional events induced by salicylic acid through the transcription cofactor NPR1. To identify additional regulatory nodes in the SAR network, we performed microarray analysis on Arabidopsis plants expressing the NPR1-GR (glucocorticoid receptor fusion protein. Since nuclear translocation of NPR1-GR requires dexamethasone, we were able to control NPR1-dependent transcription and identify direct transcriptional targets of NPR1. We show that NPR1 directly upregulates the expression of eight WRKY transcription factor genes. This large family of 74 transcription factors has been implicated in various defense responses, but no specific WRKY factor has been placed in the SAR network. Identification of NPR1-regulated WRKY factors allowed us to perform in-depth genetic analysis on a small number of WRKY factors and test well-defined phenotypes of single and double mutants associated with NPR1. Among these WRKY factors we found both positive and negative regulators of SAR. This genomics-directed approach unambiguously positioned five WRKY factors in the complex transcriptional regulatory network of SAR. Our work not only discovered new transcription regulatory components in the signaling network of SAR but also demonstrated that functional studies of large gene families have to take into consideration sequence similarity as well as the expression patterns of the candidates.

  6. Unveiling combinatorial regulation through the combination of ChIP information and in silico cis-regulatory module detection

    Science.gov (United States)

    Sun, Hong; Guns, Tias; Fierro, Ana Carolina; Thorrez, Lieven; Nijssen, Siegfried; Marchal, Kathleen

    2012-01-01

    Computationally retrieving biologically relevant cis-regulatory modules (CRMs) is not straightforward. Because of the large number of candidates and the imperfection of the screening methods, many spurious CRMs are detected that are as high scoring as the biologically true ones. Using ChIP-information allows not only to reduce the regions in which the binding sites of the assayed transcription factor (TF) should be located, but also allows restricting the valid CRMs to those that contain the assayed TF (here referred to as applying CRM detection in a query-based mode). In this study, we show that exploiting ChIP-information in a query-based way makes in silico CRM detection a much more feasible endeavor. To be able to handle the large datasets, the query-based setting and other specificities proper to CRM detection on ChIP-Seq based data, we developed a novel powerful CRM detection method ‘CPModule’. By applying it on a well-studied ChIP-Seq data set involved in self-renewal of mouse embryonic stem cells, we demonstrate how our tool can recover combinatorial regulation of five known TFs that are key in the self-renewal of mouse embryonic stem cells. Additionally, we make a number of new predictions on combinatorial regulation of these five key TFs with other TFs documented in TRANSFAC. PMID:22422841

  7. Implicit sequence learning in deaf children with cochlear implants.

    Science.gov (United States)

    Conway, Christopher M; Pisoni, David B; Anaya, Esperanza M; Karpicke, Jennifer; Henning, Shirley C

    2011-01-01

    Deaf children with cochlear implants (CIs) represent an intriguing opportunity to study neurocognitive plasticity and reorganization when sound is introduced following a period of auditory deprivation early in development. Although it is common to consider deafness as affecting hearing alone, it may be the case that auditory deprivation leads to more global changes in neurocognitive function. In this paper, we investigate implicit sequence learning abilities in deaf children with CIs using a novel task that measured learning through improvement to immediate serial recall for statistically consistent visual sequences. The results demonstrated two key findings. First, the deaf children with CIs showed disturbances in their visual sequence learning abilities relative to the typically developing normal-hearing children. Second, sequence learning was significantly correlated with a standardized measure of language outcome in the CI children. These findings suggest that a period of auditory deprivation has secondary effects related to general sequencing deficits, and that disturbances in sequence learning may at least partially explain why some deaf children still struggle with language following cochlear implantation. © 2010 Blackwell Publishing Ltd.

  8. Integration of ATAC-seq and RNA-seq identifies human alpha cell and beta cell signature genes

    Directory of Open Access Journals (Sweden)

    Amanda M. Ackermann

    2016-03-01

    Conclusions: We have determined the genetic landscape of human α- and β-cells based on chromatin accessibility and transcript levels, which allowed for detection of novel α- and β-cell signature genes not previously known to be expressed in islets. Using fine-mapping of open chromatin, we have identified thousands of potential cis-regulatory elements that operate in an endocrine cell type-specific fashion.

  9. Gene Unprediction with Spurio: A tool to identify spurious protein sequences.

    Science.gov (United States)

    Höps, Wolfram; Jeffryes, Matt; Bateman, Alex

    2018-01-01

    We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation.  Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases.  We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes.  Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.

  10. Exome sequencing identifies ZNF644 mutations in high myopia.

    Directory of Open Access Journals (Sweden)

    Yi Shi

    2011-06-01

    Full Text Available Myopia is the most common ocular disorder worldwide, and high myopia in particular is one of the leading causes of blindness. Genetic factors play a critical role in the development of myopia, especially high myopia. Recently, the exome sequencing approach has been successfully used for the disease gene identification of Mendelian disorders. Here we show a successful application of exome sequencing to identify a gene for an autosomal dominant disorder, and we have identified a gene potentially responsible for high myopia in a monogenic form. We captured exomes of two affected individuals from a Han Chinese family with high myopia and performed sequencing analysis by a second-generation sequencer with a mean coverage of 30× and sufficient depth to call variants at ∼97% of each targeted exome. The shared genetic variants of these two affected individuals in the family being studied were filtered against the 1000 Genomes Project and the dbSNP131 database. A mutation A672G in zinc finger protein 644 isoform 1 (ZNF644 was identified as being related to the phenotype of this family. After we performed sequencing analysis of the exons in the ZNF644 gene in 300 sporadic cases of high myopia, we identified an additional five mutations (I587V, R680G, C699Y, 3'UTR+12 C>G, and 3'UTR+592 G>A in 11 different patients. All these mutations were absent in 600 normal controls. The ZNF644 gene was expressed in human retinal and retinal pigment epithelium (RPE. Given that ZNF644 is predicted to be a transcription factor that may regulate genes involved in eye development, mutation may cause the axial elongation of eyeball found in high myopia patients. Our results suggest that ZNF644 might be a causal gene for high myopia in a monogenic form.

  11. HBVRegDB: Annotation, comparison, detection and visualization of regulatory elements in hepatitis B virus sequences

    Directory of Open Access Journals (Sweden)

    Firth Andrew E

    2007-12-01

    Full Text Available Abstract Background The many Hepadnaviridae sequences available have widely varied functional annotation. The genomes are very compact (~3.2 kb but contain multiple layers of functional regulatory elements in addition to coding regions. Key regions are subject to purifying selection, as mutations in these regions will produce non-functional viruses. Results These genomic sequences have been organized into a structured database to facilitate research at the molecular level. HBVRegDB is a comparative genomic analysis tool with an integrated underlying sequence database. The database contains genomic sequence data from representative viruses. In addition to INSDC and RefSeq annotation, HBVRegDB also contains expert and systematically calculated annotations (e.g. promoters and comparative genome analysis results (e.g. blastn, tblastx. It also contains analyses based on curated HBV alignments. Information about conserved regions – including primary conservation (e.g. CDS-Plotcon and RNA secondary structure predictions (e.g. Alidot – is integrated into the database. A large amount of data is graphically presented using the GBrowse (Generic Genome Browser adapted for analysis of viral genomes. Flexible query access is provided based on any annotated genomic feature. Novel regulatory motifs can be found by analysing the annotated sequences. Conclusion HBVRegDB serves as a knowledge database and as a comparative genomic analysis tool for molecular biologists investigating HBV. It is publicly available and complementary to other viral and HBV focused datasets and tools http://hbvregdb.otago.ac.nz. The availability of multiple and highly annotated sequences of viral genomes in one database combined with comparative analysis tools facilitates detection of novel genomic elements.

  12. Hydrogenation of fluoroarenes: Direct access to all-cis-(multi)fluorinated cycloalkanes.

    Science.gov (United States)

    Wiesenfeldt, Mario P; Nairoukh, Zackaria; Li, Wei; Glorius, Frank

    2017-09-01

    All-c is -multifluorinated cycloalkanes exhibit intriguing electronic properties. In particular, they display extremely high dipole moments perpendicular to the aliphatic ring, making them highly desired motifs in material science. Very few such motifs have been prepared, as their syntheses require multistep sequences from diastereoselectively prefunctionalized precursors. Herein we report a synthetic strategy to access these valuable materials via the rhodium-cyclic (alkyl)(amino)carbene (CAAC)-catalyzed hydrogenation of readily available fluorinated arenes in hexane. This route enables the scalable single-step preparation of an abundance of multisubstituted and multifluorinated cycloalkanes, including all- cis -1,2,3,4,5,6-hexafluorocyclohexane as well as cis-configured fluorinated aliphatic heterocycles. Copyright © 2017, American Association for the Advancement of Science.

  13. Identification of cis-acting elements on positive-strand subgenomic mRNA required for the synthesis of negative-strand counterpart in bovine coronavirus.

    Science.gov (United States)

    Yeh, Po-Yuan; Wu, Hung-Yi

    2014-07-30

    It has been demonstrated that, in addition to genomic RNA, sgmRNA is able to serve as a template for the synthesis of the negative-strand [(-)-strand] complement. However, the cis-acting elements on the positive-strand [(+)-strand] sgmRNA required for (-)-strand sgmRNA synthesis have not yet been systematically identified. In this study, we employed real-time quantitative reverse transcription polymerase chain reaction to analyze the cis-acting elements on bovine coronavirus (BCoV) sgmRNA 7 required for the synthesis of its (-)-strand counterpart by deletion mutagenesis. The major findings are as follows. (1) Deletion of the 5'-terminal leader sequence on sgmRNA 7 decreased the synthesis of the (-)-strand sgmRNA complement. (2) Deletions of the 3' untranslated region (UTR) bulged stem-loop showed no effect on (-)-strand sgmRNA synthesis; however, deletion of the 3' UTR pseudoknot decreased the yield of (-)-strand sgmRNA. (3) Nucleotides positioned from -15 to -34 of the sgmRNA 7 3'-terminal region are required for efficient (-)-strand sgmRNA synthesis. (4) Nucleotide species at the 3'-most position (-1) of sgmRNA 7 is correlated to the efficiency of (-)-strand sgmRNA synthesis. These results together suggest, in principle, that the 5'- and 3'-terminal sequences on sgmRNA 7 harbor cis-acting elements are critical for efficient (-)-strand sgmRNA synthesis in BCoV.

  14. Ant System-Corner Insertion Sequence: An Efficient VLSI Hard Module Placer

    Directory of Open Access Journals (Sweden)

    HOO, C.-S.

    2013-02-01

    Full Text Available Placement is important in VLSI physical design as it determines the time-to-market and chip's reliability. In this paper, a new floorplan representation which couples with Ant System, namely Corner Insertion Sequence (CIS is proposed. Though CIS's search complexity is smaller than the state-of-the-art representation Corner Sequence (CS, CIS adopts a preset boundary on the placement and hence, leading to search bound similar to CS. This enables the previous unutilized corner edges to become viable. Also, the redundancy of CS representation is eliminated in CIS leads to a lower search complexity of CIS. Experimental results on Microelectronics Center of North Carolina (MCNC hard block benchmark circuits show that the proposed algorithm performs comparably in terms of area yet at least two times faster than CS.

  15. Comprehensive meta-analysis of Signal Transducers and Activators of Transcription (STAT genomic binding patterns discerns cell-specific cis-regulatory modules

    Directory of Open Access Journals (Sweden)

    Kang Keunsoo

    2013-01-01

    Full Text Available Abstract Background Cytokine-activated transcription factors from the STAT (Signal Transducers and Activators of Transcription family control common and context-specific genetic programs. It is not clear to what extent cell-specific features determine the binding capacity of seven STAT members and to what degree they share genetic targets. Molecular insight into the biology of STATs was gained from a meta-analysis of 29 available ChIP-seq data sets covering genome-wide occupancy of STATs 1, 3, 4, 5A, 5B and 6 in several cell types. Results We determined that the genomic binding capacity of STATs is primarily defined by the cell type and to a lesser extent by individual family members. For example, the overlap of shared binding sites between STATs 3 and 5 in T cells is greater than that between STAT5 in T cells and non-T cells. Even for the top 1,000 highly enriched STAT binding sites, ~15% of STAT5 binding sites in mouse female liver are shared by other STATs in different cell types while in T cells ~90% of STAT5 binding sites are co-occupied by STAT3, STAT4 and STAT6. In addition, we identified 116 cis-regulatory modules (CRM, which are recognized by all STAT members across cell types defining a common JAK-STAT signature. Lastly, in liver STAT5 binding significantly coincides with binding of the cell-specific transcription factors HNF4A, FOXA1 and FOXA2 and is associated with cell-type specific gene transcription. Conclusions Our results suggest that genomic binding of STATs is primarily determined by the cell type and further specificity is achieved in part by juxtaposed binding of cell-specific transcription factors.

  16. Ionizing radiation sources management in the Commonwealth of Independent States - CIS

    International Nuclear Information System (INIS)

    Iskra, A.; Bufetova, M.

    2006-01-01

    Ionizing radiation sources cover a broad band of power: from powerful NPP reactors and research reactors to portable radioisotope ionizing radiation sources applied in medicine, agriculture, industry and in the energy supply systems of remote facilities. At present, scales and use field of radionuclide sources in the CIS have the tendency to increase. In this connection, the issues of ionizing radiation sources management safety at all stages of their life cycle, from production to treatment, have been of a great importance. The materials on ionizing radiation sources inventory and treatment in the CIS (Russia, Armenia, Belarus, Georgia, Kazakhstan, Kyrgyzstan, Tajikistan and Ukraine) are presented in the report. It is shown that in some republics, there is difficulty in ionizing radiation sources accounting and control system; the national regulatory and legal framework bases regulating activity on radioactive sources use, localization and treatment require update. Many problems are connected with the sources beyond state accounting. The problem of ionizing radiation sources use safety is complicated by the growing activity of various terrorist groups. The opportunity to use ionizing radiation sources with terrorism goals requires the application of defined systems of security and physical protection at all stages of their management. For this purpose a collective, with all CIS countries, organization of radioactive sources accounting and control as well as countermeasures on their illegal transportation and use are necessary. In this connection, the information collection regarding situation with providing of ionizing radiation sources safety, conditions of equipment and storage facilities, radioactive materials accounting and control system in the CIS countries is vitally needed

  17. cDNA cloning, genomic organization and expression analysis during somatic embryogenesis of the translationally controlled tumor protein (TCTP) gene from Japanese larch (Larix leptolepis).

    Science.gov (United States)

    Zhang, Li-Feng; Li, Wan-Feng; Han, Su-Ying; Yang, Wen-Hua; Qi, Li-Wang

    2013-10-15

    A full-length cDNA and genomic sequences of a translationally controlled tumor protein (TCTP) gene were isolated from Japanese larch (Larix leptolepis) and designated LaTCTP. The length of the cDNA was 1, 043 bp and contained a 504 bp open reading frame that encodes a predicted protein of 167 amino acids, characterized by two signature sequences of the TCTP protein family. Analysis of the LaTCTP gene structure indicated four introns and five exons, and it is the largest of all currently known TCTP genes in plants. The 5'-flanking promoter region of LaTCTP was cloned using an improved TAIL-PCR technique. In this region we identified many important potential cis-acting elements, such as a Box-W1 (fungal elicitor responsive element), a CAT-box (cis-acting regulatory element related to meristem expression), a CGTCA-motif (cis-acting regulatory element involved in MeJA-responsiveness), a GT1-motif (light responsive element), a Skn-1-motif (cis-acting regulatory element required for endosperm expression) and a TGA-element (auxin-responsive element), suggesting that expression of LaTCTP is highly regulated. Expression analysis demonstrated ubiquitous localization of LaTCTP mRNA in the roots, stems and needles, high mRNA levels in the embryonal-suspensor mass (ESM), browning embryogenic cultures and mature somatic embryos, and low levels of mRNA at day five during somatic embryogenesis. We suggest that LaTCTP might participate in the regulation of somatic embryo development. These results provide a theoretical basis for understanding the molecular regulatory mechanism of LaTCTP and lay the foundation for artificial regulation of somatic embryogenesis. © 2013.

  18. Exposure of maternal mice to cis-bifenthrin enantioselectively disrupts the transcription of genes related to testosterone synthesis in male offspring.

    Science.gov (United States)

    Jin, Yuanxiang; Wang, Jiangcong; Sun, Xueqing; Ye, Yang; Xu, Minjie; Wang, Jianai; Chen, Shaoping; Fu, Zhengwei

    2013-12-01

    The commercial bifenthrin (BF) contains two cis isomers. In the present study, a dose of 15mg/kg of 1R-cis-BF or 1S-cis-BF was orally administered for 3 weeks to female mice before or during pregnancy. Then, the expression of steroidogenesis related genes which were considered as effective biomarkers of endocrine disruption were analyzed in the male offspring. Maternal exposure to 1S-cis-BF during pregnancy significantly reduced the mRNA levels of peripheral benzodiazepine receptor (PBR) and steroidogenic acute regulatory protein (StAR) in the testes of 3- or 6-week old male offspring. In addition, a significant decrease of cytochrome P450 17α-hydroxysteroid dehydrogenase (P450-17α) was also observed in the testes of 6-week old male offspring when dams were treated with 1S-cis-BF during pregnancy but not before pregnancy. Moreover, the scavenger receptor class B type 1 (SRB1) and cytochrome P450 cholesterol side-chain cleavage enzyme (P450scc) decreased significantly in the testes of 6-week old male offspring when dams were treated with 1S-cis-BF during and before pregnancy. Thus, oral administration of the maternal mice to cis-BF for 3 weeks, particularly during pregnancy, resulted in endocrine disruption in the male offspring, with the 1S-cis-BF causing more significant alterations than the 1R-cis-BF form. Copyright © 2013 Elsevier Inc. All rights reserved.

  19. Sequence analysis of the MYC oncogene involved in the t(8;14)(q24;q11) chromosome translocation in a human leukemia T-cell line indicates that putative regulatory regions are not altered

    International Nuclear Information System (INIS)

    Finver, S.N.; Nishikura, K.; Finger, L.R.; Haluska, F.G.; Finan, J.; Nowell, P.C.; Croce, C.M.

    1988-01-01

    The authors cloned the translocation-associated and homologous normal MYC alleles from SKW-3, a leukemia T-cell line with the t(8; 14)(q24; q11) translocation, and determined the sequence of the MYC oncogene first exon and flanking 5' putative regulatory regions. S1 nuclease protection experiments utilizing a MYC first exon probe demonstrated transcriptional deregulation of the MYC gene associated with the T-cell receptor α locus on the 8q + chromosome of SKW-3 cells. Nucleotide sequence analysis of the translocation-associated (8q +) MYC allele identified a single base substitution within the upstream flanking region; the homologous nontranslocated allele contained an additional substitution and a two-base deletion. None of the deletions or substitutions localized to putative 5' regulatory regions. The MYC first exon sequence was germ line in both alleles. These results demonstrate that alterations within the putative 5' MYC regulatory regions are not necessarily involved in MYC deregulation in T-cell leukemias, and they show that juxtaposition of the T-cell receptor α locus to a germ-line MYC oncogene results in MYC deregulation

  20. Mediation Analysis Demonstrates That Trans-eQTLs Are Often Explained by Cis-Mediation : A Genome-Wide Analysis among 1,800 South Asians

    NARCIS (Netherlands)

    Pierce, Brandon L.; Tong, Lin; Chen, Lin S.; Rahaman, Ronald; Argos, Maria; Jasmine, Farzana; Roy, Shantanu; Paul-Brutus, Rachelle; Westra, Harm-Jan; Franke, Lude; Esko, Tonu; Zaman, Rakibuz; Islam, Tariqul; Rahman, Mahfuzar; Baron, John A.; Kibriya, Muhammad G.; Ahsan, Habibul

    2014-01-01

    A large fraction of human genes are regulated by genetic variation near the transcribed sequence (cis-eQTL, expression quantitative trait locus), and many cis-eQTLs have implications for human disease. Less is known regarding the effects of genetic variation on expression of distant genes

  1. Thermodynamics-based models of transcriptional regulation with gene sequence.

    Science.gov (United States)

    Wang, Shuqiang; Shen, Yanyan; Hu, Jinxing

    2015-12-01

    Quantitative models of gene regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled or heuristic approximations of the underlying regulatory mechanisms. In this work, we have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence. The proposed model relies on a continuous time, differential equation description of transcriptional dynamics. The sequence features of the promoter are exploited to derive the binding affinity which is derived based on statistical molecular thermodynamics. Experimental results show that the proposed model can effectively identify the activity levels of transcription factors and the regulatory parameters. Comparing with the previous models, the proposed model can reveal more biological sense.

  2. Identification of choriogenin cis-regulatory elements and production of estrogen-inducible, liver-specific transgenic Medaka.

    Science.gov (United States)

    Ueno, Tetsuro; Yasumasu, Shigeki; Hayashi, Shinji; Iuchi, Ichiro

    2004-07-01

    Choriogenins (chg-H, chg-L) are precursor proteins of egg envelope of medaka and synthesized in the spawning female liver in response to estrogen. We linked a gene construct chg-L1.5 kb/GFP (a 1.5 kb 5'-upstream region of the chg-L gene fused with a green fluorescence protein (GFP) gene) to another construct emgb/RFP (a cis-regulatory region of embryonic globin gene fused with an RFP gene), injected the double fusion gene construct into 1- or 2-cell-stage embryos, and selected embryos expressing the RFP in erythroid cells. From the embryos, we established two lines of chg-L1.5 kb/GFP-emgb/RFP-transgenic medaka. The 3-month-old spawning females and estradiol-17beta (E2)-exposed males displayed the liver-specific GFP expression. The E2-dependent GFP expression was detected in the differentiating liver of the stage 37-38 embryos. In addition, RT-PCR and whole-mount in situ hybridization showed that the E2-dependent chg expression was found in the liver of the stage 34 embryos of wild medaka, suggesting that such E2-dependency is achieved shortly after differentiation of the liver. Analysis using serial deletion mutants fused with GFP showed that the region -426 to -284 of the chg-L gene or the region -364 to -265 of the chg-H gene had the ability to promote the E2-dependent liver-specific GFP expression of its downstream gene. Further analyses suggested that an estrogen response element (ERE) at -309, an ERE half-site at -330 and a binding site for C/EBP at -363 of the chg-L gene played important roles in its downstream chg-L gene expression. In addition, this transgenic medaka may be useful as one of the test animals for detecting environmental estrogenic steroids.

  3. TSSer: an automated method to identify transcription start sites in prokaryotic genomes from differential RNA sequencing data.

    Science.gov (United States)

    Jorjani, Hadi; Zavolan, Mihaela

    2014-04-01

    Accurate identification of transcription start sites (TSSs) is an essential step in the analysis of transcription regulatory networks. In higher eukaryotes, the capped analysis of gene expression technology enabled comprehensive annotation of TSSs in genomes such as those of mice and humans. In bacteria, an equivalent approach, termed differential RNA sequencing (dRNA-seq), has recently been proposed, but the application of this approach to a large number of genomes is hindered by the paucity of computational analysis methods. With few exceptions, when the method has been used, annotation of TSSs has been largely done manually. In this work, we present a computational method called 'TSSer' that enables the automatic inference of TSSs from dRNA-seq data. The method rests on a probabilistic framework for identifying both genomic positions that are preferentially enriched in the dRNA-seq data as well as preferentially captured relative to neighboring genomic regions. Evaluating our approach for TSS calling on several publicly available datasets, we find that TSSer achieves high consistency with the curated lists of annotated TSSs, but identifies many additional TSSs. Therefore, TSSer can accelerate genome-wide identification of TSSs in bacterial genomes and can aid in further characterization of bacterial transcription regulatory networks. TSSer is freely available under GPL license at http://www.clipz.unibas.ch/TSSer/index.php

  4. Defining the plasticity of transcription factor binding sites by Deconstructing DNA consensus sequences: the PhoP-binding sites among gamma/enterobacteria.

    Directory of Open Access Journals (Sweden)

    Oscar Harari

    2010-07-01

    Full Text Available Transcriptional regulators recognize specific DNA sequences. Because these sequences are embedded in the background of genomic DNA, it is hard to identify the key cis-regulatory elements that determine disparate patterns of gene expression. The detection of the intra- and inter-species differences among these sequences is crucial for understanding the molecular basis of both differential gene expression and evolution. Here, we address this problem by investigating the target promoters controlled by the DNA-binding PhoP protein, which governs virulence and Mg(2+ homeostasis in several bacterial species. PhoP is particularly interesting; it is highly conserved in different gamma/enterobacteria, regulating not only ancestral genes but also governing the expression of dozens of horizontally acquired genes that differ from species to species. Our approach consists of decomposing the DNA binding site sequences for a given regulator into families of motifs (i.e., termed submotifs using a machine learning method inspired by the "Divide & Conquer" strategy. By partitioning a motif into sub-patterns, computational advantages for classification were produced, resulting in the discovery of new members of a regulon, and alleviating the problem of distinguishing functional sites in chromatin immunoprecipitation and DNA microarray genome-wide analysis. Moreover, we found that certain partitions were useful in revealing biological properties of binding site sequences, including modular gains and losses of PhoP binding sites through evolutionary turnover events, as well as conservation in distant species. The high conservation of PhoP submotifs within gamma/enterobacteria, as well as the regulatory protein that recognizes them, suggests that the major cause of divergence between related species is not due to the binding sites, as was previously suggested for other regulators. Instead, the divergence may be attributed to the fast evolution of orthologous target

  5. Cis-Natural Antisense Transcripts Are Mainly Co-expressed with Their Sense Transcripts and Primarily Related to Energy Metabolic Pathways during Muscle Development.

    Science.gov (United States)

    Zhao, Yunxia; Hou, Ye; Zhao, Changzhi; Liu, Fei; Luan, Yu; Jing, Lu; Li, Xinyun; Zhu, Mengjin; Zhao, Shuhong

    2016-01-01

    Cis-natural antisense transcripts (cis-NATs) are a new class of RNAs identified in various species. However, the biological functions of cis-NATs are largely unknown. In this study, we investigated the transcriptional characteristics and functions of cis-NATs in the muscle tissue of lean Landrace and indigenous fatty Lantang pigs. In total, 3,306 cis-NATs of 2,469 annotated genes were identified in the muscle tissue of pigs. More than 1,300 cis-NATs correlated with their sense genes at the transcriptional level, and approximately 80% of them were co-expressed in the two breeds. Furthermore, over 1,200 differentially expressed cis-NATs were identified during muscle development. Function annotation showed that the cis-NATs participated in muscle development mainly by co-expressing with genes involved in energy metabolic pathways, including citrate cycle (TCA cycle), glycolysis or gluconeogenesis, mitochondrial activation and so on. Moreover, these cis-NATs and their sense genes abruptly increased at the transition from the late fetal stages to the early postnatal stages and then decreased along with muscle development. In conclusion, the cis-NATs in the muscle tissue of pigs were identified and determined to be mainly co-expressed with their sense genes. The co-expressed cis-NATs and their sense gene were primarily related to energy metabolic pathways during muscle development in pigs. Our results offered novel evidence on the roles of cis-NATs during the muscle development of pigs.

  6. Sea urchin neural alpha2 tubulin gene: isolation and promoter analysis.

    Science.gov (United States)

    Costa, S; Ragusa, M A; Drago, G; Casano, C; Alaimo, G; Guida, N; Gianguzza, F

    2004-04-02

    Expression of Talpha2 gene, during sea urchin Paracentrotus lividus development, is spatially and temporally regulated. In order to characterize this gene, we isolated the relevant genomic sequences and scanned the isolated 5'-flanking region in searching for cis-regulatory elements required for proper expression. Gel mobility shift and footprinting assays, as well as reporter gene (CAT and beta-gal) expression assays, were used to address cis-regulatory elements involved in regulation. Here we report that an upstream 5'-flanking fragment of PlTalpha2 gene drives temporal expression of reporter genes congruent with that of endogenous Talpha2 gene. The fragment contains cis-elements able to bind nuclear proteins from the gastrula stage (at which the Talpha2 gene is expressed) whose sequences could be consistent with the consensus sequences for transcription factors present in data bank.

  7. Identification of Cis-Acting Elements on Positive-Strand Subgenomic mRNA Required for the Synthesis of Negative-Strand Counterpart in Bovine Coronavirus

    Directory of Open Access Journals (Sweden)

    Po-Yuan Yeh

    2014-07-01

    Full Text Available It has been demonstrated that, in addition to genomic RNA, sgmRNA is able to serve as a template for the synthesis of the negative-strand [(−-strand] complement. However, the cis-acting elements on the positive-strand [(+-strand] sgmRNA required for (−-strand sgmRNA synthesis have not yet been systematically identified. In this study, we employed real-time quantitative reverse transcription polymerase chain reaction to analyze the cis-acting elements on bovine coronavirus (BCoV sgmRNA 7 required for the synthesis of its (−-strand counterpart by deletion mutagenesis. The major findings are as follows. (1 Deletion of the 5'-terminal leader sequence on sgmRNA 7 decreased the synthesis of the (−-strand sgmRNA complement. (2 Deletions of the 3' untranslated region (UTR bulged stem-loop showed no effect on (−-strand sgmRNA synthesis; however, deletion of the 3' UTR pseudoknot decreased the yield of (−-strand sgmRNA. (3 Nucleotides positioned from −15 to −34 of the sgmRNA 7 3'-terminal region are required for efficient (−-strand sgmRNA synthesis. (4 Nucleotide species at the 3'-most position (−1 of sgmRNA 7 is correlated to the efficiency of (−-strand sgmRNA synthesis. These results together suggest, in principle, that the 5'- and 3'-terminal sequences on sgmRNA 7 harbor cis-acting elements are critical for efficient (−-strand sgmRNA synthesis in BCoV.

  8. The evolutionarily conserved leprecan gene: its regulation by Brachyury and its role in the developing Ciona notochord.

    Science.gov (United States)

    Dunn, Matthew P; Di Gregorio, Anna

    2009-04-15

    In Ciona intestinalis, leprecan was identified as a target of the notochord-specific transcription factor Ciona Brachyury (Ci-Bra) (Takahashi, H., Hotta, K., Erives, A., Di Gregorio, A., Zeller, R.W., Levine, M., Satoh, N., 1999. Brachyury downstream notochord differentiation in the ascidian embryo. Genes Dev. 13, 1519-1523). By screening approximately 14 kb of the Ci-leprecan locus for cis-regulatory activity, we have identified a 581-bp minimal notochord-specific cis-regulatory module (CRM) whose activity depends upon T-box binding sites located at the 3'-end of its sequence. These sites are specifically bound in vitro by a GST-Ci-Bra fusion protein, and mutations that abolish binding in vitro result in loss or decrease of regulatory activity in vivo. Serial deletions of the 581-bp notochord CRM revealed that this sequence is also able to direct expression in muscle cells through the same T-box sites that are utilized by Ci-Bra in the notochord, which are also bound in vitro by the muscle-specific T-box activators Ci-Tbx6b and Ci-Tbx6c. Additionally, we created plasmids aimed to interfere with the function of Ci-leprecan and categorized the resulting phenotypes, which consist of variable dislocations of notochord cells along the anterior-posterior axis. Together, these observations provide mechanistic insights generally applicable to T-box transcription factors and their target sequences, as well as a first set of clues on the function of Leprecan in early chordate development.

  9. Dissection of cis-regulatory element architecture of the rice oleosin gene promoters to assess abscisic acid responsiveness in suspension-cultured rice cells.

    Science.gov (United States)

    Kim, Sol; Lee, Soo-Bin; Han, Chae-Seong; Lim, Mi-Na; Lee, Sung-Eun; Yoon, In Sun; Hwang, Yong-Sic

    2017-08-01

    Oleosins are the most abundant proteins in the monolipid layer surrounding neutral storage lipids that form oil bodies in plants. Several lines of evidence indicate that they are physiologically important for the maintenance of oil body structure and for mobilization of the lipids stored inside. Rice has six oleosin genes in its genome, the expression of all of which was found to be responsive to abscisic acid (ABA) in our examination of mature embryo and aleurone tissues. The 5'-flanking region of OsOle5 was initially characterized for its responsiveness to ABA through a transient expression assay system using the protoplasts from suspension-cultured rice cells. A series of successive deletions and site-directed mutations identified five regions critical for the hormonal induction of its promoter activity. A search for cis-acting elements in these regions deposited in a public database revealed that they contain various promoter elements previously reported to be involved in the ABA response of various genes. A gain-of-function experiment indicated that multiple copies of all five regions were sufficient to provide the minimal promoter with a distinct ABA responsiveness. Comparative sequence analysis of the short, but still ABA-responsive, promoters of OsOle genes revealed no common modular architecture shared by them, indicating that various distinct promoter elements and independent trans-acting factors are involved in the ABA responsiveness of rice oleosin multigenes. Copyright © 2017 Elsevier GmbH. All rights reserved.

  10. Absence of mutation at the 5'-upstream promoter region of the TPM4 gene from cardiac mutant axolotl (Ambystoma mexicanum).

    Science.gov (United States)

    Denz, Christopher R; Zhang, Chi; Jia, Pingping; Du, Jianfeng; Huang, Xupei; Dube, Syamalima; Thomas, Anish; Poiesz, Bernard J; Dube, Dipak K

    2011-09-01

    Tropomyosins are a family of actin-binding proteins that show cell-specific diversity by a combination of multiple genes and alternative RNA splicing. Of the 4 different tropomyosin genes, TPM4 plays a pivotal role in myofibrillogenesis as well as cardiac contractility in amphibians. In this study, we amplified and sequenced the upstream regulatory region of the TPM4 gene from both normal and mutant axolotl hearts. To identify the cis-elements that are essential for the expression of the TPM4, we created various deletion mutants of the TPM4 promoter DNA, inserted the deleted segments into PGL3 vector, and performed promoter-reporter assay using luciferase as the reporter gene. Comparison of sequences of the promoter region of the TPM4 gene from normal and mutant axolotl revealed no mutations in the promoter sequence of the mutant TPM4 gene. CArG box elements that are generally involved in controlling the expression of several other muscle-specific gene promoters were not found in the upstream regulatory region of the TPM4 gene. In deletion experiments, loss of activity of the reporter gene was noted upon deletion which was then restored upon further deletion suggesting the presence of both positive and negative cis-elements in the upstream regulatory region of the TPM4 gene. We believe that this is the first axolotl promoter that has ever been cloned and studied with clear evidence that it functions in mammalian cell lines. Although striated muscle-specific cis-acting elements are absent from the promoter region of TPM4 gene, our results suggest the presence of positive and negative cis-elements in the promoter region, which in conjunction with positive and negative trans-elements may be involved in regulating the expression of TPM4 gene in a tissue-specific manner.

  11. cis and trans requirements for the selective packaging of adenovirus type 5 DNA.

    OpenAIRE

    Gräble, M; Hearing, P

    1992-01-01

    Polar packaging of adenovirus DNA into virions is dependent on the presence of cis-acting sequences at the left end of the viral genome. Our previous analyses demonstrated that the adenovirus type 5 (Ad5) packaging domain (nucleotides 194 to 358) is composed of at least five elements that are functionally redundant. A repeated sequence, termed the A repeat, was associated with packaging function. Here we report a more detailed analysis of the requirements for the selective packaging of Ad5 DN...

  12. Computational modeling identifies key gene regulatory interactions underlying phenobarbital-mediated tumor promotion

    Science.gov (United States)

    Luisier, Raphaëlle; Unterberger, Elif B.; Goodman, Jay I.; Schwarz, Michael; Moggs, Jonathan; Terranova, Rémi; van Nimwegen, Erik

    2014-01-01

    Gene regulatory interactions underlying the early stages of non-genotoxic carcinogenesis are poorly understood. Here, we have identified key candidate regulators of phenobarbital (PB)-mediated mouse liver tumorigenesis, a well-characterized model of non-genotoxic carcinogenesis, by applying a new computational modeling approach to a comprehensive collection of in vivo gene expression studies. We have combined our previously developed motif activity response analysis (MARA), which models gene expression patterns in terms of computationally predicted transcription factor binding sites with singular value decomposition (SVD) of the inferred motif activities, to disentangle the roles that different transcriptional regulators play in specific biological pathways of tumor promotion. Furthermore, transgenic mouse models enabled us to identify which of these regulatory activities was downstream of constitutive androstane receptor and β-catenin signaling, both crucial components of PB-mediated liver tumorigenesis. We propose novel roles for E2F and ZFP161 in PB-mediated hepatocyte proliferation and suggest that PB-mediated suppression of ESR1 activity contributes to the development of a tumor-prone environment. Our study shows that combining MARA with SVD allows for automated identification of independent transcription regulatory programs within a complex in vivo tissue environment and provides novel mechanistic insights into PB-mediated hepatocarcinogenesis. PMID:24464994

  13. Effectively identifying regulatory hotspots while capturing expression heterogeneity in gene expression studies

    Science.gov (United States)

    2014-01-01

    Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods. PMID:24708878

  14. 77 FR 7968 - Semiannual Regulatory Agenda

    Science.gov (United States)

    2012-02-13

    ... Regulation Sequence No. Title Identifier No. 392 Non-Federal Oil and Gas 1024-AD78 Rights. National Park.... Timetable: Action Date FR Cite NPRM 07/00/12 Regulatory Flexibility Analysis Required: Yes. Agency Contact... anaconda, and Beni anaconda. Timetable: Action Date FR Cite ANPRM 01/31/08 73 FR 5784 ANPRM Comment Period...

  15. Recognition of cis-acting sequences in RNA 3 of Prunus necrotic ringspot virus by the replicase of Alfalfa mosaic virus.

    Science.gov (United States)

    Aparicio, F; Sánchez-Navarro, J A; Olsthoorn, R C; Pallás, V; Bol, J F

    2001-04-01

    Alfalfa mosaic virus (AMV) and Prunus necrotic ringspot virus (PNRSV) belong to the genera ALFAMOVIRUS: and ILARVIRUS:, respectively, of the family BROMOVIRIDAE: Initiation of infection by AMV and PNRSV requires binding of a few molecules of coat protein (CP) to the 3' termini of the inoculum RNAs and the CPs of the two viruses are interchangeable in this early step of the replication cycle. CIS:-acting sequences in PNRSV RNA 3 that are recognized by the AMV replicase were studied in in vitro replicase assays and by inoculation of AMV-PNRSV RNA 3 chimeras to tobacco plants and protoplasts transformed with the AMV replicase genes (P12 plants). The results showed that the AMV replicase recognized the promoter for minus-strand RNA synthesis in PNRSV RNA 3 but not the promoter for plus-strand RNA synthesis. A chimeric RNA with PNRSV movement protein and CP genes accumulated in tobacco, which is a non-host for PNRSV.

  16. Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models

    DEFF Research Database (Denmark)

    Wong, Wendy S W; Nielsen, Rasmus

    2007-01-01

    MOTIVATION: Finding the regulatory modules for transcription factors binding is an important step in elucidating the complex molecular mechanisms underlying regulation of gene expression. There are numerous methods available for solving this problem, however, very few of them take advantage of th...

  17. Ancient Transposable Elements Transformed the Uterine Regulatory Landscape and Transcriptome during the Evolution of Mammalian Pregnancy

    Directory of Open Access Journals (Sweden)

    Vincent J. Lynch

    2015-02-01

    Full Text Available A major challenge in biology is determining how evolutionarily novel characters originate; however, mechanistic explanations for the origin of new characters are almost completely unknown. The evolution of pregnancy is an excellent system in which to study the origin of novelties because mammals preserve stages in the transition from egg laying to live birth. To determine the molecular bases of this transition, we characterized the pregnant/gravid uterine transcriptome from tetrapods to trace the evolutionary history of uterine gene expression. We show that thousands of genes evolved endometrial expression during the origins of mammalian pregnancy, including genes that mediate maternal-fetal communication and immunotolerance. Furthermore, thousands of cis-regulatory elements that mediate decidualization and cell-type identity in decidualized stromal cells are derived from ancient mammalian transposable elements (TEs. Our results indicate that one of the defining mammalian novelties evolved from DNA sequences derived from ancient mammalian TEs co-opted into hormone-responsive regulatory elements distributed throughout the genome.

  18. Human polyomavirus JCV late leader peptide region contains important regulatory elements

    International Nuclear Information System (INIS)

    Akan, Ilhan; Sariyer, Ilker Kudret; Biffi, Renato; Palermo, Victoria; Woolridge, Stefanie; White, Martyn K.; Amini, Shohreh; Khalili, Kamel; Safak, Mahmut

    2006-01-01

    Transcription is a complex process that relies on the cooperative interaction between sequence-specific factors and the basal transcription machinery. The strength of a promoter depends on upstream or downstream cis-acting DNA elements, which bind transcription factors. In this study, we investigated whether DNA elements located downstream of the JCV late promoter, encompassing the late leader peptide region, which encodes agnoprotein, play regulatory roles in the JCV lytic cycle. For this purpose, the entire coding region of the leader peptide was deleted and the functional consequences of this deletion were analyzed. We found that viral gene expression and replication were drastically reduced. Gene expression also decreased from a leader peptide point mutant but to a lesser extent. This suggested that the leader peptide region of JCV might contain critical cis-acting DNA elements to which transcription factors bind and regulate viral gene expression and replication. We analyzed the entire coding region of the late leader peptide by a footprinting assay and identified three major regions (region I, II and III) that were protected by nuclear proteins. Further investigation of the first two protected regions by band shift assays revealed a new band that appeared in new infection cycles, suggesting that viral infection induces new factors that interact with the late leader peptide region of JCV. Analysis of the effect of the leader peptide region on the promoter activity of JCV by transfection assays demonstrated that this region has a positive and negative effect on the large T antigen (LT-Ag)-mediated activation of the viral early and late promoters, respectively. Furthermore, a partial deletion analysis of the leader peptide region encompassing the protected regions I and II demonstrated a significant down-regulation of viral gene expression and replication. More importantly, these results were similar to that obtained from a complete deletion of the late leader

  19. Identifying time-delayed gene regulatory networks via an evolvable hierarchical recurrent neural network.

    Science.gov (United States)

    Kordmahalleh, Mina Moradi; Sefidmazgi, Mohammad Gorji; Harrison, Scott H; Homaifar, Abdollah

    2017-01-01

    The modeling of genetic interactions within a cell is crucial for a basic understanding of physiology and for applied areas such as drug design. Interactions in gene regulatory networks (GRNs) include effects of transcription factors, repressors, small metabolites, and microRNA species. In addition, the effects of regulatory interactions are not always simultaneous, but can occur after a finite time delay, or as a combined outcome of simultaneous and time delayed interactions. Powerful biotechnologies have been rapidly and successfully measuring levels of genetic expression to illuminate different states of biological systems. This has led to an ensuing challenge to improve the identification of specific regulatory mechanisms through regulatory network reconstructions. Solutions to this challenge will ultimately help to spur forward efforts based on the usage of regulatory network reconstructions in systems biology applications. We have developed a hierarchical recurrent neural network (HRNN) that identifies time-delayed gene interactions using time-course data. A customized genetic algorithm (GA) was used to optimize hierarchical connectivity of regulatory genes and a target gene. The proposed design provides a non-fully connected network with the flexibility of using recurrent connections inside the network. These features and the non-linearity of the HRNN facilitate the process of identifying temporal patterns of a GRN. Our HRNN method was implemented with the Python language. It was first evaluated on simulated data representing linear and nonlinear time-delayed gene-gene interaction models across a range of network sizes and variances of noise. We then further demonstrated the capability of our method in reconstructing GRNs of the Saccharomyces cerevisiae synthetic network for in vivo benchmarking of reverse-engineering and modeling approaches (IRMA). We compared the performance of our method to TD-ARACNE, HCC-CLINDE, TSNI and ebdbNet across different network

  20. Diversity of antisense and other non-coding RNAs in Archaea revealed by comparative small RNA sequencing in four Pyrobaculum species

    Directory of Open Access Journals (Sweden)

    David L Bernick

    2012-07-01

    Full Text Available A great diversity of small, non-coding RNA molecules with roles in gene regulation and RNA processing have been intensely studied in eukaryotic and bacterial model organisms, yet our knowledge of possible parallel roles for small RNAs in archaea is limited. We employed RNA-seq to identify novel small RNA across multiple species of the hyperthermophilic genus Pyrobaculum, known for unusual RNA gene characteristics. By comparing transcriptional data collected in parallel among four species, we were able to identify conserved RNA genes fitting into known and novel families. Among our findings, we highlight three novel cis-antisense small RNAs encoded opposite to key regulatory (ferric uptake regulator, metabolic (triose-phosphate isomerase, and core transcriptional apparatus genes (transcription factor B. We also found a large increase in the number of conserved C/D box small RNA genes over what had been previously recognized; many of these genes are encoded antisense to protein coding genes. The conserved opposition to orthologous genes across the Pyrobaculum genus suggests similarities to other cis-antisense regulatory systems. Furthermore, the genus-specific nature of these small RNAs indicates they are relatively recent, stable adaptations.

  1. Production of recombinant AAV vectors encoding insulin-like growth factor I is enhanced by interaction among AAV rep regulatory sequences

    Directory of Open Access Journals (Sweden)

    Dilley Robert

    2009-01-01

    Full Text Available Abstract Background Adeno-associated virus (AAV vectors are promising tools for gene therapy. Currently, their potential is limited by difficulties in producing high vector yields with which to generate transgene protein product. AAV vector production depends in part upon the replication (Rep proteins required for viral replication. We tested the hypothesis that mutations in the start codon and upstream regulatory elements of Rep78/68 in AAV helper plasmids can regulate recombinant AAV (rAAV vector production. We further tested whether the resulting rAAV vector preparation augments the production of the potentially therapeutic transgene, insulin-like growth factor I (IGF-I. Results We constructed a series of AAV helper plasmids containing different Rep78/68 start codon in combination with different gene regulatory sequences. rAAV vectors carrying the human IGF-I gene were prepared with these vectors and the vector preparations used to transduce HT1080 target cells. We found that the substitution of ATG by ACG in the Rep78/68 start codon in an AAV helper plasmid (pAAV-RC eliminated Rep78/68 translation, rAAV and IGF-I production. Replacement of the heterologous sequence upstream of Rep78/68 in pAAV-RC with the AAV2 endogenous p5 promoter restored translational activity to the ACG mutant, and restored rAAV and IGF-I production. Insertion of the AAV2 p19 promoter sequence into pAAV-RC in front of the heterologous sequence also enabled ACG to function as a start codon for Rep78/68 translation. The data further indicate that the function of the AAV helper construct (pAAV-RC, that is in current widespread use for rAAV production, may be improved by replacement of its AAV2 unrelated heterologous sequence with the native AAV2 p5 promoter. Conclusion Taken together, the data demonstrate an interplay between the start codon and upstream regulatory sequences in the regulation of Rep78/68 and indicate that selective mutations in Rep78/68 regulatory elements

  2. A survey of single nucleotide polymorphisms identified from whole-genome sequencing and their functional effect in the porcine genome.

    Science.gov (United States)

    Keel, B N; Nonneman, D J; Rohrer, G A

    2017-08-01

    Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.

  3. Exome Sequencing Fails to Identify the Genetic Cause of Aicardi Syndrome.

    Science.gov (United States)

    Lund, Caroline; Striano, Pasquale; Sorte, Hanne Sørmo; Parisi, Pasquale; Iacomino, Michele; Sheng, Ying; Vigeland, Magnus D; Øye, Anne-Marte; Møller, Rikke Steensbjerre; Selmer, Kaja K; Zara, Federico

    2016-09-01

    Aicardi syndrome (AS) is a well-characterized neurodevelopmental disorder with an unknown etiology. In this study, we performed whole-exome sequencing in 11 female patients with the diagnosis of AS, in order to identify the disease-causing gene. In particular, we focused on detecting variants in the X chromosome, including the analysis of variants with a low number of sequencing reads, in case of somatic mosaicism. For 2 of the patients, we also sequenced the exome of the parents to search for de novo mutations. We did not identify any genetic variants likely to be damaging. Only one single missense variant was identified by the de novo analyses of the 2 trios, and this was considered benign. The failure to identify a disease gene in this study may be due to technical limitations of our study design, including the possibility that the genetic aberration leading to AS is situated in a non-exonic region or that the mutation is somatic and not detectable by our approach. Alternatively, it is possible that AS is genetically heterogeneous and that 11 patients are not sufficient to reveal the causative genes. Future studies of AS should consider designs where also non-exonic regions are explored and apply a sequencing depth so that also low-grade somatic mosaicism can be detected.

  4. Ebola virus RNA editing depends on the primary editing site sequence and an upstream secondary structure.

    Directory of Open Access Journals (Sweden)

    Masfique Mehedi

    Full Text Available Ebolavirus (EBOV, the causative agent of a severe hemorrhagic fever and a biosafety level 4 pathogen, increases its genome coding capacity by producing multiple transcripts encoding for structural and nonstructural glycoproteins from a single gene. This is achieved through RNA editing, during which non-template adenosine residues are incorporated into the EBOV mRNAs at an editing site encoding for 7 adenosine residues. However, the mechanism of EBOV RNA editing is currently not understood. In this study, we report for the first time that minigenomes containing the glycoprotein gene editing site can undergo RNA editing, thereby eliminating the requirement for a biosafety level 4 laboratory to study EBOV RNA editing. Using a newly developed dual-reporter minigenome, we have characterized the mechanism of EBOV RNA editing, and have identified cis-acting sequences that are required for editing, located between 9 nt upstream and 9 nt downstream of the editing site. Moreover, we show that a secondary structure in the upstream cis-acting sequence plays an important role in RNA editing. EBOV RNA editing is glycoprotein gene-specific, as a stretch encoding for 7 adenosine residues located in the viral polymerase gene did not serve as an editing site, most likely due to an absence of the necessary cis-acting sequences. Finally, the EBOV protein VP30 was identified as a trans-acting factor for RNA editing, constituting a novel function for this protein. Overall, our results provide novel insights into the RNA editing mechanism of EBOV, further understanding of which might result in novel intervention strategies against this viral pathogen.

  5. Ebola virus RNA editing depends on the primary editing site sequence and an upstream secondary structure.

    Science.gov (United States)

    Mehedi, Masfique; Hoenen, Thomas; Robertson, Shelly; Ricklefs, Stacy; Dolan, Michael A; Taylor, Travis; Falzarano, Darryl; Ebihara, Hideki; Porcella, Stephen F; Feldmann, Heinz

    2013-01-01

    Ebolavirus (EBOV), the causative agent of a severe hemorrhagic fever and a biosafety level 4 pathogen, increases its genome coding capacity by producing multiple transcripts encoding for structural and nonstructural glycoproteins from a single gene. This is achieved through RNA editing, during which non-template adenosine residues are incorporated into the EBOV mRNAs at an editing site encoding for 7 adenosine residues. However, the mechanism of EBOV RNA editing is currently not understood. In this study, we report for the first time that minigenomes containing the glycoprotein gene editing site can undergo RNA editing, thereby eliminating the requirement for a biosafety level 4 laboratory to study EBOV RNA editing. Using a newly developed dual-reporter minigenome, we have characterized the mechanism of EBOV RNA editing, and have identified cis-acting sequences that are required for editing, located between 9 nt upstream and 9 nt downstream of the editing site. Moreover, we show that a secondary structure in the upstream cis-acting sequence plays an important role in RNA editing. EBOV RNA editing is glycoprotein gene-specific, as a stretch encoding for 7 adenosine residues located in the viral polymerase gene did not serve as an editing site, most likely due to an absence of the necessary cis-acting sequences. Finally, the EBOV protein VP30 was identified as a trans-acting factor for RNA editing, constituting a novel function for this protein. Overall, our results provide novel insights into the RNA editing mechanism of EBOV, further understanding of which might result in novel intervention strategies against this viral pathogen.

  6. Cis-trans photoisomerization of abscisic acid

    International Nuclear Information System (INIS)

    Brabham, D.E.; Biggs, R.H.

    1981-01-01

    An important regulator of numerous physiological processes in higher plants is abscisic acid (ABA), which is photoisomerized from the more biologically active cis isomer to the nearly inactive trans isomer by natural sunlight. It is possible that this photoisomerization is a UV control mechanism in functions regulated by ABA. The quantum yields of both the cis to trans and trans to cis photoisomerizations were measured under various conditions of pH and oxygen concentration at room temperature. The yield for photoisomerization of cis-ABA ranged from 0.25 at pH 3.0 to 0.11 at pH 7.0. Oxygen partially quenched the process. The quantum yield varied only slightly with wavelength. The quantum yield of photolysis of cis-ABA was reported for pH 3.0 as 0.06. This yield also varied slightly with wavelength and was relatively insensitive to oxygen. This relatively high yield explains the loss of potency of ABA during UV irradiation. Phosphorescence of cis- and trans-ABA was observed in methanol at 77 K. Onset of the emission was at 350 nm. The emission spectra were the same for both isomers. From these results a mechanism of UV action on plants based on the photoisomerization of the inactive trans-ABA to the biologically active cis isomer is proposed. (author)

  7. Galactosemia: A strategy to identify new biochemical phenotypes and molecular genotypes

    Energy Technology Data Exchange (ETDEWEB)

    Elsas, L.J.; Langley, S.; Steele, E.; Evinger, J.; Brown, A.; Singh, R.; Fernhoff, P.; Hjelm, L.N.; Dembure, P.P.; Fridovich-Keil, J.L. [Emory Univ. School of Medicine, Atlanta, GA (United States)

    1995-03-01

    We describe a stratagem for identifying new mutations in the galactose-1-phosphate uridyl transferase (GALT) gene. GALT enzyme activity and isoforms were defined in erythrocytes from probands and their first-degree relatives. If the biochemical phenotypes segregated in an autosomal recesssive pattern, we screened for common mutations by using multiplex PCR and restriction endonuclease digestions. If common mutant alleles were not present, the 11 exons of the GALT gene were amplified by PCR, and variations from the normal nucleotide sequences were identified by SSCP. The suspected region(s) was then analyzed by direct DNA sequencing. We identified 86 mutant GALT alleles that reduced erythrocyte GALT activity. Seventy-five of these GALT genomes had abnormal SSCP patterns, of which 41 were sequenced, yielding 12 new and 21 previously reported, rare mutations. Among the novel group of 12 new mutations, an unusual biochemical phenotype was found in a family whose newborn proband has classical galactosemia. He had inherited two mutations in cis (N314D-E204K) from his father, whose GALT activity was near normal, and an additional GALT mutation in the splice-acceptor site of intron C (IVSC) from his mother. The substitution of a positively charged E204K mutation created a unique isoform-banding pattern. An asymptomatic sister`s GALT genes carries three mutations (E203K-N314D/N314D) with eight distinct isoform bands. Surprisingly, her erythrocytes have normal GALT activity. We conclude that the synergism of pedigree, biochemical, SSCP, and direct GALT gene analyses is an efficient protocol for identifying new mutations and speculate that E203K and N314D codon changes produce intra-allelic complementation when in cis. 40 refs., 4 figs., 3 tabs.

  8. Evaluating cis-2,6-Dimethylpiperidide (cis-DMP) as a Base Component in Lithium-Mediated Zincation Chemistry

    Science.gov (United States)

    Armstrong, David R; Garden, Jennifer A; Kennedy, Alan R; Leenhouts, Sarah M; Mulvey, Robert E; O'Keefe, Philip; O'Hara, Charles T; Steven, Alan

    2013-01-01

    Most recent advances in metallation chemistry have centred on the bulky secondary amide 2,2,6,6-tetramethylpiperidide (TMP) within mixed metal, often ate, compositions. However, the precursor amine TMP(H) is rather expensive so a cheaper substitute would be welcome. Thus this study was aimed towards developing cheaper non-TMP based mixed-metal bases and, as cis-2,6-dimethylpiperidide (cis-DMP) was chosen as the alternative amide, developing cis-DMP zincate chemistry which has received meagre attention compared to that of its methyl-rich counterpart TMP. A new lithium diethylzincate, [(TMEDA)LiZn(cis-DMP)Et2] (TMEDA=N,N,N′,N′-tetramethylethylenediamine) has been synthesised by co-complexation of Li(cis-DMP), Et2Zn and TMEDA, and characterised by NMR (including DOSY) spectroscopy and X-ray crystallography, which revealed a dinuclear contact ion pair arrangement. By using N,N-diisopropylbenzamide as a test aromatic substrate, the deprotonative reactivity of [(TMEDA)LiZn(cis-DMP)Et2] has been probed and contrasted with that of the known but previously uninvestigated di-tert-butylzincate, [(TMEDA)LiZn(cis-DMP)tBu2]. The former was found to be the superior base (for example, producing the ortho-deuteriated product in respective yields of 78 % and 48 % following D2O quenching of zincated benzamide intermediates). An 88 % yield of 2-iodo-N,N-diisopropylbenzamide was obtained on reaction of two equivalents of the diethylzincate with the benzamide followed by iodination. Comparisons are also drawn using 1,1,1,3,3,3-hexamethyldisilazide (HMDS), diisopropylamide and TMP as the amide component in the lithium amide, Et2Zn and TMEDA system. Under certain conditions, the cis-DMP base system was found to give improved results in comparison to HMDS and diisopropylamide (DA), and comparable results to a TMP system. Two novel complexes isolated from reactions of the di-tert-butylzincate and crystallographically characterised, namely the pre-metallation complex [{(iPr)2N(Ph)C=O}LiZn(cis

  9. 77 FR 28467 - Identifying and Reducing Regulatory Burdens

    Science.gov (United States)

    2012-05-14

    ... online wherever practicable. Sec. 3. Setting Priorities. In implementing and improving their... regulatory priorities, to promote public participation in retrospective review, to modernize our regulatory..., agencies shall give priority, consistent with law, to those initiatives that will produce significant...

  10. Regulatory effects of cotranscriptional RNA structure formation and transitions.

    Science.gov (United States)

    Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2016-09-01

    RNAs, which play significant roles in many fundamental biological processes of life, fold into sophisticated and precise structures. RNA folding is a dynamic and intricate process, which conformation transition of coding and noncoding RNAs form the primary elements of genetic regulation. The cellular environment contains various intrinsic and extrinsic factors that potentially affect RNA folding in vivo, and experimental and theoretical evidence increasingly indicates that the highly flexible features of the RNA structure are affected by these factors, which include the flanking sequence context, physiochemical conditions, cis RNA-RNA interactions, and RNA interactions with other molecules. Furthermore, distinct RNA structures have been identified that govern almost all steps of biological processes in cells, including transcriptional activation and termination, transcriptional mutagenesis, 5'-capping, splicing, 3'-polyadenylation, mRNA export and localization, and translation. Here, we briefly summarize the dynamic and complex features of RNA folding along with a wide variety of intrinsic and extrinsic factors that affect RNA folding. We then provide several examples to elaborate RNA structure-mediated regulation at the transcriptional and posttranscriptional levels. Finally, we illustrate the regulatory roles of RNA structure and discuss advances pertaining to RNA structure in plants. WIREs RNA 2016, 7:562-574. doi: 10.1002/wrna.1350 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.

  11. Integration of ATAC-seq and RNA-seq identifies human alpha cell and beta cell signature genes.

    Science.gov (United States)

    Ackermann, Amanda M; Wang, Zhiping; Schug, Jonathan; Naji, Ali; Kaestner, Klaus H

    2016-03-01

    Although glucagon-secreting α-cells and insulin-secreting β-cells have opposing functions in regulating plasma glucose levels, the two cell types share a common developmental origin and exhibit overlapping transcriptomes and epigenomes. Notably, destruction of β-cells can stimulate repopulation via transdifferentiation of α-cells, at least in mice, suggesting plasticity between these cell fates. Furthermore, dysfunction of both α- and β-cells contributes to the pathophysiology of type 1 and type 2 diabetes, and β-cell de-differentiation has been proposed to contribute to type 2 diabetes. Our objective was to delineate the molecular properties that maintain islet cell type specification yet allow for cellular plasticity. We hypothesized that correlating cell type-specific transcriptomes with an atlas of open chromatin will identify novel genes and transcriptional regulatory elements such as enhancers involved in α- and β-cell specification and plasticity. We sorted human α- and β-cells and performed the "Assay for Transposase-Accessible Chromatin with high throughput sequencing" (ATAC-seq) and mRNA-seq, followed by integrative analysis to identify cell type-selective gene regulatory regions. We identified numerous transcripts with either α-cell- or β-cell-selective expression and discovered the cell type-selective open chromatin regions that correlate with these gene activation patterns. We confirmed cell type-selective expression on the protein level for two of the top hits from our screen. The "group specific protein" (GC; or vitamin D binding protein) was restricted to α-cells, while CHODL (chondrolectin) immunoreactivity was only present in β-cells. Furthermore, α-cell- and β-cell-selective ATAC-seq peaks were identified to overlap with known binding sites for islet transcription factors, as well as with single nucleotide polymorphisms (SNPs) previously identified as risk loci for type 2 diabetes. We have determined the genetic landscape of

  12. DMINDA: an integrated web server for DNA motif identification and analyses.

    Science.gov (United States)

    Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying

    2014-07-01

    DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Structural Elucidation of cis / trans Dicaffeoylquinic Acid Photoisomerization Using Ion Mobility Spectrometry-Mass Spectrometry

    Energy Technology Data Exchange (ETDEWEB)

    Zheng, Xueyun; Renslow, Ryan S.; Makola, Mpho M.; Webb, Ian K.; Deng, Liulin; Thomas, Dennis G.; Govind, Niranjan; Ibrahim, Yehia M.; Kabanda, Mwadham M.; Dubery, Ian A.; Heyman, Heino M.; Smith, Richard D.; Madala, Ntakadzeni E.; Baker, Erin S.

    2017-03-15

    Due to the recently uncovered health benefits and anti-HIV activities of dicaffeoylquinic acids (diCQAs), understanding their structures and functions is of great interest for drug discovery efforts. DiCQAs are analytically challenging to identify and quantify since they commonly exist as a diverse mixture of positional and geometric (cis/trans) isomers. In this work, we utilized ion mobility spectrometry coupled with mass spectrometry to separate the various isomers before and after UV irradiation. The experimental collision cross sections were then compared with theoretical structures to differentiate and identify the diCQA isomers. Our analyses found that naturally the diCQAs existed predominantly as trans/trans isomers, but after 3 h of UV irradiation, cis/cis, cis/trans, trans/cis, and trans/trans isomers were all present in the mixture. This is the first report of successful differentiation of cis/trans diCQA isomers individually, which shows the great promise of IMS coupled with theoretical calculations for determining the structure and activity relationships of different isomers in drug discovery studies.

  14. Cis-urocanic acid, a sunlight-induced immunosuppressive factor, activates immune suppression via the 5-HT2A receptor

    Science.gov (United States)

    Walterscheid, Jeffrey P.; Nghiem, Dat X.; Kazimi, Nasser; Nutt, Leta K.; McConkey, David J.; Norval, Mary; Ullrich, Stephen E.

    2006-01-01

    Exposure to UV radiation induces skin cancer and suppresses the immune response. To induce immune suppression, the electromagnetic energy of UV radiation must be absorbed by an epidermal photoreceptor and converted into a biologically recognizable signal. Two photoreceptors have been recognized: DNA and trans-urocanic acid (UCA). Trans-UCA is normally found in the outermost layer of skin and isomerizes to the cis isomer upon exposure to UV radiation. Although UCA was identified as a UV photoreceptor years ago, and many have documented its ability to induce immune suppression, its exact mode of action remains elusive. Particularly vexing has been the identity of the molecular pathway by which cis-UCA mediates immune suppression. Here we provide evidence that cis-UCA binds to the serotonin [5-hydroxytryptamine (5-HT)] receptor with relatively high affinity (Kd = 4.6 nM). Anti-cis-UCA antibody precipitates radiolabeled 5-HT, and the binding is inhibited by excess 5-HT and/or excess cis-UCA. Similarly, anti-5-HT antibody precipitates radiolabeled cis-UCA, and the binding is inhibited by excess 5-HT or excess cis-UCA. Calcium mobilization was activated when a mouse fibroblast line, stably transfected with the human 5-HT2A receptor, was treated with cis-UCA. Cis-UCA-induced calcium mobilization was blocked with a selective 5-HT2A receptor antagonist. UV- and cis-UCA-induced immune suppression was blocked by antiserotonin antibodies or by treating the mice with 5-HT2A receptor antagonists. Our findings identify cis-UCA as a serotonin receptor ligand and indicate that the immunosuppressive effects of cis-UCA and UV radiation are mediated by activation of the 5-HT2A receptor. PMID:17085585

  15. In silico analysis, mapping of regulatory elements and corresponding dna-protein interaction in polyphenol oxidase gene promoter from different rice varieties

    International Nuclear Information System (INIS)

    Mahmood, T.; Rehman, M.; Aziz, E.

    2015-01-01

    Polyphenol oxidase (PPO) is an important enzyme that has positive impact regarding plant resistance against different biotic and abiotic stresses. In the present study PPO promoter from six different rice varieties was amplified and then analyzed for cis- and trans-acting elements. The study revealed a total of 79 different cis-acting regulatory elements including 11 elements restricted to only one or other variety. Among six varieties Pakhal-Basmati had highest number (5) of these elements, whereas C-622 and Rachna-Basmati have no such sequences. Rachna-Basmati, IR-36-Basmati and Kashmir- Basmati had 1, 2 and 3 unique elements, respectively. Different elementsrelated to pathogen, salt and water stresses were found, which may be helpful in controlling PPO activity according to changing environment. Moreover, HADDOCK was used to understand molecular mechanism of PPO regulation and it was found that DNA-protein interactions are stabilized by many potential hydrogen bonds. Adenine and arginine were the most reactive residues in DNA and proteins respectively.Structural comparison of different protein-DNA complexes show that even a highly conserved transcriptional factor can adopt different conformations when they contact a different DNA binding sequence, however their stable interactions depend on the number of hydrogen bonds formed and distance. (author)

  16. 10 points about buying C.I.S

    International Nuclear Information System (INIS)

    Anon.

    1993-01-01

    On October 16, 1992, the U.S. Department of Commerce (DOC) settled the antidumping case against the CIS republics by imposing price and volume quotas on CIS uranium imported into the United States. Bound by a suspension agreement, each of the six uranium-producing CIS republics is responsible for restricting the flow of imports to the US-either directly or indirectly. (As the NUKEM Market Report went to press, the Ukraine government notified the DOC of its intent not to terminate the suspension agreement.) This action is to prevent undercutting price levels in the US domestic uranium markets. What follows are ten points about everything you should know about importing uranium from the uranium-producing CIS republics- Kazakhstan, Kyrgyzstan, Russian Federation, Tajikistan, Ukraine and Uzbekistan. Newcomers to the CIS scene should follow this simple roadmap and be aware of the issues they face as importers in terms of Commerce/Customs requirements and documentation and where to get them, when to buy the material and how to transport it, how to deal effectively with CIS exporters, and how to avoid unnecessary complications when buying CIS

  17. HIV-1 envelope sequence-based diversity measures for identifying recent infections.

    Directory of Open Access Journals (Sweden)

    Alexis Kafando

    Full Text Available Identifying recent HIV-1 infections is crucial for monitoring HIV-1 incidence and optimizing public health prevention efforts. To identify recent HIV-1 infections, we evaluated and compared the performance of 4 sequence-based diversity measures including percent diversity, percent complexity, Shannon entropy and number of haplotypes targeting 13 genetic segments within the env gene of HIV-1. A total of 597 diagnostic samples obtained in 2013 and 2015 from recently and chronically HIV-1 infected individuals were selected. From the selected samples, 249 (134 from recent versus 115 from chronic infections env coding regions, including V1-C5 of gp120 and the gp41 ectodomain of HIV-1, were successfully amplified and sequenced by next generation sequencing (NGS using the Illumina MiSeq platform. The ability of the four sequence-based diversity measures to correctly identify recent HIV infections was evaluated using the frequency distribution curves, median and interquartile range and area under the curve (AUC of the receiver operating characteristic (ROC. Comparing the median and interquartile range and evaluating the frequency distribution curves associated with the 4 sequence-based diversity measures, we observed that the percent diversity, number of haplotypes and Shannon entropy demonstrated significant potential to discriminate recent from chronic infections (p<0.0001. Using the AUC of ROC analysis, only the Shannon entropy measure within three HIV-1 env segments could accurately identify recent infections at a satisfactory level. The env segments were gp120 C2_1 (AUC = 0.806, gp120 C2_3 (AUC = 0.805 and gp120 V3 (AUC = 0.812. Our results clearly indicate that the Shannon entropy measure represents a useful tool for predicting HIV-1 infection recency.

  18. Elucidating the Small Regulatory RNA Repertoire of the Sea Anemone Anemonia viridis Based on Whole Genome and Small RNA Sequencing.

    Science.gov (United States)

    Urbarova, Ilona; Patel, Hardip; Forêt, Sylvain; Karlsen, Bård Ove; Jørgensen, Tor Erik; Hall-Spencer, Jason M; Johansen, Steinar D

    2018-02-01

    Cnidarians harbor a variety of small regulatory RNAs that include microRNAs (miRNAs) and PIWI-interacting RNAs (piRNAs), but detailed information is limited. Here, we report the identification and expression of novel miRNAs and putative piRNAs, as well as their genomic loci, in the symbiotic sea anemone Anemonia viridis. We generated a draft assembly of the A. viridis genome with putative size of 313 Mb that appeared to be composed of about 36% repeats, including known transposable elements. We detected approximately equal fractions of DNA transposons and retrotransposons. Deep sequencing of small RNA libraries constructed from A. viridis adults sampled at a natural CO2 gradient off Vulcano Island, Italy, identified 70 distinct miRNAs. Eight were homologous to previously reported miRNAs in cnidarians, whereas 62 appeared novel. Nine miRNAs were recognized as differentially expressed along the natural seawater pH gradient. We found a highly abundant and diverse population of piRNAs, with a substantial fraction showing ping-pong signatures. We identified nearly 22% putative piRNAs potentially targeting transposable elements within the A. viridis genome. The A. viridis genome appeared similar in size to that of other hexacorals with a very high divergence of transposable elements resembling that of the sea anemone genus Exaiptasia. The genome encodes and expresses a high number of small regulatory RNAs, which include novel miRNAs and piRNAs. Differentially expressed small RNAs along the seawater pH gradient indicated regulatory gene responses to environmental stressors. © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Common integration sites of published datasets identified using a graph-based framework

    Directory of Open Access Journals (Sweden)

    Alessandro Vasciaveo

    2016-01-01

    Full Text Available With next-generation sequencing, the genomic data available for the characterization of integration sites (IS has dramatically increased. At present, in a single experiment, several thousand viral integration genome targets can be investigated to define genomic hot spots. In a previous article, we renovated a formal CIS analysis based on a rigid fixed window demarcation into a more stretchy definition grounded on graphs. Here, we present a selection of supporting data related to the graph-based framework (GBF from our previous article, in which a collection of common integration sites (CIS was identified on six published datasets. In this work, we will focus on two datasets, ISRTCGD and ISHIV, which have been previously discussed. Moreover, we show in more detail the workflow design that originates the datasets.

  20. cis-Bifenthrin enantioselectively induces hepatic oxidative stress in mice.

    Science.gov (United States)

    Jin, Yuanxiang; Wang, Jiangcong; Pan, Xiuhong; Wang, Linggang; Fu, Zhengwei

    2013-09-01

    Bifenthrin (BF), as a chiral synthetic pyrethroid, is widely used to control field and household pests. In China, the commercial cis-BF contained two enantiomers including 1R-cis-BF and 1S-cis-BF. However, the difference in oxidative stress induced by the two enantiomers in mice still remains unclear. In the present study, 4 week-old adolescent male ICR mice were orally administered cis-BF, 1R-cis-BF or 1S-cis-BF daily for 2, 4 and 6 weeks at doses of 5 mg/kg/day, respectively. We found that the hepatic reactive oxygen species (ROS) levels, as well as the malondialdehyde (MDA) and glutathione (GSH) content both in the serum and liver increased significantly in the 4 or 6 weeks 1S-cis-BF treated groups. The activities of superoxide dismutase (SOD) and catalase (CAT) also changed significantly in the serum and liver of 1S-cis-BF treated mice. More importantly, the significant differences in MDA content and CAT activity both in the serum and liver, and the activities of total antioxidant capacity (T-AOC) and SOD in serum were also observed between the 1S-cis-BF and 1R-cis-BF treated groups. Moreover, the transcription of oxidative stress response related genes including Sod1, Cat and heme oxygenase-1(Ho-1) in the liver of 1S-cis-BF treated groups were also significant higher than those in 1R-cis-BF treated group. Thus, it was concluded that cis-BF induced hepatic oxidative stress in an enantiomer specific manner in mice when exposed during the puberty, and that 1S-cis-BF showed much more toxic in hepatic oxidative stress than 1R-cis-BF. Copyright © 2013 Elsevier Inc. All rights reserved.

  1. Extended exome sequencing identifies BACH2 as a novel major risk locus for Addison's disease.

    Science.gov (United States)

    Eriksson, D; Bianchi, M; Landegren, N; Nordin, J; Dalin, F; Mathioudaki, A; Eriksson, G N; Hultin-Rosenberg, L; Dahlqvist, J; Zetterqvist, H; Karlsson, Å; Hallgren, Å; Farias, F H G; Murén, E; Ahlgren, K M; Lobell, A; Andersson, G; Tandre, K; Dahlqvist, S R; Söderkvist, P; Rönnblom, L; Hulting, A-L; Wahlberg, J; Ekwall, O; Dahlqvist, P; Meadows, J R S; Bensing, S; Lindblad-Toh, K; Kämpe, O; Pielberg, G R

    2016-12-01

    Autoimmune disease is one of the leading causes of morbidity and mortality worldwide. In Addison's disease, the adrenal glands are targeted by destructive autoimmunity. Despite being the most common cause of primary adrenal failure, little is known about its aetiology. To understand the genetic background of Addison's disease, we utilized the extensively characterized patients of the Swedish Addison Registry. We developed an extended exome capture array comprising a selected set of 1853 genes and their potential regulatory elements, for the purpose of sequencing 479 patients with Addison's disease and 1394 controls. We identified BACH2 (rs62408233-A, OR = 2.01 (1.71-2.37), P = 1.66 × 10 -15 , MAF 0.46/0.29 in cases/controls) as a novel gene associated with Addison's disease development. We also confirmed the previously known associations with the HLA complex. Whilst BACH2 has been previously reported to associate with organ-specific autoimmune diseases co-inherited with Addison's disease, we have identified BACH2 as a major risk locus in Addison's disease, independent of concomitant autoimmune diseases. Our results may enable future research towards preventive disease treatment. © 2016 The Authors. Journal of Internal Medicine published by John Wiley & Sons Ltd on behalf of Association for Publication of The Journal of Internal Medicine.

  2. Evolution of Enzymatic Activities in the Enolase Superfamily: Stereochemically Distinct Mechanisms in Two Families of cis,cis-Muconate Lactonizing Enzymes

    Energy Technology Data Exchange (ETDEWEB)

    Sakai, A.; Fedorov, A; Fedorov, E; Schnoes, A; Glasner, M; Burley, S; Babbitt, P; Almo, S; Gerlt, J

    2009-01-01

    The mechanistically diverse enolase superfamily is a paradigm for elucidating Nature's strategies for divergent evolution of enzyme function. Each of the different reactions catalyzed by members of the superfamily is initiated by abstraction of the a-proton of a carboxylate substrate that is coordinated to an essential Mg2+. The muconate lactonizing enzyme (MLE) from Pseudomonas putida, a member of a family that catalyzes the syn-cycloisomerization of cis,cis-muconate to (4S)-muconolactone in the e-ketoadipate pathway, has provided critical insights into the structural bases for evolution of function within the superfamily. A second, divergent family of homologous MLEs that catalyzes anti-cycloisomerization has been identified. Structures of members of both families liganded with the common (4S)-muconolactone product (syn, Pseudomonas fluorescens, gi 70731221; anti, Mycobacterium smegmatis, gi 118470554) document that the conserved Lys at the end of the second e-strand in the (e/a)7e-barrel domain serves as the acid catalyst in both reactions. The different stereochemical courses (syn and anti) result from different structural strategies for determining substrate specificity: although the distal carboxylate group of the cis,cis-muconate substrate attacks the same face of the proximal double bond, opposite faces of the resulting enolate anion intermediate are presented to the conserved Lys acid catalyst. The discovery of two families of homologous, but stereochemically distinct, MLEs likely provides an example of 'pseudoconvergent' evolution of the same function from different homologous progenitors within the enolase superfamily, in which different spatial arrangements of active site functional groups and substrate specificity determinants support catalysis of the same reaction.

  3. Evolution of Enzymatic Activities in the Enolase Superfamily: Stereochemically Distinct Mechanisms in Two Families of cis,cis-Muconate Lactonizing Enzymes†

    Science.gov (United States)

    Sakai, Ayano; Fedorov, Alexander A.; Fedorov, Elena V.; Schnoes, Alexandra M.; Glasner, Margaret E.; Brown, Shoshana; Rutter, Marc E.; Bain, Kevin; Chang, Shawn; Gheyi, Tarun; Sauder, J. Michael; Burley, Stephen K.; Babbitt, Patricia C.; Almo, Steven C.; Gerlt, John A.

    2009-01-01

    The mechanistically diverse enolase superfamily is a paradigm for elucidating Nature’s strategies for divergent evolution of enzyme function. Each of the different reactions catalyzed by members of the superfamily is initiated by abstraction of the α-proton of a carboxylate substrate that is coordinated to an essential Mg2+. The muconate lactonizing enzyme (MLE) from Pseudomonas putida, a member of a family that catalyzes the syn-cycloisomerization of cis,cis-muconate to (4S)-muconolactone in the β-ketoadipate pathway, has provided critical insights into the structural bases for evolution of function within the superfamily. A second, divergent family of homologues MLEs that catalyzes anti-cycloisomerization has been identified. Structures of members of both families liganded with the common (4S)-muconolactone product (syn, Pseudomonas fluorescens, GI:70731221; anti, Mycobacterium smegmatis, GI:118470554) document that the conserved Lys at the end of the second β-strand in the (β/α)7β-barrel domain serves as the acid catalyst in both reactions. The different stereochemical courses (syn and anti) result from different structural strategies for determining substrate specificity: although the distal carboxylate group of the cis,cis-muconate substrate attacks the same face of the proximal double bond, opposite faces of the resulting enolate anion intermediate are presented to the conserved Lys acid catalyst. The discovery of two families of homologous, but stereochemically distinct, MLEs likely provides an example of “pseudoconvergent” evolution of the same function from different homologous progenitors within the enolase superfamily, in which different spatial arrangements of active site functional groups and substrate specificity determinants support catalysis of the same reaction. PMID:19220063

  4. Identifying recombinants in human and primate immunodeficiency virus sequence alignments using quartet scanning

    Directory of Open Access Journals (Sweden)

    Martin Darren P

    2009-04-01

    Full Text Available Abstract Background Recombination has a profound impact on the evolution of viruses, but characterizing recombination patterns in molecular sequences remains a challenging endeavor. Despite its importance in molecular evolutionary studies, identifying the sequences that exhibit such patterns has received comparatively less attention in the recombination detection framework. Here, we extend a quartet-mapping based recombination detection method to enable identification of recombinant sequences without prior specifications of either query and reference sequences. Through simulations we evaluate different recombinant identification statistics and significance tests. We compare the quartet approach with triplet-based methods that employ additional heuristic tests to identify parental and recombinant sequences. Results Analysis of phylogenetic simulations reveal that identifying the descendents of relatively old recombination events is a challenging task for all methods available, and that quartet scanning performs relatively well compared to the triplet based methods. The use of quartet scanning is further demonstrated by analyzing both well-established and putative HIV-1 recombinant strains. In agreement with recent findings, we provide evidence that the presumed circulating recombinant CRF02_AG is a 'pure' lineage, whereas the presumed parental lineage subtype G has a recombinant origin. We also demonstrate HIV-1 intrasubtype recombination, confirm the hybrid origin of SIV in chimpanzees and further disentangle the recombinant history of SIV lineages in a primate immunodeficiency virus data set. Conclusion Quartet scanning makes a valuable addition to triplet-based methods for identifying recombinant sequences without prior specifications of either query and reference sequences. The new method is available in the VisRD v.3.0 package http://www.cmp.uea.ac.uk/~vlm/visrd.

  5. Evolutionary Novelty in a Butterfly Wing Pattern through Enhancer Shuffling

    Science.gov (United States)

    Pardo-Diaz, Carolina; Hanly, Joseph J.; Martin, Simon H.; Mallet, James; Dasmahapatra, Kanchon K.; Salazar, Camilo; Joron, Mathieu; Nadeau, Nicola; McMillan, W. Owen; Jiggins, Chris D.

    2016-01-01

    An important goal in evolutionary biology is to understand the genetic changes underlying novel morphological structures. We investigated the origins of a complex wing pattern found among Amazonian Heliconius butterflies. Genome sequence data from 142 individuals across 17 species identified narrow regions associated with two distinct red colour pattern elements, dennis and ray. We hypothesise that these modules in non-coding sequence represent distinct cis-regulatory loci that control expression of the transcription factor optix, which in turn controls red pattern variation across Heliconius. Phylogenetic analysis of the two elements demonstrated that they have distinct evolutionary histories and that novel adaptive morphological variation was created by shuffling these cis-regulatory modules through recombination between divergent lineages. In addition, recombination of modules into different combinations within species further contributes to diversity. Analysis of the timing of diversification in these two regions supports the hypothesis of introgression moving regulatory modules between species, rather than shared ancestral variation. The dennis phenotype introgressed into Heliconius melpomene at about the same time that ray originated in this group, while ray introgressed back into H. elevatus much more recently. We show that shuffling of existing enhancer elements both within and between species provides a mechanism for rapid diversification and generation of novel morphological combinations during adaptive radiation. PMID:26771987

  6. Identifying transposon insertions and their effects from RNA-sequencing data.

    Science.gov (United States)

    de Ruiter, Julian R; Kas, Sjors M; Schut, Eva; Adams, David J; Koudijs, Marco J; Wessels, Lodewyk F A; Jonkers, Jos

    2017-07-07

    Insertional mutagenesis using engineered transposons is a potent forward genetic screening technique used to identify cancer genes in mouse model systems. In the analysis of these screens, transposon insertion sites are typically identified by targeted DNA-sequencing and subsequently assigned to predicted target genes using heuristics. As such, these approaches provide no direct evidence that insertions actually affect their predicted targets or how transcripts of these genes are affected. To address this, we developed IM-Fusion, an approach that identifies insertion sites from gene-transposon fusions in standard single- and paired-end RNA-sequencing data. We demonstrate IM-Fusion on two separate transposon screens of 123 mammary tumors and 20 B-cell acute lymphoblastic leukemias, respectively. We show that IM-Fusion accurately identifies transposon insertions and their true target genes. Furthermore, by combining the identified insertion sites with expression quantification, we show that we can determine the effect of a transposon insertion on its target gene(s) and prioritize insertions that have a significant effect on expression. We expect that IM-Fusion will significantly enhance the accuracy of cancer gene discovery in forward genetic screens and provide initial insight into the biological effects of insertions on candidate cancer genes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. n-Alkane and clofibrate, a peroxisome proliferator, activate transcription of ALK2 gene encoding cytochrome P450alk2 through distinct cis-acting promoter elements in Candida maltosa

    International Nuclear Information System (INIS)

    Kogure, Takahisa; Takagi, Masamichi; Ohta, Akinori

    2005-01-01

    The ALK2 gene, encoding one of the n-alkane-hydroxylating cytochromes P450 in Candida maltosa, is induced by n-alkanes and a peroxisome proliferator, clofibrate. Deletion analysis of this gene's promoter revealed two cis-acting elements-an n-alkane-responsive element (ARE2) and a clofibrate-responsive element (CRE2)-that partly overlap in sequence but have distinct functions. ARE2-mediated activation responded to n-alkanes but not to clofibrate and was repressed by glucose. CRE2-mediated activation responded to polyunsaturated fatty acids and steroid hormones as well as to peroxisome proliferators but not to n-alkanes, and it was not repressed by glucose. Both elements mediated activation by oleic acid. Mutational analysis demonstrated that three CCG sequences in CRE2 were critical to the activation by clofibrate as well as to the in vitro binding of a specific protein to this element. These findings suggest that ALK2 is induced by peroxisome proliferators and steroid hormones through a specific CRE2-mediated regulatory mechanism

  8. Identification of absolute geometries of cis and trans molecular isomers by Coulomb Explosion Imaging.

    Science.gov (United States)

    Ablikim, Utuq; Bomme, Cédric; Xiong, Hui; Savelyev, Evgeny; Obaid, Razib; Kaderiya, Balram; Augustin, Sven; Schnorr, Kirsten; Dumitriu, Ileana; Osipov, Timur; Bilodeau, René; Kilcoyne, David; Kumarappan, Vinod; Rudenko, Artem; Berrah, Nora; Rolles, Daniel

    2016-12-02

    An experimental route to identify and separate geometric isomers by means of coincident Coulomb explosion imaging is presented, allowing isomer-resolved photoionization studies on isomerically mixed samples. We demonstrate the technique on cis/trans 1,2-dibromoethene (C 2 H 2 Br 2 ). The momentum correlation between the bromine ions in a three-body fragmentation process induced by bromine 3d inner-shell photoionization is used to identify the cis and trans structures of the isomers. The experimentally determined momentum correlations and the isomer-resolved fragment-ion kinetic energies are matched closely by a classical Coulomb explosion model.

  9. Dilemmas of compliance in the CIS

    International Nuclear Information System (INIS)

    Vorobyev, A.

    1996-01-01

    The objective of this paper is to examine some of the difficulties faced by Russia and other Common Independent States (CIS) in the field of compliance with disarmament treaties and non-proliferation regimes, as well as ways and means, particularly with regard to the legal framework, designed to overcome these difficulties. Naturally, the fate and pace of overcoming the existing problems will depend only partially on development of CIS States. A large variety of international factors and the general security will be essential for progress in resolving disarmament and arms control issues in the CIS

  10. Mechanistically Distinct Pathways of Divergent Regulatory DNA Creation Contribute to Evolution of Human-Specific Genomic Regulatory Networks Driving Phenotypic Divergence of Homo sapiens.

    Science.gov (United States)

    Glinsky, Gennadi V

    2016-09-19

    Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8-10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of

  11. Cross-comparison of the genome sequences from human, chimpanzee, Neanderthal and a Denisovan hominin identifies novel potentially compensated mutations

    Directory of Open Access Journals (Sweden)

    Zhang Guojie

    2011-07-01

    Full Text Available Abstract The recent publication of the draft genome sequences of the Neanderthal and a ~50,000-year-old archaic hominin from Denisova Cave in southern Siberia has ushered in a new age in molecular archaeology. We previously cross-compared the human, chimpanzee and Neanderthal genome sequences with respect to a set of disease-causing/disease-associated missense and regulatory mutations (Human Gene Mutation Database and succeeded in identifying genetic variants which, although apparently pathogenic in humans, may represent a 'compensated' wild-type state in at least one of the other two species. Here, in an attempt to identify further 'potentially compensated mutations' (PCMs of interest, we have compared our dataset of disease-causing/disease-associated mutations with their corresponding nucleotide positions in the Denisovan hominin, Neanderthal and chimpanzee genomes. Of the 15 human putatively disease-causing mutations that were found to be compensated in chimpanzee, Denisovan or Neanderthal, only a solitary F5 variant (Val1736Met was specific to the Denisovan. In humans, this missense mutation is associated with activated protein C resistance and an increased risk of thromboembolism and recurrent miscarriage. It is unclear at this juncture whether this variant was indeed a PCM in the Denisovan or whether it could instead have been associated with disease in this ancient hominin.

  12. Identification of Bacterial Small RNAs by RNA Sequencing

    DEFF Research Database (Denmark)

    Gómez Lozano, María; Marvig, Rasmus Lykke; Molin, Søren

    2014-01-01

    sequencing (RNA-seq) is described that involves the preparation and analysis of three different sequencing libraries. As a signifi cant number of unique sRNAs are identifi ed in each library, the libraries can be used either alone or in combination to increase the number of sRNAs identifi ed. The approach......Small regulatory RNAs (sRNAs) in bacteria are known to modulate gene expression and control a variety of processes including metabolic reactions, stress responses, and pathogenesis in response to environmental signals. A method to identify bacterial sRNAs on a genome-wide scale based on RNA...... may be applied to identify sRNAs in any bacterium under different growth and stress conditions....

  13. Up front in the CIS

    International Nuclear Information System (INIS)

    Grey, C.A.

    1994-01-01

    A picture is drawn of the current supply side of the front-end fuel cycle production capacities in the CIS. Uranium production has been steadily declining, as in the West. Market realities have been reflected in local costs of production since the break-up of the former Soviet Union and some uneconomic mines have been closed. In terms of actual production, Kazakhstan, Russia and Uzbekistan, remain among the top five uranium producers in the world. Western government action has been taken to restrict the market access for natural uranium from the CIS. Reactors in the CIS continue to be supplied with fabricated fuel solely by Russian, though Western fuel fabricators have reduced Russian supplies to Eastern Europe. Russia's current dominance in conversion and enrichment services in both the CIS and Eastern Europe is likely to continue as long as the present surplus low enriched uranium stocks last and surplus production capacity exists. Market penetration in the West has been limited by government action but Russia in 1993 still held about 20% of the world's conversion market and nearly 19% of the enrichment market. (6 figures, 2 tables, 4 references) (UK)

  14. The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line

    DEFF Research Database (Denmark)

    Suzuki, Harukazu; Forrest, Alistair R R; van Nimwegen, Erik

    2009-01-01

    , we identified the key transcription regulators, their time-dependent activities and target genes. Systematic siRNA knockdown of 52 transcription factors confirmed the roles of individual factors in the regulatory network. Our results indicate that cellular states are constrained by complex networks......Using deep sequencing (deepCAGE), the FANTOM4 study measured the genome-wide dynamics of transcription-start-site usage in the human monocytic cell line THP-1 throughout a time course of growth arrest and differentiation. Modeling the expression dynamics in terms of predicted cis-regulatory sites...... involving both positive and negative regulatory interactions among substantial numbers of transcription factors and that no single transcription factor is both necessary and sufficient to drive the differentiation process....

  15. Thermal decomposition and isomerization of cis-permethrin and beta-cypermethrin in the solid phase.

    Science.gov (United States)

    González Audino, Paola; Licastro, Susana A; Zerba, Eduardo

    2002-02-01

    The stability to heart of cis-permethrin and beta-cypermethrin in the solid phase was studied and the decomposition products identified. Samples heated at 210 degrees C in an oven in the dark showed that, in the absence of potassium chlorate (the salt present in smoke-generating formulations of these pyrethroids), cis-permethrin was not isomerized, although in the presence of that salt, decomposition was greater and thermal isomerization occurred. Other salts of the type KXO3 or NaXO3, with X being halogen or nitrogen, also led to a considerable thermal isomerization. Heating the insecticides in solution in the presence of potassium chlorate did not produce isomerization in any of the solvents assayed. Salt-catalysed thermal cis-trans isomerization was also found for other pyrethroids derived from permethrinic or deltamethrinic acid but not for those derived from chrysanthemic acid. The main thermal degradation processes of cis-permethrin and beta-cypermethrin decomposition when potassium chlorate was present were cyclopropane isomerization, ester cleavage and subsequent oxidation of the resulting products. Permethrinic acid, 3-phenoxybenzyle chloride, alcohol, aldehyde and acid were identified in both cases, as well as 3-phenoxybenzyl cyanide from beta-cypermethrin. A similar decomposition pattern occurred after combustion of pyrethroid fumigant formulations.

  16. Alternative approaches for identifying acute systemic toxicity: Moving from research to regulatory testing.

    Science.gov (United States)

    Hamm, Jon; Sullivan, Kristie; Clippinger, Amy J; Strickland, Judy; Bell, Shannon; Bhhatarai, Barun; Blaauboer, Bas; Casey, Warren; Dorman, David; Forsby, Anna; Garcia-Reyero, Natàlia; Gehen, Sean; Graepel, Rabea; Hotchkiss, Jon; Lowit, Anna; Matheson, Joanna; Reaves, Elissa; Scarano, Louis; Sprankle, Catherine; Tunkel, Jay; Wilson, Dan; Xia, Menghang; Zhu, Hao; Allen, David

    2017-06-01

    Acute systemic toxicity testing provides the basis for hazard labeling and risk management of chemicals. A number of international efforts have been directed at identifying non-animal alternatives for in vivo acute systemic toxicity tests. A September 2015 workshop, Alternative Approaches for Identifying Acute Systemic Toxicity: Moving from Research to Regulatory Testing, reviewed the state-of-the-science of non-animal alternatives for this testing and explored ways to facilitate implementation of alternatives. Workshop attendees included representatives from international regulatory agencies, academia, nongovernmental organizations, and industry. Resources identified as necessary for meaningful progress in implementing alternatives included compiling and making available high-quality reference data, training on use and interpretation of in vitro and in silico approaches, and global harmonization of testing requirements. Attendees particularly noted the need to characterize variability in reference data to evaluate new approaches. They also noted the importance of understanding the mechanisms of acute toxicity, which could be facilitated by the development of adverse outcome pathways. Workshop breakout groups explored different approaches to reducing or replacing animal use for acute toxicity testing, with each group crafting a roadmap and strategy to accomplish near-term progress. The workshop steering committee has organized efforts to implement the recommendations of the workshop participants. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. Cis-acting elements in the promoter region of the human aldolase C gene.

    Science.gov (United States)

    Buono, P; de Conciliis, L; Olivetta, E; Izzo, P; Salvatore, F

    1993-08-16

    We investigated the cis-acting sequences involved in the expression of the human aldolase C gene by transient transfections into human neuroblastoma cells (SKNBE). We demonstrate that 420 bp of the 5'-flanking DNA direct at high efficiency the transcription of the CAT reporter gene. A deletion between -420 bp and -164 bp causes a 60% decrease of CAT activity. Gel shift and DNase I footprinting analyses revealed four protected elements: A, B, C and D. Competition analyses indicate that Sp1 or factors sharing a similar sequence specificity bind to elements A and B, but not to elements C and D. Sequence analysis shows a half palindromic ERE motif (GGTCA), in elements B and D. Region D binds a transactivating factor which appears also essential to stabilize the initiation complex.

  18. A Sequence and Structure Based Method to Predict Putative Substrates, Functions and Regulatory Networks of Endo Proteases

    Science.gov (United States)

    Venkatraman, Prasanna; Balakrishnan, Satish; Rao, Shashidhar; Hooda, Yogesh; Pol, Suyog

    2009-01-01

    Background Proteases play a central role in cellular homeostasis and are responsible for the spatio- temporal regulation of function. Many putative proteases have been recently identified through genomic approaches, leading to a surge in global profiling attempts to characterize their function. Through such efforts and others it has become evident that many proteases play non-traditional roles. Accordingly, the number and the variety of the substrate repertoire of proteases are expected to be much larger than previously assumed. In line with such global profiling attempts, we present here a method for the prediction of natural substrates of endo proteases (human proteases used as an example) by employing short peptide sequences as specificity determinants. Methodology/Principal Findings Our method incorporates specificity determinants unique to individual enzymes and physiologically relevant dual filters namely, solvent accessible surface area-a parameter dependent on protein three-dimensional structure and subcellular localization. By incorporating such hitherto unused principles in prediction methods, a novel ligand docking strategy to mimic substrate binding at the active site of the enzyme, and GO functions, we identify and perform subjective validation on putative substrates of matriptase and highlight new functions of the enzyme. Using relative solvent accessibility to rank order we show how new protease regulatory networks and enzyme cascades can be created. Conclusion We believe that our physiologically relevant computational approach would be a very useful complementary method in the current day attempts to profile proteases (endo proteases in particular) and their substrates. In addition, by using functional annotations, we have demonstrated how normal and unknown functions of a protease can be envisaged. We have developed a network which can be integrated to create a proteolytic world. This network can in turn be extended to integrate other regulatory

  19. Characterization of noncoding regulatory DNA in the human genome.

    Science.gov (United States)

    Elkon, Ran; Agami, Reuven

    2017-08-08

    Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.

  20. The value of bladder mapping and prostatic urethra biopsies for detection of carcinoma in situ (CIS).

    Science.gov (United States)

    Gudjónsson, Sigurdur; Bläckberg, Mats; Chebil, Gunilla; Jahnson, Staffan; Olsson, Hans; Bendahl, Pär-Ola; Månsson, Wiking; Liedberg, Fredrik

    2012-07-01

    It is well known that CIS is a major risk factor for muscle-invasive bladder cancer and that this entity can be difficult to diagnose. Taking cold-cup mapping biopsies from different areas of the bladder (BMAP) is commonly used in patients at risk of harbouring CIS. The diagnostic accuracy of this approach has not been assessed until now. By using the CIS found in the cystoprostatectomy specimen as an indicator of the true occurrence of CIS and comparing that with the findings of BMAP, it is clear that the sensitivity of BMAP to detect CIS when present is low and that negative findings should be considered unreliable. To assess the value of bladder mapping and prostatic urethra biopsies for detection of urothelial carcinoma in situ (CIS). CIS of the urinary bladder is a flat high-grade lesion of the mucosa associated with a significant risk of progression to muscle-invasive disease. CIS is difficult to identify on cystoscopy, and definite diagnosis requires histopathology. Traditionally, if CIS is suspected, multiple cold-cup biopsies are taken from the bladder mucosa, and resection biopsies are obtained from the prostatic urethra in males. This approach is often called bladder mapping (BMAP). The accuracy of BMAP as a diagnostic tool is not known. Male patients with bladder cancer scheduled for cystectomy underwent cold-cup bladder biopsies (sidewalls, posterior wall, dome, trigone), and resection biopsies were taken from the prostatic urethra. After cystectomy, the surgical specimen was investigated in a standardised manner and subsequently compared with the BMAP biopsies for the presence of CIS. The histopathology reports of 162 patients were analysed. CIS was detected in 46% of the cystoprostatectomy specimens, and multiple (≥2) CIS lesions were found in 30%. BMAP (cold-cup bladder biopsies + resection biopsies from the prostatic urethra) provided sensitivity of 51% for any CIS, and 55% for multiple CIS lesions. The cold-cup biopsies for CIS in the bladder

  1. Repertoire of bovine miRNA and miRNA-like small regulatory RNAs expressed upon viral infection.

    Directory of Open Access Journals (Sweden)

    Evgeny A Glazov

    Full Text Available MicroRNA (miRNA and other types of small regulatory RNAs play a crucial role in the regulation of gene expression in eukaryotes. Several distinct classes of small regulatory RNAs have been discovered in recent years. To extend the repertoire of small RNAs characterized in mammals and to examine relationship between host miRNA expression and viral infection we used Illumina's ultrahigh throughput sequencing approach. We sequenced three small RNA libraries prepared from cell line derived from the adult bovine kidney under normal conditions and upon infection of the cell line with Bovine herpesvirus 1. We used a bioinformatics approach to distinguish authentic mature miRNA sequences from other classes of small RNAs and short RNA fragments represented in the sequencing data. Using this approach we detected 219 out of 356 known bovine miRNAs and 115 respective miRNA* sequences. In addition we identified five new bovine orthologs of known mammalian miRNAs and discovered 268 new cow miRNAs many of which are not identifiable in other mammalian genomes and thus might be specific to the ruminant lineage. In addition we found seven new bovine mirtron candidates. We also discovered 10 small nucleolar RNA (snoRNA loci that give rise to small RNA with possible miRNA-like function. Results presented in this study extend our knowledge of the biology and evolution of small regulatory RNAs in mammals and illuminate mechanisms of small RNA biogenesis and function. New miRNA sequences and the original sequencing data have been submitted to miRNA repository (miRBase and NCBI GEO archive respectively. We envisage that these resources will facilitate functional annotation of the bovine genome and promote further functional and comparative genomics studies of small regulatory RNA in mammals.

  2. A SNP in the HTT promoter alters NF-κB binding and is a bidirectional genetic modifier of Huntington disease

    DEFF Research Database (Denmark)

    Bečanović, Kristina; Nørremølle, Anne; Neal, Scott J

    2015-01-01

    Cis-regulatory variants that alter gene expression can modify disease expressivity, but none have previously been identified in Huntington disease (HD). Here we provide in vivo evidence in HD patients that cis-regulatory variants in the HTT promoter are bidirectional modifiers of HD age of onset....

  3. The Evolution of Lineage-Specific Regulatory Activities in the Human Embryonic Limb

    OpenAIRE

    Cotney, Justin; Leng, Jing; Yin, Jun; Reilly, Steven K.; DeMare, Laura E.; Emera, Deena; Ayoub, Albert E.; Rakic, Pasko; Noonan, James P.

    2013-01-01

    The evolution of human anatomical features likely involved changes in gene regulation during development. However, the nature and extent of human-specific developmental regulatory functions remain unknown. We obtained a genome-wide view of cis-regulatory evolution in human embryonic tissues by comparing the histone modification H3K27ac, which provides a quantitative readout of promoter and enhancer activity, during human, rhesus, and mouse limb development. Based on increased H3K27ac, we find...

  4. Natural history bycatch: a pipeline for identifying metagenomic sequences in RADseq data

    Directory of Open Access Journals (Sweden)

    Iris Holmes

    2018-04-01

    Full Text Available Background Reduced representation genomic datasets are increasingly becoming available from a variety of organisms. These datasets do not target specific genes, and so may contain sequences from parasites and other organisms present in the target tissue sample. In this paper, we demonstrate that (1 RADseq datasets can be used for exploratory analysis of tissue-specific metagenomes, and (2 tissue collections house complete metagenomic communities, which can be investigated and quantified by a variety of techniques. Methods We present an exploratory method for mining metagenomic “bycatch” sequences from a range of host tissue types. We use a combination of the pyRAD assembly pipeline, NCBI’s blastn software, and custom R scripts to isolate metagenomic sequences from RADseq type datasets. Results When we focus on sequences that align with existing references in NCBI’s GenBank, we find that between three and five percent of identifiable double-digest restriction site associated DNA (ddRAD sequences from host tissue samples are from phyla to contain known blood parasites. In addition to tissue samples, we examine ddRAD sequences from metagenomic DNA extracted snake and lizard hind-gut samples. We find that the sequences recovered from these samples match with expected bacterial and eukaryotic gut microbiome phyla. Discussion Our results suggest that (1 museum tissue banks originally collected for host DNA archiving are also preserving valuable parasite and microbiome communities, (2 that publicly available RADseq datasets may include metagenomic sequences that could be explored, and (3 that restriction site approaches are a useful exploratory technique to identify microbiome lineages that could be missed by primer-based approaches.

  5. Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

    Science.gov (United States)

    Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

    2012-02-01

    The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.

  6. Genomic Features That Predict Allelic Imbalance in Humans Suggest Patterns of Constraint on Gene Expression Variation

    Science.gov (United States)

    Fédrigo, Olivier; Haygood, Ralph; Mukherjee, Sayan; Wray, Gregory A.

    2009-01-01

    Variation in gene expression is an important contributor to phenotypic diversity within and between species. Although this variation often has a genetic component, identification of the genetic variants driving this relationship remains challenging. In particular, measurements of gene expression usually do not reveal whether the genetic basis for any observed variation lies in cis or in trans to the gene, a distinction that has direct relevance to the physical location of the underlying genetic variant, and which may also impact its evolutionary trajectory. Allelic imbalance measurements identify cis-acting genetic effects by assaying the relative contribution of the two alleles of a cis-regulatory region to gene expression within individuals. Identification of patterns that predict commonly imbalanced genes could therefore serve as a useful tool and also shed light on the evolution of cis-regulatory variation itself. Here, we show that sequence motifs, polymorphism levels, and divergence levels around a gene can be used to predict commonly imbalanced genes in a human data set. Reduction of this feature set to four factors revealed that only one factor significantly differentiated between commonly imbalanced and nonimbalanced genes. We demonstrate that these results are consistent between the original data set and a second published data set in humans obtained using different technical and statistical methods. Finally, we show that variation in the single allelic imbalance-associated factor is partially explained by the density of genes in the region of a target gene (allelic imbalance is less probable for genes in gene-dense regions), and, to a lesser extent, the evenness of expression of the gene across tissues and the magnitude of negative selection on putative regulatory regions of the gene. These results suggest that the genomic distribution of functional cis-regulatory variants in the human genome is nonrandom, perhaps due to local differences in evolutionary

  7. A Survey of 6,300 Genomic Fragments for cis-Regulatory Activity in the Imaginal Discs of Drosophila melanogaster

    Directory of Open Access Journals (Sweden)

    Aurélie Jory

    2012-10-01

    Full Text Available Over 6,000 fragments from the genome of Drosophila melanogaster were analyzed for their ability to drive expression of GAL4 reporter genes in the third-instar larval imaginal discs. About 1,200 reporter genes drove expression in the eye, antenna, leg, wing, haltere, or genital imaginal discs. The patterns ranged from large regions to individual cells. About 75% of the active fragments drove expression in multiple discs; 20% were expressed in ventral, but not dorsal, discs (legs, genital, and antenna, whereas ∼23% were expressed in dorsal but not ventral discs (wing, haltere, and eye. Several patterns, for example, within the leg chordotonal organ, appeared a surprisingly large number of times. Unbiased searches for DNA sequence motifs suggest candidate transcription factors that may regulate enhancers with shared activities. Together, these expression patterns provide a valuable resource to the community and offer a broad overview of how transcriptional regulatory information is distributed in the Drosophila genome.

  8. Massive contribution of transposable elements to mammalian regulatory sequences.

    Science.gov (United States)

    Rayan, Nirmala Arul; Del Rosario, Ricardo C H; Prabhakar, Shyam

    2016-09-01

    Barbara McClintock discovered the existence of transposable elements (TEs) in the late 1940s and initially proposed that they contributed to the gene regulatory program of higher organisms. This controversial idea gained acceptance only much later in the 1990s, when the first examples of TE-derived promoter sequences were uncovered. It is now known that half of the human genome is recognizably derived from TEs. It is thus important to understand the scope and nature of their contribution to gene regulation. Here, we provide a timeline of major discoveries in this area and discuss how transposons have revolutionized our understanding of mammalian genomes, with a special emphasis on the massive contribution of TEs to primate evolution. Our analysis of primate-specific functional elements supports a simple model for the rate at which new functional elements arise in unique and TE-derived DNA. Finally, we discuss some of the challenges and unresolved questions in the field, which need to be addressed in order to fully characterize the impact of TEs on gene regulation, evolution and disease processes. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Prioritization of gene regulatory interactions from large-scale modules in yeast

    Directory of Open Access Journals (Sweden)

    Bringas Ricardo

    2008-01-01

    Full Text Available Abstract Background The identification of groups of co-regulated genes and their transcription factors, called transcriptional modules, has been a focus of many studies about biological systems. While methods have been developed to derive numerous modules from genome-wide data, individual links between regulatory proteins and target genes still need experimental verification. In this work, we aim to prioritize regulator-target links within transcriptional modules based on three types of large-scale data sources. Results Starting with putative transcriptional modules from ChIP-chip data, we first derive modules in which target genes show both expression and function coherence. The most reliable regulatory links between transcription factors and target genes are established by identifying intersection of target genes in coherent modules for each enriched functional category. Using a combination of genome-wide yeast data in normal growth conditions and two different reference datasets, we show that our method predicts regulatory interactions with significantly higher predictive power than ChIP-chip binding data alone. A comparison with results from other studies highlights that our approach provides a reliable and complementary set of regulatory interactions. Based on our results, we can also identify functionally interacting target genes, for instance, a group of co-regulated proteins related to cell wall synthesis. Furthermore, we report novel conserved binding sites of a glycoprotein-encoding gene, CIS3, regulated by Swi6-Swi4 and Ndd1-Fkh2-Mcm1 complexes. Conclusion We provide a simple method to prioritize individual TF-gene interactions from large-scale transcriptional modules. In comparison with other published works, we predict a complementary set of regulatory interactions which yields a similar or higher prediction accuracy at the expense of sensitivity. Therefore, our method can serve as an alternative approach to prioritization for

  10. Optimization of Pseudomonas putida KT2440 as host for the production of cis, cis-muconate from benzoate

    NARCIS (Netherlands)

    Duuren, van J.B.J.H.

    2011-01-01

    Optimization of Pseudomonas putida KT2440 as host for the production of cis, cis-muconate

    from benzoate P. putida KT2440 was used as biocatalyst given its versatile and energetically robust metabolism.

    Therefore, a mutant was generated and a process developed based on which a

  11. Distinct cis regulatory elements govern the expression of TAG1 in embryonic sensory ganglia and spinal cord.

    Directory of Open Access Journals (Sweden)

    Yoav Hadas

    Full Text Available Cell fate commitment of spinal progenitor neurons is initiated by long-range, midline-derived, morphogens that regulate an array of transcription factors that, in turn, act sequentially or in parallel to control neuronal differentiation. Included among these are transcription factors that regulate the expression of receptors for guidance cues, thereby determining axonal trajectories. The Ig/FNIII superfamily molecules TAG1/Axonin1/CNTN2 (TAG1 and Neurofascin (Nfasc are co-expressed in numerous neuronal cell types in the CNS and PNS - for example motor, DRG and interneurons - both promote neurite outgrowth and both are required for the architecture and function of nodes of Ranvier. The genes encoding TAG1 and Nfasc are adjacent in the genome, an arrangement which is evolutionarily conserved. To study the transcriptional network that governs TAG1 and Nfasc expression in spinal motor and commissural neurons, we set out to identify cis elements that regulate their expression. Two evolutionarily conserved DNA modules, one located between the Nfasc and TAG1 genes and the second directly 5' to the first exon and encompassing the first intron of TAG1, were identified that direct complementary expression to the CNS and PNS, respectively, of the embryonic hindbrain and spinal cord. Sequential deletions and point mutations of the CNS enhancer element revealed a 130bp element containing three conserved E-boxes required for motor neuron expression. In combination, these two elements appear to recapitulate a major part of the pattern of TAG1 expression in the embryonic nervous system.

  12. The Non-Coding Regulatory RNA Revolution in Archaea

    Directory of Open Access Journals (Sweden)

    Diego Rivera Gelsinger

    2018-03-01

    Full Text Available Small non-coding RNAs (sRNAs are ubiquitously found in the three domains of life playing large-scale roles in gene regulation, transposable element silencing and defense against foreign elements. While a substantial body of experimental work has been done to uncover function of sRNAs in Bacteria and Eukarya, the functional roles of sRNAs in Archaea are still poorly understood. Recently, high throughput studies using RNA-sequencing revealed that sRNAs are broadly expressed in the Archaea, comprising thousands of transcripts within the transcriptome during non-challenged and stressed conditions. Antisense sRNAs, which overlap a portion of a gene on the opposite strand (cis-acting, are the most abundantly expressed non-coding RNAs and they can be classified based on their binding patterns to mRNAs (3′ untranslated region (UTR, 5′ UTR, CDS-binding. These antisense sRNAs target many genes and pathways, suggesting extensive roles in gene regulation. Intergenic sRNAs are less abundantly expressed and their targets are difficult to find because of a lack of complete overlap between sRNAs and target mRNAs (trans-acting. While many sRNAs have been validated experimentally, a regulatory role has only been reported for very few of them. Further work is needed to elucidate sRNA-RNA binding mechanisms, the molecular determinants of sRNA-mediated regulation, whether protein components are involved and how sRNAs integrate with complex regulatory networks.

  13. Exome sequencing identifies SUCO mutations in mesial temporal lobe epilepsy.

    Science.gov (United States)

    Sha, Zhiqiang; Sha, Longze; Li, Wenting; Dou, Wanchen; Shen, Yan; Wu, Liwen; Xu, Qi

    2015-03-30

    Mesial temporal lobe epilepsy (mTLE) is the main type and most common medically intractable form of epilepsy. Severity of disease-based stratified samples may help identify new disease-associated mutant genes. We analyzed mRNA expression profiles from patient hippocampal tissue. Three of the seven patients had severe mTLE with generalized-onset convulsions and consciousness loss that occurred over many years. We found that compared with other groups, patients with severe mTLE were classified into a distinct group. Whole-exome sequencing and Sanger sequencing validation in all seven patients identified three novel SUN domain-containing ossification factor (SUCO) mutations in severely affected patients. Furthermore, SUCO knock down significantly reduced dendritic length in vitro. Our results indicate that mTLE defects may affect neuronal development, and suggest that neurons have abnormal development due to lack of SUCO, which may be a generalized-onset epilepsy-related gene. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  14. Nominal Anchors in the CIS

    OpenAIRE

    Peter M Keller; Thomas J Richardson

    2003-01-01

    Monetary policy has become increasingly important in the countries of the Commonwealth of Independent States (CIS) as fiscal adjustment and structural reforms have taken root. Inflation has been brought down to relatively low levels in almost all of these countries, raising the question of what should be the appropriate nominal anchor at this stage. Formally, almost all CIS countries have floating exchange rate regimes, yet in practice they manage their exchange rates very heavily, perhaps be...

  15. Suppressor mutations identify amino acids in PAA-1/PR65 that facilitate regulatory RSA-1/B″ subunit targeting of PP2A to centrosomes in C. elegans.

    Science.gov (United States)

    Lange, Karen I; Heinrichs, Jeffrey; Cheung, Karen; Srayko, Martin

    2013-01-15

    Protein phosphorylation and dephosphorylation is a key mechanism for the spatial and temporal regulation of many essential developmental processes and is especially prominent during mitosis. The multi-subunit protein phosphatase 2A (PP2A) enzyme plays an important, yet poorly characterized role in dephosphorylating proteins during mitosis. PP2As are heterotrimeric complexes comprising a catalytic, structural, and regulatory subunit. Regulatory subunits are mutually exclusive and determine subcellular localization and substrate specificity of PP2A. At least 3 different classes of regulatory subunits exist (termed B, B', B″) but there is no obvious similarity in primary sequence between these classes. Therefore, it is not known how these diverse regulatory subunits interact with the same holoenzyme to facilitate specific PP2A functions in vivo. The B″ family of regulatory subunits is the least understood because these proteins lack conserved structural domains. RSA-1 (regulator of spindle assembly) is a regulatory B″ subunit required for mitotic spindle assembly in Caenorhabditis elegans. In order to address how B″ subunits interact with the PP2A core enzyme, we focused on a conditional allele, rsa-1(or598ts), and determined that this mutation specifically disrupts the protein interaction between RSA-1 and the PP2A structural subunit, PAA-1. Through genetic screening, we identified a putative interface on the PAA-1 structural subunit that interacts with a defined region of RSA-1/B″. In the context of previously published results, these data propose a mechanism of how different PP2A B-regulatory subunit families can bind the same holoenzyme in a mutually exclusive manner, to perform specific tasks in vivo.

  16. Suppressor mutations identify amino acids in PAA-1/PR65 that facilitate regulatory RSA-1/B″ subunit targeting of PP2A to centrosomes in C. elegans

    Directory of Open Access Journals (Sweden)

    Karen I. Lange

    2012-11-01

    Protein phosphorylation and dephosphorylation is a key mechanism for the spatial and temporal regulation of many essential developmental processes and is especially prominent during mitosis. The multi-subunit protein phosphatase 2A (PP2A enzyme plays an important, yet poorly characterized role in dephosphorylating proteins during mitosis. PP2As are heterotrimeric complexes comprising a catalytic, structural, and regulatory subunit. Regulatory subunits are mutually exclusive and determine subcellular localization and substrate specificity of PP2A. At least 3 different classes of regulatory subunits exist (termed B, B′, B″ but there is no obvious similarity in primary sequence between these classes. Therefore, it is not known how these diverse regulatory subunits interact with the same holoenzyme to facilitate specific PP2A functions in vivo. The B″ family of regulatory subunits is the least understood because these proteins lack conserved structural domains. RSA-1 (regulator of spindle assembly is a regulatory B″ subunit required for mitotic spindle assembly in Caenorhabditis elegans. In order to address how B″ subunits interact with the PP2A core enzyme, we focused on a conditional allele, rsa-1(or598ts, and determined that this mutation specifically disrupts the protein interaction between RSA-1 and the PP2A structural subunit, PAA-1. Through genetic screening, we identified a putative interface on the PAA-1 structural subunit that interacts with a defined region of RSA-1/B″. In the context of previously published results, these data propose a mechanism of how different PP2A B-regulatory subunit families can bind the same holoenzyme in a mutually exclusive manner, to perform specific tasks in vivo.

  17. Development and utilization of complementary communication channels for treatment decision making and survivorship issues among cancer patients: The CIS Research Consortium Experience.

    Science.gov (United States)

    Fleisher, Linda; Wen, Kuang Yi; Miller, Suzanne M; Diefenbach, Michael; Stanton, Annette L; Ropka, Mary; Morra, Marion; Raich, Peter C

    2015-11-01

    Cancer patients and survivors are assuming active roles in decision-making and digital patient support tools are widely used to facilitate patient engagement. As part of Cancer Information Service Research Consortium's randomized controlled trials focused on the efficacy of eHealth interventions to promote informed treatment decision-making for newly diagnosed prostate and breast cancer patients, and post-treatment breast cancer, we conducted a rigorous process evaluation to examine the actual use of and perceived benefits of two complementary communication channels -- print and eHealth interventions. The three Virtual Cancer Information Service (V-CIS) interventions were developed through a rigorous developmental process, guided by self-regulatory theory, informed decision-making frameworks, and health communications best practices. Control arm participants received NCI print materials; experimental arm participants received the additional V-CIS patient support tool. Actual usage data from the web-based V-CIS was also obtained and reported. Print materials were highly used by all groups. About 60% of the experimental group reported using the V-CIS. Those who did use the V-CIS rated it highly on improvements in knowledge, patient-provider communication and decision-making. The findings show that how patients actually use eHealth interventions either singularly or within the context of other communication channels is complex. Integrating rigorous best practices and theoretical foundations is essential and multiple communication approaches should be considered to support patient preferences.

  18. Studies on radiosensitization of Escherichia coli cells by cis-platinum complexes

    International Nuclear Information System (INIS)

    Zimbrick, J.D.; Sukrochana, A.; Richmond, R.C.

    1979-01-01

    We recently reported that the antitumor drug cis-Pt(NH 3 ) 2 Cl 2 (cis-DDP) produces significant radiosensitization of anoxic E coli C cells. We have extended these studies to three other platinum drugs, all of which have been shown to be more effective antitumor drugs than cis-DDP. The drugs are: cis-dichloro bis(ethylene imine) Pt(II) (cis-DEP); cis-dichlorobicyclopentylamine Pt(II) (cis-PAD); and Pt-thymine blue (cis-PTB). Survival curve studies indicate that these drugs all produce greater anoxic radiosensitization of E coli C than cis-DDP at concentrations which are less toxic to the cells than similar concentrations of cis-DDP. If the cells are treated with any one of these drugs for two hours and then washed to remove the drug before irradiation, no detectable radiosensitization is found. We conclude that these drugs have the potential for being useful agents in combined modality therapy and that they warrant further study in mammalian systems

  19. Enhancing regulatory effectiveness by improving the process for identifying and resolving generic issues

    International Nuclear Information System (INIS)

    Vander Molen, Harold J.

    2001-01-01

    The Generic Issues Program first began formally in response to a Commission directive in October of 1976. In 1983, it became one of the first programs to make successful use of probabilistic risk information to aid in regulatory decision-making. In the 16 years since the program became quantitative, 836 issues have been processed. Of these, 106 reactor safety issues were prioritized as requiring further evaluation to determine the final resolution. Approximately a dozen generic issues remain unresolved. Although there is far less reactor licensing activity than in the 1970s, new issues continue to be identified from research and operational experience. These issues often involve complex and controversial questions of safety and regulation, and an efficient and effective means of addressing these issues is essential for regulatory effectiveness. Issues that involve a significant safety question require swift, effective, enforceable, and cost-effective regulatory actions. Issues that are of little safety significance must be quickly shown to be so and dismissed in an expeditious manner so as to avoid unnecessary expenditure of limited resources and to reduce regulatory uncertainty. Additionally, in the time since the generic issue program began, probabilistic risk assessment techniques have advanced significantly while agency resources have continued to diminish. Accordingly, the paper discusses the steps that have been taken to enhance the effectiveness and efficiency of the generic issue resolution process. Additionally, four resolved issues are discussed, along with key elements of a proposed new procedure for resolving potential generic issues

  20. Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

    Science.gov (United States)

    Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

    2002-07-01

    Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.

  1. Molecular characterization, tissue expression and sequence variability of the barramundi (Lates calcarifer myostatin gene

    Directory of Open Access Journals (Sweden)

    Smith-Keune Carolyn

    2008-02-01

    Full Text Available Abstract Background Myostatin (MSTN is a member of the transforming growth factor-β superfamily that negatively regulates growth of skeletal muscle tissue. The gene encoding for the MSTN peptide is a consolidate candidate for the enhancement of productivity in terrestrial livestock. This gene potentially represents an important target for growth improvement of cultured finfish. Results Here we report molecular characterization, tissue expression and sequence variability of the barramundi (Lates calcarifer MSTN-1 gene. The barramundi MSTN-1 was encoded by three exons 379, 371 and 381 bp in length and translated into a 376-amino acid peptide. Intron 1 and 2 were 412 and 819 bp in length and presented typical GT...AG splicing sites. The upstream region contained cis-regulatory elements such as TATA-box and E-boxes. A first assessment of sequence variability suggested that higher mutation rates are found in the 5' flanking region with several SNP's present in this species. A putative micro RNA target site has also been observed in the 3'UTR (untranslated region and is highly conserved across teleost fish. The deduced amino acid sequence was conserved across vertebrates and exhibited characteristic conserved putative functional residues including a cleavage motif of proteolysis (RXXR, nine cysteines and two glycosilation sites. A qualitative analysis of the barramundi MSTN-1 expression pattern revealed that, in adult fish, transcripts are differentially expressed in various tissues other than skeletal muscles including gill, heart, kidney, intestine, liver, spleen, eye, gonad and brain. Conclusion Our findings provide valuable insights such as sequence variation and genomic information which will aid the further investigation of the barramundi MSTN-1 gene in association with growth. The finding for the first time in finfish MSTN of a miRNA target site in the 3'UTR provides an opportunity for the identification of regulatory mutations on the

  2. A review of recent analyses of the Canadian Incidence Study of Reported Child Abuse and Neglect (CIS

    Directory of Open Access Journals (Sweden)

    D. Potter

    2015-01-01

    Full Text Available Introduction: The objective of this analysis is to identify, assess the quality and summarize the findings of peer-reviewed articles that used data from the Canadian Incidence Study of Reported Child Abuse and Neglect (CIS published since November 2011 and data from provincial oversamples of the CIS as well as to illustrate evolving uses of these datasets. Methods: Articles were identified from the Public Health Agency of Canada's data request records tracking access to CIS data and publications produced from that data. At least two raters independently reviewed and appraised the quality of each article. Results: A total of 32 articles were included. Common strengths of articles included clearly stated research aims, appropriate control variables and analyses, sufficient sample sizes, appropriate conclusions and relevance to practice or policy. Common problem areas of articles included unclear definitions for variables and inclusion criteria of cases. Articles frequently measured the associations between maltreatment, child, caregiver, household and agency/referral characteristics and investigative outcomes such as opening cases for ongoing services and placement. Conclusion: Articles using CIS data were rated positively on most quality indicators. Researchers have recently focussed on inadequately studied categories of maltreatment (exposure to intimate partner violence [IPV], neglect and emotional maltreatment and examined factors specific to First Nations children. Data from the CIS oversamples have been underutilized. The use of multivariate analysis techniques has increased.

  3. ChromaSig: a probabilistic approach to finding common chromatin signatures in the human genome.

    Directory of Open Access Journals (Sweden)

    Gary Hon

    2008-10-01

    Full Text Available Computational methods to identify functional genomic elements using genetic information have been very successful in determining gene structure and in identifying a handful of cis-regulatory elements. But the vast majority of regulatory elements have yet to be discovered, and it has become increasingly apparent that their discovery will not come from using genetic information alone. Recently, high-throughput technologies have enabled the creation of information-rich epigenetic maps, most notably for histone modifications. However, tools that search for functional elements using this epigenetic information have been lacking. Here, we describe an unsupervised learning method called ChromaSig to find, in an unbiased fashion, commonly occurring chromatin signatures in both tiling microarray and sequencing data. Applying this algorithm to nine chromatin marks across a 1% sampling of the human genome in HeLa cells, we recover eight clusters of distinct chromatin signatures, five of which correspond to known patterns associated with transcriptional promoters and enhancers. Interestingly, we observe that the distinct chromatin signatures found at enhancers mark distinct functional classes of enhancers in terms of transcription factor and coactivator binding. In addition, we identify three clusters of novel chromatin signatures that contain evolutionarily conserved sequences and potential cis-regulatory elements. Applying ChromaSig to a panel of 21 chromatin marks mapped genomewide by ChIP-Seq reveals 16 classes of genomic elements marked by distinct chromatin signatures. Interestingly, four classes containing enrichment for repressive histone modifications appear to be locally heterochromatic sites and are enriched in quickly evolving regions of the genome. The utility of this approach in uncovering novel, functionally significant genomic elements will aid future efforts of genome annotation via chromatin modifications.

  4. High-Throughput Sequencing Identifies MicroRNAs from Posterior Intestine of Loach (Misgurnus anguillicaudatus) and Their Response to Intestinal Air-Breathing Inhibition.

    Science.gov (United States)

    Huang, Songqian; Cao, Xiaojuan; Tian, Xianchang; Wang, Weimin

    2016-01-01

    MicroRNAs (miRNAs) exert important roles in animal growth, immunity, and development, and regulate gene expression at the post-transcriptional level. Knowledges about the diversities of miRNAs and their roles in accessory air-breathing organs (ABOs) of fish remain unknown. In this work, we used high-throughput sequencing to identify known and novel miRNAs from the posterior intestine, an important ABO, in loach (Misgurnus anguillicaudatus) under normal and intestinal air-breathing inhibited conditions. A total of 204 known and 84 novel miRNAs were identified, while 47 miRNAs were differentially expressed between the two small RNA libraries (i.e. between the normal and intestinal air-breathing inhibited group). Potential miRNA target genes were predicted by combining our transcriptome data of the posterior intestine of the loach under the same conditions, and then annotated using COG, GO, KEGG, Swissprot and Nr databases. The regulatory networks of miRNAs and their target genes were analyzed. The abundances of nine known miRNAs were validated by qRT-PCR. The relative expression profiles of six known miRNAs and their eight corresponding target genes, and two novel potential miRNAs were also detected. Histological characteristics of the posterior intestines in both normal and air-breathing inhibited group were further analyzed. This study contributes to our understanding on the functions and molecular regulatory mechanisms of miRNAs in accessory air-breathing organs of fish.

  5. High-Throughput Sequencing Identifies MicroRNAs from Posterior Intestine of Loach (Misgurnus anguillicaudatus and Their Response to Intestinal Air-Breathing Inhibition.

    Directory of Open Access Journals (Sweden)

    Songqian Huang

    Full Text Available MicroRNAs (miRNAs exert important roles in animal growth, immunity, and development, and regulate gene expression at the post-transcriptional level. Knowledges about the diversities of miRNAs and their roles in accessory air-breathing organs (ABOs of fish remain unknown. In this work, we used high-throughput sequencing to identify known and novel miRNAs from the posterior intestine, an important ABO, in loach (Misgurnus anguillicaudatus under normal and intestinal air-breathing inhibited conditions. A total of 204 known and 84 novel miRNAs were identified, while 47 miRNAs were differentially expressed between the two small RNA libraries (i.e. between the normal and intestinal air-breathing inhibited group. Potential miRNA target genes were predicted by combining our transcriptome data of the posterior intestine of the loach under the same conditions, and then annotated using COG, GO, KEGG, Swissprot and Nr databases. The regulatory networks of miRNAs and their target genes were analyzed. The abundances of nine known miRNAs were validated by qRT-PCR. The relative expression profiles of six known miRNAs and their eight corresponding target genes, and two novel potential miRNAs were also detected. Histological characteristics of the posterior intestines in both normal and air-breathing inhibited group were further analyzed. This study contributes to our understanding on the functions and molecular regulatory mechanisms of miRNAs in accessory air-breathing organs of fish.

  6. Circular RNA Profiling and Bioinformatic Modeling Identify Its Regulatory Role in Hepatic Steatosis.

    Science.gov (United States)

    Guo, Xing-Ya; He, Chong-Xin; Wang, Yu-Qin; Sun, Chao; Li, Guang-Ming; Su, Qing; Pan, Qin; Fan, Jian-Gao

    2017-01-01

    Circular RNAs (circRNAs) exhibit a wide range of physiological and pathological activities. To uncover their role in hepatic steatosis, we investigated the expression profile of circRNAs in HepG2-based hepatic steatosis induced by high-fat stimulation. Differentially expressed circRNAs were subjected to validation using QPCR and functional analyses using principal component analysis, hierarchical clustering, target prediction, gene ontology (GO), and pathway annotation, respectively. Bioinformatic integration established the circRNA-miRNA-mRNA regulatory network so as to identify the mechanisms underlying circRNAs' metabolic effect. Here we reported that hepatic steatosis was associated with a total of 357 circRNAs. Enrichment of transcription-related GOs, especially GO: 0006355, GO: 004589, GO: 0045944, GO: 0045892, and GO: 0000122, demonstrated their specific actions in transcriptional regulation. Lipin 1 (LPIN1) was recognized to mediate the transcriptional regulatory effect of circRNAs on metabolic pathways. circRNA-miRNA-mRNA network further identified the signaling cascade of circRNA_021412/miR-1972/LPIN1, which was characterized by decreased level of circRNA_021412 and miR-1972-based inhibition of LPIN1. LPIN1-induced downregulation of long chain acyl-CoA synthetases (ACSLs) expression finally resulted in the hepatosteatosis. These findings identify circRNAs to be important regulators of hepatic steatosis. Transcription-dependent modulation of metabolic pathways may underlie their effects, partially by the circRNA_021412/miR-1972/LPIN1 signaling.

  7. The PAZAR database of gene regulatory information coupled to the ORCA toolkit for the study of regulatory sequences

    Science.gov (United States)

    Portales-Casamar, Elodie; Arenillas, David; Lim, Jonathan; Swanson, Magdalena I.; Jiang, Steven; McCallum, Anthony; Kirov, Stefan; Wasserman, Wyeth W.

    2009-01-01

    The PAZAR database unites independently created and maintained data collections of transcription factor and regulatory sequence annotation. The flexible PAZAR schema permits the representation of diverse information derived from experiments ranging from biochemical protein–DNA binding to cellular reporter gene assays. Data collections can be made available to the public, or restricted to specific system users. The data ‘boutiques’ within the shopping-mall-inspired system facilitate the analysis of genomics data and the creation of predictive models of gene regulation. Since its initial release, PAZAR has grown in terms of data, features and through the addition of an associated package of software tools called the ORCA toolkit (ORCAtk). ORCAtk allows users to rapidly develop analyses based on the information stored in the PAZAR system. PAZAR is available at http://www.pazar.info. ORCAtk can be accessed through convenient buttons located in the PAZAR pages or via our website at http://www.cisreg.ca/ORCAtk. PMID:18971253

  8. Deep sequencing-based identification of small regulatory RNAs in Synechocystis sp. PCC 6803.

    Directory of Open Access Journals (Sweden)

    Wen Xu

    Full Text Available Synechocystis sp. PCC 6803 is a genetically tractable model organism for photosynthesis research. The genome of Synechocystis sp. PCC 6803 consists of a circular chromosome and seven plasmids. The importance of small regulatory RNAs (sRNAs as mediators of a number of cellular processes in bacteria has begun to be recognized. However, little is known regarding sRNAs in Synechocystis sp. PCC 6803. To provide a comprehensive overview of sRNAs in this model organism, the sRNAs of Synechocystis sp. PCC 6803 were analyzed using deep sequencing, and 7,951,189 reads were obtained. High quality mapping reads (6,127,890 were mapped onto the genome and assembled into 16,192 transcribed regions (clusters based on read overlap. A total number of 5211 putative sRNAs were revealed from the genome and the 4 megaplasmids, and 27 of these molecules, including four from plasmids, were confirmed by RT-PCR. In addition, possible target genes regulated by all of the putative sRNAs identified in this study were predicted by IntaRNA and analyzed for functional categorization and biological pathways, which provided evidence that sRNAs are indeed involved in many different metabolic pathways, including basic metabolic pathways, such as glycolysis/gluconeogenesis, the citrate cycle, fatty acid metabolism and adaptations to environmentally stress-induced changes. The information from this study provides a valuable reservoir for understanding the sRNA-mediated regulation of the complex physiology and metabolic processes of cyanobacteria.

  9. SSR_pipeline: a bioinformatic infrastructure for identifying microsatellites from paired-end Illumina high-throughput DNA sequencing data

    Science.gov (United States)

    Miller, Mark P.; Knaus, Brian J.; Mullins, Thomas D.; Haig, Susan M.

    2013-01-01

    SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (e.g., microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains 3 analysis modules along with a fourth control module that can automate analyses of large volumes of data. The modules are used to 1) identify the subset of paired-end sequences that pass Illumina quality standards, 2) align paired-end reads into a single composite DNA sequence, and 3) identify sequences that possess microsatellites (both simple and compound) conforming to user-specified parameters. The microsatellite search algorithm is extremely efficient, and we have used it to identify repeats with motifs from 2 to 25bp in length. Each of the 3 analysis modules can also be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc.). We demonstrate use of the program with data from the brine fly Ephydra packardi (Diptera: Ephydridae) and provide empirical timing benchmarks to illustrate program performance on a common desktop computer environment. We further show that the Illumina platform is capable of identifying large numbers of microsatellites, even when using unenriched sample libraries and a very small percentage of the sequencing capacity from a single DNA sequencing run. All modules from SSR_pipeline are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, and Windows).

  10. SSR_pipeline: a bioinformatic infrastructure for identifying microsatellites from paired-end Illumina high-throughput DNA sequencing data.

    Science.gov (United States)

    Miller, Mark P; Knaus, Brian J; Mullins, Thomas D; Haig, Susan M

    2013-01-01

    SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (e.g., microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains 3 analysis modules along with a fourth control module that can automate analyses of large volumes of data. The modules are used to 1) identify the subset of paired-end sequences that pass Illumina quality standards, 2) align paired-end reads into a single composite DNA sequence, and 3) identify sequences that possess microsatellites (both simple and compound) conforming to user-specified parameters. The microsatellite search algorithm is extremely efficient, and we have used it to identify repeats with motifs from 2 to 25 bp in length. Each of the 3 analysis modules can also be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc.). We demonstrate use of the program with data from the brine fly Ephydra packardi (Diptera: Ephydridae) and provide empirical timing benchmarks to illustrate program performance on a common desktop computer environment. We further show that the Illumina platform is capable of identifying large numbers of microsatellites, even when using unenriched sample libraries and a very small percentage of the sequencing capacity from a single DNA sequencing run. All modules from SSR_pipeline are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, and Windows).

  11. Genetic mapping and exome sequencing identify variants associated with five novel diseases.

    Directory of Open Access Journals (Sweden)

    Erik G Puffenberger

    Full Text Available The Clinic for Special Children (CSC has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain children. Among the Plain people, we have used single nucleotide polymorphism (SNP microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb that contain many genes (mean = 79. For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data.

  12. A novel cis-acting element required for DNA damage-inducible expression of yeast DIN7

    International Nuclear Information System (INIS)

    Yoshitani, Ayako; Yoshida, Minoru; Ling Feng

    2008-01-01

    Din7 is a DNA damage-inducible mitochondrial nuclease that modulates the stability of mitochondrial DNA (mtDNA) in Saccharomyces cerevisiae. How DIN7 gene expression is regulated, however, has remained largely unclear. Using promoter sequence alignment, we found a highly conserved 19-bp sequence in the promoter regions of DIN7 and NTG1, which encodes an oxidative stress-inducible base-excision-repair enzyme. Deletion of the 19-bp sequence markedly reduced the hydroxyurea (HU)-enhanced DIN7 promoter activity. In addition, nuclear fractions prepared from HU-treated cells were used in in vitro band shift assays to reveal the presence of currently unidentified trans-acting factor(s) that preferentially bound to the 19-bp region. These results suggest that the 19-bp sequence is a novel cis-acting element that is required for the regulation of DIN7 expression in response to HU-induced DNA damage

  13. Predicting effects of noncoding variants with deep learning-based sequence model.

    Science.gov (United States)

    Zhou, Jian; Troyanskaya, Olga G

    2015-10-01

    Identifying functional effects of noncoding variants is a major challenge in human genetics. To predict the noncoding-variant effects de novo from sequence, we developed a deep learning-based algorithmic framework, DeepSEA (http://deepsea.princeton.edu/), that directly learns a regulatory sequence code from large-scale chromatin-profiling data, enabling prediction of chromatin effects of sequence alterations with single-nucleotide sensitivity. We further used this capability to improve prioritization of functional variants including expression quantitative trait loci (eQTLs) and disease-associated variants.

  14. Mutation of miRNA target sequences during human evolution

    DEFF Research Database (Denmark)

    Gardner, Paul P; Vinther, Jeppe

    2008-01-01

    It has long-been hypothesized that changes in non-protein-coding genes and the regulatory sequences controlling expression could undergo positive selection. Here we identify 402 putative microRNA (miRNA) target sequences that have been mutated specifically in the human lineage and show that genes...... containing such deletions are more highly expressed than their mouse orthologs. Our findings indicate that some miRNA target mutations are fixed by positive selection and might have been involved in the evolution of human-specific traits....

  15. SPIRE, a modular pipeline for eQTL analysis of RNA-Seq data, reveals a regulatory hotspot controlling miRNA expression in C. elegans.

    Science.gov (United States)

    Kel, Ivan; Chang, Zisong; Galluccio, Nadia; Romeo, Margherita; Beretta, Stefano; Diomede, Luisa; Mezzelani, Alessandra; Milanesi, Luciano; Dieterich, Christoph; Merelli, Ivan

    2016-10-18

    The interpretation of genome-wide association study is difficult, as it is hard to understand how polymorphisms can affect gene regulation, in particular for trans-regulatory elements located far from their controlling gene. Using RNA or protein expression data as phenotypes, it is possible to correlate their variations with specific genotypes. This technique is usually referred to as expression Quantitative Trait Loci (eQTLs) analysis and only few packages exist for the integration of genotype patterns and expression profiles. In particular, tools are needed for the analysis of next-generation sequencing (NGS) data on a genome-wide scale, which is essential to identify eQTLs able to control a large number of genes (hotspots). Here we present SPIRE (Software for Polymorphism Identification Regulating Expression), a generic, modular and functionally highly flexible pipeline for eQTL processing. SPIRE integrates different univariate and multivariate approaches for eQTL analysis, paying particular attention to the scalability of the procedure in order to support cis- as well as trans-mapping, thus allowing the identification of hotspots in NGS data. In particular, we demonstrated how SPIRE can handle big association study datasets, reproducing published results and improving the identification of trans-eQTLs. Furthermore, we employed the pipeline to analyse novel data concerning the genotypes of two different C. elegans strains (N2 and Hawaii) and related miRNA expression data, obtained using RNA-Seq. A miRNA regulatory hotspot was identified in chromosome 1, overlapping the transcription factor grh-1, known to be involved in the early phases of embryonic development of C. elegans. In a follow-up qPCR experiment we were able to verify most of the predicted eQTLs, as well as to show, for a novel miRNA, a significant difference in the sequences of the two analysed strains of C. elegans. SPIRE is publicly available as open source software at , together with some example

  16. 77 FR 7960 - Unified Agenda of Federal Regulatory and Deregulatory Actions

    Science.gov (United States)

    2012-02-13

    ... Sequence No. Title Identifier No. 377 Claims Procedures Under 1625-AA03 the Oil Pollution Act of 1990 (USCG... Regulatory Flexibility Analysis Required: Yes. Agency Contact: Jeremy F. Olson, Senior Procurement Analyst... Procedures Under the Oil Pollution Act of 1990 (USCG-2004- 17697) Legal Authority: 33 U.S.C. 2713 and 2714...

  17. Unleashing the genome of Brassica rapa

    Directory of Open Access Journals (Sweden)

    Haibao eTang

    2012-07-01

    Full Text Available The completion and release of the Brassica rapa genome is of great benefit to researchers of the Brassicas, Arabidopsis, and genome evolution. While its lineage is closely related to the model organism Arabidopsis thaliana, the Brassicas experienced a whole genome triplication subsequent to their divergence. This event contemporaneously created three copies of its ancestral genome, which had diploidized through the process of homeologous gene loss known as fractionation. By the fractionation of homeologous gene content and genetic regulatory binding sites, Brassica’s genome is well placed to use comparative genomic techniques to identify syntenic regions, homeologous gene duplications, and putative regulatory sequences. Here, we use the comparative genomics platform CoGe to perform several different genomic analyses with which to study structural changes of its genome and dynamics of various genetic elements. Starting with whole genome comparisons, the Brassica paleohexaploidy is characterized, syntenic regions with Arabidopsis thaliana are identified, and the TOC1 gene in the circadian rhythm pathway from Arabidopsis thaliana is used to find duplicated orthologs in Brassica rapa. These TOC1 genes are further analyzed to identify conserved noncoding sequences that contain cis-acting regulatory elements and promoter sequences previously implicated in circadian rhythmicity. Each 'cookbook style' analysis includes a step-by-step walkthrough with links to CoGe to quickly reproduce each step of the analytical process.

  18. Separation and identification of beta-carotene and its cis isomers by high pressure liquid chromatography (HPLC); Separacion e identificacion del beta-caroteno y sus isomeros cis por cromatografia liquida de alta resolucion (HPLC)

    Energy Technology Data Exchange (ETDEWEB)

    Carrillo de Padilla, F [Universidad Central de Venezuela (UCV), Facultad de Farmacia, Catedra de Analisis de Alimentos, Caracas (Venezuela)

    1996-07-01

    The separation and identification by HPLC of the cis isomers of beta-carotene was studied. A 1.26 mg/ml beta-carotene solution previously isomerized with iodine as a catalyst, was eluted with 2% acetone in hexane, from a Ca(OH)2 chromatographic column in three bands. The fractions were identified by spectrophotometry and the retention times of 2.05, 2.4 and 2.8 min for the 13 cis, all-trans, and 9 cis beta-carotene isomers, determined by HPLC, with 1% acetone in hexane as Mobil phase. 22.13 mg % of all-trans beta-carotene were found in a sample of canned carrots. It is recommended the analyses of a greater number of samples, the determination of the method's sensitivity, reproducibility, and the use of a standard of reference of a response factor for calculations.

  19. In situ detection of a heat-shock regulatory element binding protein using a soluble short synthetic enhancer sequence

    Energy Technology Data Exchange (ETDEWEB)

    Harel-Bellan, A; Brini, A T; Farrar, W L [National Cancer Institute, Frederick, MD (USA); Ferris, D K [Program Resources, Inc., Frederick, MD (USA); Robin, P [Institut Gustave Roussy, Villejuif (France)

    1989-06-12

    In various studies, enhancer binding proteins have been successfully absorbed out by competing sequences inserted into plasmids, resulting in the inhibition of the plasmid expression. Theoretically, such a result could be achieved using synthetic enhancer sequences not inserted into plasmids. In this study, a double stranded DNA sequence corresponding to the human heat shock regulatory element was chemically synthesized. By in vitro retardation assays, the synthetic sequence was shown to bind specifically a protein in extracts from the human T cell line Jurkat. When the synthetic enhancer was electroporated into Jurkat cells, not only the enhancer was shown to remain undegraded into the cells for up to 2 days, but also its was shown to bind intracellularly a protein. The binding was specific and was modulated upon heat shock. Furthermore, the binding protein was shown to be of the expected molecular weight by UV crosslinking. However, when the synthetic enhancer element was co-electroporated with an HSP 70-CAT reporter construct, the expression of the reporter plasmid was consistently enhanced in the presence of the exogenous synthetic enhancer.

  20. Stepwise encapsulation and controlled two-stage release system for cis-Diamminediiodoplatinum.

    Science.gov (United States)

    Chen, Yun; Li, Qian; Wu, Qingsheng

    2014-01-01

    cis-Diamminediiodoplatinum (cis-DIDP) is a cisplatin-like anticancer drug with higher anticancer activity, but lower stability and price than cisplatin. In this study, a cis-DIDP carrier system based on micro-sized stearic acid was prepared by an emulsion solvent evaporation method. The maximum drug loading capacity of cis-DIDP-loaded solid lipid nanoparticles was 22.03%, and their encapsulation efficiency was 97.24%. In vitro drug release in phosphate-buffered saline (pH =7.4) at 37.5°C exhibited a unique two-stage process, which could prove beneficial for patients with tumors and malignancies. MTT (3-[4,5-dimethylthiazol-2-yl]-2, 5-diphenyltetrazolium bromide) assay results showed that cis-DIDP released from cis-DIDP-loaded solid lipid nanoparticles had better inhibition activity than cis-DIDP that had not been loaded.

  1. Fringe proteins modulate Notch-ligand cis and trans interactions to specify signaling states.

    Science.gov (United States)

    LeBon, Lauren; Lee, Tom V; Sprinzak, David; Jafar-Nejad, Hamed; Elowitz, Michael B

    2014-09-25

    The Notch signaling pathway consists of multiple types of receptors and ligands, whose interactions can be tuned by Fringe glycosyltransferases. A major challenge is to determine how these components control the specificity and directionality of Notch signaling in developmental contexts. Here, we analyzed same-cell (cis) Notch-ligand interactions for Notch1, Dll1, and Jag1, and their dependence on Fringe protein expression in mammalian cells. We found that Dll1 and Jag1 can cis-inhibit Notch1, and Fringe proteins modulate these interactions in a way that parallels their effects on trans interactions. Fringe similarly modulated Notch-ligand cis interactions during Drosophila development. Based on these and previously identified interactions, we show how the design of the Notch signaling pathway leads to a restricted repertoire of signaling states that promote heterotypic signaling between distinct cell types, providing insight into the design principles of the Notch signaling system, and the specific developmental process of Drosophila dorsal-ventral boundary formation.

  2. The spotted gar genome illuminates vertebrate evolution and facilitates human-to-teleost comparisons

    Science.gov (United States)

    Braasch, Ingo; Gehrke, Andrew R.; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S.; Barrell, Daniel; Martin, Kyle J.; Mulley, John F.; Ravi, Vydianathan; Lee, Alison P.; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E. G.; Sun, Yi; Hertel, Jana; Beam, Michael J.; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H.; Litman, Gary W.; Litman, Ronda T.; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F.; Wang, Han; Taylor, John S.; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M. J.; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A.; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T.; Venkatesh, Byrappa; Holland, Peter W. H.; Guiguen, Yann; Bobe, Julien; Shubin, Neil H.; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H.

    2016-01-01

    To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before the teleost genome duplication (TGD). The slowly evolving gar genome conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization, and development (e.g., Hox, ParaHox, and miRNA genes). Numerous conserved non-coding elements (CNEs, often cis-regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles of such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses revealed that the sum of expression domains and levels from duplicated teleost genes often approximate patterns and levels of gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes, and the function of human regulatory sequences. PMID:26950095

  3. The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons.

    Science.gov (United States)

    Braasch, Ingo; Gehrke, Andrew R; Smith, Jeramiah J; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M; Campbell, Michael S; Barrell, Daniel; Martin, Kyle J; Mulley, John F; Ravi, Vydianathan; Lee, Alison P; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E G; Sun, Yi; Hertel, Jana; Beam, Michael J; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H; Litman, Gary W; Litman, Ronda T; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F; Wang, Han; Taylor, John S; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M J; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T; Venkatesh, Byrappa; Holland, Peter W H; Guiguen, Yann; Bobe, Julien; Shubin, Neil H; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H

    2016-04-01

    To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before teleost genome duplication (TGD). The slowly evolving gar genome has conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization and development (mediated, for example, by Hox, ParaHox and microRNA genes). Numerous conserved noncoding elements (CNEs; often cis regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles for such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses showed that the sums of expression domains and expression levels for duplicated teleost genes often approximate the patterns and levels of expression for gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes and the function of human regulatory sequences.

  4. Characterization of STAT5B phosphorylation correlating with expression of cytokine-inducible SH2-containing protein (CIS).

    Science.gov (United States)

    Cooper, John C; Boustead, Jared N; Yu, Chao-Lan

    2006-06-01

    Cytokine-inducible SH2-containing protein (CIS) is the first identified member of genes encoding for the suppressor of cytokine signaling (SOCS). CIS is also a well-known target gene of signal transducer and activator of transcription 5 (STAT5) pathways, providing normal negative feedback control of signaling by cytokines and growth factors. Three other SOCS genes, SOCS1, SOCS2, and SOCS3, can be silenced by DNA hypermethylation in human cancers, suggesting a potential mechanism for constitutive STAT activation. However, it is not known whether CIS expression is similarly perturbed in tumor cells. We report here the absence of CIS expression in T lymphoma LSTRA that overexpresses the Lck protein tyrosine kinase and exhibits elevated STAT5 activity. Pervanadate-induced CIS expression and STAT5 binding to the CIS promoter in vivo over a short time course implies that mechanisms other than DNA hypermethylation may contribute to defective CIS expression in LSTRA cells. Comparison with cytokine-dependent BaF3 cells stimulated with interleukin-3 (IL-3) further reveals that CIS induction correlates with specific STAT5b post-translational modifications. It exhibits as the slowest migrating form through SDS-polyacrylamide gel electrophoresis (SDS-PAGE) analysis. This distinctly modified STAT5b is the predominant form that binds to the consensus STAT5 sites in the CIS promoter and accumulates in the nucleus. In vitro phosphatase assays and phosphoamino acid analysis suggest the involvement of phosphorylation on residues other than the highly conserved tyrosine and serine sites in this distinct STAT5b mobility shift. All together, our results provide a novel link between incomplete STAT5b phosphorylation and defective SOCS gene expression in cancer cells.

  5. Inverted repeats in the promoter as an autoregulatory sequence for TcrX in Mycobacterium tuberculosis

    International Nuclear Information System (INIS)

    Bhattacharya, Monolekha; Das, Amit Kumar

    2011-01-01

    Highlights: ► The regulatory sequences recognized by TcrX have been identified. ► The regulatory region comprises of inverted repeats segregated by 30 bp region. ► The mode of binding of TcrX with regulatory sequence is unique. ► In silico TcrX–DNA docked model binds one of the inverted repeats. ► Both phosphorylated and unphosphorylated TcrX binds regulatory sequence in vitro. -- Abstract: TcrY, a histidine kinase, and TcrX, a response regulator, constitute a two-component system in Mycobacterium tuberculosis. tcrX, which is expressed during iron scarcity, is instrumental in the survival of iron-dependent M. tuberculosis. However, the regulator of tcrX/Y has not been fully characterized. Crosslinking studies of TcrX reveal that it can form oligomers in vitro. Electrophoretic mobility shift assays (EMSAs) show that TcrX recognizes two regions in the promoter that are comprised of inverted repeats separated by ∼30 bp. The dimeric in silico model of TcrX predicts binding to one of these inverted repeat regions. Site-directed mutagenesis and radioactive phosphorylation indicate that D54 of TcrX is phosphorylated by H256 of TcrY. However, phosphorylated and unphosphorylated TcrX bind the regulatory sequence with equal efficiency, which was shown with an EMSA using the D54A TcrX mutant.

  6. Systematic identification and characterization of regulatory elements derived from human endogenous retroviruses.

    Directory of Open Access Journals (Sweden)

    Jumpei Ito

    2017-07-01

    Full Text Available Human endogenous retroviruses (HERVs and other long terminal repeat (LTR-type retrotransposons (HERV/LTRs have regulatory elements that possibly influence the transcription of host genes. We systematically identified and characterized these regulatory elements based on publicly available datasets of ChIP-Seq of 97 transcription factors (TFs provided by ENCODE and Roadmap Epigenomics projects. We determined transcription factor-binding sites (TFBSs using the ChIP-Seq datasets and identified TFBSs observed on HERV/LTR sequences (HERV-TFBSs. Overall, 794,972 HERV-TFBSs were identified. Subsequently, we identified "HERV/LTR-shared regulatory element (HSRE," defined as a TF-binding motif in HERV-TFBSs, shared within a substantial fraction of a HERV/LTR type. HSREs could be an indication that the regulatory elements of HERV/LTRs are present before their insertions. We identified 2,201 HSREs, comprising specific associations of 354 HERV/LTRs and 84 TFs. Clustering analysis showed that HERV/LTRs can be grouped according to the TF binding patterns; HERV/LTR groups bounded to pluripotent TFs (e.g., SOX2, POU5F1, and NANOG, embryonic endoderm/mesendoderm TFs (e.g., GATA4/6, SOX17, and FOXA1/2, hematopoietic TFs (e.g., SPI1 (PU1, GATA1/2, and TAL1, and CTCF were identified. Regulatory elements of HERV/LTRs tended to locate nearby and/or interact three-dimensionally with the genes involved in immune responses, indicating that the regulatory elements play an important role in controlling the immune regulatory network. Further, we demonstrated subgroup-specific TF binding within LTR7, LTR5B, and LTR5_Hs, indicating that gains or losses of the regulatory elements occurred during genomic invasions of the HERV/LTRs. Finally, we constructed dbHERV-REs, an interactive database of HERV/LTR regulatory elements (http://herv-tfbs.com/. This study provides fundamental information in understanding the impact of HERV/LTRs on host transcription, and offers insights into

  7. Exome Sequencing Fails to Identify the Genetic Cause of Aicardi Syndrome

    DEFF Research Database (Denmark)

    Lund, Caroline; Striano, Pasquale; Sorte, Hanne Sørmo

    2016-01-01

    Aicardi syndrome (AS) is a well-characterized neurodevelopmental disorder with an unknown etiology. In this study, we performed whole-exome sequencing in 11 female patients with the diagnosis of AS, in order to identify the disease-causing gene. In particular, we focused on detecting variants in ...

  8. SeqAnt: A web service to rapidly identify and annotate DNA sequence variations

    Directory of Open Access Journals (Sweden)

    Patel Viren

    2010-09-01

    Full Text Available Abstract Background The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research. Results SeqAnt (Sequence Annotator is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds. Conclusion SeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.

  9. Unique Trichomonas vaginalis gene sequences identified in multinational regions of Northwest China.

    Science.gov (United States)

    Liu, Jun; Feng, Meng; Wang, Xiaolan; Fu, Yongfeng; Ma, Cailing; Cheng, Xunjia

    2017-07-24

    Trichomonas vaginalis (T. vaginalis) is a flagellated protozoan parasite that infects humans worldwide. This study determined the sequence of the 18S ribosomal RNA gene of T. vaginalis infecting both females and males in Xinjiang, China. Samples from 73 females and 28 males were collected and confirmed for infection with T. vaginalis, a total of 110 sequences were identified when the T. vaginalis 18S ribosomal RNA gene was sequenced. These sequences were used to prepare a phylogenetic network. The rooted network comprised three large clades and several independent branches. Most of the Xinjiang sequences were in one group. Preliminary results suggest that Xinjiang T. vaginalis isolates might be genetically unique, as indicated by the sequence of their 18S ribosomal RNA gene. Low migration rate of local people in this province may contribute to a genetic conservativeness of T. vaginalis. The unique genetic feature of our isolates may suggest a different clinical presentation of trichomoniasis, including metronidazole susceptibility, T. vaginalis virus or Mycoplasma co-infection characteristics. The transmission and evolution of Xinjiang T. vaginalis is of interest and should be studied further. More attention should be given to T. vaginalis infection in both females and males in Xinjiang.

  10. Relative Stability of cis- and trans-Hydrindanones

    Directory of Open Access Journals (Sweden)

    Motoo Tori

    2015-01-01

    Full Text Available The relative stabilities of several cis- and trans-hydrindanones were compared using both isomerization experiments and MM2 calculations. The generally believed rule that cis-hydrindanones are more stable than trans-isomers is applicable, but is not always true. This review introduces examples, mainly from studies in our laboratory, to explain these facts.

  11. SoftSearch: integration of multiple sequence features to identify breakpoints of structural variations.

    Directory of Open Access Journals (Sweden)

    Steven N Hart

    Full Text Available BACKGROUND: Structural variation (SV represents a significant, yet poorly understood contribution to an individual's genetic makeup. Advanced next-generation sequencing technologies are widely used to discover such variations, but there is no single detection tool that is considered a community standard. In an attempt to fulfil this need, we developed an algorithm, SoftSearch, for discovering structural variant breakpoints in Illumina paired-end next-generation sequencing data. SoftSearch combines multiple strategies for detecting SV including split-read, discordant read-pair, and unmated pairs. Co-localized split-reads and discordant read pairs are used to refine the breakpoints. RESULTS: We developed and validated SoftSearch using real and synthetic datasets. SoftSearch's key features are 1 not requiring secondary (or exhaustive primary alignment, 2 portability into established sequencing workflows, and 3 is applicable to any DNA-sequencing experiment (e.g. whole genome, exome, custom capture, etc.. SoftSearch identifies breakpoints from a small number of soft-clipped bases from split reads and a few discordant read-pairs which on their own would not be sufficient to make an SV call. CONCLUSIONS: We show that SoftSearch can identify more true SVs by combining multiple sequence features. SoftSearch was able to call clinically relevant SVs in the BRCA2 gene not reported by other tools while offering significantly improved overall performance.

  12. Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing.

    Science.gov (United States)

    Hong, Jungeui; Gresham, David

    2017-11-01

    Quantitative analysis of next-generation sequencing (NGS) data requires discriminating duplicate reads generated by PCR from identical molecules that are of unique origin. Typically, PCR duplicates are identified as sequence reads that align to the same genomic coordinates using reference-based alignment. However, identical molecules can be independently generated during library preparation. Misidentification of these molecules as PCR duplicates can introduce unforeseen biases during analyses. Here, we developed a cost-effective sequencing adapter design by modifying Illumina TruSeq adapters to incorporate a unique molecular identifier (UMI) while maintaining the capacity to undertake multiplexed, single-index sequencing. Incorporation of UMIs into TruSeq adapters (TrUMIseq adapters) enables identification of bona fide PCR duplicates as identically mapped reads with identical UMIs. Using TrUMIseq adapters, we show that accurate removal of PCR duplicates results in improved accuracy of both allele frequency (AF) estimation in heterogeneous populations using DNA sequencing and gene expression quantification using RNA-Seq.

  13. Overlapping positive and negative regulatory domains of the human β-interferon gene

    International Nuclear Information System (INIS)

    Goodbourn, S.; Maniatis, T.

    1988-01-01

    Virus of poly(I) x poly(C) induction of human β-interferon gene expression requires a 40-base-pair DNA sequence designated the interferon gene regulatory element (IRE). Previous studies have shown that the IRE contains both positive and negative regulatory DNA sequences. To localize these sequences and study their interactions, the authors have examined the effects of a large number of single-base mutations within the IRE on β-interferon gene regulation. They find that the IRE consists of two genetically separable positive regulatory domains and an overlapping negative control sequence. They propose that the β-interferon gene is switched off in uninduced cells by a repressor that blocks the interaction between one of the two positive regulatory sequences and a specific transcription factor. Induction would then lead to inactivation or displacement of the repressor and binding of transcription factors to both positive regulatory domains

  14. The Community Intercomparison Suite (CIS)

    Science.gov (United States)

    Watson-Parris, Duncan; Schutgens, Nick; Cook, Nick; Kipling, Zak; Kershaw, Phil; Gryspeerdt, Ed; Lawrence, Bryan; Stier, Philip

    2017-04-01

    Earth observations (both remote and in-situ) create vast amounts of data providing invaluable constraints for the climate science community. Efficient exploitation of these complex and highly heterogeneous datasets has been limited however by the lack of suitable software tools, particularly for comparison of gridded and ungridded data, thus reducing scientific productivity. CIS (http://cistools.net) is an open-source, command line tool and Python library which allows the straight-forward quantitative analysis, intercomparison and visualisation of remote sensing, in-situ and model data. The CIS can read gridded and ungridded remote sensing, in-situ and model data - and many other data sources 'out-of-the-box', such as ESA Aerosol and Cloud CCI product, MODIS, Cloud CCI, Cloudsat, AERONET. Perhaps most importantly however CIS also employs a modular plugin architecture to allow for the reading of limitless different data types. Users are able to write their own plugins for reading the data sources which they are familiar with, and share them within the community, allowing all to benefit from their expertise. To enable the intercomparison of this data the CIS provides a number of operations including: the aggregation of ungridded and gridded datasets to coarser representations using a number of different built in averaging kernels; the subsetting of data to reduce its extent or dimensionality; the co-location of two distinct datasets onto a single set of co-ordinates; the visualisation of the input or output data through a number of different plots and graphs; the evaluation of arbitrary mathematical expressions against any number of datasets; and a number of other supporting functions such as a statistical comparison of two co-located datasets. These operations can be performed efficiently on local machines or large computing clusters - and is already available on the JASMIN computing facility. A case-study using the GASSP collection of in-situ aerosol observations

  15. Regulatory Roles for Long ncRNA and mRNA

    International Nuclear Information System (INIS)

    Karapetyan, Armen R.; Buiting, Coen; Kuiper, Renske A.; Coolen, Marcel W.

    2013-01-01

    Recent advances in high-throughput sequencing technology have identified the transcription of a much larger portion of the genome than previously anticipated. Especially in the context of cancer it has become clear that aberrant transcription of both protein-coding and long non-coding RNAs (lncRNAs) are frequent events. The current dogma of RNA function describes mRNA to be responsible for the synthesis of proteins, whereas non-coding RNA can have regulatory or epigenetic functions. However, this distinction between protein coding and regulatory ability of transcripts may not be that strict. Here, we review the increasing body of evidence for the existence of multifunctional RNAs that have both protein-coding and trans-regulatory roles. Moreover, we demonstrate that coding transcripts bind to components of the Polycomb Repressor Complex 2 (PRC2) with similar affinities as non-coding transcripts, revealing potential epigenetic regulation by mRNAs. We hypothesize that studies on the regulatory ability of disease-associated mRNAs will form an important new field of research

  16. Regulatory Roles for Long ncRNA and mRNA

    Energy Technology Data Exchange (ETDEWEB)

    Karapetyan, Armen R.; Buiting, Coen; Kuiper, Renske A.; Coolen, Marcel W., E-mail: M.Coolen@gen.umcn.nl [Department of Human Genetics, Nijmegen Centre for Molecular Life Sciences (NCMLS), Radboud University Nijmegen Medical Centre, P.O. Box 9101, Nijmegen 6500 HB (Netherlands)

    2013-04-26

    Recent advances in high-throughput sequencing technology have identified the transcription of a much larger portion of the genome than previously anticipated. Especially in the context of cancer it has become clear that aberrant transcription of both protein-coding and long non-coding RNAs (lncRNAs) are frequent events. The current dogma of RNA function describes mRNA to be responsible for the synthesis of proteins, whereas non-coding RNA can have regulatory or epigenetic functions. However, this distinction between protein coding and regulatory ability of transcripts may not be that strict. Here, we review the increasing body of evidence for the existence of multifunctional RNAs that have both protein-coding and trans-regulatory roles. Moreover, we demonstrate that coding transcripts bind to components of the Polycomb Repressor Complex 2 (PRC2) with similar affinities as non-coding transcripts, revealing potential epigenetic regulation by mRNAs. We hypothesize that studies on the regulatory ability of disease-associated mRNAs will form an important new field of research.

  17. Whole-exome sequencing identifies USH2A mutations in a pseudo-dominant Usher syndrome family.

    Science.gov (United States)

    Zheng, Sui-Lian; Zhang, Hong-Liang; Lin, Zhen-Lang; Kang, Qian-Yan

    2015-10-01

    Usher syndrome (USH) is an autosomal recessive (AR) multi-sensory degenerative disorder leading to deaf-blindness. USH is clinically subdivided into three subclasses, and 10 genes have been identified thus far. Clinical and genetic heterogeneities in USH make a precise diagnosis difficult. A dominant‑like USH family in successive generations was identified, and the present study aimed to determine the genetic predisposition of this family. Whole‑exome sequencing was performed in two affected patients and an unaffected relative. Systematic data were analyzed by bioinformatic analysis to remove the candidate mutations via step‑wise filtering. Direct Sanger sequencing and co‑segregation analysis were performed in the pedigree. One novel and two known mutations in the USH2A gene were identified, and were further confirmed by direct sequencing and co‑segregation analysis. The affected mother carried compound mutations in the USH2A gene, while the unaffected father carried a heterozygous mutation. The present study demonstrates that whole‑exome sequencing is a robust approach for the molecular diagnosis of disorders with high levels of genetic heterogeneity.

  18. Conserved Transcriptional Regulatory Programs Underlying Rice and Barley Germination

    Science.gov (United States)

    Lin, Li; Tian, Shulan; Kaeppler, Shawn; Liu, Zongrang; An, Yong-Qiang (Charles)

    2014-01-01

    Germination is a biological process important to plant development and agricultural production. Barley and rice diverged 50 million years ago, but share a similar germination process. To gain insight into the conservation of their underlying gene regulatory programs, we compared transcriptomes of barley and rice at start, middle and end points of germination, and revealed that germination regulated barley and rice genes (BRs) diverged significantly in expression patterns and/or protein sequences. However, BRs with higher protein sequence similarity tended to have more conserved expression patterns. We identified and characterized 316 sets of conserved barley and rice genes (cBRs) with high similarity in both protein sequences and expression patterns, and provided a comprehensive depiction of the transcriptional regulatory program conserved in barley and rice germination at gene, pathway and systems levels. The cBRs encoded proteins involved in a variety of biological pathways and had a wide range of expression patterns. The cBRs encoding key regulatory components in signaling pathways often had diverse expression patterns. Early germination up-regulation of cell wall metabolic pathway and peroxidases, and late germination up-regulation of chromatin structure and remodeling pathways were conserved in both barley and rice. Protein sequence and expression pattern of a gene change quickly if it is not subjected to a functional constraint. Preserving germination-regulated expression patterns and protein sequences of those cBRs for 50 million years strongly suggests that the cBRs are functionally significant and equivalent in germination, and contribute to the ancient characteristics of germination preserved in barley and rice. The functional significance and equivalence of the cBR genes predicted here can serve as a foundation to further characterize their biological functions and facilitate bridging rice and barley germination research with greater confidence. PMID

  19. Optimization of Pseudomonas putida KT2440 as host for the production of cis, cis-muconate from benzoate

    OpenAIRE

    Duuren, van, J.B.J.H.

    2011-01-01

    Optimization of Pseudomonas putida KT2440 as host for the production of cis, cis-muconate from benzoate P. putida KT2440 was used as biocatalyst given its versatile and energetically robust metabolism. Therefore, a mutant was generated and a process developed based on which a life cycle assessment (LCA) was performed. Additionally, the growth related parameters were experimentally obtained to constrain the metabolic model iJP815 further. The mutant Pseudomonas putida KT2440-JD1 was deri...

  20. An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data.

    Science.gov (United States)

    Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E; Greenwood, Alex D

    2015-11-24

    Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.

  1. An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data

    Science.gov (United States)

    Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E.; Greenwood, Alex D.

    2015-01-01

    Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals. PMID:26610552

  2. Genome-wide identification of regulatory elements and reconstruction of gene regulatory networks of the green alga Chlamydomonas reinhardtii under carbon deprivation.

    Directory of Open Access Journals (Sweden)

    Flavia Vischi Winck

    Full Text Available The unicellular green alga Chlamydomonas reinhardtii is a long-established model organism for studies on photosynthesis and carbon metabolism-related physiology. Under conditions of air-level carbon dioxide concentration [CO2], a carbon concentrating mechanism (CCM is induced to facilitate cellular carbon uptake. CCM increases the availability of carbon dioxide at the site of cellular carbon fixation. To improve our understanding of the transcriptional control of the CCM, we employed FAIRE-seq (formaldehyde-assisted Isolation of Regulatory Elements, followed by deep sequencing to determine nucleosome-depleted chromatin regions of algal cells subjected to carbon deprivation. Our FAIRE data recapitulated the positions of known regulatory elements in the promoter of the periplasmic carbonic anhydrase (Cah1 gene, which is upregulated during CCM induction, and revealed new candidate regulatory elements at a genome-wide scale. In addition, time series expression patterns of 130 transcription factor (TF and transcription regulator (TR genes were obtained for cells cultured under photoautotrophic condition and subjected to a shift from high to low [CO2]. Groups of co-expressed genes were identified and a putative directed gene-regulatory network underlying the CCM was reconstructed from the gene expression data using the recently developed IOTA (inner composition alignment method. Among the candidate regulatory genes, two members of the MYB-related TF family, Lcr1 (Low-CO 2 response regulator 1 and Lcr2 (Low-CO2 response regulator 2, may play an important role in down-regulating the expression of a particular set of TF and TR genes in response to low [CO2]. The results obtained provide new insights into the transcriptional control of the CCM and revealed more than 60 new candidate regulatory genes. Deep sequencing of nucleosome-depleted genomic regions indicated the presence of new, previously unknown regulatory elements in the C. reinhardtii genome

  3. RegTransBase - A Database Of Regulatory Sequences and Interactionsin a Wide Range of Prokaryotic Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Kazakov, Alexei E.; Cipriano, Michael J.; Novichkov, Pavel S.; Minovitsky, Simon; Vinogradov, Dmitry V.; Arkin, Adam; Mironov, AndreyA.; Gelfand, Mikhail S.; Dubchak, Inna

    2006-07-01

    RegTransBase, a manually curated database of regulatoryinteractions in prokaryotes, captures the knowledge in publishedscientific literature using a controlled vocabulary. Although a number ofdatabases describing interactions between regulatory proteins and theirbinding sites are currently being maintained, they focus mostly on themodel organisms Escherichia coli and Bacillus subtilis, or are entirelycomputationally derived. RegTransBase describes a large number ofregulatory interactions reported in many organisms and contains varioustypes of experimental data, in particular: the activation or repressionof transcription by an identified direct regulator; determining thetranscriptional regulatory function of a protein (or RNA) directlybinding to DNA (RNA); mapping or prediction of binding site for aregulatory protein; characterization of regulatory mutations. Currently,the RegTransBase content is derived from about 3000 relevant articlesdescribing over 7000 experiments in relation to 128 microbes. It containsdata on the regulation of about 7500 genes and evidence for 6500interactions with 650 regulators. RegTransBase also contains manuallycreated position weight matrices (PWM) that can be used to identifycandidate regulatory sites in over 60 species. RegTransBase is availableat http://regtransbase.lbl.gov.

  4. Structural and mutational analyses of cis-acting sequences in the 5'-untranslated region of satellite RNA of bamboo mosaic potexvirus

    International Nuclear Information System (INIS)

    Annamalai, Padmanaban; Hsu, Y.-H.; Liu, Y.-P.; Tsai, C.-H.; Lin, N.-S.

    2003-01-01

    The satellite RNA of Bamboo mosaic virus (satBaMV) contains on open reading frame for a 20-kDa protein that is flanked by a 5'-untranslated region (UTR) of 159 nucleotides (nt) and a 3'-UTR of 129 nt. A secondary structure was predicted for the 5'-UTR of satBaMV RNA, which folds into a large stem-loop (LSL) and a small stem-loop. Enzymatic probing confirmed the existence of LSL (nt 8-138) in the 5'-UTR. The essential cis-acting sequences in the 5'-UTR required for satBaMV RNA replication were determined by deletion and substitution mutagenesis. Their replication efficiencies were analyzed in Nicotiana benthamiana protoplasts and Chenopodium quinoa plants coinoculated with helper BaMV RNA. All deletion mutants abolished the replication of satBaMV RNA, whereas mutations introduced in most of the loop regions and stems showed either no replication or a decreased replication efficiency. Mutations that affected the positive-strand satBaMV RNA accumulation also affected the accumulation of negative-strand RNA; however, the accumulation of genomic and subgenomic RNAs of BaMV were not affected. Moreover, covariation analyses of natural satBaMV variants provide substantial evidence that the secondary structure in the 5'-UTR of satBaMV is necessary for efficient replication

  5. Evidence for multiple major histocompatibility class II X-box binding proteins.

    OpenAIRE

    Celada, A; Maki, R

    1989-01-01

    The X box is a loosely conserved DNA sequence that is located upstream of all major histocompatibility class II genes and is one of the cis-acting regulatory elements. Despite the similarity between all X-box sequences, each promoter-proximal X box in the mouse appears to bind a separate nuclear factor.

  6. Alternative splicing events identified in human embryonic stem cells and neural progenitors.

    Directory of Open Access Journals (Sweden)

    Gene W Yeo

    2007-10-01

    Full Text Available Human embryonic stem cells (hESCs and neural progenitor (NP cells are excellent models for recapitulating early neuronal development in vitro, and are key to establishing strategies for the treatment of degenerative disorders. While much effort had been undertaken to analyze transcriptional and epigenetic differences during the transition of hESC to NP, very little work has been performed to understand post-transcriptional changes during neuronal differentiation. Alternative RNA splicing (AS, a major form of post-transcriptional gene regulation, is important in mammalian development and neuronal function. Human ESC, hESC-derived NP, and human central nervous system stem cells were compared using Affymetrix exon arrays. We introduced an outlier detection approach, REAP (Regression-based Exon Array Protocol, to identify 1,737 internal exons that are predicted to undergo AS in NP compared to hESC. Experimental validation of REAP-predicted AS events indicated a threshold-dependent sensitivity ranging from 56% to 69%, at a specificity of 77% to 96%. REAP predictions significantly overlapped sets of alternative events identified using expressed sequence tags and evolutionarily conserved AS events. Our results also reveal that focusing on differentially expressed genes between hESC and NP will overlook 14% of potential AS genes. In addition, we found that REAP predictions are enriched in genes encoding serine/threonine kinase and helicase activities. An example is a REAP-predicted alternative exon in the SLK (serine/threonine kinase 2 gene that is differentially included in hESC, but skipped in NP as well as in other differentiated tissues. Lastly, comparative sequence analysis revealed conserved intronic cis-regulatory elements such as the FOX1/2 binding site GCAUG as being proximal to candidate AS exons, suggesting that FOX1/2 may participate in the regulation of AS in NP and hESC. In summary, a new methodology for exon array analysis was introduced

  7. Genetic architecture for susceptibility to gout in the KARE cohort study.

    Science.gov (United States)

    Shin, Jimin; Kim, Younyoung; Kong, Minyoung; Lee, Chaeyoung

    2012-06-01

    This study aimed to identify functional associations of cis-regulatory regions with gout susceptibility using data resulted from a genome-wide association study (GWAS), and to show a genetic architecture for gout with interaction effects among genes within each of the identified functions. The GWAS was conducted with 8314 control subjects and 520 patients with gout in the Korea Association REsource cohort. However, genetic associations with any individual nucleotide variants were not discovered by Bonferroni multiple testing in the GWAS (P>1.42 × 10(-7)). Genomic regions enrichment analysis was employed to identify functional associations of cis-regulatory regions. This analysis revealed several biological processes associated with gout susceptibility, and they were quite different from those with serum uric acid level. Epistasis for susceptibility to gout was estimated using entropy decomposition with selected genes within each biological process identified by the genomic regions enrichment analysis. Some epistases among nucleotide sequence variants for gout susceptibility were found to be larger than their individual effects. This study provided the first evidence that genetic factors for gout susceptibility greatly differed from those for serum uric acid level, which may suggest that research endeavors for identifying genetic factors for gout susceptibility should not be heavily dependent on pathogenesis of uric acid. Interaction effects between genes should be examined to explain a large portion of phenotypic variability for gout susceptibility.

  8. Mediation Analysis Demonstrates That Trans-eQTLs Are Often Explained by Cis-Mediation: A Genome-Wide Analysis among 1,800 South Asians

    Science.gov (United States)

    Pierce, Brandon L.; Tong, Lin; Chen, Lin S.; Rahaman, Ronald; Argos, Maria; Jasmine, Farzana; Roy, Shantanu; Paul-Brutus, Rachelle; Westra, Harm-Jan; Franke, Lude; Esko, Tonu; Zaman, Rakibuz; Islam, Tariqul; Rahman, Mahfuzar; Baron, John A.; Kibriya, Muhammad G.; Ahsan, Habibul

    2014-01-01

    A large fraction of human genes are regulated by genetic variation near the transcribed sequence (cis-eQTL, expression quantitative trait locus), and many cis-eQTLs have implications for human disease. Less is known regarding the effects of genetic variation on expression of distant genes (trans-eQTLs) and their biological mechanisms. In this work, we use genome-wide data on SNPs and array-based expression measures from mononuclear cells obtained from a population-based cohort of 1,799 Bangladeshi individuals to characterize cis- and trans-eQTLs and determine if observed trans-eQTL associations are mediated by expression of transcripts in cis with the SNPs showing trans-association, using Sobel tests of mediation. We observed 434 independent trans-eQTL associations at a false-discovery rate of 0.05, and 189 of these trans-eQTLs were also cis-eQTLs (enrichment Pmediator based on Sobel Pmediation signals in two European cohorts, and while only 7 trans-eQTL associations were present in one or both cohorts, 6 showed evidence of cis-mediation. Analyses of simulated data show that complete mediation will be observed as partial mediation in the presence of mediator measurement error or imperfect LD between measured and causal variants. Our data demonstrates that trans-associations can become significantly stronger or switch directions after adjusting for a potential mediator. Using simulated data, we demonstrate that this phenomenon is expected in the presence of strong cis-trans confounding and when the measured cis-transcript is correlated with the true (unmeasured) mediator. In conclusion, by applying mediation analysis to eQTL data, we show that a substantial fraction of observed trans-eQTL associations can be explained by cis-mediation. Future studies should focus on understanding the mechanisms underlying widespread cis-mediation and their relevance to disease biology, as well as using mediation analysis to improve eQTL discovery. PMID:25474530

  9. Chiral synthesis of (Z)-3-cis-6,7-cis-9,10-diepoxyhenicosenes, sex pheromone components of the satin moth, Leucoma salicis.

    Science.gov (United States)

    Wimalaratne, Priyantha D C; Slessor, Keith N

    2004-06-01

    All four isomers of (Z)-3-cis-6,7-cis-9,10-diepoxyhenicosenes, 1-4, have been synthesized using D-xylose as the chirally pure starting material. D-Xylose was first converted to 2-deoxy-4,5-O-isopropylidene-3-t-butyldimethylsilyl-D-threopentose 11, via several steps of selective protection, dehydroxylation, and deprotection. Wittig coupling of 11 with nonyltriphenylphosphonium bromide followed by hydrogenation and acid catalyzed deprotection of hydroxyl groups yielded the chiral (2R,3R)-1,2,3-triol, 14, which was used as the precursor for the C-8 to C-21 unit of the (Z)-3-cis-6,7-cis-9,10-diepoxyhenicosenes. Selective tosylation of 14 followed by stereospecific cyclization yielded (2R,3R)-1,2-epoxytetradecan-3-ol, 16, which was then divergently converted to the t-butyldimethylsilyl ether 17 and tosylate 22, respectively. Establishment of the C-5 through C-7 unit of the target molecules was accomplished via regiospecific coupling of 17 with 1-t-butyldimethylsiloxy-2-propyne to form 18. Stepwise transformation of 18 via the formation of tosylate 19, desilylation, and stereospecific cyclization to form epoxy alcohol 20, followed by P2-Ni reduction yielded a key intermediate, allylic epoxy alcohol (Z)-2-(5S,6R)-cis-5,6-epoxyheptadecen-1-ol, 21. Similarly, the coupling of 22 with 1-t-butyldimethylsiloxy-2-propyne yielded 23, which was stereospecifically cyclized to form 24. Desilylation and P2-Ni reduction of 24 gave the antipodal intermediate, (Z)-2-(5R,6S)-cis-5,6-epoxyheptadecen-1-ol, 26. Asymmetric epoxidation of antipodes 21 and 26 with (L)- or (D)-diethyl tartrates resulted in the formation of diepoxy alcohols 27 and 29 from 21, and 33 and 31 from 26, respectively. Tosylation of these diepoxy alcohols followed by coupling with lithium dibutenyl cuprate yielded the four stereoisomers of (Z)-3-cis-6,7-cis-9,10-diepoxyhenicosenes, 1-4. Analysis of the retention characteristics of these materials revealed that one or both of the S*,R*,S*,R* stereoisomers comprise the

  10. Identifying driver mutations in sequenced cancer genomes

    DEFF Research Database (Denmark)

    Raphael, Benjamin J; Dobson, Jason R; Oesper, Layla

    2014-01-01

    High-throughput DNA sequencing is revolutionizing the study of cancer and enabling the measurement of the somatic mutations that drive cancer development. However, the resulting sequencing datasets are large and complex, obscuring the clinically important mutations in a background of errors, nois...... patterns of mutual exclusivity. These techniques, coupled with advances in high-throughput DNA sequencing, are enabling precision medicine approaches to the diagnosis and treatment of cancer....

  11. Deep sequencing of uveal melanoma identifies a recurrent mutation in PLCB4

    DEFF Research Database (Denmark)

    Johansson, Peter; Aoude, Lauren G; Wadt, Karin

    2016-01-01

    Next generation sequencing of uveal melanoma (UM) samples has identified a number of recurrent oncogenic or loss-of-function mutations in key driver genes including: GNAQ, GNA11, EIF1AX, SF3B1 and BAP1. To search for additional driver mutations in this tumor type we carried out whole......, instead, a BRCA mutation signature predominated. In addition to mutations in the known UM driver genes, we found a recurrent mutation in PLCB4 (c.G1888T, p.D630Y, NM_000933), which was validated using Sanger sequencing. The identical mutation was also found in published UM sequence data (1 of 56 tumors......-genome or whole-exome sequencing of 28 tumors or primary cell lines. These samples have a low mutation burden, with a mean of 10.6 protein changing mutations per sample (range 0 to 53). As expected for these sun-shielded melanomas the mutation spectrum was not consistent with an ultraviolet radiation signature...

  12. Sequencing illustrates the transcriptional response of Legionella pneumophila during infection and identifies seventy novel small non-coding RNAs.

    LENUS (Irish Health Repository)

    Weissenmayer, Barbara A

    2011-01-01

    Second generation sequencing has prompted a number of groups to re-interrogate the transcriptomes of several bacterial and archaeal species. One of the central findings has been the identification of complex networks of small non-coding RNAs that play central roles in transcriptional regulation in all growth conditions and for the pathogen\\'s interaction with and survival within host cells. Legionella pneumophila is a gram-negative facultative intracellular human pathogen with a distinct biphasic lifestyle. One of its primary environmental hosts in the free-living amoeba Acanthamoeba castellanii and its infection by L. pneumophila mimics that seen in human macrophages. Here we present analysis of strand specific sequencing of the transcriptional response of L. pneumophila during exponential and post-exponential broth growth and during the replicative and transmissive phase of infection inside A. castellanii. We extend previous microarray based studies as well as uncovering evidence of a complex regulatory architecture underpinned by numerous non-coding RNAs. Over seventy new non-coding RNAs could be identified; many of them appear to be strain specific and in configurations not previously reported. We discover a family of non-coding RNAs preferentially expressed during infection conditions and identify a second copy of 6S RNA in L. pneumophila. We show that the newly discovered putative 6S RNA as well as a number of other non-coding RNAs show evidence for antisense transcription. The nature and extent of the non-coding RNAs and their expression patterns suggests that these may well play central roles in the regulation of Legionella spp. specific traits and offer clues as to how L. pneumophila adapts to its intracellular niche. The expression profiles outlined in the study have been deposited into Genbank\\'s Gene Expression Omnibus (GEO) database under the series accession GSE27232.

  13. Identification and characterization of promoters and cis-regulatory elements of genes involved in secondary metabolites production in hop (Humulus lupulus. L)

    Czech Academy of Sciences Publication Activity Database

    Duraisamy, Ganesh Selvaraj; Mishra, Ajay Kumar; Kocábek, Tomáš; Matoušek, Jaroslav

    2016-01-01

    Roč. 84, October (2016), s. 346-352 ISSN 1476-9271 R&D Projects: GA ČR GA13-03037S Institutional support: RVO:60077344 Keywords : Cis-acting elements * Gene regulation * Humulus lupulus Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 1.331, year: 2016

  14. The Comparison of Biochemical and Sequencing 16S rDNA Gene Methods to Identify Nontuberculous Mycobacteria

    Directory of Open Access Journals (Sweden)

    Shafipour1, M.

    2014-11-01

    Full Text Available The identification of Mycobacteria in the species level has great medical importance. Biochemical tests are laborious and time-consuming, so new techniques could be used to identify the species. This research aimed to the comparison of biochemical and sequencing 16S rDNA gene methods to identify nontuberculous Mycobacteria in patients suspected to tuberculosis in Golestan province which is the most prevalent region of tuberculosis in Iran. Among 3336 patients suspected to tuberculosis referred to hospitals and health care centres in Golestan province during 2010-2011, 319 (9.56% culture positive cases were collected. Identification of species by using biochemical tests was done. On the samples recognized as nontuberculous Mycobacteria, after DNA extraction by boiling, 16S rDNA PCR was done and their sequencing were identified by NCBI BLAST. Of the 319 positive samples in Golestan Province, 300 cases were M.tuberculosis and 19 cases (5.01% were identified as nontuberculous Mycobacteria by biochemical tests. 15 out of 19 nontuberculous Mycobacteria were identified by PCR and sequencing method as similar by biochemical methods (similarity rate: 78.9%. But after PCR, 1 case known as M.simiae by biochemical test was identified as M. lentiflavum and 3 other cases were identified as Nocardia. Biochemical methods corresponded to the 16S rDNA PCR and sequencing in 78.9% of cases. However, in identification of M. lentiflavum and Nocaria sp. the molecular method is better than biochemical methods.

  15. Combined removal of a BTEX, TCE, and cis-DCE mixture using Pseudomonas sp. immobilized on scrap tyres.

    Science.gov (United States)

    Lu, Qihong; de Toledo, Renata Alves; Xie, Fei; Li, Junhui; Shim, Hojae

    2015-09-01

    The simultaneous aerobic removal of a mixture of benzene, toluene, ethylbenzene, and o,m,p-xylene (BTEX); cis-dichloroethylene (cis-DCE); and trichloroethylene (TCE) from the artificially contaminated water using an indigenous bacterial isolate identified as Pseudomonas plecoglossicida immobilized on waste scrap tyres was investigated. Suspended and immobilized conditions were compared for the removal of these volatile organic compounds. For the immobilized system, toluene, benzene, and ethylbenzene were completely removed, while the highest removal efficiencies of 99.0 ± 0.1, 96.8 ± 0.3, 73.6 ± 2.5, and 61.6 ± 0.9% were obtained for o-xylene, m,p-xylene, TCE, and cis-DCE, respectively. The sorption kinetics of contaminants towards tyre surface was also evaluated, and the sorption capacity generally followed the order of toluene > benzene > m,p-xylene > o-xylene > ethylbenzene > TCE > cis-DCE. Scrap tyres showed a good capability for the simultaneous sorption and bioremoval of BTEX/cis-DCE/TCE mixture, implying a promising waste material for the removal of contaminant mixture from industrial wastewater or contaminated groundwater.

  16. Effect of toluene concentration and hydrogen peroxide on Pseudomonas plecoglossicida cometabolizing mixture of cis-DCE and TCE in soil slurry.

    Science.gov (United States)

    Li, Junhui; Lu, Qihong; de Toledo, Renata Alves; Lu, Ying; Shim, Hojae

    2015-12-01

    An indigenous Pseudomonas sp., isolated from the regional contaminated soil and identified as P. plecoglossicida, was evaluated for its aerobic cometabolic removal of cis-1,2-dichloroethylene (cis-DCE) and trichloroethylene (TCE) using toluene as growth substrate in a laboratory-scale soil slurry. The aerobic simultaneous bioremoval of the cis-DCE/TCE/toluene mixture was studied under different conditions. Results showed that an increase in toluene concentration level from 300 to 900 mg/kg prolonged the lag phase for the bacterial growth, while the bioremoval extent for cis-DCE, TCE, and toluene declined as the initial toluene concentration increased. In addition, the cometabolic bioremoval of cis-DCE and TCE was inhibited by the presence of hydrogen peroxide as the additional oxygen source, while the bioremoval of toluene (900 mg/kg) was enhanced after 9 days of incubation. The subsequent addition of toluene did not improve the cometabolic bioremoval of cis-DCE and TCE. The obtained results would help to enhance the applicability of bioremediation technology to the mixed waste contaminated sites.

  17. Whole-Exome Sequencing Identifies Rare and Low-Frequency Coding Variants Associated with LDL Cholesterol

    Science.gov (United States)

    Lange, Leslie A.; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M.; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M.; Smith, Joshua D.; Turner, Emily H.; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A.; Holmen, Oddgeir L.; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A.; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C.; Correa, Adolfo; Griswold, Michael E.; Jakobsdottir, Johanna; Smith, Albert V.; Schreiner, Pamela J.; Feitosa, Mary F.; Zhang, Qunyuan; Huffman, Jennifer E.; Crosby, Jacy; Wassel, Christina L.; Do, Ron; Franceschini, Nora; Martin, Lisa W.; Robinson, Jennifer G.; Assimes, Themistocles L.; Crosslin, David R.; Rosenthal, Elisabeth A.; Tsai, Michael; Rieder, Mark J.; Farlow, Deborah N.; Folsom, Aaron R.; Lumley, Thomas; Fox, Ervin R.; Carlson, Christopher S.; Peters, Ulrike; Jackson, Rebecca D.; van Duijn, Cornelia M.; Uitterlinden, André G.; Levy, Daniel; Rotter, Jerome I.; Taylor, Herman A.; Gudnason, Vilmundur; Siscovick, David S.; Fornage, Myriam; Borecki, Ingrid B.; Hayward, Caroline; Rudan, Igor; Chen, Y. Eugene; Bottinger, Erwin P.; Loos, Ruth J.F.; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M.; Gabriel, Stacey B.; O’Donnell, Christopher J.; Post, Wendy S.; North, Kari E.; Reiner, Alexander P.; Boerwinkle, Eric; Psaty, Bruce M.; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P.; Cupples, L. Adrienne; Kooperberg, Charles; Wilson, James G.; Nickerson, Deborah A.; Abecasis, Goncalo R.; Rich, Stephen S.; Tracy, Russell P.; Willer, Cristen J.; Gabriel, Stacey B.; Altshuler, David M.; Abecasis, Gonçalo R.; Allayee, Hooman; Cresci, Sharon; Daly, Mark J.; de Bakker, Paul I.W.; DePristo, Mark A.; Do, Ron; Donnelly, Peter; Farlow, Deborah N.; Fennell, Tim; Garimella, Kiran; Hazen, Stanley L.; Hu, Youna; Jordan, Daniel M.; Jun, Goo; Kathiresan, Sekar; Kang, Hyun Min; Kiezun, Adam; Lettre, Guillaume; Li, Bingshan; Li, Mingyao; Newton-Cheh, Christopher H.; Padmanabhan, Sandosh; Peloso, Gina; Pulit, Sara; Rader, Daniel J.; Reich, David; Reilly, Muredach P.; Rivas, Manuel A.; Schwartz, Steve; Scott, Laura; Siscovick, David S.; Spertus, John A.; Stitziel, Nathaniel O.; Stoletzki, Nina; Sunyaev, Shamil R.; Voight, Benjamin F.; Willer, Cristen J.; Rich, Stephen S.; Akylbekova, Ermeg; Atwood, Larry D.; Ballantyne, Christie M.; Barbalic, Maja; Barr, R. Graham; Benjamin, Emelia J.; Bis, Joshua; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer; Budoff, Matthew; Burke, Greg; Buxbaum, Sarah; Carr, Jeff; Chen, Donna T.; Chen, Ida Y.; Chen, Wei-Min; Concannon, Pat; Crosby, Jacy; Cupples, L. Adrienne; D’Agostino, Ralph; DeStefano, Anita L.; Dreisbach, Albert; Dupuis, Josée; Durda, J. Peter; Ellis, Jaclyn; Folsom, Aaron R.; Fornage, Myriam; Fox, Caroline S.; Fox, Ervin; Funari, Vincent; Ganesh, Santhi K.; Gardin, Julius; Goff, David; Gordon, Ora; Grody, Wayne; Gross, Myron; Guo, Xiuqing; Hall, Ira M.; Heard-Costa, Nancy L.; Heckbert, Susan R.; Heintz, Nicholas; Herrington, David M.; Hickson, DeMarc; Huang, Jie; Hwang, Shih-Jen; Jacobs, David R.; Jenny, Nancy S.; Johnson, Andrew D.; Johnson, Craig W.; Kawut, Steven; Kronmal, Richard; Kurz, Raluca; Lange, Ethan M.; Lange, Leslie A.; Larson, Martin G.; Lawson, Mark; Lewis, Cora E.; Levy, Daniel; Li, Dalin; Lin, Honghuang; Liu, Chunyu; Liu, Jiankang; Liu, Kiang; Liu, Xiaoming; Liu, Yongmei; Longstreth, William T.; Loria, Cay; Lumley, Thomas; Lunetta, Kathryn; Mackey, Aaron J.; Mackey, Rachel; Manichaikul, Ani; Maxwell, Taylor; McKnight, Barbara; Meigs, James B.; Morrison, Alanna C.; Musani, Solomon K.; Mychaleckyj, Josyf C.; Nettleton, Jennifer A.; North, Kari; O’Donnell, Christopher J.; O’Leary, Daniel; Ong, Frank; Palmas, Walter; Pankow, James S.; Pankratz, Nathan D.; Paul, Shom; Perez, Marco; Person, Sharina D.; Polak, Joseph; Post, Wendy S.; Psaty, Bruce M.; Quinlan, Aaron R.; Raffel, Leslie J.; Ramachandran, Vasan S.; Reiner, Alexander P.; Rice, Kenneth; Rotter, Jerome I.; Sanders, Jill P.; Schreiner, Pamela; Seshadri, Sudha; Shea, Steve; Sidney, Stephen; Silverstein, Kevin; Smith, Nicholas L.; Sotoodehnia, Nona; Srinivasan, Asoke; Taylor, Herman A.; Taylor, Kent; Thomas, Fridtjof; Tracy, Russell P.; Tsai, Michael Y.; Volcik, Kelly A.; Wassel, Chrstina L.; Watson, Karol; Wei, Gina; White, Wendy; Wiggins, Kerri L.; Wilk, Jemma B.; Williams, O. Dale; Wilson, Gregory; Wilson, James G.; Wolf, Phillip; Zakai, Neil A.; Hardy, John; Meschia, James F.; Nalls, Michael; Singleton, Andrew; Worrall, Brad; Bamshad, Michael J.; Barnes, Kathleen C.; Abdulhamid, Ibrahim; Accurso, Frank; Anbar, Ran; Beaty, Terri; Bigham, Abigail; Black, Phillip; Bleecker, Eugene; Buckingham, Kati; Cairns, Anne Marie; Caplan, Daniel; Chatfield, Barbara; Chidekel, Aaron; Cho, Michael; Christiani, David C.; Crapo, James D.; Crouch, Julia; Daley, Denise; Dang, Anthony; Dang, Hong; De Paula, Alicia; DeCelie-Germana, Joan; Drumm, Allen DozorMitch; Dyson, Maynard; Emerson, Julia; Emond, Mary J.; Ferkol, Thomas; Fink, Robert; Foster, Cassandra; Froh, Deborah; Gao, Li; Gershan, William; Gibson, Ronald L.; Godwin, Elizabeth; Gondor, Magdalen; Gutierrez, Hector; Hansel, Nadia N.; Hassoun, Paul M.; Hiatt, Peter; Hokanson, John E.; Howenstine, Michelle; Hummer, Laura K.; Kanga, Jamshed; Kim, Yoonhee; Knowles, Michael R.; Konstan, Michael; Lahiri, Thomas; Laird, Nan; Lange, Christoph; Lin, Lin; Lin, Xihong; Louie, Tin L.; Lynch, David; Make, Barry; Martin, Thomas R.; Mathai, Steve C.; Mathias, Rasika A.; McNamara, John; McNamara, Sharon; Meyers, Deborah; Millard, Susan; Mogayzel, Peter; Moss, Richard; Murray, Tanda; Nielson, Dennis; Noyes, Blakeslee; O’Neal, Wanda; Orenstein, David; O’Sullivan, Brian; Pace, Rhonda; Pare, Peter; Parker, H. Worth; Passero, Mary Ann; Perkett, Elizabeth; Prestridge, Adrienne; Rafaels, Nicholas M.; Ramsey, Bonnie; Regan, Elizabeth; Ren, Clement; Retsch-Bogart, George; Rock, Michael; Rosen, Antony; Rosenfeld, Margaret; Ruczinski, Ingo; Sanford, Andrew; Schaeffer, David; Sell, Cindy; Sheehan, Daniel; Silverman, Edwin K.; Sin, Don; Spencer, Terry; Stonebraker, Jackie; Tabor, Holly K.; Varlotta, Laurie; Vergara, Candelaria I.; Weiss, Robert; Wigley, Fred; Wise, Robert A.; Wright, Fred A.; Wurfel, Mark M.; Zanni, Robert; Zou, Fei; Nickerson, Deborah A.; Rieder, Mark J.; Green, Phil; Shendure, Jay; Akey, Joshua M.; Bustamante, Carlos D.; Crosslin, David R.; Eichler, Evan E.; Fox, P. Keolu; Fu, Wenqing; Gordon, Adam; Gravel, Simon; Jarvik, Gail P.; Johnsen, Jill M.; Kan, Mengyuan; Kenny, Eimear E.; Kidd, Jeffrey M.; Lara-Garduno, Fremiet; Leal, Suzanne M.; Liu, Dajiang J.; McGee, Sean; O’Connor, Timothy D.; Paeper, Bryan; Robertson, Peggy D.; Smith, Joshua D.; Staples, Jeffrey C.; Tennessen, Jacob A.; Turner, Emily H.; Wang, Gao; Yi, Qian; Jackson, Rebecca; Peters, Ulrike; Carlson, Christopher S.; Anderson, Garnet; Anton-Culver, Hoda; Assimes, Themistocles L.; Auer, Paul L.; Beresford, Shirley; Bizon, Chris; Black, Henry; Brunner, Robert; Brzyski, Robert; Burwen, Dale; Caan, Bette; Carty, Cara L.; Chlebowski, Rowan; Cummings, Steven; Curb, J. David; Eaton, Charles B.; Ford, Leslie; Franceschini, Nora; Fullerton, Stephanie M.; Gass, Margery; Geller, Nancy; Heiss, Gerardo; Howard, Barbara V.; Hsu, Li; Hutter, Carolyn M.; Ioannidis, John; Jiao, Shuo; Johnson, Karen C.; Kooperberg, Charles; Kuller, Lewis; LaCroix, Andrea; Lakshminarayan, Kamakshi; Lane, Dorothy; Lasser, Norman; LeBlanc, Erin; Li, Kuo-Ping; Limacher, Marian; Lin, Dan-Yu; Logsdon, Benjamin A.; Ludlam, Shari; Manson, JoAnn E.; Margolis, Karen; Martin, Lisa; McGowan, Joan; Monda, Keri L.; Kotchen, Jane Morley; Nathan, Lauren; Ockene, Judith; O’Sullivan, Mary Jo; Phillips, Lawrence S.; Prentice, Ross L.; Robbins, John; Robinson, Jennifer G.; Rossouw, Jacques E.; Sangi-Haghpeykar, Haleh; Sarto, Gloria E.; Shumaker, Sally; Simon, Michael S.; Stefanick, Marcia L.; Stein, Evan; Tang, Hua; Taylor, Kira C.; Thomson, Cynthia A.; Thornton, Timothy A.; Van Horn, Linda; Vitolins, Mara; Wactawski-Wende, Jean; Wallace, Robert; Wassertheil-Smoller, Sylvia; Zeng, Donglin; Applebaum-Bowden, Deborah; Feolo, Michael; Gan, Weiniu; Paltoo, Dina N.; Sholinsky, Phyliss; Sturcke, Anne

    2014-01-01

    Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98th or <2nd percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. PMID:24507775

  18. An Evolutionarily Young Polar Bear (Ursus maritimus Endogenous Retrovirus Identified from Next Generation Sequence Data

    Directory of Open Access Journals (Sweden)

    Kyriakos Tsangaras

    2015-11-01

    Full Text Available Transcriptome analysis of polar bear (Ursus maritimus tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV. Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos and black bear (Ursus americanus but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.

  19. Transcription Factors Bind Thousands of Active and InactiveRegions in the Drosophila Blastoderm

    Energy Technology Data Exchange (ETDEWEB)

    Li, Xiao-Yong; MacArthur, Stewart; Bourgon, Richard; Nix, David; Pollard, Daniel A.; Iyer, Venky N.; Hechmer, Aaron; Simirenko, Lisa; Stapleton, Mark; Luengo Hendriks, Cris L.; Chu, Hou Cheng; Ogawa, Nobuo; Inwood, William; Sementchenko, Victor; Beaton, Amy; Weiszmann, Richard; Celniker, Susan E.; Knowles, David W.; Gingeras, Tom; Speed, Terence P.; Eisen, Michael B.; Biggin, Mark D.

    2008-01-10

    Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. Here, we use whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched in bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over forty well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly-bound regions are not involved in early

  20. Functional promoter upstream p53 regulatory sequence of IGFBP3 that is silenced by tumor specific methylation

    International Nuclear Information System (INIS)

    Hanafusa, Tadashi; Shinji, Toshiyuki; Shiraha, Hidenori; Nouso, Kazuhiro; Iwasaki, Yoshiaki; Yumoto, Eichiro; Ono, Toshiro; Koide, Norio

    2005-01-01

    Insulin-like growth factor binding protein (IGFBP)-3 functions as a carrier of insulin-like growth factors (IGFs) in circulation and a mediator of the growth suppression signal in cells. There are two reported p53 regulatory regions in the IGFBP3 gene; one upstream of the promoter and one intronic. We previously reported a hot spot of promoter hypermethylation of IGFBP-3 in human hepatocellular carcinomas and derivative cell lines. As the hot spot locates at the putative upstream p53 consensus sequences, these p53 consensus sequences are really functional is a question to be answered. In this study, we examined the p53 consensus sequences upstream of the IGFBP-3 promoter for the p53 induced expression of IGFBP-3. Deletion, mutagenesis, and methylation constructs of IGFBP-3 promoter were assessed in the human hepatoblastoma cell line HepG2 for promoter activity. Deletions and mutations of these sequences completely abolished the expression of IGFBP-3 in the presence of p53 overexpression. In vitro methylation of these p53 consensus sequences also suppressed IGFBP-3 expression. In contrast, the expression of IGFBP-3 was not affected in the absence of p53 overexpression. Further, we observed by electrophoresis mobility shift assay that p53 binding to the promoter region was diminished when methylated. From these observations, we conclude that four out of eleven p53 consensus sequences upstream of the IGFBP-3 promoter are essential for the p53 induced expression of IGFBP-3, and hypermethylation of these sequences selectively suppresses p53 induced IGFBP-3 expression in HepG2 cells

  1. Monoclonal antibodies to DNA modified with cis- or trans-diamminedichloroplatinum(II)

    International Nuclear Information System (INIS)

    Sundquist, W.I.; Lippard, S.J.; Stollar, B.D.

    1987-01-01

    Murine monoclonal antibodies that bind selectively to adducts formed on DNA by the antitumor drug cis-diamminedichloroplatinum(II), cis-DDP, or to the chemothrapeutically inactive trans isomer trans-DDP were elicited by immunization with calf thymus DNA modified with either cis- or trans-DDP at ratios of bound platinum per nucleotide, (D/N)/sub b/, of 0.06-0.08. The binding of two monoclonal antibodies to cis-DDP-modified DNA was competitively inhibited in an enzyme-linked immunosorbent assay (ELISA) by 4-6 nM concentrations of cis-DDP bound to DNA. Adducts formed by cis-DDP on other synthetic DNA polymers did not inhibit antibody binding to cis-DDP-DNA. The biologically active compounds [Pt(en)Cl 2 ], [Pt(dach)Cl 2 ], and [Pt(NH 3 ) 2 (cbdca)] (carboplatin) all formed antibody-detectable adducts on DNA, whereas the inactive platinum complexes trans-DDP and [Pt(dien)Cl]Cl (dien, diethylenetriamine) did not. The monoclonal antibodies therefore recognize a bifunctional Pt-DNA adduct with cis stereochemistry in which platinum is coordinated by two adjacent guanines or, to a lesser degree, by adjacent adenine and guanine. A monoclonal antibody raised against trans-DDP-DNA was competitively inhibited in an ELISA by 40 nM trans-DDP bound to DNA. This antibody crossreacted with unmodified, denatured DNA. The recognition of cis- or trans-DDP-modified DNAs by monoclonal antibodies thus parallels the known modes of DNA binding of these compounds and may correlate with their biological activities

  2. Stepwise encapsulation and controlled two-stage release system for cis-Diamminediiodoplatinum

    Directory of Open Access Journals (Sweden)

    Chen Y

    2014-06-01

    Full Text Available Yun Chen,1,* Qian Li,1,2,* Qingsheng Wu1 1Department of Chemistry, Key Laboratory of Yangtze River Water Environment, Ministry of Education, Tongji University, Shanghai; 2Shanghai Institute of Quality Inspection and Technical Research, Shanghai, People’s Republic of China *These authors contributed equally to this work Abstract: cis-Diamminediiodoplatinum (cis-DIDP is a cisplatin-like anticancer drug with higher anticancer activity, but lower stability and price than cisplatin. In this study, a cis-DIDP carrier system based on micro-sized stearic acid was prepared by an emulsion solvent evaporation method. The maximum drug loading capacity of cis-DIDP-loaded solid lipid nanoparticles was 22.03%, and their encapsulation efficiency was 97.24%. In vitro drug release in phosphate-buffered saline (pH =7.4 at 37.5°C exhibited a unique two-stage process, which could prove beneficial for patients with tumors and malignancies. MTT (3-[4,5-dimethylthiazol-2-yl]-2, 5-diphenyltetrazolium bromide assay results showed that cis-DIDP released from cis-DIDP-loaded solid lipid nanoparticles had better inhibition activity than cis-DIDP that had not been loaded. Keywords: stearic acid, emulsion solvent evaporation method, drug delivery, cis-DIDP, in vitro

  3. Identification of two evolutionarily conserved 5' cis-elements involved in regulating spatiotemporal expression of Nolz-1 during mouse embryogenesis.

    Directory of Open Access Journals (Sweden)

    Sunny Li-Yun Chang

    Full Text Available Proper development of vertebrate embryos depends not only on the crucial funtions of key evolutionarily conserved transcriptional regulators, but also on the precisely spatiotemporal expression of these transcriptional regulators. The mouse Nolz-1/Znf503/Zfp503 gene is a mammalian member of the conserved zinc-finger containing NET family. The expression pattern of Nolz-1 in mouse embryos is highly correlated with that of its homologues in different species. To study the spatiotemporal regulation of Nolz-1, we first identified two evolutionarily conserved cis-elements, UREA and UREB, in 5' upstream regions of mouse Nolz-1 locus. We then generated UREA-LacZ and UREB-LacZ transgenic reporter mice to characterize the putative enhancer activity of UREA and UREB. The results indicated that both UREA and UREB contained tissue-specific enhancer activity for directing LacZ expression in selective tissue organs during mouse embryogensis. UREA directed LacZ expression preferentially in selective regions of developing central nervous system, including the forebrain, hindbrain and spinal cord, whereas UREB directed LacZ expression mainly in other developing tissue organs such as the Nolz-1 expressing branchial arches and its derivatives, the apical ectodermal ridge of limb buds and the urogenital tissues. Both UREA and UREB directed strong LacZ expression in the lateral plate mesoderm where endogenous Nolz-1 was also expressed. Despite that the LacZ expression pattern did not full recapitulated the endogenous Nolz-1 expression and some mismatched expression patterns were observed, co-expression of LacZ and Nolz-1 did occur in many cells of selective tissue organs, such as in the ventrolateral cortex and ventral spinal cord of UREA-LacZ embryos, and the urogenital tubes of UREB-LacZ embryos. Taken together, our study suggests that UREA and UREB may function as evolutionarily conserved cis-regulatory elements that coordinate with other cis-elements to regulate

  4. Connections between Transcription Downstream of Genes and cis-SAGe Chimeric RNA.

    Science.gov (United States)

    Chwalenia, Katarzyna; Qin, Fujun; Singh, Sandeep; Tangtrongstittikul, Panjapon; Li, Hui

    2017-11-22

    cis-Splicing between adjacent genes (cis-SAGe) is being recognized as one way to produce chimeric fusion RNAs. However, its detail mechanism is not clear. Recent study revealed induction of transcriptions downstream of genes (DoGs) under osmotic stress. Here, we investigated the influence of osmotic stress on cis-SAGe chimeric RNAs and their connection to DoGs. We found,the absence of induction of at least some cis-SAGe fusions and/or their corresponding DoGs at early time point(s). In fact, these DoGs and their cis-SAGe fusions are inversely correlated. This negative correlation was changed to positive at a later time point. These results suggest a direct competition between the two categories of transcripts when total pool of readthrough transcripts is limited at an early time point. At a later time point, DoGs and corresponding cis-SAGe fusions are both induced, indicating that total readthrough transcripts become more abundant. Finally, we observed overall enhancement of cis-SAGe chimeric RNAs in KCl-treated samples by RNA-Seq analysis.

  5. Enantioselective disruption of the endocrine system by Cis-Bifenthrin in the male mice.

    Science.gov (United States)

    Jin, Yuanxiang; Wang, Jiangcong; Pan, Xiuhong; Miao, Wenyu; Lin, Xiaojian; Wang, Linggang; Fu, Zhengwei

    2015-07-01

    Bifenthrin (BF), as a chiral pyrethroid, is widely used to control field and household pests in China. At present, the commercial BF is a mixed compound containing cis isomers (cis-BF) including two enantiomers of 1R-cis-BF and 1S-cis-BF. In the present study, the two individual cis-BF enantiomers were separated by a preparative supercritical fluid chromatography. Then, four week-old adolescent male ICR mice were orally administered 1R-cis-BF and 1S-cis-BF separately daily for 3 weeks at doses of 0, 7.5 and 15 mg/kg/day, respectively. Results showed that the transcription status of some genes involved in cholesterol synthesis and transport as well as testosterone (T) synthesis in the testes were influenced by cis-BF enantiomers. Especially, we observed that the transcription status of key genes on the pathway of T synthesis including cytochrome P450 cholesterol side-chain cleavage enzyme (P450scc) and cytochrome P450 17α-hydroxysteroid dehydrogenase (P45017α)) were selectively altered in the testis of mice when treated with 1S-cis-BF, suggesting that it is the possible reason to explain why the lower serum T concentration in 1S-cis-BF treated group. Taken together, it concluded that both of the cis-BF enantiomers have the endocrine disruption activities, while 1S-cis-BF was higher than 1R-cis-BF in mice when exposed during the puberty. The data was helpful to understand the toxicity of cis-BF in mammals under enantiomeric level. © 2014 Wiley Periodicals, Inc.

  6. Repetitive Elements in Mycoplasma hyopneumoniae Transcriptional Regulation.

    Directory of Open Access Journals (Sweden)

    Amanda Malvessi Cattani

    Full Text Available Transcriptional regulation, a multiple-step process, is still poorly understood in the important pig pathogen Mycoplasma hyopneumoniae. Basic motifs like promoters and terminators have already been described, but no other cis-regulatory elements have been found. DNA repeat sequences have been shown to be an interesting potential source of cis-regulatory elements. In this work, a genome-wide search for tandem and palindromic repetitive elements was performed in the intergenic regions of all coding sequences from M. hyopneumoniae strain 7448. Computational analysis demonstrated the presence of 144 tandem repeats and 1,171 palindromic elements. The DNA repeat sequences were distributed within the 5' upstream regions of 86% of transcriptional units of M. hyopneumoniae strain 7448. Comparative analysis between distinct repetitive sequences found in related mycoplasma genomes demonstrated different percentages of conservation among pathogenic and nonpathogenic strains. qPCR assays revealed differential expression among genes showing variable numbers of repetitive elements. In addition, repeats found in 206 genes already described to be differentially regulated under different culture conditions of M. hyopneumoniae strain 232 showed almost 80% conservation in relation to M. hyopneumoniae strain 7448 repeats. Altogether, these findings suggest a potential regulatory role of tandem and palindromic DNA repeats in the M. hyopneumoniae transcriptional profile.

  7. Repetitive Elements in Mycoplasma hyopneumoniae Transcriptional Regulation.

    Science.gov (United States)

    Cattani, Amanda Malvessi; Siqueira, Franciele Maboni; Guedes, Rafael Lucas Muniz; Schrank, Irene Silveira

    2016-01-01

    Transcriptional regulation, a multiple-step process, is still poorly understood in the important pig pathogen Mycoplasma hyopneumoniae. Basic motifs like promoters and terminators have already been described, but no other cis-regulatory elements have been found. DNA repeat sequences have been shown to be an interesting potential source of cis-regulatory elements. In this work, a genome-wide search for tandem and palindromic repetitive elements was performed in the intergenic regions of all coding sequences from M. hyopneumoniae strain 7448. Computational analysis demonstrated the presence of 144 tandem repeats and 1,171 palindromic elements. The DNA repeat sequences were distributed within the 5' upstream regions of 86% of transcriptional units of M. hyopneumoniae strain 7448. Comparative analysis between distinct repetitive sequences found in related mycoplasma genomes demonstrated different percentages of conservation among pathogenic and nonpathogenic strains. qPCR assays revealed differential expression among genes showing variable numbers of repetitive elements. In addition, repeats found in 206 genes already described to be differentially regulated under different culture conditions of M. hyopneumoniae strain 232 showed almost 80% conservation in relation to M. hyopneumoniae strain 7448 repeats. Altogether, these findings suggest a potential regulatory role of tandem and palindromic DNA repeats in the M. hyopneumoniae transcriptional profile.

  8. The Analysis of A Frequent TMPRSS3 Allele Containing P.V116M and P.V291L in A Cis Configuration among Deaf Koreans

    Directory of Open Access Journals (Sweden)

    Ah Reum Kim

    2017-10-01

    Full Text Available We performed targeted re-sequencing to identify the genetic etiology of early-onset postlingual deafness and encountered a frequent TMPRSS3 allele harboring two variants in a cis configuration. We aimed to evaluate the pathogenicity of the allele. Among 88 cochlear implantees with autosomal recessive non-syndromic hearing loss, subjects with GJB2 and SLC26A4 mutations were excluded. Thirty-one probands manifesting early-onset postlingual deafness were sorted. Through targeted re-sequencing, we detected two families with a TMPRSS3 mutant allele containing p.V116M and p.V291L in a cis configuration, p.[p.V116M; p.V291L]. A minor allele frequency was calculated and proteolytic activity was measured. A p.[p.V116M; p.V291L] allele demonstrated a significantly higher frequency compared to normal controls and merited attention due to its high frequency (4.84%, 3/62. The first family showed a novel deleterious splice site variant—c.783-1G>A—in a trans allele, while the other showed homozygosity. The progression to deafness was noted within the first decade, suggesting DFNB10. The proteolytic activity was significantly reduced, confirming the severe pathogenicity. This frequent mutant allele significantly contributes to early-onset postlingual deafness in Koreans. For clinical implication and proper auditory rehabilitation, it is important to pay attention to this allele with a severe pathogenic potential.

  9. Functional Interaction of the Adenovirus IVa2 Protein with Adenovirus Type 5 Packaging Sequences

    OpenAIRE

    Ostapchuk, Philomena; Yang, Jihong; Auffarth, Ece; Hearing, Patrick

    2005-01-01

    Adenovirus type 5 (Ad5) DNA packaging is initiated in a polar fashion from the left end of the genome. The packaging process is dependent on the cis-acting packaging domain located between nucleotides 230 and 380. Seven AT-rich repeats that direct packaging have been identified within this domain. A1, A2, A5, and A6 are the most important repeats functionally and share a bipartite sequence motif. Several lines of evidence suggest that there is a limiting trans-acting factor(s) that plays a ro...

  10. Identification of Predictive Cis-Regulatory Elements Using a Discriminative Objective Function and a Dynamic Search Space.

    Directory of Open Access Journals (Sweden)

    Rahul Karnik

    Full Text Available The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs.

  11. The efficacy of 9-cis retinoic acid in experimental models of cancer.

    Science.gov (United States)

    Gottardis, M M; Lamph, W W; Shalinsky, D R; Wellstein, A; Heyman, R A

    1996-01-01

    9-cis retinoic acid (9-cis RA) is a retinoid receptor pan-agonist that binds with high affinity to both retinoic acid receptors (RARs) and retinoid X receptors (RXRs). Using a variety of in vivo and in vitro cancer models, we present experimental data that 9-cis RA has activity as a potential chemotherapeutic agent. Treatment of the human promyelocytic leukemia cell line HL-60 with 9-cis RA decreases cell proliferation, increases cell differentiation, and increases apoptosis. Induction of apoptosis correlates with an increase in tissue transglutaminase (type II) activity. In vivo, 9-cis RA induces complete tumor regression of an early passage human lip squamous cell carcinoma xenograft. Finally, 9-cis RA inhibits the anchorage-independent growth of the human breast cancer cell lines MCF-7 and LY2 (an antiestrogen-resistant MCF-7 variant). Transient co-transfection assays indicate that 9-cis RA inhibits estrogen receptor transcription of an ERE-tk-LUC reporter through RAR or RXR receptors. These data suggest that retinoid receptors can antagonize estrogen-dependent transcription and provides one possible mechanism for the inhibition of cell growth by 9-cis RA in breast cancer cell lines. In summary, these findings present evidence that 9-cis RA has a wide range of activities in human cancer models.

  12. Functional brain activation differences in stuttering identified with a rapid fMRI sequence

    Science.gov (United States)

    Kraft, Shelly Jo; Choo, Ai Leen; Sharma, Harish; Ambrose, Nicoline G.

    2011-01-01

    The purpose of this study was to investigate whether brain activity related to the presence of stuttering can be identified with rapid functional MRI (fMRI) sequences that involved overt and covert speech processing tasks. The long-term goal is to develop sensitive fMRI approaches with developmentally appropriate tasks to identify deviant speech motor and auditory brain activity in children who stutter closer to the age at which recovery from stuttering is documented. Rapid sequences may be preferred for individuals or populations who do not tolerate long scanning sessions. In this report, we document the application of a picture naming and phoneme monitoring task in three minute fMRI sequences with adults who stutter (AWS). If relevant brain differences are found in AWS with these approaches that conform to previous reports, then these approaches can be extended to younger populations. Pairwise contrasts of brain BOLD activity between AWS and normally fluent adults indicated the AWS showed higher BOLD activity in the right inferior frontal gyrus (IFG), right temporal lobe and sensorimotor cortices during picture naming and and higher activity in the right IFG during phoneme monitoring. The right lateralized pattern of BOLD activity together with higher activity in sensorimotor cortices is consistent with previous reports, which indicates rapid fMRI sequences can be considered for investigating stuttering in younger participants. PMID:22133409

  13. Local effect of enhancer of zeste-like reveals cooperation of epigenetic and cis-acting determinants for zygotic genome rearrangements.

    Directory of Open Access Journals (Sweden)

    Maoussi Lhuillier-Akakpo

    2014-09-01

    Full Text Available In the ciliate Paramecium tetraurelia, differentiation of the somatic nucleus from the zygotic nucleus is characterized by massive and reproducible deletion of transposable elements and of 45,000 short, dispersed, single-copy sequences. A specific class of small RNAs produced by the germline during meiosis, the scnRNAs, are involved in the epigenetic regulation of DNA deletion but the underlying mechanisms are poorly understood. Here, we show that trimethylation of histone H3 (H3K27me3 and H3K9me3 displays a dynamic nuclear localization that is altered when the endonuclease required for DNA elimination is depleted. We identified the putative histone methyltransferase Ezl1 necessary for H3K27me3 and H3K9me3 establishment and show that it is required for correct genome rearrangements. Genome-wide analyses show that scnRNA-mediated H3 trimethylation is necessary for the elimination of long, repeated germline DNA, while single copy sequences display differential sensitivity to depletion of proteins involved in the scnRNA pathway, Ezl1- a putative histone methyltransferase and Dcl5- a protein required for iesRNA biogenesis. Our study reveals cis-acting determinants, such as DNA length, also contribute to the definition of germline sequences to delete. We further show that precise excision of single copy DNA elements, as short as 26 bp, requires Ezl1, suggesting that development specific H3K27me3 and H3K9me3 ensure specific demarcation of very short germline sequences from the adjacent somatic sequences.

  14. Assessment of regulatory effectiveness. Peer discussions on regulatory practices

    International Nuclear Information System (INIS)

    1999-09-01

    This report arises from the seventh series of peer discussions on regulatory practices entitled 'Assessment of Regulatory Effectiveness'. The term 'regulatory effectiveness' covers the quality of the work and level of performance of a regulatory body. In this sense, regulatory effectiveness applies to regulatory body activities aimed at preventing safety degradation and ensuring that an acceptable level of safety is being maintained by the regulated operating organizations. In addition, regulatory effectiveness encompasses the promotion of safety improvements, the timely and cost effective performance of regulatory functions in a manner which ensures the confidence of the operating organizations, the general public and the government, and striving for continuous improvements to performance. Senior regulators from 22 Member States participated in two peer group discussions during March and May 1999. The discussions were focused on the elements of an effective regulatory body, possible indicators of regulatory effectiveness and its assessment. This report presents the outcome of these meetings and recommendations of good practices identified by senior regulators, which do not necessarily reflect those of the governments of the nominating Member States, the organizations they belong to, or the International Atomic Energy Agency. In order to protect people and the environment from hazards associated with nuclear facilities, the main objective of a nuclear regulatory body is to ensure that a high level of safety in the nuclear activities under its jurisdiction is achieved, maintained and within the control of operating organizations. Even if it is possible to directly judge objective safety levels at nuclear facilities, such safety levels would not provide an exclusive indicator of regulatory effectiveness. The way the regulatory body ensures the safety of workers and the public and the way it discharges its responsibilities also determine its effectiveness. Hence the

  15. Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice

    Directory of Open Access Journals (Sweden)

    Shuchi eSmita

    2015-12-01

    Full Text Available MYB transcription factor (TF is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by top down and guide gene approaches. More than 50% of OsMYBs were strongly correlated under fifty experimental conditions with 51 hub genes via top down approach. Further, clusters were identified using Markov Clustering (MCL. To maximize the clustering performance, parameter evaluation of the MCL inflation score (I was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by guide gene approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought

  16. [Separation and identification of beta-carotene and its cis isomers by high pressure liquid chromatography (HPLC)].

    Science.gov (United States)

    Carrillo de Padilla, F

    1996-06-01

    The separation and identification by HPLC of the cis isomers of beta-carotene was studied. A 1.26 mg/ml beta-carotene solution previously isomerized with iodine as a catalyst, was eluted with 2% acetone in hexane, from a Ca(OH)2 chromatographic column in three bands. The fractions were identified by spectrophotometry and the retention times of 2.05, 2.4 and 2.8 min for the 13 cis, all-trans, and 9 cis beta-carotene isomers, determined by HPLC, with 1% acetone in hexane as movil phase. 22.13 mg % of all-trans beta-carotene were found in a sample of canned carrots. It is recommended the analyses of a greater number of samples, the determination of the method's sensitivity, reproductibility, and the use of a standard of reference of a response factor for calculations.

  17. Separation and identification of beta-carotene and its cis isomers by high pressure liquid chromatography (HPLC)

    International Nuclear Information System (INIS)

    Carrillo de Padilla, F.

    1996-01-01

    The separation and identification by HPLC of the cis isomers of beta-carotene was studied. A 1.26 mg/ml beta-carotene solution previously isomerized with iodine as a catalyst, was eluted with 2% acetone in hexane, from a Ca(OH)2 chromatographic column in three bands. The fractions were identified by spectrophotometry and the retention times of 2.05, 2.4 and 2.8 min for the 13 cis, all-trans, and 9 cis beta-carotene isomers, determined by HPLC, with 1% acetone in hexane as Mobil phase. 22.13 mg % of all-trans beta-carotene were found in a sample of canned carrots. It is recommended the analyses of a greater number of samples, the determination of the method's sensitivity, reproducibility, and the use of a standard of reference of a response factor for calculations

  18. Effects of cis-9, trans-11 and trans-10, cis-12 conjugated linoleic acid (CLA) isomers on immune function in healthy men

    NARCIS (Netherlands)

    Albers, R.; Wielen, R.P.J. van der; Brink, E.J.; Hendriks, H.F.J.; Dorovska-Taran, V.N.; Mohede, I.C.M.

    2003-01-01

    Objectives: To study the effects of two different mixtures of the main conjugated linoleic acid (CLA) isomers cis-9, trans-11 CLA and trans-10, cis-12 CLA on human immune function. Design: Double-blind, randomized, parallel, reference-controlled intervention study. Subjects and intervention:

  19. Kynurenine 3-Monooxygenase Gene Associated With Nicotine Initiation and Addiction: Analysis of Novel Regulatory Features at 5′ and 3′-Regions

    Directory of Open Access Journals (Sweden)

    Hassan A. Aziz

    2018-06-01

    Full Text Available Tobacco smoking is widespread behavior in Qatar and worldwide and is considered one of the major preventable causes of ill health and death. Nicotine is part of tobacco smoke that causes numerous health risks and is incredibly addictive; it binds to the α7 nicotinic acetylcholine receptor (α7nAChR in the brain. Recent studies showed α7nAChR involvement in the initiation and addiction of smoking. Kynurenic acid (KA, a significant tryptophan metabolite, is an antagonist of α7nAChR. Inhibition of kynurenine 3-monooxygenase enzyme encoded by KMO enhances the KA levels. Modulating KMO gene expression could be a useful tactic for the treatment of tobacco initiation and dependence. Since KMO regulation is still poorly understood, we aimed to investigate the 5′ and 3′-regulatory factors of KMO gene to advance our knowledge to modulate KMO gene expression. In this study, bioinformatics methods were used to identify the regulatory sequences associated with expression of KMO. The displayed differential expression of KMO mRNA in the same tissue and different tissues suggested the specific usage of the KMO multiple alternative promoters. Eleven KMO alternative promoters identified at 5′-regulatory region contain TATA-Box, lack CpG Island (CGI and showed dinucleotide base-stacking energy values specific to transcription factor binding sites (TFBSs. The structural features of regulatory sequences can influence the transcription process and cell type-specific expression. The uncharacterized LOC105373233 locus coding for non-coding RNA (ncRNA located on the reverse strand in a convergent manner at the 3′-side of KMO locus. The two genes likely expressed by a promoter that lacks TATA-Box harbor CGI and two TFBSs linked to the bidirectional transcription, the NRF1, and ZNF14 motifs. We identified two types of microRNA (miR in the uncharacterized LOC105373233 ncRNA, which are like hsa-miR-5096 and hsa-miR-1285-3p and can target the miR recognition

  20. Regulatory Roles for Long ncRNA and mRNA

    Directory of Open Access Journals (Sweden)

    Marcel W. Coolen

    2013-04-01

    Full Text Available Recent advances in high-throughput sequencing technology have identified the transcription of a much larger portion of the genome than previously anticipated. Especially in the context of cancer it has become clear that aberrant transcription of both protein-coding and long non-coding RNAs (lncRNAs are frequent events. The current dogma of RNA function describes mRNA to be responsible for the synthesis of proteins, whereas non-coding RNA can have regulatory or epigenetic functions. However, this distinction between protein coding and regulatory ability of transcripts may not be that strict. Here, we review the increasing body of evidence for the existence of multifunctional RNAs that have both protein-coding and trans-regulatory roles. Moreover, we demonstrate that coding transcripts bind to components of the Polycomb Repressor Complex 2 (PRC2 with similar affinities as non-coding transcripts, revealing potential epigenetic regulation by mRNAs. We hypothesize that studies on the regulatory ability of disease-associated mRNAs will form an important new field of research.

  1. Elucidating MicroRNA Regulatory Networks Using Transcriptional, Post-transcriptional, and Histone Modification Measurements

    Directory of Open Access Journals (Sweden)

    Sara J.C. Gosline

    2016-01-01

    Full Text Available MicroRNAs (miRNAs regulate diverse biological processes by repressing mRNAs, but their modest effects on direct targets, together with their participation in larger regulatory networks, make it challenging to delineate miRNA-mediated effects. Here, we describe an approach to characterizing miRNA-regulatory networks by systematically profiling transcriptional, post-transcriptional and epigenetic activity in a pair of isogenic murine fibroblast cell lines with and without Dicer expression. By RNA sequencing (RNA-seq and CLIP (crosslinking followed by immunoprecipitation sequencing (CLIP-seq, we found that most of the changes induced by global miRNA loss occur at the level of transcription. We then introduced a network modeling approach that integrated these data with epigenetic data to identify specific miRNA-regulated transcription factors that explain the impact of miRNA perturbation on gene expression. In total, we demonstrate that combining multiple genome-wide datasets spanning diverse regulatory modes enables accurate delineation of the downstream miRNA-regulated transcriptional network and establishes a model for studying similar networks in other systems.

  2. Characterisation of mutations of the phosphoinositide-3-kinase regulatory subunit, PIK3R2, in perisylvian polymicrogyria: a next-generation sequencing study.

    Science.gov (United States)

    Mirzaa, Ghayda M; Conti, Valerio; Timms, Andrew E; Smyser, Christopher D; Ahmed, Sarah; Carter, Melissa; Barnett, Sarah; Hufnagel, Robert B; Goldstein, Amy; Narumi-Kishimoto, Yoko; Olds, Carissa; Collins, Sarah; Johnston, Kathreen; Deleuze, Jean-François; Nitschké, Patrick; Friend, Kathryn; Harris, Catharine; Goetsch, Allison; Martin, Beth; Boyle, Evan August; Parrini, Elena; Mei, Davide; Tattini, Lorenzo; Slavotinek, Anne; Blair, Ed; Barnett, Christopher; Shendure, Jay; Chelly, Jamel; Dobyns, William B; Guerrini, Renzo

    2015-12-01

    Bilateral perisylvian polymicrogyria (BPP), the most common form of regional polymicrogyria, causes the congenital bilateral perisylvian syndrome, featuring oromotor dysfunction, cognitive impairment, and epilepsy. The causes of BPP are heterogeneous, but only a few genetic causes have been reported. The aim of this study was to identify additional genetic causes of BPP and characterise their frequency in this population. Children (aged ≤18 years) with polymicrogyria were enrolled into our research programme from July, 1980, to October, 2015, at two centres (Florence, Italy, and Seattle, WA, USA). We obtained samples (blood and saliva) throughout this period at both centres and did whole-exome sequencing on DNA from eight trios (two parents and one affected child) with BPP in 2014. After the identification of mosaic PIK3R2 mutations in two of these eight children, we performed targeted screening of PIK3R2 by two methods in a cohort of 118 children with BPP. First, we performed targeted sequencing of the entire PIK3R2 gene by single molecule molecular inversion probes (smMIPs) on 38 patients with BPP with normal to large head size. Second, we did amplicon sequencing of the recurrent PIK3R2 mutation (Gly373Arg) in 80 children with various types of polymicrogyria including BPP. One additional patient had clinical whole-exome sequencing done independently, and was included in this study because of the phenotypic similarity to our cohort. We identified a mosaic mutation (Gly373Arg) in a regulatory subunit of the PI3K-AKT-mTOR pathway, PIK3R2, in two children with BPP. Of the 38 patients with BPP and normal to large head size who underwent targeted next-generation sequencing by smMIPs, we identified constitutional and mosaic PIK3R2 mutations in 17 additional children. In parallel, one patient had the recurrent PIK3R2 mutation identified by clinical whole-exome sequencing. Seven of these 20 patients had BPP alone, and 13 had BPP in association with features of the

  3. Identification of unique cis-element pattern on simulated microgravity treated Arabidopsis by in silico and gene expression

    Science.gov (United States)

    Soh, Hyuncheol; Choi, Yongsang; Lee, Taek-Kyun; Yeo, Up-Dong; Han, Kyeongsik; Auh, Chungkyun; Lee, Sukchan

    2012-08-01

    Arabidopsis gene expression microarray (44 K) was used to detect genes highly induced under simulated microgravity stress (SMS). Ten SMS-inducible genes were selected from the microarray data and these 10 genes were found to be abundantly expressed in 3-week-old plants. Nine out of the 10 SMS-inducible genes were also expressed in response to the three abiotic stresses of drought, touch, and wounding in 3-week-old Arabidopsis plants respectively. However, WRKY46 was elevated only in response to SMS. Six other WRKY genes did not respond to SMS. To clarify the characteristics of the genes expressed at high levels in response to SMS, 20 cis-elements in the promoters of the 40 selected genes including the 10 SMS-inducible genes, the 6 WRKY genes, and abiotic stress-inducible genes were analyzed and their spatial positions on each promoter were determined. Four cis-elements (M/T-G-T-P from MYB1AT or TATABOX5, GT1CONSENSUS, TATABOX5, and POLASIG1) showed a unique spatial arrangement in most SMS-inducible genes including WRKY46. Therefore the M/T-G-T-P cis-element patterns identified in the promoter of WRKY46 may play important roles in regulating gene expression in response to SMS. The presences of the cis-element patterns suggest that the order or spatial positioning of certain groups of cis-elements is more important than the existence or numbers of specific cis-elements. Taken together, our data indicate that WRKY46 is a novel SMS inducible transcription factor and the unique spatial arrangement of cis-elements shown in WRKY46 promoter may play an important role for its response to SMS.

  4. Effects of pyrethroid pesticide cis-bifenthrin on lipogenesis in hepatic cell line.

    Science.gov (United States)

    Xiang, Dandan; Chu, Tianyi; Li, Meng; Wang, Qiangwei; Zhu, Guonian

    2018-06-01

    Mounting evidence suggests there is a link between exposure to synthetic pyrethroids (SPs) and the development of obesity. The information presented in this study suggests that cis-bifenthrin (cis-BF) could activate pregnane X receptor (PXR) mediated pathway and lead to the lipid accumulation of human hepatoma (HepG2) cells. Cells were incubated in the control or different concentrations of cis-BF for 24 h. The 1 × 10 -7  M and 1 × 10 -6  M cis-BF exposure were found to induce cellular triglyceride (TG) accumulation significantly. This phenomenon was further supported by Oil Red O Staining assay. The cis-BF exposure caused upregulation of PXR gene and protein. Correspondingly, we also observed the increased expression of downstream genes involved in lipid formation and the inhibition of the expression of β-oxidation. As chiral pesticide,cis-BF was further conformed to behave enantioselectivity in the lipid metabolism. Rather than 1R-cis-BF, HepG2 cells incubated with 1S-cis-BF exhibited a significant TG accumulation. 1S-cis-BF also showed a higher binding level, of which the KD value was 9.184 × 10 -8  M in the SPR assay, compared with 1R-cis-BF (3.463 × 10 -6  M). In addition, the molecular docking simulation analyses correlated well with the KD values measured by the SPR, indicating that 1S-cis-BF showed a better binding affinity with PXR. The results in this study also elucidates the differences between the two enantiomers of pyrethroid-induced toxicity in lipid metabolism of non-target organism. Copyright © 2018 Elsevier Ltd. All rights reserved.

  5. Somatic mosaicism of a CDKL5 mutation identified by next-generation sequencing.

    Science.gov (United States)

    Kato, Takeshi; Morisada, Naoya; Nagase, Hiroaki; Nishiyama, Masahiro; Toyoshima, Daisaku; Nakagawa, Taku; Maruyama, Azusa; Fu, Xue Jun; Nozu, Kandai; Wada, Hiroko; Takada, Satoshi; Iijima, Kazumoto

    2015-10-01

    CDKL5-related encephalopathy is an X-linked dominantly inherited disorder that is characterized by early infantile epileptic encephalopathy or atypical Rett syndrome. We describe a 5-year-old Japanese boy with intractable epilepsy, severe developmental delay, and Rett syndrome-like features. Onset was at 2 months, when his electroencephalogram showed sporadic single poly spikes and diffuse irregular poly spikes. We conducted a genetic analysis using an Illumina® TruSight™ One sequencing panel on a next-generation sequencer. We identified two epilepsy-associated single nucleotide variants in our case: CDKL5 p.Ala40Val and KCNQ2 p.Glu515Asp. CDKL5 p.Ala40Val has been previously reported to be responsible for early infantile epileptic encephalopathy. In our case, the CDKL5 heterozygous mutation showed somatic mosaicism because the boy's karyotype was 46,XY. The KCNQ2 variant p.Glu515Asp is known to cause benign familial neonatal seizures-1, and this variant showed paternal inheritance. Although we believe that the somatic mosaic CDKL5 mutation is mainly responsible for the neurological phenotype in the patient, the KCNQ2 variant might have some neurological effect. Genetic analysis by next-generation sequencing is capable of identifying multiple variants in a patient. Copyright © 2015 The Japanese Society of Child Neurology. Published by Elsevier B.V. All rights reserved.

  6. Enzymatic study on AtCCD4 and AtCCD7 and their potential to form acyclic regulatory metabolites

    KAUST Repository

    Bruno, Mark

    2016-09-29

    The Arabidopsis carotenoid cleavage dioxygenase 4 (AtCCD4) is a negative regulator of the carotenoid content of seeds and has recently been suggested as a candidate for the generation of retrograde signals that are thought to derive from the cleavage of poly-cis-configured carotene desaturation intermediates. In this work, we investigated the activity of AtCCD4 in vitro and used dynamic modeling to determine its substrate preference. Our results document strict regional specificity for cleavage at the C9–C10 double bond in carotenoids and apocarotenoids, with preference for carotenoid substrates and an obstructing effect on hydroxyl functions, and demonstrate the specificity for all-trans-configured carotenes and xanthophylls. AtCCD4 cleaved substrates with at least one ionone ring and did not convert acyclic carotene desaturation intermediates, independent of their isomeric states. These results do not support a direct involvement of AtCCD4 in generating the supposed regulatory metabolites. In contrast, the strigolactone biosynthetic enzyme AtCCD7 converted 9-cis-configured acyclic carotenes, such as 9-cis-ζ-carotene, 9\\'-cis-neurosporene, and 9-cis-lycopene, yielding 9-cis-configured products and indicating that AtCCD7, rather than AtCCD4, is the candidate for forming acyclic retrograde signals.

  7. Enzymatic study on AtCCD4 and AtCCD7 and their potential to form acyclic regulatory metabolites

    KAUST Repository

    Bruno, Mark; Koschmieder, Julian; Wuest, Florian; Schaub, Patrick; Fehling-Kaschek, Mirjam; Timmer, Jens; Beyer, Peter; Al-Babili, Salim

    2016-01-01

    The Arabidopsis carotenoid cleavage dioxygenase 4 (AtCCD4) is a negative regulator of the carotenoid content of seeds and has recently been suggested as a candidate for the generation of retrograde signals that are thought to derive from the cleavage of poly-cis-configured carotene desaturation intermediates. In this work, we investigated the activity of AtCCD4 in vitro and used dynamic modeling to determine its substrate preference. Our results document strict regional specificity for cleavage at the C9–C10 double bond in carotenoids and apocarotenoids, with preference for carotenoid substrates and an obstructing effect on hydroxyl functions, and demonstrate the specificity for all-trans-configured carotenes and xanthophylls. AtCCD4 cleaved substrates with at least one ionone ring and did not convert acyclic carotene desaturation intermediates, independent of their isomeric states. These results do not support a direct involvement of AtCCD4 in generating the supposed regulatory metabolites. In contrast, the strigolactone biosynthetic enzyme AtCCD7 converted 9-cis-configured acyclic carotenes, such as 9-cis-ζ-carotene, 9'-cis-neurosporene, and 9-cis-lycopene, yielding 9-cis-configured products and indicating that AtCCD7, rather than AtCCD4, is the candidate for forming acyclic retrograde signals.

  8. Enzymatic study on AtCCD4 and AtCCD7 and their potential to form acyclic regulatory metabolites

    Science.gov (United States)

    Bruno, Mark; Koschmieder, Julian; Wuest, Florian; Schaub, Patrick; Fehling-Kaschek, Mirjam; Timmer, Jens; Beyer, Peter; Al-Babili, Salim

    2016-01-01

    The Arabidopsis carotenoid cleavage dioxygenase 4 (AtCCD4) is a negative regulator of the carotenoid content of seeds and has recently been suggested as a candidate for the generation of retrograde signals that are thought to derive from the cleavage of poly-cis-configured carotene desaturation intermediates. In this work, we investigated the activity of AtCCD4 in vitro and used dynamic modeling to determine its substrate preference. Our results document strict regional specificity for cleavage at the C9–C10 double bond in carotenoids and apocarotenoids, with preference for carotenoid substrates and an obstructing effect on hydroxyl functions, and demonstrate the specificity for all-trans-configured carotenes and xanthophylls. AtCCD4 cleaved substrates with at least one ionone ring and did not convert acyclic carotene desaturation intermediates, independent of their isomeric states. These results do not support a direct involvement of AtCCD4 in generating the supposed regulatory metabolites. In contrast, the strigolactone biosynthetic enzyme AtCCD7 converted 9-cis-configured acyclic carotenes, such as 9-cis-ζ-carotene, 9'-cis-neurosporene, and 9-cis-lycopene, yielding 9-cis-configured products and indicating that AtCCD7, rather than AtCCD4, is the candidate for forming acyclic retrograde signals. PMID:27811075

  9. U.S./CIS eye joint nuclear rocket venture

    Science.gov (United States)

    Clark, John S.; Mcilwain, Melvin C.; Smetanikov, Vladimir; D'Yakov, Evgenij K.; Pavshuk, Vladimir A.

    1993-01-01

    An account is given of the significance for U.S. spacecraft development of a nuclear thermal rocket (NTR) reactor concept that has been developed in the (formerly Soviet) Commonwealth of Independent States (CIS). The CIS NTR reactor employs a hydrogen-cooled zirconium hydride moderator and ternary carbide fuels; the comparatively cool operating temperatures associated with this design promise overall robustness.

  10. Identifying significant genetic regulatory networks in the prostate cancer from microarray data based on transcription factor analysis and conditional independency

    Directory of Open Access Journals (Sweden)

    Yeh Cheng-Yu

    2009-12-01

    Full Text Available Abstract Background Prostate cancer is a world wide leading cancer and it is characterized by its aggressive metastasis. According to the clinical heterogeneity, prostate cancer displays different stages and grades related to the aggressive metastasis disease. Although numerous studies used microarray analysis and traditional clustering method to identify the individual genes during the disease processes, the important gene regulations remain unclear. We present a computational method for inferring genetic regulatory networks from micorarray data automatically with transcription factor analysis and conditional independence testing to explore the potential significant gene regulatory networks that are correlated with cancer, tumor grade and stage in the prostate cancer. Results To deal with missing values in microarray data, we used a K-nearest-neighbors (KNN algorithm to determine the precise expression values. We applied web services technology to wrap the bioinformatics toolkits and databases to automatically extract the promoter regions of DNA sequences and predicted the transcription factors that regulate the gene expressions. We adopt the microarray datasets consists of 62 primary tumors, 41 normal prostate tissues from Stanford Microarray Database (SMD as a target dataset to evaluate our method. The predicted results showed that the possible biomarker genes related to cancer and denoted the androgen functions and processes may be in the development of the prostate cancer and promote the cell death in cell cycle. Our predicted results showed that sub-networks of genes SREBF1, STAT6 and PBX1 are strongly related to a high extent while ETS transcription factors ELK1, JUN and EGR2 are related to a low extent. Gene SLC22A3 may explain clinically the differentiation associated with the high grade cancer compared with low grade cancer. Enhancer of Zeste Homolg 2 (EZH2 regulated by RUNX1 and STAT3 is correlated to the pathological stage

  11. Identifying significant genetic regulatory networks in the prostate cancer from microarray data based on transcription factor analysis and conditional independency.

    Science.gov (United States)

    Yeh, Hsiang-Yuan; Cheng, Shih-Wu; Lin, Yu-Chun; Yeh, Cheng-Yu; Lin, Shih-Fang; Soo, Von-Wun

    2009-12-21

    Prostate cancer is a world wide leading cancer and it is characterized by its aggressive metastasis. According to the clinical heterogeneity, prostate cancer displays different stages and grades related to the aggressive metastasis disease. Although numerous studies used microarray analysis and traditional clustering method to identify the individual genes during the disease processes, the important gene regulations remain unclear. We present a computational method for inferring genetic regulatory networks from micorarray data automatically with transcription factor analysis and conditional independence testing to explore the potential significant gene regulatory networks that are correlated with cancer, tumor grade and stage in the prostate cancer. To deal with missing values in microarray data, we used a K-nearest-neighbors (KNN) algorithm to determine the precise expression values. We applied web services technology to wrap the bioinformatics toolkits and databases to automatically extract the promoter regions of DNA sequences and predicted the transcription factors that regulate the gene expressions. We adopt the microarray datasets consists of 62 primary tumors, 41 normal prostate tissues from Stanford Microarray Database (SMD) as a target dataset to evaluate our method. The predicted results showed that the possible biomarker genes related to cancer and denoted the androgen functions and processes may be in the development of the prostate cancer and promote the cell death in cell cycle. Our predicted results showed that sub-networks of genes SREBF1, STAT6 and PBX1 are strongly related to a high extent while ETS transcription factors ELK1, JUN and EGR2 are related to a low extent. Gene SLC22A3 may explain clinically the differentiation associated with the high grade cancer compared with low grade cancer. Enhancer of Zeste Homolg 2 (EZH2) regulated by RUNX1 and STAT3 is correlated to the pathological stage. We provide a computational framework to reconstruct

  12. Genome-wide comparative analysis reveals human-mouse regulatory landscape and evolution.

    Science.gov (United States)

    Denas, Olgert; Sandstrom, Richard; Cheng, Yong; Beal, Kathryn; Herrero, Javier; Hardison, Ross C; Taylor, James

    2015-02-14

    Because species-specific gene expression is driven by species-specific regulation, understanding the relationship between sequence and function of the regulatory regions in different species will help elucidate how differences among species arise. Despite active experimental and computational research, relationships among sequence, conservation, and function are still poorly understood. We compared transcription factor occupied segments (TFos) for 116 human and 35 mouse TFs in 546 human and 125 mouse cell types and tissues from the Human and the Mouse ENCODE projects. We based the map between human and mouse TFos on a one-to-one nucleotide cross-species mapper, bnMapper, that utilizes whole genome alignments (WGA). Our analysis shows that TFos are under evolutionary constraint, but a substantial portion (25.1% of mouse and 25.85% of human on average) of the TFos does not have a homologous sequence on the other species; this portion varies among cell types and TFs. Furthermore, 47.67% and 57.01% of the homologous TFos sequence shows binding activity on the other species for human and mouse respectively. However, 79.87% and 69.22% is repurposed such that it binds the same TF in different cells or different TFs in the same cells. Remarkably, within the set of repurposed TFos, the corresponding genome regions in the other species are preferred locations of novel TFos. These events suggest exaptation of some functional regulatory sequences into new function. Despite TFos repurposing, we did not find substantial changes in their predicted target genes, suggesting that CRMs buffer evolutionary events allowing little or no change in the TFos - target gene associations. Thus, the small portion of TFos with strictly conserved occupancy underestimates the degree of conservation of regulatory interactions. We mapped regulatory sequences from an extensive number of TFs and cell types between human and mouse using WGA. A comparative analysis of this correspondence unveiled the

  13. Sequence-Based Introgression Mapping Identifies Candidate White Mold Tolerance Genes in Common Bean

    Directory of Open Access Journals (Sweden)

    Sujan Mamidi

    2016-07-01

    Full Text Available White mold, caused by the necrotrophic fungus (Lib. de Bary, is a major disease of common bean ( L.. WM7.1 and WM8.3 are two quantitative trait loci (QTL with major effects on tolerance to the pathogen. Advanced backcross populations segregating individually for either of the two QTL, and a recombinant inbred (RI population segregating for both QTL were used to fine map and confirm the genetic location of the QTL. The QTL intervals were physically mapped using the reference common bean genome sequence, and the physical intervals for each QTL were further confirmed by sequence-based introgression mapping. Using whole-genome sequence data from susceptible and tolerant DNA pools, introgressed regions were identified as those with significantly higher numbers of single-nucleotide polymorphisms (SNPs relative to the whole genome. By combining the QTL and SNP data, WM7.1 was located to a 660-kb region that contained 41 gene models on the proximal end of chromosome Pv07, while the WM8.3 introgression was narrowed to a 1.36-Mb region containing 70 gene models. The most polymorphic candidate gene in the WM7.1 region encodes a BEACH-domain protein associated with apoptosis. Within the WM8.3 interval, a receptor-like protein with the potential to recognize pathogen effectors was the most polymorphic gene. The use of gene and sequence-based mapping identified two candidate genes whose putative functions are consistent with the current model of pathogenicity.

  14. Maps of open chromatin highlight cell type-restricted patterns of regulatory sequence variation at hematological trait loci

    NARCIS (Netherlands)

    Paul, D.S.; Albers, C.A.; Rendon, A.; Voss, K.; Stephens, J.; Akkerman, J.W.; Algra, A.; Al-Hussani, A.; Allayee, H.; Anni, F.; Asselbergs, F.W.; Attwood, A.; Balkau, B.; Bandinelli, S.; Bastardot, F.; Basu, S.; Baumeister, S.E.; Beckmann, J.; Benyamin, B.; Biino, G.; Bis, J.C.; Bomba, L.; Bonnefond, A.; Boomsma, D.I.; Bradley, J.R.; Cambien, F.; Ciullo, M.; Cookson, W.O.; Cucca, F.; Cvejic, A.; d'Adamo, A.P.; Danesh, J.; Danjou, F.; Das, D.; Davies, G.; de Bakker, P.I.; de Boer, R.A.; de Geus, E.J.C.; Deary, I.J.; Dedoussis, G.V.; Dimitriou, M.; Dina, C.; Döring, A.; Elling, U.; Ellinghaus, D.; Elliott, P.; Engström, G.; Erdmann, J.; Esko, T.; Evans, D.M.; Eyjolfsson, G.I.; Falchi, M.; Feng, W.W.; Ferreira, M.A.; Ferrucci, L.; Fischer, K.; Folsom, A.R.; Fortina, P.; Franke, A.; Franke, L.; Frazer, I.H.; Froguel, P.; Galanello, R.; Ganesh, S.; Garner, S.F.; Gasparini, P.; Genser, B.; Gibson, Q.D.; Gieger, C.; Girotto, G.; Glazer, N.L.; Gögele, M.; Goodall, A.H.; Greinacher, A.; Gudbjartsson, D.F.; Hammond, C.J.; Harris, S.E.; Hartiala, J.; Hartikainen, A.L.; Hazen, S.L.; Heckbert, S.R.; Hedblad, B.; Hengstenberg, C.; Hersch, M.; Hicks, A.A.; Holm, H.; Hottenga, J.J.; Illig, T.; Järvelin, M.R.; Jolley, J.; Jupe, S.; Kähönen, M.; Kamatani, N.; Kanoni, S.; Kema, I.P.; Kemp, J.P.; Khadake, J.; Khaw, K.T.; Kleber, M.E.; Kooner, J.S.; Kovacs, P.; Kühnel, B.; Kyrtsonis, M.C.; Labrune, Y.; Lagou, V.; Langenberg, C.; Lehtimäki, T.; Li, X.; Liang, L.; Lloyd-Jones, H.; Loos, R.J.; Lopez, L.M.; Lumley, T.; Lyytikäinen, L.P.; Maerz, W.; Mägi, R.; Mangino, M.; Martin, N.G.; Maschio, A.; Mateo Leach, I.; McKnight, B.; Meacham, S.; Medland, S.E.; Meisinger, C.; Melander, O.; Memari, Y.; Metspalu, A.; Miller, K.; Mitchell, B.D.; Moffatt, M.F.; Montgomery, G.W.; Moore, C.; Murgia, F.; Nakamura, Y.; Nauck, M.; Navis, G.; Nolte, I.M.; Nöthlings, U.; Nutile, T.; Okada, Y.; Olafsson, I.; Onundarson, P.T.; O'Reilly, P.F.; Parracciani, D.; Parsa, A.; Penninger, J.M.; Penninx, B.W.J.H.; Pirastu, M.; Pirastu, N.; Pistis, G.; Porcu, E.; Portas, L.; Porteous, D.J.; Pouta, A.; Pramstaller, P.P.; Prokopenko, I.; Psaty, B.M.; Pullat, J.; Radhakrishnan, A.; Raitakari, O.; Ramirez-Solis, R.; Ried, J.S.; Ring, S.M.; Robino, A.; Rotter, J.I.; Ruggiero, D.; Ruokonen, A.; Sala, C.; Saluments, A.; Samani, N.J.; Sambrook, J.; Sanna, S.; Schlessinger, D.; Schmidt, C.O.; Schreiber, S; Schunkert, H.; Scott, J.; Sehmi, J.; Serbanovic-Canic, J.; Shin, S.Y.; Shuldiner, A.R.; Sladek, R.; Smit, J.H.; Smith, G.D.; Smith, J.G.; Smith, N.L.; Snieder, H.; Sorice, R.; Spector, T.D.; Starr, J.M.; Stefansson, K.; Stemple, D.; Stumvoll, M.; Sulem, P.; Takahashi, A.; Tan, S.T.; Tanaka, T.; Tang, C.; Tang, W.; Tang, W.H.; Taylor, K.; Tenesa, A.; Teumer, A.; Thein, S.; Thorsteinsdottir, U.; Toniolo, D.; Tönjes, A.; Traglia, M.; Uda, M.; Ulivi, S.; van der Schoot, E.; van Gilst, W.H.; van Pelt, L.J.; van Veldhuisen, D.J.; Verweij, N.; Visscher, P.M.; Völker, U.; Vollenweider, P.; Wareham, N.J.; Wernisch, L.; Westra, H.J.; Whitfield, J.B.; Wichmann, H.E.; Wiggins, K.L.; Willemsen, G.; Winkelmann, B.R.; Wirnsberger, G.; Wolffenbuttel, B.H.; Yang, J.; Yang, T.P.; Zhang, J.H.; Zhao, J.H.; Zitting, P.; Zwaginga, JJ; van der Harst, P.; Chambers, J.C.; Soranzo, N.; Ouwehand, W.H.; Deloukas, P.

    2013-01-01

    Nearly three-quarters of the 143 genetic signals associated with platelet and erythrocyte phenotypes identified by metaanalyses of genome-wide association (GWA) studies are located at non-protein-coding regions. Here, we assessed the role of candidate regulatory variants associated with cell

  15. The pineapple genome and the evolution of CAM photosynthesis.

    Science.gov (United States)

    Ming, Ray; VanBuren, Robert; Wai, Ching Man; Tang, Haibao; Schatz, Michael C; Bowers, John E; Lyons, Eric; Wang, Ming-Li; Chen, Jung; Biggers, Eric; Zhang, Jisen; Huang, Lixian; Zhang, Lingmao; Miao, Wenjing; Zhang, Jian; Ye, Zhangyao; Miao, Chenyong; Lin, Zhicong; Wang, Hao; Zhou, Hongye; Yim, Won C; Priest, Henry D; Zheng, Chunfang; Woodhouse, Margaret; Edger, Patrick P; Guyot, Romain; Guo, Hao-Bo; Guo, Hong; Zheng, Guangyong; Singh, Ratnesh; Sharma, Anupma; Min, Xiangjia; Zheng, Yun; Lee, Hayan; Gurtowski, James; Sedlazeck, Fritz J; Harkess, Alex; McKain, Michael R; Liao, Zhenyang; Fang, Jingping; Liu, Juan; Zhang, Xiaodan; Zhang, Qing; Hu, Weichang; Qin, Yuan; Wang, Kai; Chen, Li-Yu; Shirley, Neil; Lin, Yann-Rong; Liu, Li-Yu; Hernandez, Alvaro G; Wright, Chris L; Bulone, Vincent; Tuskan, Gerald A; Heath, Katy; Zee, Francis; Moore, Paul H; Sunkar, Ramanjulu; Leebens-Mack, James H; Mockler, Todd; Bennetzen, Jeffrey L; Freeling, Michael; Sankoff, David; Paterson, Andrew H; Zhu, Xinguang; Yang, Xiaohan; Smith, J Andrew C; Cushman, John C; Paull, Robert E; Yu, Qingyi

    2015-12-01

    Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ρ duplication event. The pineapple lineage has transitioned from C3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues. CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.

  16. Mass spectrometry for identification of proteins that specifically bind to a distal enhancer of the Oct4 gene

    Science.gov (United States)

    Bakhmet, E. I.; Nazarov, I. B.; Artamonova, T. O.; Khodorkovsky, M. A.; Tomilin, A. N.

    2017-11-01

    Transcription factor Oct4 is a marker of pluripotent stem cells and has a significant role in their self-renewal. Oct4 gene is controlled by three cis-regulatory elements - proximal promoter, proximal enhancer and distal enhancer. All of these elements are targets for binding of regulatory proteins. Distal enhancer is in our research focus because of its activity in early stages of embryonic development. There are two main sequences called site 2A and site 2B that are presented in distal enhancer. For this moment proteins which bind to a site 2A (CCCCTCCCCCC) remain unknown. Using combination of in vitro method electrophoretic mobility shift assay (EMSA) and mass spectromery we identified several candidates that can regulate Oct4 gene expression through site 2A.

  17. Prdm1a and miR-499 act sequentially to restrict Sox6 activity to the fast-twitch muscle lineage in the zebrafish embryo.

    Science.gov (United States)

    Wang, XinGang; Ono, Yosuke; Tan, Swee Chuan; Chai, Ruth JinFen; Parkin, Caroline; Ingham, Philip W

    2011-10-01

    Sox6 has been proposed to play a conserved role in vertebrate skeletal muscle fibre type specification. In zebrafish, sox6 transcription is repressed in slow-twitch progenitors by the Prdm1a transcription factor. Here we identify sox6 cis-regulatory sequences that drive fast-twitch-specific expression in a Prdm1a-dependent manner. We show that sox6 transcription subsequently becomes derepressed in slow-twitch fibres, whereas Sox6 protein remains restricted to fast-twitch fibres. We find that translational repression of sox6 is mediated by miR-499, the slow-twitch-specific expression of which is in turn controlled by Prdm1a, forming a regulatory loop that initiates and maintains the slow-twitch muscle lineage.

  18. RNA-Mediated cis Regulation in Acinetobacter baumannii Modulates Stress-Induced Phenotypic Variation.

    Science.gov (United States)

    Ching, Carly; Gozzi, Kevin; Heinemann, Björn; Chai, Yunrong; Godoy, Veronica G

    2017-06-01

    In the nosocomial opportunistic pathogen Acinetobacter baumannii , RecA-dependent mutagenesis, which causes antibiotic resistance acquisition, is linked to the DNA damage response (DDR). Notably, unlike the Escherichia coli paradigm, recA and DDR gene expression in A. baumannii is bimodal. Namely, there is phenotypic variation upon DNA damage, which may provide a bet-hedging strategy for survival. Thus, understanding recA gene regulation is key to elucidate the yet unknown DDR regulation in A. baumannii Here, we identify a structured 5' untranslated region (UTR) in the recA transcript which serves as a cis -regulatory element. We show that a predicted stem-loop structure in this 5' UTR affects mRNA half-life and underlies bimodal gene expression and thus phenotypic variation in response to ciprofloxacin treatment. We furthermore show that the stem-loop structure of the recA 5' UTR influences intracellular RecA protein levels and, in vivo , impairing the formation of the stem-loop structure of the recA 5' UTR lowers cell survival of UV treatment and decreases rifampin resistance acquisition from DNA damage-induced mutagenesis. We hypothesize that the 5' UTR allows for stable recA transcripts during stress, including antibiotic treatment, enabling cells to maintain suitable RecA levels for survival. This innovative strategy to regulate the DDR in A. baumannii may contribute to its success as a pathogen. IMPORTANCE Acinetobacter baumannii is an opportunistic pathogen quickly gaining antibiotic resistances. Mutagenesis and antibiotic resistance acquisition are linked to the DNA damage response (DDR). However, how the DDR is regulated in A. baumannii remains unknown, since unlike most bacteria, A. baumannii does not follow the regulation of the Escherichia coli paradigm. In this study, we have started to uncover the mechanisms regulating the novel A. baumannii DDR. We have found that a cis -acting 5' UTR regulates recA transcript stability, RecA protein levels, and DNA

  19. Two estrogen response element sequences near the PCNA gene are not responsible for its estrogen-enhanced expression in MCF7 cells.

    Science.gov (United States)

    Wang, Cheng; Yu, Jie; Kallen, Caleb B

    2008-01-01

    The proliferating cell nuclear antigen (PCNA) is an essential component of DNA replication, cell cycle regulation, and epigenetic inheritance. High expression of PCNA is associated with poor prognosis in patients with breast cancer. The 5'-region of the PCNA gene contains two computationally-detected estrogen response element (ERE) sequences, one of which is evolutionarily conserved. Both of these sequences are of undocumented cis-regulatory function. We recently demonstrated that estradiol (E2) enhances PCNA mRNA expression in MCF7 breast cancer cells. MCF7 cells proliferate in response to E2. Here, we demonstrate that E2 rapidly enhanced PCNA mRNA and protein expression in a process that requires ERalpha as well as de novo protein synthesis. One of the two upstream ERE sequences was specifically bound by ERalpha-containing protein complexes, in vitro, in gel shift analysis. Yet, each ERE sequence, when cloned as a single copy, or when engineered as two tandem copies of the ERE-containing sequence, was not capable of activating a luciferase reporter construct in response to E2. In MCF7 cells, neither ERE-containing genomic region demonstrated E2-dependent recruitment of ERalpha by sensitive ChIP-PCR assays. We conclude that E2 enhances PCNA gene expression by an indirect process and that computational detection of EREs, even when evolutionarily conserved and when near E2-responsive genes, requires biochemical validation.

  20. Cis By Trans

    OpenAIRE

    Rodovalho,Amara Moira

    2017-01-01

    Cis, trans: above all, metaphors. Cisjordan, region skirting the Jordan River. Cisplatin, Uruguay’s ancient name, region occupying one of the banks of the Prata River. Trans- Amazonian, that which crosses the Amazon; transatlantic, that which crosses the Atlantic. Cisalpine, transalpine. The geometric isomerism of Organic Chemistry, where “cis” are atoms that, when molecules are divided in half, remain on the same side, and “trans” those remaining on opposite sides. Ev...

  1. A novel radiation responsive cis-acting element regulates gene induction and mediates tissue injury

    International Nuclear Information System (INIS)

    Hallahan, Dennis E.; Virudachalam, Subbulakshmi; Kuchibahtla, Jaya

    1997-01-01

    containing binding domains for the transcription factors AP-1 and Ets. This DNA sequence (TGCCTCAGTTTCCC) is similar to antioxidant responsive element. X-ray- mediated transcriptional activation of the 5' regulatory region of ICAM-1 required the antioxidant responsive element (ARE). Electrophoretic mobility shift analysis of nuclear proteins from irradiated endothelial cells incubated with the ARE binding domain (5'-GCTGCTGCCTCAGTTTCCC-3') showed increased protein-DNA complexes at 60 and 120 minutes after irradiation. Conclusions: 1) ICAM induction in irradiated tissue occurs in the microvascular endothelium. 2) ICAM expression contributes to the pathogenesis of radiation-mediated tissue injury and the ICAM knockout serves as a model for the study of the pathogenesis of tissue injury. 3) ICAM expression is regulated by a novel radiation-inducible cis-acting element that has homology to previously identified antioxidant responsive elements

  2. Identification of lignin genes and regulatory sequences involved in secondary cell wall formation in Acacia auriculiformis and Acacia mangium via de novo transcriptome sequencing

    Directory of Open Access Journals (Sweden)

    Cannon Charles H

    2011-07-01

    Full Text Available Abstract Background Acacia auriculiformis × Acacia mangium hybrids are commercially important trees for the timber and pulp industry in Southeast Asia. Increasing pulp yield while reducing pulping costs are major objectives of tree breeding programs. The general monolignol biosynthesis and secondary cell wall formation pathways are well-characterized but genes in these pathways are poorly characterized in Acacia hybrids. RNA-seq on short-read platforms is a rapid approach for obtaining comprehensive transcriptomic data and to discover informative sequence variants. Results We sequenced transcriptomes of A. auriculiformis and A. mangium from non-normalized cDNA libraries synthesized from pooled young stem and inner bark tissues using paired-end libraries and a single lane of an Illumina GAII machine. De novo assembly produced a total of 42,217 and 35,759 contigs with an average length of 496 bp and 498 bp for A. auriculiformis and A. mangium respectively. The assemblies of A. auriculiformis and A. mangium had a total length of 21,022,649 bp and 17,838,260 bp, respectively, with the largest contig 15,262 bp long. We detected all ten monolignol biosynthetic genes using Blastx and further analysis revealed 18 lignin isoforms for each species. We also identified five contigs homologous to R2R3-MYB proteins in other plant species that are involved in transcriptional regulation of secondary cell wall formation and lignin deposition. We searched the contigs against public microRNA database and predicted the stem-loop structures of six highly conserved microRNA families (miR319, miR396, miR160, miR172, miR162 and miR168 and one legume-specific family (miR2086. Three microRNA target genes were predicted to be involved in wood formation and flavonoid biosynthesis. By using the assemblies as a reference, we discovered 16,648 and 9,335 high quality putative Single Nucleotide Polymorphisms (SNPs in the transcriptomes of A. auriculiformis and A. mangium

  3. Functional characterization of tobacco transcription factor TGA2.1

    DEFF Research Database (Denmark)

    Kegler, C.; Lenk, I.; Krawczyk, S.

    2004-01-01

    Activation sequence-1 (as-1)-like regulatory cis elements mediate transcriptional activation in response to increased levels of plant signalling molecules auxin and salicylic acid (SA). Our earlier work has shown that tobacco cellular as-1-binding complex SARP (salicylic acid responsive protein...

  4. Identification of microRNAs from Eugenia uniflora by high-throughput sequencing and bioinformatics analysis.

    Science.gov (United States)

    Guzman, Frank; Almerão, Mauricio P; Körbes, Ana P; Loss-Morais, Guilherme; Margis, Rogerio

    2012-01-01

    microRNAs or miRNAs are small non-coding regulatory RNAs that play important functions in the regulation of gene expression at the post-transcriptional level by targeting mRNAs for degradation or inhibiting protein translation. Eugenia uniflora is a plant native to tropical America with pharmacological and ecological importance, and there have been no previous studies concerning its gene expression and regulation. To date, no miRNAs have been reported in Myrtaceae species. Small RNA and RNA-seq libraries were constructed to identify miRNAs and pre-miRNAs in Eugenia uniflora. Solexa technology was used to perform high throughput sequencing of the library, and the data obtained were analyzed using bioinformatics tools. From 14,489,131 small RNA clean reads, we obtained 1,852,722 mature miRNA sequences representing 45 conserved families that have been identified in other plant species. Further analysis using contigs assembled from RNA-seq allowed the prediction of secondary structures of 25 known and 17 novel pre-miRNAs. The expression of twenty-seven identified miRNAs was also validated using RT-PCR assays. Potential targets were predicted for the most abundant mature miRNAs in the identified pre-miRNAs based on sequence homology. This study is the first large scale identification of miRNAs and their potential targets from a species of the Myrtaceae family without genomic sequence resources. Our study provides more information about the evolutionary conservation of the regulatory network of miRNAs in plants and highlights species-specific miRNAs.

  5. Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia

    Science.gov (United States)

    Puente, Xose S.; Pinyol, Magda; Quesada, Víctor; Conde, Laura; Ordóñez, Gonzalo R.; Villamor, Neus; Escaramis, Georgia; Jares, Pedro; Beà, Sílvia; González-Díaz, Marcos; Bassaganyas, Laia; Baumann, Tycho; Juan, Manel; López-Guerra, Mónica; Colomer, Dolors; Tubío, José M. C.; López, Cristina; Navarro, Alba; Tornador, Cristian; Aymerich, Marta; Rozman, María; Hernández, Jesús M.; Puente, Diana A.; Freije, José M. P.; Velasco, Gloria; Gutiérrez-Fernández, Ana; Costa, Dolors; Carrió, Anna; Guijarro, Sara; Enjuanes, Anna; Hernández, Lluís; Yagüe, Jordi; Nicolás, Pilar; Romeo-Casabona, Carlos M.; Himmelbauer, Heinz; Castillo, Ester; Dohm, Juliane C.; de Sanjosé, Silvia; Piris, Miguel A.; de Alava, Enrique; Miguel, Jesús San; Royo, Romina; Gelpí, Josep L.; Torrents, David; Orozco, Modesto; Pisano, David G.; Valencia, Alfonso; Guigó, Roderic; Bayés, Mónica; Heath, Simon; Gut, Marta; Klatt, Peter; Marshall, John; Raine, Keiran; Stebbings, Lucy A.; Futreal, P. Andrew; Stratton, Michael R.; Campbell, Peter J.; Gut, Ivo; López-Guillermo, Armando; Estivill, Xavier; Montserrat, Emili; López-Otín, Carlos; Campo, Elías

    2012-01-01

    Chronic lymphocytic leukaemia (CLL), the most frequent leukaemia in adults in Western countries, is a heterogeneous disease with variable clinical presentation and evolution1,2. Two major molecular subtypes can be distinguished, characterized respectively by a high or low number of somatic hypermutations in the variable region of immunoglobulin genes3,4. The molecular changes leading to the pathogenesis of the disease are still poorly understood. Here we performed whole-genome sequencing of four cases of CLL and identified 46 somatic mutations that potentially affect gene function. Further analysis of these mutations in 363 patients with CLL identified four genes that are recurrently mutated: notch 1 (NOTCH1), exportin 1 (XPO1), myeloid differentiation primary response gene 88 (MYD88) and kelch-like 6 (KLHL6). Mutations in MYD88 and KLHL6 are predominant in cases of CLL with mutated immunoglobulin genes, whereas NOTCH1 and XPO1 mutations are mainly detected in patients with unmutated immunoglobulins. The patterns of somatic mutation, supported by functional and clinical analyses, strongly indicate that the recurrent NOTCH1, MYD88 and XPO1 mutations are oncogenic changes that contribute to the clinical evolution of the disease. To our knowledge, this is the first comprehensive analysis of CLL combining whole-genome sequencing with clinical characteristics and clinical outcomes. It highlights the usefulness of this approach for the identification of clinically relevant mutations in cancer. PMID:21642962

  6. Genomic Aberrations in Crizotinib Resistant Lung Adenocarcinoma Samples Identified by Transcriptome Sequencing.

    Directory of Open Access Journals (Sweden)

    Ali Saber

    Full Text Available ALK-break positive non-small cell lung cancer (NSCLC patients initially respond to crizotinib, but resistance occurs inevitably. In this study we aimed to identify fusion genes in crizotinib resistant tumor samples. Re-biopsies of three patients were subjected to paired-end RNA sequencing to identify fusion genes using deFuse and EricScript. The IGV browser was used to determine presence of known resistance-associated mutations. Sanger sequencing was used to validate fusion genes and digital droplet PCR to validate mutations. ALK fusion genes were detected in all three patients with EML4 being the fusion partner. One patient had no additional fusion genes. Another patient had one additional fusion gene, but without a predicted open reading frame (ORF. The third patient had three additional fusion genes, of which two were derived from the same chromosomal region as the EML4-ALK. A predicted ORF was identified only in the CLIP4-VSNL1 fusion product. The fusion genes validated in the post-treatment sample were also present in the biopsy before crizotinib. ALK mutations (p.C1156Y and p.G1269A detected in the re-biopsies of two patients, were not detected in pre-treatment biopsies. In conclusion, fusion genes identified in our study are unlikely to be involved in crizotinib resistance based on presence in pre-treatment biopsies. The detection of ALK mutations in post-treatment tumor samples of two patients underlines their role in crizotinib resistance.

  7. Whole-Exome Sequencing Identifies One De Novo Variant in the FGD6 Gene in a Thai Family with Autism Spectrum Disorder

    Directory of Open Access Journals (Sweden)

    Chuphong Thongnak

    2018-01-01

    Full Text Available Autism spectrum disorder (ASD has a strong genetic basis, although the genetics of autism is complex and it is unclear. Genetic testing such as microarray or sequencing was widely used to identify autism markers, but they are unsuccessful in several cases. The objective of this study is to identify causative variants of autism in two Thai families by using whole-exome sequencing technique. Whole-exome sequencing was performed with autism-affected children from two unrelated families. Each sample was sequenced on SOLiD 5500xl Genetic Analyzer system followed by combined bioinformatics pipeline including annotation and filtering process to identify candidate variants. Candidate variants were validated, and the segregation study with other family members was performed using Sanger sequencing. This study identified a possible causative variant for ASD, c.2951G>A, in the FGD6 gene. We demonstrated the potential for ASD genetic variants associated with ASD using whole-exome sequencing and a bioinformatics filtering procedure. These techniques could be useful in identifying possible causative ASD variants, especially in cases in which variants cannot be identified by other techniques.

  8. Plasticity and innovation of regulatory mechanisms underlying seed oil content mediated by duplicated genes in the palaeopolyploid soybean.

    Science.gov (United States)

    Zhang, Dajian; Zhao, Meixia; Li, Shuai; Sun, Lianjun; Wang, Weidong; Cai, Chunmei; Dierking, Emily C; Ma, Jianxin

    2017-06-01

    Many plants have undergone whole genome duplication (WGD). However, how regulatory networks underlying a particular trait are reshaped in polyploids has not been experimentally investigated. Here we show that the regulatory pathways modulating seed oil content, which involve WRINKLED1 (WRI1), LEAFY COTYLEDON1 (LEC1), and LEC2 in Arabidopsis, have been modified in the palaeopolyploid soybean. Such modifications include functional reduction of GmWRI1b of the GmWRI1a/GmWRI1b homoeologous pair relevant to WRI1, complementary non-allelic dosage effects of the GmLEC1a/GmLEC1b homoeologous pair relevant to LEC1, pseudogenization of the singleton GmLEC2 relevant to LEC2, and the rise of the LEC2-like function of GmABI3b, contrasting to its homoeolog GmABI3a, which maintains the ABSCISIC ACID INSENSITIVE 3 (ABI3)-like function in modulating seed maturation and dormancy. The function of GmABI3b in modulating seed oil biosynthesis was fulfilled by direct binding to a RY (CATGCA) cis-regulatory element in the GmWRI1a promoter, which was absent in the GmWRI1b promoter, resulting in reduction of the GmWRI1b expression. Nevertheless, the three regulators each exhibited similar intensities of purifying selection to their respective duplicates since these pairs were formed by a WGD event that is proposed to have occurred approximately 13 million years ago (mya), suggesting that the differentiation in spatiotemporal expression between the duplicated genes is more likely to be the outcome of neutral variation in regulatory sequences. This study thus exemplifies the plasticity, dynamics, and novelty of regulatory networks mediated by WGD. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  9. Communications and Information Sharing (CIS) Laboratory

    Data.gov (United States)

    Federal Laboratory Consortium — TheCommunications and Information Sharing (CIS) Laboratory is a Public Safety interoperable communications technology laboratory with analog and digital radios, and...

  10. Assessment of clusters of transcription factor binding sites in relationship to human promoter, CpG islands and gene expression

    Directory of Open Access Journals (Sweden)

    Sakaki Yoshiyuki

    2004-02-01

    Full Text Available Abstract Background Gene expression is regulated mainly by transcription factors (TFs that interact with regulatory cis-elements on DNA sequences. To identify functional regulatory elements, computer searching can predict TF binding sites (TFBS using position weight matrices (PWMs that represent positional base frequencies of collected experimentally determined TFBS. A disadvantage of this approach is the large output of results for genomic DNA. One strategy to identify genuine TFBS is to utilize local concentrations of predicted TFBS. It is unclear whether there is a general tendency for TFBS to cluster at promoter regions, although this is the case for certain TFBS. Also unclear is the identification of TFs that have TFBS concentrated in promoters and to what level this occurs. This study hopes to answer some of these questions. Results We developed the cluster score measure to evaluate the correlation between predicted TFBS clusters and promoter sequences for each PWM. Non-promoter sequences were used as a control. Using the cluster score, we identified a PWM group called PWM-PCP, in which TFBS clusters positively correlate with promoters, and another PWM group called PWM-NCP, in which TFBS clusters negatively correlate with promoters. The PWM-PCP group comprises 47% of the 199 vertebrate PWMs, while the PWM-NCP group occupied 11 percent. After reducing the effect of CpG islands (CGI against the clusters using partial correlation coefficients among three properties (promoter, CGI and predicted TFBS cluster, we identified two PWM groups including those strongly correlated with CGI and those not correlated with CGI. Conclusion Not all PWMs predict TFBS correlated with human promoter sequences. Two main PWM groups were identified: (1 those that show TFBS clustered in promoters associated with CGI, and (2 those that show TFBS clustered in promoters independent of CGI. Assessment of PWM matches will allow more positive interpretation of TFBS in

  11. Deep sequencing of Salmonella RNA associated with heterologous Hfq proteins in vivo reveals small RNAs as a major target class and identifies RNA processing phenotypes.

    Science.gov (United States)

    Sittka, Alexandra; Sharma, Cynthia M; Rolle, Katarzyna; Vogel, Jörg

    2009-01-01

    The bacterial Sm-like protein, Hfq, is a key factor for the stability and function of small non-coding RNAs (sRNAs) in Escherichia coli. Homologues of this protein have been predicted in many distantly related organisms yet their functional conservation as sRNA-binding proteins has not entirely been clear. To address this, we expressed in Salmonella the Hfq proteins of two eubacteria (Neisseria meningitides, Aquifex aeolicus) and an archaeon (Methanocaldococcus jannaschii), and analyzed the associated RNA by deep sequencing. This in vivo approach identified endogenous Salmonella sRNAs as a major target of the foreign Hfq proteins. New Salmonella sRNA species were also identified, and some of these accumulated specifically in the presence of a foreign Hfq protein. In addition, we observed specific RNA processing defects, e.g., suppression of precursor processing of SraH sRNA by Methanocaldococcus Hfq, or aberrant accumulation of extracytoplasmic target mRNAs of the Salmonella GcvB, MicA or RybB sRNAs. Taken together, our study provides evidence of a conserved inherent sRNA-binding property of Hfq, which may facilitate the lateral transmission of regulatory sRNAs among distantly related species. It also suggests that the expression of heterologous RNA-binding proteins combined with deep sequencing analysis of RNA ligands can be used as a molecular tool to dissect individual steps of RNA metabolism in vivo.

  12. Effect of Oxygen on Verbenone Conversion From cis-Verbenol by Gut Facultative Anaerobes of Dendroctonus valens

    Directory of Open Access Journals (Sweden)

    Qingjie Cao

    2018-03-01

    Full Text Available Since its introduction from North America, Dendroctonus valens LeConte has become a destructive forest pest in China. Although gut aerobic bacteria have been investigated and some are implicated in beetle pheromone production, little is known about the abundance and significance of facultative anaerobic bacteria in beetle gut, especially with regards to effects of oxygen on their role in pheromone production. In this study, we isolated and identified gut bacteria of D. valens adults in an anaerobic environment, and further compared their ability to convert cis-verbenol into verbenone (a multi-functional pheromone of D. valens under different O2 concentrations. Pantoea conspicua, Enterobacter xiangfangensis, Staphylococcus warneri were the most frequently isolated species among the total of 10 species identified from beetle gut in anaerobic conditions. Among all isolated species, nine were capable of cis-verbenol to verbenone conversion, and the conversion efficiency increased with increased oxygen concentration. This O2-mediated conversion of cis-verbenol to verbenone suggests that gut facultative anaerobes of D. valens might play an important role in the frass, where there is higher exposure to oxygen, hence the higher verbenone production. This claim is further supported by distinctly differential oxygen concentrations between gut and frass of D. valens females.

  13. Molecular defects identified by whole exome sequencing in a child with Fanconi anemia.

    Science.gov (United States)

    Zheng, Zhaojing; Geng, Juan; Yao, Ru-En; Li, Caihua; Ying, Daming; Shen, Yongnian; Ying, Lei; Yu, Yongguo; Fu, Qihua

    2013-11-10

    Fanconi anemia is a rare genetic disease characterized by bone marrow failure, multiple congenital malformations, and an increased susceptibility to malignancy. At least 15 genes have been identified that are involved in the pathogenesis of Fanconi anemia. However, it is still a challenge to assign the complementation group and to characterize the molecular defects in patients with Fanconi anemia. In the current study, whole exome sequencing was used to identify the affected gene(s) in a boy with Fanconi anemia. A recurring, non-synonymous mutation was found (c.3971C>T, p.P1324L) as well as a novel frameshift mutation (c.989_995del, p.H330LfsX2) in FANCA gene. Our results indicate that whole exome sequencing may be useful in clinical settings for rapid identification of disease-causing mutations in rare genetic disorders such as Fanconi anemia. © 2013 Elsevier B.V. All rights reserved.

  14. Predicting transcription factor binding sites using local over-representation and comparative genomics

    Directory of Open Access Journals (Sweden)

    Touzet Hélène

    2006-08-01

    Full Text Available Abstract Background Identifying cis-regulatory elements is crucial to understanding gene expression, which highlights the importance of the computational detection of overrepresented transcription factor binding sites (TFBSs in coexpressed or coregulated genes. However, this is a challenging problem, especially when considering higher eukaryotic organisms. Results We have developed a method, named TFM-Explorer, that searches for locally overrepresented TFBSs in a set of coregulated genes, which are modeled by profiles provided by a database of position weight matrices. The novelty of the method is that it takes advantage of spatial conservation in the sequence and supports multiple species. The efficiency of the underlying algorithm and its robustness to noise allow weak regulatory signals to be detected in large heterogeneous data sets. Conclusion TFM-Explorer provides an efficient way to predict TFBS overrepresentation in related sequences. Promising results were obtained in a variety of examples in human, mouse, and rat genomes. The software is publicly available at http://bioinfo.lifl.fr/TFM-Explorer.

  15. EIA for mining projects in the CIS

    Energy Technology Data Exchange (ETDEWEB)

    Coppin, N.J.; Wheeler, P. [Wardell Armstrong, Newcastle under Lyme (United Kingdom)

    1996-12-31

    This paper examines the Environmental Impact Assessment (EIA) requirements and procedures encountered during work on gold and coal mining projects in Kazakhstan, Uzbekistan and Mongolia. Observations on the implementation of former-Soviet inspired EIA in the Commonwealth of Independent States (CIS), and the differences with North American and European requirements and procedures are highlighted, particularly where these indicate lessons for the West. The main implications for mining companies considering or developing projects in the CIS are discussed, particularly the procedures that have to be followed for environmental permitting. 2 figs.

  16. Quantitative analysis of polycomb response elements (PREs at identical genomic locations distinguishes contributions of PRE sequence and genomic environment

    Directory of Open Access Journals (Sweden)

    Okulski Helena

    2011-03-01

    Full Text Available Abstract Background Polycomb/Trithorax response elements (PREs are cis-regulatory elements essential for the regulation of several hundred developmentally important genes. However, the precise sequence requirements for PRE function are not fully understood, and it is also unclear whether these elements all function in a similar manner. Drosophila PRE reporter assays typically rely on random integration by P-element insertion, but PREs are extremely sensitive to genomic position. Results We adapted the ΦC31 site-specific integration tool to enable systematic quantitative comparison of PREs and sequence variants at identical genomic locations. In this adaptation, a miniwhite (mw reporter in combination with eye-pigment analysis gives a quantitative readout of PRE function. We compared the Hox PRE Frontabdominal-7 (Fab-7 with a PRE from the vestigial (vg gene at four landing sites. The analysis revealed that the Fab-7 and vg PREs have fundamentally different properties, both in terms of their interaction with the genomic environment at each site and their inherent silencing abilities. Furthermore, we used the ΦC31 tool to examine the effect of deletions and mutations in the vg PRE, identifying a 106 bp region containing a previously predicted motif (GTGT that is essential for silencing. Conclusions This analysis showed that different PREs have quantifiably different properties, and that changes in as few as four base pairs have profound effects on PRE function, thus illustrating the power and sensitivity of ΦC31 site-specific integration as a tool for the rapid and quantitative dissection of elements of PRE design.

  17. Impaired RNA splicing of 5'-regulatory sequences of the astroglial glutamate transporter EAAT2 in human astrocytoma

    NARCIS (Netherlands)

    Münch, C.; Penndorf, A.; Schwalenstöcker, B.; Troost, D.; Ludolph, A. C.; Ince, P.; Meyer, T.

    2001-01-01

    A loss of the glutamate transporter EAAT2 has been reported in the neoplastic transformation of astrocytic cells and astrocytoma. The RNA expression of EAAT2 and five 5'-regulatory splice variants was investigated to identify alterations of the post-transcriptional EAAT2 gene regulation in human

  18. MetaPhinder-Identifying Bacteriophage Sequences in Metagenomic Data Sets

    DEFF Research Database (Denmark)

    Jurtz, Vanessa Isabell; Villarroel, Julia; Lund, Ole

    2016-01-01

    genome structure of many bacteriophages. The method is demonstrated to outperform both BLAST methods based on single hits and methods based on k-mer comparisons. MetaPhinder is available as a web service at the Center for Genomic Epidemiology https://cge.cbs.dtu.dk/services/MetaPhinder/, while the source...... and understand them. Here we present MetaPhinder, a method to identify assembled genomic fragments (i.e. contigs) of phage origin in metage-nomic data sets. The method is based on a comparison to a database of whole genome bacteriophage sequences, integrating hits to multiple genomes to accomodate for the mosaic...... code can be downloaded from https://bitbucket.org/genomicepidemiology/metaphinder or https://github.com/vanessajurtz/MetaPhinder....

  19. Exome sequencing identifies three novel candidate genes implicated in intellectual disability.

    Directory of Open Access Journals (Sweden)

    Zehra Agha

    Full Text Available Intellectual disability (ID is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K-specific methyltransferase 2B (KMT2B, zinc finger protein 589 (ZNF589, as well as hedgehog acyltransferase (HHAT with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID.

  20. Identification and Characterization of 5′ Untranslated Regions (5′UTRs in Zymomonas mobilis as Regulatory Biological Parts

    Directory of Open Access Journals (Sweden)

    Seung Hee Cho

    2017-12-01

    Full Text Available Regulatory RNA regions within a transcript, particularly in the 5′ untranslated region (5′UTR, have been shown in a variety of organisms to control the expression levels of these mRNAs in response to various metabolites or environmental conditions. Considering the unique tolerance of Zymomonas mobilis to ethanol and the growing interest in engineering microbial strains with enhanced tolerance to industrial inhibitors, we searched natural cis-regulatory regions in this microorganism using transcriptomic data and bioinformatics analysis. Potential regulatory 5′UTRs were identified and filtered based on length, gene function, relative gene counts, and conservation in other organisms. An in vivo fluorescence-based screening system was developed to confirm the responsiveness of 36 5′UTR candidates to ethanol, acetate, and xylose stresses. UTR_ZMO0347 (5′UTR of gene ZMO0347 encoding the RNA binding protein Hfq was found to down-regulate downstream gene expression under ethanol stress. Genomic deletion of UTR_ZMO0347 led to a general decrease of hfq expression at the transcript level and increased sensitivity for observed changes in Hfq expression at the protein level. The role of UTR_ZMO0347 and other 5′UTRs gives us insight into the regulatory network of Z. mobilis in response to stress and unlocks new strategies for engineering robust industrial strains as well as for harvesting novel responsive regulatory biological parts for controllable gene expression platforms in this organism.

  1. Analysis of sequence diversity through internal transcribed spacers and simple sequence repeats to identify Dendrobium species.

    Science.gov (United States)

    Liu, Y T; Chen, R K; Lin, S J; Chen, Y C; Chin, S W; Chen, F C; Lee, C Y

    2014-04-08

    The Orchidaceae is one of the largest and most diverse families of flowering plants. The Dendrobium genus has high economic potential as ornamental plants and for medicinal purposes. In addition, the species of this genus are able to produce large crops. However, many Dendrobium varieties are very similar in outward appearance, making it difficult to distinguish one species from another. This study demonstrated that the 12 Dendrobium species used in this study may be divided into 2 groups by internal transcribed spacer (ITS) sequence analysis. Red and yellow flowers may also be used to separate these species into 2 main groups. In particular, the deciduous characteristic is associated with the ITS genetic diversity of the A group. Of 53 designed simple sequence repeat (SSR) primer pairs, 7 pairs were polymorphic for polymerase chain reaction products that were amplified from a specific band. The results of this study demonstrate that these 7 SSR primer pairs may potentially be used to identify Dendrobium species and their progeny in future studies.

  2. ECONOMIC AND LEGAL ASPECTS OF CREATION OF AN INTERNATIONAL FINANCIAL CENTER AS IMPORTANT CIS AND EVRASES COUNTRIES INTERGATION FACTOR

    Directory of Open Access Journals (Sweden)

    V. S. Balabanov

    2011-01-01

    Full Text Available The process of forming in Moscow a regional (to be in future transformed into a global international Single Economic Space (SES financial center should become for the Commonwealth of Independent States (CIS and Euroasian Economic Community (EvrAsES countries an universal integration instrument to be used to create their common economic and commercial space. The international center along with SES national financial centers will form an internationally competitive polycentric financial network with single institutional (regulatory, law, customs, etc.agreements. A mechanism should be formed to attract countries outside Customs Union to participate in creation of the international financial center.

  3. Novel expressed sequences identified in a model of androgen independent prostate cancer

    Directory of Open Access Journals (Sweden)

    Jones Steven JM

    2007-01-01

    Full Text Available Abstract Background Prostate cancer is the most frequently diagnosed cancer in American men, and few effective treatment options are available to patients who develop hormone-refractory prostate cancer. The molecular changes that occur to allow prostate cells to proliferate in the absence of androgens are not fully understood. Results Subtractive hybridization experiments performed with samples from an in vivo model of hormonal progression identified 25 expressed sequences representing novel human transcripts. Intriguingly, these 25 sequences have small open-reading frames and are not highly conserved through evolution, suggesting many of these novel expressed sequences may be derived from untranslated regions of novel transcripts or from non-coding transcripts. Examination of a large metalibrary of human Serial Analysis of Gene Expression (SAGE tags demonstrated that only three of these novel sequences had been previously detected. RT-PCR experiments confirmed that the 6 sequences tested were expressed in specific human tissues, as well as in clinical samples of prostate cancer. Further RT-PCR experiments for five of these fragments indicated they originated from large untranslated regions of unannotated transcripts. Conclusion This study underlines the value of using complementary techniques in the annotation of the human genome. The tissue-specific expression of 4 of the 6 clones tested indicates the expression of these novel transcripts is tightly regulated, and future work will determine the possible role(s these novel transcripts may play in the progression of prostate cancer.

  4. Mechanically induced cis to trans reisomerization of azobenzene

    Science.gov (United States)

    Turansky, Robert; Konopka, Martin; Stich, Ivan; Marx, Dominik

    2007-03-01

    Using density functional techniques we study mechanochemistry of the azobenzene molecule. Azobenzene is an optically switchable molecule. Laser light is normally used to achieve molecular switching between the cis and trans isomers. We use mechanochemistry to achieve the switching. Thiolate-gold bond can used to exert mechanical energy on the molecule bonded between two gold electrodes in static AFM apparatus. Our model consists of two realistic gold electrodes bridged by dithioazobenzene. We find that pulling the transisomer leads just to formation of gold nanowires and mechanical breakage of the electrodes. However, mechanochemistry with modest applied forces leads to cis trans reisomerization via rotation mechanism. Contrary, use of simple constraints instead of realistic gold electrodes, leads to cis trans reisomerization, albeit with significantly larger applied forces and via inversion mechanism. Important experimental and theoretical ramifications of these simulations will be discussed.

  5. The effect of cis-diammine dichloro platinum(II) on radiation injury in the rat bowel

    International Nuclear Information System (INIS)

    Lee, Kyung Ja; Rhee, Chung Sik

    1995-01-01

    This experimental study was performed for evaluate the effects of cis-diamminedichloroplatinum(II) (cis-DDP) on the radiation injury of rat bowel by histopathologic changes. Rats were exposed to entire abdomen by a single doses of X-ray(6-10 Gy) without or with cis-DDP(2.5mg/kg). Rats were divided into 3 groups such as radiation alone, cis-DDP alone and combined group. In combined group, cis-DDP was given 30 minutes before or immediately after irradiation. Cis-DDP induced the inflammatory cell infiltrations with focal necrosis of the mucosa in the small bowel and no abnormal change in the large bowel. In radiation alone group, mucosal necrosis, submucosal fibrosis and muscular necrosis were prominent changes in small bowel and submucosal fibrosis in the large bowel. The submucosal fibrosis in the small bowel was appeared in 10 Gy of radiation alone group and 8 Gy of cis-DDP infusion after radiation and 6 Gy of cis-DDP infusion before radiation of combined group. In the large bowel, submucosal fibrosis was noted in 8 Gy of radiation alone group 8 Gy of cis-DDP infusion after radiation and 6 Gy of cis-DDP infusion before radiation of combined group. In the small bowel, the enhancement ratio was 1.67 in a group of cis-DDP infusion before radiation and 1.25 in group of cis-DDP infusion after radiation as the end point was the submucosal fibrosis. In the large bowel, the enhancement ratio was 1.33 in a group of cis-DDP infusion before radiation and 1.0 in a group of cis-DDP infusion after radiation as the end point was the submucosal fibrosis. This study suggested that cis-DDP enhance the radiation effect in the small and large bowel especially when cis-DDP was infused before radiation

  6. Multiple cis-acting elements involved in up-regulation of a cytochrome P450 gene conferring resistance to deltamethrin in smal brown planthopper, Laodelphax striatellus (Fallén).

    Science.gov (United States)

    Pu, Jian; Sun, Haina; Wang, Jinda; Wu, Min; Wang, Kangxu; Denholm, Ian; Han, Zhaojun

    2016-11-01

    As well as arising from single point mutations in binding sites or detoxifying enzymes, it is likely that insecticide resistance mechanisms are frequently controlled by multiple genetic factors, resulting in resistance being inherited as a quantitative trait. However, empirical evidence for this is still rare. Here we analyse the causes of up-regulation of CYP6FU1, a monoxygenase implicated in resistance to deltamethrin in the rice pest Laodelphax striatellus. The 5'-flanking region of this gene was cloned and sequenced from individuals of a susceptible and a resistant strain. A luminescent reporter assay was used to evaluate different 5'-flanking regions and their fragments for promoter activity. Mutations enhancing promoter activity in various fragments were characterized, singly and in combination, by site mutation recovery. Nucleotide diversity in flanking sequences was greatly reduced in deltamethrin-resistant insects compared to susceptible ones. Phylogenetic sequence analysis found that CYP6FU1 had five different types of 5'-flanking region. All five types were present in a susceptible strain but only a single type showing the highest promoter activity was present in a resistant strain. Four cis-acting elements were identified whose influence on up-regulation was much more pronounced in combination than when present singly. Of these, two were new transcription factor (TF) binding sites produced by mutations, another one was also a new TF binding site alternated from an existing one, and the fourth was a unique transcription start site. These results demonstrate that multiple cis-acting elements are involved in up-regulating CYP6FU1 to generate a resistance phenotype. Copyright © 2016 Elsevier Ltd. All rights reserved.

  7. Oxidation reactions of derivatives of cis-octalins promoted by thallium trinitrate (TTN); Reacoes de oxidacao de cis-octalinas promovidas por trinitrato de talio (TTN)

    Energy Technology Data Exchange (ETDEWEB)

    Ferraz, Helena M.C.; Carneiro, Vania M.T.; Vieira, Tiago O.; Silva Junior, Luiz F. [Universidade de Sao Paulo (USP), SP (Brazil). Inst. de Quimica]. E-mail: luizfsjr@iq.usp.br

    2008-07-01

    The reaction of ten cis-octalins and cis-octalones with thallium trinitrate (TTN) leads to different products, depending mainly on the substitution pattern of the substrate. Functionalized cis-hydrindanes were obtained from the reaction of 1,2,3,4,4a,5,8,8a-octahydro- 4a-methylnaphthalene and of 1,2,3,4,4a,5,8,8a-octahydro-4a,7-dimethylnaphthalene with TTN in acetonitrile, whereas a cyclic ether was formed treating 1,2,3,4,4a,5,8,8a-octahydro-6,8a-dimethylnaphthalene-1-ol with TTN in trimethylorthoformate (TMOF). (author)

  8. X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes

    OpenAIRE

    Hu, H.; Haas, S.A.; Chelly, J.; Van Esch, H.; Raynaud, M.; de Brouwer, A.P.M.; Weinert, S.; Froyen, G.; Frints, S.G.M.; Laumonnier, F.; Zemojtel, T.; Love, M.I.; Richard, H.; Emde, A.K.; Bienek, M.

    2016-01-01

    X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of ...

  9. A Novel Prosthetic Joint Infection Pathogen, Mycoplasma salivarium, Identified by Metagenomic Shotgun Sequencing.

    Science.gov (United States)

    Thoendel, Matthew; Jeraldo, Patricio; Greenwood-Quaintance, Kerryl E; Chia, Nicholas; Abdel, Matthew P; Steckelberg, James M; Osmon, Douglas R; Patel, Robin

    2017-07-15

    Defining the microbial etiology of culture-negative prosthetic joint infection (PJI) can be challenging. Metagenomic shotgun sequencing is a new tool to identify organisms undetected by conventional methods. We present a case where metagenomics was used to identify Mycoplasma salivarium as a novel PJI pathogen in a patient with hypogammaglobulinemia. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: journals.permissions@oup.com.

  10. Transcriptional regulation by competing transcription factor modules.

    Directory of Open Access Journals (Sweden)

    Rutger Hermsen

    2006-12-01

    Full Text Available Gene regulatory networks lie at the heart of cellular computation. In these networks, intracellular and extracellular signals are integrated by transcription factors, which control the expression of transcription units by binding to cis-regulatory regions on the DNA. The designs of both eukaryotic and prokaryotic cis-regulatory regions are usually highly complex. They frequently consist of both repetitive and overlapping transcription factor binding sites. To unravel the design principles of these promoter architectures, we have designed in silico prokaryotic transcriptional logic gates with predefined input-output relations using an evolutionary algorithm. The resulting cis-regulatory designs are often composed of modules that consist of tandem arrays of binding sites to which the transcription factors bind cooperatively. Moreover, these modules often overlap with each other, leading to competition between them. Our analysis thus identifies a new signal integration motif that is based upon the interplay between intramodular cooperativity and intermodular competition. We show that this signal integration mechanism drastically enhances the capacity of cis-regulatory domains to integrate signals. Our results provide a possible explanation for the complexity of promoter architectures and could be used for the rational design of synthetic gene circuits.

  11. Dcode.org anthology of comparative genomic tools.

    Science.gov (United States)

    Loots, Gabriela G; Ovcharenko, Ivan

    2005-07-01

    Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the non-coding encryption of gene regulation across genomes. To facilitate the practical application of comparative sequence analysis to genetics and genomics, we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools, zPicture and Mulan; a phylogenetic shadowing tool, eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools, rVista and multiTF; a tool for extracting cis-regulatory modules governing the expression of co-regulated genes, Creme 2.0; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here, we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ website.

  12. The Complexity of Posttranscriptional Small RNA Regulatory Networks Revealed by In Silico Analysis of Gossypium arboreum L. Leaf, Flower and Boll Small Regulatory RNAs.

    Directory of Open Access Journals (Sweden)

    Hongtao Hu

    Full Text Available MicroRNAs (miRNAs and secondary small interfering RNAs (principally phased siRNAs or trans-acting siRNAs are two distinct subfamilies of small RNAs (sRNAs that are emerging as key regulators of posttranscriptional gene expression in plants. Both miRNAs and secondary-siRNAs (sec-siRNAs are processed from longer RNA precursors by DICER-LIKE proteins (DCLs. Gossypium arboreum L., also known as tree cotton or Asian cotton, is a diploid, possibly ancestral relative of tetraploid Gossypium hirsutum L., the predominant type of commercially grown cotton worldwide known as upland cotton. To understand the biological significance of these gene regulators in G. arboreum, a bioinformatics analysis was performed on G. arboreum small RNAs produced from G. arboreum leaf, flower, and boll tissues. Consequently, 263 miRNAs derived from 353 precursors, including 155 conserved miRNAs (cs-miRNAs and 108 novel lineage-specific miRNAs (ls-miRNAs. Along with miRNAs, 2,033 miRNA variants (isomiRNAs were identified as well. Those isomiRNAs with variation at the 3'-miRNA end were expressed at the highest levels, compared to other types of variants. In addition, 755 pha-siRNAs derived 319 pha-siRNA gene transcripts (PGTs were identified, and the potential pha-siRNA initiators were predicted. Also, 2,251 non-phased siRNAs were found as well, of which 1,088 appeared to be produced by so-called cis- or trans-cleavage of the PGTs observed at positions differing from pha-siRNAs. Of those sRNAs, 148 miRNAs/isomiRNAs and 274 phased/non-phased siRNAs were differentially expressed in one or more pairs of tissues examined. Target analysis revealed that target genes for both miRNAs and pha-siRNAs are involved a broad range of metabolic and enzymatic activities. We demonstrate that secondary siRNA production could result from initial cleavage of precursors by both miRNAs or isomiRNAs, and that subsequently produced phased and unphased siRNAs could result that also serve as triggers

  13. The Complexity of Posttranscriptional Small RNA Regulatory Networks Revealed by In Silico Analysis of Gossypium arboreum L. Leaf, Flower and Boll Small Regulatory RNAs.

    Science.gov (United States)

    Hu, Hongtao; Rashotte, Aaron M; Singh, Narendra K; Weaver, David B; Goertzen, Leslie R; Singh, Shree R; Locy, Robert D

    2015-01-01

    MicroRNAs (miRNAs) and secondary small interfering RNAs (principally phased siRNAs or trans-acting siRNAs) are two distinct subfamilies of small RNAs (sRNAs) that are emerging as key regulators of posttranscriptional gene expression in plants. Both miRNAs and secondary-siRNAs (sec-siRNAs) are processed from longer RNA precursors by DICER-LIKE proteins (DCLs). Gossypium arboreum L., also known as tree cotton or Asian cotton, is a diploid, possibly ancestral relative of tetraploid Gossypium hirsutum L., the predominant type of commercially grown cotton worldwide known as upland cotton. To understand the biological significance of these gene regulators in G. arboreum, a bioinformatics analysis was performed on G. arboreum small RNAs produced from G. arboreum leaf, flower, and boll tissues. Consequently, 263 miRNAs derived from 353 precursors, including 155 conserved miRNAs (cs-miRNAs) and 108 novel lineage-specific miRNAs (ls-miRNAs). Along with miRNAs, 2,033 miRNA variants (isomiRNAs) were identified as well. Those isomiRNAs with variation at the 3'-miRNA end were expressed at the highest levels, compared to other types of variants. In addition, 755 pha-siRNAs derived 319 pha-siRNA gene transcripts (PGTs) were identified, and the potential pha-siRNA initiators were predicted. Also, 2,251 non-phased siRNAs were found as well, of which 1,088 appeared to be produced by so-called cis- or trans-cleavage of the PGTs observed at positions differing from pha-siRNAs. Of those sRNAs, 148 miRNAs/isomiRNAs and 274 phased/non-phased siRNAs were differentially expressed in one or more pairs of tissues examined. Target analysis revealed that target genes for both miRNAs and pha-siRNAs are involved a broad range of metabolic and enzymatic activities. We demonstrate that secondary siRNA production could result from initial cleavage of precursors by both miRNAs or isomiRNAs, and that subsequently produced phased and unphased siRNAs could result that also serve as triggers of a second

  14. Knockdown of platinum-induced growth differentiation factor 15 abrogates p27-mediated tumor growth delay in the chemoresistant ovarian cancer model A2780cis

    International Nuclear Information System (INIS)

    Meier, Julia C; Haendler, Bernard; Seidel, Henrik; Groth, Philip; Adams, Robert; Ziegelbauer, Karl; Kreft, Bertolt; Beckmann, Georg; Sommer, Anette; Kopitz, Charlotte

    2015-01-01

    Molecular mechanisms underlying the development of resistance to platinum-based treatment in patients with ovarian cancer remain poorly understood. This is mainly due to the lack of appropriate in vivo models allowing the identification of resistance-related factors. In this study, we used human whole-genome microarrays and linear model analysis to identify potential resistance-related genes by comparing the expression profiles of the parental human ovarian cancer model A2780 and its platinum-resistant variant A2780cis before and after carboplatin treatment in vivo. Growth differentiation factor 15 (GDF15) was identified as one of five potential resistance-related genes in the A2780cis tumor model. Although A2780-bearing mice showed a strong carboplatin-induced increase of GDF15 plasma levels, the basal higher GDF15 plasma levels of A2780cis-bearing mice showed no further increase after short-term or long-term carboplatin treatment. This correlated with a decreased DNA damage response, enhanced AKT survival signaling and abrogated cell cycle arrest in the carboplatin-treated A2780cis tumors. Furthermore, knockdown of GDF15 in A2780cis cells did not alter cell proliferation but enhanced cell migration and colony size in vitro. Interestingly, in vivo knockdown of GDF15 in the A2780cis model led to a basal-enhanced tumor growth, but increased sensitivity to carboplatin treatment as compared to the control-transduced A2780cis tumors. This was associated with larger necrotic areas, a lobular tumor structure and increased p53 and p16 expression of the carboplatin-treated shGDF15-A2780cis tumors. Furthermore, shRNA-mediated GDF15 knockdown abrogated p27 expression as compared to control-transduced A2780cis tumors. In conclusion, these data show that GDF15 may contribute to carboplatin resistance by suppressing tumor growth through p27. These data show that GDF15 might serve as a novel treatment target in women with platinum-resistant ovarian cancer

  15. Differential transcriptome profiling of chilling stress response between shoots and rhizomes of Oryza longistaminata using RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Ting Zhang

    Full Text Available Rice (Oryza sativa is very sensitive to chilling stress at seedling and reproductive stages, whereas wild rice, O. longistaminata, tolerates non-freezing cold temperatures and has overwintering ability. Elucidating the molecular mechanisms of chilling tolerance (CT in O. longistaminata should thus provide a basis for rice CT improvement through molecular breeding. In this study, high-throughput RNA sequencing was performed to profile global transcriptome alterations and crucial genes involved in response to long-term low temperature in O. longistaminata shoots and rhizomes subjected to 7 days of chilling stress. A total of 605 and 403 genes were respectively identified as up- and down-regulated in O. longistaminata under 7 days of chilling stress, with 354 and 371 differentially expressed genes (DEGs found exclusively in shoots and rhizomes, respectively. GO enrichment and KEGG pathway analyses revealed that multiple transcriptional regulatory pathways were enriched in commonly induced genes in both tissues; in contrast, only the photosynthesis pathway was prevalent in genes uniquely induced in shoots, whereas several key metabolic pathways and the programmed cell death process were enriched in genes induced only in rhizomes. Further analysis of these tissue-specific DEGs showed that the CBF/DREB1 regulon and other transcription factors (TFs, including AP2/EREBPs, MYBs, and WRKYs, were synergistically involved in transcriptional regulation of chilling stress response in shoots. Different sets of TFs, such as OsERF922, OsNAC9, OsWRKY25, and WRKY74, and eight genes encoding antioxidant enzymes were exclusively activated in rhizomes under long-term low-temperature treatment. Furthermore, several cis-regulatory elements, including the ICE1-binding site, the GATA element for phytochrome regulation, and the W-box for WRKY binding, were highly abundant in both tissues, confirming the involvement of multiple regulatory genes and complex networks in the

  16. Characterization of the bovine pregnancy-associated glycoprotein gene family – analysis of gene sequences, regulatory regions within the promoter and expression of selected genes

    Directory of Open Access Journals (Sweden)

    Walker Angela M

    2009-04-01

    Full Text Available Abstract Background The Pregnancy-associated glycoproteins (PAGs belong to a large family of aspartic peptidases expressed exclusively in the placenta of species in the Artiodactyla order. In cattle, the PAG gene family is comprised of at least 22 transcribed genes, as well as some variants. Phylogenetic analyses have shown that the PAG family segregates into 'ancient' and 'modern' groupings. Along with sequence differences between family members, there are clear distinctions in their spatio-temporal distribution and in their relative level of expression. In this report, 1 we performed an in silico analysis of the bovine genome to further characterize the PAG gene family, 2 we scrutinized proximal promoter sequences of the PAG genes to evaluate the evolution pressures operating on them and to identify putative regulatory regions, 3 we determined relative transcript abundance of selected PAGs during pregnancy and, 4 we performed preliminary characterization of the putative regulatory elements for one of the candidate PAGs, bovine (bo PAG-2. Results From our analysis of the bovine genome, we identified 18 distinct PAG genes and 14 pseudogenes. We observed that the first 500 base pairs upstream of the translational start site contained multiple regions that are conserved among all boPAGs. However, a preponderance of conserved regions, that harbor recognition sites for putative transcriptional factors (TFs, were found to be unique to the modern boPAG grouping, but not the ancient boPAGs. We gathered evidence by means of Q-PCR and screening of EST databases to show that boPAG-2 is the most abundant of all boPAG transcripts. Finally, we provided preliminary evidence for the role of ETS- and DDVL-related TFs in the regulation of the boPAG-2 gene. Conclusion PAGs represent a relatively large gene family in the bovine genome. The proximal promoter regions of these genes display differences in putative TF binding sites, likely contributing to observed

  17. Cis-Lunar Reusable In-Space Transportation Architecture for the Evolvable Mars Campaign

    Science.gov (United States)

    McVay, Eric S.; Jones, Christopher A.; Merrill, Raymond G.

    2016-01-01

    Human exploration missions to Mars or other destinations in the solar system require large quantities of propellant to enable the transportation of required elements from Earth's sphere of influence to Mars. Current and proposed launch vehicles are incapable of launching all of the requisite mass on a single vehicle; hence, multiple launches and in-space aggregation are required to perform a Mars mission. This study examines the potential of reusable chemical propulsion stages based in cis-lunar space to meet the transportation objectives of the Evolvable Mars Campaign and identifies cis-lunar propellant supply requirements. These stages could be supplied with fuel and oxidizer delivered to cis-lunar space, either launched from Earth or other inner solar system sources such as the Moon or near Earth asteroids. The effects of uncertainty in the model parameters are evaluated through sensitivity analysis of key parameters including the liquid propellant combination, inert mass fraction of the vehicle, change in velocity margin, and change in payload masses. The outcomes of this research include a description of the transportation elements, the architecture that they enable, and an option for a campaign that meets the objectives of the Evolvable Mars Campaign. This provides a more complete understanding of the propellant requirements, as a function of time, that must be delivered to cis-lunar space. Over the selected sensitivity ranges for the current payload and schedule requirements of the 2016 point of departure of the Evolvable Mars Campaign destination systems, the resulting propellant delivery quantities are between 34 and 61 tonnes per year of hydrogen and oxygen propellant, or between 53 and 76 tonnes per year of methane and oxygen propellant, or between 74 and 92 tonnes per year of hypergolic propellant. These estimates can guide future propellant manufacture and/or delivery architectural analysis.

  18. Natural selection in a population of Drosophila melanogaster explained by changes in gene expression caused by sequence variation in core promoter regions.

    Science.gov (United States)

    Sato, Mitsuhiko P; Makino, Takashi; Kawata, Masakado

    2016-02-09

    Understanding the evolutionary forces that influence variation in gene regulatory regions in natural populations is an important challenge for evolutionary biology because natural selection for such variations could promote adaptive phenotypic evolution. Recently, whole-genome sequence analyses have identified regulatory regions subject to natural selection. However, these studies could not identify the relationship between sequence variation in the detected regions and change in gene expression levels. We analyzed sequence variations in core promoter regions, which are critical regions for gene regulation in higher eukaryotes, in a natural population of Drosophila melanogaster, and identified core promoter sequence variations associated with differences in gene expression levels subjected to natural selection. Among the core promoter regions whose sequence variation could change transcription factor binding sites and explain differences in expression levels, three core promoter regions were detected as candidates associated with purifying selection or selective sweep and seven as candidates associated with balancing selection, excluding the possibility of linkage between these regions and core promoter regions. CHKov1, which confers resistance to the sigma virus and related insecticides, was identified as core promoter regions that has been subject to selective sweep, although it could not be denied that selection for variation in core promoter regions was due to linked single nucleotide polymorphisms in the regulatory region outside core promoter regions. Nucleotide changes in core promoter regions of CHKov1 caused the loss of two basal transcription factor binding sites and acquisition of one transcription factor binding site, resulting in decreased gene expression levels. Of nine core promoter regions regions associated with balancing selection, brat, and CG9044 are associated with neuromuscular junction development, and Nmda1 are associated with learning

  19. Regulatory sequence of cupin family gene

    Science.gov (United States)

    Hood, Elizabeth; Teoh, Thomas

    2017-07-25

    This invention is in the field of plant biology and agriculture and relates to novel seed specific promoter regions. The present invention further provide methods of producing proteins and other products of interest and methods of controlling expression of nucleic acid sequences of interest using the seed specific promoter regions.

  20. Defining adolescent common mental disorders using electronic primary care data: a comparison with outcomes measured using the CIS-R.

    Science.gov (United States)

    Cornish, Rosie P; John, Ann; Boyd, Andy; Tilling, Kate; Macleod, John

    2016-12-01

    To compare the prevalence of common mental disorders (CMDs) derived from data held in primary care records with that measured using the revised Clinical Interview Schedule (CIS-R) in order to assess the potential robustness of findings based only on routinely collected data. Comparison study using linkage between the Avon Longitudinal Study of Parents and Children (ALSPAC) and electronic primary care records. We studied 1562 adolescents who had completed the CIS-R in ALSPAC at age 17-18 years and had linkage established to their primary care records. Outcome measures from ALSPAC were whether or not an individual met International Classification of Diseases-10 criteria for a diagnosis of (1) a CMD or, specifically, (2) depression. Lists of Read codes corresponding to diagnoses, symptoms and treatments were used to create 12 definitions of CMD and depression alone using the primary care data. We calculated sensitivities and specificities of these, using CIS-R definitions as the reference standard. Sensitivities ranged from 5.2% to 24.3% for depression and from 3.8% to 19.2% for CMD. The specificities of all definitions were above 98% for depression and above 96% for CMD.For both outcomes, the definition that included current diagnosis, treatment or symptoms identified the highest proportion of CIS-R cases. Most individuals meeting case definitions for CMD based on primary care data also met CIS-R case definitions. Conversely many individuals identified as cases using the CIS-R had no evidence of CMD in their clinical records. This suggests that clinical databases are likely to yield underestimates of the burden of CMD in the population. However, clinical records appear to yield valid diagnoses which may be useful for studying risk factors and consequences of CMD. The greatest epidemiological value may be obtained when information is available from survey and clinical records. Published by the BMJ Publishing Group Limited. For permission to use (where not already