WorldWideScience

Sample records for genome-wide transcriptional modelling

  1. Computational modelling of genome-wide [corrected] transcription assembly networks using a fluidics analogy.

    Directory of Open Access Journals (Sweden)

    Yousry Y Azmy

    Full Text Available Understanding how a myriad of transcription regulators work to modulate mRNA output at thousands of genes remains a fundamental challenge in molecular biology. Here we develop a computational tool to aid in assessing the plausibility of gene regulatory models derived from genome-wide expression profiling of cells mutant for transcription regulators. mRNA output is modelled as fluid flow in a pipe lattice, with assembly of the transcription machinery represented by the effect of valves. Transcriptional regulators are represented as external pressure heads that determine flow rate. Modelling mutations in regulatory proteins is achieved by adjusting valves' on/off settings. The topology of the lattice is designed by the experimentalist to resemble the expected interconnection between the modelled agents and their influence on mRNA expression. Users can compare multiple lattice configurations so as to find the one that minimizes the error with experimental data. This computational model provides a means to test the plausibility of transcription regulation models derived from large genomic data sets.

  2. Quantitative models of the mechanisms that control genome-wide patterns of transcription factor binding during early Drosophila development.

    Directory of Open Access Journals (Sweden)

    Tommy Kaplan

    2011-02-01

    Full Text Available Transcription factors that drive complex patterns of gene expression during animal development bind to thousands of genomic regions, with quantitative differences in binding across bound regions mediating their activity. While we now have tools to characterize the DNA affinities of these proteins and to precisely measure their genome-wide distribution in vivo, our understanding of the forces that determine where, when, and to what extent they bind remains primitive. Here we use a thermodynamic model of transcription factor binding to evaluate the contribution of different biophysical forces to the binding of five regulators of early embryonic anterior-posterior patterning in Drosophila melanogaster. Predictions based on DNA sequence and in vitro protein-DNA affinities alone achieve a correlation of ∼0.4 with experimental measurements of in vivo binding. Incorporating cooperativity and competition among the five factors, and accounting for spatial patterning by modeling binding in every nucleus independently, had little effect on prediction accuracy. A major source of error was the prediction of binding events that do not occur in vivo, which we hypothesized reflected reduced accessibility of chromatin. To test this, we incorporated experimental measurements of genome-wide DNA accessibility into our model, effectively restricting predicted binding to regions of open chromatin. This dramatically improved our predictions to a correlation of 0.6-0.9 for various factors across known target genes. Finally, we used our model to quantify the roles of DNA sequence, accessibility, and binding competition and cooperativity. Our results show that, in regions of open chromatin, binding can be predicted almost exclusively by the sequence specificity of individual factors, with a minimal role for protein interactions. We suggest that a combination of experimentally determined chromatin accessibility data and simple computational models of transcription

  3. Genome-wide transcriptional comparison of MPP+ treated human neuroblastoma cells with the state space model

    Directory of Open Access Journals (Sweden)

    Jin Hwan Do

    2015-10-01

    Full Text Available This study compared a parkinsonian neurotoxin 1-methyl-4-phenylpyridinium (MPP+ response in two distinct phenotypes of human neuroblastoma cell lines: neuronal N-type SH-SY5Y cells and flat substrate-adherent S-type SH-EP cells. SH-SY5Y and SH-EP cells shared only 14% of their own MPP+ response genes, and their gene ontology (GO analysis revealed significant endoplasmic reticulum (ER stress by misfolded proteins. Gene modules, which are groups of transcriptionally co-expressed genes with similar biological functions, were identified for SH-SY5Y and SH-EP cells by using time-series microarray data with the state space model (SSM. All modules of SH-SY5Y and SH-EP cells showed strong positive auto-regulation that was often mediated via signal molecules and may cause bi-stability. Interactions in gene levels were calculated by using SSM parameters obtained in the process of module identification. Gene networks that were constructed from the gene interaction matrix showed different hub genes with high node degrees between SH-SY5Y and SH-EP cells. That is, key hub genes of SH-SY5Y cells were DCN, HIST1H2BK, and C5orf40, whereas those of SH-EP cells were MSH6, RBCK1, MTHFD2, ZNF26, CTH, and CARS. These results suggest that inhibition of the mitochondrial complex I by MPP+ might induce different downstream processes that are cell type dependent.

  4. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    Science.gov (United States)

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  5. Genome-wide transcriptional reprogramming under drought stress

    KAUST Repository

    Chen, Hao

    2012-01-01

    Soil water deficit is one of the major factors limiting plant productivity. Plants cope with this adverse environmental condition by coordinating the up- or downregulation of an array of stress responsive genes. Reprogramming the expression of these genes leads to rebalanced development and growth that are in concert with the reduced water availability and that ultimately confer enhanced stress tolerance. Currently, several techniques have been employed to monitor genome-wide transcriptional reprogramming under drought stress. The results from these high throughput studies indicate that drought stress-induced transcriptional reprogramming is dynamic, has temporal and spatial specificity, and is coupled with the circadian clock and phytohormone signaling pathways. © 2012 Springer-Verlag Berlin Heidelberg. All rights are reserved.

  6. Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.

    Directory of Open Access Journals (Sweden)

    Jing Chen

    Full Text Available Identifying transcription factors (TF involved in producing a genome-wide transcriptional profile is an essential step in building mechanistic model that can explain observed gene expression data. We developed a statistical framework for constructing genome-wide signatures of TF activity, and for using such signatures in the analysis of gene expression data produced by complex transcriptional regulatory programs. Our framework integrates ChIP-seq data and appropriately matched gene expression profiles to identify True REGulatory (TREG TF-gene interactions. It provides genome-wide quantification of the likelihood of regulatory TF-gene interaction that can be used to either identify regulated genes, or as genome-wide signature of TF activity. To effectively use ChIP-seq data, we introduce a novel statistical model that integrates information from all binding "peaks" within 2 Mb window around a gene's transcription start site (TSS, and provides gene-level binding scores and probabilities of regulatory interaction. In the second step we integrate these binding scores and regulatory probabilities with gene expression data to assess the likelihood of True REGulatory (TREG TF-gene interactions. We demonstrate the advantages of TREG framework in identifying genes regulated by two TFs with widely different distribution of functional binding events (ERα and E2f1. We also show that TREG signatures of TF activity vastly improve our ability to detect involvement of ERα in producing complex diseases-related transcriptional profiles. Through a large study of disease-related transcriptional signatures and transcriptional signatures of drug activity, we demonstrate that increase in statistical power associated with the use of TREG signatures makes the crucial difference in identifying key targets for treatment, and drugs to use for treatment. All methods are implemented in an open-source R package treg. The package also contains all data used in the analysis

  7. Genome-wide transcription analyses in rice using tiling microarrays

    DEFF Research Database (Denmark)

    Li, Lei; Wang, Xiangfeng; Stolc, Viktor;

    2006-01-01

    Sequencing and computational annotation revealed several features, including high gene numbers, unusual composition of the predicted genes and a large number of genes lacking homology to known genes, that distinguish the rice (Oryza sativa) genome from that of other fully sequenced model species....... We report here a full-genome transcription analysis of the indica rice subspecies using high-density oligonucleotide tiling microarrays. Our results provided expression data support for the existence of 35,970 (81.9%) annotated gene models and identified 5,464 unique transcribed intergenic regions...... activity between duplicated segments of the genome. Collectively, our results provide the first whole-genome transcription map useful for further understanding the rice genome. Udgivelsesdato: 2006-Jan...

  8. Genome-wide transcription responses to synchrotron microbeam radiotherapy.

    Science.gov (United States)

    Sprung, Carl N; Yang, Yuqing; Forrester, Helen B; Li, Jason; Zaitseva, Marina; Cann, Leonie; Restall, Tina; Anderson, Robin L; Crosbie, Jeffrey C; Rogers, Peter A W

    2012-10-01

    The majority of cancer patients achieve benefit from radiotherapy. A significant limitation of radiotherapy is its relatively low therapeutic index, defined as the maximum radiation dose that causes acceptable normal tissue damage to the minimum dose required to achieve tumor control. Recently, a new radiotherapy modality using synchrotron-generated X-ray microbeam radiotherapy has been demonstrated in animal models to ablate tumors with concurrent sparing of normal tissue. Very little work has been undertaken into the cellular and molecular mechanisms that differentiate microbeam radiotherapy from broad beam. The purpose of this study was to investigate and compare the whole genome transcriptional response of in vivo microbeam radiotherapy versus broad beam irradiated tumors. We hypothesized that gene expression changes after microbeam radiotherapy are different from those seen after broad beam. We found that in EMT6.5 tumors at 4-48 h postirradiation, microbeam radiotherapy differentially regulates a number of genes, including major histocompatibility complex (MHC) class II antigen gene family members, and other immunity-related genes including Ciita, Ifng, Cxcl1, Cxcl9, Indo and Ubd when compared to broad beam. Our findings demonstrate molecular differences in the tumor response to microbeam versus broad beam irradiation and these differences provide insight into the underlying mechanisms of microbeam radiotherapy and broad beam.

  9. Genome-Wide Chromosomal Targets of Oncogenic Transcription Factors

    Science.gov (United States)

    2008-04-01

    Wang JP, Widom J (2005) Improved alignment of nucleosome DNA sequences using a mixture model. Nucleic Acids Res 33: 6743–6755. 6. Ioshikhes IP, Albert I...EMBO J 24: 533–542. 26. Anderson JD, Widom J (2000) Sequence and position-dependence of the equilibrium accessibility of nucleosomal DNA target sites

  10. Genome-wide location analysis reveals a role for Sub1 in RNA polymerase III transcription

    Science.gov (United States)

    Tavenet, Arounie; Suleau, Audrey; Dubreuil, Géraldine; Ferrari, Roberto; Ducrot, Cécile; Michaut, Magali; Aude, Jean-Christophe; Dieci, Giorgio; Lefebvre, Olivier; Conesa, Christine; Acker, Joël

    2009-01-01

    Human PC4 and the yeast ortholog Sub1 have multiple functions in RNA polymerase II transcription. Genome-wide mapping revealed that Sub1 is present on Pol III-transcribed genes. Sub1 was found to interact with components of the Pol III transcription system and to stimulate the initiation and reinitiation steps in a system reconstituted with all recombinant factors. Sub1 was required for optimal Pol III gene transcription in exponentially growing cells. PMID:19706510

  11. Genome-wide transcriptional response of a Saccharomyces cerevisiae strain with an altered redox metabolism

    DEFF Research Database (Denmark)

    Bro, Christoffer; Regenberg, Birgitte; Nielsen, Jens

    2004-01-01

    The genome-wide transcriptional response of a Saccharomyces cerevisiae strain deleted in GDH1 that encodes a NADP(+)-dependent glutamate dehydrogenase was compared to a wild-type strain under anaerobic steady-state conditions. The GDH1-deleted strain has a significantly reduced NADPH requirement...

  12. Genome-wide analysis of light- and temperature-entrained circadian transcripts in Caenorhabditis elegans.

    Directory of Open Access Journals (Sweden)

    Alexander M van der Linden

    Full Text Available Most organisms have an endogenous circadian clock that is synchronized to environmental signals such as light and temperature. Although circadian rhythms have been described in the nematode Caenorhabditis elegans at the behavioral level, these rhythms appear to be relatively non-robust. Moreover, in contrast to other animal models, no circadian transcriptional rhythms have been identified. Thus, whether this organism contains a bona fide circadian clock remains an open question. Here we use genome-wide expression profiling experiments to identify light- and temperature-entrained oscillating transcripts in C. elegans. These transcripts exhibit rhythmic expression with temperature-compensated 24-h periods. In addition, their expression is sustained under constant conditions, suggesting that they are under circadian regulation. Light and temperature cycles strongly drive gene expression and appear to entrain largely nonoverlapping gene sets. We show that mutations in a cyclic nucleotide-gated channel required for sensory transduction abolish both light- and temperature-entrained gene expression, implying that environmental cues act cell nonautonomously to entrain circadian rhythms. Together, these findings demonstrate circadian-regulated transcriptional rhythms in C. elegans and suggest that further analyses in this organism will provide new information about the evolution and function of this biological clock.

  13. Acute Genome-wide effects of Rosiglitazone on PPARγ transcriptional networks in Adipocytes

    DEFF Research Database (Denmark)

    Haakonsson, Anders Kristian; Madsen, Maria Stahl; Nielsen, Ronni

    2013-01-01

    Peroxisome proliferator-activated receptor γ (PPARγ) is a master regulator of adipocyte differentiation, and genome-wide studies indicate that it is involved in the induction of most adipocyte genes. Here we report, for the first time, the acute effects of the synthetic PPARγ agonist rosiglitazone...... on the transcriptional network of PPARγ in adipocytes. Treatment with rosiglitazone for 1 hour leads to acute transcriptional activation as well as repression of a number of genes as determined by genome-wide RNA polymerase II occupancy. Unlike what has been shown for many other nuclear receptors, agonist treatment does...... not lead to major changes in the occurrence of PPARγ binding sites. However, rosiglitazone promotes PPARγ occupancy at many preexisting sites, and this is paralleled by increased occupancy of the mediator subunit MED1. The increase in PPARγ and MED1 binding is correlated with an increase in transcription...

  14. High resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints.

    Directory of Open Access Journals (Sweden)

    Yuchun Guo

    Full Text Available An essential component of genome function is the syntax of genomic regulatory elements that determine how diverse transcription factors interact to orchestrate a program of regulatory control. A precise characterization of in vivo spacing constraints between key transcription factors would reveal key aspects of this genomic regulatory language. To discover novel transcription factor spatial binding constraints in vivo, we developed a new integrative computational method, genome wide event finding and motif discovery (GEM. GEM resolves ChIP data into explanatory motifs and binding events at high spatial resolution by linking binding event discovery and motif discovery with positional priors in the context of a generative probabilistic model of ChIP data and genome sequence. GEM analysis of 63 transcription factors in 214 ENCODE human ChIP-Seq experiments recovers more known factor motifs than other contemporary methods, and discovers six new motifs for factors with unknown binding specificity. GEM's adaptive learning of binding-event read distributions allows it to further improve upon previous methods for processing ChIP-Seq and ChIP-exo data to yield unsurpassed spatial resolution and discovery of closely spaced binding events of the same factor. In a systematic analysis of in vivo sequence-specific transcription factor binding using GEM, we have found hundreds of spatial binding constraints between factors. GEM found 37 examples of factor binding constraints in mouse ES cells, including strong distance-specific constraints between Klf4 and other key regulatory factors. In human ENCODE data, GEM found 390 examples of spatially constrained pair-wise binding, including such novel pairs as c-Fos:c-Jun/USF1, CTCF/Egr1, and HNF4A/FOXA1. The discovery of new factor-factor spatial constraints in ChIP data is significant because it proposes testable models for regulatory factor interactions that will help elucidate genome function and the

  15. High resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints.

    Science.gov (United States)

    Guo, Yuchun; Mahony, Shaun; Gifford, David K

    2012-01-01

    An essential component of genome function is the syntax of genomic regulatory elements that determine how diverse transcription factors interact to orchestrate a program of regulatory control. A precise characterization of in vivo spacing constraints between key transcription factors would reveal key aspects of this genomic regulatory language. To discover novel transcription factor spatial binding constraints in vivo, we developed a new integrative computational method, genome wide event finding and motif discovery (GEM). GEM resolves ChIP data into explanatory motifs and binding events at high spatial resolution by linking binding event discovery and motif discovery with positional priors in the context of a generative probabilistic model of ChIP data and genome sequence. GEM analysis of 63 transcription factors in 214 ENCODE human ChIP-Seq experiments recovers more known factor motifs than other contemporary methods, and discovers six new motifs for factors with unknown binding specificity. GEM's adaptive learning of binding-event read distributions allows it to further improve upon previous methods for processing ChIP-Seq and ChIP-exo data to yield unsurpassed spatial resolution and discovery of closely spaced binding events of the same factor. In a systematic analysis of in vivo sequence-specific transcription factor binding using GEM, we have found hundreds of spatial binding constraints between factors. GEM found 37 examples of factor binding constraints in mouse ES cells, including strong distance-specific constraints between Klf4 and other key regulatory factors. In human ENCODE data, GEM found 390 examples of spatially constrained pair-wise binding, including such novel pairs as c-Fos:c-Jun/USF1, CTCF/Egr1, and HNF4A/FOXA1. The discovery of new factor-factor spatial constraints in ChIP data is significant because it proposes testable models for regulatory factor interactions that will help elucidate genome function and the implementation of combinatorial

  16. Genome-wide identification, classification and analysis of heat shock transcription factor family in maize

    Directory of Open Access Journals (Sweden)

    Zhu Su-Wen

    2011-01-01

    Full Text Available Abstract Background Heat shock response in eukaryotes is transcriptionally regulated by conserved heat shock transcription factors (Hsfs. Hsf genes are represented by a large multigene family in plants and investigation of the Hsf gene family will serve to elucidate the mechanisms by which plants respond to stress. In recent years, reports of genome-wide structural and evolutionary analysis of the entire Hsf gene family have been generated in two model plant systems, Arabidopsis and rice. Maize, an important cereal crop, has represented a model plant for genetics and evolutionary research. Although some Hsf genes have been characterized in maize, analysis of the entire Hsf gene family were not completed following Maize (B73 Genome Sequencing Project. Results A genome-wide analysis was carried out in the present study to identify all Hsfs maize genes. Due to the availability of complete maize genome sequences, 25 nonredundant Hsf genes, named ZmHsfs were identified. Chromosomal location, protein domain and motif organization of ZmHsfs were analyzed in maize genome. The phylogenetic relationships, gene duplications and expression profiles of ZmHsf genes were also presented in this study. Twenty-five ZmHsfs were classified into three major classes (class A, B, and C according to their structural characteristics and phylogenetic comparisons, and class A was further subdivided into 10 subclasses. Moreover, phylogenetic analysis indicated that the orthologs from the three species (maize, Arabidopsis and rice were distributed in all three classes, it also revealed diverse Hsf gene family expression patterns in classes and subclasses. Chromosomal/segmental duplications played a key role in Hsf gene family expansion in maize by investigation of gene duplication events. Furthermore, the transcripts of 25 ZmHsf genes were detected in the leaves by heat shock using quantitative real-time PCR. The result demonstrated that ZmHsf genes exhibit different

  17. Genome wide transcriptional response of Saccharomyces cerevisiae to stress-induced perturbations

    Directory of Open Access Journals (Sweden)

    Hilal eTaymaz-Nikerel

    2016-02-01

    Full Text Available Cells respond to environmental and/or genetic perturbations in order to survive and proliferate. Characterization of the changes after various stimuli at different -omics levels is crucial to comprehend the adaptation of cells to changing conditions. Genome wide quantification and analysis of transcript levels, the genes affected by perturbations, extends our understanding of cellular metabolism by pointing out the mechanisms that play role in sensing the stress caused by those perturbations and related signaling pathways, and in this way guides us to achieve endeavors such as rational engineering of cells or interpretation of disease mechanisms. Saccharomyces cerevisiae as a model system has been studied in response to different perturbations and corresponding transcriptional profiles were followed either statically or/and dynamically, short- and long- term. This review focuses on response of yeast cells to diverse stress inducing perturbations including nutritional changes, ionic stress, salt stress, oxidative stress, osmotic shock, as well as to genetic interventions such as deletion and over-expression of genes. It is aimed to conclude on common regulatory phenomena that allow yeast to organize its transcriptomic response after any perturbation under different external conditions.

  18. Genome-wide transcriptional profiling reveals molecular signatures of secondary xylem differentiation in Populus tomentosa.

    Science.gov (United States)

    Yang, X H; Li, X G; Li, B L; Zhang, D Q

    2014-11-11

    Wood formation occurs via cell division, primary cell wall and secondary wall formation, and programmed cell death in the vascular cambium. Transcriptional profiling of secondary xylem differentiation is essential for understanding the molecular mechanisms underlying wood formation. Differential gene expression in secondary xylem differentiation of Populus has been previously investigated using cDNA microarray analysis. However, little is known about the molecular mechanisms from a genome-wide perspective. In this study, the Affymetrix poplar genome chips containing 61,413 probes were used to investigate the changes in the transcriptome during secondary xylem differentiation in Chinese white poplar (Populus tomentosa). Two xylem tissues (newly formed and lignified) were sampled for genome-wide transcriptional profiling. In total, 6843 genes (~11%) were identified with differential expression in the two xylem tissues. Many genes involved in cell division, primary wall modification, and cellulose synthesis were preferentially expressed in the newly formed xylem. In contrast, many genes, including 4-coumarate:cinnamate-4-hydroxylase (C4H), 4-coumarate:CoA ligase (4CL), cinnamyl alcohol dehydrogenase (CAD), and caffeoyl CoA 3-O-methyltransferase (CCoAOMT), associated with lignin biosynthesis were more transcribed in the lignified xylem. The two xylem tissues also showed differential expression of genes related to various hormones; thus, the secondary xylem differentiation could be regulated by hormone signaling. Furthermore, many transcription factor genes were preferentially expressed in the lignified xylem, suggesting that wood lignification involves extensive transcription regulation. The genome-wide transcriptional profiling of secondary xylem differentiation could provide additional insights into the molecular basis of wood formation in poplar species.

  19. The Csr system regulates genome-wide mRNA stability and transcription and thus gene expression in Escherichia coli

    OpenAIRE

    Thomas Esquerré; Marie Bouvier; Catherine Turlan; Carpousis, Agamemnon J.; Laurence Girbal; Muriel Cocaign-Bousquet

    2016-01-01

    Bacterial adaptation requires large-scale regulation of gene expression. We have performed a genome-wide analysis of the Csr system, which regulates many important cellular functions. The Csr system is involved in post-transcriptional regulation, but a role in transcriptional regulation has also been suggested. Two proteins, an RNA-binding protein CsrA and an atypical signaling protein CsrD, participate in the Csr system. Genome-wide transcript stabilities and levels were compared in wildtype...

  20. Genome-Wide Identification and Evolutionary Analysis of the Animal Specific ETS Transcription Factor Family

    OpenAIRE

    Wang, Zhipeng; Zhang, Qin

    2009-01-01

    The ETS proteins are a family of transcription factors (TFs) that regulate a variety of biological processes. We made genome-wide analyses to explore the classification of the ETS gene family. We identified 207 ETS genes which encode 321 ETS TFs from ten animal species. Of the 321 ETS TFs, 155 contain only an ETS domain, about 50% contain a ETS_PEA3_N or a SAM_PNT domain in addition to an ETS domain, the rest (only four) contain a second ETS domain or a second ETS_PEA3_N domain or an another ...

  1. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

    Directory of Open Access Journals (Sweden)

    Down Thomas A

    2010-09-01

    Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.

  2. Genome-wide transcription profile of field- and laboratory-selected dichlorodiphenyltrichloroethane (DDT)-resistant Drosophila

    OpenAIRE

    2004-01-01

    Genome-wide microarray analysis (Affymetrix array) was used (i) to determine whether only one gene, the cytochrome P450 enzyme Cyp6g1, is differentially transcribed in dichlorodiphenyltrichloroethane (DDT)-resistant vs. -susceptible Drosophila; and (ii) to profile common genes differentially transcribed across a DDT-resistant field isolate [Rst(2)DDTWisconsin] and a laboratory DDT-selected population [Rst(2)DDT91-R]. Statistical analysis (ANOVA model) identified 158 probe sets that were diffe...

  3. Genome-wide transcriptional effects of the anti-cancer agent camptothecin.

    Directory of Open Access Journals (Sweden)

    Artur Veloso

    Full Text Available The anti-cancer drug camptothecin inhibits replication and transcription by trapping DNA topoisomerase I (Top1 covalently to DNA in a "cleavable complex". To examine the effects of camptothecin on RNA synthesis genome-wide we used Bru-Seq and show that camptothecin treatment primarily affected transcription elongation. We also observed that camptothecin increased RNA reads past transcription termination sites as well as at enhancer elements. Following removal of camptothecin, transcription spread as a wave from the 5'-end of genes with no recovery of transcription apparent from RNA polymerases stalled in the body of genes. As a result, camptothecin preferentially inhibited the expression of large genes such as proto-oncogenes, and anti-apoptotic genes while smaller ribosomal protein genes, pro-apoptotic genes and p53 target genes showed relative higher expression. Cockayne syndrome group B fibroblasts (CS-B, which are defective in transcription-coupled repair (TCR, showed an RNA synthesis recovery profile similar to normal fibroblasts suggesting that TCR is not involved in the repair of or RNA synthesis recovery from transcription-blocking Top1 lesions. These findings of the effects of camptothecin on transcription have important implications for its anti-cancer activities and may aid in the design of improved combinatorial treatments involving Top1 poisons.

  4. Genome-wide analysis of a Wnt1-regulated transcriptional network implicates neurodegenerative pathways.

    Science.gov (United States)

    Wexler, Eric M; Rosen, Ezra; Lu, Daning; Osborn, Gregory E; Martin, Elizabeth; Raybould, Helen; Geschwind, Daniel H

    2011-10-04

    Wnt proteins are critical to mammalian brain development and function. The canonical Wnt signaling pathway involves the stabilization and nuclear translocation of β-catenin; however, Wnt also signals through alternative, noncanonical pathways. To gain a systems-level, genome-wide view of Wnt signaling, we analyzed Wnt1-stimulated changes in gene expression by transcriptional microarray analysis in cultured human neural progenitor (hNP) cells at multiple time points over a 72-hour time course. We observed a widespread oscillatory-like pattern of changes in gene expression, involving components of both the canonical and the noncanonical Wnt signaling pathways. A higher-order, systems-level analysis that combined independent component analysis, waveform analysis, and mutual information-based network construction revealed effects on pathways related to cell death and neurodegenerative disease. Wnt effectors were tightly clustered with presenilin1 (PSEN1) and granulin (GRN), which cause dominantly inherited forms of Alzheimer's disease and frontotemporal dementia (FTD), respectively. We further explored a potential link between Wnt1 and GRN and found that Wnt1 decreased GRN expression by hNPs. Conversely, GRN knockdown increased WNT1 expression, demonstrating that Wnt and GRN reciprocally regulate each other. Finally, we provided in vivo validation of the in vitro findings by analyzing gene expression data from individuals with FTD. These unbiased and genome-wide analyses provide evidence for a connection between Wnt signaling and the transcriptional regulation of neurodegenerative disease genes.

  5. Dry and wet approaches for genome-wide functional annotation of conventional and unconventional transcriptional activators

    Directory of Open Access Journals (Sweden)

    Elisabetta Levati

    2016-01-01

    Full Text Available Transcription factors (TFs are master gene products that regulate gene expression in response to a variety of stimuli. They interact with DNA in a sequence-specific manner using a variety of DNA-binding domain (DBD modules. This allows to properly position their second domain, called “effector domain”, to directly or indirectly recruit positively or negatively acting co-regulators including chromatin modifiers, thus modulating preinitiation complex formation as well as transcription elongation. At variance with the DBDs, which are comprised of well-defined and easily recognizable DNA binding motifs, effector domains are usually much less conserved and thus considerably more difficult to predict. Also not so easy to identify are the DNA-binding sites of TFs, especially on a genome-wide basis and in the case of overlapping binding regions. Another emerging issue, with many potential regulatory implications, is that of so-called “moonlighting” transcription factors, i.e., proteins with an annotated function unrelated to transcription and lacking any recognizable DBD or effector domain, that play a role in gene regulation as their second job. Starting from bioinformatic and experimental high-throughput tools for an unbiased, genome-wide identification and functional characterization of TFs (especially transcriptional activators, we describe both established (and usually well affordable as well as newly developed platforms for DNA-binding site identification. Selected combinations of these search tools, some of which rely on next-generation sequencing approaches, allow delineating the entire repertoire of TFs and unconventional regulators encoded by the any sequenced genome.

  6. Genome-wide transcriptional profiling of human glioblastoma cells in response to ITE treatment.

    Science.gov (United States)

    Kang, Bo; Zhou, Yanwen; Zheng, Min; Wang, Ying-Jie

    2015-09-01

    A ligand-activated transcription factor aryl hydrocarbon receptor (AhR) is recently revealed to play a key role in embryogenesis and tumorigenesis (Feng et al. [1], Safe et al. [2]) and 2-(1'H-indole-3'-carbonyl)-thiazole-4-carboxylic acid methyl ester (ITE) (Song et al. [3]) is an endogenous AhR ligand that possesses anti-tumor activity. In order to gain insights into how ITE acts via the AhR in embryogenesis and tumorigenesis, we analyzed the genome-wide transcriptional profiles of the following three groups of cells: the human glioblastoma U87 parental cells, U87 tumor sphere cells treated with vehicle (DMSO) and U87 tumor sphere cells treated with ITE. Here, we provide the details of the sample gathering strategy and show the quality controls and the analyses associated with our gene array data deposited into the Gene Expression Omnibus (GEO) under the accession code of GSE67986.

  7. Genome-Wide Analysis and Molecular Characterization of Heat Shock Transcription Factor Family in Glycine max

    Institute of Scientific and Technical Information of China (English)

    Eunsook Chung; Kyoung-Mi Kim; Jai-Heon Lee

    2013-01-01

    Heat shock transcription factors (Hsfs) play an essential role on the increased tolerance against heat stress by regulating the expression of heat-responsive genes.In this study,a genome-wide analysis was performed to identify all of the soybean (Glycine max) GmHsfgenes based on the latest soybean genome sequence.Chromosomal location,protein domain,motif organization,and phylogenetic relationships of 26 non-redundant GmHsf genes were analyzed compared with AtHsfs (Arabidopsis thaliana Hsfs).According to their structural features,the predicted members were divided into the previously defined classes A-C,as described for AtHsfs.Transcript levels and subcellular localization of five GmHsfs responsive to abiotic stresses were analyzed by real-time RT-PCR.These results provide a fundamental clue for understanding the complexity of the soybean GmHsfgene family and cloning the functional genes in future studies.

  8. Genome-wide transcriptional reorganization associated with senescence-to-immortality switch during human hepatocellular carcinogenesis.

    Science.gov (United States)

    Yildiz, Gokhan; Arslan-Ergul, Ayca; Bagislar, Sevgi; Konu, Ozlen; Yuzugullu, Haluk; Gursoy-Yuzugullu, Ozge; Ozturk, Nuri; Ozen, Cigdem; Ozdag, Hilal; Erdal, Esra; Karademir, Sedat; Sagol, Ozgul; Mizrak, Dilsa; Bozkaya, Hakan; Ilk, Hakki Gokhan; Ilk, Ozlem; Bilen, Biter; Cetin-Atalay, Rengul; Akar, Nejat; Ozturk, Mehmet

    2013-01-01

    Senescence is a permanent proliferation arrest in response to cell stress such as DNA damage. It contributes strongly to tissue aging and serves as a major barrier against tumor development. Most tumor cells are believed to bypass the senescence barrier (become "immortal") by inactivating growth control genes such as TP53 and CDKN2A. They also reactivate telomerase reverse transcriptase. Senescence-to-immortality transition is accompanied by major phenotypic and biochemical changes mediated by genome-wide transcriptional modifications. This appears to happen during hepatocellular carcinoma (HCC) development in patients with liver cirrhosis, however, the accompanying transcriptional changes are virtually unknown. We investigated genome-wide transcriptional changes related to the senescence-to-immortality switch during hepatocellular carcinogenesis. Initially, we performed transcriptome analysis of senescent and immortal clones of Huh7 HCC cell line, and identified genes with significant differential expression to establish a senescence-related gene list. Through the analysis of senescence-related gene expression in different liver tissues we showed that cirrhosis and HCC display expression patterns compatible with senescent and immortal phenotypes, respectively; dysplasia being a transitional state. Gene set enrichment analysis revealed that cirrhosis/senescence-associated genes were preferentially expressed in non-tumor tissues, less malignant tumors, and differentiated or senescent cells. In contrast, HCC/immortality genes were up-regulated in tumor tissues, or more malignant tumors and progenitor cells. In HCC tumors and immortal cells genes involved in DNA repair, cell cycle, telomere extension and branched chain amino acid metabolism were up-regulated, whereas genes involved in cell signaling, as well as in drug, lipid, retinoid and glycolytic metabolism were down-regulated. Based on these distinctive gene expression features we developed a 15-gene

  9. Acute genome-wide effects of rosiglitazone on PPARγ transcriptional networks in adipocytes.

    Science.gov (United States)

    Haakonsson, Anders Kristian; Stahl Madsen, Maria; Nielsen, Ronni; Sandelin, Albin; Mandrup, Susanne

    2013-09-01

    Peroxisome proliferator-activated receptor γ (PPARγ) is a master regulator of adipocyte differentiation, and genome-wide studies indicate that it is involved in the induction of most adipocyte genes. Here we report, for the first time, the acute effects of the synthetic PPARγ agonist rosiglitazone on the transcriptional network of PPARγ in adipocytes. Treatment with rosiglitazone for 1 hour leads to acute transcriptional activation as well as repression of a number of genes as determined by genome-wide RNA polymerase II occupancy. Unlike what has been shown for many other nuclear receptors, agonist treatment does not lead to major changes in the occurrence of PPARγ binding sites. However, rosiglitazone promotes PPARγ occupancy at many preexisting sites, and this is paralleled by increased occupancy of the mediator subunit MED1. The increase in PPARγ and MED1 binding is correlated with an increase in transcription of nearby genes, indicating that rosiglitazone, in addition to activating the receptor, also promotes its association with DNA, and that this is causally linked to recruitment of mediator and activation of genes. Notably, both rosiglitazone-activated and -repressed genes are induced during adipogenesis. However, rosiglitazone-activated genes are markedly more associated with PPARγ than repressed genes and are highly dependent on PPARγ for expression in adipocytes. By contrast, repressed genes are associated with the other key adipocyte transcription factor CCAAT-enhancer binding proteinα (C/EBPα), and their expression is more dependent on C/EBPα. This suggests that the relative occupancies of PPARγ and C/EBPα are critical for whether genes will be induced or repressed by PPARγ agonist.

  10. Genome-wide analysis of alternative transcripts in human breast cancer

    Science.gov (United States)

    Wen, Ji; Toomer, Kevin H.

    2016-01-01

    Transcript variants play a critical role in diversifying gene expression. Alternative splicing is a major mechanism for generating transcript variants. A number of genes have been implicated in breast cancer pathogenesis with their aberrant expression of alternative transcripts. In this study, we performed genome-wide analyses of transcript variant expression in breast cancer. With RNA-Seq data from 105 patients, we characterized the transcriptome of breast tumors, by pairwise comparison of gene expression in the breast tumor versus matched healthy tissue from each patient. We identified 2839 genes, ~10 % of protein-coding genes in the human genome, that had differential expression of transcript variants between tumors and healthy tissues. The validity of the computational analysis was confirmed by quantitative RT-PCR assessment of transcript variant expression from four top candidate genes. The alternative transcript profiling led to classification of breast cancer into two subgroups and yielded a novel molecular signature that could be prognostic of patients’ tumor burden and survival. We uncovered nine splicing factors (FOX2, MBNL1, QKI, PTBP1, ELAVL1, HNRNPC, KHDRBS1, SFRS2, and TIAR) that were involved in aberrant splicing in breast cancer. Network analyses for the coordinative patterns of transcript variant expression identified twelve “hub” genes that differentiated the cancerous and normal transcriptomes. Dysregulated expression of alternative transcripts may reveal novel biomarkers for tumor development. It may also suggest new therapeutic targets, such as the “hub” genes identified through the network analyses of transcript variant expression, or splicing factors implicated in the formation of the tumor transcriptome. PMID:25913416

  11. Genome-wide transcriptional response of a Saccharomyces cerevisiae strain with an altered redox metabolism.

    Science.gov (United States)

    Bro, Christoffer; Regenberg, Birgitte; Nielsen, Jens

    2004-02-05

    The genome-wide transcriptional response of a Saccharomyces cerevisiae strain deleted in GDH1 that encodes a NADP(+)-dependent glutamate dehydrogenase was compared to a wild-type strain under anaerobic steady-state conditions. The GDH1-deleted strain has a significantly reduced NADPH requirement, and therefore, an altered redox metabolism. Identification of genes with significantly changed expression using a t-test and a Bonferroni correction yielded only 16 transcripts when accepting two false-positives, and 7 of these were Open Reading Frames (ORFs) with unknown function. Among the 16 transcripts the only one with a direct link to redox metabolism was GND1, encoding phosphogluconate dehydrogenase. To extract additional information we analyzed the transcription data for a gene subset consisting of all known genes encoding metabolic enzymes that use NAD(+) or NADP(+). The subset was analyzed for genes with significantly changed expression again with a t-test and correction for multiple testing. This approach was found to enrich the analysis since GND1, ZWF1 and ALD6, encoding the most important enzymes for regeneration of NADPH under anaerobic conditions, were down-regulated together with eight other genes encoding NADP(H)-dependent enzymes. This indicates a possible common redox-dependent regulation of these genes. Furthermore, we showed that it might be necessary to analyze the expression of a subset of genes to extract all available information from global transcription analysis.

  12. Genome-wide transcriptional responses to a lipid hydroperoxide: adaptation occurs without induction of oxidant defenses.

    Science.gov (United States)

    Alic, Nazif; Felder, Thomas; Temple, Mark D; Gloeckner, Christian; Higgins, Vincent J; Briza, Peter; Dawes, Ian W

    2004-07-01

    Free radicals can initiate the oxidation of polyunsaturated fatty acids in cells through the process of lipid peroxidation. The genome-wide transcriptional changes in Saccharomyces cerevisiae after treatment with the toxic lipid peroxidation product linoleic acid hydroperoxide (LoaOOH) were identified. High-dose treatment led to a switch in transcription from biosynthetic to protective functions. This response encompassed a set of genes stimulated predominantly by LoaOOH, and not by other oxidants or heat shock, which contained components of the pleiotropic drug resistance system. The dose dependence of the transcriptional response revealed that large and widespread changes occur only in response to higher doses. Pretreatment of cells with sublethal doses of LoaOOH induces resistance to an otherwise lethal dose through the process of adaptation. Adaptive doses elicited a more subtle transcriptional response affecting metabolic functions, including an increase in the capacity for detoxification and downregulation of the rate of protein synthesis. Surprisingly, the cellular response to adaptive doses did not include induction of oxidative-stress defense enzymes nor of transcripts involved in general cellular defense systems.

  13. Genome wide transcriptional profile analysis of Vitis amurensis and Vitis vinifera in response to cold stress.

    Science.gov (United States)

    Xin, Haiping; Zhu, Wei; Wang, Lina; Xiang, Yue; Fang, Linchuan; Li, Jitao; Sun, Xiaoming; Wang, Nian; Londo, Jason P; Li, Shaohua

    2013-01-01

    Grape is one of the most important fruit crops worldwide. The suitable geographical locations and productivity of grapes are largely limited by temperature. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding that of Vitis vinifera, the dominant cultivated species of grapevine. However, the molecular mechanisms that contribute to the enhanced freezing tolerance of V. amurensis remain unknown. Here we used deep sequencing data from restriction endonuclease-generated cDNA fragments to evaluate the whole genome wide modification of transcriptome of V. amurensis under cold treatment. Vitis vinifera cv. Muscat of Hamburg was used as control to help investigate the distinctive features of V. amruensis in responding to cold stress. Approximately 9 million tags were sequenced from non-cold treatment (NCT) and cold treatment (CT) cDNA libraries in each species of grapevine sampled from shoot apices. Alignment of tags into V. vinifera cv. Pinot noir (PN40024) annotated genome identified over 15,000 transcripts in each library in V. amruensis and more than 16,000 in Muscat of Hamburg. Comparative analysis between NCT and CT libraries indicate that V. amurensis has fewer differential expressed genes (DEGs, 1314 transcripts) than Muscat of Hamburg (2307 transcripts) when exposed to cold stress. Common DEGs (408 transcripts) suggest that some genes provide fundamental roles during cold stress in grapes. The most robust DEGs (more than 20-fold change) also demonstrated significant differences between two kinds of grapevine, indicating that cold stress may trigger species specific pathways in V. amurensis. Functional categories of DEGs indicated that the proportion of up-regulated transcripts related to metabolism, transport, signal transduction and transcription were more abundant in V. amurensis. Several highly expressed transcripts that were found uniquely accumulated in V. amurensis are discussed in detail. This subset of unique candidate

  14. Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis

    Directory of Open Access Journals (Sweden)

    Katiyar Amit

    2012-10-01

    Full Text Available Abstract Background The MYB gene family comprises one of the richest groups of transcription factors in plants. Plant MYB proteins are characterized by a highly conserved MYB DNA-binding domain. MYB proteins are classified into four major groups namely, 1R-MYB, 2R-MYB, 3R-MYB and 4R-MYB based on the number and position of MYB repeats. MYB transcription factors are involved in plant development, secondary metabolism, hormone signal transduction, disease resistance and abiotic stress tolerance. A comparative analysis of MYB family genes in rice and Arabidopsis will help reveal the evolution and function of MYB genes in plants. Results A genome-wide analysis identified at least 155 and 197 MYB genes in rice and Arabidopsis, respectively. Gene structure analysis revealed that MYB family genes possess relatively more number of introns in the middle as compared with C- and N-terminal regions of the predicted genes. Intronless MYB-genes are highly conserved both in rice and Arabidopsis. MYB genes encoding R2R3 repeat MYB proteins retained conserved gene structure with three exons and two introns, whereas genes encoding R1R2R3 repeat containing proteins consist of six exons and five introns. The splicing pattern is similar among R1R2R3 MYB genes in Arabidopsis. In contrast, variation in splicing pattern was observed among R1R2R3 MYB members of rice. Consensus motif analysis of 1kb upstream region (5′ to translation initiation codon of MYB gene ORFs led to the identification of conserved and over-represented cis-motifs in both rice and Arabidopsis. Real-time quantitative RT-PCR analysis showed that several members of MYBs are up-regulated by various abiotic stresses both in rice and Arabidopsis. Conclusion A comprehensive genome-wide analysis of chromosomal distribution, tandem repeats and phylogenetic relationship of MYB family genes in rice and Arabidopsis suggested their evolution via duplication. Genome-wide comparative analysis of MYB genes and

  15. Genome-wide analysis of the MYB transcription factor superfamily in soybean

    Directory of Open Access Journals (Sweden)

    Du Hai

    2012-07-01

    Full Text Available Abstract Background The MYB superfamily constitutes one of the most abundant groups of transcription factors described in plants. Nevertheless, their functions appear to be highly diverse and remain rather unclear. To date, no genome-wide characterization of this gene family has been conducted in a legume species. Here we report the first genome-wide analysis of the whole MYB superfamily in a legume species, soybean (Glycine max, including the gene structures, phylogeny, chromosome locations, conserved motifs, and expression patterns, as well as a comparative genomic analysis with Arabidopsis. Results A total of 244 R2R3-MYB genes were identified and further classified into 48 subfamilies based on a phylogenetic comparative analysis with their putative orthologs, showed both gene loss and duplication events. The phylogenetic analysis showed that most characterized MYB genes with similar functions are clustered in the same subfamily, together with the identification of orthologs by synteny analysis, functional conservation among subgroups of MYB genes was strongly indicated. The phylogenetic relationships of each subgroup of MYB genes were well supported by the highly conserved intron/exon structures and motifs outside the MYB domain. Synonymous nucleotide substitution (dN/dS analysis showed that the soybean MYB DNA-binding domain is under strong negative selection. The chromosome distribution pattern strongly indicated that genome-wide segmental and tandem duplication contribute to the expansion of soybean MYB genes. In addition, we found that ~ 4% of soybean R2R3-MYB genes had undergone alternative splicing events, producing a variety of transcripts from a single gene, which illustrated the extremely high complexity of transcriptome regulation. Comparative expression profile analysis of R2R3-MYB genes in soybean and Arabidopsis revealed that MYB genes play conserved and various roles in plants, which is indicative of a divergence in

  16. Drosophila genome-wide RNAi screen identifies multiple regulators of HIF-dependent transcription in hypoxia.

    Directory of Open Access Journals (Sweden)

    Andrés Dekanty

    2010-06-01

    Full Text Available Hypoxia-inducible factors (HIFs are a family of evolutionary conserved alpha-beta heterodimeric transcription factors that induce a wide range of genes in response to low oxygen tension. Molecular mechanisms that mediate oxygen-dependent HIF regulation operate at the level of the alpha subunit, controlling protein stability, subcellular localization, and transcriptional coactivator recruitment. We have conducted an unbiased genome-wide RNA interference (RNAi screen in Drosophila cells aimed to the identification of genes required for HIF activity. After 3 rounds of selection, 30 genes emerged as critical HIF regulators in hypoxia, most of which had not been previously associated with HIF biology. The list of genes includes components of chromatin remodeling complexes, transcription elongation factors, and translational regulators. One remarkable hit was the argonaute 1 (ago1 gene, a central element of the microRNA (miRNA translational silencing machinery. Further studies confirmed the physiological role of the miRNA machinery in HIF-dependent transcription. This study reveals the occurrence of novel mechanisms of HIF regulation, which might contribute to developing novel strategies for therapeutic intervention of HIF-related pathologies, including heart attack, cancer, and stroke.

  17. Drosophila genome-wide RNAi screen identifies multiple regulators of HIF-dependent transcription in hypoxia.

    Directory of Open Access Journals (Sweden)

    Andrés Dekanty

    2010-06-01

    Full Text Available Hypoxia-inducible factors (HIFs are a family of evolutionary conserved alpha-beta heterodimeric transcription factors that induce a wide range of genes in response to low oxygen tension. Molecular mechanisms that mediate oxygen-dependent HIF regulation operate at the level of the alpha subunit, controlling protein stability, subcellular localization, and transcriptional coactivator recruitment. We have conducted an unbiased genome-wide RNA interference (RNAi screen in Drosophila cells aimed to the identification of genes required for HIF activity. After 3 rounds of selection, 30 genes emerged as critical HIF regulators in hypoxia, most of which had not been previously associated with HIF biology. The list of genes includes components of chromatin remodeling complexes, transcription elongation factors, and translational regulators. One remarkable hit was the argonaute 1 (ago1 gene, a central element of the microRNA (miRNA translational silencing machinery. Further studies confirmed the physiological role of the miRNA machinery in HIF-dependent transcription. This study reveals the occurrence of novel mechanisms of HIF regulation, which might contribute to developing novel strategies for therapeutic intervention of HIF-related pathologies, including heart attack, cancer, and stroke.

  18. Genome-wide transcriptional analysis of genes associated with acute desiccation stress in Anopheles gambiae.

    Directory of Open Access Journals (Sweden)

    Mei-Hui Wang

    Full Text Available Malaria transmission in sub-Saharan Africa varies seasonally in intensity. Outbreaks of malaria occur after the beginning of the rainy season, whereas, during the dry season, reports of the disease are less frequent. Anopheles gambiae mosquitoes, the main malaria vector, are observed all year long but their densities are low during the dry season that generally lasts several months. Aestivation, seasonal migration, and local adaptation have been suggested as mechanisms that enable mosquito populations to persist through the dry season. Studies of chromosomal inversions have shown that inversions 2La, 2Rb, 2Rc, 2Rd, and 2Ru are associated with various physiological changes that confer aridity resistance. However, little is known about how phenotypic plasticity responds to seasonally dry conditions. This study examined the effects of desiccation stress on transcriptional regulation in An. gambiae. We exposed female An. gambiae G3 mosquitoes to acute desiccation and conducted a genome-wide analysis of their transcriptomes using the Affymetrix Plasmodium/Anopheles Genome Array. The transcription of 248 genes (1.7% of all transcripts was significantly affected in all experimental conditions, including 96 with increased expression and 152 with decreased expression. In general, the data indicate a reduction in the metabolic rate of mosquitoes exposed to desiccation. Transcripts accumulated at higher levels during desiccation are associated with oxygen radical detoxification, DNA repair and stress responses. The proportion of transcripts within 2La and 2Rs (2Rb, 2Rc, 2Rd, and 2Ru (67/248, or 27% is similar to the percentage of transcripts located within these inversions (31%. These data may be useful in efforts to elucidate the role of chromosomal inversions in aridity tolerance. The scope of application of the anopheline genome demonstrates that examining transcriptional activity in relation to genotypic adaptations greatly expands the number of

  19. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii.

    Science.gov (United States)

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong

    2014-10-16

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.

  20. Genome wide analysis of stress responsive WRKY transcription factors in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Shaiq Sultan

    2016-04-01

    Full Text Available WRKY transcription factors are a class of DNA-binding proteins that bind with a specific sequence C/TTGACT/C known as W-Box found in promoters of genes which are regulated by these WRKYs. From previous studies, 43 different stress responsive WRKY transcription factors in Arabidopsis thaliana, identified and then categorized in three groups viz., abiotic, biotic and both of these stresses. A comprehensive genome wide analysis including chromosomal localization, gene structure analysis, multiple sequence alignment, phylogenetic analysis and promoter analysis of these WRKY genes was carried out in this study to determine the functional homology in Arabidopsis. This analysis led to the classification of these WRKY family members into 3 major groups and subgroups and showed evolutionary relationship among these groups on the base of their functional WRKY domain, chromosomal localization and intron/exon structure. The proposed groups of these stress responsive WRKY genes and annotation based on their position on chromosomes can also be explored to determine their functional homology in other plant species in relation to different stresses. The result of the present study provides indispensable genomic information for the stress responsive WRKY transcription factors in Arabidopsis and will pave the way to explain the precise role of various AtWRKYs in plant growth and development under stressed conditions.

  1. Genome-wide Analysis of Plant-specific Dof Transcription Factor Family in Tomato

    Institute of Scientific and Technical Information of China (English)

    Xiaofeng Cai; Yuyang Zhang; Chanjuan Zhang; Tingyan Zhang; Tixu Hu; Jie Ye; Junhong Zhang

    2013-01-01

    The Dof (DNA binding with One Finger) family encoding single zinc finger proteins has been known as a family of plant-specific transcription factors.These transcription factors are involved in a variety of functions of importance for different biological processes in plants.In the current study,we identified 34 Dof family genes in tomato (Solanum lycopersicum L.),distributed on 11 chromosomes.A complete overview of SIDof genes in tomato is presented,including the gene structures,chromosome locations,phylogeny,protein motifs and evolution pattern.Phylogenetic analysis of 34 SlDof proteins resulted in four classes constituting six clusters.In addition,a comparative analysis between these genes in tomato,Arabidopsis (Arabidopsis thaliana L.) and rice (Oryza sativa L.) was also performed.The tomato Dof family expansion has been dated to recent duplication events,and segmental duplication is predominant for the SlDof genes.Furthermore,the SlDof genes displayed differential expression either in their transcript abundance or in their expression patterns under normal growth conditions.This is the first step towards genome-wide analyses of the Dof genes in tomato.Our study provides a very useful reference for cloning and functional analysis of the members of this gene family in tomato and other species.

  2. Genome-wide chromatin occupancy analysis reveals a role for ASH2 in transcriptional pausing.

    Science.gov (United States)

    Pérez-Lluch, Sílvia; Blanco, Enrique; Carbonell, Albert; Raha, Debasish; Snyder, Michael; Serras, Florenci; Corominas, Montserrat

    2011-06-01

    An important mechanism for gene regulation involves chromatin changes via histone modification. One such modification is histone H3 lysine 4 trimethylation (H3K4me3), which requires histone methyltranferase complexes (HMT) containing the trithorax-group (trxG) protein ASH2. Mutations in ash2 cause a variety of pattern formation defects in the Drosophila wing. We have identified genome-wide binding of ASH2 in wing imaginal discs using chromatin immunoprecipitation combined with sequencing (ChIP-Seq). Our results show that genes with functions in development and transcriptional regulation are activated by ASH2 via H3K4 trimethylation in nearby nucleosomes. We have characterized the occupancy of phosphorylated forms of RNA Polymerase II and histone marks associated with activation and repression of transcription. ASH2 occupancy correlates with phosphorylated forms of RNA Polymerase II and histone activating marks in expressed genes. Additionally, RNA Polymerase II phosphorylation on serine 5 and H3K4me3 are reduced in ash2 mutants in comparison to wild-type flies. Finally, we have identified specific motifs associated with ASH2 binding in genes that are differentially expressed in ash2 mutants. Our data suggest that recruitment of the ASH2-containing HMT complexes is context specific and points to a function of ASH2 and H3K4me3 in transcriptional pausing control.

  3. Genome-Wide Transcriptional Regulation Mediated by Biochemically Distinct SWI/SNF Complexes.

    Directory of Open Access Journals (Sweden)

    Jesse R Raab

    2015-12-01

    Full Text Available Multiple positions within the SWI/SNF chromatin remodeling complex can be filled by mutually exclusive subunits. Inclusion or exclusion of these proteins defines many unique forms of SWI/SNF and has profound functional consequences. Often this complex is studied as a single entity within a particular cell type and we understand little about the functional relationship between these biochemically distinct forms of the remodeling complex. Here we examine the functional relationships among three complex-specific ARID (AT-Rich Interacting Domain subunits using genome-wide chromatin immunoprecipitation, transcriptome analysis, and transcription factor binding maps. We find widespread overlap in transcriptional regulation and the genomic binding of distinct SWI/SNF complexes. ARID1B and ARID2 participate in wide-spread cooperation to repress hundreds of genes. Additionally, we find numerous examples of competition between ARID1A and another ARID, and validate that gene expression changes following loss of one ARID are dependent on the function of an alternative ARID. These distinct regulatory modalities are correlated with differential occupancy by transcription factors. Together, these data suggest that distinct SWI/SNF complexes dictate gene-specific transcription through functional interactions between the different forms of the SWI/SNF complex and associated co-factors. Most genes regulated by SWI/SNF are controlled by multiple biochemically distinct forms of the complex, and the overall expression of a gene is the product of the interaction between these different SWI/SNF complexes. The three mutually exclusive ARID family members are among the most frequently mutated chromatin regulators in cancer, and understanding the functional interactions and their role in transcriptional regulation provides an important foundation to understand their role in cancer.

  4. Genome-Wide Transcriptional Regulation Mediated by Biochemically Distinct SWI/SNF Complexes

    Science.gov (United States)

    Raab, Jesse R.; Resnick, Samuel; Magnuson, Terry

    2015-01-01

    Multiple positions within the SWI/SNF chromatin remodeling complex can be filled by mutually exclusive subunits. Inclusion or exclusion of these proteins defines many unique forms of SWI/SNF and has profound functional consequences. Often this complex is studied as a single entity within a particular cell type and we understand little about the functional relationship between these biochemically distinct forms of the remodeling complex. Here we examine the functional relationships among three complex-specific ARID (AT-Rich Interacting Domain) subunits using genome-wide chromatin immunoprecipitation, transcriptome analysis, and transcription factor binding maps. We find widespread overlap in transcriptional regulation and the genomic binding of distinct SWI/SNF complexes. ARID1B and ARID2 participate in wide-spread cooperation to repress hundreds of genes. Additionally, we find numerous examples of competition between ARID1A and another ARID, and validate that gene expression changes following loss of one ARID are dependent on the function of an alternative ARID. These distinct regulatory modalities are correlated with differential occupancy by transcription factors. Together, these data suggest that distinct SWI/SNF complexes dictate gene-specific transcription through functional interactions between the different forms of the SWI/SNF complex and associated co-factors. Most genes regulated by SWI/SNF are controlled by multiple biochemically distinct forms of the complex, and the overall expression of a gene is the product of the interaction between these different SWI/SNF complexes. The three mutually exclusive ARID family members are among the most frequently mutated chromatin regulators in cancer, and understanding the functional interactions and their role in transcriptional regulation provides an important foundation to understand their role in cancer. PMID:26716708

  5. Genome-wide transcript profiling reveals novel breast cancer-associated intronic sense RNAs.

    Science.gov (United States)

    Kim, Sang Woo; Fishilevich, Elane; Arango-Argoty, Gustavo; Lin, Yuefeng; Liu, Guodong; Li, Zhihua; Monaghan, A Paula; Nichols, Mark; John, Bino

    2015-01-01

    Non-coding RNAs (ncRNAs) play major roles in development and cancer progression. To identify novel ncRNAs that may identify key pathways in breast cancer development, we performed high-throughput transcript profiling of tumor and normal matched-pair tissue samples. Initial transcriptome profiling using high-density genome-wide tiling arrays revealed changes in over 200 novel candidate genomic regions that map to intronic regions. Sixteen genomic loci were identified that map to the long introns of five key protein-coding genes, CRIM1, EPAS1, ZEB2, RBMS1, and RFX2. Consistent with the known role of the tumor suppressor ZEB2 in the cancer-associated epithelial to mesenchymal transition (EMT), in situ hybridization reveals that the intronic regions deriving from ZEB2 as well as those from RFX2 and EPAS1 are down-regulated in cells of epithelial morphology, suggesting that these regions may be important for maintaining normal epithelial cell morphology. Paired-end deep sequencing analysis reveals a large number of distinct genomic clusters with no coding potential within the introns of these genes. These novel transcripts are only transcribed from the coding strand. A comprehensive search for breast cancer associated genes reveals enrichment for transcribed intronic regions from these loci, pointing to an underappreciated role of introns or mechanisms relating to their biology in EMT and breast cancer.

  6. Genome-wide transcript profiling reveals novel breast cancer-associated intronic sense RNAs.

    Directory of Open Access Journals (Sweden)

    Sang Woo Kim

    Full Text Available Non-coding RNAs (ncRNAs play major roles in development and cancer progression. To identify novel ncRNAs that may identify key pathways in breast cancer development, we performed high-throughput transcript profiling of tumor and normal matched-pair tissue samples. Initial transcriptome profiling using high-density genome-wide tiling arrays revealed changes in over 200 novel candidate genomic regions that map to intronic regions. Sixteen genomic loci were identified that map to the long introns of five key protein-coding genes, CRIM1, EPAS1, ZEB2, RBMS1, and RFX2. Consistent with the known role of the tumor suppressor ZEB2 in the cancer-associated epithelial to mesenchymal transition (EMT, in situ hybridization reveals that the intronic regions deriving from ZEB2 as well as those from RFX2 and EPAS1 are down-regulated in cells of epithelial morphology, suggesting that these regions may be important for maintaining normal epithelial cell morphology. Paired-end deep sequencing analysis reveals a large number of distinct genomic clusters with no coding potential within the introns of these genes. These novel transcripts are only transcribed from the coding strand. A comprehensive search for breast cancer associated genes reveals enrichment for transcribed intronic regions from these loci, pointing to an underappreciated role of introns or mechanisms relating to their biology in EMT and breast cancer.

  7. Genome-Wide Dissection of the Heat Shock Transcription Factor Family Genes in Arachis.

    Science.gov (United States)

    Wang, Pengfei; Song, Hui; Li, Changsheng; Li, Pengcheng; Li, Aiqin; Guan, Hongshan; Hou, Lei; Wang, Xingjun

    2017-01-01

    Heat shock transcription factors (Hsfs) are important transcription factors (TFs) in protecting plants from damages caused by various stresses. The released whole genome sequences of wild peanuts make it possible for genome-wide analysis of Hsfs in peanut. In this study, a total of 16 and 17 Hsf genes were identified from Arachis duranensis and A. ipaensis, respectively. We identified 16 orthologous Hsf gene pairs in both peanut species; however HsfXs was only identified from A. ipaensis. Orthologous pairs between two wild peanut species were highly syntenic. Based on phylogenetic relationship, peanut Hsfs were divided into groups A, B, and C. Selection pressure analysis showed that group B Hsf genes mainly underwent positive selection and group A Hsfs were affected by purifying selection. Small scale segmental and tandem duplication may play important roles in the evolution of these genes. Cis-elements, such as ABRE, DRE, and HSE, were found in the promoters of most Arachis Hsf genes. Five AdHsfs and two AiHsfs contained fungal elicitor responsive elements suggesting their involvement in response to fungi infection. These genes were differentially expressed in cultivated peanut under abiotic stress and Aspergillus flavus infection. AhHsf2 and AhHsf14 were significantly up-regulated after inoculation with A. flavus suggesting their possible role in fungal resistance.

  8. Genome-wide Transcription Factor Gene Prediction and their Expressional Tissue-Specificities in Maize

    Institute of Scientific and Technical Information of China (English)

    Yi Jiang; Biao Zeng; Hainan Zhao; Mei Zhang; Shaojun Xie; Jinsheng Lai

    2012-01-01

    Transcription factors (TFs) are important regulators of gene expression.To better understand TFencoding genes in maize (Zea mays L.),a genome-wide TF prediction was performed using the updated B73 reference genome.A total of 2 298 TF genes were identified,which can be classified into 56 families.The largest family,known as the MYB superfamily,comprises 322 MYB and MYB-related TF genes.The expression patterns of 2014 (87.64%) TF genes were examined using RNA-seq data,which resulted in the identification of a subset of TFs that are specifically expressed in particular tissues (including root,shoot,leaf,ear,tassel and kernel).Similarly,98 kernel-specific TF genes were further analyzed,and it was observed that 29 of the kernel-specific genes were preferentially expressed in the early kernel developmental stage,while 69 of the genes were expressed in the late kernel developmental stage.Identification of these TFs,particularly the tissue-specific ones,provides important information for the understanding of development and transcriptional regulation of maize.

  9. The CHR site: definition and genome-wide identification of a cell cycle transcriptional element.

    Science.gov (United States)

    Müller, Gerd A; Wintsche, Axel; Stangner, Konstanze; Prohaska, Sonja J; Stadler, Peter F; Engeland, Kurt

    2014-01-01

    The cell cycle genes homology region (CHR) has been identified as a DNA element with an important role in transcriptional regulation of late cell cycle genes. It has been shown that such genes are controlled by DREAM, MMB and FOXM1-MuvB and that these protein complexes can contact DNA via CHR sites. However, it has not been elucidated which sequence variations of the canonical CHR are functional and how frequent CHR-based regulation is utilized in mammalian genomes. Here, we define the spectrum of functional CHR elements. As the basis for a computational meta-analysis, we identify new CHR sequences and compile phylogenetic motif conservation as well as genome-wide protein-DNA binding and gene expression data. We identify CHR elements in most late cell cycle genes binding DREAM, MMB, or FOXM1-MuvB. In contrast, Myb- and forkhead-binding sites are underrepresented in both early and late cell cycle genes. Our findings support a general mechanism: sequential binding of DREAM, MMB and FOXM1-MuvB complexes to late cell cycle genes requires CHR elements. Taken together, we define the group of CHR-regulated genes in mammalian genomes and provide evidence that the CHR is the central promoter element in transcriptional regulation of late cell cycle genes by DREAM, MMB and FOXM1-MuvB. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Genome-wide identification of the regulatory targets of a transcription factor using biochemical characterization and computational genomic analysis

    Directory of Open Access Journals (Sweden)

    Jolly Emmitt R

    2005-11-01

    Full Text Available Abstract Background A major challenge in computational genomics is the development of methodologies that allow accurate genome-wide prediction of the regulatory targets of a transcription factor. We present a method for target identification that combines experimental characterization of binding requirements with computational genomic analysis. Results Our method identified potential target genes of the transcription factor Ndt80, a key transcriptional regulator involved in yeast sporulation, using the combined information of binding affinity, positional distribution, and conservation of the binding sites across multiple species. We have also developed a mathematical approach to compute the false positive rate and the total number of targets in the genome based on the multiple selection criteria. Conclusion We have shown that combining biochemical characterization and computational genomic analysis leads to accurate identification of the genome-wide targets of a transcription factor. The method can be extended to other transcription factors and can complement other genomic approaches to transcriptional regulation.

  11. Genome-wide identification and characterization of transcription start sites and promoters in the tunicate Ciona intestinalis.

    Science.gov (United States)

    Yokomori, Rui; Shimai, Kotaro; Nishitsuji, Koki; Suzuki, Yutaka; Kusakabe, Takehiro G; Nakai, Kenta

    2016-01-01

    The tunicate Ciona intestinalis, an invertebrate chordate, has recently emerged as a powerful model organism for gene regulation analysis. However, few studies have been conducted to identify and characterize its transcription start sites (TSSs) and promoters at the genome-wide level. Here, using TSS-seq, we identified TSSs at the genome-wide scale and characterized promoters in C. intestinalis. Specifically, we identified TSS clusters (TSCs), high-density regions of TSS-seq tags, each of which appears to originate from an identical promoter. TSCs were found not only at known TSSs but also in other regions, suggesting the existence of many unknown transcription units in the genome. We also identified candidate promoters of 79 ribosomal protein (RP) genes, each of which had the major TSS in a polypyrimidine tract and showed a sharp TSS distribution like human RP gene promoters. Ciona RP gene promoters, however, did not appear to have typical TATA boxes, unlike human RP gene promoters. In Ciona non-RP promoters, two pyrimidine-purine dinucleotides, CA and TA, were frequently used as TSSs. Despite the absence of CpG islands, Ciona TATA-less promoters showed low expression specificity like CpG-associated human TATA-less promoters. By using TSS-seq, we also predicted trans-spliced gene TSSs and found that their downstream regions had higher G+T content than those of non-trans-spliced gene TSSs. Furthermore, we identified many putative alternative promoters, some of which were regulated in a tissue-specific manner. Our results provide valuable information about TSSs and promoter characteristics in C. intestinalis and will be helpful in future analysis of transcriptional regulation in chordates.

  12. Genome-wide analysis of the homeobox C6 transcriptional network in prostate cancer.

    Science.gov (United States)

    McCabe, Colleen D; Spyropoulos, Demetri D; Martin, David; Moreno, Carlos S

    2008-03-15

    Homeobox transcription factors are developmentally regulated genes that play crucial roles in tissue patterning. Homeobox C6 (HOXC6) is overexpressed in prostate cancers and correlated with cancer progression, but the downstream targets of HOXC6 are largely unknown. We have performed genome-wide localization analysis to identify promoters bound by HOXC6 in prostate cancer cells. This analysis identified 468 reproducibly bound promoters whose associated genes are involved in functions such as cell proliferation and apoptosis. We have complemented these data with expression profiling of prostates from mice with homozygous disruption of the Hoxc6 gene to identify 31 direct regulatory target genes of HOXC6. We show that HOXC6 directly regulates expression of bone morphogenic protein 7, fibroblast growth factor receptor 2, insulin-like growth factor binding protein 3, and platelet-derived growth factor receptor alpha (PDGFRA) in prostate cells and indirectly influences the Notch and Wnt signaling pathways in vivo. We further show that inhibition of PDGFRA reduces proliferation of prostate cancer cells, and that overexpression of HOXC6 can overcome the effects of PDGFRA inhibition. HOXC6 regulates genes with both oncogenic and tumor suppressor activities as well as several genes such as CD44 that are important for prostate branching morphogenesis and metastasis to the bone microenvironment.

  13. Genome-wide transcriptional and physiological responses of Bradyrhizobium japonicum to paraquat-mediated oxidative stress.

    Science.gov (United States)

    Donati, Andrew J; Jeon, Jeong-Min; Sangurdekar, Dipen; So, Jae-Seong; Chang, Woo-Suk

    2011-06-01

    The rhizobial bacterium Bradyrhizobium japonicum functions as a nitrogen-fixing symbiont of the soybean plant (Glycine max). Plants are capable of producing an oxidative burst, a rapid proliferation of reactive oxygen species (ROS), as a defense mechanism against pathogenic and symbiotic bacteria. Therefore, B. japonicum must be able to resist such a defense mechanism to initiate nodulation. In this study, paraquat, a known superoxide radical-inducing agent, was used to investigate this response. Genome-wide transcriptional profiles were created for both prolonged exposure (PE) and fulminant shock (FS) conditions. These profiles revealed that 190 and 86 genes were up- and downregulated for the former condition, and that 299 and 105 genes were up- and downregulated for the latter condition, respectively (>2.0-fold; P ROS scavenging enzymes, such as superoxide dismutase and catalase, were not detected, suggesting constitutive expression of those genes by endogenous ROS. Various physiological tests, including exopolysaccharide (EPS), cellular protein, and motility characterization, were performed to corroborate the gene expression data. The results suggest that B. japonicum responds to tolerable oxidative stress during PE through enhanced motility, increased translational activity, and EPS production, in addition to the expression of genes involved in global stress responses, such as chaperones and sigma factors.

  14. Genome-wide analysis of LXRα activation reveals new transcriptional networks in human atherosclerotic foam cells.

    Science.gov (United States)

    Feldmann, Radmila; Fischer, Cornelius; Kodelja, Vitam; Behrens, Sarah; Haas, Stefan; Vingron, Martin; Timmermann, Bernd; Geikowski, Anne; Sauer, Sascha

    2013-04-01

    Increased physiological levels of oxysterols are major risk factors for developing atherosclerosis and cardiovascular disease. Lipid-loaded macrophages, termed foam cells, are important during the early development of atherosclerotic plaques. To pursue the hypothesis that ligand-based modulation of the nuclear receptor LXRα is crucial for cell homeostasis during atherosclerotic processes, we analysed genome-wide the action of LXRα in foam cells and macrophages. By integrating chromatin immunoprecipitation-sequencing (ChIP-seq) and gene expression profile analyses, we generated a highly stringent set of 186 LXRα target genes. Treatment with the nanomolar-binding ligand T0901317 and subsequent auto-regulatory LXRα activation resulted in sequence-dependent sharpening of the genome-binding patterns of LXRα. LXRα-binding loci that correlated with differential gene expression revealed 32 novel target genes with potential beneficial effects, which in part explained the implications of disease-associated genetic variation data. These observations identified highly integrated LXRα ligand-dependent transcriptional networks, including the APOE/C1/C4/C2-gene cluster, which contribute to the reversal of cholesterol efflux and the dampening of inflammation processes in foam cells to prevent atherogenesis.

  15. Genome-wide profiling of transcription factor binding and epigenetic marks in adipocytes by ChIP-seq

    DEFF Research Database (Denmark)

    Nielsen, Ronni; Mandrup, Susanne

    2014-01-01

    The recent advances in high-throughput sequencing combined with various other technologies have allowed detailed and genome-wide insight into the transcriptional networks that control adipogenesis. Chromatin immunoprecipitation (ChIP) combined with high-throughput sequencing (ChIP-seq) is one...

  16. Elucidation of the role of Grr1p in glucose sensing by Saccharomyces cerevisiae through genome-wide transcription analysis

    DEFF Research Database (Denmark)

    Westergaard, Steen Lund; Bro, Christoffer; Olsson, Lisbeth

    2004-01-01

    The role of Grr1p in glucose sensing in Saccharomyces cerevisiae was elucidated through genome-wide transcription analysis. From triplicate analysis of a strain with deletion of the GRR1-gene from the genome and an isogenic reference strain, 68 genes were identified to have significantly altered...

  17. Genome-Wide Mapping of Binding Sites Reveals Multiple Biological Functions of the Transcription Factor Cst6p in Saccharomyces cerevisiae

    DEFF Research Database (Denmark)

    Liu, Guodong; Bergenholm, David; Nielsen, Jens

    2016-01-01

    of Cst6p to its target promoters is condition dependent and explain the mechanism for the retarded growth of the CST6 deletion mutant on ethanol. Furthermore, we demonstrate that Cst6p is a new member of a stress-responsive transcriptional regulatory network. These results provide deeper understanding......In the model eukaryote Saccharomyces cerevisiae, the transcription factor Cst6p has been reported to play important roles in several biological processes. However, the genome-wide targets of Cst6p and its physiological functions remain unknown. Here, we mapped the genome-wide binding sites of Cst6p...... decreased expression of NCE103, encoding a carbonic anhydrase, which is a direct target of Cst6p. The target genes of Cst6p have a large overlap with those of stress-responsive transcription factors, such as Sko1p and Skn7p. In addition, a CST6 deletion mutant growing on ethanol shows hypersensitivity...

  18. A genome-wide longitudinal transcriptome analysis of the aging model Podospora anserina.

    Directory of Open Access Journals (Sweden)

    Oliver Philipp

    Full Text Available Aging of biological systems is controlled by various processes which have a potential impact on gene expression. Here we report a genome-wide transcriptome analysis of the fungal aging model Podospora anserina. Total RNA of three individuals of defined age were pooled and analyzed by SuperSAGE (serial analysis of gene expression. A bioinformatics analysis identified different molecular pathways to be affected during aging. While the abundance of transcripts linked to ribosomes and to the proteasome quality control system were found to decrease during aging, those associated with autophagy increase, suggesting that autophagy may act as a compensatory quality control pathway. Transcript profiles associated with the energy metabolism including mitochondrial functions were identified to fluctuate during aging. Comparison of wild-type transcripts, which are continuously down-regulated during aging, with those down-regulated in the long-lived, copper-uptake mutant grisea, validated the relevance of age-related changes in cellular copper metabolism. Overall, we (i present a unique age-related data set of a longitudinal study of the experimental aging model P. anserina which represents a reference resource for future investigations in a variety of organisms, (ii suggest autophagy to be a key quality control pathway that becomes active once other pathways fail, and (iii present testable predictions for subsequent experimental investigations.

  19. A Probabilistic Genome-Wide Gene Reading Frame Sequence Model

    DEFF Research Database (Denmark)

    Have, Christian Theil; Mørk, Søren

    We introduce a new type of probabilistic sequence model, that model the sequential composition of reading frames of genes in a genome. Our approach extends gene finders with a model of the sequential composition of genes at the genome-level -- effectively producing a sequential genome annotation...... and are evaluated by the effect on prediction performance. Since bacterial gene finding to a large extent is a solved problem it forms an ideal proving ground for evaluating the explicit modeling of larger scale gene sequence composition of genomes. We conclude that the sequential composition of gene reading frames...... as output. The model can be used to obtain the most probable genome annotation based on a combination of i: a gene finder score of each gene candidate and ii: the sequence of the reading frames of gene candidates through a genome. The model --- as well as a higher order variant --- is developed and tested...

  20. Genome-wide transcriptional response of the archaeon Thermococcus gammatolerans to cadmium.

    Directory of Open Access Journals (Sweden)

    Arnaud Lagorce

    Full Text Available Thermococcus gammatolerans, the most radioresistant archaeon known to date, is an anaerobic and hyperthermophilic sulfur-reducing organism living in deep-sea hydrothermal vents. Knowledge of mechanisms underlying archaeal metal tolerance in such metal-rich ecosystem is still poorly documented. We showed that T. gammatolerans exhibits high resistance to cadmium (Cd, cobalt (Co and zinc (Zn, a weaker tolerance to nickel (Ni, copper (Cu and arsenate (AsO(4 and that cells exposed to 1 mM Cd exhibit a cellular Cd concentration of 67 µM. A time-dependent transcriptomic analysis using microarrays was performed at a non-toxic (100 µM and a toxic (1 mM Cd dose. The reliability of microarray data was strengthened by real time RT-PCR validations. Altogether, 114 Cd responsive genes were revealed and a substantial subset of genes is related to metal homeostasis, drug detoxification, re-oxidization of cofactors and ATP production. This first genome-wide expression profiling study of archaeal cells challenged with Cd showed that T. gammatolerans withstands induced stress through pathways observed in both prokaryotes and eukaryotes but also through new and original strategies. T. gammatolerans cells challenged with 1 mM Cd basically promote: 1 the induction of several transporter/permease encoding genes, probably to detoxify the cell; 2 the upregulation of Fe transporters encoding genes to likely compensate Cd damages in iron-containing proteins; 3 the induction of membrane-bound hydrogenase (Mbh and membrane-bound hydrogenlyase (Mhy2 subunits encoding genes involved in recycling reduced cofactors and/or in proton translocation for energy production. By contrast to other organisms, redox homeostasis genes appear constitutively expressed and only a few genes encoding DNA repair proteins are regulated. We compared the expression of 27 Cd responsive genes in other stress conditions (Zn, Ni, heat shock, γ-rays, and showed that the Cd transcriptional pattern is

  1. A Probabilistic Genome-Wide Gene Reading Frame Sequence Model

    DEFF Research Database (Denmark)

    Have, Christian Theil; Mørk, Søren

    using the probabilistic logic programming language and machine learning system PRISM - a fast and efficient model prototyping environment, using bacterial gene finding performance as a benchmark of signal strength. The model is used to prune a set of gene predictions from an underlying gene finder...

  2. Genome-Wide Targets Regulated by the OsMADS1 Transcription Factor Reveals Its DNA Recognition Properties1[OPEN

    Science.gov (United States)

    Khanday, Imtiyaz; Das, Sanjukta; Chongloi, Grace L; Vijayraghavan, Usha

    2016-01-01

    OsMADS1 controls rice (Oryza sativa) floral fate and organ development. Yet, its genome-wide targets and the mechanisms underlying its role as a transcription regulator controlling developmental gene expression are unknown. We identify 3112 gene-associated OsMADS1-bound sites in the floret genome. These occur in the vicinity of transcription start sites, within gene bodies, and in intergenic regions. Majority of the bound DNA contained CArG motif variants or, in several cases, only A-tracts. Sequences flanking the binding peak had a higher AT nucleotide content, implying that broader DNA structural features may define in planta binding. Sequences for binding by other transcription factor families like MYC, AP2/ERF, bZIP, etc. are enriched in OsMADS1-bound DNAs. Target genes implicated in transcription, chromatin remodeling, cellular processes, and hormone metabolism were enriched. Combining expression data from OsMADS1 knockdown florets with these DNA binding data, a snapshot of a gene regulatory network was deduced where targets, such as AP2/ERF and bHLH transcription factors and chromatin remodelers form nodes. We show that the expression status of these nodal factors can be altered by inducing the OsMADS1-GR fusion protein and present a model for a regulatory cascade where the direct targets of OsMADS1, OsbHLH108/SPT, OsERF034, and OsHSF24, in turn control genes such as OsMADS32 and OsYABBY5. This cascade, with other similar relationships, cumulatively contributes to floral organ development. Overall, OsMADS1 binds to several regulatory genes and, probably in combination with other factors, controls a gene regulatory network that ensures rice floret development. PMID:27457124

  3. The Csr system regulates genome-wide mRNA stability and transcription and thus gene expression in Escherichia coli.

    Science.gov (United States)

    Esquerré, Thomas; Bouvier, Marie; Turlan, Catherine; Carpousis, Agamemnon J; Girbal, Laurence; Cocaign-Bousquet, Muriel

    2016-04-26

    Bacterial adaptation requires large-scale regulation of gene expression. We have performed a genome-wide analysis of the Csr system, which regulates many important cellular functions. The Csr system is involved in post-transcriptional regulation, but a role in transcriptional regulation has also been suggested. Two proteins, an RNA-binding protein CsrA and an atypical signaling protein CsrD, participate in the Csr system. Genome-wide transcript stabilities and levels were compared in wildtype E. coli (MG1655) and isogenic mutant strains deficient in CsrA or CsrD activity demonstrating for the first time that CsrA and CsrD are global negative and positive regulators of transcription, respectively. The role of CsrA in transcription regulation may be indirect due to the 4.6-fold increase in csrD mRNA concentration in the CsrA deficient strain. Transcriptional action of CsrA and CsrD on a few genes was validated by transcriptional fusions. In addition to an effect on transcription, CsrA stabilizes thousands of mRNAs. This is the first demonstration that CsrA is a global positive regulator of mRNA stability. For one hundred genes, we predict that direct control of mRNA stability by CsrA might contribute to metabolic adaptation by regulating expression of genes involved in carbon metabolism and transport independently of transcriptional regulation.

  4. Genome-wide systematic characterization of the bZIP transcriptional factor family in tomato (Solanum lycopersicum L.).

    Science.gov (United States)

    Li, Dayong; Fu, Fuyou; Zhang, Huijuan; Song, Fengming

    2015-10-12

    Transcription factors of the basic leucine zipper (bZIP) family represent exclusively in eukaryotes and have been shown to regulate diverse biological processes in plant growth and development as well as in abiotic and biotic stress responses. However, little is known about the bZIP family in tomato (Solanum lycopersicum L.). The SlbZIP genes were identified using local BLAST and hidden Markov model profile searches. The phylogenetic trees, conserved motifs and gene structures were generated by MEGA6.06, MEME tool and gene Structure Display Server, respectively. The syntenic block diagrams were generated by the Circos software. The transcriptional gene expression profiles were obtained using Genevestigator tool and quantitative RT-PCR. In the present study, we carried out a genome-wide identification and systematic analyses of 69 SlbZIP genes that distributes unevenly on the tomato chromosomes. This family can be divided into 9 groups according to the phylogenetic relationship among the SlbZIP proteins. Six kinds of intron patterns (a-f) within the basic and hinge regions are defined. The additional conserved motifs and their presence of the group specificity were also identified. Further, we predicted the DNA-binding patterns and the dimerization property on the basis of the characteristic features in the basic and hinge regions and the leucine zipper, respectively, which supports our classification greatly and helps to classify 24 distinct subfamilies. Within the SlbZIP family, a total of 40 SlbZIP genes are located in the segmental duplicate regions in the tomato genome, suggesting that the segment chromosomal duplications contribute greatly to the expansion of the tomato SlbZIP family. Expression profiling analyses of 59 SlbZIP genes using quantitative RT-PCR and publicly available microarray data indicate that the tomato SlbZIP genes have distinct and diverse expression patterns in different tissues and developmental stages and many of the tomato bZIP genes

  5. Genome-Wide Association between Transcription Factor Expression and Chromatin Accessibility Reveals Regulators of Chromatin Accessibility

    Science.gov (United States)

    Rueedi, Rico

    2017-01-01

    To better understand genome regulation, it is important to uncover the role of transcription factors in the process of chromatin structure establishment and maintenance. Here we present a data-driven approach to systematically characterise transcription factors that are relevant for this process. Our method uses a linear mixed modelling approach to combine datasets of transcription factor binding motif enrichments in open chromatin and gene expression across the same set of cell lines. Applying this approach to the ENCODE dataset, we confirm already known and imply numerous novel transcription factors that play a role in the establishment or maintenance of open chromatin. In particular, our approach rediscovers many factors that have been annotated as pioneer factors. PMID:28118358

  6. Genome-wide effects of selenium and translational uncoupling on transcription in the termite gut symbiont Treponema primitia.

    Science.gov (United States)

    Matson, Eric G; Rosenthal, Adam Z; Zhang, Xinning; Leadbetter, Jared R

    2013-11-12

    When prokaryotic cells acquire mutations, encounter translation-inhibiting substances, or experience adverse environmental conditions that limit their ability to synthesize proteins, transcription can become uncoupled from translation. Such uncoupling is known to suppress transcription of protein-encoding genes in bacteria. Here we show that the trace element selenium controls transcription of the gene for the selenocysteine-utilizing enzyme formate dehydrogenase (fdhFSec) through a translation-coupled mechanism in the termite gut symbiont Treponema primitia, a member of the bacterial phylum Spirochaetes. We also evaluated changes in genome-wide transcriptional patterns caused by selenium limitation and by generally uncoupling translation from transcription via antibiotic-mediated inhibition of protein synthesis. We observed that inhibiting protein synthesis in T. primitia influences transcriptional patterns in unexpected ways. In addition to suppressing transcription of certain genes, the expected consequence of inhibiting protein synthesis, we found numerous examples in which transcription of genes and operons is truncated far downstream from putative promoters, is unchanged, or is even stimulated overall. These results indicate that gene regulation in bacteria allows for specific post-initiation transcriptional responses during periods of limited protein synthesis, which may depend both on translational coupling and on unclassified intrinsic elements of protein-encoding genes. A large body of literature demonstrates that the coupling of transcription and translation is a general and essential method by which bacteria regulate gene expression levels. However, the potential role of noncanonical amino acids in regulating transcriptional output via translational control remains, for the most part, undefined. Furthermore, the genome-wide transcriptional state in response to translational decoupling is not well quantified. The results presented here suggest that the

  7. Software engineering the mixed model for genome-wide association studies on large samples

    Science.gov (United States)

    Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample siz...

  8. Genome-wide analysis of basic leucine zipper transcription factor families in Arabidopsis thaliana, Oryza saliva and Populus trichocarpa

    Institute of Scientific and Technical Information of China (English)

    JI Qian; ZHANG Liang-sheng; WANG Yi-fei; WANG Jian

    2009-01-01

    The basic leucine zipper (bZIP) transcription factors form a large gene family that is important in pathogen defense, light and stress signaling, etc. The Completed whole genome sequences of model plants Arabidopsis (Arabidopsis thaliana), rice (Oryza saliva) and poplar (Populus trichocarpa) constitute a valuable resource for genome-wide analysis and genomic comparative analysis, as they are representatives of the two major evolutionary lineages within the angiosperms: the monocotyledons and the dicotyledons. In this study, bioinformatics analysis identified 74, 89 and 88 bZIP genes respectively in Arabidopsis, rice and poplar. Moreover, a comprehensive overview of this gene family is presented, including the gene structure, phylogeny, chromosome distribution, conserved motifs. As a result, the plant bZIPs were organized into 10 subfamilies on basis of phylogenetic relationship. Gene duplication events during the family evolution history were also investigated. And it was further concluded that chromosomal/segmental duplication might have played a key role in gene expansion of bZIP gene family.

  9. Genome-wide mapping of conserved microRNAs and their host transcripts in Tribolium castaneum

    Institute of Scientific and Technical Information of China (English)

    Qibin Luo; Qing Zhou; Xiaomin Yu; Hongbin Lin; Songnian Hu; Jun Yu

    2008-01-01

    MicroRNAs (miRNAs) are endogenous 22-nt RNAs, which play important regulatory roles by post-transcriptional gene silencing. A computational strategy has been developed for the identification of conserved miRNAs based on features of known metazoan miRNAs in red flour beetle (Tribolium castaneum), which is regarded as one of the major laboratory models of arthropods. Among 118 putative miRNAs, 47% and 53% of the predicted miRNAs from the red flour beetle are harbored by known protein-coding genes (intronic) and genes located outside (intergenic miRNA), respectively. There are 31 intronic miRNAs in the same transcriptional orientation as the host genes, which may share RNA polymerase Ⅱ and spliceosomal machinery with their host genes for their biogenesis. A hypothetical feedback model has been proposed based on the analysis of the relationship between intronic miRNAs and their host genes in the development of red flour beetle.

  10. Genome-wide DNA methylation patterns and transcription analysis in sheep muscle.

    Directory of Open Access Journals (Sweden)

    Christine Couldrey

    Full Text Available DNA methylation plays a central role in regulating many aspects of growth and development in mammals through regulating gene expression. The development of next generation sequencing technologies have paved the way for genome-wide, high resolution analysis of DNA methylation landscapes using methodology known as reduced representation bisulfite sequencing (RRBS. While RRBS has proven to be effective in understanding DNA methylation landscapes in humans, mice, and rats, to date, few studies have utilised this powerful method for investigating DNA methylation in agricultural animals. Here we describe the utilisation of RRBS to investigate DNA methylation in sheep Longissimus dorsi muscles. RRBS analysis of ∼1% of the genome from Longissimus dorsi muscles provided data of suitably high precision and accuracy for DNA methylation analysis, at all levels of resolution from genome-wide to individual nucleotides. Combining RRBS data with mRNAseq data allowed the sheep Longissimus dorsi muscle methylome to be compared with methylomes from other species. While some species differences were identified, many similarities were observed between DNA methylation patterns in sheep and other more commonly studied species. The RRBS data presented here highlights the complexity of epigenetic regulation of genes. However, the similarities observed across species are promising, in that knowledge gained from epigenetic studies in human and mice may be applied, with caution, to agricultural species. The ability to accurately measure DNA methylation in agricultural animals will contribute an additional layer of information to the genetic analyses currently being used to maximise production gains in these species.

  11. Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data

    Directory of Open Access Journals (Sweden)

    Kim Seon-Young

    2006-07-01

    Full Text Available Abstract Background A complete understanding of the regulatory mechanisms of gene expression is the next important issue of genomics. Many bioinformaticians have developed methods and algorithms for predicting transcriptional regulatory mechanisms from sequence, gene expression, and binding data. However, most of these studies involved the use of yeast which has much simpler regulatory networks than human and has many genome wide binding data and gene expression data under diverse conditions. Studies of genome wide transcriptional networks of human genomes currently lag behind those of yeast. Results We report herein a new method that combines gene expression data analysis with promoter analysis to infer transcriptional regulatory elements of human genes. The Z scores from the application of gene set analysis with gene sets of transcription factor binding sites (TFBSs were successfully used to represent the activity of TFBSs in a given microarray data set. A significant correlation between the Z scores of gene sets of TFBSs and individual genes across multiple conditions permitted successful identification of many known human transcriptional regulatory elements of genes as well as the prediction of numerous putative TFBSs of many genes which will constitute a good starting point for further experiments. Using Z scores of gene sets of TFBSs produced better predictions than the use of mRNA levels of a transcription factor itself, suggesting that the Z scores of gene sets of TFBSs better represent diverse mechanisms for changing the activity of transcription factors in the cell. In addition, cis-regulatory modules, combinations of co-acting TFBSs, were readily identified by our analysis. Conclusion By a strategic combination of gene set level analysis of gene expression data sets and promoter analysis, we were able to identify and predict many transcriptional regulatory elements of human genes. We conclude that this approach will aid in decoding

  12. Epigenomic model of cardiac enhancers with application to genome wide association studies.

    Science.gov (United States)

    Sahu, Avinash Das; Aniba, Radhouane; Chang, Yen-Pei Christy; Hannenhalli, Sridhar

    2013-01-01

    Mammalian gene regulation is often mediated by distal enhancer elements, in particular, for tissue specific and developmental genes. Computational identification of enhancers is difficult because they do not exhibit clear location preference relative to their target gene and also because they lack clearly distinguishing genomic features. This represents a major challenge in deciphering transcriptional regulation. Recent ChIP-seq based genome-wide investigation of epigenomic modifications have revealed that enhancers are often enriched for certain epigenomic marks. Here we utilize the epigenomic data in human heart tissue along with validated human heart enhancers to develop a Support Vector Machine (SVM) model of cardiac enhancers. Cross-validation classification accuracy of our model was 84% and 92% on positive and negative sets respectively with ROC AUC = 0.92. More importantly, while P300 binding has been used as gold standard for enhancers, our model can distinguish P300-bound validated enhancers from other P300-bound regions that failed to exhibit enhancer activity in transgenic mouse. While GWAS studies reveal polymorphic regions associated with certain phenotypes, they do not immediately provide causality. Next, we hypothesized that genomic regions containing a GWAS SNP associated with a cardiac phenotype might contain another SNP in a cardiac enhancer, which presumably mediates the phenotype. Starting with a comprehensive set of SNPs associated with cardiac phenotypes in GWAS studies, we scored other SNPs in LD with the GWAS SNP according to its probability of being an enhancer and choose one with best score in the LD as enhancer. We found that our predicted enhancers are enriched for known cardiac transcriptional regulator motifs and are likely to regulate the nearby gene. Importantly, these tendencies are more favorable for the predicted enhancers compared with an approach that uses P300 binding as a marker of enhancer activity.

  13. Genome-wide Mapping of Transcriptional Start Sites Defines an Extensive Leaderless Transcriptome in Mycobacterium tuberculosis

    Directory of Open Access Journals (Sweden)

    Teresa Cortes

    2013-11-01

    Full Text Available Deciphering physiological changes that mediate transition of Mycobacterium tuberculosis between replicating and nonreplicating states is essential to understanding how the pathogen can persist in an individual host for decades. We have combined RNA sequencing (RNA-seq of 5′ triphosphate-enriched libraries with regular RNA-seq to characterize the architecture and expression of M. tuberculosis promoters. We identified over 4,000 transcriptional start sites (TSSs. Strikingly, for 26% of the genes with a primary TSS, the site of transcriptional initiation overlapped with the annotated start codon, generating leaderless transcripts lacking a 5′ UTR and, hence, the Shine-Dalgarno sequence commonly used to initiate ribosomal engagement in eubacteria. Genes encoding proteins with active growth functions were markedly depleted from the leaderless transcriptome, and there was a significant increase in the overall representation of leaderless mRNAs in a starvation model of growth arrest. The high percentage of leaderless genes may have particular importance in the physiology of nonreplicating M. tuberculosis.

  14. Genome-Wide Analysis of the Role of Global Transcriptional Regulator GntR1 in Corynebacterium glutamicum

    OpenAIRE

    Tanaka, Yuya; Takemoto, Norihiko; Ito, Terukazu; Teramoto, Haruhiko; Yukawa, Hideaki; Inui, Masayuki

    2014-01-01

    The transcriptional regulator GntR1 downregulates the genes for gluconate catabolism and pentose phosphate pathway in Corynebacterium glutamicum. Gluconate lowers the DNA binding affinity of GntR1, which is probably the mechanism of gluconate-dependent induction of these genes. In addition, GntR1 positively regulates ptsG, a gene encoding a major glucose transporter, and pck, a gene encoding phosphoenolpyruvate carboxykinase. Here, we searched for the new target of GntR1 on a genome-wide scal...

  15. Genome-wide analysis of growth phase-dependent translational and transcriptional regulation in halophilic archaea

    Directory of Open Access Journals (Sweden)

    Raddatz Günter

    2007-11-01

    Full Text Available Abstract Background Differential expression of genes can be regulated on many different levels. Most global studies of gene regulation concentrate on transcript level regulation, and very few global analyses of differential translational efficiencies exist. The studies have revealed that in Saccharomyces cerevisiae, Arabidopsis thaliana, and human cell lines translational regulation plays a significant role. Additional species have not been investigated yet. Particularly, until now no global study of translational control with any prokaryotic species was available. Results A global analysis of translational control was performed with two haloarchaeal model species, Halobacterium salinarum and Haloferax volcanii. To identify differentially regulated genes, exponentially growing and stationary phase cells were compared. More than 20% of H. salinarum transcripts are translated with non-average efficiencies. By far the largest group is comprised of genes that are translated with above-average efficiency specifically in exponential phase, including genes for many ribosomal proteins, RNA polymerase subunits, enzymes, and chemotaxis proteins. Translation of 1% of all genes is specifically repressed in either of the two growth phases. For comparison, DNA microarrays were also used to identify differential transcriptional regulation in H. salinarum, and 17% of all genes were found to have non-average transcript levels in exponential versus stationary phase. In H. volcanii, 12% of all genes are translated with non-average efficiencies. The overlap with H. salinarum is negligible. In contrast to H. salinarum, 4.6% of genes have non-average translational efficiency in both growth phases, and thus they might be regulated by other stimuli than growth phase. Conclusion For the first time in any prokaryotic species it was shown that a significant fraction of genes is under differential translational control. Groups of genes with different regulatory patterns

  16. Genome-Wide Effects of Selenium and Translational Uncoupling on Transcription in the Termite Gut Symbiont Treponema primitia

    Science.gov (United States)

    Matson, Eric G.; Rosenthal, Adam Z.; Zhang, Xinning; Leadbetter, Jared R.

    2013-01-01

    ABSTRACT When prokaryotic cells acquire mutations, encounter translation-inhibiting substances, or experience adverse environmental conditions that limit their ability to synthesize proteins, transcription can become uncoupled from translation. Such uncoupling is known to suppress transcription of protein-encoding genes in bacteria. Here we show that the trace element selenium controls transcription of the gene for the selenocysteine-utilizing enzyme formate dehydrogenase (fdhFSec) through a translation-coupled mechanism in the termite gut symbiont Treponema primitia, a member of the bacterial phylum Spirochaetes. We also evaluated changes in genome-wide transcriptional patterns caused by selenium limitation and by generally uncoupling translation from transcription via antibiotic-mediated inhibition of protein synthesis. We observed that inhibiting protein synthesis in T. primitia influences transcriptional patterns in unexpected ways. In addition to suppressing transcription of certain genes, the expected consequence of inhibiting protein synthesis, we found numerous examples in which transcription of genes and operons is truncated far downstream from putative promoters, is unchanged, or is even stimulated overall. These results indicate that gene regulation in bacteria allows for specific post-initiation transcriptional responses during periods of limited protein synthesis, which may depend both on translational coupling and on unclassified intrinsic elements of protein-encoding genes. PMID:24222491

  17. Genome-wide analysis of differential transcriptional and epigenetic variability across human immune cell types

    DEFF Research Database (Denmark)

    Ecker, Simone; Chen, Lu; Pancaldi, Vera

    2017-01-01

    Background: A healthy immune system requires immune cells that adapt rapidly to environmental challenges. This phenotypic plasticity can be mediated by transcriptional and epigenetic variability. Results: We apply a novel analytical approach to measure and compare transcriptional and epigenetic v...

  18. Rho-dependent transcription termination is essential to prevent excessive genome-wide R-loops in Escherichia coli.

    Science.gov (United States)

    Leela, J Krishna; Syeda, Aisha H; Anupama, K; Gowrishankar, J

    2013-01-02

    Two pathways of transcription termination, factor-independent and -dependent, exist in bacteria. The latter pathway operates on nascent transcripts that are not simultaneously translated and requires factors Rho, NusG, and NusA, each of which is essential for viability of WT Escherichia coli. NusG and NusA are also involved in antitermination of transcription at the ribosomal RNA operons, as well as in regulating the rates of transcription elongation of all genes. We have used a bisulfite-sensitivity assay to demonstrate genome-wide increase in the occurrence of RNA-DNA hybrids (R-loops), including from antisense and read-through transcripts, in a nusG missense mutant defective for Rho-dependent termination. Lethality associated with complete deficiency of Rho and NusG (but not NusA) was rescued by ectopic expression of an R-loop-helicase UvsW, especially so on defined growth media. Our results suggest that factor-dependent transcription termination subserves a surveillance function to prevent translation-uncoupled transcription from generating R-loops, which would block replication fork progression and therefore be lethal, and that NusA performs additional essential functions as well in E. coli. Prevention of R-loop-mediated transcription-replication conflicts by cotranscriptional protein engagement of nascent RNA is emerging as a unifying theme among both prokaryotes and eukaryotes.

  19. Genome-wide transcriptional response of Silurana (Xenopus tropicalis to infection with the deadly chytrid fungus.

    Directory of Open Access Journals (Sweden)

    Erica Bree Rosenblum

    Full Text Available Emerging infectious diseases are of great concern for both wildlife and humans. Several highly virulent fungal pathogens have recently been discovered in natural populations, highlighting the need for a better understanding of fungal-vertebrate host-pathogen interactions. Because most fungal pathogens are not fatal in the absence of other predisposing conditions, host-pathogen dynamics for deadly fungal pathogens are of particular interest. The chytrid fungus Batrachochytrium dendrobatidis (hereafter Bd infects hundreds of species of frogs in the wild. It is found worldwide and is a significant contributor to the current global amphibian decline. However, the mechanism by which Bd causes death in amphibians, and the response of the host to Bd infection, remain largely unknown. Here we use whole-genome microarrays to monitor the transcriptional responses to Bd infection in the model frog species, Silurana (Xenopus tropicalis, which is susceptible to chytridiomycosis. To elucidate the immune response to Bd and evaluate the physiological effects of chytridiomycosis, we measured gene expression changes in several tissues (liver, skin, spleen following exposure to Bd. We detected a strong transcriptional response for genes involved in physiological processes that can help explain some clinical symptoms of chytridiomycosis at the organismal level. However, we detected surprisingly little evidence of an immune response to Bd exposure, suggesting that this susceptible species may not be mounting efficient innate and adaptive immune responses against Bd. The weak immune response may be partially explained by the thermal conditions of the experiment, which were optimal for Bd growth. However, many immune genes exhibited decreased expression in Bd-exposed frogs compared to control frogs, suggesting a more complex effect of Bd on the immune system than simple temperature-mediated immune suppression. This study generates important baseline data for ongoing

  20. Genome-wide transcriptional changes induced by phagocytosis or growth on bacteria in Dictyostelium

    Directory of Open Access Journals (Sweden)

    Peracino Barbara

    2008-06-01

    Full Text Available Abstract Background Phagocytosis plays a major role in the defense of higher organisms against microbial infection and provides also the basis for antigen processing in the immune response. Cells of the model organism Dictyostelium are professional phagocytes that exploit phagocytosis of bacteria as the preferred way to ingest food, besides killing pathogens. We have investigated Dictyostelium differential gene expression during phagocytosis of non-pathogenic bacteria, using DNA microarrays, in order to identify molecular functions and novel genes involved in phagocytosis. Results The gene expression profiles of cells incubated for a brief time with bacteria were compared with cells either incubated in axenic medium or growing on bacteria. Transcriptional changes during exponential growth in axenic medium or on bacteria were also compared. We recognized 443 and 59 genes that are differentially regulated by phagocytosis or by the different growth conditions (growth on bacteria vs. axenic medium, respectively, and 102 genes regulated by both processes. Roughly one third of the genes are up-regulated compared to macropinocytosis and axenic growth. Functional annotation of differentially regulated genes with different tools revealed that phagocytosis induces profound changes in carbohydrate, aminoacid and lipid metabolism, and in cytoskeletal components. Genes regulating translation and mitochondrial biogenesis are mostly up-regulated. Genes involved in sterol biosynthesis are selectively up-regulated, suggesting a shift in membrane lipid composition linked to phagocytosis. Very few changes were detected in genes required for vesicle fission/fusion, indicating that the intracellular traffic machinery is mostly in common between phagocytosis and macropinocytosis. A few putative receptors, including GPCR family 3 proteins, scaffolding and adhesion proteins, components of signal transduction and transcription factors have been identified, which could

  1. Genome-wide transcriptional response of silkworm (Bombyx mori) to infection by the microsporidian Nosema bombycis.

    Science.gov (United States)

    Ma, Zhengang; Li, Chunfeng; Pan, Guoqing; Li, Zhihong; Han, Bing; Xu, Jinshan; Lan, Xiqian; Chen, Jie; Yang, Donglin; Chen, Quanmei; Sang, Qi; Ji, Xiaocun; Li, Tian; Long, Mengxian; Zhou, Zeyang

    2013-01-01

    Microsporidia have attracted much attention because they infect a variety of species ranging from protists to mammals, including immunocompromised patients with AIDS or cancer. Aside from the study on Nosema ceranae, few works have focused on elucidating the mechanism in host response to microsporidia infection. Nosema bombycis is a pathogen of silkworm pébrine that causes great economic losses to the silkworm industry. Detailed understanding of the host (Bombyx mori) response to infection by N. bombycis is helpful for prevention of this disease. A genome-wide survey of the gene expression profile at 2, 4, 6 and 8 days post-infection by N. bombycis was performed and results showed that 64, 244, 1,328, 1,887 genes were induced, respectively. Up to 124 genes, which are involved in basal metabolism pathways, were modulated. Notably, B. mori genes that play a role in juvenile hormone synthesis and metabolism pathways were induced, suggesting that the host may accumulate JH as a response to infection. Interestingly, N. bombycis can inhibit the silkworm serine protease cascade melanization pathway in hemolymph, which may be due to the secretion of serpins in the microsporidia. N. bombycis also induced up-regulation of several cellular immune factors, in which CTL11 has been suggested to be involved in both spore recognition and immune signal transduction. Microarray and real-time PCR analysis indicated the activation of silkworm Toll and JAK/STAT pathways. The notable up-regulation of antimicrobial peptides, including gloverins, lebocins and moricins, strongly indicated that antimicrobial peptide defense mechanisms were triggered to resist the invasive microsporidia. An analysis of N. bombycis-specific response factors suggested their important roles in anti-microsporidia defense. Overall, this study primarily provides insight into the potential molecular mechanisms for the host-parasite interaction between B. mori and N. bombycis and may provide a foundation for

  2. Genome-wide assembly and analysis of alternative transcripts in mouse

    Science.gov (United States)

    Sharov, Alexei A.; Dudekula, Dawood B.; Ko, Minoru S.H.

    2005-01-01

    To build a mouse gene index with the most comprehensive coverage of alternative transcription/splicing (ATS), we developed an algorithm and a fully automated computational pipeline for transcript assembly from expressed sequences aligned to the genome. We identified 191,946 genomic loci, which included 27,497 protein-coding genes and 11,906 additional gene candidates (e.g., nonprotein-coding, but multiexon). Comparison of the resulting gene index with TIGR, UniGene, DoTS, and ESTGenes databases revealed that it had a greater number of transcripts, a greater average number of exons and introns with proper splicing sites per gene, and longer ORFs. The 27,497 protein-coding genes had 77,138 transcripts, i.e., 2.8 transcripts per gene on average. Close examination of transcripts led to a combinatorial table of 23 types of ATS units, only nine of which were previously described, i.e., 14 types of alternative splicing, seven types of alternative starts, and two types of alternative termination. The 47%, 18%, and 14% of 20,323 multiexon protein-coding genes with proper splice sites had alternative splicings, alternative starts, and alternative terminations, respectively. The gene index with the comprehensive ATS will provide a useful platform for analyzing the nature and mechanism of ATS, as well as for designing the accurate exon-based DNA microarrays. PMID:15867436

  3. Genome-wide analysis of the DNA-binding with one zinc finger (Dof) transcription factor family in bananas.

    Science.gov (United States)

    Dong, Chen; Hu, Huigang; Xie, Jianghui

    2016-12-01

    DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. In this study, we report a genome-wide search for Musa acuminata Dof (MaDof) genes and their expression profiles at different developmental stages and in response to various abiotic stresses. In addition, a complete overview of the Dof gene family in bananas is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 74 full-length protein-coding MaDof genes unevenly distributed on 11 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that MaDof genes can be classified into four subgroups (StDof I, II, III, and IV). The detailed genomic information of the MaDof gene homologs in the present study provides opportunities for functional analyses to unravel the exact role of the genes in plant growth and development.

  4. Genome-wide transcription profile of endothelial cells after cardiac transplantation in the rat.

    Science.gov (United States)

    Mikalsen, B; Fosby, B; Wang, J; Hammarström, C; Bjaerke, H; Lundström, M; Kasprzycka, M; Scott, H; Line, P-D; Haraldsen, G

    2010-07-01

    Transcriptome analyses of organ transplants have until now usually focused on whole tissue samples containing activation profiles from different cell populations. Here, we enriched endothelial cells from rat cardiac allografts and isografts, establishing their activation profile at baseline and on days 2, 3 and 4 after transplantation. Modulated transcripts were assigned to three categories based on their regulation profile in allografts and isografts. Categories A and B contained the majority of transcripts and showed similar regulation in both graft types, appearing to represent responses to surgical trauma. By contrast, category C contained transcripts that were partly allograft-specific and to a large extent associated with interferon-gamma-responsiveness. Several transcripts were verified by immunohistochemical analysis of graft lesions, among them the matricellular protein periostin, which was one of the most highly upregulated transcripts but has not been associated with transplantation previously. In conclusion, the majority of the differentially expressed genes in graft endothelial cells are affected by the transplantation procedure whereas relatively few are associated with allograft rejection.

  5. Genome-wide dynamic transcriptional profiling in clostridium beijerinckii NCIMB 8052 using single-nucleotide resolution RNA-Seq

    Directory of Open Access Journals (Sweden)

    Wang Yi

    2012-03-01

    Full Text Available Abstract Background Clostridium beijerinckii is a prominent solvent-producing microbe that has great potential for biofuel and chemical industries. Although transcriptional analysis is essential to understand gene functions and regulation and thus elucidate proper strategies for further strain improvement, limited information is available on the genome-wide transcriptional analysis for C. beijerinckii. Results The genome-wide transcriptional dynamics of C. beijerinckii NCIMB 8052 over a batch fermentation process was investigated using high-throughput RNA-Seq technology. The gene expression profiles indicated that the glycolysis genes were highly expressed throughout the fermentation, with comparatively more active expression during acidogenesis phase. The expression of acid formation genes was down-regulated at the onset of solvent formation, in accordance with the metabolic pathway shift from acidogenesis to solventogenesis. The acetone formation gene (adc, as a part of the sol operon, exhibited highly-coordinated expression with the other sol genes. Out of the > 20 genes encoding alcohol dehydrogenase in C. beijerinckii, Cbei_1722 and Cbei_2181 were highly up-regulated at the onset of solventogenesis, corresponding to their key roles in primary alcohol production. Most sporulation genes in C. beijerinckii 8052 demonstrated similar temporal expression patterns to those observed in B. subtilis and C. acetobutylicum, while sporulation sigma factor genes sigE and sigG exhibited accelerated and stronger expression in C. beijerinckii 8052, which is consistent with the more rapid forespore and endspore development in this strain. Global expression patterns for specific gene functional classes were examined using self-organizing map analysis. The genes associated with specific functional classes demonstrated global expression profiles corresponding to the cell physiological variation and metabolic pathway switch. Conclusions The results from this

  6. Genome-wide assembly and analysis of alternative transcripts in mouse

    OpenAIRE

    Sharov, Alexei A; Dudekula, Dawood B.; Minoru S.H. Ko

    2005-01-01

    To build a mouse gene index with the most comprehensive coverage of alternative transcription/splicing (ATS), we developed an algorithm and a fully automated computational pipeline for transcript assembly from expressed sequences aligned to the genome. We identified 191,946 genomic loci, which included 27,497 protein-coding genes and 11,906 additional gene candidates (e.g., nonprotein-coding, but multiexon). Comparison of the resulting gene index with TIGR, UniGene, DoTS, and ESTGenes databas...

  7. Genome-Wide Transcriptional Responses to Carbon Starvation in Nongrowing Lactococcus lactis

    NARCIS (Netherlands)

    Ercan,O.; Wels, M.; Smid. E.J.; Kleerebezem, M.

    2015-01-01

    This paper describes the transcriptional adaptations of nongrowing, retentostat cultures of Lactococcus lactis to starvation. Near-zero-growth cultures (μ = 0.0001 h−1) obtained by extended retentostat cultivation were exposed to starvation by termination of the medium supply for 24 h, followed by a

  8. Genome-wide transcriptional responses to carbon starvation in nongrowing Lactococcus lactis

    NARCIS (Netherlands)

    Ercan, O.; Wels, M.; Smid, E.J.; Kleerebezem, M.

    2015-01-01

    This paper describes the transcriptional adaptations of nongrowing, retentostat cultures of Lactococcus lactis to starvation. Near-zero-growth cultures (µ = 0.0001 h-1) obtained by extended retentostat cultivation were exposed to starvation by termination of the medium supply for 24 h, followed by a

  9. An efficient hierarchical generalized linear mixed model for pathway analysis of genome-wide association studies.

    Science.gov (United States)

    Wang, Lily; Jia, Peilin; Wolfinger, Russell D; Chen, Xi; Grayson, Britney L; Aune, Thomas M; Zhao, Zhongming

    2011-03-01

    In genome-wide association studies (GWAS) of complex diseases, genetic variants having real but weak associations often fail to be detected at the stringent genome-wide significance level. Pathway analysis, which tests disease association with combined association signals from a group of variants in the same pathway, has become increasingly popular. However, because of the complexities in genetic data and the large sample sizes in typical GWAS, pathway analysis remains to be challenging. We propose a new statistical model for pathway analysis of GWAS. This model includes a fixed effects component that models mean disease association for a group of genes, and a random effects component that models how each gene's association with disease varies about the gene group mean, thus belongs to the class of mixed effects models. The proposed model is computationally efficient and uses only summary statistics. In addition, it corrects for the presence of overlapping genes and linkage disequilibrium (LD). Via simulated and real GWAS data, we showed our model improved power over currently available pathway analysis methods while preserving type I error rate. Furthermore, using the WTCCC Type 1 Diabetes (T1D) dataset, we demonstrated mixed model analysis identified meaningful biological processes that agreed well with previous reports on T1D. Therefore, the proposed methodology provides an efficient statistical modeling framework for systems analysis of GWAS. The software code for mixed models analysis is freely available at http://biostat.mc.vanderbilt.edu/LilyWang.

  10. An integrated pipeline for the genome-wide analysis of transcription factor binding sites from ChIP-Seq.

    Science.gov (United States)

    Mercier, Eloi; Droit, Arnaud; Li, Leping; Robertson, Gordon; Zhang, Xuekui; Gottardo, Raphael

    2011-02-16

    ChIP-Seq has become the standard method for genome-wide profiling DNA association of transcription factors. To simplify analyzing and interpreting ChIP-Seq data, which typically involves using multiple applications, we describe an integrated, open source, R-based analysis pipeline. The pipeline addresses data input, peak detection, sequence and motif analysis, visualization, and data export, and can readily be extended via other R and Bioconductor packages. Using a standard multicore computer, it can be used with datasets consisting of tens of thousands of enriched regions. We demonstrate its effectiveness on published human ChIP-Seq datasets for FOXA1, ER, CTCF and STAT1, where it detected co-occurring motifs that were consistent with the literature but not detected by other methods. Our pipeline provides the first complete set of Bioconductor tools for sequence and motif analysis of ChIP-Seq and ChIP-chip data.

  11. Genome-wide digital transcript analysis of putative fruitlet abscission related genes regulated by ethephon in litchi

    Directory of Open Access Journals (Sweden)

    Caiqin eLi

    2015-07-01

    Full Text Available The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn. causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2,730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1,867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors, protein ubiquitination, ROS response, calcium signal transduction and cell wall modification. These genes could be clustered into 4 groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to

  12. Genome-Wide Transcriptional Profiling of the Response of Staphylococcus aureus to Cryptotanshinone

    Directory of Open Access Journals (Sweden)

    Haihua Feng

    2009-01-01

    Full Text Available Staphylococcus aureus (S. aureus strains with multiple antibiotic resistances are increasingly widespread, and new agents are required for the treatment of S. aureus. Cryptotanshinone (CT, a major tanshinone of medicinal plant Salvia miltiorrhiza Bunge, demonstrated effective in vitro antibacterial activity against all 21 S. aureus strains tested in this experiment. Affymetrix GeneChips were utilized to determine the global transcriptional response of S. aureus ATCC 25923 to treatment with subinhibitory concentrations of CT. Transcriptome profiling indicated that the antibacterial action of CT may be associated with its action as active oxygen radical generator; S. aureus undergoes an oxygen-limiting state upon exposure to CT.

  13. Histone deacetylase inhibition modulates histone acetylation at gene promoter regions and affects genome-wide gene transcription in Schistosoma mansoni

    Science.gov (United States)

    Anderson, Letícia; Gomes, Monete Rajão; daSilva, Lucas Ferreira; Pereira, Adriana da Silva Andrade; Mourão, Marina M.; Romier, Christophe; Pierce, Raymond

    2017-01-01

    Background Schistosomiasis is a parasitic disease infecting hundreds of millions of people worldwide. Treatment depends on a single drug, praziquantel, which kills the Schistosoma spp. parasite only at the adult stage. HDAC inhibitors (HDACi) such as Trichostatin A (TSA) induce parasite mortality in vitro (schistosomula and adult worms), however the downstream effects of histone hyperacetylation on the parasite are not known. Methodology/Principal findings TSA treatment of adult worms in vitro increased histone acetylation at H3K9ac and H3K14ac, which are transcription activation marks, not affecting the unrelated transcription repression mark H3K27me3. We investigated the effect of TSA HDACi on schistosomula gene expression at three different time points, finding a marked genome-wide change in the transcriptome profile. Gene transcription activity was correlated with changes on the chromatin acetylation mark at gene promoter regions. Moreover, combining expression data with ChIP-Seq public data for schistosomula, we found that differentially expressed genes having the H3K4me3 mark at their promoter region in general showed transcription activation upon HDACi treatment, compared with those without the mark, which showed transcription down-regulation. Affected genes are enriched for DNA replication processes, most of them being up-regulated. Twenty out of 22 genes encoding proteins involved in reducing reactive oxygen species accumulation were down-regulated. Dozens of genes encoding proteins with histone reader motifs were changed, including SmEED from the PRC2 complex. We targeted SmEZH2 methyltransferase PRC2 component with a new EZH2 inhibitor (GSK343) and showed a synergistic effect with TSA, significantly increasing schistosomula mortality. Conclusions/Significance Genome-wide gene expression analyses have identified important pathways and cellular functions that were affected and may explain the schistosomicidal effect of TSA HDACi. The change in expression

  14. Blood Genome-Wide Transcriptional Profiles of HER2 Negative Breast Cancers Patients

    Directory of Open Access Journals (Sweden)

    Ovidiu Balacescu

    2016-01-01

    Full Text Available Tumors act systemically to sustain cancer progression, affecting the physiological processes in the host and triggering responses in the blood circulating cells. In this study, we explored blood transcriptional patterns of patients with two subtypes of HER2 negative breast cancers, with different prognosis and therapeutic outcome. Peripheral blood samples from seven healthy female donors and 29 women with breast cancer including 14 triple-negative breast cancers and 15 hormone-dependent breast cancers were evaluated by microarray. We also evaluated the stroma in primary tumors. Transcriptional analysis revealed distinct molecular signatures in the blood of HER2− breast cancer patients according to ER/PR status. Our data showed the implication of immune signaling in both breast cancer subtypes with an enrichment of these processes in the blood of TNBC patients. We observed a significant alteration of “chemokine signaling,” “IL-8 signaling,” and “communication between innate and adaptive immune cells” pathways in the blood of TNBC patients correlated with an increased inflammation and necrosis in their primary tumors. Overall, our data indicate that the presence of triple-negative breast cancer is associated with an enrichment of altered systemic immune-related pathways, suggesting that immunotherapy could possibly be synergistic to the chemotherapy, to improve the clinical outcome of these patients.

  15. Genomic-wide transcriptional profiling in primary myoblasts reveals Runx1-regulated genes in muscle regeneration

    Directory of Open Access Journals (Sweden)

    Kfir Baruch Umansky

    2015-12-01

    Full Text Available In response to muscle damage the muscle adult stem cells are activated and differentiate into myoblasts that regenerate the damaged tissue. We have recently showed that following myopathic damage the level of the Runx1 transcription factor (TF is elevated and that during muscle regeneration this TF regulates the balance between myoblast proliferation and differentiation (Umansky et al.. We employed Runx1-dependent gene expression, Chromatin Immunoprecipitation sequencing (ChIP-seq, Assay for Transposase-Accessible Chromatin with high-throughput sequencing (ATAC-seq and histone H3K4me1/H3K27ac modification analyses to identify a subset of Runx1-regulated genes that are co-occupied by the TFs MyoD and c-Jun and are involved in muscle regeneration (Umansky et al.. The data is available at the GEO database under the superseries accession number GSE56131.

  16. Genome-wide expression profiling shows transcriptional reprogramming in Fusarium graminearum by Fusarium graminearum virus 1-DK21 infection

    Directory of Open Access Journals (Sweden)

    Cho Won

    2012-05-01

    Full Text Available Abstract Background Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21 is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessary for preventing damage caused by F. graminearum. Therefore, we investigated important cellular regulatory processes in a host containing FgV1-DK21 as compared to an uninfected parent using a transcriptional approach. Results Using a 3′-tiling microarray covering all known F. graminearum genes, we carried out genome-wide expression analyses of F. graminearum at two different time points. At the early point of growth of an infected strain as compared to an uninfected strain, genes associated with protein synthesis, including ribosome assembly, nucleolus, and ribosomal RNA processing, were significantly up-regulated. In addition, genes required for transcription and signal transduction, including fungal-specific transcription factors and cAMP signaling, respectively, were actively up-regulated. In contrast, genes involved in various metabolic pathways, particularly in producing carboxylic acids, aromatic amino acids, nitrogen compounds, and polyamines, showed dramatic down-regulation at the early time point. Moreover, genes associated with transport systems localizing to transmembranes were down-regulated at both time points. Conclusion This is the first report of global change in the prominent cellular pathways in the Fusarium host containing FgV1-DK21. The significant increase in transcripts for transcription and translation machinery in fungal host cells seems to be related to virus replication. In addition, significant down-regulation of genes required for metabolism and transporting systems in a fungal host containing the virus appears to be related to the host defense mechanism and fungal virulence. Taken together

  17. GenoGAM: genome-wide generalized additive models for ChIP-Seq analysis.

    Science.gov (United States)

    Stricker, Georg; Engelhardt, Alexander; Schulz, Daniel; Schmid, Matthias; Tresch, Achim; Gagneur, Julien

    2017-08-01

    Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is a widely used approach to study protein-DNA interactions. Often, the quantities of interest are the differential occupancies relative to controls, between genetic backgrounds, treatments, or combinations thereof. Current methods for differential occupancy of ChIP-Seq data rely however on binning or sliding window techniques, for which the choice of the window and bin sizes are subjective. Here, we present GenoGAM (Genome-wide Generalized Additive Model), which brings the well-established and flexible generalized additive models framework to genomic applications using a data parallelism strategy. We model ChIP-Seq read count frequencies as products of smooth functions along chromosomes. Smoothing parameters are objectively estimated from the data by cross-validation, eliminating ad hoc binning and windowing needed by current approaches. GenoGAM provides base-level and region-level significance testing for full factorial designs. Application to a ChIP-Seq dataset in yeast showed increased sensitivity over existing differential occupancy methods while controlling for type I error rate. By analyzing a set of DNA methylation data and illustrating an extension to a peak caller, we further demonstrate the potential of GenoGAM as a generic statistical modeling tool for genome-wide assays. Software is available from Bioconductor: https://www.bioconductor.org/packages/release/bioc/html/GenoGAM.html . gagneur@in.tum.de. Supplementary information is available at Bioinformatics online.

  18. Genome-wide association study of handedness excludes simple genetic models

    Science.gov (United States)

    Armour, J AL; Davison, A; McManus, I C

    2014-01-01

    Handedness is a human behavioural phenotype that appears to be congenital, and is often assumed to be inherited, but for which the developmental origin and underlying causation(s) have been elusive. Models of the genetic basis of variation in handedness have been proposed that fit different features of the observed resemblance between relatives, but none has been decisively tested or a corresponding causative locus identified. In this study, we applied data from well-characterised individuals studied at the London Twin Research Unit. Analysis of genome-wide SNP data from 3940 twins failed to identify any locus associated with handedness at a genome-wide level of significance. The most straightforward interpretation of our analyses is that they exclude the simplest formulations of the ‘right-shift' model of Annett and the ‘dextral/chance' model of McManus, although more complex modifications of those models are still compatible with our observations. For polygenic effects, our study is inadequately powered to reliably detect alleles with effect sizes corresponding to an odds ratio of 1.2, but should have good power to detect effects at an odds ratio of 2 or more. PMID:24065183

  19. Genome-wide mapping of transcription factor binding reveals developmental process integration and a fresh look at evolutionary dynamics.

    Science.gov (United States)

    Yant, Levi

    2012-02-01

    How does evolution forge adaptive responses? Are many changes required or few? Just how complex are the transcriptional networks that control development? Diverse questions like these are being newly addressed by next-generation sequencing-based techniques. Facilitating a mechanistic understanding, these approaches reveal the direct in vivo interactions between transcription factors and their physical targets, combined with genome-scale readouts to comprehensively map adaptive gene regulatory networks (GRNs). Here I focus on pioneering work from the last 3 years that has leveraged these data to investigate diverse aspects of GRN circuitry controlling the reproductive transition in plants. These approaches have revealed surprising new functions for long-investigated key players in developmental programs and laid bare the basis for pleiotropy in many others, suggesting widespread process integration at the transcriptional level. Evolutionary questions begged by the recent deluge of GRN mapping data are being assessed anew, both by emerging work outside Arabidopsis thaliana and novel analyses within. These studies have swiftly exposed the distinctive power and adaptability of genome-wide GRN mapping and illustrate that this unique data type holds tremendous promise for plant biology.

  20. Genome-wide DNA binding pattern of the homeodomain transcription factor Sine oculis (So in the developing eye of Drosophila melanogaster

    Directory of Open Access Journals (Sweden)

    Barbara Jusiak

    2014-12-01

    Full Text Available The eye of the fruit fly Drosophila melanogaster provides a highly tractable genetic model system for the study of animal development, and many genes that regulate Drosophila eye formation have homologs implicated in human development and disease. Among these is the homeobox gene sine oculis (so, which encodes a homeodomain transcription factor (TF that is both necessary for eye development and sufficient to reprogram a subset of cells outside the normal eye field toward an eye fate. We have performed a genome-wide analysis of So binding to DNA prepared from developing Drosophila eye tissue in order to identify candidate direct targets of So-mediated transcriptional regulation, as described in our recent article [20]. The data are available from NCBI Gene Expression Omnibus (GEO with the accession number GSE52943. Here we describe the methods, data analysis, and quality control of our So ChIP-seq dataset.

  1. Genome-wide characterization of JASMONATE-ZIM DOMAIN transcription repressors in wheat (Triticum aestivum L.).

    Science.gov (United States)

    Wang, Yukun; Qiao, Linyi; Bai, Jianfang; Wang, Peng; Duan, Wenjing; Yuan, Shaohua; Yuan, Guoliang; Zhang, Fengting; Zhang, Liping; Zhao, Changping

    2017-02-13

    The JASMONATE-ZIM DOMAIN (JAZ) repressor family proteins are jasmonate co-receptors and transcriptional repressor in jasmonic acid (JA) signaling pathway, and they play important roles in regulating the growth and development of plants. Recently, more and more researches on JAZ gene family are reported in many plants. Although the genome sequencing of common wheat (Triticum aestivum L.) and its relatives is complete, our knowledge about this gene family remains vacant. Fourteen JAZ genes were identified in the wheat genome. Structural analysis revealed that the TaJAZ proteins in wheat were as conserved as those in other plants, but had structural characteristics. By phylogenetic analysis, all JAZ proteins from wheat and other plants were clustered into 11 sub-groups (G1-G11), and TaJAZ proteins shared a high degree of similarity with some JAZ proteins from Aegliops tauschii, Brachypodium distachyon and Oryza sativa. The Ka/Ks ratios of TaJAZ genes ranged from 0.0016 to 0.6973, suggesting that the TaJAZ family had undergone purifying selection in wheat. Gene expression patterns obtained by quantitative real-time PCR (qRT-PCR) revealed differential temporal and spatial regulation of TaJAZ genes under multifarious abiotic stress treatments of high salinity, drought, cold and phytohormone. Among these, TaJAZ7, 8 and 12 were specifically expressed in the anther tissues of the thermosensitive genic male sterile (TGMS) wheat line BS366 and normal control wheat line Jing411. Compared with the gene expression patterns in the normal wheat line Jing411, TaJAZ7, 8 and 12 had different expression patterns in abnormally dehiscent anthers of BS366 at the heading stage 6, suggesting that specific up- or down-regulation of these genes might be associated with the abnormal anther dehiscence in TGMS wheat line. This study analyzed the size and composition of the JAZ gene family in wheat, and investigated stress responsive and differential tissue-specific expression profiles of each

  2. Genome-wide identification of transcription start sites, promoters and transcription factor binding sites in E. coli.

    Directory of Open Access Journals (Sweden)

    Alfredo Mendoza-Vargas

    Full Text Available Despite almost 40 years of molecular genetics research in Escherichia coli a major fraction of its Transcription Start Sites (TSSs are still unknown, limiting therefore our understanding of the regulatory circuits that control gene expression in this model organism. RegulonDB (http://regulondb.ccg.unam.mx/ is aimed at integrating the genetic regulatory network of E. coli K12 as an entirely bioinformatic project up till now. In this work, we extended its aims by generating experimental data at a genome scale on TSSs, promoters and regulatory regions. We implemented a modified 5' RACE protocol and an unbiased High Throughput Pyrosequencing Strategy (HTPS that allowed us to map more than 1700 TSSs with high precision. From this collection, about 230 corresponded to previously reported TSSs, which helped us to benchmark both our methodologies and the accuracy of the previous mapping experiments. The other ca 1500 TSSs mapped belong to about 1000 different genes, many of them with no assigned function. We identified promoter sequences and type of sigma factors that control the expression of about 80% of these genes. As expected, the housekeeping sigma(70 was the most common type of promoter, followed by sigma(38. The majority of the putative TSSs were located between 20 to 40 nucleotides from the translational start site. Putative regulatory binding sites for transcription factors were detected upstream of many TSSs. For a few transcripts, riboswitches and small RNAs were found. Several genes also had additional TSSs within the coding region. Unexpectedly, the HTPS experiments revealed extensive antisense transcription, probably for regulatory functions. The new information in RegulonDB, now with more than 2400 experimentally determined TSSs, strengthens the accuracy of promoter prediction, operon structure, and regulatory networks and provides valuable new information that will facilitate the understanding from a global perspective the complex and

  3. Genome-wide profiling of H3K56 acetylation and transcription factor binding sites in human adipocytes.

    Directory of Open Access Journals (Sweden)

    Kinyui Alice Lo

    Full Text Available The growing epidemic of obesity and metabolic diseases calls for a better understanding of adipocyte biology. The regulation of transcription in adipocytes is particularly important, as it is a target for several therapeutic approaches. Transcriptional outcomes are influenced by both histone modifications and transcription factor binding. Although the epigenetic states and binding sites of several important transcription factors have been profiled in the mouse 3T3-L1 cell line, such data are lacking in human adipocytes. In this study, we identified H3K56 acetylation sites in human adipocytes derived from mesenchymal stem cells. H3K56 is acetylated by CBP and p300, and deacetylated by SIRT1, all are proteins with important roles in diabetes and insulin signaling. We found that while almost half of the genome shows signs of H3K56 acetylation, the highest level of H3K56 acetylation is associated with transcription factors and proteins in the adipokine signaling and Type II Diabetes pathways. In order to discover the transcription factors that recruit acetyltransferases and deacetylases to sites of H3K56 acetylation, we analyzed DNA sequences near H3K56 acetylated regions and found that the E2F recognition sequence was enriched. Using chromatin immunoprecipitation followed by high-throughput sequencing, we confirmed that genes bound by E2F4, as well as those by HSF-1 and C/EBPα, have higher than expected levels of H3K56 acetylation, and that the transcription factor binding sites and acetylation sites are often adjacent but rarely overlap. We also discovered a significant difference between bound targets of C/EBPα in 3T3-L1 and human adipocytes, highlighting the need to construct species-specific epigenetic and transcription factor binding site maps. This is the first genome-wide profile of H3K56 acetylation, E2F4, C/EBPα and HSF-1 binding in human adipocytes, and will serve as an important resource for better understanding adipocyte

  4. Genome-wide analysis of the human p53 transcriptional network unveils a lncRNA tumour suppressor signature.

    Science.gov (United States)

    Sánchez, Yolanda; Segura, Victor; Marín-Béjar, Oskar; Athie, Alejandro; Marchese, Francesco P; González, Jovanna; Bujanda, Luis; Guo, Shuling; Matheu, Ander; Huarte, Maite

    2014-12-19

    Despite the inarguable relevance of p53 in cancer, genome-wide studies relating endogenous p53 activity to the expression of lncRNAs in human cells are still missing. Here, by integrating RNA-seq with p53 ChIP-seq analyses of a human cancer cell line under DNA damage, we define a high-confidence set of 18 lncRNAs that are p53 transcriptional targets. We demonstrate that two of the p53-regulated lncRNAs are required for the efficient binding of p53 to some of its target genes, modulating the p53 transcriptional network and contributing to apoptosis induction by DNA damage. We also show that the expression of p53-lncRNAs is lowered in colorectal cancer samples, constituting a tumour suppressor signature with high diagnostic power. Thus, p53-regulated lncRNAs establish a positive regulatory feedback loop that enhances p53 tumour suppressor activity. Furthermore, the signature defined by p53-regulated lncRNAs supports their potential use in the clinic as biomarkers and therapeutic targets.

  5. An Empirical Bayes Mixture Model for Effect Size Distributions in Genome-Wide Association Studies

    DEFF Research Database (Denmark)

    Thompson, Wesley K.; Wang, Yunpeng; Schork, Andrew J.

    2015-01-01

    Characterizing the distribution of effects from genome-wide genotyping data is crucial for understanding important aspects of the genetic architecture of complex traits, such as number or proportion of non-null loci, average proportion of phenotypic variance explained per non-null effect, power...... for discovery, and polygenic risk prediction. To this end, previous work has used effect-size models based on various distributions, including the normal and normal mixture distributions, among others. In this paper we propose a scale mixture of two normals model for effect size distributions of genome...... of variance explained by genotyped SNPs, CD and SZ have a broadly dissimilar genetic architecture, due to differing mean effect size and proportion of non-null loci....

  6. Genome-wide identification and expression analysis of the ClTCP transcription factors in Citrullus lanatus.

    Science.gov (United States)

    Shi, Pibiao; Guy, Kateta Malangisha; Wu, Weifang; Fang, Bingsheng; Yang, Jinghua; Zhang, Mingfang; Hu, Zhongyuan

    2016-04-12

    The plant-specific TCP transcription factor family, which is involved in the regulation of cell growth and proliferation, performs diverse functions in multiple aspects of plant growth and development. However, no comprehensive analysis of the TCP family in watermelon (Citrullus lanatus) has been undertaken previously. A total of 27 watermelon TCP encoding genes distributed on nine chromosomes were identified. Phylogenetic analysis clustered the genes into 11 distinct subgroups. Furthermore, phylogenetic and structural analyses distinguished two homology classes within the ClTCP family, designated Class I and Class II. The Class II genes were differentiated into two subclasses, the CIN subclass and the CYC/TB1 subclass. The expression patterns of all members were determined by semi-quantitative PCR. The functions of two ClTCP genes, ClTCP14a and ClTCP15, in regulating plant height were confirmed by ectopic expression in Arabidopsis wild-type and ortholog mutants. This study represents the first genome-wide analysis of the watermelon TCP gene family, which provides valuable information for understanding the classification and functions of the TCP genes in watermelon.

  7. Genome-wide Identification of TCP Family Transcription Factors from Populus euphratica and Their Involvement in Leaf Shape Regulation.

    Science.gov (United States)

    Ma, Xiaodong; Ma, Jianchao; Fan, Di; Li, Chaofeng; Jiang, Yuanzhong; Luo, Keming

    2016-09-08

    Higher plants have been shown to experience a juvenile vegetative phase, an adult vegetative phase, and a reproductive phase during its postembryonic development and distinct lateral organ morphologies have been observed at the different development stages. Populus euphratica, commonly known as a desert poplar, has developed heteromorphic leaves during its development. The TCP family genes encode a group of plant-specific transcription factors involved in several aspects of plant development. In particular, TCPs have been shown to influence leaf size and shape in many herbaceous plants. However, whether these functions are conserved in woody plants remains unknown. In the present study, we carried out genome-wide identification of TCP genes in P. euphratica and P. trichocarpa, and 33 and 36 genes encoding putative TCP proteins were found, respectively. Phylogenetic analysis of the poplar TCPs together with Arabidopsis TCPs indicated a biased expansion of the TCP gene family via segmental duplications. In addition, our results have also shown a correlation between different expression patterns of several P. euphratica TCP genes and leaf shape variations, indicating their involvement in the regulation of leaf shape development.

  8. Genome-Wide Transcriptional and Physiological Responses of Bradyrhizobium japonicum to Paraquat-Mediated Oxidative Stress▿†

    Science.gov (United States)

    Donati, Andrew J.; Jeon, Jeong-Min; Sangurdekar, Dipen; So, Jae-Seong; Chang, Woo-Suk

    2011-01-01

    The rhizobial bacterium Bradyrhizobium japonicum functions as a nitrogen-fixing symbiont of the soybean plant (Glycine max). Plants are capable of producing an oxidative burst, a rapid proliferation of reactive oxygen species (ROS), as a defense mechanism against pathogenic and symbiotic bacteria. Therefore, B. japonicum must be able to resist such a defense mechanism to initiate nodulation. In this study, paraquat, a known superoxide radical-inducing agent, was used to investigate this response. Genome-wide transcriptional profiles were created for both prolonged exposure (PE) and fulminant shock (FS) conditions. These profiles revealed that 190 and 86 genes were up- and downregulated for the former condition, and that 299 and 105 genes were up- and downregulated for the latter condition, respectively (>2.0-fold; P ROS scavenging enzymes, such as superoxide dismutase and catalase, were not detected, suggesting constitutive expression of those genes by endogenous ROS. Various physiological tests, including exopolysaccharide (EPS), cellular protein, and motility characterization, were performed to corroborate the gene expression data. The results suggest that B. japonicum responds to tolerable oxidative stress during PE through enhanced motility, increased translational activity, and EPS production, in addition to the expression of genes involved in global stress responses, such as chaperones and sigma factors. PMID:21498770

  9. Genome-wide identification and characterization of Notch transcription complex-binding sequence-paired sites in leukemia cells.

    Science.gov (United States)

    Severson, Eric; Arnett, Kelly L; Wang, Hongfang; Zang, Chongzhi; Taing, Len; Liu, Hudan; Pear, Warren S; Shirley Liu, X; Blacklow, Stephen C; Aster, Jon C

    2017-05-02

    Notch transcription complexes (NTCs) drive target gene expression by binding to two distinct types of genomic response elements, NTC monomer-binding sites and sequence-paired sites (SPSs) that bind NTC dimers. SPSs are conserved and have been linked to the Notch responsiveness of a few genes. To assess the overall contribution of SPSs to Notch-dependent gene regulation, we determined the DNA sequence requirements for NTC dimerization using a fluorescence resonance energy transfer (FRET) assay and applied insights from these in vitro studies to Notch-"addicted" T cell acute lymphoblastic leukemia (T-ALL) cells. We found that SPSs contributed to the regulation of about a third of direct Notch target genes. Although originally described in promoters, SPSs are present mainly in long-range enhancers, including an enhancer containing a newly described SPS that regulates HES5 expression. Our work provides a general method for identifying SPSs in genome-wide data sets and highlights the widespread role of NTC dimerization in Notch-transformed leukemia cells. Copyright © 2017, American Association for the Advancement of Science.

  10. Genome-wide location analysis reveals distinct transcriptional circuitry by paralogous regulators Foxa1 and Foxa2.

    Science.gov (United States)

    Bochkis, Irina M; Schug, Jonathan; Ye, Diana Z; Kurinna, Svitlana; Stratton, Sabrina A; Barton, Michelle C; Kaestner, Klaus H

    2012-01-01

    Gene duplication is a powerful driver of evolution. Newly duplicated genes acquire new roles that are relevant to fitness, or they will be lost over time. A potential path to functional relevance is mutation of the coding sequence leading to the acquisition of novel biochemical properties, as analyzed here for the highly homologous paralogs Foxa1 and Foxa2 transcriptional regulators. We determine by genome-wide location analysis (ChIP-Seq) that, although Foxa1 and Foxa2 share a large fraction of binding sites in the liver, each protein also occupies distinct regulatory elements in vivo. Foxa1-only sites are enriched for p53 binding sites and are frequently found near genes important to cell cycle regulation, while Foxa2-restricted sites show only a limited match to the forkhead consensus and are found in genes involved in steroid and lipid metabolism. Thus, Foxa1 and Foxa2, while redundant during development, have evolved divergent roles in the adult liver, ensuring the maintenance of both genes during evolution.

  11. Genome-wide location analysis reveals distinct transcriptional circuitry by paralogous regulators Foxa1 and Foxa2.

    Directory of Open Access Journals (Sweden)

    Irina M Bochkis

    Full Text Available Gene duplication is a powerful driver of evolution. Newly duplicated genes acquire new roles that are relevant to fitness, or they will be lost over time. A potential path to functional relevance is mutation of the coding sequence leading to the acquisition of novel biochemical properties, as analyzed here for the highly homologous paralogs Foxa1 and Foxa2 transcriptional regulators. We determine by genome-wide location analysis (ChIP-Seq that, although Foxa1 and Foxa2 share a large fraction of binding sites in the liver, each protein also occupies distinct regulatory elements in vivo. Foxa1-only sites are enriched for p53 binding sites and are frequently found near genes important to cell cycle regulation, while Foxa2-restricted sites show only a limited match to the forkhead consensus and are found in genes involved in steroid and lipid metabolism. Thus, Foxa1 and Foxa2, while redundant during development, have evolved divergent roles in the adult liver, ensuring the maintenance of both genes during evolution.

  12. Identification of novel candidate genes involved in mineralization of dental enamel by genome-wide transcript profiling.

    Science.gov (United States)

    Lacruz, Rodrigo S; Smith, Charles E; Bringas, Pablo; Chen, Yi-Bu; Smith, Susan M; Snead, Malcolm L; Kurtz, Ira; Hacia, Joseph G; Hubbard, Michael J; Paine, Michael L

    2012-05-01

    The gene repertoire regulating vertebrate biomineralization is poorly understood. Dental enamel, the most highly mineralized tissue in mammals, differs from other calcifying systems in that the formative cells (ameloblasts) lack remodeling activity and largely degrade and resorb the initial extracellular matrix. Enamel mineralization requires that ameloblasts undergo a profound functional switch from matrix-secreting to maturational (calcium transport, protein resorption) roles as mineralization progresses. During the maturation stage, extracellular pH decreases markedly, placing high demands on ameloblasts to regulate acidic environments present around the growing hydroxyapatite crystals. To identify the genetic events driving enamel mineralization, we conducted genome-wide transcript profiling of the developing enamel organ from rat incisors and highlight over 300 genes differentially expressed during maturation. Using multiple bioinformatics analyses, we identified groups of maturation-associated genes whose functions are linked to key mineralization processes including pH regulation, calcium handling, and matrix turnover. Subsequent qPCR and Western blot analyses revealed that a number of solute carrier (SLC) gene family members were up-regulated during maturation, including the novel protein Slc24a4 involved in calcium handling as well as other proteins of similar function (Stim1). By providing the first global overview of the cellular machinery required for enamel maturation, this study provide a strong foundation for improving basic understanding of biomineralization and its practical applications in healthcare. Copyright © 2011 Wiley Periodicals, Inc.

  13. Heat shock transcription factors in banana: genome-wide characterization and expression profile analysis during development and stress response

    Science.gov (United States)

    Wei, Yunxie; Hu, Wei; Xia, Feiyu; Zeng, Hongqiu; Li, Xiaolin; Yan, Yu; He, Chaozu; Shi, Haitao

    2016-01-01

    Banana (Musa acuminata) is one of the most popular fresh fruits. However, the rapid spread of fungal pathogen Fusarium oxysporum f. sp. cubense (Foc) in tropical areas severely affected banana growth and production. Thus, it is very important to identify candidate genes involved in banana response to abiotic stress and pathogen infection, as well as the molecular mechanism and possible utilization for genetic breeding. Heat stress transcription factors (Hsfs) are widely known for their common involvement in various abiotic stresses and plant-pathogen interaction. However, no MaHsf has been identified in banana, as well as its possible role. In this study, genome-wide identification and further analyses of evolution, gene structure and conserved motifs showed closer relationship of them in every subgroup. The comprehensive expression profiles of MaHsfs revealed the tissue- and developmental stage-specific or dependent, as well as abiotic and biotic stress-responsive expressions of them. The common regulation of several MaHsfs by abiotic and biotic stress indicated the possible roles of them in plant stress responses. Taken together, this study extended our understanding of MaHsf gene family and identified some candidate MaHsfs with specific expression profiles, which may be used as potential candidates for genetic breeding in banana. PMID:27857174

  14. The genome-wide binding profile of the Sulfolobus solfataricus transcription factor Ss-LrpB shows binding events beyond direct transcription regulation.

    Science.gov (United States)

    Nguyen-Duc, Trong; van Oeffelen, Liesbeth; Song, Ningning; Hassanzadeh-Ghassabeh, Gholamreza; Muyldermans, Serge; Charlier, Daniel; Peeters, Eveline

    2013-11-25

    Gene regulatory processes are largely resulting from binding of transcription factors to specific genomic targets. Leucine-responsive Regulatory Protein (Lrp) is a prevalent transcription factor family in prokaryotes, however, little information is available on biological functions of these proteins in archaea. Here, we study genome-wide binding of the Lrp-like transcription factor Ss-LrpB from Sulfolobus solfataricus. Chromatin immunoprecipitation in combination with DNA microarray analysis (ChIP-chip) has revealed that Ss-LrpB interacts with 36 additional loci besides the four previously identified local targets. Only a subset of the newly identified binding targets, concentrated in a highly variable IS-dense genomic region, is also bound in vitro by pure Ss-LrpB. There is no clear relationship between the in vitro measured DNA-binding specificity of Ss-LrpB and the in vivo association suggesting a limited permissivity of the crenarchaeal chromatin for transcription factor binding. Of 37 identified binding regions, 29 are co-bound by LysM, another Lrp-like transcription factor in S. solfataricus. Comparative gene expression analysis in an Ss-lrpB mutant strain shows no significant Ss-LrpB-mediated regulation for most targeted genes, with exception of the CRISPR B cluster, which is activated by Ss-LrpB through binding to a specific motif in the leader region. The genome-wide binding profile presented here implies that Ss-LrpB is associated at additional genomic binding sites besides the local gene targets, but acts as a specific transcription regulator in the tested growth conditions. Moreover, we have provided evidence that two Lrp-like transcription factors in S. solfataricus, Ss-LrpB and LysM, interact in vivo.

  15. Decoding genome-wide GadEWX-transcriptional regulatory networks reveals multifaceted cellular responses to acid stress in Escherichia coli

    DEFF Research Database (Denmark)

    Seo, Sang Woo; Kim, Donghyuk; O'Brien, Edward J.;

    2015-01-01

    The regulators GadE, GadW and GadX (which we refer to as GadEWX) play a critical role in the transcriptional regulation of the glutamate-dependent acid resistance (GDAR) system in Escherichia coli K-12 MG1655. However, the genome-wide regulatory role of GadEWX is still unknown. Here we comprehens...

  16. Genome-wide organization and expression profiling of the NAC transcription factor family in potato (Solanum tuberosum L.).

    Science.gov (United States)

    Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh

    2013-08-01

    NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement.

  17. SiRNA sequence model: redesign algorithm based on available genome-wide libraries.

    Science.gov (United States)

    Kozak, Karol

    2013-12-01

    The evolution of RNA interference (RNAi) and the development of technologies exploiting its biology have enabled scientists to rapidly examine the consequences of depleting a particular gene product in cells. Design tools have been developed based on experimental data to increase the knockdown efficiency of siRNAs. Not all siRNAs that are developed to a given target mRNA are equally effective. Currently available design algorithms take an accession, identify conserved regions among their transcript space, find accessible regions within the mRNA, design all possible siRNAs for these regions, filter them based on multi-scores thresholds, and then perform off-target filtration. These different criteria are used by commercial suppliers to produce siRNA genome-wide libraries for different organisms. In this article, we analyze existing siRNA design algorithms and evaluate weight of design parameters for libraries produced in the last decade. We proved that not all essential parameters are currently applied by siRNA vendors. Based on our evaluation results, we were able to suggest an siRNA sequence pattern. The findings in our study can be useful for commercial vendors improving the design of RNAi constructs, by addressing both the issue of potency and the issue of specificity.

  18. Software engineering the mixed model for genome-wide association studies on large samples.

    Science.gov (United States)

    Zhang, Zhiwu; Buckler, Edward S; Casstevens, Terry M; Bradbury, Peter J

    2009-11-01

    Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample size and number of markers used for GWAS is increasing dramatically, resulting in greater statistical power to detect those associations. The use of mixed models with increasingly large data sets depends on the availability of software for analyzing those models. While multiple software packages implement the mixed model method, no single package provides the best combination of fast computation, ability to handle large samples, flexible modeling and ease of use. Key elements of association analysis with mixed models are reviewed, including modeling phenotype-genotype associations using mixed models, population stratification, kinship and its estimation, variance component estimation, use of best linear unbiased predictors or residuals in place of raw phenotype, improving efficiency and software-user interaction. The available software packages are evaluated, and suggestions made for future software development.

  19. MYB Transcription Factors in Chinese Pear (Pyrus bretschneideri Rehd.: Genome-Wide Identification, Classification and Expression Profiling during Fruit Development

    Directory of Open Access Journals (Sweden)

    Yun Peng eCao

    2016-04-01

    Full Text Available The MYB family is one of the largest families of transcription factors in plants. Although some MYBs have been reported to play roles in secondary metabolism, no comprehensive study of the MYB family in Chinese pear (Pyrus bretschneideri Rehd. has been reported. In the present study, we performed genome-wide analysis of MYB genes in Chinese pear, designated as PbMYBs, including analyses of their phylogenic relationships, structures, chromosomal locations, promoter regions, GO annotations and collinearity. A total of 129 PbMYB genes were identified in the pear genome and were divided into 31 subgroups based on phylogenetic analysis. These PbMYBs were unevenly distributed among 16 chromosomes (total of 17 chromosomes. The occurrence of gene duplication events indicated that whole-genome duplication and segmental duplication likely played key roles in expansion of the PbMYB gene family. Ka/Ks analysis suggested that the duplicated PbMYBs mainly experienced purifying selection with restrictive functional divergence after the duplication events. Interspecies microsynteny analysis revealed maximum orthology between pear and peach, followed by plum and strawberry. Subsequently, the expression patterns of 20 PbMYB genes that may be involved in lignin biosynthesis according to their phylogenetic relationships were examined throughout fruit development. Among the twenty genes examined, PbMYB25 and PbMYB52 exhibited expression patterns consistent with the typical variations in the lignin content previously reported. Moreover, sub-cellular localization analysis revealed that two proteins PbMYB25 and PbMYB52 were localized to the nucleus. All together, PbMYB25 and PbMYB52 were inferred to be candidate genes involved in the regulation of lignin biosynthesis during the development of pear fruit. This study provides useful information for further functional analysis of the MYB gene family in pear.

  20. Genome-Wide Identification and Characterization of BrrTCP Transcription Factors in Brassica rapa ssp. rapa

    Directory of Open Access Journals (Sweden)

    Jiancan Du

    2017-09-01

    Full Text Available The teosinte branched1/cycloidea/proliferating cell factor (TCP gene family is a plant-specific transcription factor that participates in the control of plant development by regulating cell proliferation. However, no report is currently available about this gene family in turnips (Brassica rapa ssp. rapa. In this study, a genome-wide analysis of TCP genes was performed in turnips. Thirty-nine TCP genes in turnip genome were identified and distributed on 10 chromosomes. Phylogenetic analysis clearly showed that the family was classified as two clades: class I and class II. Gene structure and conserved motif analysis showed that the same clade genes have similar gene structures and conserved motifs. The expression profiles of 39 TCP genes were determined through quantitative real-time PCR. Most CIN-type BrrTCP genes were highly expressed in leaf. The members of CYC/TB1 subclade are highly expressed in flower bud and weakly expressed in root. By contrast, class I clade showed more widespread but less tissue-specific expression patterns. Yeast two-hybrid data show that BrrTCP proteins preferentially formed heterodimers. The function of BrrTCP2 was confirmed through ectopic expression of BrrTCP2 in wild-type and loss-of-function ortholog mutant of Arabidopsis. Overexpression of BrrTCP2 in wild-type Arabidopsis resulted in the diminished leaf size. Overexpression of BrrTCP2 in triple mutants of tcp2/4/10 restored the leaf phenotype of tcp2/4/10 to the phenotype of wild type. The comprehensive analysis of turnip TCP gene family provided the foundation to further study the roles of TCP genes in turnips.

  1. Improved genome-wide mapping of uncapped and cleaved transcripts in eukaryotes--GMUCT 2.0.

    Science.gov (United States)

    Willmann, Matthew R; Berkowitz, Nathan D; Gregory, Brian D

    2014-05-01

    The advent of high-throughput sequencing has led to an explosion of studies into the diversity, expression, processing, and lifespan of RNAs. Recently, three different high-throughput sequencing-based methods have been developed to specifically study RNAs that are in the process of being degraded. All three methods-genome-wide mapping of uncapped and cleaved transcripts (GMUCT), parallel analysis of RNA ends (PARE), and degradome sequencing-take advantage of the fact that Illumina sequencing libraries use T4 RNA ligase 1 to ligate an adapter to the 5' end of RNAs that have a free 5'-monophosphate. This condition for T4 RNA ligase 1 substrates means that mature mRNAs are not substrates of the enzyme because they have a 5'-cap moiety. As a result, these sequencing libraries are specifically made up of clones of decapped or degrading mRNAs resulting from 5'-to-3' or nonsense-mediated decay (NMD) and the 3' fragment of cleaved microRNA (miRNA) and small interfering RNA (siRNA) target RNAs. Here, we present a massively streamlined protocol for GMUCT that takes 2-3days, can be initiated with as little as 5μg of starting total RNA, and involves only one gel size-selection step. We show that the resulting datasets are similar to those produced using the previous GMUCT and PARE protocols. In total, our results suggest that this method will be the preferable approach for future studies of RNA degradation intermediates and small RNA-mediated cleavage in eukaryotic transcriptomes.

  2. Genome-wide RNA polymerase II profiles and RNA accumulation reveal kinetics of transcription and associated epigenetic changes during diurnal cycles.

    Directory of Open Access Journals (Sweden)

    Gwendal Le Martelot

    Full Text Available Interactions of cell-autonomous circadian oscillators with diurnal cycles govern the temporal compartmentalization of cell physiology in mammals. To understand the transcriptional and epigenetic basis of diurnal rhythms in mouse liver genome-wide, we generated temporal DNA occupancy profiles by RNA polymerase II (Pol II as well as profiles of the histone modifications H3K4me3 and H3K36me3. We used these data to quantify the relationships of phases and amplitudes between different marks. We found that rhythmic Pol II recruitment at promoters rather than rhythmic transition from paused to productive elongation underlies diurnal gene transcription, a conclusion further supported by modeling. Moreover, Pol II occupancy preceded mRNA accumulation by 3 hours, consistent with mRNA half-lives. Both methylation marks showed that the epigenetic landscape is highly dynamic and globally remodeled during the 24-hour cycle. While promoters of transcribed genes had tri-methylated H3K4 even at their trough activity times, tri-methylation levels reached their peak, on average, 1 hour after Pol II. Meanwhile, rhythms in tri-methylation of H3K36 lagged transcription by 3 hours. Finally, modeling profiles of Pol II occupancy and mRNA accumulation identified three classes of genes: one showing rhythmicity both in transcriptional and mRNA accumulation, a second class with rhythmic transcription but flat mRNA levels, and a third with constant transcription but rhythmic mRNAs. The latter class emphasizes widespread temporally gated posttranscriptional regulation in the mouse liver.

  3. Genome-wide identification of transcriptional targets of RORA reveals direct regulation of multiple genes associated with autism spectrum disorder.

    Science.gov (United States)

    Sarachana, Tewarit; Hu, Valerie W

    2013-05-22

    We have recently identified the nuclear hormone receptor RORA (retinoic acid-related orphan receptor-alpha) as a novel candidate gene for autism spectrum disorder (ASD). Our independent cohort studies have consistently demonstrated the reduction of RORA transcript and/or protein levels in blood-derived lymphoblasts as well as in the postmortem prefrontal cortex and cerebellum of individuals with ASD. Moreover, we have also shown that RORA has the potential to be under negative and positive regulation by androgen and estrogen, respectively, suggesting the possibility that RORA may contribute to the male bias of ASD. However, little is known about transcriptional targets of this nuclear receptor, particularly in humans. Here we identify transcriptional targets of RORA in human neuronal cells on a genome-wide level using chromatin immunoprecipitation (ChIP) with an anti-RORA antibody followed by whole-genome promoter array (chip) analysis. Selected potential targets of RORA were then validated by an independent ChIP followed by quantitative PCR analysis. To further demonstrate that reduced RORA expression results in reduced transcription of RORA targets, we determined the expression levels of the selected transcriptional targets in RORA-deficient human neuronal cells, as well as in postmortem brain tissues from individuals with ASD who exhibit reduced RORA expression. The ChIP-on-chip analysis reveals that RORA1, a major isoform of RORA protein in human brain, can be recruited to as many as 2,764 genomic locations corresponding to promoter regions of 2,544 genes across the human genome. Gene ontology analysis of this dataset of genes that are potentially directly regulated by RORA1 reveals statistically significant enrichment in biological functions negatively impacted in individuals with ASD, including neuronal differentiation, adhesion and survival, synaptogenesis, synaptic transmission and plasticity, and axonogenesis, as well as higher level functions such as

  4. Modeling the cumulative genetic risk for multiple sclerosis from genome-wide association data.

    Science.gov (United States)

    Wang, Joanne H; Pappas, Derek; De Jager, Philip L; Pelletier, Daniel; de Bakker, Paul Iw; Kappos, Ludwig; Polman, Chris H; Chibnik, Lori B; Hafler, David A; Matthews, Paul M; Hauser, Stephen L; Baranzini, Sergio E; Oksenberg, Jorge R

    2011-01-18

    Multiple sclerosis (MS) is the most common cause of chronic neurologic disability beginning in early to middle adult life. Results from recent genome-wide association studies (GWAS) have substantially lengthened the list of disease loci and provide convincing evidence supporting a multifactorial and polygenic model of inheritance. Nevertheless, the knowledge of MS genetics remains incomplete, with many risk alleles still to be revealed. We used a discovery GWAS dataset (8,844 samples, 2,124 cases and 6,720 controls) and a multi-step logistic regression protocol to identify novel genetic associations. The emerging genetic profile included 350 independent markers and was used to calculate and estimate the cumulative genetic risk in an independent validation dataset (3,606 samples). Analysis of covariance (ANCOVA) was implemented to compare clinical characteristics of individuals with various degrees of genetic risk. Gene ontology and pathway enrichment analysis was done using the DAVID functional annotation tool, the GO Tree Machine, and the Pathway-Express profiling tool. In the discovery dataset, the median cumulative genetic risk (P-Hat) was 0.903 and 0.007 in the case and control groups, respectively, together with 79.9% classification sensitivity and 95.8% specificity. The identified profile shows a significant enrichment of genes involved in the immune response, cell adhesion, cell communication/signaling, nervous system development, and neuronal signaling, including ionotropic glutamate receptors, which have been implicated in the pathological mechanism driving neurodegeneration. In the validation dataset, the median cumulative genetic risk was 0.59 and 0.32 in the case and control groups, respectively, with classification sensitivity 62.3% and specificity 75.9%. No differences in disease progression or T2-lesion volumes were observed among four levels of predicted genetic risk groups (high, medium, low, misclassified). On the other hand, a significant

  5. Genome-wide transcriptional profiling reveals microRNA-correlated genes and biological processes in human lymphoblastoid cell lines.

    Directory of Open Access Journals (Sweden)

    Liang Wang

    Full Text Available BACKGROUND: Expression level of many genes shows abundant natural variation in human populations. The variations in gene expression are believed to contribute to phenotypic differences. Emerging evidence has shown that microRNAs (miRNAs are one of the key regulators of gene expression. However, past studies have focused on the miRNA target genes and used loss- or gain-of-function approach that may not reflect natural association between miRNA and mRNAs. METHODOLOGY/PRINCIPAL FINDINGS: To examine miRNA regulatory effect on global gene expression under endogenous condition, we performed pair-wise correlation coefficient analysis on expression levels of 366 miRNAs and 14,174 messenger RNAs (mRNAs in 90 immortalized lymphoblastoid cell lines, and observed significant correlations between the two species of RNA transcripts. We identified a total of 7,207 significantly correlated miRNA-mRNA pairs (false discovery rate q<0.01. Of those, 4,085 pairs showed positive correlations while 3,122 pairs showed negative correlations. Gene ontology analyses on the miRNA-correlated genes revealed significant enrichments in several biological processes related to cell cycle, cell communication and signal transduction. Individually, each of three miRNAs (miR-331, -98 and -33b demonstrated significant correlation with the genes in cell cycle-related biological processes, which is consistent with important role of miRNAs in cell cycle regulation. CONCLUSIONS/SIGNIFICANCE: This study demonstrates feasibility of using naturally expressed transcript profiles to identify endogenous correlation between miRNA and miRNA. By applying this genome-wide approach, we have identified thousands of miRNA-correlated genes and revealed potential role of miRNAs in several important cellular functions. The study results along with accompanying data sets will provide a wealth of high-throughput data to further evaluate the miRNA-regulated genes and eventually in phenotypic variations of

  6. Incorporating biological pathways via a Markov random field model in genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Min Chen

    2011-04-01

    Full Text Available Genome-wide association studies (GWAS examine a large number of markers across the genome to identify associations between genetic variants and disease. Most published studies examine only single markers, which may be less informative than considering multiple markers and multiple genes jointly because genes may interact with each other to affect disease risk. Much knowledge has been accumulated in the literature on biological pathways and interactions. It is conceivable that appropriate incorporation of such prior knowledge may improve the likelihood of making genuine discoveries. Although a number of methods have been developed recently to prioritize genes using prior biological knowledge, such as pathways, most methods treat genes in a specific pathway as an exchangeable set without considering the topological structure of a pathway. However, how genes are related with each other in a pathway may be very informative to identify association signals. To make use of the connectivity information among genes in a pathway in GWAS analysis, we propose a Markov Random Field (MRF model to incorporate pathway topology for association analysis. We show that the conditional distribution of our MRF model takes on a simple logistic regression form, and we propose an iterated conditional modes algorithm as well as a decision theoretic approach for statistical inference of each gene's association with disease. Simulation studies show that our proposed framework is more effective to identify genes associated with disease than a single gene-based method. We also illustrate the usefulness of our approach through its applications to a real data example.

  7. Hierarchical modeling of genome-wide Short Tandem Repeat (STR) markers infers native American prehistory.

    Science.gov (United States)

    Lewis, Cecil M

    2010-02-01

    This study examines a genome-wide dataset of 678 Short Tandem Repeat loci characterized in 444 individuals representing 29 Native American populations as well as the Tundra Netsi and Yakut populations from Siberia. Using these data, the study tests four current hypotheses regarding the hierarchical distribution of neutral genetic variation in native South American populations: (1) the western region of South America harbors more variation than the eastern region of South America, (2) Central American and western South American populations cluster exclusively, (3) populations speaking the Chibchan-Paezan and Equatorial-Tucanoan language stock emerge as a group within an otherwise South American clade, (4) Chibchan-Paezan populations in Central America emerge together at the tips of the Chibchan-Paezan cluster. This study finds that hierarchical models with the best fit place Central American populations, and populations speaking the Chibchan-Paezan language stock, at a basal position or separated from the South American group, which is more consistent with a serial founder effect into South America than that previously described. Western (Andean) South America is found to harbor similar levels of variation as eastern (Equatorial-Tucanoan and Ge-Pano-Carib) South America, which is inconsistent with an initial west coast migration into South America. Moreover, in all relevant models, the estimates of genetic diversity within geographic regions suggest a major bottleneck or founder effect occurring within the North American subcontinent, before the peopling of Central and South America. 2009 Wiley-Liss, Inc.

  8. Genome-wide transcriptional analysis of Drosophila larvae infected by entomopathogenic nematodes shows involvement of complement, recognition and extracellular matrix proteins.

    Science.gov (United States)

    Arefin, Badrul; Kucerova, Lucie; Dobes, Pavel; Markus, Robert; Strnad, Hynek; Wang, Zhi; Hyrsl, Pavel; Zurovec, Michal; Theopold, Ulrich

    2014-01-01

    Heterorhabditis bacteriophora is an entomopathogenic nematode (EPN) which infects its host by accessing the hemolymph where it releases endosymbiotic bacteria of the species Photorhabdus luminescens. We performed a genome-wide transcriptional analysis of the Drosophila response to EPN infection at the time point at which the nematodes reached the hemolymph either via the cuticle or the gut and the bacteria had started to multiply. Many of the most strongly induced genes have been implicated in immune responses in other infection models. Mapping of the complete set of differentially regulated genes showed the hallmarks of a wound response, but also identified a large fraction of EPN-specific transcripts. Several genes identified by transcriptome profiling or their homologues play protective roles during nematode infections. Genes that positively contribute to controlling nematobacterial infections encode: a homolog of thioester-containing complement protein 3, a basement membrane component (glutactin), a recognition protein (GNBP-like 3) and possibly several small peptides. Of note is that several of these genes have not previously been implicated in immune responses.

  9. Meta-analysis of genome-wide association from genomic prediction models

    Science.gov (United States)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...

  10. Precise Identification of Genome-Wide Transcription Start Sites in Bacteria by 5'-Rapid Amplification of cDNA Ends (5'-RACE).

    Science.gov (United States)

    Matteau, Dominick; Rodrigue, Sébastien

    2015-01-01

    Transcription start sites are commonly used to locate promoter elements in bacterial genomes. TSS were previously studied one gene at a time, often through 5'-rapid amplification of cDNA ends (5'-RACE). This technique has now been adapted for high-throughput sequencing and can be used to precisely identify TSS in a genome-wide fashion for practically any bacterium, which greatly contributes to our understanding of gene regulatory networks in microorganisms.

  11. First WNK4-hypokalemia animal model identified by genome-wide association in Burmese cats.

    Directory of Open Access Journals (Sweden)

    Barbara Gandolfi

    Full Text Available Burmese is an old and popular cat breed, however, several health concerns, such as hypokalemia and a craniofacial defect, are prevalent, endangering the general health of the breed. Hypokalemia, a subnormal serum potassium ion concentration ([K(+], most often occurs as a secondary problem but can occur as a primary problem, such as hypokalaemic periodic paralysis in humans, and as feline hypokalaemic periodic polymyopathy primarily in Burmese. The most characteristic clinical sign of hypokalemia in Burmese is a skeletal muscle weakness that is frequently episodic in nature, either generalized, or sometimes localized to the cervical and thoracic limb girdle muscles. Burmese hypokalemia is suspected to be a single locus autosomal recessive trait. A genome wide case-control study using the illumina Infinium Feline 63K iSelect DNA array was performed using 35 cases and 25 controls from the Burmese breed that identified a locus on chromosome E1 associated with hypokalemia. Within approximately 1.2 Mb of the highest associated SNP, two candidate genes were identified, KCNH4 and WNK4. Direct sequencing of the genes revealed a nonsense mutation, producing a premature stop codon within WNK4 (c.2899C>T, leading to a truncated protein that lacks the C-terminal coiled-coil domain and the highly conserved Akt1/SGK phosphorylation site. All cases were homozygous for the mutation. Although the exact mechanism causing hypokalemia has not been determined, extrapolation from the homologous human and mouse genes suggests the mechanism may involve a potassium-losing nephropathy. A genetic test to screen for the genetic defect within the active breeding population has been developed, which should lead to eradication of the mutation and improved general health within the breed. Moreover, the identified mutation may help clarify the role of the protein in K⁺ regulation and the cat represents the first animal model for WNK4-associated hypokalemia.

  12. First WNK4-hypokalemia animal model identified by genome-wide association in Burmese cats.

    Science.gov (United States)

    Gandolfi, Barbara; Gruffydd-Jones, Timothy J; Malik, Richard; Cortes, Alejandro; Jones, Boyd R; Helps, Chris R; Prinzenberg, Eva M; Erhardt, George; Lyons, Leslie A

    2012-01-01

    Burmese is an old and popular cat breed, however, several health concerns, such as hypokalemia and a craniofacial defect, are prevalent, endangering the general health of the breed. Hypokalemia, a subnormal serum potassium ion concentration ([K(+)]), most often occurs as a secondary problem but can occur as a primary problem, such as hypokalaemic periodic paralysis in humans, and as feline hypokalaemic periodic polymyopathy primarily in Burmese. The most characteristic clinical sign of hypokalemia in Burmese is a skeletal muscle weakness that is frequently episodic in nature, either generalized, or sometimes localized to the cervical and thoracic limb girdle muscles. Burmese hypokalemia is suspected to be a single locus autosomal recessive trait. A genome wide case-control study using the illumina Infinium Feline 63K iSelect DNA array was performed using 35 cases and 25 controls from the Burmese breed that identified a locus on chromosome E1 associated with hypokalemia. Within approximately 1.2 Mb of the highest associated SNP, two candidate genes were identified, KCNH4 and WNK4. Direct sequencing of the genes revealed a nonsense mutation, producing a premature stop codon within WNK4 (c.2899C>T), leading to a truncated protein that lacks the C-terminal coiled-coil domain and the highly conserved Akt1/SGK phosphorylation site. All cases were homozygous for the mutation. Although the exact mechanism causing hypokalemia has not been determined, extrapolation from the homologous human and mouse genes suggests the mechanism may involve a potassium-losing nephropathy. A genetic test to screen for the genetic defect within the active breeding population has been developed, which should lead to eradication of the mutation and improved general health within the breed. Moreover, the identified mutation may help clarify the role of the protein in K⁺ regulation and the cat represents the first animal model for WNK4-associated hypokalemia.

  13. Genome-wide analysis of auxin response factor gene family members in medicinal model plant Salvia miltiorrhiza

    Directory of Open Access Journals (Sweden)

    Zhichao Xu

    2016-06-01

    Full Text Available Auxin response factors (ARFs can function as transcriptional activators or repressors to regulate the expression of auxin response genes by specifically binding to auxin response elements (AuxREs during plant development. Based on a genome-wide strategy using the medicinal model plant Salvia miltiorrhiza, 25 S. miltiorrhiza ARF (SmARF gene family members in four classes (class Ia, IIa, IIb and III were comprehensively analyzed to identify characteristics including gene structures, conserved domains, phylogenetic relationships and expression patterns. In a hybrid analysis of the phylogenetic tree, microRNA targets, and expression patterns of SmARFs in different organs, root tissues, and methyl jasmonate or indole-3-acetic acid treatment conditions, we screened for candidate SmARFs involved in various developmental processes of S. miltiorrhiza. Based on this analysis, we predicted that SmARF25, SmARF7, SmARF16 and SmARF20 are involved in flower, leaf, stem and root development, respectively. With the further insight into the targets of miR160 and miR167, specific SmARF genes in S. miltiorrhiza might encode products that participate in biological processes as described for ARF genes in Arabidopsis. Our results provide a foundation for understanding the molecular basis and regulatory mechanisms of SmARFs in S. miltiorrhiza.

  14. Growth on mannitol-rich media elicits a genome-wide transcriptional response in Burkholderia multivorans that impacts on multiple virulence traits in an exopolysaccharide-independent manner.

    Science.gov (United States)

    Denman, Carmen C; Robinson, Matthew T; Sass, Andrea M; Mahenthiralingam, Eshwar; Brown, Alan R

    2014-01-01

    In common with other members of the Burkholderia cepacia complex (BCC), Burkholderia multivorans is capable of producing exopolysaccharide (EPS) when grown on certain mannitol-rich media. The significance of the resulting mucoid phenotype and the genome-wide response to mannitol has never been characterized despite its clinical relevance following the approval of a dried-powder preparation of mannitol as an inhaled osmolyte therapy for cystic fibrosis (CF) patients. In the present study we defined the transcriptional response of B. multivorans ATCC 17616, a model genome-sequenced strain of environmental origin, to growth on mannitol-rich yeast extract media (MYEM). EPS-dependent and -independent impact of MYEM on virulence-associated traits was assessed in both strain ATCC 17616 and the CF isolate B. multivorans C1576. Our studies revealed a significant transcriptional response to MYEM encompassing approximately 23 % of predicted genes within the genome. Strikingly, this transcriptional response identified that EPS induction occurs in ATCC 17616 without the upregulation of the bce-I and bce-II EPS gene clusters, despite their pivotal role in EPS biosynthesis. Of approximately 20 differentially expressed putative virulence factors, 16 exhibited upregulation including flagella, ornibactin, oxidative stress proteins and phospholipases. MYEM-grown B. multivorans also exhibited enhanced motility, biofilm formation and epithelial cell invasion. In contrast to these potential virulence enhancements, MYEM-grown B. multivorans C1576 showed attenuated virulence in the Galleria mellonella infection model. All of the observed phenotypic responses occurred independently of EPS production, highlighting the profound impact that mannitol-based growth has on the physiology and virulence of B. multivorans.

  15. Diagnosis of ulcerative colitis before onset of inflammation by multivariate modeling of genome-wide gene expression data

    DEFF Research Database (Denmark)

    Olsen, Jørgen; Gerds, Thomas A; Seidelin, Jakob B

    2009-01-01

    biopsies from 78 patients were included. A diagnostic model was derived with the random forest method based on 71 biopsies from 60 patients. The model-internal out-of-bag performance measure yielded perfect classification. Furthermore, the model was validated in independent 18 noninflamed biopsies from 18...... of random forest modeling of genome-wide gene expression data for distinguishing quiescent and active UC colonic mucosa versus control and CD colonic mucosa.(Inflamm Bowel Dis 2009)....

  16. A genome-wide screen for spatially restricted expression patterns identifies transcription factors that regulate glial development

    NARCIS (Netherlands)

    Fu, H.; Cai, J.; Clevers, H.; Fast, E.; Gray, S.; Greenberg, R.; Jain, M.K.; Ma, Q.; Qiu, M.; Rowitch, D.H.; Taylor, C.; Stiles, C.D.

    2009-01-01

    Forward genetic screens in genetically accessible invertebrate organisms such as Drosophila melanogaster have shed light on transcription factors that specify formation of neurons in the vertebrate CNS. However, invertebrate models have, to date, been uninformative with respect to genes that specify

  17. A brassinosteroid transcriptional network revealed by genome-wide identification of BESI target genes in Arabidopsis thaliana.

    Science.gov (United States)

    Yu, Xiaofei; Li, Lei; Zola, Jaroslaw; Aluru, Maneesha; Ye, Huaxun; Foudree, Andrew; Guo, Hongqing; Anderson, Sarah; Aluru, Srinivas; Liu, Peng; Rodermel, Steve; Yin, Yanhai

    2011-02-01

    Brassinosteroids (BRs) are important regulators for plant growth and development. BRs signal to control the activities of the BES1 and BZR1 family transcription factors. The transcriptional network through which BES1 and BZR regulate large number of target genes is mostly unknown. By combining chromatin immunoprecipitation coupled with Arabidopsis tiling arrays (ChIP-chip) and gene expression studies, we have identified 1609 putative BES1 target genes, 404 of which are regulated by BRs and/or in gain-of-function bes1-D mutant. BES1 targets contribute to BR responses and interactions with other hormonal or light signaling pathways. Computational modeling of gene expression data using Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNe) reveals that BES1-targeted transcriptional factors form a gene regulatory network (GRN). Mutants of many genes in the network displayed defects in BR responses. Moreover, we found that BES1 functions to inhibit chloroplast development by repressing the expression of GLK1 and GLK2 transcription factors, confirming a hypothesis generated from the GRN. Our results thus provide a global view of BR regulated gene expression and a GRN that guides future studies in understanding BR-regulated plant growth.

  18. Heterosis in early maize ear inflorescence development: a genome-wide transcription analysis for two maize inbred lines and their hybrid.

    Science.gov (United States)

    Ding, Haiping; Qin, Cheng; Luo, Xirong; Li, Lujiang; Chen, Zhe; Liu, Hongjun; Gao, Jian; Lin, Haijian; Shen, Yaou; Zhao, Maojun; Lübberstedt, Thomas; Zhang, Zhiming; Pan, Guangtang

    2014-08-11

    Heterosis, or hybrid vigor, contributes to superior agronomic performance of hybrids compared to their inbred parents. Despite its importance, little is known about the genetic and molecular basis of heterosis. Early maize ear inflorescences formation affects grain yield, and are thus an excellent model for molecular mechanisms involved in heterosis. To determine the parental contributions and their regulation during maize ear-development-genesis, we analyzed genome-wide digital gene expression profiles in two maize elite inbred lines (B73 and Mo17) and their F1 hybrid using deep sequencing technology. Our analysis revealed 17,128 genes expressed in these three genotypes and 22,789 genes expressed collectively in the present study. Approximately 38% of the genes were differentially expressed in early maize ear inflorescences from heterotic cross, including many transcription factor genes and some presence/absence variations (PAVs) genes, and exhibited multiple modes of gene action. These different genes showing differential expression patterns were mainly enriched in five cellular component categories (organelle, cell, cell part, organelle part and macromolecular complex), five molecular function categories (structural molecule activity, binding, transporter activity, nucleic acid binding transcription factor activity and catalytic activity), and eight biological process categories (cellular process, metabolic process, biological regulation, regulation of biological process, establishment of localization, cellular component organization or biogenesis, response to stimulus and localization). Additionally, a significant number of genes were expressed in only one inbred line or absent in both inbred lines. Comparison of the differences of modes of gene action between previous studies and the present study revealed only a small number of different genes had the same modes of gene action in both maize seedlings and ear inflorescences. This might be an indication that in

  19. Heterosis in Early Maize Ear Inflorescence Development: A Genome-Wide Transcription Analysis for Two Maize Inbred Lines and Their Hybrid

    Directory of Open Access Journals (Sweden)

    Haiping Ding

    2014-08-01

    Full Text Available Heterosis, or hybrid vigor, contributes to superior agronomic performance of hybrids compared to their inbred parents. Despite its importance, little is known about the genetic and molecular basis of heterosis. Early maize ear inflorescences formation affects grain yield, and are thus an excellent model for molecular mechanisms involved in heterosis. To determine the parental contributions and their regulation during maize ear-development-genesis, we analyzed genome-wide digital gene expression profiles in two maize elite inbred lines (B73 and Mo17 and their F1 hybrid using deep sequencing technology. Our analysis revealed 17,128 genes expressed in these three genotypes and 22,789 genes expressed collectively in the present study. Approximately 38% of the genes were differentially expressed in early maize ear inflorescences from heterotic cross, including many transcription factor genes and some presence/absence variations (PAVs genes, and exhibited multiple modes of gene action. These different genes showing differential expression patterns were mainly enriched in five cellular component categories (organelle, cell, cell part, organelle part and macromolecular complex, five molecular function categories (structural molecule activity, binding, transporter activity, nucleic acid binding transcription factor activity and catalytic activity, and eight biological process categories (cellular process, metabolic process, biological regulation, regulation of biological process, establishment of localization, cellular component organization or biogenesis, response to stimulus and localization. Additionally, a significant number of genes were expressed in only one inbred line or absent in both inbred lines. Comparison of the differences of modes of gene action between previous studies and the present study revealed only a small number of different genes had the same modes of gene action in both maize seedlings and ear inflorescences. This might be an

  20. Genome-wide profiling of AP-1-regulated transcription provides insights into the invasiveness of triple-negative breast cancer.

    Science.gov (United States)

    Zhao, Chunyan; Qiao, Yichun; Jonsson, Philip; Wang, Jian; Xu, Li; Rouhi, Pegah; Sinha, Indranil; Cao, Yihai; Williams, Cecilia; Dahlman-Wright, Karin

    2014-07-15

    Triple-negative breast cancer (TNBC) is an aggressive clinical subtype accounting for up to 20% of all breast cancers, but its malignant determinants remain largely undefined. Here, we show that in TNBC the overexpression of Fra-1, a component of the transcription factor AP-1, offers prognostic potential. Fra-1 depletion or its heterodimeric partner c-Jun inhibits the proliferative and invasive phenotypes of TNBC cells in vitro. Similarly, RNAi-mediated attenuation of Fra-1 or c-Jun reduced cellular invasion in vivo in a zebrafish tumor xenograft model. Exploring the AP-1 cistrome and the AP-1-regulated transcriptome, we obtained insights into the transcriptional regulatory networks of AP-1 in TNBC cells. Among the direct targets identified for Fra-1/c-Jun involved in proliferation, adhesion, and cell-cell contact, we found that AP-1 repressed the expression of E-cadherin by transcriptional upregulation of ZEB2 to stimulate cell invasion. Overall, this work illuminates the pathways through which TNBC cells acquire invasive and proliferative properties.

  1. Spatially Resolved Genome-wide Transcriptional Profiling Identifies BMP Signaling as Essential Regulator of Zebrafish Cardiomyocyte Regeneration.

    Science.gov (United States)

    Wu, Chi-Chung; Kruse, Fabian; Vasudevarao, Mohankrishna Dalvoy; Junker, Jan Philipp; Zebrowski, David C; Fischer, Kristin; Noël, Emily S; Grün, Dominic; Berezikov, Eugene; Engel, Felix B; van Oudenaarden, Alexander; Weidinger, Gilbert; Bakkers, Jeroen

    2016-01-11

    In contrast to mammals, zebrafish regenerate heart injuries via proliferation of cardiomyocytes located near the wound border. To identify regulators of cardiomyocyte proliferation, we used spatially resolved RNA sequencing (tomo-seq) and generated a high-resolution genome-wide atlas of gene expression in the regenerating zebrafish heart. Interestingly, we identified two wound border zones with distinct expression profiles, including the re-expression of embryonic cardiac genes and targets of bone morphogenetic protein (BMP) signaling. Endogenous BMP signaling has been reported to be detrimental to mammalian cardiac repair. In contrast, we find that genetic or chemical inhibition of BMP signaling in zebrafish reduces cardiomyocyte dedifferentiation and proliferation, ultimately compromising myocardial regeneration, while bmp2b overexpression is sufficient to enhance it. Our results provide a resource for further studies on the molecular regulation of cardiac regeneration and reveal intriguing differential cellular responses of cardiomyocytes to a conserved signaling pathway in regenerative versus non-regenerative hearts.

  2. Genome-wide Reconstruction of OxyR and SoxRS Transcriptional Regulatory Networks under Oxidative Stress in Escherichia coli K-12 MG1655

    Directory of Open Access Journals (Sweden)

    Sang Woo Seo

    2015-08-01

    Full Text Available Three transcription factors (TFs, OxyR, SoxR, and SoxS, play a critical role in transcriptional regulation of the defense system for oxidative stress in bacteria. However, their full genome-wide regulatory potential is unknown. Here, we perform a genome-scale reconstruction of the OxyR, SoxR, and SoxS regulons in Escherichia coli K-12 MG1655. Integrative data analysis reveals that a total of 68 genes in 51 transcription units (TUs belong to these regulons. Among them, 48 genes showed more than 2-fold changes in expression level under single-TF-knockout conditions. This reconstruction expands the genome-wide roles of these factors to include direct activation of genes related to amino acid biosynthesis (methionine and aromatic amino acids, cell wall synthesis (lipid A biosynthesis and peptidoglycan growth, and divalent metal ion transport (Mn2+, Zn2+, and Mg2+. Investigating the co-regulation of these genes with other stress-response TFs reveals that they are independently regulated by stress-specific TFs.

  3. Uncovering the genome-wide transcriptional responses of the filamentous fungus Aspergillus niger to lignocellulose using RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Stéphane Delmas

    Full Text Available A key challenge in the production of second generation biofuels is the conversion of lignocellulosic substrates into fermentable sugars. Enzymes, particularly those from fungi, are a central part of this process, and many have been isolated and characterised. However, relatively little is known of how fungi respond to lignocellulose and produce the enzymes necessary for dis-assembly of plant biomass. We studied the physiological response of the fungus Aspergillus niger when exposed to wheat straw as a model lignocellulosic substrate. Using RNA sequencing we showed that, 24 hours after exposure to straw, gene expression of known and presumptive plant cell wall-degrading enzymes represents a huge investment for the cells (about 20% of the total mRNA. Our results also uncovered new esterases and surface interacting proteins that might form part of the fungal arsenal of enzymes for the degradation of plant biomass. Using transcription factor deletion mutants (xlnR and creA to study the response to both lignocellulosic substrates and low carbon source concentrations, we showed that a subset of genes coding for degradative enzymes is induced by starvation. Our data support a model whereby this subset of enzymes plays a scouting role under starvation conditions, testing for available complex polysaccharides and liberating inducing sugars, that triggers the subsequent induction of the majority of hydrolases. We also showed that antisense transcripts are abundant and that their expression can be regulated by growth conditions.

  4. Genome-wide Functional Analysis of CREB/Long-Term Memory-Dependent Transcription Reveals Distinct Basal and Memory Gene Expression Programs

    Science.gov (United States)

    Lakhina, Vanisha; Arey, Rachel N.; Kaletsky, Rachel; Kauffman, Amanda; Stein, Geneva; Keyes, William; Xu, Daniel; Murphy, Coleen T.

    2014-01-01

    SUMMARY Induced CREB activity is a hallmark of long-term memory, but the full repertoire of CREB transcriptional targets required specifically for memory is not known in any system. To obtain a more complete picture of the mechanisms involved in memory, we combined memory training with genome-wide transcriptional analysis of C. elegans CREB mutants. This approach identified 757 significant CREB/memory-induced targets and confirmed the involvement of known memory genes from other organisms, but also suggested new mechanisms and novel components that may be conserved through mammals. CREB mediates distinct basal and memory transcriptional programs at least partially through spatial restriction of CREB activity: basal targets are regulated primarily in nonneuronal tissues, while memory targets are enriched for neuronal expression, emanating from CREB activity in AIM neurons. This suite of novel memory-associated genes will provide a platform for the discovery of orthologous mammalian long-term memory components. PMID:25611510

  5. Genome-wide functional analysis of CREB/long-term memory-dependent transcription reveals distinct basal and memory gene expression programs.

    Science.gov (United States)

    Lakhina, Vanisha; Arey, Rachel N; Kaletsky, Rachel; Kauffman, Amanda; Stein, Geneva; Keyes, William; Xu, Daniel; Murphy, Coleen T

    2015-01-21

    Induced CREB activity is a hallmark of long-term memory, but the full repertoire of CREB transcriptional targets required specifically for memory is not known in any system. To obtain a more complete picture of the mechanisms involved in memory, we combined memory training with genome-wide transcriptional analysis of C. elegans CREB mutants. This approach identified 757 significant CREB/memory-induced targets and confirmed the involvement of known memory genes from other organisms, but also suggested new mechanisms and novel components that may be conserved through mammals. CREB mediates distinct basal and memory transcriptional programs at least partially through spatial restriction of CREB activity: basal targets are regulated primarily in nonneuronal tissues, while memory targets are enriched for neuronal expression, emanating from CREB activity in AIM neurons. This suite of novel memory-associated genes will provide a platform for the discovery of orthologous mammalian long-term memory components.

  6. Genome-Wide Transcriptional Start Site Mapping and sRNA Identification in the Pathogen Leptospira interrogans

    Science.gov (United States)

    Zhukova, Anna; Fernandes, Luis Guilherme; Hugon, Perrine; Pappas, Christopher J.; Sismeiro, Odile; Coppée, Jean-Yves; Becavin, Christophe; Malabat, Christophe; Eshghi, Azad; Zhang, Jun-Jie; Yang, Frank X.; Picardeau, Mathieu

    2017-01-01

    Leptospira are emerging zoonotic pathogens transmitted from animals to humans typically through contaminated environmental sources of water and soil. Regulatory pathways of pathogenic Leptospira spp. underlying the adaptive response to different hosts and environmental conditions remains elusive. In this study, we provide the first global Transcriptional Start Site (TSS) map of a Leptospira species. RNA was obtained from the pathogen Leptospira interrogans grown at 30°C (optimal in vitro temperature) and 37°C (host temperature) and selectively enriched for 5′ ends of native transcripts. A total of 2865 and 2866 primary TSS (pTSS) were predicted in the genome of L. interrogans at 30 and 37°C, respectively. The majority of the pTSSs were located between 0 and 10 nucleotides from the translational start site, suggesting that leaderless transcripts are a common feature of the leptospiral translational landscape. Comparative differential RNA-sequencing (dRNA-seq) analysis revealed conservation of most pTSS at 30 and 37°C. Promoter prediction algorithms allow the identification of the binding sites of the alternative sigma factor sigma 54. However, other motifs were not identified indicating that Leptospira consensus promoter sequences are inherently different from the Escherichia coli model. RNA sequencing also identified 277 and 226 putative small regulatory RNAs (sRNAs) at 30 and 37°C, respectively, including eight validated sRNAs by Northern blots. These results provide the first global view of TSS and the repertoire of sRNAs in L. interrogans. These data will establish a foundation for future experimental work on gene regulation under various environmental conditions including those in the host. PMID:28154810

  7. Genome-wide expression profiling of five mouse models identifies similarities and differences with human psoriasis.

    Directory of Open Access Journals (Sweden)

    William R Swindell

    Full Text Available Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1. While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis.

  8. Hypersensitive photic responses and intact genome-wide transcriptional control without the KaiC phosphorylation cycle in the Synechococcus circadian system.

    Science.gov (United States)

    Umetani, Miki; Hosokawa, Norimune; Kitayama, Yohko; Iwasaki, Hideo

    2014-02-01

    Cyanobacteria are unique organisms with remarkably stable circadian oscillations. These are controlled by a network architecture that comprises two regulatory factors: posttranslational oscillation (PTO) and a transcription/translation feedback loop (TTFL). The clock proteins KaiA, KaiB, and KaiC are essential for the circadian rhythm of the unicellular species Synechococcus elongatus PCC 7942. Temperature-compensated autonomous cycling of KaiC phosphorylation has been proposed as the primary oscillator mechanism that maintains the circadian clock, even in the dark, and it controls genome-wide gene expression rhythms under continuous-light conditions (LL). However, the kaiC(EE) mutation (where "EE" represents the amino acid changes Ser431Glu and Thr432Glu), where phosphorylation cycling does not occur in vivo, has a damped but clear kaiBC expression rhythm with a long period. This suggests that there must be coupling between the robust PTO and the "slave" unstable TTFL. Here, we found that the kaiC(EE) mutant strain in LL was hypersensitive to the dark acclimation required for phase shifting. Twenty-three percent of the genes in the kaiC(EE) mutant strain exhibited genome-wide transcriptional rhythms with a period of 48 h in LL. The circadian phase distribution was also conserved significantly in most of the wild-type and kaiC(EE) mutant strain cycling genes, which suggests that the output mechanism was not damaged severely even in the absence of KaiC phosphorylation cycles. These results strongly suggest that the KaiC phosphorylation cycle is not essential for generating the genome-wide rhythm under light conditions, whereas it is important for appropriate circadian timing in the light and dark.

  9. Local Genealogies in a Linear Mixed Model for Genome-wide Association Mapping in Complex Pedigreed Populations

    DEFF Research Database (Denmark)

    Sahana, Goutam; Mailund, Thomas; Lund, Mogens Sandø

    2011-01-01

    be extended to incorporate other effects in a straightforward and rigorous fashion. Here, we present a complementary approach, called ‘GENMIX (genealogy based mixed model)’ which combines advantages from two powerful GWAS methods: genealogy-based haplotype grouping and MMA. Subjects and Methods: We validated......Introduction: The state-of-the-art for dealing with multiple levels of relationship among the samples in genome-wide association studies (GWAS) is unified mixed model analysis (MMA). This approach is very flexible, can be applied to both family-based and population-based samples, and can...

  10. No DNA damage response and negligible genome-wide transcriptional changes in human embryonic stem cells exposed to terahertz radiation.

    Science.gov (United States)

    Bogomazova, A N; Vassina, E M; Goryachkovskaya, T N; Popik, V M; Sokolov, A S; Kolchanov, N A; Lagarkova, M A; Kiselev, S L; Peltek, S E

    2015-01-13

    Terahertz (THz) radiation was proposed recently for use in various applications, including medical imaging and security scanners. However, there are concerns regarding the possible biological effects of non-ionising electromagnetic radiation in the THz range on cells. Human embryonic stem cells (hESCs) are extremely sensitive to environmental stimuli, and we therefore utilised this cell model to investigate the non-thermal effects of THz irradiation. We studied DNA damage and transcriptome responses in hESCs exposed to narrow-band THz radiation (2.3 THz) under strict temperature control. The transcription of approximately 1% of genes was subtly increased following THz irradiation. Functional annotation enrichment analysis of differentially expressed genes revealed 15 functional classes, which were mostly related to mitochondria. Terahertz irradiation did not induce the formation of γH2AX foci or structural chromosomal aberrations in hESCs. We did not observe any effect on the mitotic index or morphology of the hESCs following THz exposure.

  11. Genome-Wide Identification and Characterization of bZIP Transcription Factors in Brassica oleracea under Cold Stress

    OpenAIRE

    Hwang, Indeok; Manoharan, Ranjith Kumar; Kang, Jong-Goo; Chung, Mi-Young; Kim, Young-Wook; Nou, Ill-Sup

    2016-01-01

    Cabbages (Brassica oleracea L.) are an important vegetable crop around world, and cold temperature is among the most significant abiotic stresses causing agricultural losses, especially in cabbage crops. Plant bZIP transcription factors play diverse roles in biotic/abiotic stress responses. In this study, 119 putative BolbZIP transcription factors were identified using amino acid sequences from several bZIP domain consensus sequences. The BolbZIP members were classified into 63 categories bas...

  12. Genome-Wide Binding Analysis of the Transcription Activator IDEAL PLANT ARCHITECTURE1 Reveals a Complex Network Regulating Rice Plant Architecture[W

    Science.gov (United States)

    Lu, Zefu; Yu, Hong; Xiong, Guosheng; Wang, Jing; Jiao, Yongqing; Liu, Guifu; Jing, Yanhui; Meng, Xiangbing; Hu, Xingming; Qian, Qian; Fu, Xiangdong; Wang, Yonghong; Li, Jiayang

    2013-01-01

    IDEAL PLANT ARCHITECTURE1 (IPA1) is critical in regulating rice (Oryza sativa) plant architecture and substantially enhances grain yield. To elucidate its molecular basis, we first confirmed IPA1 as a functional transcription activator and then identified 1067 and 2185 genes associated with IPA1 binding sites in shoot apices and young panicles, respectively, through chromatin immunoprecipitation sequencing assays. The SQUAMOSA PROMOTER BINDING PROTEIN-box direct binding core motif GTAC was highly enriched in IPA1 binding peaks; interestingly, a previously uncharacterized indirect binding motif TGGGCC/T was found to be significantly enriched through the interaction of IPA1 with proliferating cell nuclear antigen PROMOTER BINDING FACTOR1 or PROMOTER BINDING FACTOR2. Genome-wide expression profiling by RNA sequencing revealed IPA1 roles in diverse pathways. Moreover, our results demonstrated that IPA1 could directly bind to the promoter of rice TEOSINTE BRANCHED1, a negative regulator of tiller bud outgrowth, to suppress rice tillering, and directly and positively regulate DENSE AND ERECT PANICLE1, an important gene regulating panicle architecture, to influence plant height and panicle length. The elucidation of target genes of IPA1 genome-wide will contribute to understanding the molecular mechanisms underlying plant architecture and to facilitating the breeding of elite varieties with ideal plant architecture. PMID:24170127

  13. Genome-wide binding analysis of the transcription activator ideal plant architecture1 reveals a complex network regulating rice plant architecture.

    Science.gov (United States)

    Lu, Zefu; Yu, Hong; Xiong, Guosheng; Wang, Jing; Jiao, Yongqing; Liu, Guifu; Jing, Yanhui; Meng, Xiangbing; Hu, Xingming; Qian, Qian; Fu, Xiangdong; Wang, Yonghong; Li, Jiayang

    2013-10-01

    Ideal plant architecture1 (IPA1) is critical in regulating rice (Oryza sativa) plant architecture and substantially enhances grain yield. To elucidate its molecular basis, we first confirmed IPA1 as a functional transcription activator and then identified 1067 and 2185 genes associated with IPA1 binding sites in shoot apices and young panicles, respectively, through chromatin immunoprecipitation sequencing assays. The Squamosa promoter binding protein-box direct binding core motif GTAC was highly enriched in IPA1 binding peaks; interestingly, a previously uncharacterized indirect binding motif TGGGCC/T was found to be significantly enriched through the interaction of IPA1 with proliferating cell nuclear antigen promoter binding factor1 or promoter binding factor2. Genome-wide expression profiling by RNA sequencing revealed IPA1 roles in diverse pathways. Moreover, our results demonstrated that IPA1 could directly bind to the promoter of rice teosinte branched1, a negative regulator of tiller bud outgrowth, to suppress rice tillering, and directly and positively regulate dense and erect panicle1, an important gene regulating panicle architecture, to influence plant height and panicle length. The elucidation of target genes of IPA1 genome-wide will contribute to understanding the molecular mechanisms underlying plant architecture and to facilitating the breeding of elite varieties with ideal plant architecture.

  14. A foundation for provitamin A biofortification of maize: genome-wide association and genomic prediction models of carotenoid levels.

    Science.gov (United States)

    Owens, Brenda F; Lipka, Alexander E; Magallanes-Lundback, Maria; Tiede, Tyler; Diepenbrock, Christine H; Kandianis, Catherine B; Kim, Eunha; Cepela, Jason; Mateos-Hernandez, Maria; Buell, C Robin; Buckler, Edward S; DellaPenna, Dean; Gore, Michael A; Rocheford, Torbert

    2014-12-01

    Efforts are underway for development of crops with improved levels of provitamin A carotenoids to help combat dietary vitamin A deficiency. As a global staple crop with considerable variation in kernel carotenoid composition, maize (Zea mays L.) could have a widespread impact. We performed a genome-wide association study (GWAS) of quantified seed carotenoids across a panel of maize inbreds ranging from light yellow to dark orange in grain color to identify some of the key genes controlling maize grain carotenoid composition. Significant associations at the genome-wide level were detected within the coding regions of zep1 and lut1, carotenoid biosynthetic genes not previously shown to impact grain carotenoid composition in association studies, as well as within previously associated lcyE and crtRB1 genes. We leveraged existing biochemical and genomic information to identify 58 a priori candidate genes relevant to the biosynthesis and retention of carotenoids in maize to test in a pathway-level analysis. This revealed dxs2 and lut5, genes not previously associated with kernel carotenoids. In genomic prediction models, use of markers that targeted a small set of quantitative trait loci associated with carotenoid levels in prior linkage studies were as effective as genome-wide markers for predicting carotenoid traits. Based on GWAS, pathway-level analysis, and genomic prediction studies, we outline a flexible strategy involving use of a small number of genes that can be selected for rapid conversion of elite white grain germplasm, with minimal amounts of carotenoids, to orange grain versions containing high levels of provitamin A.

  15. Genome-wide prediction, display and refinement of binding sites with information theory-based models

    Directory of Open Access Journals (Sweden)

    Leeder J Steven

    2003-09-01

    . Conclusions Delila-Genome was used to scan the human genome sequence with information weight matrices of transcription factor binding sites, including PXR/RXRα, AHR and NF-κB p50/p65, and matrices for RNA binding sites including splice donor, acceptor, and SC35 recognition sites. Comparisons of genome scans with the original and refined PXR/RXRα information weight matrices indicate that the refined model more accurately predicts the strengths of known binding sites and is more sensitive for detection of novel binding sites.

  16. Genome-wide analysis of human global and transcription-coupled excision repair of UV damage at single-nucleotide resolution.

    Science.gov (United States)

    Hu, Jinchuan; Adar, Sheera; Selby, Christopher P; Lieb, Jason D; Sancar, Aziz

    2015-05-01

    We developed a method for genome-wide mapping of DNA excision repair named XR-seq (excision repair sequencing). Human nucleotide excision repair generates two incisions surrounding the site of damage, creating an ∼30-mer. In XR-seq, this fragment is isolated and subjected to high-throughput sequencing. We used XR-seq to produce stranded, nucleotide-resolution maps of repair of two UV-induced DNA damages in human cells: cyclobutane pyrimidine dimers (CPDs) and (6-4) pyrimidine-pyrimidone photoproducts [(6-4)PPs]. In wild-type cells, CPD repair was highly associated with transcription, specifically with the template strand. Experiments in cells defective in either transcription-coupled excision repair or general excision repair isolated the contribution of each pathway to the overall repair pattern and showed that transcription-coupled repair of both photoproducts occurs exclusively on the template strand. XR-seq maps capture transcription-coupled repair at sites of divergent gene promoters and bidirectional enhancer RNA (eRNA) production at enhancers. XR-seq data also uncovered the repair characteristics and novel sequence preferences of CPDs and (6-4)PPs. XR-seq and the resulting repair maps will facilitate studies of the effects of genomic location, chromatin context, transcription, and replication on DNA repair in human cells.

  17. Genome-wide mapping of transcription start sites yields novel insights into the primary transcriptome of Pseudomonas putida

    DEFF Research Database (Denmark)

    D'Arrigo, Isotta; Bojanovic, Klara; Yang, Xiaochen

    2016-01-01

    was examined using an in vivo assay with GFP-fusion vectors and shown to function via a translational repression mechanism. Furthermore, 56 novel intergenic small RNAs and 8 putative actuaton transcripts were detected, as well as 8 novel open reading frames (ORFs). This study illustrates how global mapping...... elements of P. putida strain KT2440. A total of 7937 putative transcription start sites (TSSs) were identified, where over two-thirds were located either on the opposite strand or internal to annotated genes. For TSSs associated with mRNAs, sequence analysis revealed a clear Shine–Dalgarno sequence...... but a lack of conserved overrepresented promoter motifs. These TSSs defined approximately 50 leaderless transcripts and an abundance of mRNAs with long leader regions of which 18 contain RNA regulatory elements from the Rfam database. The thiamine pyrophosphate riboswitch upstream of the thiC gene...

  18. A mixed model reduces spurious genetic associations produced by population stratification in genome-wide association studies.

    Science.gov (United States)

    Shin, Jimin; Lee, Chaeyoung

    2015-04-01

    Population stratification can produce spurious genetic associations in genome-wide association studies (GWASs). Mixed model methodology has been regarded useful for correcting population stratification. This study explored statistical power and false discovery rate (FDR) with the data simulated for dichotomous traits. Empirical FDRs and powers were estimated using fixed models with and without genomic control and using mixed models with and without reflecting loci linked to the candidate marker in genetic relationships. Population stratification with admixture degree ranged from 1% to 10% resulted in inflated FDRs from the fixed model analysis without genomic control and decreased power from the fixed model analysis with genomic control (Ppopulation stratification could not change FDR and power estimates from the mixed model analyses (P>0.05). We suggest that the mixed model methodology was useful to reduce spurious genetic associations produced by population stratification in GWAS, even with a high degree of admixture (10%). Copyright © 2015 Elsevier Inc. All rights reserved.

  19. RNA-Seq analysis of stuA mutants in Fusarium verticillioides indicates dramatic genomic wide transcriptional reprogramming

    Science.gov (United States)

    StuA, first discovered in Aspergillus nidulans and a member of the APSES class of transcription factors, regulates several essential developmental stages in fungi such as virulence, sporulation and toxin production in phytopathogenic fungi. Fusarium verticillioides (Fv), a maize phytopathogen, produ...

  20. The Role of α-CTD in the Genome-Wide Transcriptional Regulation of the Bacillus subtilis Cells.

    Directory of Open Access Journals (Sweden)

    Satohiko Murayama

    Full Text Available The amino acid sequence of the RNA polymerase (RNAP α-subunit is well conserved throughout the Eubacteria. Its C-terminal domain (α-CTD is important for the transcriptional regulation of specific promoters in both Escherichia coli and Bacillus subtilis, through interactions with transcription factors and/or a DNA element called the "UP element". However, there is only limited information regarding the α-CTD regulated genes in B. subtilis and the importance of this subunit in the transcriptional regulation of B. subtilis. Here, we established strains and the growth conditions in which the α-subunit of RNAP was replaced with a C-terminally truncated version. Transcriptomic and ChAP-chip analyses revealed that α-CTD deficiency reduced the transcription and RNAP binding of genes related to the utilization of secondary carbon sources, transition state responses, and ribosome synthesis. In E. coli, it is known that α-CTD also contributes to the expression of genes related to the utilization of secondary carbon sources and ribosome synthesis. Our results suggest that the biological importance of α-CTD is conserved in B. subtilis and E. coli, but that its specific roles have diversified between these two bacteria.

  1. Genome-wide transcriptional analysis of salinity stressed japonica and indica rice genotypes during panicle initiation stage.

    Science.gov (United States)

    Walia, Harkamal; Wilson, Clyde; Zeng, Linghe; Ismail, Abdelbagi M; Condamine, Pascal; Close, Timothy J

    2007-03-01

    Rice yield is most sensitive to salinity stress imposed during the panicle initiation (PI) stage. In this study, we have focused on physiological and transcriptional responses of four rice genotypes exposed to salinity stress during PI. The genotypes selected included a pair of indicas (IR63731 and IR29) and a pair of japonica (Agami and M103) rice subspecies with contrasting salt tolerance. Physiological characterization showed that tolerant genotypes maintained a much lower shoot Na+ concentration relative to sensitive genotypes under salinity stress. Global gene expression analysis revealed a strikingly large number of genes which are induced by salinity stress in sensitive genotypes, IR29 and M103 relative to tolerant lines. We found 19 probe sets to be commonly induced in all four genotypes. We found several salinity modulated, ion homeostasis related genes from our analysis. We also studied the expression of SKC1, a cation transporter reported by others as a major source of variation in salt tolerance in rice. The transcript abundance of SKC1 did not change in response to salinity stress at PI stage in the shoot tissue of all four genotypes. However, we found the transcript abundance of SKC1 to be significantly higher in tolerant japonica Agami relative to sensitive japonica M103 under control and stressed conditions during PI stage.

  2. Genome-Wide Mapping of Collier In Vivo Binding Sites Highlights Its Hierarchical Position in Different Transcription Regulatory Networks.

    Directory of Open Access Journals (Sweden)

    Mathilde de Taffin

    Full Text Available Collier, the single Drosophila COE (Collier/EBF/Olf-1 transcription factor, is required in several developmental processes, including head patterning and specification of muscle and neuron identity during embryogenesis. To identify direct Collier (Col targets in different cell types, we used ChIP-seq to map Col binding sites throughout the genome, at mid-embryogenesis. In vivo Col binding peaks were associated to 415 potential direct target genes. Gene Ontology analysis revealed a strong enrichment in proteins with DNA binding and/or transcription-regulatory properties. Characterization of a selection of candidates, using transgenic CRM-reporter assays, identified direct Col targets in dorso-lateral somatic muscles and specific neuron types in the central nervous system. These data brought new evidence that Col direct control of the expression of the transcription regulators apterous and eyes-absent (eya is critical to specifying neuronal identities. They also showed that cross-regulation between col and eya in muscle progenitor cells is required for specification of muscle identity, revealing a new parallel between the myogenic regulatory networks operating in Drosophila and vertebrates. Col regulation of eya, both in specific muscle and neuronal lineages, may illustrate one mechanism behind the evolutionary diversification of Col biological roles.

  3. Genome-wide analysis and expression profiling of the ERF transcription factor family in potato (Solanum tuberosum L.).

    Science.gov (United States)

    Charfeddine, Mariam; Saïdi, Mohamed Najib; Charfeddine, Safa; Hammami, Asma; Gargouri Bouzid, Radhia

    2015-04-01

    The ERF transcription factors belong to the AP2/ERF superfamily, one of the largest transcription factor families in plants. They play important roles in plant development processes, as well as in the response to biotic, abiotic, and hormone signaling. In the present study, 155 putative ERF transcription factor genes were identified from the potato (Solanum tuberosum) genome database, and compared with those from Arabidopsis thaliana. The StERF proteins are divided into ten phylogenetic groups. Expression analyses of five StERFs were carried out by semi-quantitative RT-PCR and compared with published RNA-seq data. These latter analyses were used to distinguish tissue-specific, biotic, and abiotic stress genes as well as hormone-responsive StERF genes. The results are of interest to better understand the role of the AP2/ERF genes in response to diverse types of stress in potatoes. A comprehensive analysis of the physiological functions and biological roles of the ERF family genes in S. tuberosum is required to understand crop stress tolerance mechanisms.

  4. Genome-wide mapping of human DNA-replication origins: levels of transcription at ORC1 sites regulate origin selection and replication timing.

    Science.gov (United States)

    Dellino, Gaetano Ivan; Cittaro, Davide; Piccioni, Rossana; Luzi, Lucilla; Banfi, Stefania; Segalla, Simona; Cesaroni, Matteo; Mendoza-Maldonado, Ramiro; Giacca, Mauro; Pelicci, Pier Giuseppe

    2013-01-01

    We report the genome-wide mapping of ORC1 binding sites in mammals, by chromatin immunoprecipitation and parallel sequencing (ChIP-seq). ORC1 binding sites in HeLa cells were validated as active DNA replication origins (ORIs) using Repli-seq, a method that allows identification of ORI-containing regions by parallel sequencing of temporally ordered replicating DNA. ORC1 sites were universally associated with transcription start sites (TSSs) of coding or noncoding RNAs (ncRNAs). Transcription levels at the ORC1 sites directly correlated with replication timing, suggesting the existence of two classes of ORIs: those associated with moderate/high transcription levels (≥1 RNA copy/cell), firing in early S and mapping to the TSSs of coding RNAs; and those associated with low transcription levels (<1 RNA copy/cell), firing throughout the entire S and mapping to TSSs of ncRNAs. These findings are compatible with a scenario whereby TSS expression levels influence the efficiency of ORC1 recruitment at G(1) and the probability of firing during S.

  5. Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models.

    Science.gov (United States)

    Sul, Jae Hoon; Bilow, Michael; Yang, Wen-Yun; Kostem, Emrah; Furlotte, Nick; He, Dan; Eskin, Eleazar

    2016-03-01

    Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants.

  6. Identification of regulatory network topological units coordinating the genome-wide transcriptional response to glucose in Escherichia coli

    Directory of Open Access Journals (Sweden)

    Gosset Guillermo

    2007-06-01

    Full Text Available Abstract Background Glucose is the preferred carbon and energy source for Escherichia coli. A complex regulatory network coordinates gene expression, transport and enzyme activities in response to the presence of this sugar. To determine the extent of the cellular response to glucose, we applied an approach combining global transcriptome and regulatory network analyses. Results Transcriptome data from isogenic wild type and crp- strains grown in Luria-Bertani medium (LB or LB + 4 g/L glucose (LB+G were analyzed to identify differentially transcribed genes. We detected 180 and 200 genes displaying increased and reduced relative transcript levels in the presence of glucose, respectively. The observed expression pattern in LB was consistent with a gluconeogenic metabolic state including active transport and interconversion of small molecules and macromolecules, induction of protease-encoding genes and a partial heat shock response. In LB+G, catabolic repression was detected for transport and metabolic interconversion activities. We also detected an increased capacity for de novo synthesis of nucleotides, amino acids and proteins. Cluster analysis of a subset of genes revealed that CRP mediates catabolite repression for most of the genes displaying reduced transcript levels in LB+G, whereas Fis participates in the upregulation of genes under this condition. An analysis of the regulatory network, in terms of topological functional units, revealed 8 interconnected modules which again exposed the importance of Fis and CRP as directly responsible for the coordinated response of the cell. This effect was also seen with other not extensively connected transcription factors such as FruR and PdhR, which showed a consistent response considering media composition. Conclusion This work allowed the identification of eight interconnected regulatory network modules that includes CRP, Fis and other transcriptional factors that respond directly or indirectly to the

  7. Genome-wide Analysis of BP1 Transcriptional Targets in Breast Cancer Cell Line Hs578T

    Directory of Open Access Journals (Sweden)

    Yongchun Song, Chengxue Dang, Yebo Fu, Yi Lian, Jenny Hottel, Xuelan Li, Tim McCaffrey, Sidney W. Fu

    2009-01-01

    Full Text Available Homeobox genes are known to be critically important in tumor development and progression. The BP1 (Beta Protein 1 gene, an isoform of DLX4, belongs to the Distal-less (DLX subfamily of homeobox genes and encodes a homeodomain-containing transcription factor. Our studies have shown that the BP1 gene was overexpressed in 81% of primary breast cancer and its expression was closely correlated with the progression of breast cancer. However, the exact role of BP1 in breast has yet to be elucidated. Therefore, it is important to explore the potential transcriptional targets of BP1 via whole genome-scale screening. In this study, we used the chromatin immunoprecipitation on chip (ChIP-on-chip and gene expression microarray assays to identify candidate target genes and gene networks, which are directly regulated by BP1 in ER negative (ER- breast cancer cells. After rigorous bioinformatic and statistical analysis for both ChIP-on-chip and expression microarray gene lists, 18 overlapping genes were noted and verified. Those potential target genes are involved in a variety of tumorigenic pathways, which sheds light on the functional mechanisms of BP1 in breast cancer development and progression.

  8. Genome-Wide Transcriptional Profile Analysis of Prunus persica in Response to Low Sink Demand after Fruit Removal.

    Science.gov (United States)

    Duan, Wei; Xu, Hongguo; Liu, Guotian; Fan, Peige; Liang, Zhenchang; Li, Shaohua

    2016-01-01

    Prunus persica fruits were removed from 1-year-old shoots to analysis photosynthesis, chlorophyll fluorescence and genes changes in leaves to low sink demand caused by fruit removal (-fruit) during the final stage of rapid fruit growth. A decline in net photosynthesis rate was observed, accompanied with a decrease in stomatal conductance. The intercellular CO2 concentrations and leaf temperature increased as compared with a normal fruit load (+fruit). Moreover, low sink demand significantly inhibited the donor side and the reaction center of photosystem II. 382 genes in leaf with an absolute fold change ≥1 change in expression level, representing 116 up- and 266 down-regulated genes except for unknown transcripts. Among these, 25 genes for photosynthesis were down-regulated, 69 stress and 19 redox related genes up-regulated under the low sink demand. These studies revealed high leaf temperature may result in a decline of net photosynthesis rate through down-regulation in photosynthetic related genes and up-regulation in redox and stress related genes, especially heat shock proteins genes. The complex changes in genes at the transcriptional level under low sink demand provided useful starting points for in-depth analyses of source-sink relationship in P. persica.

  9. Genome-wide gene regulation of biosynthesis and energy generation by a novel transcriptional repressor in Geobacter species.

    Science.gov (United States)

    Ueki, Toshiyuki; Lovley, Derek R

    2010-01-01

    Geobacter species play important roles in bioremediation of contaminated environments and in electricity production from waste organic matter in microbial fuel cells. To better understand physiology of Geobacter species, expression and function of citrate synthase, a key enzyme in the TCA cycle that is important for organic acid oxidation in Geobacter species, was investigated. Geobacter sulfurreducens did not require citrate synthase for growth with hydrogen as the electron donor and fumarate as the electron acceptor. Expression of the citrate synthase gene, gltA, was repressed by a transcription factor under this growth condition. Functional and comparative genomics approaches, coupled with genetic and biochemical assays, identified a novel transcription factor termed HgtR that acts as a repressor for gltA. Further analysis revealed that HgtR is a global regulator for genes involved in biosynthesis and energy generation in Geobacter species. The hgtR gene was essential for growth with hydrogen, during which hgtR expression was induced. These findings provide important new insights into the mechanisms by which Geobacter species regulate their central metabolism under different environmental conditions.

  10. Identification of differentially expressed genes associated with flower color in peach using genome-wide transcriptional analysis.

    Science.gov (United States)

    Zhou, Y; Wu, X X; Zhang, Z; Gao, Z H

    2015-05-11

    Flower color is an important trait of the ornamental peach (Prunus persica L.). However, the mechanism responsible for the different colors that appear in the same genotype remains unclear. In this study, red samples showed higher anthocyanins content (0.122 ± 0.009 mg/g), which was significantly different from that in white samples (0.066 ± 0.010 mg/g). Similarly to carotenoids content, red extract (0.058 ± 0.004 mg/L) was significantly higher in white extract (0.015 ± 0.004 mg/L). We estimated gene expression using Illumina sequencing technology in libraries from white and red flower buds. A total of 3,599,960 and 3,464,141 tags were sequenced from the 2 libraries, respectively. Moreover, we identified 106 significantly differentially expressed genes between the 2 libraries. Among these, 78 and 28 represented transcripts with a higher or lower abundance of more than 2-fold than in the white flower library, respectively. GO annotation indicated that highly ranked genes were involved in the pigment biosynthetic process. Expression patterns of 11 genes were verified using quantitative reverse transcription-polymerase chain reaction assays. The results suggest that hydroxycinnamoyl-coenzyme A shikimate/quinate hydroxycinnamoyltransferase, 2-oxoglutarate-dependent dioxygenase, isoflavone reductase, riboflavin kinase, zeta-carotene desaturase, and ATP binding cassette transporter may be associated with the flower color formation. Our results may be useful for scientists focusing on Prunus persica floral development and biotechnology.

  11. Genome-wide identification of transcriptional start sites in the plant pathogen Pseudomonas syringae pv. tomato str. DC3000.

    Directory of Open Access Journals (Sweden)

    Melanie J Filiatrault

    Full Text Available RNA-Seq has provided valuable insights into global gene expression in a wide variety of organisms. Using a modified RNA-Seq approach and Illumina's high-throughput sequencing technology, we globally identified 5'-ends of transcripts for the plant pathogen Pseudomonas syringae pv. tomato str. DC3000. A substantial fraction of 5'-ends obtained by this method were consistent with results obtained using global RNA-Seq and 5'RACE. As expected, many 5'-ends were positioned a short distance upstream of annotated genes. We also captured 5'-ends within intergenic regions, providing evidence for the expression of un-annotated genes and non-coding RNAs, and detected numerous examples of antisense transcription, suggesting additional levels of complexity in gene regulation in DC3000. Importantly, targeted searches for sequence patterns in the vicinity of 5'-ends revealed over 1200 putative promoters and other regulatory motifs, establishing a broad foundation for future investigations of regulation at the genomic and single gene levels.

  12. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    Directory of Open Access Journals (Sweden)

    Yong Guo

    Full Text Available The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max. In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  13. ChIP on chip and ChIP-Seq assays: genome-wide analysis of transcription factor binding and histone modifications.

    Science.gov (United States)

    Pillai, Smitha; Chellappan, Srikumar P

    2015-01-01

    Deregulation of transcriptional activity of many genes has been causatively linked to human diseases including cancer. Altered patterns of gene expression in normal and cancer cells are the result of inappropriate expression of transcription factors and chromatin modifying proteins. Chromatin immunoprecipitation assay is a well-established tool for investigating the interactions between regulatory proteins and DNA at distinct stages of gene activation. ChIP coupled with DNA microarrays, known as ChIP on chip, or sequencing of DNA associated with the factors (ChIP-Seq) allow us to determine the entire spectrum of in vivo DNA binding sites for a given protein. This has been of immense value because ChIP on chip assays and ChIP-Seq experiments can provide a snapshot of the transcriptional regulatory mechanisms on a genome-wide scale. This chapter outlines the general strategies used to carry out ChIP-chip assays to study the differential recruitment of regulatory molecules based on the studies conducted in our lab as well as other published protocols; these can be easily modified to a ChIP-Seq analysis.

  14. Genome-wide association of coagulation properties, curd firmness modeling, protein percentage, and acidity in milk from Brown Swiss cows.

    Science.gov (United States)

    Dadousis, C; Biffani, S; Cipolat-Gotet, C; Nicolazzi, E L; Rossoni, A; Santus, E; Bittante, G; Cecchinato, A

    2016-05-01

    Cheese production is increasing in many countries, and a desire toward genetic selection for milk coagulation properties in dairy cattle breeding exists. However, measurements of individual cheesemaking properties are hampered by high costs and labor, whereas traditional single-point milk coagulation properties (MCP) are sometimes criticized. Nevertheless, new modeling of the entire curd firmness and syneresis process (CFt equation) offers new insight into the cheesemaking process. Moreover, identification of genomic regions regulating milk cheesemaking properties might enhance direct selection of individuals in breeding programs based on cheese ability rather than related milk components. Therefore, the objective of this study was to perform genome-wide association studies to identify genomic regions linked to traditional MCP and new CFt parameters, milk acidity (pH), and milk protein percentage. Milk and DNA samples from 1,043 Italian Brown Swiss cows were used. Milk pH and 3 MCP traits were grouped together to represent the MCP set. Four CFt equation parameters, 2 derived traits, and protein percentage were considered as the second group of traits (CFt set). Animals were genotyped with the Illumina SNP50 BeadChip v.2 (Illumina Inc., San Diego, CA). Multitrait animal models were used to estimate variance components. For genome-wide association studies, the genome-wide association using mixed model and regression-genomic control approach was used. In total, 106 significant marker traits associations and 66 single nucleotide polymorphisms were identified on 12 chromosomes (1, 6, 9, 11, 13, 15, 16, 19, 20, 23, 26, and 28). Sharp peaks were detected at 84 to 88 Mbp on Bos taurus autosome (BTA) 6, with a peak at 87.4 Mbp in the region harboring the casein genes. Evidence of quantitative trait loci at 82.6 and 88.4 Mbp on the same chromosome was found. All chromosomes but BTA6, BTA11, and BTA28 were associated with only one trait. Only BTA6 was in common between MCP

  15. Genome-wide Reconstruction of OxyR and SoxRS Transcriptional Regulatory Networks under Oxidative Stress in Escherichia coli K-12 MG1655

    DEFF Research Database (Denmark)

    Seo, Sang Woo; Kim, Donghyuk; Szubin, Richard;

    2015-01-01

    Three transcription factors (TFs), OxyR, SoxR, and SoxS, play a critical role in transcriptional regulation of the defense system for oxidative stress in bacteria. However, their full genome-wide regulatory potential is unknown. Here, we perform a genome-scale reconstruction of the OxyR, SoxR, an...

  16. Genome-wide analysis of the R2R3-MYB transcription factor gene family in sweet orange (Citrus sinensis).

    Science.gov (United States)

    Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang

    2014-10-01

    MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.

  17. Genome-wide analysis of Dof transcription factors reveals functional characteristics during development and response to biotic stresses in pepper

    Science.gov (United States)

    Kang, Won-Hee; Kim, Seungill; Lee, Hyun-Ah; Choi, Doil; Yeom, Seon-In

    2016-01-01

    The DNA-binding with one zinc finger proteins (Dofs) are a plant-specific family of transcription factors. The Dofs are involved in a variety of biological processes such as phytohormone production, seed development, and environmental adaptation. Dofs have been previously identified in several plants, but not in pepper. We identified 33 putative Dof genes in pepper (CaDofs). To gain an overview of the CaDofs, we analyzed phylogenetic relationships, protein motifs, and evolutionary history. We divided the 33 CaDofs, containing 25 motifs, into four major groups distributed on eight chromosomes. We discovered an expansion of the CaDofs dated to a recent duplication event. Segmental duplication that occurred before the speciation of the Solanaceae lineages was predominant among the CaDofs. The global gene-expression profiling of the CaDofs by RNA-seq analysis showed distinct temporal and pathogen-specific variation during development and response to biotic stresses (two TMV strains, PepMoV, and Phytophthora capsici), suggesting functional diversity among the CaDofs. These results will provide the useful clues into the responses of Dofs in biotic stresses and promote a better understanding of their multiple function in pepper and other species. PMID:27653666

  18. Genome wide transcription profiling of the effects of overexpression of Spc1 and its kinase dead mutant in Schizosaccharomyces pombe

    Directory of Open Access Journals (Sweden)

    Madhurima Paul

    2015-12-01

    Full Text Available The Mitogen Activated Protein Kinase Spc1 (p38 homolog is a major player in stress responses of the unicellular fission yeast Schizosaccharomyces pombe. This pathway is therefore also known as the SAPK or Stress Activated Protein Kinase pathway. Spc1 is a known activator of transcription factors that control gene expression in response to extracellular stimuli and is also known to interact with the translation machinery [1–8]. Spc1 has also been implicated in cell cycle regulation and meiosis in S. pombe [1,2,9,10]. Given its documented role in modulating gene expression, we performed a microarray based identification of genes whose expression in unperturbed cells (absence of stress stimuli is dependent on Spc1. For this we overexpressed Spc1 in S. pombe. Additionally we also overexpressed Spc1K49R (a kinase dead mutant of Spc1 to understand the contribution of Spc1's kinase activity towards the observed gene expression changes. The microarray data are available at NCBI's Gene Expression Omnibus (GEO Series (accession number GSE73618. Here we report the annotation of the genes whose expression get altered by Spc1/Spc1K49R overexpression and also provide details related to sample processing and statistical analysis of our microarray data.

  19. Genome-wide identification, classification, and analysis of heat shock transcription factor family in Chinese cabbage (Brassica rapa pekinensis).

    Science.gov (United States)

    Huang, X Y; Tao, P; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-03-27

    Chinese cabbage (Brassica rapa ssp. pekinensis) is one of the most important vegetable crops grown worldwide, and various methods exist for selection, propagation, and cultivation. The entire Chinese cabbage genome has been sequenced, and the heat shock transcription factor family (Hsfs) has been found to play a central role in plant growth and development and in the response to biotic and abiotic stress conditions, particularly in acquired thermotolerance. We analyzed heat tolerance mechanisms in Chinese cabbage. In this study, 30 Hsfs were identified from the Chinese cabbage genome database. The classification, phylogenetic reconstruction, chromosome distribution, conserved motifs, expression analysis, and interaction networks of the Hsfs were predicted and analyzed. Thirty BrHsfs were classified into 3 major classes (class A, B, and C) according to their structural characteristics and phylogenetic comparisons, and class A was further subdivided into 8 subclasses. Distribution mapping results showed that Hsf genes were located on 10 Chinese cabbage chromosomes. The expression profile indicated that Hsfs play differential roles in 5 organs in Chinese cabbage, and likely participate in the development of underground parts and regulation of reproductive growth. An orthologous gene interaction network was constructed, and included MBF1C, ROF1, TBP2, CDC2, and HSP70 5 genes, which are closely related to heat stress. Our results contribute to the understanding of the complexity of Hsfs in Chinese cabbage and provide a basis for further functional gene research.

  20. Genome-wide classification and evolutionary and expression analyses of citrus MYB transcription factor families in sweet orange.

    Directory of Open Access Journals (Sweden)

    Xiao-Jin Hou

    Full Text Available MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB. Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus.

  1. Genome-wide classification and evolutionary and expression analyses of citrus MYB transcription factor families in sweet orange.

    Science.gov (United States)

    Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2014-01-01

    MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus.

  2. Genome-Wide Expression Profiling of Five Mouse Models Identifies Similarities and Differences with Human Psoriasis

    NARCIS (Netherlands)

    Swindell, William R.; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P.; Voorhees, John J.; Elder, James T.; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P.; DiGiovanni, John; Pittelkow, Mark R.; Ward, Nicole L.; Gudjonsson, Johann E.

    2011-01-01

    Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the huma

  3. Genome-wide expression profiling of five mouse models identifies similarities and differences with human psoriasis

    NARCIS (Netherlands)

    W.R. Swindell (William R.); A. Johnston (Andrew); S. Carbajal (Steve); G. Han (Gangwen); C.T. Wohn (Christopher); J. Lu (Jun); X. Xing (Xianying); R.P. Nair (Rajan P.); J.J. Voorhees (John); J.T. Elder (James); X.J. Wang (Xian Jiang); S. Sano (Shigetoshi); E.P. Prens (Errol); J. DiGiovanni (John); M.R. Pittelkow (Mark R.); N.L. Ward (Nicole); J.E. Gudjonsson (Johann Eli)

    2011-01-01

    textabstractDevelopment of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features

  4. Genome-wide expression profiling of five mouse models identifies similarities and differences with human psoriasis

    NARCIS (Netherlands)

    W.R. Swindell (William R.); A. Johnston (Andrew); S. Carbajal (Steve); G. Han (Gangwen); C.T. Wohn (Christopher); J. Lu (Jun); X. Xing (Xianying); R.P. Nair (Rajan P.); J.J. Voorhees (John); J.T. Elder (James); X.J. Wang (Xian Jiang); S. Sano (Shigetoshi); E.P. Prens (Errol); J. DiGiovanni (John); M.R. Pittelkow (Mark R.); N.L. Ward (Nicole); J.E. Gudjonsson (Johann Eli)

    2011-01-01

    textabstractDevelopment of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features

  5. Genome-wide Expression Analysis and Metabolite Profiling Elucidate Transcriptional Regulation of Flavonoid Biosynthesis and Modulation under Abiotic Stresses in Banana.

    Science.gov (United States)

    Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H; Trivedi, Prabodh K

    2016-08-19

    Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana.

  6. Genome-wide Expansion and Expression Divergence of the Basic Leucine Zipper Transcription Factors in Higher Plants with an Emphasis on Sorghum

    Institute of Scientific and Technical Information of China (English)

    Jizhou Wang; Junxia Zhou; Baolan Zhang; Jeevanandam Vanitha; Srinivasan Ramachandran; Shu-Ye Jiang

    2011-01-01

    Plant bZIP transcription factors play crucial roles in multiple biological processes. However,little is known about the sorghum bZIP gene family although the sorghum genome has been completely sequenced. In this study,we have carried out a genome-wide identification and characterization of this gene family in sorghum.Our data show that the genome encodes at least 92 bZIP transcription factors. These bZIP genes have been expanded mainly by segmental duplication. Such an expansion mechanism has also been observed in rice,arabidopsis and many other plant organisms,suggesting a common expansion mode of this gene family in plants. Further investigation shows that most of the bZIP members have been present in the most recent common ancestor of sorghum and rice and the major expansion would occur before the sorghum-rice split era. Although these bZIP genes have been duplicated with a long history,they exhibited limited functional divergence as shown by nonsynonymous substitutions (Ka)/synonymous substitutions (Ks) analyses. Their retention was mainly due to the high percentages of expression divergence. Our data also showed that this gene family might play a role in multiple developmental stages and tissues and might be regarded as important regulators of various abiotic stresses and sugar signaling.

  7. Genome-wide Expression Analysis and Metabolite Profiling Elucidate Transcriptional Regulation of Flavonoid Biosynthesis and Modulation under Abiotic Stresses in Banana

    Science.gov (United States)

    Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H.; Trivedi, Prabodh K.

    2016-01-01

    Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana. PMID:27539368

  8. Genome-wide integration on transcription factors, histone acetylation and gene expression reveals genes co-regulated by histone modification patterns.

    Directory of Open Access Journals (Sweden)

    Yayoi Natsume-Kitatani

    Full Text Available N-terminal tails of H2A, H2B, H3 and H4 histone families are subjected to posttranslational modifications that take part in transcriptional regulation mechanisms, such as transcription factor binding and gene expression. Regulation mechanisms under control of histone modification are important but remain largely unclear, despite of emerging datasets for comprehensive analysis of histone modification. In this paper, we focus on what we call genetic harmonious units (GHUs, which are co-occurring patterns among transcription factor binding, gene expression and histone modification. We present the first genome-wide approach that captures GHUs by combining ChIP-chip with microarray datasets from Saccharomyces cerevisiae. Our approach employs noise-robust soft clustering to select patterns which share the same preferences in transcription factor-binding, histone modification and gene expression, which are all currently implied to be closely correlated. The detected patterns are a well-studied acetylation of lysine 16 of H4 in glucose depletion as well as co-acetylation of five lysine residues of H3 with H4 Lys12 and H2A Lys7 responsible for ribosome biogenesis. Furthermore, our method further suggested the recognition of acetylated H4 Lys16 being crucial to histone acetyltransferase ESA1, whose essential role is still under controversy, from a microarray dataset on ESA1 and its bypass suppressor mutants. These results demonstrate that our approach allows us to provide clearer principles behind gene regulation mechanisms under histone modifications and detect GHUs further by applying to other microarray and ChIP-chip datasets. The source code of our method, which was implemented in MATLAB (http://www.mathworks.com/, is available from the supporting page for this paper: http://www.bic.kyoto-u.ac.jp/pathway/natsume/hm_detector.htm.

  9. FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm.

    Directory of Open Access Journals (Sweden)

    Shouheng Tuo

    Full Text Available Two-locus model is a typical significant disease model to be identified in genome-wide association study (GWAS. Due to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power, high computation cost, and preference for some types of disease models.In this study, two scoring functions (Bayesian network based K2-score and Gini-score are used for characterizing two SNP locus as a candidate model, the two criteria are adopted simultaneously for improving identification power and tackling the preference problem to disease models. Harmony search algorithm (HSA is improved for quickly finding the most likely candidate models among all two-locus models, in which a local search algorithm with two-dimensional tabu table is presented to avoid repeatedly evaluating some disease models that have strong marginal effect. Finally G-test statistic is used to further test the candidate models.We investigate our method named FHSA-SED on 82 simulated datasets and a real AMD dataset, and compare it with two typical methods (MACOED and CSE which have been developed recently based on swarm intelligent search algorithm. The results of simulation experiments indicate that our method outperforms the two compared algorithms in terms of detection power, computation time, evaluation times, sensitivity (TPR, specificity (SPC, positive predictive value (PPV and accuracy (ACC. Our method has identified two SNPs (rs3775652 and rs10511467 that may be also associated with disease in AMD dataset.

  10. A genome-wide survey on basic helix-loop-helix transcription factors in rat and mouse.

    Science.gov (United States)

    Zheng, Xiaodong; Zheng, X; Wang, Yong; Wang, Y; Yao, Qin; Yao, Q; Yang, Zhe; Yang, Z; Chen, Keping; Chen, K

    2009-04-01

    The basic helix-loop-helix (bHLH) proteins play essential roles in a wide range of developmental processes in higher organisms. bHLH family members have been identified in over 20 organisms, including nematode, fruit fly, and human. Our study identified 114 rat and 14 additional mouse bHLH members in rat and mouse genomes, respectively. Phylogenetic analyses revealed that both rat and mouse had 49, 26, 15, 4, 12, and 4 bHLH members in groups A, B, C, D, E, and F, respectively. Only the rat Mxi1 gene has two copies in the genome. All other rat bHLH genes and all mouse bHLH genes are single-copy genes. The chromosomal distribution pattern of mouse, rat, and human bHLH genes suggests the emergence of some bHLH genes through gene duplication, which probably happened at least before the divergence of vertebrates from invertebrates. The present study provides useful information for future studies using rat as a model animal for mammalian development.

  11. Genome-wide analysis of simple sequence repeats in the model medicinal mushroom Ganoderma lucidum.

    Science.gov (United States)

    Qian, Jun; Xu, Haibin; Song, Jingyuan; Xu, Jiang; Zhu, Yingjie; Chen, Shilin

    2013-01-10

    Simple sequence repeats (SSRs) or microsatellites are one of the most popular sources of genetic markers and play a significant role in gene function and genome organization. We identified SSRs in the genome of Ganoderma lucidum and analyzed their frequency and distribution in different genomic regions. We also compared the SSRs in G. lucidum with six other Agaricomycetes genomes: Coprinopsis cinerea, Laccaria bicolor, Phanerochaete chrysosporium, Postia placenta, Schizophyllum commune and Serpula lacrymans. Based on our search criteria, the total number of SSRs found ranged from 1206 to 6104 and covered from 0.04% to 0.15% of the fungal genomes. The SSR abundance was not correlated with the genome size, and mono- to tri-nucleotide repeats outnumbered other SSR categories in all of the species examined. In G. lucidum, a repertoire of 2674 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. The highest SSR relative abundance was found in introns (108 SSRs/Mb), followed by intergenic regions (84 SSRs/Mb). A total of 684 SSRs were found in the protein-coding sequences (CDSs) of 588 gene models, with 81.4% of them being tri- or hexa-nucleotides. After scanning for InterPro domains, 280 of these genes were successfully annotated, and 215 of them could be assigned to Gene Ontology (GO) terms. SSRs were also identified in 28 bioactive compound synthesis-related gene models, including one 3-hydroxy-3-methylglutaryl-CoA reductase (HMGR), three polysaccharide biosynthesis genes and 24 cytochrome P450 monooxygenases (CYPs). Primers were designed for the identified SSR loci, providing the basis for the future development of SSR markers of this medicinal fungus.

  12. Combining Genome-Wide Information with a Functional Structural Plant Model to Simulate 1-Year-Old Apple Tree Architecture

    Science.gov (United States)

    Migault, Vincent; Pallas, Benoît; Costes, Evelyne

    2017-01-01

    In crops, optimizing target traits in breeding programs can be fostered by selecting appropriate combinations of architectural traits which determine light interception and carbon acquisition. In apple tree, architectural traits were observed to be under genetic control. However, architectural traits also result from many organogenetic and morphological processes interacting with the environment. The present study aimed at combining a FSPM built for apple tree, MAppleT, with genetic determinisms of architectural traits, previously described in a bi-parental population. We focused on parameters related to organogenesis (phyllochron and immediate branching) and morphogenesis processes (internode length and leaf area) during the first year of tree growth. Two independent datasets collected in 2004 and 2007 on 116 genotypes, issued from a ‘Starkrimson’ × ‘Granny Smith’ cross, were used. The phyllochron was estimated as a function of thermal time and sylleptic branching was modeled subsequently depending on phyllochron. From a genetic map built with SNPs, marker effects were estimated on four MAppleT parameters with rrBLUP, using 2007 data. These effects were then considered in MAppleT to simulate tree development in the two climatic conditions. The genome wide prediction model gave consistent estimations of parameter values with correlation coefficients between observed values and estimated values from SNP markers ranging from 0.79 to 0.96. However, the accuracy of the prediction model following cross validation schemas was lower. Three integrative traits (the number of leaves, trunk length, and number of sylleptic laterals) were considered for validating MAppleT simulations. In 2007 climatic conditions, simulated values were close to observations, highlighting the correct simulation of genetic variability. However, in 2004 conditions which were not used for model calibration, the simulations differed from observations. This study demonstrates the possibility

  13. The SWR1 histone replacement complex causes genetic instability and genome-wide transcription misregulation in the absence of H2A.Z.

    Science.gov (United States)

    Morillo-Huesca, Macarena; Clemente-Ruiz, Marta; Andújar, Eloísa; Prado, Félix

    2010-08-12

    The SWR1 complex replaces the canonical histone H2A with the variant H2A.Z (Htz1 in yeast) at specific chromatin regions. This dynamic alteration in nucleosome structure provides a molecular mechanism to regulate transcription, gene silencing, chromosome segregation and DNA repair. Here we show that genetic instability, sensitivity to drugs impairing different cellular processes and genome-wide transcriptional misregulation in htz1Delta can be partially or totally suppressed if SWR1 is not formed (swr1Delta), if it forms but cannot bind to chromatin (swc2Delta) or if it binds to chromatin but lacks histone replacement activity (swc5Delta and the ATPase-dead swr1-K727G). These results suggest that in htz1Delta the nucleosome remodelling activity of SWR1 affects chromatin integrity because of an attempt to replace H2A with Htz1 in the absence of the latter. This would impair transcription and, either directly or indirectly, other cellular processes. Specifically, we show that in htz1Delta, the SWR1 complex causes an accumulation of recombinogenic DNA damage by a mechanism dependent on phosphorylation of H2A at Ser129, a modification that occurs in response to DNA damage, suggesting that the SWR1 complex impairs the repair of spontaneous DNA damage in htz1Delta. In addition, SWR1 causes DSBs sensitivity in htz1Delta; consistently, in the absence of Htz1 the SWR1 complex bound near an endonuclease HO-induced DSB at the mating-type (MAT) locus impairs DSB-induced checkpoint activation. Our results support a stepwise mechanism for the replacement of H2A with Htz1 and demonstrate that a tight control of this mechanism is essential to regulate chromatin dynamics but also to prevent the deleterious consequences of an incomplete nucleosome remodelling.

  14. The SWR1 histone replacement complex causes genetic instability and genome-wide transcription misregulation in the absence of H2A.Z.

    Directory of Open Access Journals (Sweden)

    Macarena Morillo-Huesca

    Full Text Available The SWR1 complex replaces the canonical histone H2A with the variant H2A.Z (Htz1 in yeast at specific chromatin regions. This dynamic alteration in nucleosome structure provides a molecular mechanism to regulate transcription, gene silencing, chromosome segregation and DNA repair. Here we show that genetic instability, sensitivity to drugs impairing different cellular processes and genome-wide transcriptional misregulation in htz1Delta can be partially or totally suppressed if SWR1 is not formed (swr1Delta, if it forms but cannot bind to chromatin (swc2Delta or if it binds to chromatin but lacks histone replacement activity (swc5Delta and the ATPase-dead swr1-K727G. These results suggest that in htz1Delta the nucleosome remodelling activity of SWR1 affects chromatin integrity because of an attempt to replace H2A with Htz1 in the absence of the latter. This would impair transcription and, either directly or indirectly, other cellular processes. Specifically, we show that in htz1Delta, the SWR1 complex causes an accumulation of recombinogenic DNA damage by a mechanism dependent on phosphorylation of H2A at Ser129, a modification that occurs in response to DNA damage, suggesting that the SWR1 complex impairs the repair of spontaneous DNA damage in htz1Delta. In addition, SWR1 causes DSBs sensitivity in htz1Delta; consistently, in the absence of Htz1 the SWR1 complex bound near an endonuclease HO-induced DSB at the mating-type (MAT locus impairs DSB-induced checkpoint activation. Our results support a stepwise mechanism for the replacement of H2A with Htz1 and demonstrate that a tight control of this mechanism is essential to regulate chromatin dynamics but also to prevent the deleterious consequences of an incomplete nucleosome remodelling.

  15. Combined genome-wide expression profiling and targeted RNA interference in primary mouse macrophages reveals perturbation of transcriptional networks associated with interferon signalling

    Directory of Open Access Journals (Sweden)

    Craigon Marie

    2009-08-01

    Full Text Available Abstract Background Interferons (IFNs are potent antiviral cytokines capable of reprogramming the macrophage phenotype through the induction of interferon-stimulated genes (ISGs. Here we have used targeted RNA interference to suppress the expression of a number of key genes associated with IFN signalling in murine macrophages prior to stimulation with interferon-gamma. Genome-wide changes in transcript abundance caused by siRNA activity were measured using exon-level microarrays in the presence or absence of IFNγ. Results Transfection of murine bone-marrow derived macrophages (BMDMs with a non-targeting (control siRNA and 11 sequence-specific siRNAs was performed using a cationic lipid transfection reagent (Lipofectamine2000 prior to stimulation with IFNγ. Total RNA was harvested from cells and gene expression measured on Affymetrix GeneChip Mouse Exon 1.0 ST Arrays. Network-based analysis of these data revealed six siRNAs to cause a marked shift in the macrophage transcriptome in the presence or absence IFNγ. These six siRNAs targeted the Ifnb1, Irf3, Irf5, Stat1, Stat2 and Nfkb2 transcripts. The perturbation of the transcriptome by the six siRNAs was highly similar in each case and affected the expression of over 600 downstream transcripts. Regulated transcripts were clustered based on co-expression into five major groups corresponding to transcriptional networks associated with the type I and II IFN response, cell cycle regulation, and NF-KB signalling. In addition we have observed a significant non-specific immune stimulation of cells transfected with siRNA using Lipofectamine2000, suggesting use of this reagent in BMDMs, even at low concentrations, is enough to induce a type I IFN response. Conclusion Our results provide evidence that the type I IFN response in murine BMDMs is dependent on Ifnb1, Irf3, Irf5, Stat1, Stat2 and Nfkb2, and that siRNAs targeted to these genes results in perturbation of key transcriptional networks associated

  16. Genome-wide cooperation by HAT Gcn5, remodeler SWI/SNF, and chaperone Ydj1 in promoter nucleosome eviction and transcriptional activation.

    Science.gov (United States)

    Qiu, Hongfang; Chereji, Răzvan V; Hu, Cuihua; Cole, Hope A; Rawal, Yashpal; Clark, David J; Hinnebusch, Alan G

    2016-02-01

    Chaperones, nucleosome remodeling complexes, and histone acetyltransferases have been implicated in nucleosome disassembly at promoters of particular yeast genes, but whether these cofactors function ubiquitously, as well as the impact of nucleosome eviction on transcription genome-wide, is poorly understood. We used chromatin immunoprecipitation of histone H3 and RNA polymerase II (Pol II) in mutants lacking single or multiple cofactors to address these issues for about 200 genes belonging to the Gcn4 transcriptome, of which about 70 exhibit marked reductions in H3 promoter occupancy on induction by amino acid starvation. Examining four target genes in a panel of mutants indicated that SWI/SNF, Gcn5, the Hsp70 cochaperone Ydj1, and chromatin-associated factor Yta7 are required downstream from Gcn4 binding, whereas Asf1/Rtt109, Nap1, RSC, and H2AZ are dispensable for robust H3 eviction in otherwise wild-type cells. Using ChIP-seq to interrogate all 70 exemplar genes in single, double, and triple mutants implicated Gcn5, Snf2, and Ydj1 in H3 eviction at most, but not all, Gcn4 target promoters, with Gcn5 generally playing the greatest role and Ydj1 the least. Remarkably, these three cofactors cooperate similarly in H3 eviction at virtually all yeast promoters. Defective H3 eviction in cofactor mutants was coupled with reduced Pol II occupancies for the Gcn4 transcriptome and the most highly expressed uninduced genes, but the relative Pol II levels at most genes were unaffected or even elevated. These findings indicate that nucleosome eviction is crucial for robust transcription of highly expressed genes but that other steps in gene activation are more rate-limiting for most other yeast genes.

  17. A genome-wide identification and analysis of the basic helix-loop-helix transcription factors in the ponerine ant, Harpegnathos saltator

    Directory of Open Access Journals (Sweden)

    Liu Ake

    2012-08-01

    Full Text Available Abstract Background The basic helix-loop-helix (bHLH transcription factors and their homologs form a superfamily that plays essential roles in transcriptional networks of multiple developmental processes. bHLH family members have been identified in over 20 organisms, including fruit fly, zebrafish, human and mouse. Result In this study, we conducted a genome-wide survey for bHLH sequences, and identified 57 bHLH sequences encoded in complete genome sequence of the ponerine ant, Harpegnathos saltator. Phylogenetic analysis of the bHLH domain sequences classified these genes into 38 bHLH families with 23, 14, 10, 1, 8 and 1 members in group A, B, C, D, E and F, respectively. The number of PabHLHs (ponerine ant bHLHs with introns is higher than many other insect species, and they are found to have introns with average lengths only inferior to those of pea aphid. In addition, two H. saltator bHLHs named PaCrp1 and PaSide locate on two separate contigs in the genome. Conclusions A putative full set of PabHLH genes is comparable with other insect species and genes encoding Oligo, MyoRb and Figα were not found in genomes of all insect species of which bHLH family members have been identified. Moreover, in-family phylogenetic analyses indicate that the PabHLH genes are more closely related with Apis mellifera than others. The present study will serve as a solid foundation for further investigations into the structure and function of bHLH proteins in the regulation of H. saltator development.

  18. iTAK: A program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators and protein kinases

    Science.gov (United States)

    Transcription factors (TFs) are proteins that regulate the expression of target genes by binding to specific elements in their regulatory regions. Transcriptional regulators (TRs) also regulate the expression of target genes; however, they operate indirectly via interaction with the basal transcript...

  19. Genetic model selection in genome-wide association studies: robust methods and the use of meta-analysis.

    Science.gov (United States)

    Bagos, Pantelis G

    2013-06-01

    In genetic association studies (GAS) as well as in genome-wide association studies (GWAS), the mode of inheritance (dominant, additive and recessive) is usually not known a priori. Assuming an incorrect mode of inheritance may lead to substantial loss of power, whereas on the other hand, testing all possible models may result in an increased type I error rate. The situation is even more complicated in the meta-analysis of GAS or GWAS, in which individual studies are synthesized to derive an overall estimate. Meta-analysis increases the power to detect weak genotype effects, but heterogeneity and incompatibility between the included studies complicate things further. In this review, we present a comprehensive summary of the statistical methods used for robust analysis and genetic model selection in GAS and GWAS. We then discuss the application of such methods in the context of meta-analysis. We describe the theoretical properties of the various methods and the foundations on which they are based. We also present the available software implementations of the described methods. Finally, since only few of the available robust methods have been applied in the meta-analysis setting, we present some simple extensions that allow robust meta-analysis of GAS and GWAS. Possible extensions and proposals for future work are also discussed.

  20. DOMINO: development of informative molecular markers for phylogenetic and genome-wide population genetic studies in non-model organisms.

    Science.gov (United States)

    Frías-López, Cristina; Sánchez-Herrero, José F; Guirao-Rico, Sara; Mora, Elisa; Arnedo, Miquel A; Sánchez-Gracia, Alejandro; Rozas, Julio

    2016-12-15

    The development of molecular markers is one of the most important challenges in phylogenetic and genome wide population genetics studies, especially in studies with non-model organisms. A highly promising approach for obtaining suitable markers is the utilization of genomic partitioning strategies for the simultaneous discovery and genotyping of a large number of markers. Unfortunately, not all markers obtained from these strategies provide enough information for solving multiple evolutionary questions at a reasonable taxonomic resolution. We have developed Development Of Molecular markers In Non-model Organisms (DOMINO), a bioinformatics tool for informative marker development from both next generation sequencing (NGS) data and pre-computed sequence alignments. The application implements popular NGS tools with new utilities in a highly versatile pipeline specifically designed to discover or select personalized markers at different levels of taxonomic resolution. These markers can be directly used to study the taxa surveyed for their design, utilized for further downstream PCR amplification in a broader set taxonomic scope, or exploited as suitable templates to bait design for target DNA enrichment techniques. We conducted an exhaustive evaluation of the performance of DOMINO via computer simulations and illustrate its utility to find informative markers in an empirical dataset. DOMINO is freely available from www.ub.edu/softevol/domino CONTACT: elsanchez@ub.edu or jrozas@ub.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Genome-wide association study of the five-factor model of personality in young Korean women.

    Science.gov (United States)

    Kim, Han-Na; Roh, Seung-Ju; Sung, Yeon Ah; Chung, Hye Won; Lee, Jong-Young; Cho, Juhee; Shin, Hocheol; Kim, Hyung-Lae

    2013-10-01

    Personality is a determinant of behavior and lifestyle associated with health and human diseases. Although personality is known to be a heritable trait, its polygenic nature has made the identification of genetic variants elusive. We performed a genome-wide association study on 1089 Korean women aged 18-40 years whose personality traits were measured with the Revised NEO Personality Inventory for the five-factor model of personality. To reduce environmental factors that may influence personality traits, this study was restricted to young adult women. In the discovery phase, we identified variants of PTPRD (protein tyrosine phosphatase, receptor type D) that associated this gene with the Openness domain. Other genes that were previously reported to be associated with neurological phenotypes were also associated with personality traits. In particular, DRD1 and OR1A2 were linked to Neuroticism, NKAIN2 with Extraversion, HTR5A with Openness and DRD3 with Agreeableness. Data from our replication study of 2090 subjects confirmed the association between OR1A2 and Neuroticism. We first identified and confirmed a novel region on OR1A2 associated with Neuroticism [corrected]. Candidate genes for psychiatric disorders were also enriched. These findings contribute to our understanding of the genetic architecture of personality traits and provide critical clues to the neurobiological mechanisms that influence them.

  2. A genome-wide CNV analysis of schizophrenia reveals a potential role for a multiple-hit model.

    Science.gov (United States)

    Rudd, Danielle S; Axelsen, Michael; Epping, Eric A; Andreasen, Nancy C; Wassink, Thomas H

    2014-12-01

    Schizophrenia is a chronic and severe psychiatric disorder that is highly heritable. While both common and rare genetic variants contribute to disease risk, many questions still remain about disease etiology. We performed a genome-wide analysis of copy number variants (CNVs) in 166 schizophrenia subjects and 52 psychiatrically healthy controls. First, overall CNV characteristics were compared between cases and controls. The only statistically significant finding was that deletions comprised a greater proportion of CNVs in cases. High interest CNVs were then identified as conservative using the following filtering criteria: (i) known deleterious CNVs; (ii) CNVs > 1 Mb that were novel (not found in a database of control individuals); and (iii) CNVs 1 Mb) or with multiple conservative CNVs. Two case individuals with the highest burden of conservative CNVs also share a recurrent 15q11.2 BP1-2 deletion, indicating a role for a potential multiple-hit CNV model for schizophrenia. In total, we report three 15q11.2 BP1-2 deletion individuals with schizophrenia, adding to a growing body of evidence that this CNV is involved in disease etiology.

  3. Genome-wide transcriptional analysis of apoptosis-related genes and pathways regulated by H2AX in lung cancer A549 cells.

    Science.gov (United States)

    Lu, Chengrong; Xiong, Min; Luo, Yuan; Li, Jing; Zhang, Yanjun; Dong, Yaqiong; Zhu, Yanjun; Niu, Tianhui; Wang, Zhe; Duan, Lianning

    2013-09-01

    Histone H2AX is a novel tumor suppressor protein and plays an important role in apoptosis of cancer cells. However, the role of H2AX in lung cancer cells is unclear. The detailed mechanism and epigenetic regulation by H2AX remain elusive in cancer cells. We showed that H2AX was involved in apoptosis of lung cancer A549 cells as in other tumor cells. Knockdown of H2AX strongly suppressed apoptosis of A549 cells. We clarified the molecular mechanisms of apoptosis regulated by H2AX based on genome-wide transcriptional analysis. Microarray data analysis demonstrated that H2AX knockdown in A549 cells affected expression of 3,461 genes, including upregulation of 1,435 and downregulation of 2,026. These differentially expressed genes were subjected to bioinformatic analysis for exploring biological processes regulated by H2AX in lung cancer cells. Gene ontology analysis showed that H2AX affected expression of many genes, through which, many important functions including response to stimuli, gene expression, and apoptosis were involved in apoptotic regulation of lung cancer cells. Pathway analysis identified the mitogen-activated protein kinase signaling pathway and apoptosis as the most important pathways targeted by H2AX. Signal transduction pathway networks analysis and chromatin immunoprecipitation assay showed that two core genes, NFKB1 and JUN, were involved in apoptosis regulated by H2AX in lung cancer cells. Taken together, these data provide compelling clues for further exploration of H2AX function in cancer cells.

  4. Genome-wide analysis and identification of stress-responsive genes of the NAM-ATAF1,2-CUC2 transcription factor family in apple.

    Science.gov (United States)

    Su, Hongyan; Zhang, Shizhong; Yuan, Xiaowei; Chen, Changtian; Wang, Xiao-Fei; Hao, Yu-Jin

    2013-10-01

    NAC (NAM, ATAF1,2, and CUC2) proteins constitute one of the largest families of plant-specific transcription factors. To date, little is known about the NAC genes in the apple (Malus domestica). In this study, a total of 180 NAC genes were identified in the apple genome and were phylogenetically clustered into six groups (I-VI) with the NAC genes from Arabidopsis and rice. The predicted apple NAC genes were distributed across all of 17 chromosomes at various densities. Additionally, the gene structure and motif compositions of the apple NAC genes were analyzed. Moreover, the expression of 29 selected apple NAC genes was analyzed in different tissues and under different abiotic stress conditions. All of the selected genes, with the exception of four genes, were expressed in at least one of the tissues tested, which indicates that the NAC genes are involved in various aspects of the physiological and developmental processes of the apple. Encouragingly, 17 of the selected genes were found to respond to one or more of the abiotic stress treatments, and these 17 genes included not only the expected 7 genes that were clustered with the well-known stress-related marker genes in group IV but also 10 genes located in other subgroups, none of which contains members that have been reported to be stress-related. To the best of our knowledge, this report describes the first genome-wide analysis of the apple NAC gene family, and the results should provide valuable information for understanding the classification and putative functions of this family.

  5. Local genealogies in a linear mixed model for genome-wide association mapping in complex pedigreed populations.

    Directory of Open Access Journals (Sweden)

    Goutam Sahana

    Full Text Available INTRODUCTION: The state-of-the-art for dealing with multiple levels of relationship among the samples in genome-wide association studies (GWAS is unified mixed model analysis (MMA. This approach is very flexible, can be applied to both family-based and population-based samples, and can be extended to incorporate other effects in a straightforward and rigorous fashion. Here, we present a complementary approach, called 'GENMIX (genealogy based mixed model' which combines advantages from two powerful GWAS methods: genealogy-based haplotype grouping and MMA. SUBJECTS AND METHODS: We validated GENMIX using genotyping data of Danish Jersey cattle and simulated phenotype and compared to the MMA. We simulated scenarios for three levels of heritability (0.21, 0.34, and 0.64, seven levels of MAF (0.05, 0.10, 0.15, 0.20, 0.25, 0.35, and 0.45 and five levels of QTL effect (0.1, 0.2, 0.5, 0.7 and 1.0 in phenotypic standard deviation unit. Each of these 105 possible combinations (3 h(2 x 7 MAF x 5 effects of scenarios was replicated 25 times. RESULTS: GENMIX provides a better ranking of markers close to the causative locus' location. GENMIX outperformed MMA when the QTL effect was small and the MAF at the QTL was low. In scenarios where MAF was high or the QTL affecting the trait had a large effect both GENMIX and MMA performed similarly. CONCLUSION: In discovery studies, where high-ranking markers are identified and later examined in validation studies, we therefore expect GENMIX to enrich candidates brought to follow-up studies with true positives over false positives more than the MMA would.

  6. Northern glacial refugia and altitudinal niche divergence shape genome-wide differentiation in the emerging plant model Arabidopsis arenosa.

    Science.gov (United States)

    Kolář, Filip; Fuxová, Gabriela; Záveská, Eliška; Nagano, Atsushi J; Hyklová, Lucie; Lučanová, Magdalena; Kudoh, Hiroshi; Marhold, Karol

    2016-08-01

    Quaternary climatic oscillations profoundly impacted temperate biodiversity. For many diverse yet undersampled areas, however, the consequences of this impact are still poorly known. In Europe, particular uncertainty surrounds the role of Balkans, a major hotspot of European diversity, in postglacial recolonization of more northerly areas, and the Carpathians, a debatable candidate for a northern 'cryptic' glacial refugium. Using genome-wide SNPs and microsatellites, we examined how the interplay of historical processes and niche shifts structured genetic diversity of diploid Arabidopsis arenosa, a little-known member of the plant model genus that occupies a wide niche range from sea level to alpine peaks across eastern temperate Europe. While the northern Balkans hosted one isolated endemic lineage, most of the genetic diversity was concentrated further north in the Pannonian Basin and the Carpathians, where it likely survived the last glaciation in northern refugia. Finally, a distinct postglacial environment in northern Europe was colonized by populations of admixed origin from the two Carpathian lineages. Niche differentiation along altitude-related bioclimatic gradients was the main trend in the phylogeny of A. arenosa. The most prominent niche shifts, however, characterized genetically only slightly divergent populations that expanded into narrowly defined alpine and northern coastal postglacial environments. Our study highlights the role of eastern central European mountains not only as refugia for unique temperate diversity but also sources for postglacial expansion into novel high-altitude and high-latitude niches. Knowledge of distinct genetic substructure of diploid A. arenosa also opens new opportunities for follow-up studies of this emerging model of evolutionary biology.

  7. Genome wide transcriptional analysis of resting and IL2 activated human natural killer cells: gene expression signatures indicative of novel molecular signaling pathways

    Directory of Open Access Journals (Sweden)

    Schmitz Alexander

    2007-07-01

    Full Text Available Abstract Background Human natural killer (NK cells are the key contributors of innate immune response and the effector functions of these cells are enhanced by cytokines such as interleukine 2 (IL2. We utilized genome-wide transcriptional profiling to identify gene expression signatures and pathways in resting and IL2 activated NK cell isolated from peripheral blood of healthy donors. Results Gene expression profiling of resting NK cells showed high expression of a number of cytotoxic factors, cytokines, chemokines and inhibitory and activating surface NK receptors. Resting NK cells expressed many genes associated with cellular quiescence and also appeared to have an active TGFβ (TGFB1 signaling pathway. IL2 stimulation induced rapid downregulation of quiescence associated genes and upregulation of genes associated with cell cycle progression and proliferation. Numerous genes that may enhance immune function and responsiveness including activating receptors (DNAM1, KLRC1 and KLRC3, death receptor ligand (TNFSF6 (FASL and TRAIL, chemokine receptors (CX3CR1, CCR5 and CCR7, interleukin receptors (IL2RG, IL18RAB and IL27RA and members of secretory pathways (DEGS1, FKBP11, SSR3, SEC61G and SLC3A2 were upregulated. The expression profile suggested PI3K/AKT activation and NF-κB activation through multiple pathways (TLR/IL1R, TNF receptor induced and TCR-like possibly involving BCL10. Activation of NFAT signaling was supported by increased expression of many pathway members and downstream target genes. The transcription factor GATA3 was expressed in resting cells while T-BET was upregulated on activation concurrent with the change in cytokine expression profile. The importance of NK cells in innate immune response was also reflected by late increased expression of inflammatory chemotactic factors and receptors and molecules involved in adhesion and lymphocyte trafficking or migration. Conclusion This analysis allowed us to identify genes implicated in

  8. Modelling human regulatory variation in mouse: finding the function in genome-wide association studies and whole-genome sequencing.

    Directory of Open Access Journals (Sweden)

    Jean-François Schmouth

    Full Text Available An increasing body of literature from genome-wide association studies and human whole-genome sequencing highlights the identification of large numbers of candidate regulatory variants of potential therapeutic interest in numerous diseases. Our relatively poor understanding of the functions of non-coding genomic sequence, and the slow and laborious process of experimental validation of the functional significance of human regulatory variants, limits our ability to fully benefit from this information in our efforts to comprehend human disease. Humanized mouse models (HuMMs, in which human genes are introduced into the mouse, suggest an approach to this problem. In the past, HuMMs have been used successfully to study human disease variants; e.g., the complex genetic condition arising from Down syndrome, common monogenic disorders such as Huntington disease and β-thalassemia, and cancer susceptibility genes such as BRCA1. In this commentary, we highlight a novel method for high-throughput single-copy site-specific generation of HuMMs entitled High-throughput Human Genes on the X Chromosome (HuGX. This method can be applied to most human genes for which a bacterial artificial chromosome (BAC construct can be derived and a mouse-null allele exists. This strategy comprises (1 the use of recombineering technology to create a human variant-harbouring BAC, (2 knock-in of this BAC into the mouse genome using Hprt docking technology, and (3 allele comparison by interspecies complementation. We demonstrate the throughput of the HuGX method by generating a series of seven different alleles for the human NR2E1 gene at Hprt. In future challenges, we consider the current limitations of experimental approaches and call for a concerted effort by the genetics community, for both human and mouse, to solve the challenge of the functional analysis of human regulatory variation.

  9. Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network and pathway analyses

    Directory of Open Access Journals (Sweden)

    Lisette J. A. Kogelman

    2014-07-01

    Full Text Available Obesity is a complex condition with world-wide exponentially rising prevalence rates, linked with severe diseases like Type 2 Diabetes. Economic and welfare consequences have led to a raised interest in a better understanding of the biological and genetic background. To date, whole genome investigations focusing on single genetic variants have achieved limited success, and the importance of including genetic interactions is becoming evident. Here, the aim was to perform an integrative genomic analysis in an F2 pig resource population that was constructed with an aim to maximize genetic variation of obesity-related phenotypes and genotyped using the 60K SNP chip. Firstly, Genome Wide Association (GWA analysis was performed on the Obesity Index to locate candidate genomic regions that were further validated using combined Linkage Disequilibrium Linkage Analysis and investigated by evaluation of haplotype blocks. We built Weighted Interaction SNP Hub (WISH and differentially wired (DW networks using genotypic correlations amongst obesity-associated SNPs resulting from GWA analysis. GWA results and SNP modules detected by WISH and DW analyses were further investigated by functional enrichment analyses. The functional annotation of SNPs revealed several genes associated with obesity, e.g. NPC2 and OR4D10. Moreover, gene enrichment analyses identified several significantly associated pathways, over and above the GWA study results, that may influence obesity and obesity related diseases, e.g. metabolic processes. WISH networks based on genotypic correlations allowed further identification of various gene ontology terms and pathways related to obesity and related traits, which were not identified by the GWA study. In conclusion, this is the first study to develop a (genetic obesity index and employ systems genetics in a porcine model to provide important insights into the complex genetic architecture associated with obesity and many biological pathways

  10. Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network, and pathway analyses.

    Science.gov (United States)

    Kogelman, Lisette J A; Pant, Sameer D; Fredholm, Merete; Kadarmideen, Haja N

    2014-01-01

    Obesity is a complex condition with world-wide exponentially rising prevalence rates, linked with severe diseases like Type 2 Diabetes. Economic and welfare consequences have led to a raised interest in a better understanding of the biological and genetic background. To date, whole genome investigations focusing on single genetic variants have achieved limited success, and the importance of including genetic interactions is becoming evident. Here, the aim was to perform an integrative genomic analysis in an F2 pig resource population that was constructed with an aim to maximize genetic variation of obesity-related phenotypes and genotyped using the 60K SNP chip. Firstly, Genome Wide Association (GWA) analysis was performed on the Obesity Index to locate candidate genomic regions that were further validated using combined Linkage Disequilibrium Linkage Analysis and investigated by evaluation of haplotype blocks. We built Weighted Interaction SNP Hub (WISH) and differentially wired (DW) networks using genotypic correlations amongst obesity-associated SNPs resulting from GWA analysis. GWA results and SNP modules detected by WISH and DW analyses were further investigated by functional enrichment analyses. The functional annotation of SNPs revealed several genes associated with obesity, e.g., NPC2 and OR4D10. Moreover, gene enrichment analyses identified several significantly associated pathways, over and above the GWA study results, that may influence obesity and obesity related diseases, e.g., metabolic processes. WISH networks based on genotypic correlations allowed further identification of various gene ontology terms and pathways related to obesity and related traits, which were not identified by the GWA study. In conclusion, this is the first study to develop a (genetic) obesity index and employ systems genetics in a porcine model to provide important insights into the complex genetic architecture associated with obesity and many biological pathways that underlie

  11. Genome-wide identification of enhancer elements.

    Science.gov (United States)

    Tulin, Sarah; Barsi, Julius C; Bocconcelli, Carlo; Smith, Joel

    2016-01-01

    We present a prospective genome-wide regulatory element database for the sea urchin embryo and the modified chromosome capture-related methodology used to create it. The method we developed is termed GRIP-seq for genome-wide regulatory element immunoprecipitation and combines features of chromosome conformation capture, chromatin immunoprecipitation, and paired-end next-generation sequencing with molecular steps that enrich for active cis-regulatory elements associated with basal transcriptional machinery. The first GRIP-seq database, available to the community, comes from S. purpuratus 24 hpf embryos and takes advantage of the extremely well-characterized cis-regulatory elements in this system for validation. In addition, using the GRIP-seq database, we identify and experimentally validate a novel, intronic cis-regulatory element at the onecut locus. We find GRIP-seq signal sensitively identifies active cis-regulatory elements with a high signal-to-noise ratio for both distal and intronic elements. This promising GRIP-seq protocol has the potential to address a rate-limiting step in resolving comprehensive, predictive network models in all systems.

  12. Large-scale integration of small molecule-induced genome-wide transcriptional responses, Kinome-wide binding affinities and cell-growth inhibition profiles reveal global trends characterizing systems-level drug action

    Directory of Open Access Journals (Sweden)

    Dusica eVidovic

    2014-09-01

    Full Text Available The Library of Integrated Network-based Cellular Signatures (LINCS project is a large-scale coordinated effort to build a comprehensive systems biology reference resource. The goals of the program include the generation of a very large multidimensional data matrix and informatics and computational tools to integrate, analyze, and make the data readily accessible. LINCS data include genome-wide transcriptional signatures, biochemical protein binding profiles, cellular phenotypic response profiles and various other datasets for a wide range of cell model systems and molecular and genetic perturbations. Here we present a partial survey of this data facilitated by data standards and in particular a robust compound standardization workflow; we integrated several types of LINCS signatures and analyzed the results with a focus on mechanism of action and chemical compounds. We illustrate how kinase targets can be related to disease models and relevant drugs. We identified some fundamental trends that appear to link Kinome binding profiles and transcriptional signatures to chemical information and biochemical binding profiles to transcriptional responses independent of chemical similarity. To fill gaps in the datasets we developed and applied predictive models. The results can be interpreted at the systems level as demonstrated based on a large number of signaling pathways. We can identify clear global relationships, suggesting robustness of cellular responses to chemical perturbation. Overall, the results suggest that chemical similarity is a useful measure at the systems level, which would support phenotypic drug optimization efforts. With this study we demonstrate the potential of such integrated analysis approaches and suggest prioritizing further experiments to fill the gaps in the current data.

  13. Large-scale integration of small molecule-induced genome-wide transcriptional responses, Kinome-wide binding affinities and cell-growth inhibition profiles reveal global trends characterizing systems-level drug action.

    Science.gov (United States)

    Vidović, Dušica; Koleti, Amar; Schürer, Stephan C

    2014-01-01

    The Library of Integrated Network-based Cellular Signatures (LINCS) project is a large-scale coordinated effort to build a comprehensive systems biology reference resource. The goals of the program include the generation of a very large multidimensional data matrix and informatics and computational tools to integrate, analyze, and make the data readily accessible. LINCS data include genome-wide transcriptional signatures, biochemical protein binding profiles, cellular phenotypic response profiles and various other datasets for a wide range of cell model systems and molecular and genetic perturbations. Here we present a partial survey of this data facilitated by data standards and in particular a robust compound standardization workflow; we integrated several types of LINCS signatures and analyzed the results with a focus on mechanism of action (MoA) and chemical compounds. We illustrate how kinase targets can be related to disease models and relevant drugs. We identified some fundamental trends that appear to link Kinome binding profiles and transcriptional signatures to chemical information and biochemical binding profiles to transcriptional responses independent of chemical similarity. To fill gaps in the datasets we developed and applied predictive models. The results can be interpreted at the systems level as demonstrated based on a large number of signaling pathways. We can identify clear global relationships, suggesting robustness of cellular responses to chemical perturbation. Overall, the results suggest that chemical similarity is a useful measure at the systems level, which would support phenotypic drug optimization efforts. With this study we demonstrate the potential of such integrated analysis approaches and suggest prioritizing further experiments to fill the gaps in the current data.

  14. Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network and pathway analyses

    DEFF Research Database (Denmark)

    Kogelman, Lisette; Pant, Sameer Dinkar; Fredholm, Merete;

    2014-01-01

    Obesity is a complex condition with world-wide exponentially rising prevalence rates, linked with severe diseases like Type 2 Diabetes. Economic and welfare consequences have led to a raised interest in a better understanding of the biological and genetic background. To date, whole genome...... of obesity-related phenotypes and genotyped using the 60K SNP chip. Firstly, Genome Wide Association (GWA) analysis was performed on the Obesity Index to locate candidate genomic regions that were further validated using combined Linkage Disequilibrium Linkage Analysis and investigated by evaluation...

  15. Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network and pathway analyses

    DEFF Research Database (Denmark)

    Kogelman, Lisette; Pant, Sameer Dinkar; Fredholm, Merete

    2014-01-01

    Obesity is a complex condition with world-wide exponentially rising prevalence rates, linked with severe diseases like Type 2 Diabetes. Economic and welfare consequences have led to a raised interest in a better understanding of the biological and genetic background. To date, whole genome...... of obesity-related phenotypes and genotyped using the 60K SNP chip. Firstly, Genome Wide Association (GWA) analysis was performed on the Obesity Index to locate candidate genomic regions that were further validated using combined Linkage Disequilibrium Linkage Analysis and investigated by evaluation...

  16. The effect of Nipped-B-like (Nipbl) haploinsufficiency on genome-wide cohesin binding and target gene expression: modeling Cornelia de Lange syndrome.

    Science.gov (United States)

    Newkirk, Daniel A; Chen, Yen-Yun; Chien, Richard; Zeng, Weihua; Biesinger, Jacob; Flowers, Ebony; Kawauchi, Shimako; Santos, Rosaysela; Calof, Anne L; Lander, Arthur D; Xie, Xiaohui; Yokomori, Kyoko

    2017-01-01

    Cornelia de Lange syndrome (CdLS) is a multisystem developmental disorder frequently associated with heterozygous loss-of-function mutations of Nipped-B-like (NIPBL), the human homolog of Drosophila Nipped-B. NIPBL loads cohesin onto chromatin. Cohesin mediates sister chromatid cohesion important for mitosis but is also increasingly recognized as a regulator of gene expression. In CdLS patient cells and animal models, expression changes of multiple genes with little or no sister chromatid cohesion defect suggests that disruption of gene regulation underlies this disorder. However, the effect of NIPBL haploinsufficiency on cohesin binding, and how this relates to the clinical presentation of CdLS, has not been fully investigated. Nipbl haploinsufficiency causes CdLS-like phenotype in mice. We examined genome-wide cohesin binding and its relationship to gene expression using mouse embryonic fibroblasts (MEFs) from Nipbl+/- mice that recapitulate the CdLS phenotype. We found a global decrease in cohesin binding, including at CCCTC-binding factor (CTCF) binding sites and repeat regions. Cohesin-bound genes were found to be enriched for histone H3 lysine 4 trimethylation (H3K4me3) at their promoters; were disproportionately downregulated in Nipbl mutant MEFs; and displayed evidence of reduced promoter-enhancer interaction. The results suggest that gene activation is the primary cohesin function sensitive to Nipbl reduction. Over 50% of significantly dysregulated transcripts in mutant MEFs come from cohesin target genes, including genes involved in adipogenesis that have been implicated in contributing to the CdLS phenotype. Decreased cohesin binding at the gene regions is directly linked to disease-specific expression changes. Taken together, our Nipbl haploinsufficiency model allows us to analyze the dosage effect of cohesin loading on CdLS development.

  17. Genome-wide metabolic model to improve understanding of CD4(+) T cell metabolism, immunometabolism and application in drug design.

    Science.gov (United States)

    Han, Feifei; Li, Gonghua; Dai, Shaoxing; Huang, Jingfei

    2016-02-01

    CD4(+) T cells play a critical role in adaptive immunity and have been well studied in past decades. However, the systematic metabolism features are less clear. Here, we reconstructed the genome-wide metabolic network of naïve CD4(+) T cells, CD4T1670, by integrating transcriptome and metabolism data. We performed simulations for three critical metabolic subsystems (carbohydrate metabolism, fatty acid metabolism and glutaminolysis). The results were consistent with most experimental observations. Furthermore, we found that depletion of either glucose or glutamine did not significantly affect ATP production and biomass, but dramatically unbalanced the metabolic network and increased the release of some inflammation or anti-inflammation related factors, such as lysophosphatidylcholine, leukotriene and hyaluronan. Genome-wide single gene knockout analysis showed that acetyl-CoA carboxylase 1 (ACC1) was essential for T cell activation. We further investigated the role of immunometabolic genes in metabolic network stability, and found that over 25% of them were essential. The results also showed that although PTEN is a well-studied proliferation inhibitor, it was essential for maintaining the stability of CD4 metabolic networks. Finally, we applied CD4T1670 to evaluate the side-effects of certain drugs in preclinical experiments. These results suggested that CD4T1670 would be useful in understanding CD4(+) T cells and drug design systematically.

  18. RNA-seq in the tetraploid Xenopus laevis enables genome-wide insight in a classic developmental biology model organism.

    Science.gov (United States)

    Amin, Nirav M; Tandon, Panna; Osborne Nishimura, Erin; Conlon, Frank L

    2014-04-01

    Advances in sequencing technology have significantly advanced the landscape of developmental biology research. The dissection of genetic networks in model and non-model organisms has been greatly enhanced with high-throughput sequencing technologies. RNA-seq has revolutionized the ability to perform developmental biology research in organisms without a published genome sequence. Here, we describe a protocol for developmental biologists to perform RNA-seq on dissected tissue or whole embryos. We start with the isolation of RNA and generation of sequencing libraries. We further show how to interpret and analyze the large amount of sequencing data that is generated in RNA-seq. We explore the abilities to examine differential expression, gene duplication, transcript assembly, alternative splicing and SNP discovery. For the purposes of this article, we use Xenopus laevis as the model organism to discuss uses of RNA-seq in an organism without a fully annotated genome sequence. Copyright © 2013. Published by Elsevier Inc.

  19. Modeling cis-regulation with a compendium of genome-wide histone H3K27ac profiles.

    Science.gov (United States)

    Wang, Su; Zang, Chongzhi; Xiao, Tengfei; Fan, Jingyu; Mei, Shenglin; Qin, Qian; Wu, Qiu; Li, Xujuan; Xu, Kexin; He, Housheng Hansen; Brown, Myles; Meyer, Clifford A; Liu, X Shirley

    2016-10-01

    Model-based analysis of regulation of gene expression (MARGE) is a framework for interpreting the relationship between the H3K27ac chromatin environment and differentially expressed gene sets. The framework has three main functions: MARGE-potential, MARGE-express, and MARGE-cistrome. MARGE-potential defines a regulatory potential (RP) for each gene as the sum of H3K27ac ChIP-seq signals weighted by a function of genomic distance from the transcription start site. The MARGE framework includes a compendium of RPs derived from 365 human and 267 mouse H3K27ac ChIP-seq data sets. Relative RPs, scaled using this compendium, are superior to superenhancers in predicting BET (bromodomain and extraterminal domain) -inhibitor repressed genes. MARGE-express, which uses logistic regression to retrieve relevant H3K27ac profiles from the compendium to accurately model a query set of differentially expressed genes, was tested on 671 diverse gene sets from MSigDB. MARGE-cistrome adopts a novel semisupervised learning approach to identify cis-regulatory elements regulating a gene set. MARGE-cistrome exploits information from H3K27ac signal at DNase I hypersensitive sites identified from published human and mouse DNase-seq data. We tested the framework on newly generated RNA-seq and H3K27ac ChIP-seq profiles upon siRNA silencing of multiple transcriptional and epigenetic regulators in a prostate cancer cell line, LNCaP-abl. MARGE-cistrome can predict the binding sites of silenced transcription factors without matched H3K27ac ChIP-seq data. Even when the matching H3K27ac ChIP-seq profiles are available, MARGE leverages public H3K27ac profiles to enhance these data. This study demonstrates the advantage of integrating a large compendium of historical epigenetic data for genomic studies of transcriptional regulation. © 2016 Wang et al.; Published by Cold Spring Harbor Laboratory Press.

  20. Genome-wide transcriptional profiling of the purple sulfur bacterium Allochromatium vinosum DSM 180T during growth on different reduced sulfur compounds.

    Science.gov (United States)

    Weissgerber, Thomas; Dobler, Nadine; Polen, Tino; Latus, Jeanette; Stockdreher, Yvonne; Dahl, Christiane

    2013-09-01

    The purple sulfur bacterium Allochromatium vinosum DSM 180(T) is one of the best-studied sulfur-oxidizing anoxygenic phototrophic bacteria, and it has been developed into a model organism for laboratory-based studies of oxidative sulfur metabolism. Here, we took advantage of the organism's high metabolic versatility and performed whole-genome transcriptional profiling to investigate the response of A. vinosum cells upon exposure to sulfide, thiosulfate, elemental sulfur, or sulfite compared to photoorganoheterotrophic growth on malate. Differential expression of 1,178 genes was observed, corresponding to 30% of the A. vinosum genome. Relative transcription of 551 genes increased significantly during growth on one of the different sulfur sources, while the relative transcript abundance of 627 genes decreased. A significant number of genes that revealed strongly enhanced relative transcription levels have documented sulfur metabolism-related functions. Among these are the dsr genes, including dsrAB for dissimilatory sulfite reductase, and the sgp genes for the proteins of the sulfur globule envelope, thus confirming former results. In addition, we identified new genes encoding proteins with appropriate subcellular localization and properties to participate in oxidative dissimilatory sulfur metabolism. Those four genes for hypothetical proteins that exhibited the strongest increases of mRNA levels on sulfide and elemental sulfur, respectively, were chosen for inactivation and phenotypic analyses of the respective mutant strains. This approach verified the importance of the encoded proteins for sulfur globule formation during the oxidation of sulfide and thiosulfate and thereby also documented the suitability of comparative transcriptomics for the identification of new sulfur-related genes in anoxygenic phototrophic sulfur bacteria.

  1. Genome-wide analysis of bHLH transcription factor and involvement in the infection by yellow leaf curl virus in tomato (Solanum lycopersicum).

    Science.gov (United States)

    Wang, Jinyan; Hu, Zhongze; Zhao, Tongmin; Yang, Yuwen; Chen, Tianzi; Yang, Mali; Yu, Wengui; Zhang, Baolong

    2015-02-05

    The basic helix-loop-helix (bHLH) proteins are a superfamily of transcription factors that can bind to specific DNA target sites. They have been well characterized in model plants such as Arabidopsis and rice and have been shown to be important regulatory components in many different biological processes. However, no systemic analysis of the bHLH transcription factor family has yet been reported in tomatoes. Tomato yellow leaf curl virus (TYLCV) threatens tomato production worldwide by causing leaf yellowing, leaf curling, plant stunting and flower abscission. A total of 152 bHLH transcription factors were identified from the entire tomato genome. Phylogenetic analysis of bHLH domain sequences from Arabidopsis and tomato facilitated classification of these genes into 26 subfamilies. The evolutionary and possible functional relationships revealed during this analysis are supported by other criteria, including the chromosomal distribution of these genes, the conservation of motifs and exon/intron structural patterns, and the predicted DNA binding activities within subfamilies. Distribution mapping results showed bHLH genes were localized on the 12 tomato chromosomes. Among the 152 bHLH genes from the tomato genome, 96 bHLH genes were detected in the TYLCV-susceptible and resistant tomato breeding line before (0 dpi) and after TYLCV (357 dpi) infection. As anticipated, gene ontology (GO) analysis indicated that most bHLH genes are related to the regulation of macromolecule metabolic processes and gene expression. Only four bHLH genes were differentially expressed between 0 and 357 dpi. Virus-induced gene silencing (VIGS) of one bHLH genes SlybHLH131 in resistant lines can lead to the cell death. In the present study, 152 bHLH transcription factor genes were identified. One of which bHLH genes, SlybHLH131, was found to be involved in the TYLCV infection through qRT-PCR expression analysis and VIGS validation. The isolation and identification of these bHLH transcription

  2. Disruption of Transcriptional Coactivator Sub1 Leads to Genome-Wide Re-distribution of Clustered Mutations Induced by APOBEC in Active Yeast Genes.

    Science.gov (United States)

    Lada, Artem G; Kliver, Sergei F; Dhar, Alok; Polev, Dmitrii E; Masharsky, Alexey E; Rogozin, Igor B; Pavlov, Youri I

    2015-05-01

    Mutations in genomes of species are frequently distributed non-randomly, resulting in mutation clusters, including recently discovered kataegis in tumors. DNA editing deaminases play the prominent role in the etiology of these mutations. To gain insight into the enigmatic mechanisms of localized hypermutagenesis that lead to cluster formation, we analyzed the mutational single nucleotide variations (SNV) data obtained by whole-genome sequencing of drug-resistant mutants induced in yeast diploids by AID/APOBEC deaminase and base analog 6-HAP. Deaminase from sea lamprey, PmCDA1, induced robust clusters, while 6-HAP induced a few weak ones. We found that PmCDA1, AID, and APOBEC1 deaminases preferentially mutate the beginning of the actively transcribed genes. Inactivation of transcription initiation factor Sub1 strongly reduced deaminase-induced can1 mutation frequency, but, surprisingly, did not decrease the total SNV load in genomes. However, the SNVs in the genomes of the sub1 clones were re-distributed, and the effect of mutation clustering in the regions of transcription initiation was even more pronounced. At the same time, the mutation density in the protein-coding regions was reduced, resulting in the decrease of phenotypically detected mutants. We propose that the induction of clustered mutations by deaminases involves: a) the exposure of ssDNA strands during transcription and loss of protection of ssDNA due to the depletion of ssDNA-binding proteins, such as Sub1, and b) attainment of conditions favorable for APOBEC action in subpopulation of cells, leading to enzymatic deamination within the currently expressed genes. This model is applicable to both the initial and the later stages of oncogenic transformation and explains variations in the distribution of mutations and kataegis events in different tumor cells.

  3. Genome-wide gene expression profiling of stress response in a spinal cord clip compression injury model

    Science.gov (United States)

    2013-01-01

    Background The aneurysm clip impact-compression model of spinal cord injury (SCI) is a standard injury model in animals that closely mimics the primary mechanism of most human injuries: acute impact and persisting compression. Its histo-pathological and behavioural outcomes are extensively similar to human SCI. To understand the distinct molecular events underlying this injury model we analyzed global mRNA abundance changes during the acute, subacute and chronic stages of a moderate to severe injury to the rat spinal cord. Results Time-series expression analyses resulted in clustering of the majority of deregulated transcripts into eight statistically significant expression profiles. Systematic application of Gene Ontology (GO) enrichment pathway analysis allowed inference of biological processes participating in SCI pathology. Temporal analysis identified events specific to and common between acute, subacute and chronic time-points. Processes common to all phases of injury include blood coagulation, cellular extravasation, leukocyte cell-cell adhesion, the integrin-mediated signaling pathway, cytokine production and secretion, neutrophil chemotaxis, phagocytosis, response to hypoxia and reactive oxygen species, angiogenesis, apoptosis, inflammatory processes and ossification. Importantly, various elements of adaptive and induced innate immune responses span, not only the acute and subacute phases, but also persist throughout the chronic phase of SCI. Induced innate responses, such as Toll-like receptor signaling, are more active during the acute phase but persist throughout the chronic phase. However, adaptive immune response processes such as B and T cell activation, proliferation, and migration, T cell differentiation, B and T cell receptor-mediated signaling, and B cell- and immunoglobulin-mediated immune response become more significant during the chronic phase. Conclusions This analysis showed that, surprisingly, the diverse series of molecular events that

  4. Integrated genome-wide analysis of transcription factor occupancy, RNA polymerase II binding and steady-state RNA levels identify differentially regulated functional gene classes

    NARCIS (Netherlands)

    Mokry, M.; Hatzis, P.; Schuijers, J.; Lansu, N.; Ruzius, F.P.; Clevers, H.; Cuppen, E.

    2012-01-01

    Routine methods for assaying steady-state mRNA levels such as RNA-seq and micro-arrays are commonly used as readouts to study the role of transcription factors (TFs) in gene expression regulation. However, cellular RNA levels do not solely depend on activity of TFs and subsequent transcription by

  5. Integrated genome-wide analysis of transcription factor occupancy, RNA polymerase II binding and steady-state RNA levels identify differentially regulated functional gene classes

    NARCIS (Netherlands)

    Mokry, Michal; Hatzis, Pantelis; Schuijers, Jurian; Lansu, Nico; Ruzius, Frans-Paul; Clevers, Hans; Cuppen, Edwin

    2012-01-01

    Routine methods for assaying steady-state mRNA levels such as RNA-seq and micro-arrays are commonly used as readouts to study the role of transcription factors (TFs) in gene expression regulation. However, cellular RNA levels do not solely depend on activity of TFs and subsequent transcription by RN

  6. Genome-wide Screening of Regulators of Catalase Expression: ROLE OF A TRANSCRIPTION COMPLEX AND HISTONE AND tRNA MODIFICATION COMPLEXES ON ADAPTATION TO STRESS.

    Science.gov (United States)

    García, Patricia; Encinar Del Dedo, Javier; Ayté, José; Hidalgo, Elena

    2016-01-01

    In response to environmental cues, the mitogen-activated protein kinase Sty1-driven signaling cascade activates hundreds of genes to induce a robust anti-stress cellular response in fission yeast. Thus, upon stress imposition Sty1 transiently accumulates in the nucleus where it up-regulates transcription through the Atf1 transcription factor. Several regulators of transcription and translation have been identified as important to mount an integral response to oxidative stress, such as the Spt-Ada-Gcn5-acetyl transferase or Elongator complexes, respectively. With the aim of identifying new regulators of this massive gene expression program, we have used a GFP-based protein reporter and screened a fission yeast deletion collection using flow cytometry. We find that the levels of catalase fused to GFP, both before and after a threat of peroxides, are altered in hundreds of strains lacking components of chromatin modifiers, transcription complexes, and modulators of translation. Thus, the transcription elongation complex Paf1, the histone methylase Set1-COMPASS, and the translation-related Trm112 dimers are all involved in full expression of Ctt1-GFP and in wild-type tolerance to peroxides.

  7. Genome-wide identification, classification and functional analyses of the bHLH transcription factor family in the pig, Sus scrofa.

    Science.gov (United States)

    Liu, Wuyi

    2015-08-01

    The basic helix-loop-helix (bHLH) transcription factors are one of the largest families of gene regulatory proteins and play crucial roles in genetic, developmental and physiological processes in eukaryotes. Here, we conducted a survey of the Sus scrofa genome and identified 109 putative bHLH transcription factor members belonging to super-groups A, B, C, D, E, and F, respectively, while four members were orphan genes. We identified 6 most significantly enriched KEGG pathways and 116 most significant GO annotation categories. Further comprehensive surveys in human genome and other 12 medical databases identified 72 significantly enriched biological pathways with these 113 pig bHLH transcription factors. From the functional protein association network analysis 93 hub proteins were identified and 55 hub proteins created a tight network or a functional module within their protein families. Especially, there were 20 hub proteins found highly connected in the functional interaction network. The present study deepens our understanding and provided insights into the evolution and functional aspects of animal bHLH proteins and should serve as a solid foundation for further for analyses of specific bHLH transcription factors in the pig and other mammals.

  8. Genome-wide identification of citrus ATP-citrate lyase genes and their transcript analysis in fruits reveals their possible role in citrate utilization.

    Science.gov (United States)

    Hu, Xiao-Mei; Shi, Cai-Yun; Liu, Xiao; Jin, Long-Fei; Liu, Yong-Zhong; Peng, Shu-Ang

    2015-02-01

    ATP-citrate lyase (ACL, EC4.1.3.8) catalyzes citrate to oxaloacetate and acetyl-CoA in the cell cytosol, and has important roles in normal plant growth and in the biosynthesis of some secondary metabolites. We identified three ACL genes, CitACLα1, CitACLα2, and CitACLβ1, in the citrus genome database. Both CitACLα1 and CitACLα2 encode putative ACL α subunits with 82.5 % amino acid identity, whereas CitACLβ1 encodes a putative ACL β subunit. Gene structure analysis showed that CitACLα1 and CitACLα2 had 12 exons and 11 introns, and CitACLβ1 had 16 exons and 15 introns. CitACLα1 and CitACLβ1 were predominantly expressed in flower, and CitACLα2 was predominantly expressed in stem and fibrous roots. As fruits ripen, the transcript levels of CitACLα1, CitACLβ1, and/or CitACLα2 in cultivars 'Niuher' and 'Owari' increased, accompanied by significant decreases in citrate content, while their transcript levels decreased significantly in 'Egan No. 1' and 'Iyokan', although citrate content also decreased. In 'HB pummelo', in which acid content increased as fruit ripened, and in acid-free pummelo, transcript levels of CitACLα2, CitACLβ1, and/or CitACLα1 increased. Moreover, mild drought stress and ABA treatment significantly increased citrate contents in fruits. Transcript levels of the three genes were significantly reduced by mild drought stress, and the transcript level of only CitACLβ1 was significantly reduced by ABA treatment. Taken together, these data indicate that the effects of ACL on citrate use during fruit ripening depends on the cultivar, and the reduction in ACL gene expression may be attributed to citrate increases under mild drought stress or ABA treatment.

  9. A transcription factor map as revealed by a genome-wide gene expression analysis of whole-blood mRNA transcriptome in multiple sclerosis.

    Directory of Open Access Journals (Sweden)

    Carlos Riveros

    Full Text Available BACKGROUND: Several lines of evidence suggest that transcription factors are involved in the pathogenesis of Multiple Sclerosis (MS but complete mapping of the whole network has been elusive. One of the reasons is that there are several clinical subtypes of MS and transcription factors that may be involved in one subtype may not be in others. We investigate the possibility that this network could be mapped using microarray technologies and contemporary bioinformatics methods on a dataset derived from whole blood in 99 untreated MS patients (36 Relapse Remitting MS, 43 Primary Progressive MS, and 20 Secondary Progressive MS and 45 age-matched healthy controls. METHODOLOGY/PRINCIPAL FINDINGS: We have used two different analytical methodologies: a non-standard differential expression analysis and a differential co-expression analysis, which have converged on a significant number of regulatory motifs that are statistically overrepresented in genes that are either differentially expressed (or differentially co-expressed in cases and controls (e.g., V$KROX_Q6, p-value <3.31E-6; V$CREBP1_Q2, p-value <9.93E-6, V$YY1_02, p-value <1.65E-5. CONCLUSIONS/SIGNIFICANCE: Our analysis uncovered a network of transcription factors that potentially dysregulate several genes in MS or one or more of its disease subtypes. The most significant transcription factor motifs were for the Early Growth Response EGR/KROX family, ATF2, YY1 (Yin and Yang 1, E2F-1/DP-1 and E2F-4/DP-2 heterodimers, SOX5, and CREB and ATF families. These transcription factors are involved in early T-lymphocyte specification and commitment as well as in oligodendrocyte dedifferentiation and development, both pathways that have significant biological plausibility in MS causation.

  10. Genome-wide mapping of DNA methylation in chicken.

    Directory of Open Access Journals (Sweden)

    Qinghe Li

    Full Text Available Cytosine DNA methylation is an important epigenetic modification termed as the fifth base that functions in diverse processes. Till now, the genome-wide DNA methylation maps of many organisms has been reported, such as human, Arabidopsis, rice and silkworm, but the methylation pattern of bird remains rarely studied. Here we show the genome-wide DNA methylation map of bird, using the chicken as a model organism and an immunocapturing approach followed by high-throughput sequencing. In both of the red jungle fowl and the avian broiler, DNA methylation was described separately for the liver and muscle tissue. Generally, chicken displays analogous methylation pattern with that of animals and plants. DNA methylation is enriched in the gene body regions and the repetitive sequences, and depleted in the transcription start site (TSS and the transcription termination site (TTS. Most of the CpG islands in the chicken genome are kept in unmethylated state. Promoter methylation is negatively correlated with the gene expression level, indicating its suppressive role in regulating gene transcription. This work contributes to our understanding of epigenetics in birds.

  11. Genome-wide transcriptional profiling of peripheral blood leukocytes from cattle infected with Mycobacterium bovis reveals suppression of host immune genes

    OpenAIRE

    O'Farrelly, Cliona; Hokamp, Karsten

    2011-01-01

    PUBLISHED Background: Mycobacterium bovis is the causative agent of bovine tuberculosis (BTB), a pathological infection with significant economic impact. Recent studies have highlighted the role of functional genomics to better understand the molecular mechanisms governing the host immune response to M. bovis infection. Furthermore, these studies may enable the identification of novel transcriptional markers of BTB that can augment current diagnostic tests and surveillance programmes. I...

  12. Genome-wide profiling of Hfq-binding RNAs uncovers extensive post-transcriptional rewiring of major stress response and symbiotic regulons in Sinorhizobium meliloti.

    Science.gov (United States)

    Torres-Quesada, Omar; Reinkensmeier, Jan; Schlüter, Jan-Philip; Robledo, Marta; Peregrina, Alexandra; Giegerich, Robert; Toro, Nicolás; Becker, Anke; Jiménez-Zurdo, Jose I

    2014-01-01

    The RNA chaperone Hfq is a global post-transcriptional regulator in bacteria. Here, we used RNAseq to analyze RNA populations from the legume symbiont Sinorhizobium meliloti that were co-immunoprecipitated (CoIP-RNA) with a FLAG-tagged Hfq in five growth/stress conditions. Hfq-bound transcripts (1315) were largely identified in stressed bacteria and derived from small RNAs (sRNAs), both trans-encoded (6.4%) and antisense (asRNAs; 6.3%), and mRNAs (86%). Pull-down with Hfq recovered a small proportion of annotated S. meliloti sRNAs (14% of trans-sRNAs and 2% of asRNAs) suggesting a discrete impact of this protein in sRNA pathways. Nonetheless, Hfq selectively stabilized CoIP-enriched sRNAs, anticipating that these interactions are functionally significant. Transcription of 26 Hfq-bound sRNAs was predicted to occur from promoters recognized by the major stress σ factors σ(E2) or σ(H1/2). Recovery rates of sRNAs in each of the CoIP-RNA libraries suggest a large impact of Hfq-assisted riboregulation in S. meliloti osmoadaptation. Hfq directly targeted 18% of the predicted S. meliloti mRNAs, which encode functionally diverse proteins involved in transport and metabolism, σ(E2)-dependent stress responses, quorum sensing, flagella biosynthesis, ribosome, and membrane assembly or symbiotic nitrogen fixation. Canonical targeting of the 5' regions of two of the ABC transporter mRNAs by the homologous Hfq-binding AbcR1 and AbcR2 sRNAs leading to inhibition of protein synthesis was confirmed in vivo. We therefore provide a comprehensive resource for the systems-level deciphering of hitherto unexplored S. meliloti stress and symbiotic post-transcriptional regulons and the identification of Hfq-dependent sRNA-mRNA regulatory pairs.

  13. Genome-wide profiling of Hfq-binding RNAs uncovers extensive post-transcriptional rewiring of major stress response and symbiotic regulons in Sinorhizobium meliloti

    Science.gov (United States)

    Torres-Quesada, Omar; Reinkensmeier, Jan; Schlüter, Jan-Philip; Robledo, Marta; Peregrina, Alexandra; Giegerich, Robert; Toro, Nicolás; Becker, Anke; Jiménez-Zurdo, Jose I

    2014-01-01

    The RNA chaperone Hfq is a global post-transcriptional regulator in bacteria. Here, we used RNAseq to analyze RNA populations from the legume symbiont Sinorhizobium meliloti that were co-immunoprecipitated (CoIP-RNA) with a FLAG-tagged Hfq in five growth/stress conditions. Hfq-bound transcripts (1315) were largely identified in stressed bacteria and derived from small RNAs (sRNAs), both trans-encoded (6.4%) and antisense (asRNAs; 6.3%), and mRNAs (86%). Pull-down with Hfq recovered a small proportion of annotated S. meliloti sRNAs (14% of trans-sRNAs and 2% of asRNAs) suggesting a discrete impact of this protein in sRNA pathways. Nonetheless, Hfq selectively stabilized CoIP-enriched sRNAs, anticipating that these interactions are functionally significant. Transcription of 26 Hfq-bound sRNAs was predicted to occur from promoters recognized by the major stress σ factors σE2 or σH1/2. Recovery rates of sRNAs in each of the CoIP–RNA libraries suggest a large impact of Hfq-assisted riboregulation in S. meliloti osmoadaptation. Hfq directly targeted 18% of the predicted S. meliloti mRNAs, which encode functionally diverse proteins involved in transport and metabolism, σE2-dependent stress responses, quorum sensing, flagella biosynthesis, ribosome, and membrane assembly or symbiotic nitrogen fixation. Canonical targeting of the 5′ regions of two of the ABC transporter mRNAs by the homologous Hfq-binding AbcR1 and AbcR2 sRNAs leading to inhibition of protein synthesis was confirmed in vivo. We therefore provide a comprehensive resource for the systems-level deciphering of hitherto unexplored S. meliloti stress and symbiotic post-transcriptional regulons and the identification of Hfq-dependent sRNA–mRNA regulatory pairs. PMID:24786641

  14. Genome-Wide Identification of the Transcription Factors Involved in Citrus Fruit Ripening from the Transcriptomes of a Late-Ripening Sweet Orange Mutant and Its Wild Type.

    Science.gov (United States)

    Wu, Juxun; Fu, Lili; Yi, Hualin

    2016-01-01

    Fruit ripening is a genetically programmed process. Transcription factors (TFs) play key roles in plant development and ripening by temporarily and spatially regulating the transcription of their target genes. In this study, a total of 159 TFs were identified from a spontaneous late-ripening mutant 'Fengwan' (C. sinensis L. Osbeck) sweet orange (MT) and its wild-type counterpart ('Fengjie 72-1', WT) along the ripening period via the Transcription Factor Prediction of PlantTFDB 3.0. Fifty-two differentially expressed TFs were identified between MT and WT; 92 and 120 differentially expressed TFs were identified in WT and MT, respectively. The Venn diagram analysis showed that 16 differentially expressed TFs were identified between MT and WT and during the ripening of WT and MT. These TFs were primarily assigned to the families of C2H2, Dof, bHLH, ERF, MYB, NAC and LBD. Particularly, the number of TFs of the ERF family was the greatest between MT and WT. According to the results of the WGCNA analysis, a weighted correlation network analysis tool, several important TFs correlated to abscisic acid (ABA), citric acid, fructose, glucose and sucrose were identified, such as RD26, NTT, GATA7 and MYB21/62/77. Hierarchical cluster analysis and the expression analysis conducted at five fruit ripening stages further validated the pivotal TFs that potentially function during orange fruit development and ripening.

  15. Genome-wide transcriptional changes and defence-related chemical profiling of rice in response to infestation by the rice striped stem borer Chilo suppressalis.

    Science.gov (United States)

    Zhou, Guoxin; Wang, Xia; Yan, Feng; Wang, Xia; Li, Ran; Cheng, Jiaan; Lou, Yonggen

    2011-09-01

    How rice defends itself against pathogen infection is well documented, but little is known about how it defends itself against herbivore attack. We measured changes in the transcriptome and chemical profile of rice when the plant is infested by the striped stem borer (SSB) Chilo suppressalis. Infestation by SSBs resulted in changes in the expression levels of 4545 rice genes; this number accounts for about 8% of the genome and is made up of 18 functional groups with broad functions. The largest group comprised genes involved in metabolism, followed by cellular transport, transcription and cellular signaling. Infestation by SSBs modulated many genes responsible for the biosynthesis of plant hormones and plant signaling. Jasmonic acid (JA), salicylic acid (SA) and ethylene were the major hormones that shaped the SSB-induced defence responses of rice. Many secondary signal transduction components, such as those involved in Ca²⁺ signaling and G-protein signaling, receptor and non-receptor protein kinases, and transcription factors were involved in the SSB-induced responses of rice. Photosynthesis and ATP synthesis from photophosphorylation were restricted by SSB feeding. In addition, SSB infestation induced the accumulation of defence compounds, including trypsin proteinase inhibitors (TrypPIs) and volatile organic compounds. These results demonstrate that SSB-induced defences required rice to reconfigure a wide variety of its metabolic, physiological and biochemical processes.

  16. Genome-wide transcriptional profiling and metabolic analysis uncover multiple molecular responses of the grass species Lolium perenne under low-intensity xenobiotic stress

    Directory of Open Access Journals (Sweden)

    Anne-Antonella eSerra

    2015-12-01

    Full Text Available Lolium perenne, which is a major component of pastures, lawns, and grass strips, can be exposed to xenobiotic stresses due to diffuse and residual contaminations of soil. L. perenne was recently shown to undergo metabolic adjustments in response to sub-toxic levels of xenobiotics. To gain insight in such chemical stress responses, a de novo transcriptome analysis was carried out on leaves from plants subjected at the root level to low levels of xenobiotics, glyphosate, tebuconazole, and a combination of the two, leading to no adverse physiological effect. Chemical treatments influenced significantly the relative proportions of functional categories and of transcripts related to carbohydrate processes, to signalling, to protein-kinase cascades, as Serine/Threonine-protein kinases, to transcriptional regulations, to responses to abiotic or biotic stimuli and to responses to phytohormones. Transcriptomics-based expressions of genes encoding different types of SNF1 (sucrose non-fermenting 1-related kinases involved in sugar and stress signalling or encoding key metabolic enzymes were in line with specific qRT-PCR analysis or with the important metabolic and regulatory changes revealed by metabolomic analysis. The effects of pesticide treatments on metabolites and gene expression strongly suggest that pesticides at low levels, as single molecule or as mixture, affect cell signalling and functioning even in the absence of major physiological impact. This global analysis of L. perenne therefore highlighted the interactions between molecular regulation of responses to xenobiotics, and also carbohydrate dynamics, energy dysfunction, phytohormones and calcium signalling.

  17. Dominant Genetic Variation and Missing Heritability for Human Complex Traits: Insights from Twin versus Genome-wide Common SNP Models.

    Science.gov (United States)

    Chen, Xu; Kuja-Halkola, Ralf; Rahman, Iffat; Arpegård, Johannes; Viktorin, Alexander; Karlsson, Robert; Hägg, Sara; Svensson, Per; Pedersen, Nancy L; Magnusson, Patrik K E

    2015-11-05

    In order to further illuminate the potential role of dominant genetic variation in the "missing heritability" debate, we investigated the additive (narrow-sense heritability, h(2)) and dominant (δ(2)) genetic variance for 18 human complex traits. Within the same study base (10,682 Swedish twins), we calculated and compared the estimates from classic twin-based structural equation model with SNP-based genomic-relatedness-matrix restricted maximum likelihood [GREML(d)] method. Contributions of δ(2) were evident for 14 traits in twin models (average δ(2)twin = 0.25, range 0.14-0.49), two of which also displayed significant δ(2) in the GREMLd analyses (triglycerides δ(2)SNP = 0.28 and waist circumference δ(2)SNP = 0.19). On average, the proportion of h(2)SNP/h(2)twin was 70% for ADE-fitted traits (for which the best-fitting model included additive and dominant genetic and unique environmental components) and 31% for AE-fitted traits (for which the best-fitting model included additive genetic and unique environmental components). Independent evidence for contribution from shared environment, also in ADE-fitted traits, was obtained from self-reported within-pair contact frequency and age at separation. We conclude that despite the fact that additive genetics appear to constitute the bulk of genetic influences for most complex traits, dominant genetic variation might often be masked by shared environment in twin and family studies and might therefore have a more prominent role than what family-based estimates often suggest. The risk of erroneously attributing all inherited genetic influences (additive and dominant) to the h(2) in too-small twin studies might also lead to exaggerated "missing heritability" (the proportion of h(2) that remains unexplained by SNPs). Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  18. Genome-wide survey of alternative splicing in the grass Brachypodium distachyon: a emerging model biosystem for plant functional genomics.

    Science.gov (United States)

    Sablok, Gaurav; Gupta, P K; Baek, Jong-Min; Vazquez, Franck; Min, Xiang Jia

    2011-03-01

    A draft sequence of the genome of Brachypodium distachyon, the emerging grass model, was recently released. This represents a unique opportunity to determine its functional diversity compared to the genomes of other model species. Using homology mapping of assembled expressed sequence tags with chromosome scale pseudomolecules, we identified 128 alternative splicing events in B. distachyon. Our study identified that retention of introns is the major type of alternative splicing events (53%) in this plant and highlights the prevalence of splicing site recognition for definition of introns in plants. We have analyzed the compositional profiles of exon-intron junctions by base-pairing nucleotides with U1 snRNA which serves as a model for describing the possibility of sequence conservation. The alternative splicing isoforms identified in this study are novel and represent one of the potentially biologically significant means by which B. distachyon controls the function of its genes. Our observations serve as a basis to understand alternative splicing events of cereal crops with more complex genomes, like wheat or barley.

  19. Genome-Wide Analysis of the AP2/ERF Transcription Factors Family and the Expression Patterns of DREB Genes in Moso Bamboo (Phyllostachys edulis).

    Science.gov (United States)

    Wu, Huili; Lv, Hao; Li, Long; Liu, Jun; Mu, Shaohua; Li, Xueping; Gao, Jian

    2015-01-01

    The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis) is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA) and the division time of AP2/ERF family genes between rice and moso bamboo was 15-23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo.

  20. Integrative genome-wide expression analysis bears evidence of estrogen receptor-independent transcription in heregulin-stimulated MCF-7 cells.

    Directory of Open Access Journals (Sweden)

    Takeshi Nagashima

    Full Text Available Heregulin beta-1 (HRG is an extracellular ligand that activates mitogen-activated protein kinase (MAPK and phosphatidylinositol-3-OH kinase (PI3K/Akt signaling pathways through ErbB receptors. MAPK and Akt have been shown to phosphorylate the estrogen receptor (ER at Ser-118 and Ser-167, respectively, thereby mimicking the effects of estrogenic activity such as estrogen responsive element (ERE-dependent transcription. In the current study, integrative analysis was performed using two tiling array platforms, comprising histone H3 lysine 9 (H3K9 acetylation and RNA mapping, together with array comparative genomic hybridization (CGH analysis in an effort to identify HRG-regulated genes in ER-positive MCF-7 breast cancer cells. Through application of various threshold settings, 333 (326 up-regulated and 7 down-regulated HRG-regulated genes were detected. Prediction of upstream transcription factors (TFs and pathway analysis indicated that 21% of HRG-induced gene regulation may be controlled by the MAPK cascade, while only 0.6% of the gene expression is controlled by ERE. A comparison with previously reported estrogen (E2-regulated gene expression data revealed that only 12 common genes were identified between the 333 HRG-regulated (3.6% and 239 E2-regulated (5.0% gene groups. However, with respect to enriched upstream TFs, 4 common TFs were identified in the 14 HRG-regulated (28.6% and 13 E2-regulated (30.8% gene groups. These results indicated that while E2 and HRG may induce common TFs, the regulatory mechanisms that govern HRG- and E2-induced gene expression differ.

  1. Genome-wide prediction and validation of sigma70 promoters in Lactobacillus plantarum WCFS1.

    Directory of Open Access Journals (Sweden)

    Tilman J Todt

    Full Text Available BACKGROUND: In prokaryotes, sigma factors are essential for directing the transcription machinery towards promoters. Various sigma factors have been described that recognize, and bind to specific DNA sequence motifs in promoter sequences. The canonical sigma factor σ(70 is commonly involved in transcription of the cell's housekeeping genes, which is mediated by the conserved σ(70 promoter sequence motifs. In this study the σ(70-promoter sequences in Lactobacillus plantarum WCFS1 were predicted using a genome-wide analysis. The accuracy of the transcriptionally-active part of this promoter prediction was subsequently evaluated by correlating locations of predicted promoters with transcription start sites inferred from the 5'-ends of transcripts detected by high-resolution tiling array transcriptome datasets. RESULTS: To identify σ(70-related promoter sequences, we performed a genome-wide sequence motif scan of the L. plantarum WCFS1 genome focussing on the regions upstream of protein-encoding genes. We obtained several highly conserved motifs including those resembling the conserved σ(70-promoter consensus. Position weight matrices-based models of the recovered σ(70-promoter sequence motif were employed to identify 3874 motifs with significant similarity (p-value<10(-4 to the model-motif in the L. plantarum genome. Genome-wide transcript information deduced from whole genome tiling-array transcriptome datasets, was used to infer transcription start sites (TSSs from the 5'-end of transcripts. By this procedure, 1167 putative TSSs were identified that were used to corroborate the transcriptionally active fraction of these predicted promoters. In total, 568 predicted promoters were found in proximity (≤ 40 nucleotides of the putative TSSs, showing a highly significant co-occurrence of predicted promoter and TSS (p-value<10(-263. CONCLUSIONS: High-resolution tiling arrays provide a suitable source to infer TSSs at a genome-wide level, and

  2. Genome-wide fitness analyses of the foodborne pathogen Campylobacter jejuni in in vitro and in vivo models

    DEFF Research Database (Denmark)

    de Vries, Stefan P. W.; Gupta, Srishti; Baig, Abiyad

    2017-01-01

    Campylobacter is the most common cause of foodborne bacterial illness worldwide. Faecal contamination of meat, especially chicken, during processing represents a key route of transmission to humans. There is a lack of insight into the mechanisms driving C. jejuni growth and survival within hosts...... and the environment. Here, we report a detailed analysis of C. jejuni fitness across models reflecting stages in its life cycle. Transposon (Tn) gene-inactivation libraries were generated in three C. jejuni strains and the impact on fitness during chicken colonisation, survival in houseflies and under nutrient......-rich and -poor conditions at 4 degrees C and infection of human gut epithelial cells was assessed by Tn-insertion site sequencing (Tn-seq). A total of 331 homologous gene clusters were essential for fitness during in vitro growth in three C. jejuni strains, revealing that a large part of its genome is dedicated...

  3. A Genetic Predictive Model for Canine Hip Dysplasia: Integration of Genome Wide Association Study (GWAS) and Candidate Gene Approaches

    Science.gov (United States)

    Bartolomé, Nerea; Segarra, Sergi; Artieda, Marta; Francino, Olga; Sánchez, Elisenda; Szczypiorska, Magdalena; Casellas, Joaquim; Tejedor, Diego; Cerdeira, Joaquín; Martínez, Antonio; Velasco, Alfonso; Sánchez, Armand

    2015-01-01

    Canine hip dysplasia is one of the most prevalent developmental orthopedic diseases in dogs worldwide. Unfortunately, the success of eradication programs against this disease based on radiographic diagnosis is low. Adding the use of diagnostic genetic tools to the current phenotype-based approach might be beneficial. The aim of this study was to develop a genetic prognostic test for early diagnosis of hip dysplasia in Labrador Retrievers. To develop our DNA test, 775 Labrador Retrievers were recruited. For each dog, a blood sample and a ventrodorsal hip radiograph were taken. Dogs were divided into two groups according to their FCI hip score: control (A/B) and case (D/E). C dogs were not included in the sample. Genetic characterization combining a GWAS and a candidate gene strategy using SNPs allowed a case-control population association study. A mathematical model which included 7 SNPs was developed using logistic regression. The model showed a good accuracy (Area under the ROC curve = 0.85) and was validated in an independent population of 114 dogs. This prognostic genetic test represents a useful tool for choosing the most appropriate therapeutic approach once genetic predisposition to hip dysplasia is known. Therefore, it allows a more individualized management of the disease. It is also applicable during genetic selection processes, since breeders can benefit from the information given by this test as soon as a blood sample can be collected, and act accordingly. In the authors’ opinion, a shift towards genomic screening might importantly contribute to reducing canine hip dysplasia in the future. In conclusion, based on genetic and radiographic information from Labrador Retrievers with hip dysplasia, we developed an accurate predictive genetic test for early diagnosis of hip dysplasia in Labrador Retrievers. However, further research is warranted in order to evaluate the validity of this genetic test in other dog breeds. PMID:25874693

  4. A genetic predictive model for canine hip dysplasia: integration of Genome Wide Association Study (GWAS and candidate gene approaches.

    Directory of Open Access Journals (Sweden)

    Nerea Bartolomé

    Full Text Available Canine hip dysplasia is one of the most prevalent developmental orthopedic diseases in dogs worldwide. Unfortunately, the success of eradication programs against this disease based on radiographic diagnosis is low. Adding the use of diagnostic genetic tools to the current phenotype-based approach might be beneficial. The aim of this study was to develop a genetic prognostic test for early diagnosis of hip dysplasia in Labrador Retrievers. To develop our DNA test, 775 Labrador Retrievers were recruited. For each dog, a blood sample and a ventrodorsal hip radiograph were taken. Dogs were divided into two groups according to their FCI hip score: control (A/B and case (D/E. C dogs were not included in the sample. Genetic characterization combining a GWAS and a candidate gene strategy using SNPs allowed a case-control population association study. A mathematical model which included 7 SNPs was developed using logistic regression. The model showed a good accuracy (Area under the ROC curve = 0.85 and was validated in an independent population of 114 dogs. This prognostic genetic test represents a useful tool for choosing the most appropriate therapeutic approach once genetic predisposition to hip dysplasia is known. Therefore, it allows a more individualized management of the disease. It is also applicable during genetic selection processes, since breeders can benefit from the information given by this test as soon as a blood sample can be collected, and act accordingly. In the authors' opinion, a shift towards genomic screening might importantly contribute to reducing canine hip dysplasia in the future. In conclusion, based on genetic and radiographic information from Labrador Retrievers with hip dysplasia, we developed an accurate predictive genetic test for early diagnosis of hip dysplasia in Labrador Retrievers. However, further research is warranted in order to evaluate the validity of this genetic test in other dog breeds.

  5. A genetic predictive model for canine hip dysplasia: integration of Genome Wide Association Study (GWAS) and candidate gene approaches.

    Science.gov (United States)

    Bartolomé, Nerea; Segarra, Sergi; Artieda, Marta; Francino, Olga; Sánchez, Elisenda; Szczypiorska, Magdalena; Casellas, Joaquim; Tejedor, Diego; Cerdeira, Joaquín; Martínez, Antonio; Velasco, Alfonso; Sánchez, Armand

    2015-01-01

    Canine hip dysplasia is one of the most prevalent developmental orthopedic diseases in dogs worldwide. Unfortunately, the success of eradication programs against this disease based on radiographic diagnosis is low. Adding the use of diagnostic genetic tools to the current phenotype-based approach might be beneficial. The aim of this study was to develop a genetic prognostic test for early diagnosis of hip dysplasia in Labrador Retrievers. To develop our DNA test, 775 Labrador Retrievers were recruited. For each dog, a blood sample and a ventrodorsal hip radiograph were taken. Dogs were divided into two groups according to their FCI hip score: control (A/B) and case (D/E). C dogs were not included in the sample. Genetic characterization combining a GWAS and a candidate gene strategy using SNPs allowed a case-control population association study. A mathematical model which included 7 SNPs was developed using logistic regression. The model showed a good accuracy (Area under the ROC curve = 0.85) and was validated in an independent population of 114 dogs. This prognostic genetic test represents a useful tool for choosing the most appropriate therapeutic approach once genetic predisposition to hip dysplasia is known. Therefore, it allows a more individualized management of the disease. It is also applicable during genetic selection processes, since breeders can benefit from the information given by this test as soon as a blood sample can be collected, and act accordingly. In the authors' opinion, a shift towards genomic screening might importantly contribute to reducing canine hip dysplasia in the future. In conclusion, based on genetic and radiographic information from Labrador Retrievers with hip dysplasia, we developed an accurate predictive genetic test for early diagnosis of hip dysplasia in Labrador Retrievers. However, further research is warranted in order to evaluate the validity of this genetic test in other dog breeds.

  6. Genome-Wide Analysis of the AP2/ERF Transcription Factors Family and the Expression Patterns of DREB Genes in Moso Bamboo (Phyllostachys edulis)

    Science.gov (United States)

    Li, Long; Liu, Jun; Mu, Shaohua; Li, Xueping; Gao, Jian

    2015-01-01

    The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis) is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA) and the division time of AP2/ERF family genes between rice and moso bamboo was 15–23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo. PMID:25985202

  7. Genome-wide mRNA expression analysis of hepatic adaptation to high-fat diets reveals switch from an inflammatory to steatotic transcriptional program.

    Directory of Open Access Journals (Sweden)

    Marijana Radonjic

    Full Text Available BACKGROUND: Excessive exposure to dietary fats is an important factor in the initiation of obesity and metabolic syndrome associated pathologies. The cellular processes associated with the onset and progression of diet-induced metabolic syndrome are insufficiently understood. PRINCIPAL FINDINGS: To identify the mechanisms underlying the pathological changes associated with short and long-term exposure to excess dietary fat, hepatic gene expression of ApoE3Leiden mice fed chow and two types of high-fat (HF diets was monitored using microarrays during a 16-week period. A functional characterization of 1663 HF-responsive genes reveals perturbations in lipid, cholesterol and oxidative metabolism, immune and inflammatory responses and stress-related pathways. The major changes in gene expression take place during the early (day 3 and late (week 12 phases of HF feeding. This is also associated with characteristic opposite regulation of many HF-affected pathways between these two phases. The most prominent switch occurs in the expression of inflammatory/immune pathways (early activation, late repression and lipogenic/adipogenic pathways (early repression, late activation. Transcriptional network analysis identifies NF-kappaB, NEMO, Akt, PPARgamma and SREBP1 as the key controllers of these processes and suggests that direct regulatory interactions between these factors may govern the transition from early (stressed, inflammatory to late (pathological, steatotic hepatic adaptation to HF feeding. This transition observed by hepatic gene expression analysis is confirmed by expression of inflammatory proteins in plasma and the late increase in hepatic triglyceride content. In addition, the genes most predictive of fat accumulation in liver during 16-week high-fat feeding period are uncovered by regression analysis of hepatic gene expression and triglyceride levels. CONCLUSIONS: The transition from an inflammatory to a steatotic transcriptional program

  8. Genome-wide analysis of intraspecific DNA polymorphism in 'Micro-Tom', a model cultivar of tomato (Solanum lycopersicum).

    Science.gov (United States)

    Kobayashi, Masaaki; Nagasaki, Hideki; Garcia, Virginie; Just, Daniel; Bres, Cécile; Mauxion, Jean-Philippe; Le Paslier, Marie-Christine; Brunel, Dominique; Suda, Kunihiro; Minakuchi, Yohei; Toyoda, Atsushi; Fujiyama, Asao; Toyoshima, Hiromi; Suzuki, Takayuki; Igarashi, Kaori; Rothan, Christophe; Kaminuma, Eli; Nakamura, Yasukazu; Yano, Kentaro; Aoki, Koh

    2014-02-01

    Tomato (Solanum lycopersicum) is regarded as a model plant of the Solanaceae family. The genome sequencing of the tomato cultivar 'Heinz 1706' was recently completed. To accelerate the progress of tomato genomics studies, systematic bioresources, such as mutagenized lines and full-length cDNA libraries, have been established for the cultivar 'Micro-Tom'. However, these resources cannot be utilized to their full potential without the completion of the genome sequencing of 'Micro-Tom'. We undertook the genome sequencing of 'Micro-Tom' and here report the identification of single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) between 'Micro-Tom' and 'Heinz 1706'. The analysis demonstrated the presence of 1.23 million SNPs and 0.19 million indels between the two cultivars. The density of SNPs and indels was high in chromosomes 2, 5 and 11, but was low in chromosomes 6, 8 and 10. Three known mutations of 'Micro-Tom' were localized on chromosomal regions where the density of SNPs and indels was low, which was consistent with the fact that these mutations were relatively new and introgressed into 'Micro-Tom' during the breeding of this cultivar. We also report SNP analysis for two 'Micro-Tom' varieties that have been maintained independently in Japan and France, both of which have served as standard lines for 'Micro-Tom' mutant collections. Approximately 28,000 SNPs were identified between these two 'Micro-Tom' lines. These results provide high-resolution DNA polymorphic information on 'Micro-Tom' and represent a valuable contribution to the 'Micro-Tom'-based genomics resources.

  9. Genome-wide analysis of the WRKY transcription factor gene family in Gossypium raimondii and the expression of orthologs in cultivated tetraploid cotton

    Directory of Open Access Journals (Sweden)

    Caiping Cai

    2014-04-01

    Full Text Available WRKY proteins are members of a family of transcription factors in higher plants that function in plant responses to various physiological processes. We identified 120 candidate WRKY genes from Gossypium raimondii with corresponding expressed sequence tags in at least one of four cotton species, Gossypium hirsutum, Gossypium barbadense, Gossypium arboreum, and G. raimondii. These WRKY members were anchored on 13 chromosomes in G. raimondii with uneven distribution. Phylogenetic analysis showed that WRKY candidate genes can be classified into three groups, with 20 members in group I, 88 in group II, and 12 in group III. The 88 genes in group II were further classified into five subgroups, groups IIa–e, containing 7, 16, 37, 15, and 13 members, respectively. We characterized diversity in amino acid residues in the WRKY domain and/or other zinc finger motif regions in the WRKY proteins. The expression patterns of WRKY genes revealed their important roles in diverse functions in cotton developmental stages of vegetative and reproductive growth and stress response. Structural and expression analyses show that WRKY proteins are a class of important regulators of growth and development and play key roles in response to stresses in cotton.

  10. Genome-wide analysis of the WRKY transcription factor gene family in Gossypium raimondii and the expression of orthologs in cultivated tetraploid cotton

    Institute of Scientific and Technical Information of China (English)

    Caiping; Cai; Erli; Niu; Hao; Du; Liang; Zhao; Yue; Feng; Wangzhen; Guo

    2014-01-01

    WRKY proteins are members of a family of transcription factors in higher plants that function in plant responses to various physiological processes.We identified 120 candidate WRKY genes from Gossypium raimondii with corresponding expressed sequence tags in at least one of four cotton species,Gossypium hirsutum,Gossypium barbadense,Gossypium arboreum,and G.raimondii.These WRKY members were anchored on 13 chromosomes in G.raimondii with uneven distribution.Phylogenetic analysis showed that WRKY candidate genes can be classified into three groups,with 20 members in group I,88 in group II,and 12 in group III.The88 genes in group II were further classified into five subgroups,groups IIa–e,containing 7,16,37,15,and 13 members,respectively.We characterized diversity in amino acid residues in the WRKY domain and/or other zinc finger motif regions in the WRKY proteins.The expression patterns of WRKY genes revealed their important roles in diverse functions in cotton developmental stages of vegetative and reproductive growth and stress response.Structural and expression analyses show that WRKY proteins are a class of important regulators of growth and development and play key roles in response to stresses in cotton.

  11. Genome-wide patterns of Arabidopsis gene expression in nature.

    Directory of Open Access Journals (Sweden)

    Christina L Richards

    Full Text Available Organisms in the wild are subject to multiple, fluctuating environmental factors, and it is in complex natural environments that genetic regulatory networks actually function and evolve. We assessed genome-wide gene expression patterns in the wild in two natural accessions of the model plant Arabidopsis thaliana and examined the nature of transcriptional variation throughout its life cycle and gene expression correlations with natural environmental fluctuations. We grew plants in a natural field environment and measured genome-wide time-series gene expression from the plant shoot every three days, spanning the seedling to reproductive stages. We find that 15,352 genes were expressed in the A. thaliana shoot in the field, and accession and flowering status (vegetative versus flowering were strong components of transcriptional variation in this plant. We identified between ∼110 and 190 time-varying gene expression clusters in the field, many of which were significantly overrepresented by genes regulated by abiotic and biotic environmental stresses. The two main principal components of vegetative shoot gene expression (PC(veg correlate to temperature and precipitation occurrence in the field. The largest PC(veg axes included thermoregulatory genes while the second major PC(veg was associated with precipitation and contained drought-responsive genes. By exposing A. thaliana to natural environments in an open field, we provide a framework for further understanding the genetic networks that are deployed in natural environments, and we connect plant molecular genetics in the laboratory to plant organismal ecology in the wild.

  12. Genome-wide analysis of brain and gonad transcripts reveals changes of key sex reversal-related genes expression and signaling pathways in three stages of Monopterus albus

    Science.gov (United States)

    Hu, Qing; Guo, Wei; Li, Dapeng

    2017-01-01

    Background The natural sex reversal severely affects the sex ratio and thus decreases the productivity of the rice field eel (Monopterus albus). How to understand and manipulate this process is one of the major issues for the rice field eel stocking. So far the genomics and transcriptomics data available for this species are still scarce. Here we provide a comprehensive study of transcriptomes of brain and gonad tissue in three sex stages (female, intersex and male) from the rice field eel to investigate changes in transcriptional level during the sex reversal process. Results Approximately 195 thousand unigenes were generated and over 44.4 thousand were functionally annotated. Comparative study between stages provided multiple differentially expressed genes in brain and gonad tissue. Overall 4668 genes were found to be of unequal abundance between gonad tissues, far more than that of the brain tissues (59 genes). These genes were enriched in several different signaling pathways. A number of 231 genes were found with different levels in gonad in each stage, with several reproduction-related genes included. A total of 19 candidate genes that could be most related to sex reversal were screened out, part of these genes’ expression patterns were validated by RT-qPCR. The expression of spef2, maats1, spag6 and dmc1 were abundant in testis, but was barely detected in females, while the 17β-hsd12, zpsbp3, gal3 and foxn5 were only expressed in ovary. Conclusion This study investigated the complexity of brain and gonad transcriptomes in three sex stages of the rice field eel. Integrated analysis of different gene expression and changes in signaling pathways, such as PI3K-Akt pathway, provided crucial data for further study of sex transformation mechanisms. PMID:28319194

  13. Modeling of environmental effects in genome-wide association studies identifies SLC2A2 and HP as novel loci influencing serum cholesterol levels.

    Directory of Open Access Journals (Sweden)

    Wilmar Igl

    2010-01-01

    Full Text Available Genome-wide association studies (GWAS have identified 38 larger genetic regions affecting classical blood lipid levels without adjusting for important environmental influences. We modeled diet and physical activity in a GWAS in order to identify novel loci affecting total cholesterol, LDL cholesterol, HDL cholesterol, and triglyceride levels. The Swedish (SE EUROSPAN cohort (N(SE = 656 was screened for candidate genes and the non-Swedish (NS EUROSPAN cohorts (N(NS = 3,282 were used for replication. In total, 3 SNPs were associated in the Swedish sample and were replicated in the non-Swedish cohorts. While SNP rs1532624 was a replication of the previously published association between CETP and HDL cholesterol, the other two were novel findings. For the latter SNPs, the p-value for association was substantially improved by inclusion of environmental covariates: SNP rs5400 (p(SE,unadjusted = 3.6 x 10(-5, p(SE,adjusted = 2.2 x 10(-6, p(NS,unadjusted = 0.047 in the SLC2A2 (Glucose transporter type 2 and rs2000999 (p(SE,unadjusted = 1.1 x 10(-3, p(SE,adjusted = 3.8 x 10(-4, p(NS,unadjusted = 0.035 in the HP gene (Haptoglobin-related protein precursor. Both showed evidence of association with total cholesterol. These results demonstrate that inclusion of important environmental factors in the analysis model can reveal new genetic susceptibility loci.

  14. Genome-wide ENU mutagenesis in combination with high density SNP analysis and exome sequencing provides rapid identification of novel mouse models of developmental disease.

    Directory of Open Access Journals (Sweden)

    Georgina Caruana

    Full Text Available BACKGROUND: Mice harbouring gene mutations that cause phenotypic abnormalities during organogenesis are invaluable tools for linking gene function to normal development and human disorders. To generate mouse models harbouring novel alleles that are involved in organogenesis we conducted a phenotype-driven, genome-wide mutagenesis screen in mice using the mutagen N-ethyl-N-nitrosourea (ENU. METHODOLOGY/PRINCIPAL FINDINGS: ENU was injected into male C57BL/6 mice and the mutations transmitted through the germ-line. ENU-induced mutations were bred to homozygosity and G3 embryos screened at embryonic day (E 13.5 and E18.5 for abnormalities in limb and craniofacial structures, skin, blood, vasculature, lungs, gut, kidneys, ureters and gonads. From 52 pedigrees screened 15 were detected with anomalies in one or more of the structures/organs screened. Using single nucleotide polymorphism (SNP-based linkage analysis in conjunction with candidate gene or next-generation sequencing (NGS we identified novel recessive alleles for Fras1, Ift140 and Lig1. CONCLUSIONS/SIGNIFICANCE: In this study we have generated mouse models in which the anomalies closely mimic those seen in human disorders. The association between novel mutant alleles and phenotypes will lead to a better understanding of gene function in normal development and establish how their dysfunction causes human anomalies and disease.

  15. Genome-Wide Transcriptional Profiling to Elucidate Key Candidates Involved in Bud Burst and Rattling Growth in a Subtropical Bamboo (Dendrocalamus hamiltonii)

    Science.gov (United States)

    Bhandawat, Abhishek; Singh, Gagandeep; Seth, Romit; Singh, Pradeep; Sharma, Ram K.

    2017-01-01

    Bamboo, one of the fastest growing plants, can be a promising model system to understand growth. The study provides an insight into the complex interplay between environmental signaling and cellular machineries governing initiation and persistence of growth in a subtropical bamboo (Dendrocalamus hamiltonii). Phenological and spatio-temporal transcriptome analysis of rhizome and shoot during the major vegetative developmental transitions of D. hamiltonii was performed to dissect factors governing growth. Our work signifies the role of environmental cues, predominantly rainfall, decreasing day length, and high humidity for activating dormant bud to produce new shoot, possibly through complex molecular interactions among phosphatidylinositol, calcium signaling pathways, phytohormones, circadian rhythm, and humidity responses. We found the coordinated regulation of auxin, cytokinin, brassinosteroid signaling and cell cycle modulators; facilitating cell proliferation, cell expansion, and cell wall biogenesis supporting persistent growth of emerging shoot. Putative master regulators among these candidates were identified using predetermined Arabidopsis thaliana protein-protein interaction network. We got clues that the growth signaling begins far back in rhizome even before it emerges out as new shoot. Putative growth candidates identified in our study can serve in devising strategies to engineer bamboos and timber trees with enhanced growth and biomass potentials. PMID:28123391

  16. Impacts of population structure and analytical models in genome-wide association studies of complex traits in forest trees: a case study in Eucalyptus globulus.

    Directory of Open Access Journals (Sweden)

    Eduardo P Cappa

    Full Text Available The promise of association genetics to identify genes or genomic regions controlling complex traits has generated a flurry of interest. Such phenotype-genotype associations could be useful to accelerate tree breeding cycles, increase precision and selection intensity for late expressing, low heritability traits. However, the prospects of association genetics in highly heterozygous undomesticated forest trees can be severely impacted by the presence of cryptic population and pedigree structure. To investigate how to better account for this, we compared the GLM and five combinations of the Unified Mixed Model ( UMM on data of a low-density genome-wide association study for growth and wood property traits carried out in a Eucalyptus globulus population (n = 303 with 7,680 Diversity Array Technology (DArT markers. Model comparisons were based on the degree of deviation from the uniform distribution and estimates of the mean square differences between the observed and expected p-values of all significant marker-trait associations detected. Our analysis revealed the presence of population and family structure. There was not a single best model for all traits. Striking differences in detection power and accuracy were observed among the different models especially when population structure was not accounted for. The UMM method was the best and produced superior results when compared to GLM for all traits. Following stringent correction for false discoveries, 18 marker-trait associations were detected, 16 for tree diameter growth and two for lignin monomer composition (S:G ratio, a key wood property trait. The two DArT markers associated with S:G ratio on chromosome 10, physically map within 1 Mbp of the ferulate 5-hydroxylase (F5H gene, providing a putative independent validation of this marker-trait association. This study details the merit of collectively integrate population structure and relatedness in association analyses in undomesticated, highly

  17. Genome-Wide Mapping of Uncapped and Cleaved Transcripts Reveals a Role for the Nuclear mRNA Cap-Binding Complex in Cotranslational RNA Decay in Arabidopsis[OPEN

    Science.gov (United States)

    Willmann, Matthew R.

    2016-01-01

    RNA turnover is necessary for controlling proper mRNA levels posttranscriptionally. In general, RNA degradation is via exoribonucleases that degrade RNA either from the 5′ end to the 3′ end, such as XRN4, or in the opposite direction by the multisubunit exosome complex. Here, we use genome-wide mapping of uncapped and cleaved transcripts to reveal the global landscape of cotranslational mRNA decay in the Arabidopsis thaliana transcriptome. We found that this process leaves a clear three nucleotide periodicity in open reading frames. This pattern of cotranslational degradation is especially evident near the ends of open reading frames, where we observe accumulation of cleavage events focused 16 to 17 nucleotides upstream of the stop codon because of ribosomal pausing during translation termination. Following treatment of Arabidopsis plants with the translation inhibitor cycloheximide, cleavage events accumulate 13 to 14 nucleotides upstream of the start codon where initiating ribosomes have been stalled with these sequences in their P site. Further analysis in xrn4 mutant plants indicates that cotranslational RNA decay is XRN4 dependent. Additionally, studies in plants lacking CAP BINDING PROTEIN80/ABA HYPERSENSITIVE1, the largest subunit of the nuclear mRNA cap binding complex, reveal a role for this protein in cotranslational decay. In total, our results demonstrate the global prevalence and features of cotranslational RNA decay in a plant transcriptome. PMID:27758893

  18. Winner's Curse Correction and Variable Thresholding Improve Performance of Polygenic Risk Modeling Based on Genome-Wide Association Study Summary-Level Data.

    Science.gov (United States)

    Shi, Jianxin; Park, Ju-Hyun; Duan, Jubao; Berndt, Sonja T; Moy, Winton; Yu, Kai; Song, Lei; Wheeler, William; Hua, Xing; Silverman, Debra; Garcia-Closas, Montserrat; Hsiung, Chao Agnes; Figueroa, Jonine D; Cortessis, Victoria K; Malats, Núria; Karagas, Margaret R; Vineis, Paolo; Chang, I-Shou; Lin, Dongxin; Zhou, Baosen; Seow, Adeline; Matsuo, Keitaro; Hong, Yun-Chul; Caporaso, Neil E; Wolpin, Brian; Jacobs, Eric; Petersen, Gloria M; Klein, Alison P; Li, Donghui; Risch, Harvey; Sanders, Alan R; Hsu, Li; Schoen, Robert E; Brenner, Hermann; Stolzenberg-Solomon, Rachael; Gejman, Pablo; Lan, Qing; Rothman, Nathaniel; Amundadottir, Laufey T; Landi, Maria Teresa; Levinson, Douglas F; Chanock, Stephen J; Chatterjee, Nilanjan

    2016-12-01

    Recent heritability analyses have indicated that genome-wide association studies (GWAS) have the potential to improve genetic risk prediction for complex diseases based on polygenic risk score (PRS), a simple modelling technique that can be implemented using summary-level data from the discovery samples. We herein propose modifications to improve the performance of PRS. We introduce threshold-dependent winner's-curse adjustments for marginal association coefficients that are used to weight the single-nucleotide polymorphisms (SNPs) in PRS. Further, as a way to incorporate external functional/annotation knowledge that could identify subsets of SNPs highly enriched for associations, we propose variable thresholds for SNPs selection. We applied our methods to GWAS summary-level data of 14 complex diseases. Across all diseases, a simple winner's curse correction uniformly led to enhancement of performance of the models, whereas incorporation of functional SNPs was beneficial only for selected diseases. Compared to the standard PRS algorithm, the proposed methods in combination led to notable gain in efficiency (25-50% increase in the prediction R2) for 5 of 14 diseases. As an example, for GWAS of type 2 diabetes, winner's curse correction improved prediction R2 from 2.29% based on the standard PRS to 3.10% (P = 0.0017) and incorporating functional annotation data further improved R2 to 3.53% (P = 2×10-5). Our simulation studies illustrate why differential treatment of certain categories of functional SNPs, even when shown to be highly enriched for GWAS-heritability, does not lead to proportionate improvement in genetic risk-prediction because of non-uniform linkage disequilibrium structure.

  19. Winner's Curse Correction and Variable Thresholding Improve Performance of Polygenic Risk Modeling Based on Genome-Wide Association Study Summary-Level Data.

    Directory of Open Access Journals (Sweden)

    Jianxin Shi

    2016-12-01

    Full Text Available Recent heritability analyses have indicated that genome-wide association studies (GWAS have the potential to improve genetic risk prediction for complex diseases based on polygenic risk score (PRS, a simple modelling technique that can be implemented using summary-level data from the discovery samples. We herein propose modifications to improve the performance of PRS. We introduce threshold-dependent winner's-curse adjustments for marginal association coefficients that are used to weight the single-nucleotide polymorphisms (SNPs in PRS. Further, as a way to incorporate external functional/annotation knowledge that could identify subsets of SNPs highly enriched for associations, we propose variable thresholds for SNPs selection. We applied our methods to GWAS summary-level data of 14 complex diseases. Across all diseases, a simple winner's curse correction uniformly led to enhancement of performance of the models, whereas incorporation of functional SNPs was beneficial only for selected diseases. Compared to the standard PRS algorithm, the proposed methods in combination led to notable gain in efficiency (25-50% increase in the prediction R2 for 5 of 14 diseases. As an example, for GWAS of type 2 diabetes, winner's curse correction improved prediction R2 from 2.29% based on the standard PRS to 3.10% (P = 0.0017 and incorporating functional annotation data further improved R2 to 3.53% (P = 2×10-5. Our simulation studies illustrate why differential treatment of certain categories of functional SNPs, even when shown to be highly enriched for GWAS-heritability, does not lead to proportionate improvement in genetic risk-prediction because of non-uniform linkage disequilibrium structure.

  20. Genome-Wide Chromatin Immunoprecipitation Sequencing Analysis Shows that WhiB Is a Transcription Factor That Cocontrols Its Regulon with WhiA To Initiate Developmental Cell Division in Streptomyces

    Directory of Open Access Journals (Sweden)

    Matthew J. Bush

    2016-04-01

    Full Text Available WhiB is the founding member of a family of proteins (the WhiB-like [Wbl] family that carry a [4Fe-4S] iron-sulfur cluster and play key roles in diverse aspects of the biology of actinomycetes, including pathogenesis, antibiotic resistance, and the control of development. In Streptomyces, WhiB is essential for the process of developmentally controlled cell division that leads to sporulation. The biochemical function of Wbl proteins has been controversial; here, we set out to determine unambiguously if WhiB functions as a transcription factor using chromatin immunoprecipitation sequencing (ChIP-seq in Streptomyces venezuelae. In the first demonstration of in vivo genome-wide Wbl binding, we showed that WhiB regulates the expression of key genes required for sporulation by binding upstream of ~240 transcription units. Strikingly, the WhiB regulon is identical to the previously characterized WhiA regulon, providing an explanation for the identical phenotypes of whiA and whiB mutants. Using ChIP-seq, we demonstrated that in vivo DNA binding by WhiA depends on WhiB and vice versa, showing that WhiA and WhiB function cooperatively to control expression of a common set of WhiAB target genes. Finally, we show that mutation of the cysteine residues that coordinate the [4Fe-4S] cluster in WhiB prevents DNA binding by both WhiB and WhiA in vivo.

  1. Genome wide association studies using a new nonparametric model reveal the genetic architecture of 17 agronomic traits in an enlarged maize association panel.

    Science.gov (United States)

    Yang, Ning; Lu, Yanli; Yang, Xiaohong; Huang, Juan; Zhou, Yang; Ali, Farhan; Wen, Weiwei; Liu, Jie; Li, Jiansheng; Yan, Jianbing

    2014-09-01

    Association mapping is a powerful approach for dissecting the genetic architecture of complex quantitative traits using high-density SNP markers in maize. Here, we expanded our association panel size from 368 to 513 inbred lines with 0.5 million high quality SNPs using a two-step data-imputation method which combines identity by descent (IBD) based projection and k-nearest neighbor (KNN) algorithm. Genome-wide association studies (GWAS) were carried out for 17 agronomic traits with a panel of 513 inbred lines applying both mixed linear model (MLM) and a new method, the Anderson-Darling (A-D) test. Ten loci for five traits were identified using the MLM method at the Bonferroni-corrected threshold -log10 (P) >5.74 (α=1). Many loci ranging from one to 34 loci (107 loci for plant height) were identified for 17 traits using the A-D test at the Bonferroni-corrected threshold -log10 (P) >7.05 (α=0.05) using 556809 SNPs. Many known loci and new candidate loci were only observed by the A-D test, a few of which were also detected in independent linkage analysis. This study indicates that combining IBD based projection and KNN algorithm is an efficient imputation method for inferring large missing genotype segments. In addition, we showed that the A-D test is a useful complement for GWAS analysis of complex quantitative traits. Especially for traits with abnormal phenotype distribution, controlled by moderate effect loci or rare variations, the A-D test balances false positives and statistical power. The candidate SNPs and associated genes also provide a rich resource for maize genetics and breeding.

  2. Genome wide association studies using a new nonparametric model reveal the genetic architecture of 17 agronomic traits in an enlarged maize association panel.

    Directory of Open Access Journals (Sweden)

    Ning Yang

    2014-09-01

    Full Text Available Association mapping is a powerful approach for dissecting the genetic architecture of complex quantitative traits using high-density SNP markers in maize. Here, we expanded our association panel size from 368 to 513 inbred lines with 0.5 million high quality SNPs using a two-step data-imputation method which combines identity by descent (IBD based projection and k-nearest neighbor (KNN algorithm. Genome-wide association studies (GWAS were carried out for 17 agronomic traits with a panel of 513 inbred lines applying both mixed linear model (MLM and a new method, the Anderson-Darling (A-D test. Ten loci for five traits were identified using the MLM method at the Bonferroni-corrected threshold -log10 (P >5.74 (α=1. Many loci ranging from one to 34 loci (107 loci for plant height were identified for 17 traits using the A-D test at the Bonferroni-corrected threshold -log10 (P >7.05 (α=0.05 using 556809 SNPs. Many known loci and new candidate loci were only observed by the A-D test, a few of which were also detected in independent linkage analysis. This study indicates that combining IBD based projection and KNN algorithm is an efficient imputation method for inferring large missing genotype segments. In addition, we showed that the A-D test is a useful complement for GWAS analysis of complex quantitative traits. Especially for traits with abnormal phenotype distribution, controlled by moderate effect loci or rare variations, the A-D test balances false positives and statistical power. The candidate SNPs and associated genes also provide a rich resource for maize genetics and breeding.

  3. Genome-wide gene expression analysis of anguillid herpesvirus 1

    NARCIS (Netherlands)

    Beurden, van S.J.; Peeters, B.P.H.; Rottier, P.J.M.; Davison, A.A.; Engelsma, M.Y.

    2013-01-01

    Background Whereas temporal gene expression in mammalian herpesviruses has been studied extensively, little is known about gene expression in fish herpesviruses. Here we report a genome-wide transcription analysis of a fish herpesvirus, anguillid herpesvirus 1, in cell culture, studied during the

  4. Identification of Transcribed Enhancers by Genome-Wide Chromatin Immunoprecipitation Sequencing.

    Science.gov (United States)

    Blinka, Steven; Reimer, Michael H; Pulakanti, Kirthi; Pinello, Luca; Yuan, Guo-Cheng; Rao, Sridhar

    2017-01-01

    Recent work has shown that RNA polymerase II-mediated transcription at distal cis-regulatory elements serves as a mark of highly active enhancers. Production of noncoding RNAs at enhancers, termed eRNAs, correlates with higher expression of genes that the enhancer interacts with; hence, eRNAs provide a new tool to model gene activity in normal and disease tissues. Moreover, this unique class of noncoding RNA has diverse roles in transcriptional regulation. Transcribed enhancers can be identified by a common signature of epigenetic marks by overlaying a series of genome-wide chromatin immunoprecipitation and RNA sequencing datasets. A computational approach to filter non-enhancer elements and other classes of noncoding RNAs is essential to not cloud downstream analysis. Here we present a protocol that combines wet and dry bench methods to accurately identify transcribed enhancers genome-wide as well as an experimental procedure to validate these datasets.

  5. Genome-wide expression profiling of complex regional pain syndrome.

    Directory of Open Access Journals (Sweden)

    Eun-Heui Jin

    Full Text Available Complex regional pain syndrome (CRPS is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II and 5 controls (cut-off value: 1.5-fold change and p<0.05. Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1, matrix metalloproteinase 9 (MMP9, alanine aminopeptidase N (ANPEP, l-histidine decarboxylase (HDC, granulocyte colony-stimulating factor 3 receptor (G-CSF3R, and signal transducer and activator of transcription 3 (STAT3 genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR. We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10(-4. The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression.

  6. Mammalian RNA polymerase II core promoters: insights from genome-wide studies

    DEFF Research Database (Denmark)

    Sandelin, Albin; Carninci, Piero; Lenhard, Boris

    2007-01-01

    The identification and characterization of mammalian core promoters and transcription start sites is a prerequisite to understanding how RNA polymerase II transcription is controlled. New experimental technologies have enabled genome-wide discovery and characterization of core promoters, revealin...

  7. Genome-wide gene pathway analysis of psychotic illness symptom dimensions based on a new schizophrenia-specific model of the OPCRIT.

    Science.gov (United States)

    Docherty, Anna R; Bigdeli, T Bernard; Edwards, Alexis C; Bacanu, Silviu; Lee, Donghyung; Neale, Michael C; Wormley, Brandon K; Walsh, Dermot; O'Neill, F Anthony; Riley, Brien P; Kendler, Kenneth S; Fanous, Ayman H

    2015-05-01

    Empirically derived phenotypic measurements have the potential to enhance gene-finding efforts in schizophrenia. Previous research based on factor analyses of symptoms has typically included schizoaffective cases. Deriving factor loadings from analysis of only narrowly defined schizophrenia cases could yield more sensitive factor scores for gene pathway and gene ontology analyses. Using an Irish family sample, this study 1) factor analyzed clinician-rated Operational Criteria Checklist items in cases with schizophrenia only, 2) scored the full sample based on these factor loadings, and 3) implemented genome-wide association, gene-based, and gene-pathway analysis of these SCZ-based symptom factors (final N=507). Three factors emerged from the analysis of the schizophrenia cases: a manic, a depressive, and a positive symptom factor. In gene-based analyses of these factors, multiple genes had qschizophrenia.

  8. Functional annotations of diabetes nephropathy susceptibility loci through analysis of genome-wide renal gene expression in rat models of diabetes mellitus

    DEFF Research Database (Denmark)

    Hu, Yaomin; Kaisaki, Pamela J; Argoud, Karène

    2009-01-01

    of spontaneous (genetically determined) mild hyperglycaemia and insulin resistance (Goto-Kakizaki-GK) and experimentally induced severe hyperglycaemia (Wistar-Kyoto-WKY rats injected with streptozotocin [STZ]). RESULTS: Different patterns of transcription regulation in the two rat models of diabetes likely...... number of protein coding sequences of unknown function which can be considered as functional and, when they map to DN loci, positional candidates for DN. Further expression analysis of rat orthologs of human DN positional candidate genes provided functional annotations of known and novel genes...

  9. Genome-wide association study identifies five new schizophrenia loci.

    LENUS (Irish Health Repository)

    Ripke, Stephan

    2011-10-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10(-11)) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).

  10. Power analysis for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Klein Robert J

    2007-08-01

    Full Text Available Abstract Background Genome-wide association studies are a promising new tool for deciphering the genetics of complex diseases. To choose the proper sample size and genotyping platform for such studies, power calculations that take into account genetic model, tag SNP selection, and the population of interest are required. Results The power of genome-wide association studies can be computed using a set of tag SNPs and a large number of genotyped SNPs in a representative population, such as available through the HapMap project. As expected, power increases with increasing sample size and effect size. Power also depends on the tag SNPs selected. In some cases, more power is obtained by genotyping more individuals at fewer SNPs than fewer individuals at more SNPs. Conclusion Genome-wide association studies should be designed thoughtfully, with the choice of genotyping platform and sample size being determined from careful power calculations.

  11. Diversity of eukaryotic DNA replication origins revealed by genome-wide analysis of chromatin structure.

    Directory of Open Access Journals (Sweden)

    Nicolas M Berbenetz

    2010-09-01

    Full Text Available Eukaryotic DNA replication origins differ both in their efficiency and in the characteristic time during S phase when they become active. The biological basis for these differences remains unknown, but they could be a consequence of chromatin structure. The availability of genome-wide maps of nucleosome positions has led to an explosion of information about how nucleosomes are assembled at transcription start sites, but no similar maps exist for DNA replication origins. Here we combine high-resolution genome-wide nucleosome maps with comprehensive annotations of DNA replication origins to identify patterns of nucleosome occupancy at eukaryotic replication origins. On average, replication origins contain a nucleosome depleted region centered next to the ACS element, flanked on both sides by arrays of well-positioned nucleosomes. Our analysis identified DNA sequence properties that correlate with nucleosome occupancy at replication origins genome-wide and that are correlated with the nucleosome-depleted region. Clustering analysis of all annotated replication origins revealed a surprising diversity of nucleosome occupancy patterns. We provide evidence that the origin recognition complex, which binds to the origin, acts as a barrier element to position and phase nucleosomes on both sides of the origin. Finally, analysis of chromatin reconstituted in vitro reveals that origins are inherently nucleosome depleted. Together our data provide a comprehensive, genome-wide view of chromatin structure at replication origins and suggest a model of nucleosome positioning at replication origins in which the underlying sequence occludes nucleosomes to permit binding of the origin recognition complex, which then (likely in concert with nucleosome modifiers and remodelers positions nucleosomes adjacent to the origin to promote replication origin function.

  12. Inbreeding in genome-wide selection

    NARCIS (Netherlands)

    Daetwyler, H.D.; Villanueva, B.; Bijma, P.; Woolliams, J.A.

    2007-01-01

    Traditional selection methods, such as sib and best linear unbiased prediction (BLUP) selection, which increased genetic gain by increasing accuracy of evaluation have also led to an increased rate of inbreeding per generation (¿FG). This is not necessarily the case with genome-wide selection, which

  13. The importance of context to the genetic architecture of diabetes-related traits is revealed in a genome-wide scan of a LG/J × SM/J murine model.

    Science.gov (United States)

    Lawson, Heather A; Lee, Arthur; Fawcett, Gloria L; Wang, Bing; Pletscher, L Susan; Maxwell, Taylor J; Ehrich, Thomas H; Kenney-Hunt, Jane P; Wolf, Jason B; Semenkovich, Clay F; Cheverud, James M

    2011-04-01

    Variations in diabetic phenotypes are caused by complex interactions of genetic effects, environmental factors, and the interplay between the two. We tease apart these complex interactions by examining genome-wide genetic and epigenetic effects on diabetes-related traits among different sex, diet, and sex-by-diet cohorts in a Mus musculus model. We conducted a genome-wide scan for quantitative trait loci that affect serum glucose and insulin levels and response to glucose stress in an F(16) Advanced Intercross Line of the LG/J and SM/J intercross (Wustl:LG,SM-G16). Half of each sibship was fed a high-fat diet and half was fed a relatively low-fat diet. Context-dependent genetic (additive and dominance) and epigenetic (parent-of-origin imprinting) effects were characterized by partitioning animals into sex, diet, and sex-by-diet cohorts. We found that different cohorts often have unique genetic effects at the same loci, and that genetic signals can be masked or erroneously assigned to specific cohorts if they are not considered individually. Our data demonstrate that the effects of genes on complex trait variation are highly context-dependent and that the same genomic sequence can affect traits differently depending on an individual's sex and/or dietary environment. Our results have important implications for studies of complex traits in humans.

  14. Genome-wide inference of regulatory networks in Streptomyces coelicolor

    Directory of Open Access Journals (Sweden)

    Takano Eriko

    2010-10-01

    Full Text Available Abstract Background The onset of antibiotics production in Streptomyces species is co-ordinated with differentiation events. An understanding of the genetic circuits that regulate these coupled biological phenomena is essential to discover and engineer the pharmacologically important natural products made by these species. The availability of genomic tools and access to a large warehouse of transcriptome data for the model organism, Streptomyces coelicolor, provides incentive to decipher the intricacies of the regulatory cascades and develop biologically meaningful hypotheses. Results In this study, more than 500 samples of genome-wide temporal transcriptome data, comprising wild-type and more than 25 regulatory gene mutants of Streptomyces coelicolor probed across multiple stress and medium conditions, were investigated. Information based on transcript and functional similarity was used to update a previously-predicted whole-genome operon map and further applied to predict transcriptional networks constituting modules enriched in diverse functions such as secondary metabolism, and sigma factor. The predicted network displays a scale-free architecture with a small-world property observed in many biological networks. The networks were further investigated to identify functionally-relevant modules that exhibit functional coherence and a consensus motif in the promoter elements indicative of DNA-binding elements. Conclusions Despite the enormous experimental as well as computational challenges, a systems approach for integrating diverse genome-scale datasets to elucidate complex regulatory networks is beginning to emerge. We present an integrated analysis of transcriptome data and genomic features to refine a whole-genome operon map and to construct regulatory networks at the cistron level in Streptomyces coelicolor. The functionally-relevant modules identified in this study pose as potential targets for further studies and verification.

  15. Genome-Wide Prediction of C. elegans Genetic Interactions

    OpenAIRE

    Zhong, Weiwei; Sternberg, Paul W.

    2006-01-01

    To obtain a global view of functional interactions among genes in a metazoan genome, we computationally integrated interactome data, gene expression data, phenotype data, and functional annotation data from three model organisms—Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster—and predicted genome-wide genetic interactions in C. elegans. The resulting genetic interaction network (consisting of 18,183 interactions) provides a framework for system-level understandin...

  16. Genome-wide patterns of promoter sharing and co-expression in bovine skeletal muscle

    Directory of Open Access Journals (Sweden)

    Dalrymple Brian P

    2011-01-01

    Full Text Available Abstract Background Gene regulation by transcription factors (TF is species, tissue and time specific. To better understand how the genetic code controls gene expression in bovine muscle we associated gene expression data from developing Longissimus thoracis et lumborum skeletal muscle with bovine promoter sequence information. Results We created a highly conserved genome-wide promoter landscape comprising 87,408 interactions relating 333 TFs with their 9,242 predicted target genes (TGs. We discovered that the complete set of predicted TGs share an average of 2.75 predicted TF binding sites (TFBSs and that the average co-expression between a TF and its predicted TGs is higher than the average co-expression between the same TF and all genes. Conversely, pairs of TFs sharing predicted TGs showed a co-expression correlation higher that pairs of TFs not sharing TGs. Finally, we exploited the co-occurrence of predicted TFBS in the context of muscle-derived functionally-coherent modules including cell cycle, mitochondria, immune system, fat metabolism, muscle/glycolysis, and ribosome. Our findings enabled us to reverse engineer a regulatory network of core processes, and correctly identified the involvement of E2F1, GATA2 and NFKB1 in the regulation of cell cycle, fat, and muscle/glycolysis, respectively. Conclusion The pivotal implication of our research is two-fold: (1 there exists a robust genome-wide expression signal between TFs and their predicted TGs in cattle muscle consistent with the extent of promoter sharing; and (2 this signal can be exploited to recover the cellular mechanisms underpinning transcription regulation of muscle structure and development in bovine. Our study represents the first genome-wide report linking tissue specific co-expression to co-regulation in a non-model vertebrate.

  17. Genome-wide patterns of nucleotide polymorphism in domesticated rice

    DEFF Research Database (Denmark)

    Caicedo, Ana L; Williamson, Scott H; Hernandez, Ryan D

    2007-01-01

    Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments......, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models...... explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection...

  18. Genome-Wide Identification, 3D Modeling, Expression and Enzymatic Activity Analysis of Cell Wall Invertase Gene Family from Cassava (Manihot esculenta Crantz

    Directory of Open Access Journals (Sweden)

    Yuan Yao

    2014-04-01

    Full Text Available The cell wall invertases play a crucial role on the sucrose metabolism in plant source and sink organs. In this research, six cell wall invertase genes (MeCWINV1-6 were cloned from cassava. All the MeCWINVs contain a putative signal peptide with a predicted extracellular location. The overall predicted structures of the MeCWINV1-6 are similar to AtcwINV1. Their N-terminus domain forms a β-propeller module and three conserved sequence domains (NDPNG, RDP and WECP(VD, in which the catalytic residues are situated in these domains; while the C-terminus domain consists of a β-sandwich module. The predicted structure of Pro residue from the WECPD (MeCWINV1, 2, 5, and 6, and Val residue from the WECVD (MeCWINV3 and 4 are different. The activity of MeCWINV1 and 3 were higher than other MeCWINVs in leaves and tubers, which suggested that sucrose was mainly catalyzed by the MeCWINV1 and 3 in the apoplastic space of cassava source and sink organs. The transcriptional levels of all the MeCWINVs and their enzymatic activity were lower in tubers than in leaves at all the stages during the cassava tuber development. It suggested that the major role of the MeCWINVs was on the regulation of carbon exportation from source leaves, and the ratio of sucrose to hexose in the apoplasts; the role of these enzymes on the sucrose unloading to tuber was weaker.

  19. Genome-wide identification, 3D modeling, expression and enzymatic activity analysis of cell wall invertase gene family from cassava (Manihot esculenta Crantz).

    Science.gov (United States)

    Yao, Yuan; Geng, Meng-Ting; Wu, Xiao-Hui; Liu, Jiao; Li, Rui-Mei; Hu, Xin-Wen; Guo, Jian-Chun

    2014-04-28

    The cell wall invertases play a crucial role on the sucrose metabolism in plant source and sink organs. In this research, six cell wall invertase genes (MeCWINV1-6) were cloned from cassava. All the MeCWINVs contain a putative signal peptide with a predicted extracellular location. The overall predicted structures of the MeCWINV1-6 are similar to AtcwINV1. Their N-terminus domain forms a β-propeller module and three conserved sequence domains (NDPNG, RDP and WECP(V)D), in which the catalytic residues are situated in these domains; while the C-terminus domain consists of a β-sandwich module. The predicted structure of Pro residue from the WECPD (MeCWINV1, 2, 5, and 6), and Val residue from the WECVD (MeCWINV3 and 4) are different. The activity of MeCWINV1 and 3 were higher than other MeCWINVs in leaves and tubers, which suggested that sucrose was mainly catalyzed by the MeCWINV1 and 3 in the apoplastic space of cassava source and sink organs. The transcriptional levels of all the MeCWINVs and their enzymatic activity were lower in tubers than in leaves at all the stages during the cassava tuber development. It suggested that the major role of the MeCWINVs was on the regulation of carbon exportation from source leaves, and the ratio of sucrose to hexose in the apoplasts; the role of these enzymes on the sucrose unloading to tuber was weaker.

  20. Genome-wide analysis of poly(A) site selection in Schizosaccharomyces pombe

    KAUST Repository

    Schlackow, M.

    2013-10-23

    Polyadenylation of pre-mRNAs, a critical step in eukaryotic gene expression, is mediated by cis elements collectively called the polyadenylation signal. Genome-wide analysis of such polyadenylation signals was missing in fission yeast, even though it is an important model organism. We demonstrate that the canonical AATAAA motif is the most frequent and functional polyadenylation signal in Schizosaccharomyces pombe. Using analysis of RNA-Seq data sets from cells grown under various physiological conditions, we identify 3\\' UTRs for nearly 90% of the yeast genes. Heterogeneity of cleavage sites is common, as is alternative polyadenylation within and between conditions. We validated the computationally identified sequence elements likely to promote polyadenylation by functional assays, including qRT-PCR and 3\\'RACE analysis. The biological importance of the AATAAA motif is underlined by functional analysis of the genes containing it. Furthermore, it has been shown that convergent genes require trans elements, like cohesin for efficient transcription termination. Here we show that convergent genes lacking cohesin (on chromosome 2) are generally associated with longer overlapping mRNA transcripts. Our bioinformatic and experimental genome-wide results are summarized and can be accessed and customized in a user-friendly database Pomb(A).

  1. VIGoR: Variational Bayesian Inference for Genome-Wide Regression

    Directory of Open Access Journals (Sweden)

    Akio Onogi

    2016-04-01

    Full Text Available Genome-wide regression using a number of genome-wide markers as predictors is now widely used for genome-wide association mapping and genomic prediction. We developed novel software for genome-wide regression which we named VIGoR (variational Bayesian inference for genome-wide regression. Variational Bayesian inference is computationally much faster than widely used Markov chain Monte Carlo algorithms. VIGoR implements seven regression methods, and is provided as a command line program package for Linux/Mac, and as a cross-platform R package. In addition to model fitting, cross-validation and hyperparameter tuning using cross-validation can be automatically performed by modifying a single argument. VIGoR is available at https://github.com/Onogi/VIGoR. The R package is also available at https://cran.r-project.org/web/packages/VIGoR/index.html.

  2. Profiling genome-wide DNA methylation.

    Science.gov (United States)

    Yong, Wai-Shin; Hsu, Fei-Man; Chen, Pao-Yang

    2016-01-01

    DNA methylation is an epigenetic modification that plays an important role in regulating gene expression and therefore a broad range of biological processes and diseases. DNA methylation is tissue-specific, dynamic, sequence-context-dependent and trans-generationally heritable, and these complex patterns of methylation highlight the significance of profiling DNA methylation to answer biological questions. In this review, we surveyed major methylation assays, along with comparisons and biological examples, to provide an overview of DNA methylation profiling techniques. The advances in microarray and sequencing technologies make genome-wide profiling possible at a single-nucleotide or even a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, genomic region coverage, and bioinformatics analysis, and selecting a feasible method requires knowledge of these methods. We first introduce the biological background of DNA methylation and its pattern in plants, animals and fungi. We present an overview of major experimental approaches to profiling genome-wide DNA methylation and hydroxymethylation and then extend to the single-cell methylome. To evaluate these methods, we outline their strengths and weaknesses and perform comparisons across the different platforms. Due to the increasing need to compute high-throughput epigenomic data, we interrogate the computational pipeline for bisulfite sequencing data and also discuss the concept of identifying differentially methylated regions (DMRs). This review summarizes the experimental and computational concepts for profiling genome-wide DNA methylation, followed by biological examples. Overall, this review provides researchers useful guidance for the selection of a profiling method suited to specific research questions.

  3. Genome-wide binding and transcriptome analysis of human farnesoid X receptor in primary human hepatocytes.

    Directory of Open Access Journals (Sweden)

    Le Zhan

    Full Text Available Farnesoid X receptor (FXR, NR1H4 is a ligand-activated transcription factor, belonging to the nuclear receptor superfamily. FXR is highly expressed in the liver and is essential in regulating bile acid homeostasis. FXR deficiency is implicated in numerous liver diseases and mice with modulation of FXR have been used as animal models to study liver physiology and pathology. We have reported genome-wide binding of FXR in mice by chromatin immunoprecipitation - deep sequencing (ChIP-seq, with results indicating that FXR may be involved in regulating diverse pathways in liver. However, limited information exists for the functions of human FXR and the suitability of using murine models to study human FXR functions.In the current study, we performed ChIP-seq in primary human hepatocytes (PHHs treated with a synthetic FXR agonist, GW4064 or DMSO control. In parallel, RNA deep sequencing (RNA-seq and RNA microarray were performed for GW4064 or control treated PHHs and wild type mouse livers, respectively.ChIP-seq showed similar profiles of genome-wide FXR binding in humans and mice in terms of motif analysis and pathway prediction. However, RNA-seq and microarray showed more different transcriptome profiles between PHHs and mouse livers upon GW4064 treatment.In summary, we have established genome-wide human FXR binding and transcriptome profiles. These results will aid in determining the human FXR functions, as well as judging to what level the mouse models could be used to study human FXR functions.

  4. Genome-wide computational prediction and analysis of core promoter elements across plant monocots and dicots

    Science.gov (United States)

    Transcription initiation, essential to gene expression regulation, involves recruitment of basal transcription factors to the core promoter elements (CPEs). The distribution of currently known CPEs across plant genomes is largely unknown. This is the first large scale genome-wide report on the compu...

  5. Genome-Wide Approaches to Drosophila Heart Development

    Directory of Open Access Journals (Sweden)

    Manfred Frasch

    2016-05-01

    Full Text Available The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level.

  6. Genome-wide transcriptional analysis of grapevine berry ripening reveals a set of genes similarly modulated during three seasons and the occurrence of an oxidative burst at vèraison

    Directory of Open Access Journals (Sweden)

    Dal Ri Antonio

    2007-11-01

    Full Text Available Abstract Background Grapevine (Vitis species is among the most important fruit crops in terms of cultivated area and economic impact. Despite this relevance, little is known about the transcriptional changes and the regulatory circuits underlying the biochemical and physical changes occurring during berry development. Results Fruit ripening in the non-climacteric crop species Vitis vinifera L. has been investigated at the transcriptional level by the use of the Affymetrix Vitis GeneChip® which contains approximately 14,500 unigenes. Gene expression data obtained from berries sampled before and after véraison in three growing years, were analyzed to identify genes specifically involved in fruit ripening and to investigate seasonal influences on the process. From these analyses a core set of 1477 genes was found which was similarly modulated in all seasons. We were able to separate ripening specific isoforms within gene families and to identify ripening related genes which appeared strongly regulated also by the seasonal weather conditions. Transcripts annotation by Gene Ontology vocabulary revealed five overrepresented functional categories of which cell wall organization and biogenesis, carbohydrate and secondary metabolisms and stress response were specifically induced during the ripening phase, while photosynthesis was strongly repressed. About 19% of the core gene set was characterized by genes involved in regulatory processes, such as transcription factors and transcripts related to hormonal metabolism and signal transduction. Auxin, ethylene and light emerged as the main stimuli influencing berry development. In addition, an oxidative burst, previously not detected in grapevine, characterized by rapid accumulation of H2O2 starting from véraison and by the modulation of many ROS scavenging enzymes, was observed. Conclusion The time-course gene expression analysis of grapevine berry development has identified the occurrence of two well

  7. Genome-wide patterns of nucleotide polymorphism in domesticated rice.

    Directory of Open Access Journals (Sweden)

    Ana L Caicedo

    2007-09-01

    Full Text Available Domesticated Asian rice (Oryza sativa is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models to explain contemporary patterns of polymorphisms in rice, including a (i selectively neutral population bottleneck model, (ii bottleneck plus migration model, (iii multiple selective sweeps model, and (iv bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been the dominant demographic model for domesticated species, cannot explain the derived nucleotide polymorphism site frequency spectrum in rice. Instead, a bottleneck model that incorporates selective sweeps, or a more complex demographic model that includes subdivision and gene flow, are more plausible explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection results only in a local signature of variation.

  8. Genome-wide Analysis of Gene Regulation

    DEFF Research Database (Denmark)

    Chen, Yun

    cells are capable of regulating their gene expression, so that each cell can only express a particular set of genes yielding limited numbers of proteins with specialized functions. Therefore a rigid control of differential gene expression is necessary for cellular diversity. On the other hand, aberrant...... gene regulation will disrupt the cell’s fundamental processes, which in turn can cause disease. Hence, understanding gene regulation is essential for deciphering the code of life. Along with the development of high throughput sequencing (HTS) technology and the subsequent large-scale data analysis......, genome-wide assays have increased our understanding of gene regulation significantly. This thesis describes the integration and analysis of HTS data across different important aspects of gene regulation. Gene expression can be regulated at different stages when the genetic information is passed from gene...

  9. High-Performance Mixed Models Based Genome-Wide Association Analysis with omicABEL software [v1; ref status: indexed, http://f1000r.es/40b

    Directory of Open Access Journals (Sweden)

    Diego Fabregat-Traver

    2014-08-01

    Full Text Available To raise the power of genome-wide association studies (GWAS and avoid false-positive results in structured populations, one can rely on mixed model based tests. When large samples are used, and when multiple traits are to be studied in the ’omics’ context, this approach becomes computationally challenging. Here we consider the problem of mixed-model based GWAS for arbitrary number of traits, and demonstrate that for the analysis of single-trait and multiple-trait scenarios different computational algorithms are optimal. We implement these optimal algorithms in a high-performance computing framework that uses state-of-the-art linear algebra kernels, incorporates optimizations, and avoids redundant computations, increasing throughput while reducing memory usage and energy consumption. We show that, compared to existing libraries, our algorithms and software achieve considerable speed-ups. The OmicABEL software described in this manuscript is available under the GNU GPL v. 3 license as part of the GenABEL project for statistical genomics at http: //www.genabel.org/packages/OmicABEL.

  10. Physiological, biochemical, and genome-wide transcriptional analysis reveals that elevated CO2 mitigates the impact of combined heat wave and drought stress in Arabidopsis thaliana at multiple organizational levels.

    Science.gov (United States)

    Zinta, Gaurav; AbdElgawad, Hamada; Domagalska, Malgorzata A; Vergauwen, Lucia; Knapen, Dries; Nijs, Ivan; Janssens, Ivan A; Beemster, Gerrit T S; Asard, Han

    2014-12-01

    Climate changes increasingly threaten plant growth and productivity. Such changes are complex and involve multiple environmental factors, including rising CO2 levels and climate extreme events. As the molecular and physiological mechanisms underlying plant responses to realistic future climate extreme conditions are still poorly understood, a multiple organizational level analysis (i.e. eco-physiological, biochemical, and transcriptional) was performed, using Arabidopsis exposed to incremental heat wave and water deficit under ambient and elevated CO2 . The climate extreme resulted in biomass reduction, photosynthesis inhibition, and considerable increases in stress parameters. Photosynthesis was a major target as demonstrated at the physiological and transcriptional levels. In contrast, the climate extreme treatment induced a protective effect on oxidative membrane damage, most likely as a result of strongly increased lipophilic antioxidants and membrane-protecting enzymes. Elevated CO2 significantly mitigated the negative impact of a combined heat and drought, as apparent in biomass reduction, photosynthesis inhibition, chlorophyll fluorescence decline, H2 O2 production, and protein oxidation. Analysis of enzymatic and molecular antioxidants revealed that the stress-mitigating CO2 effect operates through up-regulation of antioxidant defense metabolism, as well as by reduced photorespiration resulting in lowered oxidative pressure. Therefore, exposure to future climate extreme episodes will negatively impact plant growth and production, but elevated CO2 is likely to mitigate this effect.

  11. Sequence-based prediction of single nucleosome positioning and genome-wide nucleosome occupancy.

    Science.gov (United States)

    van der Heijden, Thijn; van Vugt, Joke J F A; Logie, Colin; van Noort, John

    2012-09-18

    Nucleosome positioning dictates eukaryotic DNA compaction and access. To predict nucleosome positions in a statistical mechanics model, we exploited the knowledge that nucleosomes favor DNA sequences with specific periodically occurring dinucleotides. Our model is the first to capture both dyad position within a few base pairs, and free binding energy within 2 k(B)T, for all the known nucleosome positioning sequences. By applying Percus's equation to the derived energy landscape, we isolate sequence effects on genome-wide nucleosome occupancy from other factors that may influence nucleosome positioning. For both in vitro and in vivo systems, three parameters suffice to predict nucleosome occupancy with correlation coefficients of respectively 0.74 and 0.66. As predicted, we find the largest deviations in vivo around transcription start sites. This relatively simple algorithm can be used to guide future studies on the influence of DNA sequence on chromatin organization.

  12. A genome-wide screen for Schizosaccharomyces pombe deletion mutants that affect telomere length

    Institute of Scientific and Technical Information of China (English)

    Ning-Ning Liu; Tian Xu Han; Li-Lin Du; Jin-Qiu Zhou

    2010-01-01

    @@ Dear Editor, Both the fission yeast Schizosaccharomyces pombe and the budding yeast Saccharomyces cerevisiae are popular model organisms, and studies using these models have provided many informative clues for solving fundamental biological questions [1], such as DNA replication,cell cycle regulation and gene transcription. Since the completion of genome sequencing of these fungi [2, 3],systematic genetic modification, e.g. gene deletion, has become possible, and genome-wide phenotypic screening for gene function has been widely carried out. For example, Askree et al. and Gatbonton et al. examined the telomere-length change in about 4 800 non-essential gene deletion mutants of S. cerevisiae, and found that about 250 genes are involved in telomere-length regulation.

  13. Genome-wide prediction of C. elegans genetic interactions.

    Science.gov (United States)

    Zhong, Weiwei; Sternberg, Paul W

    2006-03-10

    To obtain a global view of functional interactions among genes in a metazoan genome, we computationally integrated interactome data, gene expression data, phenotype data, and functional annotation data from three model organisms-Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster-and predicted genome-wide genetic interactions in C. elegans. The resulting genetic interaction network (consisting of 18,183 interactions) provides a framework for system-level understanding of gene functions. We experimentally tested the predicted interactions for two human disease-related genes and identified 14 new modifiers.

  14. Genome-wide approaches to understanding behaviour in Drosophila melanogaster.

    Science.gov (United States)

    Neville, Megan; Goodwin, Stephen F

    2012-09-01

    Understanding how an organism exhibits specific behaviours remains a major and important biological question. Studying behaviour in a simple model organism like the fruit fly Drosophila melanogaster has the advantages of advanced molecular genetics approaches along with well-defined anatomy and physiology. With advancements in functional genomic technologies, researchers are now attempting to uncover genes and pathways involved in complex behaviours on a genome-wide scale. A systems-level network approach, which will include genomic approaches, to study behaviour will be key to understanding the regulation and modulation of behaviours and the importance of context in regulating them.

  15. Genome-wide measurement of RNA folding energies.

    Science.gov (United States)

    Wan, Yue; Qu, Kun; Ouyang, Zhengqing; Kertesz, Michael; Li, Jun; Tibshirani, Robert; Makino, Debora L; Nutter, Robert C; Segal, Eran; Chang, Howard Y

    2012-10-26

    RNA structural transitions are important in the function and regulation of RNAs. Here, we reveal a layer of transcriptome organization in the form of RNA folding energies. By probing yeast RNA structures at different temperatures, we obtained relative melting temperatures (Tm) for RNA structures in over 4000 transcripts. Specific signatures of RNA Tm demarcated the polarity of mRNA open reading frames and highlighted numerous candidate regulatory RNA motifs in 3' untranslated regions. RNA Tm distinguished noncoding versus coding RNAs and identified mRNAs with distinct cellular functions. We identified thousands of putative RNA thermometers, and their presence is predictive of the pattern of RNA decay in vivo during heat shock. The exosome complex recognizes unpaired bases during heat shock to degrade these RNAs, coupling intrinsic structural stabilities to gene regulation. Thus, genome-wide structural dynamics of RNA can parse functional elements of the transcriptome and reveal diverse biological insights.

  16. Genome-wide analysis of alternative splicing in Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Thomas Julie

    2010-02-01

    Full Text Available Abstract Background Genome-wide computational analysis of alternative splicing (AS in several flowering plants has revealed that pre-mRNAs from about 30% of genes undergo AS. Chlamydomonas, a simple unicellular green alga, is part of the lineage that includes land plants. However, it diverged from land plants about one billion years ago. Hence, it serves as a good model system to study alternative splicing in early photosynthetic eukaryotes, to obtain insights into the evolution of this process in plants, and to compare splicing in simple unicellular photosynthetic and non-photosynthetic eukaryotes. We performed a global analysis of alternative splicing in Chlamydomonas reinhardtii using its recently completed genome sequence and all available ESTs and cDNAs. Results Our analysis of AS using BLAT and a modified version of the Sircah tool revealed AS of 498 transcriptional units with 611 events, representing about 3% of the total number of genes. As in land plants, intron retention is the most prevalent form of AS. Retained introns and skipped exons tend to be shorter than their counterparts in constitutively spliced genes. The splice site signals in all types of AS events are weaker than those in constitutively spliced genes. Furthermore, in alternatively spliced genes, the prevalent splice form has a stronger splice site signal than the non-prevalent form. Analysis of constitutively spliced introns revealed an over-abundance of motifs with simple repetitive elements in comparison to introns involved in intron retention. In almost all cases, AS results in a truncated ORF, leading to a coding sequence that is around 50% shorter than the prevalent splice form. Using RT-PCR we verified AS of two genes and show that they produce more isoforms than indicated by EST data. All cDNA/EST alignments and splice graphs are provided in a website at http://combi.cs.colostate.edu/as/chlamy. Conclusions The extent of AS in Chlamydomonas that we observed is much

  17. Evaluating genome-wide DNA methylation changes in mice by Methylation Specific Digital Karyotyping

    Directory of Open Access Journals (Sweden)

    Maruoka Shuichiro

    2008-12-01

    Full Text Available Abstract Background The study of genome-wide DNA methylation changes has become more accessible with the development of various array-based technologies though when studying species other than human the choice of applications are limited and not always within reach. In this study, we adapted and tested the applicability of Methylation Specific Digital Karyotyping (MSDK, a non-array based method, for the prospective analysis of epigenetic changes after perinatal nutritional modifications in a mouse model of allergic airway disease. MSDK is a sequenced based method that allows a comprehensive and unbiased methylation profiling. The method generates 21 base pairs long sequence tags derived from specific locations in the genome. The resulting tag frequencies determine in a quantitative manner the methylation level of the corresponding loci. Results Genomic DNA from whole lung was isolated and subjected to MSDK analysis using the methylation-sensitive enzyme Not I as the mapping enzyme and Nla III as the fragmenting enzyme. In a pair wise comparison of the generated mouse MSDK libraries we identified 158 loci that are significantly differentially methylated (P-value = 0.05 after perinatal dietary changes in our mouse model. Quantitative methylation specific PCR and sequence analysis of bisulfate modified genomic DNA confirmed changes in methylation at specific loci. Differences in genomic MSDK tag counts for a selected set of genes, correlated well with changes in transcription levels as measured by real-time PCR. Furthermore serial analysis of gene expression profiling demonstrated a dramatic difference in expressed transcripts in mice exposed to perinatal nutritional changes. Conclusion The genome-wide methylation survey applied in this study allowed for an unbiased methylation profiling revealing subtle changes in DNA methylation in mice maternally exposed to dietary changes in methyl-donor content. The MSDK method is applicable for mouse models

  18. Genome-wide analysis correlates Ayurveda Prakriti.

    Science.gov (United States)

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K; Prasanna, B V; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S; Dedge, Amrish P; Bharadwaj, Ramachandra; Gangadharan, G G; Nair, Sreekumaran; Gopinath, Puthiya M; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-10-29

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as "Prakriti". To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10(-5)) were significantly different between Prakritis, without any confounding effect of stratification, after 10(6) permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India's traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine.

  19. Genome-Wide Analysis of Protein and mRNA Copy Numbers in Single Escherichia coli Cells with Single-Molecule Sensitivity.

    Science.gov (United States)

    Taniguchi, Yuichi

    2015-01-01

    Single-cell proteomic and transcriptomic analysis is an emerging approach for providing quantitative and comprehensive characterization of gene functions in individual cells. This analysis, however, is often hampered by insufficient sensitivity for detecting low copy gene expression products such as transcription factors and regulators. Here I describe a method for the quantitative genome-wide analysis of single-cell protein and mRNA copy numbers with single molecule sensitivity for the model organism Escherichia coli.

  20. Genome-wide identification of KANADI1 target genes.

    Directory of Open Access Journals (Sweden)

    Paz Merelo

    Full Text Available Plant organ development and polarity establishment is mediated by the action of several transcription factors. Among these, the KANADI (KAN subclade of the GARP protein family plays important roles in polarity-associated processes during embryo, shoot and root patterning. In this study, we have identified a set of potential direct target genes of KAN1 through a combination of chromatin immunoprecipitation/DNA sequencing (ChIP-Seq and genome-wide transcriptional profiling using tiling arrays. Target genes are over-represented for genes involved in the regulation of organ development as well as in the response to auxin. KAN1 affects directly the expression of several genes previously shown to be important in the establishment of polarity during lateral organ and vascular tissue development. We also show that KAN1 controls through its target genes auxin effects on organ development at different levels: transport and its regulation, and signaling. In addition, KAN1 regulates genes involved in the response to abscisic acid, jasmonic acid, brassinosteroids, ethylene, cytokinins and gibberellins. The role of KAN1 in organ polarity is antagonized by HD-ZIPIII transcription factors, including REVOLUTA (REV. A comparison of their target genes reveals that the REV/KAN1 module acts in organ patterning through opposite regulation of shared targets. Evidence of mutual repression between closely related family members is also shown.

  1. Genome wide selection in Citrus breeding.

    Science.gov (United States)

    Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A

    2016-10-17

    Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq(TM) (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.

  2. Genome-wide mapping of DNA strand breaks.

    Directory of Open Access Journals (Sweden)

    Frédéric Leduc

    Full Text Available Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP, uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.

  3. Genome-wide identification of direct HBx genomic targets

    KAUST Repository

    Guerrieri, Francesca

    2017-02-17

    Background The Hepatitis B Virus (HBV) HBx regulatory protein is required for HBV replication and involved in HBV-related carcinogenesis. HBx interacts with chromatin modifying enzymes and transcription factors to modulate histone post-translational modifications and to regulate viral cccDNA transcription and cellular gene expression. Aiming to identify genes and non-coding RNAs (ncRNAs) directly targeted by HBx, we performed a chromatin immunoprecipitation sequencing (ChIP-Seq) to analyse HBV recruitment on host cell chromatin in cells replicating HBV. Results ChIP-Seq high throughput sequencing of HBx-bound fragments was used to obtain a high-resolution, unbiased, mapping of HBx binding sites across the genome in HBV replicating cells. Protein-coding genes and ncRNAs involved in cell metabolism, chromatin dynamics and cancer were enriched among HBx targets together with genes/ncRNAs known to modulate HBV replication. The direct transcriptional activation of genes/miRNAs that potentiate endocytosis (Ras-related in brain (RAB) GTPase family) and autophagy (autophagy related (ATG) genes, beclin-1, miR-33a) and the transcriptional repression of microRNAs (miR-138, miR-224, miR-576, miR-596) that directly target the HBV pgRNA and would inhibit HBV replication, contribute to HBx-mediated increase of HBV replication. Conclusions Our ChIP-Seq analysis of HBx genome wide chromatin recruitment defined the repertoire of genes and ncRNAs directly targeted by HBx and led to the identification of new mechanisms by which HBx positively regulates cccDNA transcription and HBV replication.

  4. Genome-wide microarray analysis of tomato roots showed defined responses to iron deficiency

    Directory of Open Access Journals (Sweden)

    Zamboni Anita

    2012-03-01

    Full Text Available Abstract Background Plants react to iron deficiency stress adopting different kind of adaptive responses. Tomato, a Strategy I plant, improves iron uptake through acidification of rhizosphere, reduction of Fe3+ to Fe2+ and transport of Fe2+ into the cells. Large-scale transcriptional analyses of roots under iron deficiency are only available for a very limited number of plant species with particular emphasis for Arabidopsis thaliana. Regarding tomato, an interesting model species for Strategy I plants and an economically important crop, physiological responses to Fe-deficiency have been thoroughly described and molecular analyses have provided evidence for genes involved in iron uptake mechanisms and their regulation. However, no detailed transcriptome analysis has been described so far. Results A genome-wide transcriptional analysis, performed with a chip that allows to monitor the expression of more than 25,000 tomato transcripts, identified 97 differentially expressed transcripts by comparing roots of Fe-deficient and Fe-sufficient tomato plants. These transcripts are related to the physiological responses of tomato roots to the nutrient stress resulting in an improved iron uptake, including regulatory aspects, translocation, root morphological modification and adaptation in primary metabolic pathways, such as glycolysis and TCA cycle. Other genes play a role in flavonoid biosynthesis and hormonal metabolism. Conclusions The transcriptional characterization confirmed the presence of the previously described mechanisms to adapt to iron starvation in tomato, but also allowed to identify other genes potentially playing a role in this process, thus opening new research perspectives to improve the knowledge on the tomato root response to the nutrient deficiency.

  5. PPARgamma in adipocyte differentiation and metabolism--novel insights from genome-wide studies.

    Science.gov (United States)

    Siersbaek, Rasmus; Nielsen, Ronni; Mandrup, Susanne

    2010-08-04

    Adipocyte differentiation is controlled by a tightly regulated transcriptional cascade in which PPARgamma and members of the C/EBP family are key players. Here we review the roles of PPARgamma and C/EBPs in adipocyte differentiation with emphasis on the recently published genome-wide binding profiles for PPARgamma and C/EBPalpha. Interestingly, these analyses show that PPARgamma and C/EBPalpha binding sites are associated with most genes that are induced during adipogenesis suggesting direct activation of many more adipocyte genes than previously anticipated. Furthermore, an extensive overlap between the C/EBPalpha and PPARgamma cistromes indicate a hitherto unrecognized direct crosstalk between these transcription factors. As more genome-wide data emerge in the future, this crosstalk will likely be found to include several other adipogenic transcription factors. Copyright (c) 2010 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  6. Genome-wide association study of serum selenium concentrations

    DEFF Research Database (Denmark)

    Gong, Jian; Hsu, Li; Harrison, Tabitha

    2013-01-01

    Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated...... this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening and the Women’s Health Initiative (WHI). We...... tested association between 2,474,333 single nucleotide polymorphisms (SNPs) and serum selenium concentrations using linear regression models. In the first stage (PLCO) 41 SNPs clustered in 15 regions had p

  7. Genome-Wide Analysis of Human MicroRNA Stability

    Directory of Open Access Journals (Sweden)

    Yang Li

    2013-01-01

    Full Text Available Increasing studies have shown that microRNA (miRNA stability plays important roles in physiology. However, the global picture of miRNA stability remains largely unknown. Here, we had analyzed genome-wide miRNA stability across 10 diverse cell types using miRNA arrays. We found that miRNA stability shows high dynamics and diversity both within individual cells and across cell types. Strikingly, we observed a negative correlation between miRNA stability and miRNA expression level, which is different from current findings on other biological molecules such as proteins and mRNAs that show positive and not negative correlations between stability and expression level. This finding indicates that miRNA has a distinct action mode, which we called “rapid production, rapid turnover; slow production, slow turnover.” This mode further suggests that high expression miRNAs normally degrade fast and may endow the cell with special properties that facilitate cellular status-transition. Moreover, we revealed that the stability of miRNAs is affected by cohorts of factors that include miRNA targets, transcription factors, nucleotide content, evolution, associated disease, and environmental factors. Together, our results provided an extensive description of the global landscape, dynamics, and distinct mode of human miRNA stability, which provide help in investigating their functions in physiology and pathophysiology.

  8. Genome-wide transcriptome analysis of 150 cell samples†

    Science.gov (United States)

    Russom, Aman; Xiao, Wenzhong; Wilhelmy, Julie; Wang, Shenglong; Heath, Joe Don; Kurn, Nurith; Tompkins, Ronald G.; Davis, Ronald W.; Toner, Mehmet

    2013-01-01

    A major challenge in molecular biology is interrogating the human transcriptome on a genome wide scale when only a limited amount of biological sample is available for analysis. Current methodologies using microarray technologies for simultaneously monitoring mRNA transcription levels require nanogram amounts of total RNA. To overcome the sample size limitation of current technologies, we have developed a method to probe the global gene expression in biological samples as small as 150 cells, or the equivalent of approximately 300 pg total RNA. The new method employs microfluidic devices for the purification of total RNA from mammalian cells and ultra-sensitive whole transcriptome amplification techniques. We verified that the RNA integrity is preserved through the isolation process, accomplished highly reproducible whole transcriptome analysis, and established high correlation between repeated isolations of 150 cells and the same cell culture sample. We validated the technology by demonstrating that the combined microfluidic and amplification protocol is capable of identifying biological pathways perturbed by stimulation, which are consistent with the information recognized in bulk-isolated samples. PMID:20023796

  9. Genome-wide transcriptome analysis of 150 cell samples.

    Science.gov (United States)

    Irimia, Daniel; Mindrinos, Michael; Russom, Aman; Xiao, Wenzhong; Wilhelmy, Julie; Wang, Shenglong; Heath, Joe Don; Kurn, Nurith; Tompkins, Ronald G; Davis, Ronald W; Toner, Mehmet

    2009-01-01

    A major challenge in molecular biology is interrogating the human transcriptome on a genome wide scale when only a limited amount of biological sample is available for analysis. Current methodologies using microarray technologies for simultaneously monitoring mRNA transcription levels require nanogram amounts of total RNA. To overcome the sample size limitation of current technologies, we have developed a method to probe the global gene expression in biological samples as small as 150 cells, or the equivalent of approximately 300 pg total RNA. The new method employs microfluidic devices for the purification of total RNA from mammalian cells and ultra-sensitive whole transcriptome amplification techniques. We verified that the RNA integrity is preserved through the isolation process, accomplished highly reproducible whole transcriptome analysis, and established high correlation between repeated isolations of 150 cells and the same cell culture sample. We validated the technology by demonstrating that the combined microfluidic and amplification protocol is capable of identifying biological pathways perturbed by stimulation, which are consistent with the information recognized in bulk-isolated samples.

  10. Genome-wide computational prediction and analysis of core promoter elements across plant monocots and dicots.

    Directory of Open Access Journals (Sweden)

    Sunita Kumari

    Full Text Available Transcription initiation, essential to gene expression regulation, involves recruitment of basal transcription factors to the core promoter elements (CPEs. The distribution of currently known CPEs across plant genomes is largely unknown. This is the first large scale genome-wide report on the computational prediction of CPEs across eight plant genomes to help better understand the transcription initiation complex assembly. The distribution of thirteen known CPEs across four monocots (Brachypodium distachyon, Oryza sativa ssp. japonica, Sorghum bicolor, Zea mays and four dicots (Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera, Glycine max reveals the structural organization of the core promoter in relation to the TATA-box as well as with respect to other CPEs. The distribution of known CPE motifs with respect to transcription start site (TSS exhibited positional conservation within monocots and dicots with slight differences across all eight genomes. Further, a more refined subset of annotated genes based on orthologs of the model monocot (O. sativa ssp. japonica and dicot (A. thaliana genomes supported the positional distribution of these thirteen known CPEs. DNA free energy profiles provided evidence that the structural properties of promoter regions are distinctly different from that of the non-regulatory genome sequence. It also showed that monocot core promoters have lower DNA free energy than dicot core promoters. The comparison of monocot and dicot promoter sequences highlights both the similarities and differences in the core promoter architecture irrespective of the species-specific nucleotide bias. This study will be useful for future work related to genome annotation projects and can inspire research efforts aimed to better understand regulatory mechanisms of transcription.

  11. Genome-wide Analysis of Gene Regulation

    DEFF Research Database (Denmark)

    Chen, Yun

    IP-seq and small RNA-seq, we delineated the landscape of the promoters with bidirectional transcriptions that yield steady-state RNA in only one directions (Paper III). A subsequent motif analysis enabled us to uncover specific DNA signals – early polyA sites – that make RNA on the reverse strand sensitive...... they regulated or if the sites had global elevated usage rates by multiple TFs. Using RNA-seq, 5’end-seq in combination with depletion of 5’exonuclease as well as nonsensemediated decay (NMD) factors, we systematically analyzed NMD substrates as well as their degradation intermediates in human cells (Paper V......). Gene enrichment analysis on the detected NMD substrates revealed an unappreciated NMD-based regulatory mechanism of the genes hosting multiple intronic snoRNAs, which can facilitate differential expression of individual snoRNAs from a single host gene locus. Finally, supported by RNA-seq and small RNA-seq...

  12. Insights into the role of DNA methylation in diatoms by genome-wide profiling in Phaeodactylum tricornutum.

    Science.gov (United States)

    Veluchamy, Alaguraj; Lin, Xin; Maumus, Florian; Rivarola, Maximo; Bhavsar, Jaysheel; Creasy, Todd; O'Brien, Kimberly; Sengamalay, Naomi A; Tallon, Luke J; Smith, Andrew D; Rayko, Edda; Ahmed, Ikhlak; Le Crom, Stéphane; Farrant, Gregory K; Sgro, Jean-Yves; Olson, Sue A; Bondurant, Sandra Splinter; Allen, Andrew E; Allen, Andrew; Rabinowicz, Pablo D; Sussman, Michael R; Bowler, Chris; Tirichine, Leïla

    2013-01-01

    DNA cytosine methylation is a widely conserved epigenetic mark in eukaryotes that appears to have critical roles in the regulation of genome structure and transcription. Genome-wide methylation maps have so far only been established from the supergroups Archaeplastida and Unikont. Here we report the first whole-genome methylome from a stramenopile, the marine model diatom Phaeodactylum tricornutum. Around 6% of the genome is intermittently methylated in a mosaic pattern. We find extensive methylation in transposable elements. We also detect methylation in over 320 genes. Extensive gene methylation correlates strongly with transcriptional silencing and differential expression under specific conditions. By contrast, we find that genes with partial methylation tend to be constitutively expressed. These patterns contrast with those found previously in other eukaryotes. By going beyond plants, animals and fungi, this stramenopile methylome adds significantly to our understanding of the evolution of DNA methylation in eukaryotes.

  13. Genome-wide association study of clinical dimensions of schizophrenia

    DEFF Research Database (Denmark)

    Fanous, Ayman H; Zhou, Baiyu; Aggen, Steven H;

    2012-01-01

    Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia....

  14. Cancer genetic association studies in the genome-wide age

    OpenAIRE

    Savage, Sharon A

    2008-01-01

    Genome-wide association studies of hundreds of thousands of SNPs have led to a deluge of studies of genetic variation in cancer and other common diseases. Large case–control and cohort studies have identified novel SNPs as markers of cancer risk. Genome-wide association study SNP data have also advanced understanding of population-specific genetic variation. While studies of risk profiles, combinations of SNPs that may increase cancer risk, are not yet clinically applicable, future, large-sca...

  15. Genome-wide polymorphisms show unexpected targets of natural selection

    OpenAIRE

    Pespeni, Melissa H.; Garfield, David A.; Manier, Mollie K; Palumbi, Stephen R.

    2011-01-01

    Natural selection can act on all the expressed genes of an individual, leaving signatures of genetic differentiation or diversity at many loci across the genome. New power to assay these genome-wide effects of selection comes from associating multi-locus patterns of polymorphism with gene expression and function. Here, we performed one of the first genome-wide surveys in a marine species, comparing purple sea urchins, Strongylocentrotus purpuratus, from two distant locations along the species...

  16. Genome-wide association study of multiplex schizophrenia pedigrees

    DEFF Research Database (Denmark)

    Levinson, Douglas F; Shi, Jianxin; Wang, Kai

    2012-01-01

    The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs).......The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs)....

  17. Genome-Wide Prediction of DNA Methylation Using DNA Composition and Sequence Complexity in Human

    Science.gov (United States)

    Wu, Chengchao; Yao, Shixin; Li, Xinghao; Chen, Chujia; Hu, Xuehai

    2017-01-01

    DNA methylation plays a significant role in transcriptional regulation by repressing activity. Change of the DNA methylation level is an important factor affecting the expression of target genes and downstream phenotypes. Because current experimental technologies can only assay a small proportion of CpG sites in the human genome, it is urgent to develop reliable computational models for predicting genome-wide DNA methylation. Here, we proposed a novel algorithm that accurately extracted sequence complexity features (seven features) and developed a support-vector-machine-based prediction model with integration of the reported DNA composition features (trinucleotide frequency and GC content, 65 features) by utilizing the methylation profiles of embryonic stem cells in human. The prediction results from 22 human chromosomes with size-varied windows showed that the 600-bp window achieved the best average accuracy of 94.7%. Moreover, comparisons with two existing methods further showed the superiority of our model, and cross-species predictions on mouse data also demonstrated that our model has certain generalization ability. Finally, a statistical test of the experimental data and the predicted data on functional regions annotated by ChromHMM found that six out of 10 regions were consistent, which implies reliable prediction of unassayed CpG sites. Accordingly, we believe that our novel model will be useful and reliable in predicting DNA methylation. PMID:28212312

  18. Genome-wide dynamics of a bacterial response to antibiotics that target the cell envelope

    Directory of Open Access Journals (Sweden)

    Tran Ngat

    2011-05-01

    Full Text Available Abstract Background A decline in the discovery of new antibacterial drugs, coupled with a persistent rise in the occurrence of drug-resistant bacteria, has highlighted antibiotics as a diminishing resource. The future development of new drugs with novel antibacterial activities requires a detailed understanding of adaptive responses to existing compounds. This study uses Streptomyces coelicolor A3(2 as a model system to determine the genome-wide transcriptional response following exposure to three antibiotics (vancomycin, moenomycin A and bacitracin that target distinct stages of cell wall biosynthesis. Results A generalised response to all three antibiotics was identified which involves activation of transcription of the cell envelope stress sigma factor σE, together with elements of the stringent response, and of the heat, osmotic and oxidative stress regulons. Attenuation of this system by deletion of genes encoding the osmotic stress sigma factor σB or the ppGpp synthetase RelA reduced resistance to both vancomycin and bacitracin. Many antibiotic-specific transcriptional changes were identified, representing cellular processes potentially important for tolerance to each antibiotic. Sensitivity studies using mutants constructed on the basis of the transcriptome profiling confirmed a role for several such genes in antibiotic resistance, validating the usefulness of the approach. Conclusions Antibiotic inhibition of bacterial cell wall biosynthesis induces both common and compound-specific transcriptional responses. Both can be exploited to increase antibiotic susceptibility. Regulatory networks known to govern responses to environmental and nutritional stresses are also at the core of the common antibiotic response, and likely help cells survive until any specific resistance mechanisms are fully functional.

  19. Genome-wide Association of Yorkie with Chromatin and Chromatin-Remodeling Complexes

    Directory of Open Access Journals (Sweden)

    Hyangyee Oh

    2013-02-01

    Full Text Available The Hippo pathway regulates growth through the transcriptional coactivator Yorkie, but how Yorkie promotes transcription remains poorly understood. We address this by characterizing Yorkie’s association with chromatin and by identifying nuclear partners that effect transcriptional activation. Coimmunoprecipitation and mass spectrometry identify GAGA factor (GAF, the Brahma complex, and the Mediator complex as Yorkie-associated nuclear protein complexes. All three are required for Yorkie’s transcriptional activation of downstream genes, and GAF and the Brahma complex subunit Moira interact directly with Yorkie. Genome-wide chromatin-binding experiments identify thousands of Yorkie sites, most of which are associated with elevated transcription, based on genome-wide analysis of messenger RNA and histone H3K4Me3 modification. Chromatin binding also supports extensive functional overlap between Yorkie and GAF. Our studies suggest a widespread role for Yorkie as a regulator of transcription and identify recruitment of the chromatin-modifying GAF protein and BRM complex as a molecular mechanism for transcriptional activation by Yorkie.

  20. Thermodynamic Model of Transcription Elongation

    Science.gov (United States)

    Tadigotla, Vasisht; O'Maoileidigh, Daibhid; Sengupta, Anirvan; Epshtein, Vitaly; Ebright, Richard; Nudler, Evgeny; Ruckenstein, Andrei

    2006-03-01

    We present a statistical mechanics approach to the prediction of backtracked pauses in prokaryotic transcription elongation derived from structural models of the transcription elongation complex (TEC). Our algorithm is based on the thermodynamic stability of TEC along the DNA template calculated from the sequence dependent free-energy of DNA-DNA, DNA-RNA and RNA-RNA base pairing associated with (a) the translocation and size fluctuations of the transcription bubble; (b) the changes in the DNA-RNA hybrid; and (c) the changes in the RNA folding free-energy. The calculations involve no adjustable parameters apart from a cutoff used to discriminate paused from non-paused complexes. When applied to 100 experimental pauses in transcription elongation by E. coli RNA polymerase on ten DNA templates the approach produces highly statistically significant results. Transcription elongation is an inherently kinetic process and a simplified kinetic model with the same predictive power is presented separately.

  1. ReplicationDomain: a visualization tool and comparative database for genome-wide replication timing data

    Directory of Open Access Journals (Sweden)

    Yokochi Tomoki

    2008-12-01

    Full Text Available Abstract Background Eukaryotic DNA replication is regulated at the level of large chromosomal domains (0.5–5 megabases in mammals within which replicons are activated relatively synchronously. These domains replicate in a specific temporal order during S-phase and our genome-wide analyses of replication timing have demonstrated that this temporal order of domain replication is a stable property of specific cell types. Results We have developed ReplicationDomain http://www.replicationdomain.org as a web-based database for analysis of genome-wide replication timing maps (replication profiles from various cell lines and species. This database also provides comparative information of transcriptional expression and is configured to display any genome-wide property (for instance, ChIP-Chip or ChIP-Seq data via an interactive web interface. Our published microarray data sets are publicly available. Users may graphically display these data sets for a selected genomic region and download the data displayed as text files, or alternatively, download complete genome-wide data sets. Furthermore, we have implemented a user registration system that allows registered users to upload their own data sets. Upon uploading, registered users may choose to: (1 view their data sets privately without sharing; (2 share with other registered users; or (3 make their published or "in press" data sets publicly available, which can fulfill journal and funding agencies' requirements for data sharing. Conclusion ReplicationDomain is a novel and powerful tool to facilitate the comparative visualization of replication timing in various cell types as well as other genome-wide chromatin features and is considerably faster and more convenient than existing browsers when viewing multi-megabase segments of chromosomes. Furthermore, the data upload function with the option of private viewing or sharing of data sets between registered users should be a valuable resource for the

  2. Genome-wide promoter methylome of small renal masses.

    Directory of Open Access Journals (Sweden)

    Ilsiya Ibragimova

    Full Text Available The majority of renal cell carcinoma (RCC is now incidentally detected and presents as small renal masses (SRMs defined as ≤ 4 cm in size. SRMs are heterogeneous comprising several histological types of RCC each with different biology and behavior, and benign tumors mainly oncocytoma. The varied prognosis of the different types of renal tumor has implications for management options. A key epigenetic alteration involved in the initiation and progression of cancer is aberrant methylation in the promoter region of a gene. The hypermethylation is associated with transcriptional repression and is an important mechanism of inactivation of tumor suppressor genes in neoplastic cells. We have determined the genome-wide promoter methylation profiles of 47 pT1a and 2 pT1b clear cell, papillary or chromophobe RCC, 25 benign renal oncocytoma ≤ 4 cm and 4 normal renal parenchyma specimens by Infinium HumanMethylation27 beadchip technology. We identify gene promoter hypermethylation signatures that distinguish clear cell and papillary from each other, from chromophobe and oncocytoma, and from normal renal cells. Pairwise comparisons revealed genes aberrantly hypermethylated in a tumor type but unmethylated in normal, and often unmethylated in the other renal tumor types. About 0.4% to 1.7% of genes comprised the promoter methylome in SRMs. The Infinium methylation score for representative genes was verified by gold standard technologies. The genes identified as differentially methylated implicate pathways involved in metabolism, tissue response to injury, epithelial to mesenchymal transition (EMT, signal transduction and G-protein coupled receptors (GPCRs, cancer, and stem cell regulation in the biology of RCC. Our findings contribute towards an improved understanding of the development of RCC, the different biology and behavior of histological types, and discovery of molecular subtypes. The differential methylation signatures may have utility in early

  3. Genome-wide methylation analyses in glioblastoma multiforme.

    Directory of Open Access Journals (Sweden)

    Rose K Lai

    Full Text Available Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM. Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1, 5 methyl-deoxycytidine (5m-dC and 5 hydroxylmethyl-deoxycytidine (5hm-dC in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.

  4. A novel statistic for genome-wide interaction analysis.

    Science.gov (United States)

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDRanalysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  5. Genome-wide association mapping for female fertility traits in Danish and Swedish Holstein cattle

    DEFF Research Database (Denmark)

    Sahana, G; Guldbrandtsen, B; Bendixen, C;

    2010-01-01

    A genome-wide association study was conducted using a mixed model analysis for QTL for fertility traits in Danish and Swedish Holstein cattle. The analysis incorporated 2,531 progeny tested bulls, and a total of 36 387 SNP markers on 29 bovine autosomes were used. Eleven fertility traits were...

  6. Genome-wide association study and premature ovarian failure.

    Science.gov (United States)

    Christin-Maitre, S; Tachdjian, G

    2010-05-01

    Premature ovarian failure (POF) is defined as an amenorrhea for more than 4months, associated with elevated gonadotropins, usually higher than 20mIU/ml, occurring in a woman before the age of 40. Some candidate genes have been identified in the past 15years, such as FOXL2, FSHR, BMP15, GDF9, Xfra premutation. However, POF etiology remains unknown in more than 90% of cases. The first strategy to identify candidate gene, apart from studying genes involved in ovarian failure in animal models, relies on the study of X chromosome deletions and X;autosome translocations in patients. The second strategy is based on linkage analysis, the third one on Comparative Genomic Hybridization (CGH) array. The latest strategy relies on Genome-Wide Association Studies (GWAS). This technique consists in screening single nucleotide polymorphisms (SNPs) in patients and controls. So far, three studies have been performed and have identified different loci potentially linked to POF, such as PTHB1 and ADAMTS19. However, replications in independent cohorts need to be performed. GWAS studies on large cohorts of women with POF should find new candidate genes in the near future.

  7. Genome-Wide Association Study of Serum Selenium Concentrations

    Directory of Open Access Journals (Sweden)

    Ulrike Peters

    2013-05-01

    Full Text Available Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO Cancer Screening and the Women’s Health Initiative (WHI. We tested association between 2,474,333 single nucleotide polymorphisms (SNPs and serum selenium concentrations using linear regression models. In the first stage (PLCO 41 SNPs clustered in 15 regions had p < 1 × 10−5. None of these 41 SNPs reached the significant threshold (p = 0.05/15 regions = 0.003 in the second stage (WHI. Three SNPs had p < 0.05 in the second stage (rs1395479 and rs1506807 in 4q34.3/AGA-NEIL3; and rs891684 in 17q24.3/SLC39A11 and had p between 2.62 × 10−7 and 4.04 × 10−7 in the combined analysis (PLCO + WHI. Additional studies are needed to replicate these findings. Identification of genetic variation that impacts selenium concentrations may contribute to a better understanding of which genes regulate circulating selenium concentrations.

  8. Genome-wide DNA methylation maps in follicular lymphoma cells determined by methylation-enriched bisulfite sequencing.

    Directory of Open Access Journals (Sweden)

    Jeong-Hyeon Choi

    Full Text Available BACKGROUND: Follicular lymphoma (FL is a form of non-Hodgkin's lymphoma (NHL that arises from germinal center (GC B-cells. Despite the significant advances in immunotherapy, FL is still not curable. Beyond transcriptional profiling and genomics datasets, there currently is no epigenome-scale dataset or integrative biology approach that can adequately model this disease and therefore identify novel mechanisms and targets for successful prevention and treatment of FL. METHODOLOGY/PRINCIPAL FINDINGS: We performed methylation-enriched genome-wide bisulfite sequencing of FL cells and normal CD19(+ B-cells using 454 sequencing technology. The methylated DNA fragments were enriched with methyl-binding proteins, treated with bisulfite, and sequenced using the Roche-454 GS FLX sequencer. The total number of bases covered in the human genome was 18.2 and 49.3 million including 726,003 and 1.3 million CpGs in FL and CD19(+ B-cells, respectively. 11,971 and 7,882 methylated regions of interest (MRIs were identified respectively. The genome-wide distribution of these MRIs displayed significant differences between FL and normal B-cells. A reverse trend in the distribution of MRIs between the promoter and the gene body was observed in FL and CD19(+ B-cells. The MRIs identified in FL cells also correlated well with transcriptomic data and ChIP-on-Chip analyses of genome-wide histone modifications such as tri-methyl-H3K27, and tri-methyl-H3K4, indicating a concerted epigenetic alteration in FL cells. CONCLUSIONS/SIGNIFICANCE: This study is the first to provide a large scale and comprehensive analysis of the DNA methylation sequence composition and distribution in the FL epigenome. These integrated approaches have led to the discovery of novel and frequent targets of aberrant epigenetic alterations. The genome-wide bisulfite sequencing approach developed here can be a useful tool for profiling DNA methylation in clinical samples.

  9. Genome-wide association analysis identifies six new loci associated with forced vital capacity.

    Science.gov (United States)

    Loth, Daan W; Soler Artigas, María; Gharib, Sina A; Wain, Louise V; Franceschini, Nora; Koch, Beate; Pottinger, Tess D; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P; James, Alan L; Huffman, Jennifer E; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K; Fall, Tove; Viñuela, Ana; Launer, Lenore J; Loehr, Laura R; Fornage, Myriam; Li, Guo; Wilk, Jemma B; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B; North, Kari E; Rudnicka, Alicja R; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F; Hastie, Nicholas D; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A; Pietiläinen, Kirsi H; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M; Wojczynski, Mary; Pouta, Anneli; Johansson, Asa; Wild, Sarah H; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G; Eiriksdottir, Gudny; Morrison, Alanna C; Rotter, Jerome I; Gao, Wei; Postma, Dirkje S; White, Wendy B; Rich, Stephen S; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J; Psaty, Bruce M; Lohman, Kurt; Burchard, Esteban G; Uitterlinden, André G; Garcia, Melissa; Joubert, Bonnie R; McArdle, Wendy L; Musk, A Bill; Hansel, Nadia; Heckbert, Susan R; Zgaga, Lina; van Meurs, Joyce B J; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah L; Zhao, Jing Hua; Rantanen, Taina; O'Connor, George T; Ripatti, Samuli; Scott, Rodney J; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C; Starr, John M; Wijmenga, Cisca; Minster, Ryan L; Lederer, David J; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P; Gläser, Sven; Hammond, Christopher J; Burkart, Kristin M; Beilby, John; Kritchevsky, Stephen B; Gudnason, Vilmundur; Hancock, Dana B; Williams, O Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F; Wjst, Matthias; Kim, Woo Jin; Porteous, David J; Scotland, Generation; Smith, Blair H; Viljanen, Anne; Heliövaara, Markku; Attia, John R; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J; Boezen, H Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F; Lind, Lars; Stricker, Bruno H; Teumer, Alexander; Spector, Timothy D; Melén, Erik; Peters, Marjolein J; Lange, Leslie A; Barr, R Graham; Bracke, Ken R; Verhamme, Fien M; Sung, Joohon; Hiemstra, Pieter S; Cassano, Patricia A; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P; Brusselle, Guy G; Tobin, Martin D; London, Stephanie J

    2014-07-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10(-8)) with FVC in or near EFEMP1, BMP6, MIR129-2-HSD17B12, PRDM11, WWOX and KCNJ2. Two loci previously associated with spirometric measures (GSTCD and PTCH1) were related to FVC. Newly implicated regions were followed up in samples from African-American, Korean, Chinese and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and the pathogenesis of restrictive lung disease.

  10. Genome-wide association analysis identifies six new loci associated with forced vital capacity

    Science.gov (United States)

    Loth, Daan W.; Artigas, María Soler; Gharib, Sina A.; Wain, Louise V.; Franceschini, Nora; Koch, Beate; Pottinger, Tess; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P.; James, Alan L.; Huffman, Jennifer E.; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J.; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M.; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K.; Fall, Tove; Viňuela, Ana; Launer, Lenore J.; Loehr, Laura R.; Fornage, Myriam; Li, Guo; Wilk, Jemma B.; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B.; North, Kari E.; Rudnicka, Alicja R.; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F.; Hastie, Nicholas D.; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A.; Pietiläinen, Kirsi H.; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G.; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M.; Wojczynski, Mary; Pouta, Anneli; Johansson, Åsa; Wild, Sarah H.; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G.; Eiriksdottir, Gudny; Morrison, Alanna C.; Rotter, Jerome I.; Gao, Wei; Postma, Dirkje S.; White, Wendy B.; Rich, Stephen S.; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J.; Psaty, Bruce M.; Lohman, Kurt; Burchard, Esteban G.; Uitterlinden, André G.; Garcia, Melissa; Joubert, Bonnie R.; McArdle, Wendy L.; Musk, A. Bill; Hansel, Nadia; Heckbert, Susan R.; Zgaga, Lina; van Meurs, Joyce B.J.; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah; Zhao, Jing Hua; Rantanen, Taina; O’Connor, George T.; Ripatti, Samuli; Scott, Rodney J.; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C.; Starr, John M.; Wijmenga, Cisca; Minster, Ryan L.; Lederer, David J.; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P.; Gläser, Sven; Hammond, Christopher J.; Burkart, Kristin M.; Beilby, John; Kritchevsky, Stephen B.; Gudnason, Vilmundur; Hancock, Dana B.; Williams, O. Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F.; Wjst, Matthias; Kim, Woo Jin; Porteous, David J.; Scotland, Generation; Smith, Blair H.; Viljanen, Anne; Heliövaara, Markku; Attia, John R.; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J.; Boezen, H. Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F.; Lind, Lars; Stricker, Bruno H.; Teumer, Alexander; Spector, Timothy D.; Melén, Erik; Peters, Marjolein J.; Lange, Leslie A.; Barr, R. Graham; Bracke, Ken R.; Verhamme, Fien M.; Sung, Joohon; Hiemstra, Pieter S.; Cassano, Patricia A.; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P.; Brusselle, Guy G.; Tobin, Martin D.; London, Stephanie J.

    2014-01-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10−8) with FVC in or near EFEMP1, BMP6, MIR-129-2/HSD17B12, PRDM11, WWOX, and KCNJ2. Two (GSTCD and PTCH1) loci previously associated with spirometric measures were related to FVC. Newly implicated regions were followed-up in samples of African American, Korean, Chinese, and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and pathogenesis of restrictive lung disease. PMID:24929828

  11. De novo assembly of a genome-wide transcriptome map of Vicia faba (L.) for transfer cell research.

    Science.gov (United States)

    Arun-Chinnappa, Kiruba S; McCurdy, David W

    2015-01-01

    Vicia faba (L.) is an important cool-season grain legume species used widely in agriculture but also in plant physiology research, particularly as an experimental model to study transfer cell (TC) development. TCs are specialized nutrient transport cells in plants, characterized by invaginated wall ingrowths with amplified plasma membrane surface area enriched with transporter proteins that facilitate nutrient transfer. Many TCs are formed by trans-differentiation from differentiated cells at apoplasmic/symplasmic boundaries in nutrient transport. Adaxial epidermal cells of isolated cotyledons can be induced to form functional TCs, thus providing a valuable experimental system to investigate genetic regulation of TC trans-differentiation. The genome of V. faba is exceedingly large (ca. 13 Gb), however, and limited genomic information is available for this species. To provide a resource for future transcript profiling of epidermal TC differentiation, we have undertaken de novo assembly of a genome-wide transcriptome map for V. faba. Illumina paired-end sequencing of total RNA pooled from different tissues and different stages, including isolated cotyledons induced to form epidermal TCs, generated 69.5 M reads, of which 65.8 M were used for assembly following trimming and quality control. Assembly using a De-Bruijn graph-based approach generated 21,297 contigs, of which 80.6% were successfully annotated against GO terms. The assembly was validated against known V. faba cDNAs held in GenBank, including transcripts previously identified as being specifically expressed in epidermal cells across TC trans-differentiation. This genome-wide transcriptome map therefore provides a valuable tool for future transcript profiling of epidermal TC trans-differentiation, and also enriches the genetic resources available for this important legume crop species.

  12. A novel statistic for genome-wide interaction analysis.

    Directory of Open Access Journals (Sweden)

    Xuesen Wu

    2010-09-01

    Full Text Available Although great progress in genome-wide association studies (GWAS has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked. The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  13. Genome-wide de Novo Prediction of Proximal and Distal Tissue-Specific Enhancers

    Energy Technology Data Exchange (ETDEWEB)

    Loots, G G; Ovcharenko, I V

    2005-11-03

    Determining how transcriptional regulatory networks are encoded in the human genome is essential for understanding how cellular processes are directed. Here, we present a novel approach for systematically predicting tissue specific regulatory elements (REs) that blends genome-wide expression profiling, vertebrate genome comparisons, and pattern analysis of transcription factor binding sites. This analysis yields 4,670 candidate REs in the human genome with distinct tissue specificities, the majority of which reside far away from transcription start sites. We identify key transcription factors (TFs) for 34 distinct tissues and demonstrate that tissue-specific gene expression relies on multiple regulatory pathways employing similar, but different cohorts of interacting TFs. The methods and results we describe provide a global view of tissue specific gene regulation in humans, and propose a strategy for deciphering the transcriptional regulatory code in eukaryotes.

  14. Genome-wide identification of expression quantitative trait loci (eQTLs in human heart.

    Directory of Open Access Journals (Sweden)

    Tamara T Koopmann

    Full Text Available In recent years genome-wide association studies (GWAS have uncovered numerous chromosomal loci associated with various electrocardiographic traits and cardiac arrhythmia predisposition. A considerable fraction of these loci lie within inter-genic regions. The underlying trait-associated variants likely reside in regulatory regions and exert their effect by modulating gene expression. Hence, the key to unraveling the molecular mechanisms underlying these cardiac traits is to interrogate variants for association with differential transcript abundance by expression quantitative trait locus (eQTL analysis. In this study we conducted an eQTL analysis of human heart. For a total of 129 left ventricular samples that were collected from non-diseased human donor hearts, genome-wide transcript abundance and genotyping was determined using microarrays. Each of the 18,402 transcripts and 897,683 SNP genotypes that remained after pre-processing and stringent quality control were tested for eQTL effects. We identified 771 eQTLs, regulating 429 unique transcripts. Overlaying these eQTLs with cardiac GWAS loci identified novel candidates for studies aimed at elucidating the functional and transcriptional impact of these loci. Thus, this work provides for the first time a comprehensive eQTL map of human heart: a powerful and unique resource that enables systems genetics approaches for the study of cardiac traits.

  15. Improved statistics for genome-wide interaction analysis.

    Science.gov (United States)

    Ueki, Masao; Cordell, Heather J

    2012-01-01

    Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new "joint effects" statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al

  16. Genome-wide association mapping in Arabidopsis identifies previously known flowering time and pathogen resistance genes.

    Directory of Open Access Journals (Sweden)

    María José Aranzana

    2005-11-01

    Full Text Available There is currently tremendous interest in the possibility of using genome-wide association mapping to identify genes responsible for natural variation, particularly for human disease susceptibility. The model plant Arabidopsis thaliana is in many ways an ideal candidate for such studies, because it is a highly selfing hermaphrodite. As a result, the species largely exists as a collection of naturally occurring inbred lines, or accessions, which can be genotyped once and phenotyped repeatedly. Furthermore, linkage disequilibrium in such a species will be much more extensive than in a comparable outcrossing species. We tested the feasibility of genome-wide association mapping in A. thaliana by searching for associations with flowering time and pathogen resistance in a sample of 95 accessions for which genome-wide polymorphism data were available. In spite of an extremely high rate of false positives due to population structure, we were able to identify known major genes for all phenotypes tested, thus demonstrating the potential of genome-wide association mapping in A. thaliana and other species with similar patterns of variation. The rate of false positives differed strongly between traits, with more clinal traits showing the highest rate. However, the false positive rates were always substantial regardless of the trait, highlighting the necessity of an appropriate genomic control in association studies.

  17. A genome-wide characterization of microRNA genes in maize.

    Directory of Open Access Journals (Sweden)

    Lifang Zhang

    2009-11-01

    Full Text Available MicroRNAs (miRNAs are small, non-coding RNAs that play essential roles in plant growth, development, and stress response. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling identified 150 high-confidence genes within 26 miRNA families. For 25 families, expression was verified by deep-sequencing of small RNA libraries that were prepared from an assortment of maize tissues. PCR-RACE amplification of 68 miRNA transcript precursors, representing 18 families conserved across several plant species, showed that splice variation and the use of alternative transcriptional start and stop sites is common within this class of genes. Comparison of sequence variation data from diverse maize inbred lines versus teosinte accessions suggest that the mature miRNAs are under strong purifying selection while the flanking sequences evolve equivalently to other genes. Since maize is derived from an ancient tetraploid, the effect of whole-genome duplication on miRNA evolution was examined. We found that, like protein-coding genes, duplicated miRNA genes underwent extensive gene-loss, with approximately 35% of ancestral sites retained as duplicate homoeologous miRNA genes. This number is higher than that observed with protein-coding genes. A search for putative miRNA targets indicated bias towards genes in regulatory and metabolic pathways. As maize is one of the principal models for plant growth and development, this study will serve as a foundation for future research into the functional roles of miRNA genes.

  18. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    Science.gov (United States)

    ... Q R S T U V W X Y Z We want to hear from you You are here: News & Events 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2004 2003 2002 2001 2000 1999 Spotlight on Research 2012 July 2012 (historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A team of ...

  19. A genome-wide scan for preeclampsia in the Netherlands

    NARCIS (Netherlands)

    Lachmeijer, AMA; Arngrimsson, R; Bastiaans, EJ; Frigge, ML; Pals, G; Sigurdardottir, S; Stefansson, H; Palsson, B; Nicolae, D; Kong, A; Aarnoudse, JG; Gulcher, [No Value; Dekker, GA; ten Kate, LP; Stefansson, K

    2001-01-01

    Preeclampsia, hallmarked by de novo hypertension and proteinuria in pregnancy, has a familial tendency. Recently, a large Icelandic genome-wide scan provided evidence for a maternal susceptibility locus for preeclampsia on chromosome 2p13 which was confirmed by a genome scan from Australia and New

  20. Genome-wide RNA Tomography in the Zebrafish Embryo

    NARCIS (Netherlands)

    Junker, Jan Philipp; Noël, Emily S; Guryev, Victor; Peterson, Kevin A; Shah, Gopi; Huisken, Jan; McMahon, Andrew P; Berezikov, Eugene; Bakkers, Jeroen; van Oudenaarden, Alexander

    2014-01-01

    Advancing our understanding of embryonic development is heavily dependent on identification of novel pathways or regulators. Although genome-wide techniques such as RNA sequencing are ideally suited for discovering novel candidate genes, they are unable to yield spatially resolved information in

  1. Genome-wide RNA Tomography in the zebrafish embryo

    NARCIS (Netherlands)

    Junker, Jan Philipp; Noël, Emily S; Guryev, Victor; Peterson, Kevin A; Shah, Gopi; Huisken, Jan; McMahon, Andrew P; Berezikov, Eugene; Bakkers, Jeroen; van Oudenaarden, Alexander

    2014-01-01

    Advancing our understanding of embryonic development is heavily dependent on identification of novel pathways or regulators. Although genome-wide techniques such as RNA sequencing are ideally suited for discovering novel candidate genes, they are unable to yield spatially resolved information in

  2. Genome-wide association study identifies five new schizophrenia loci

    NARCIS (Netherlands)

    Ripke, Stephan; Sanders, Alan R.; Kendler, Kenneth S.; Levinson, Douglas F.; Sklar, Pamela; Holmans, Peter A.; Lin, Dan-Yu; Duan, Jubao; Ophoff, Roel A.; Andreassen, Ole A.; Scolnick, Edward; Cichon, Sven; Clair, David St.; Corvin, Aiden; Gurling, Hugh; Werge, Thomas; Rujescu, Dan; Blackwood, Douglas H. R.; Pato, Carlos N.; Malhotra, Anil K.; Purcell, Shaun; Dudbridge, Frank; Neale, Benjamin M.; Rossin, Lizzy; Visscher, Peter M.; Posthuma, Danielle; Ruderfer, Douglas M.; Fanous, Ayman; Stefansson, Hreinn; Steinberg, Stacy; Mowry, Bryan J.; Golimbet, Vera; De Hert, Marc; Jonsson, Erik G.; Bitter, Istvan; Pietilainen, Olli P. H.; Collier, David A.; Tosato, Sarah; Agartz, Ingrid; Albus, Margot; Alexander, Madeline; Amdur, Richard L.; Amin, Farooq; Bass, Nicholas; Bergen, Sarah E.; Black, Donald W.; Borglum, Anders D.; Brown, Matthew A.; Bruggeman, Richard; Buccola, Nancy G.; Byerley, William F.; Cahn, Wiepke; Cantor, Rita M.; Carr, Vaughan J.; Catts, Stanley V.; Choudhury, Khalid; Cloninger, C. Robert; Cormican, Paul; Craddock, Nicholas; Danoy, Patrick A.; Datta, Susmita; De Haan, Lieuwe; Demontis, Ditte; Dikeos, Dimitris; Djurovic, Srdjan; Donnelly, Peter; Donohoe, Gary; Duong, Linh; Dwyer, Sarah; Fink-Jensen, Anders; Freedman, Robert; Freimer, Nelson B.; Friedl, Marion; Georgieva, Lyudmila; Giegling, Ina; Gill, Michael; Glenthoj, Birte; Godard, Stephanie; Hamshere, Marian; Hansen, Mark; Hansen, Thomas; Hartmann, Annette M.; Henskens, Frans A.; Hougaard, David M.; Hultman, Christina M.; Ingason, Andres; Jablensky, Assen V.; Jakobsen, Klaus D.; Jay, Maurice; Juergens, Gesche; Kahn, Renes; Keller, Matthew C.; Kenis, Gunter; Kenny, Elaine; Kim, Yunjung; Kirov, George K.; Konnerth, Heike; Konte, Bettina; Krabbendam, Lydia; Krasucki, Robert; Lasseter, Virginia K.; Laurent, Claudine; Lawrence, Jacob; Lencz, Todd; Lerer, F. Bernard; Liang, Kung-Yee; Lichtenstein, Paul; Lieberman, Jeffrey A.; Linszen, Don H.; Lonnqvist, Jouko; Loughland, Carmel M.; Maclean, Alan W.; Maher, Brion S.; Maier, Wolfgang; Mallet, Jacques; Malloy, Pat; Mattheisen, Manuel; Mattingsdal, Morten; McGhee, Kevin A.; McGrath, John J.; McIntosh, Andrew; McLean, Duncan E.; McQuillin, Andrew; Melle, Ingrid; Michie, Patricia T.; Milanova, Vihra; Morris, Derek W.; Mors, Ole; Mortensen, Preben B.; Moskvina, Valentina; Muglia, Pierandrea; Myin-Germeys, Inez; Nertney, Deborah A.; Nestadt, Gerald; Nielsen, Jimmi; Nikolov, Ivan; Nordentoft, Merete; Norton, Nadine; Noethen, Markus M.; O'Dushlaine, Colm T.; Olincy, Ann; Olsen, Line; O'Neill, F. Anthony; Orntoft, Torben F.; Owen, Michael J.; Pantelis, Christos; Papadimitriou, George; Pato, Michele T.; Peltonen, Leena; Petursson, Hannes; Pickard, Ben; Pimm, Jonathan; Pulver, Ann E.; Puri, Vinay; Quested, Digby; Quinn, Emma M.; Rasmussen, Henrik B.; Rethelyi, Janos M.; Ribble, Robert; Rietschel, Marcella; Riley, Brien P.; Ruggeri, Mirella; Schall, Ulrich; Schulze, Thomas G.; Schwab, Sibylle G.; Scott, Rodney J.; Shi, Jianxin; Sigurdsson, Engilbert; Silverman, Jeremy M.; Spencer, Chris C. A.; Stefansson, Kari; Strange, Amy; Strengman, Eric; Stroup, T. Scott; Suvisaari, Jaana; Terenius, Lars; Thirumalai, Srinivasa; Thygesen, Johan H.; Timm, Sally; Toncheva, Draga; van den Oord, Edwin; van Os, Jim; van Winkel, Ruud; Veldink, Jan; Walsh, Dermot; Wang, August G.; Wiersma, Durk; Wildenauer, Dieter B.; Williams, Hywel J.; Williams, Nigel M.; Wormley, Brandon; Zammit, Stan; Sullivan, Patrick F.; O'Donovan, Michael C.; Daly, Mark J.; Gejman, Pablo V.

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded

  3. A genome-wide association study of anorexia nervosa

    NARCIS (Netherlands)

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, Annemarie; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countri

  4. Genome-Wide Association Analysis in Primary Sclerosing Cholangitis

    NARCIS (Netherlands)

    T.H. Karlsen; A. Franke; E. Melum; A.. Kaser; J.R. Hov; T. Balschun; B.A. Lie; A. Bergquist; C. Schramm; T.J. Weismüller; D. Gotthardt; C. Rust; E.E.R. Philipp; T. Fritz; L. Henckaerts; R. Weersma; P. Stokkers; C.Y. Ponsioen; C. Wijmenga; M. Sterneck; M. Nothnagel; J. Hampe; A. Teufel; H. Runz; P. Rosenstiel; A. Stiehl; S. Vermeire; U. Beuers; M. Manns; E. Schrumpf; K.M. Boberg; S. Schreiber

    2010-01-01

    BACKGROUND & AIMS: We aimed to characterize the genetic susceptibility to primary sclerosing cholangitis (PSC) by means of a genome-wide association analysis of single nucleotide polymorphism (SNP) markers. METHODS: A total of 443,816 SNPs on the Affymetrix SNP Array 5.0 (Affymetrix, Santa Clara, CA

  5. Genome-wide association study of Tourette's syndrome

    NARCIS (Netherlands)

    Scharf, J. M.; Yu, D.; Mathews, C. A.; Neale, B. M.; Stewart, S. E.; Fagerness, J. A.; Evans, P.; Gamazon, E.; Edlund, C. K.; Service, S. K.; Tikhomirov, A.; Osiecki, L.; Illmann, C.; Pluzhnikov, A.; Konkashbaev, A.; Davis, L. K.; Han, B.; Crane, J.; Moorjani, P.; Crenshaw, A. T.; Parkin, M. A.; Reus, V. I.; Lowe, T. L.; Rangel-Lugo, M.; Chouinard, S.; Dion, Y.; Girard, S.; Cath, D. C.; Smit, J. H.; King, R. A.; Fernandez, T. V.; Leckman, J. F.; Kidd, K. K.; Kidd, J. R.; Pakstis, A. J.; State, M. W.; Herrera, L. D.; Romero, R.; Fournier, E.; Sandor, P.; Barr, C. L.; Phan, N.; Gross-Tsur, V.; Benarroch, F.; Pollak, Y.; Budman, C. L.; Bruun, R. D.; Erenberg, G.; Naarden, A. L.; Lee, P. C.; Weiss, N.; Kremeyer, B.; Berrio, G. B.; Campbell, D. D.; Cardona Silgado, J. C.; Ochoa, W. C.; Mesa Restrepo, S. C.; Muller, H.; Valencia Duarte, A. V.; Lyon, G. J.; Leppert, M.; Morgan, J.; Weiss, R.; Grados, M. A.; Anderson, K.; Davarya, S.; Singer, H.; Walkup, J.; Jankovic, J.; Tischfield, J. A.; Heiman, G. A.; Gilbert, D. L.; Hoekstra, P. J.; Robertson, M. M.; Kurlan, R.; Liu, C.; Gibbs, J. R.; Singleton, A.; Hardy, J.; Strengman, E.; Ophoff, R. A.; Wagner, M.; Moessner, R.; Mirel, D. B.; Posthuma, D.; Sabatti, C.; Eskin, E.; Conti, D. V.; Knowles, J. A.; Ruiz-Linares, A.; Rouleau, G. A.; Purcell, S.; Heutink, P.; Oostra, B. A.; McMahon, W. M.; Freimer, N. B.; Cox, N. J.; Pauls, D. L.

    2013-01-01

    Tourette's syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association

  6. Genome-wide association study identifies five new schizophrenia loci

    NARCIS (Netherlands)

    Ripke, Stephan; Sanders, Alan R.; Kendler, Kenneth S.; Levinson, Douglas F.; Sklar, Pamela; Holmans, Peter A.; Lin, Dan-Yu; Duan, Jubao; Ophoff, Roel A.; Andreassen, Ole A.; Scolnick, Edward; Cichon, Sven; Clair, David St.; Corvin, Aiden; Gurling, Hugh; Werge, Thomas; Rujescu, Dan; Blackwood, Douglas H. R.; Pato, Carlos N.; Malhotra, Anil K.; Purcell, Shaun; Dudbridge, Frank; Neale, Benjamin M.; Rossin, Lizzy; Visscher, Peter M.; Posthuma, Danielle; Ruderfer, Douglas M.; Fanous, Ayman; Stefansson, Hreinn; Steinberg, Stacy; Mowry, Bryan J.; Golimbet, Vera; De Hert, Marc; Jonsson, Erik G.; Bitter, Istvan; Pietilainen, Olli P. H.; Collier, David A.; Tosato, Sarah; Agartz, Ingrid; Albus, Margot; Alexander, Madeline; Amdur, Richard L.; Amin, Farooq; Bass, Nicholas; Bergen, Sarah E.; Black, Donald W.; Borglum, Anders D.; Brown, Matthew A.; Bruggeman, Richard; Buccola, Nancy G.; Byerley, William F.; Cahn, Wiepke; Cantor, Rita M.; Carr, Vaughan J.; Catts, Stanley V.; Choudhury, Khalid; Cloninger, C. Robert; Cormican, Paul; Craddock, Nicholas; Danoy, Patrick A.; Datta, Susmita; De Haan, Lieuwe; Demontis, Ditte; Dikeos, Dimitris; Djurovic, Srdjan; Donnelly, Peter; Donohoe, Gary; Duong, Linh; Dwyer, Sarah; Fink-Jensen, Anders; Freedman, Robert; Freimer, Nelson B.; Friedl, Marion; Georgieva, Lyudmila; Giegling, Ina; Gill, Michael; Glenthoj, Birte; Godard, Stephanie; Hamshere, Marian; Hansen, Mark; Hansen, Thomas; Hartmann, Annette M.; Henskens, Frans A.; Hougaard, David M.; Hultman, Christina M.; Ingason, Andres; Jablensky, Assen V.; Jakobsen, Klaus D.; Jay, Maurice; Juergens, Gesche; Kahn, Renes; Keller, Matthew C.; Kenis, Gunter; Kenny, Elaine; Kim, Yunjung; Kirov, George K.; Konnerth, Heike; Konte, Bettina; Krabbendam, Lydia; Krasucki, Robert; Lasseter, Virginia K.; Laurent, Claudine; Lawrence, Jacob; Lencz, Todd; Lerer, F. Bernard; Liang, Kung-Yee; Lichtenstein, Paul; Lieberman, Jeffrey A.; Linszen, Don H.; Lonnqvist, Jouko; Loughland, Carmel M.; Maclean, Alan W.; Maher, Brion S.; Maier, Wolfgang; Mallet, Jacques; Malloy, Pat; Mattheisen, Manuel; Mattingsdal, Morten; McGhee, Kevin A.; McGrath, John J.; McIntosh, Andrew; McLean, Duncan E.; McQuillin, Andrew; Melle, Ingrid; Michie, Patricia T.; Milanova, Vihra; Morris, Derek W.; Mors, Ole; Mortensen, Preben B.; Moskvina, Valentina; Muglia, Pierandrea; Myin-Germeys, Inez; Nertney, Deborah A.; Nestadt, Gerald; Nielsen, Jimmi; Nikolov, Ivan; Nordentoft, Merete; Norton, Nadine; Noethen, Markus M.; O'Dushlaine, Colm T.; Olincy, Ann; Olsen, Line; O'Neill, F. Anthony; Orntoft, Torben F.; Owen, Michael J.; Pantelis, Christos; Papadimitriou, George; Pato, Michele T.; Peltonen, Leena; Petursson, Hannes; Pickard, Ben; Pimm, Jonathan; Pulver, Ann E.; Puri, Vinay; Quested, Digby; Quinn, Emma M.; Rasmussen, Henrik B.; Rethelyi, Janos M.; Ribble, Robert; Rietschel, Marcella; Riley, Brien P.; Ruggeri, Mirella; Schall, Ulrich; Schulze, Thomas G.; Schwab, Sibylle G.; Scott, Rodney J.; Shi, Jianxin; Sigurdsson, Engilbert; Silverman, Jeremy M.; Spencer, Chris C. A.; Stefansson, Kari; Strange, Amy; Strengman, Eric; Stroup, T. Scott; Suvisaari, Jaana; Terenius, Lars; Thirumalai, Srinivasa; Thygesen, Johan H.; Timm, Sally; Toncheva, Draga; van den Oord, Edwin; van Os, Jim; van Winkel, Ruud; Veldink, Jan; Walsh, Dermot; Wang, August G.; Wiersma, Durk; Wildenauer, Dieter B.; Williams, Hywel J.; Williams, Nigel M.; Wormley, Brandon; Zammit, Stan; Sullivan, Patrick F.; O'Donovan, Michael C.; Daly, Mark J.; Gejman, Pablo V.

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded

  7. Genome-wide association study identifies five new schizophrenia loci

    DEFF Research Database (Denmark)

    Ripke, Stephan; Sanders, Alan R; Kendler, Kenneth S

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yiel...

  8. Genome-wide significant risk associations for mucinous ovarian carcinoma

    DEFF Research Database (Denmark)

    Kelemen, Linda E; Lawrenson, Kate; Tyrer, Jonathan;

    2015-01-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas but not for mucinous ovarian carcinomas (MOCs). Our analysis of 1,644 MOC cases and 21,693 controls with imputation identified 3 new risk associations: rs752590 at 2q13 (P = 3.3 × 10(-8)), rs711830 at...

  9. Probabilistic protein function prediction from heterogeneous genome-wide data.

    Directory of Open Access Journals (Sweden)

    Naoki Nariai

    Full Text Available Dramatic improvements in high throughput sequencing technologies have led to a staggering growth in the number of predicted genes. However, a large fraction of these newly discovered genes do not have a functional assignment. Fortunately, a variety of novel high-throughput genome-wide functional screening technologies provide important clues that shed light on gene function. The integration of heterogeneous data to predict protein function has been shown to improve the accuracy of automated gene annotation systems. In this paper, we propose and evaluate a probabilistic approach for protein function prediction that integrates protein-protein interaction (PPI data, gene expression data, protein motif information, mutant phenotype data, and protein localization data. First, functional linkage graphs are constructed from PPI data and gene expression data, in which an edge between nodes (proteins represents evidence for functional similarity. The assumption here is that graph neighbors are more likely to share protein function, compared to proteins that are not neighbors. The functional linkage graph model is then used in concert with protein domain, mutant phenotype and protein localization data to produce a functional prediction. Our method is applied to the functional prediction of Saccharomyces cerevisiae genes, using Gene Ontology (GO terms as the basis of our annotation. In a cross validation study we show that the integrated model increases recall by 18%, compared to using PPI data alone at the 50% precision. We also show that the integrated predictor is significantly better than each individual predictor. However, the observed improvement vs. PPI depends on both the new source of data and the functional category to be predicted. Surprisingly, in some contexts integration hurts overall prediction accuracy. Lastly, we provide a comprehensive assignment of putative GO terms to 463 proteins that currently have no assigned function.

  10. From human monocytes to genome-wide binding sites--a protocol for small amounts of blood: monocyte isolation/ChIP-protocol/library amplification/genome wide computational data analysis.

    Directory of Open Access Journals (Sweden)

    Sebastian Weiterer

    Full Text Available Chromatin immunoprecipitation in combination with a genome-wide analysis via high-throughput sequencing is the state of the art method to gain genome-wide representation of histone modification or transcription factor binding profiles. However, chromatin immunoprecipitation analysis in the context of human experimental samples is limited, especially in the case of blood cells. The typically extremely low yields of precipitated DNA are usually not compatible with library amplification for next generation sequencing. We developed a highly reproducible protocol to present a guideline from the first step of isolating monocytes from a blood sample to analyse the distribution of histone modifications in a genome-wide manner.The protocol describes the whole work flow from isolating monocytes from human blood samples followed by a high-sensitivity and small-scale chromatin immunoprecipitation assay with guidance for generating libraries compatible with next generation sequencing from small amounts of immunoprecipitated DNA.

  11. Genome-wide analysis of interactions between ATP-dependent chromatin remodeling and histone modifications

    Directory of Open Access Journals (Sweden)

    Wang Jiang

    2009-07-01

    Full Text Available Abstract Background ATP-dependent chromatin remodeling and the covalent modification of histones play central roles in determining chromatin structure and function. Although several specific interactions between these two activities have been elaborated, the global landscape remains to be elucidated. Results In this paper, we have developed a computational method to generate the first genome-wide landscape of interactions between ATP-dependent chromatin remodeling and the covalent modification of histones in Saccharomyces cerevisiae. Our method succeeds in identifying known interactions and uncovers many previously unknown interactions between these two activities. Analysis of the genome-wide picture revealed that transcription-related modifications tend to interact with more chromatin remodelers. Our results also demonstrate that most chromatin remodeling-modification interactions act via interactions of remodelers with both histone-modifying enzymes and histone residues. We also found that the co-occurrence of both modification and remodeling has significantly different influences on multiple gene features (e.g. nucleosome occupancy compared with the presence of either one. Conclusion We gave the first genome-wide picture of ATP-dependent chromatin remodeling-histone modification interactions. We also revealed how these two activities work together to regulate chromatin structure and function. Our results suggest that distinct strategies for regulating chromatin activity are selectively employed by genes with different properties.

  12. Novel Loci Associated with Usual Sleep Duration: The CHARGE Consortium Genome-Wide Association Study

    Science.gov (United States)

    Gottlieb, Daniel J.; Hek, Karin; Chen, Ting-hsu; Watson, Nathaniel F.; Eiriksdottir, Gudny; Byrne, Enda M.; Cornelis, Marilyn; Warby, Simon C.; Bandinelli, Stefania; Cherkas, Lynn; Evans, Daniel S.; Grabe, Hans J.; Lahti, Jari; Li, Man; Lehtimäki, Terho; Lumley, Thomas; Marciante, Kristin D.; Pérusse, Louis; Psaty, Bruce M.; Robbins, John; Tranah, Gregory J.; Vink, Jacqueline M.; Wilk, Jemma B.; Stafford, Jeanette M.; Bellis, Claire; Biffar, Reiner; Bouchard, Claude; Cade, Brian; Curhan, Gary C.; Eriksson, Johan G.; Ewert, Ralf; Ferrucci, Luigi; Fülöp, Tibor; Gehrman, Philip R.; Goodloe, Robert; Harris, Tamara B.; Heath, Andrew C.; Hernandez, Dena; Hofman, Albert; Hottenga, Jouke-Jan; Hunter, David J.; Jensen, Majken K.; Johnson, Andrew D.; Kähönen, Mika; Kao, Linda; Kraft, Peter; Larkin, Emma K.; Lauderdale, Diane S.; Luik, Annemarie I.; Medici, Marco; Montgomery, Grant W.; Palotie, Aarno; Patel, Sanjay R.; Pistis, Giorgio; Porcu, Eleonora; Quaye, Lydia; Raitakari, Olli; Redline, Susan; Rimm, Eric B.; Rotter, Jerome I.; Smith, Albert V.; Spector, Tim D.; Teumer, Alexander; Uitterlinden, André G.; Vohl, Marie-Claude; Widen, Elisabeth; Willemsen, Gonneke; Young, Terry; Zhang, Xiaoling; Liu, Yongmei; Blangero, John; Boomsma, Dorret I.; Gudnason, Vilmundur; Hu, Frank; Mangino, Massimo; Martin, Nicholas G.; O’Connor, George T.; Stone, Katie L.; Tanaka, Toshiko; Viikari, Jorma; Gharib, Sina A.; Punjabi, Naresh M.; Räikkönen, Katri; Völzke, Henry; Mignot, Emmanuel; Tiemeier, Henning

    2015-01-01

    Usual sleep duration is a heritable trait correlated with psychiatric morbidity, cardiometabolic disease and mortality, although little is known about the genetic variants influencing this trait. A genome-wide association study of usual sleep duration was conducted using 18 population-based cohorts totaling 47,180 individuals of European ancestry. Genome-wide significant association was identified at two loci. The strongest is located on chromosome 2, in an intergenic region 35–80 kb upstream from the thyroid-specific transcription factor PAX8 (lowest p=1.1 ×10−9). This finding was replicated in an African-American sample of 4771 individuals (lowest p=9.3 × 10−4). The strongest combined association was at rs1823125 (p=1.5 × 10−10, minor allele frequency 0.26 in the discovery sample, 0.12 in the replication sample), with each copy of the minor allele associated with a sleep duration 3.1 minutes longer per night. The alleles associated with longer sleep duration were associated in previous genome-wide association studies with a more favorable metabolic profile and a lower risk of attention deficit hyperactivity disorder. Understanding the mechanisms underlying these associations may help elucidate biological mechanisms influencing sleep duration and its association with psychiatric, metabolic and cardiovascular disease. PMID:25469926

  13. Genome-wide examination of myoblast cell cycle withdrawal duringdifferentiation

    Energy Technology Data Exchange (ETDEWEB)

    Shen, Xun; Collier, John Michael; Hlaing, Myint; Zhang, Leanne; Delshad, Elizabeth H.; Bristow, James; Bernstein, Harold S.

    2002-12-02

    Skeletal and cardiac myocytes cease division within weeks of birth. Although skeletal muscle retains limited capacity for regeneration through recruitment of satellite cells, resident populations of adult myocardial stem cells have not been identified. Because cell cycle withdrawal accompanies myocyte differentiation, we hypothesized that C2C12 cells, a mouse myoblast cell line previously used to characterize myocyte differentiation, also would provide a model for studying cell cycle withdrawal during differentiation. C2C12 cells were differentiated in culture medium containing horse serum and harvested at various time points to characterize the expression profiles of known cell cycle and myogenic regulatory factors by immunoblot analysis. BrdU incorporation decreased dramatically in confluent cultures 48 hr after addition of horse serum, as cells started to form myotubes. This finding was preceded by up-regulation of MyoD, followed by myogenin, and activation of Bcl-2. Cyclin D1 was expressed in proliferating cultures and became undetectable in cultures containing 40 percent fused myotubes, as levels of p21(WAF1/Cip1) increased and alpha-actin became detectable. Because C2C12 myoblasts withdraw from the cell cycle during myocyte differentiation following a course that recapitulates this process in vivo, we performed a genome-wide screen to identify other gene products involved in this process. Using microarrays containing approximately 10,000 minimally redundant mouse sequences that map to the UniGene database of the National Center for Biotechnology Information, we compared gene expression profiles between proliferating, differentiating, and differentiated C2C12 cells and verified candidate genes demonstrating differential expression by RT-PCR. Cluster analysis of differentially expressed genes revealed groups of gene products involved in cell cycle withdrawal, muscle differentiation, and apoptosis. In addition, we identified several genes, including DDAH2 and Ly

  14. Genome-wide signatures of convergent evolution in echolocating mammals.

    Science.gov (United States)

    Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J

    2013-10-10

    Evolution is typically thought to proceed through divergence of genes, proteins and ultimately phenotypes. However, similar traits might also evolve convergently in unrelated taxa owing to similar selection pressures. Adaptive phenotypic convergence is widespread in nature, and recent results from several genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution, although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show that convergence is not a rare process restricted to several loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four newly sequenced bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the bottlenose dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Unexpectedly, we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognized.

  15. Genome-wide association between DNA methylation and alternative splicing in an invertebrate

    Directory of Open Access Journals (Sweden)

    Flores Kevin

    2012-09-01

    Full Text Available Abstract Background Gene bodies are the most evolutionarily conserved targets of DNA methylation in eukaryotes. However, the regulatory functions of gene body DNA methylation remain largely unknown. DNA methylation in insects appears to be primarily confined to exons. Two recent studies in Apis mellifera (honeybee and Nasonia vitripennis (jewel wasp analyzed transcription and DNA methylation data for one gene in each species to demonstrate that exon-specific DNA methylation may be associated with alternative splicing events. In this study we investigated the relationship between DNA methylation, alternative splicing, and cross-species gene conservation on a genome-wide scale using genome-wide transcription and DNA methylation data. Results We generated RNA deep sequencing data (RNA-seq to measure genome-wide mRNA expression at the exon- and gene-level. We produced a de novo transcriptome from this RNA-seq data and computationally predicted splice variants for the honeybee genome. We found that exons that are included in transcription are higher methylated than exons that are skipped during transcription. We detected enrichment for alternative splicing among methylated genes compared to unmethylated genes using fisher’s exact test. We performed a statistical analysis to reveal that the presence of DNA methylation or alternative splicing are both factors associated with a longer gene length and a greater number of exons in genes. In concordance with this observation, a conservation analysis using BLAST revealed that each of these factors is also associated with higher cross-species gene conservation. Conclusions This study constitutes the first genome-wide analysis exhibiting a positive relationship between exon-level DNA methylation and mRNA expression in the honeybee. Our finding that methylated genes are enriched for alternative splicing suggests that, in invertebrates, exon-level DNA methylation may play a role in the construction of splice

  16. Thermodynamic and Kinetic Modeling of Transcriptional Pausing

    National Research Council Canada - National Science Library

    Vasisht R. Tadigotla; Dáibhid Ó. Maoiléidigh; Anirvan M. Sengupta; Vitaly Epshtein; Richard H. Ebright; Evgeny Nudler; Andrei E. Ruckenstein

    2006-01-01

    We present a statistical mechanics approach for the prediction of backtracked pauses in bacterial transcription elongation derived from structural models of the transcription elongation complex (EC...

  17. Cooperative genome-wide analysis shows increased homozygosity in early onset Parkinson's disease.

    Directory of Open Access Journals (Sweden)

    Javier Simón-Sánchez

    Full Text Available Parkinson's disease (PD occurs in both familial and sporadic forms, and both monogenic and complex genetic factors have been identified. Early onset PD (EOPD is particularly associated with autosomal recessive (AR mutations, and three genes, PARK2, PARK7 and PINK1, have been found to carry mutations leading to AR disease. Since mutations in these genes account for less than 10% of EOPD patients, we hypothesized that further recessive genetic factors are involved in this disorder, which may appear in extended runs of homozygosity.We carried out genome wide SNP genotyping to look for extended runs of homozygosity (ROHs in 1,445 EOPD cases and 6,987 controls. Logistic regression analyses showed an increased level of genomic homozygosity in EOPD cases compared to controls. These differences are larger for ROH of 9 Mb and above, where there is a more than three-fold increase in the proportion of cases carrying a ROH. These differences are not explained by occult recessive mutations at existing loci. Controlling for genome wide homozygosity in logistic regression analyses increased the differences between cases and controls, indicating that in EOPD cases ROHs do not simply relate to genome wide measures of inbreeding. Homozygosity at a locus on chromosome19p13.3 was identified as being more common in EOPD cases as compared to controls. Sequencing analysis of genes and predicted transcripts within this locus failed to identify a novel mutation causing EOPD in our cohort.There is an increased rate of genome wide homozygosity in EOPD, as measured by an increase in ROHs. These ROHs are a signature of inbreeding and do not necessarily harbour disease-causing genetic variants. Although there might be other regions of interest apart from chromosome 19p13.3, we lack the power to detect them with this analysis.

  18. DNA immunoprecipitation semiconductor sequencing (DIP-SC-seq) as a rapid method to generate genome wide epigenetic signatures.

    Science.gov (United States)

    Thomson, John P; Fawkes, Angie; Ottaviano, Raffaele; Hunter, Jennifer M; Shukla, Ruchi; Mjoseng, Heidi K; Clark, Richard; Coutts, Audrey; Murphy, Lee; Meehan, Richard R

    2015-05-14

    Modification of DNA resulting in 5-methylcytosine (5 mC) or 5-hydroxymethylcytosine (5hmC) has been shown to influence the local chromatin environment and affect transcription. Although recent advances in next generation sequencing technology allow researchers to map epigenetic modifications across the genome, such experiments are often time-consuming and cost prohibitive. Here we present a rapid and cost effective method of generating genome wide DNA modification maps utilising commercially available semiconductor based technology (DNA immunoprecipitation semiconductor sequencing; "DIP-SC-seq") on the Ion Proton sequencer. Focussing on the 5hmC mark we demonstrate, by directly comparing with alternative sequencing strategies, that this platform can successfully generate genome wide 5hmC patterns from as little as 500 ng of genomic DNA in less than 4 days. Such a method can therefore facilitate the rapid generation of multiple genome wide epigenetic datasets.

  19. Genome-Wide Scan for Adaptive Divergence and Association with Population-Specific Covariates

    OpenAIRE

    Gautier, Mathieu

    2015-01-01

    In population genomics studies, accounting for the neutral covariance structure across population allele frequencies is critical to improve the robustness of genome-wide scan approaches. Elaborating on the BayEnv model, this study investigates several modeling extensions (i) to improve the estimation accuracy of the population covariance matrix and all the related measures, (ii) to identify significantly overly differentiated SNPs based on a calibration procedure of the XtX statistics, and (i...

  20. Brewing yeast genomes and genome-wide expression and proteome profiling during fermentation.

    Science.gov (United States)

    Smart, Katherine A

    2007-11-01

    The genome structure, ancestry and instability of the brewing yeast strains have received considerable attention. The hybrid nature of brewing lager yeast strains provides adaptive potential but yields genome instability which can adversely affect fermentation performance. The requirement to differentiate between production strains and assess master cultures for genomic instability has led to significant adoption of specialized molecular tool kits by the industry. Furthermore, the development of genome-wide transcriptional and protein expression technologies has generated significant interest from brewers. The opportunity presented to explore, and the concurrent requirement to understand both, the constraints and potential of their strains to generate existing and new products during fermentation is discussed.

  1. Genome-wide association studies and resting heart rate

    DEFF Research Database (Denmark)

    Oskari Kilpeläinen, Tuomas

    2016-01-01

    Genome-wide association studies (GWASs) have revolutionized the search for genetic variants regulating resting heart rate. In the last 10 years, GWASs have led to the identification of at least 21 novel heart rate loci. These discoveries have provided valuable insights into the mechanisms...... and pathways that regulate heart rate and link heart rate to cardiovascular morbidity and mortality. GWASs capture majority of genetic variation in a population sample by utilizing high-throughput genotyping chips measuring genotypes for up to several millions of SNPs across the genome in thousands...... of individuals. This allows the identification of the strongest heart rate associated signals at genome-wide level. While GWASs provide robust statistical evidence of the association of a given genetic locus with heart rate, they are only the starting point for detailed follow-up studies to locate the causal...

  2. Genome-wide patterns of selection in 230 ancient Eurasians

    Science.gov (United States)

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  3. Genome wide copy number analysis of single cells

    Science.gov (United States)

    Baslan, Timour; Kendall, Jude; Rodgers, Linda; Cox, Hilary; Riggs, Mike; Stepansky, Asya; Troge, Jennifer; Ravi, Kandasamy; Esposito, Diane; Lakshmi, B.; Wigler, Michael; Navin, Nicholas; Hicks, James

    2016-01-01

    Summary Copy number variation (CNV) is increasingly recognized as an important contributor to phenotypic variation in health and disease. Most methods for determining CNV rely on admixtures of cells, where information regarding genetic heterogeneity is lost. Here, we present a protocol that allows for the genome wide copy number analysis of single nuclei isolated from mixed populations of cells. Single nucleus sequencing (SNS), combines flow sorting of single nuclei based on DNA content, whole genome amplification (WGA), followed by next generation sequencing to quantize genomic intervals in a genome wide manner. Multiplexing of single cells is discussed. Additionally, we outline informatic approaches that correct for biases inherent in the WGA procedure and allow for accurate determination of copy number profiles. All together, the protocol takes ~3 days from flow cytometry to sequence-ready DNA libraries. PMID:22555242

  4. Genome-Wide Association Study of Polymorphisms Predisposing to Bronchiolitis

    Science.gov (United States)

    Pasanen, Anu; Karjalainen, Minna K.; Bont, Louis; Piippo-Savolainen, Eija; Ruotsalainen, Marja; Goksör, Emma; Kumawat, Kuldeep; Hodemaekers, Hennie; Nuolivirta, Kirsi; Jartti, Tuomas; Wennergren, Göran; Hallman, Mikko; Rämet, Mika; Korppi, Matti

    2017-01-01

    Bronchiolitis is a major cause of hospitalization among infants. Severe bronchiolitis is associated with later asthma, suggesting a common genetic predisposition. Genetic background of bronchiolitis is not well characterized. To identify polymorphisms associated with bronchiolitis, we conducted a genome-wide association study (GWAS) in which 5,300,000 single nucleotide polymorphisms (SNPs) were tested for association in a Finnish–Swedish population of 217 children hospitalized for bronchiolitis and 778 controls. The most promising SNPs (n = 77) were genotyped in a Dutch replication population of 416 cases and 432 controls. Finally, we used a set of 202 Finnish bronchiolitis cases to further investigate candidate SNPs. We did not detect genome-wide significant associations, but several suggestive association signals (p bronchiolitis. These preliminary findings require further validation in a larger sample size. PMID:28139761

  5. Genome-wide mapping of the cohesin complex in the yeast Saccharomyces cerevisiae.

    Directory of Open Access Journals (Sweden)

    Earl F Glynn

    2004-09-01

    Full Text Available In eukaryotic cells, cohesin holds sister chromatids together until they separate into daughter cells during mitosis. We have used chromatin immunoprecipitation coupled with microarray analysis (ChIP chip to produce a genome-wide description of cohesin binding to meiotic and mitotic chromosomes of Saccharomyces cerevisiae. A computer program, PeakFinder, enables flexible, automated identification and annotation of cohesin binding peaks in ChIP chip data. Cohesin sites are highly conserved in meiosis and mitosis, suggesting that chromosomes share a common underlying structure during different developmental programs. These sites occur with a semiperiodic spacing of 11 kb that correlates with AT content. The number of sites correlates with chromosome size; however, binding to neighboring sites does not appear to be cooperative. We observed a very strong correlation between cohesin sites and regions between convergent transcription units. The apparent incompatibility between transcription and cohesin binding exists in both meiosis and mitosis. Further experiments reveal that transcript elongation into a cohesin-binding site removes cohesin. A negative correlation between cohesin sites and meiotic recombination sites suggests meiotic exchange is sensitive to the chromosome structure provided by cohesin. The genome-wide view of mitotic and meiotic cohesin binding provides an important framework for the exploration of cohesins and cohesion in other genomes.

  6. Genome-wide association study of relative telomere length.

    Science.gov (United States)

    Prescott, Jennifer; Kraft, Peter; Chasman, Daniel I; Savage, Sharon A; Mirabello, Lisa; Berndt, Sonja I; Weissfeld, Joel L; Han, Jiali; Hayes, Richard B; Chanock, Stephen J; Hunter, David J; De Vivo, Immaculata

    2011-05-10

    Telomere function is essential to maintaining the physical integrity of linear chromosomes and healthy human aging. The probability of forming proper telomere structures depends on the length of the telomeric DNA tract. We attempted to identify common genetic variants associated with log relative telomere length using genome-wide genotyping data on 3,554 individuals from the Nurses' Health Study and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial that took part in the National Cancer Institute Cancer Genetic Markers of Susceptibility initiative for breast and prostate cancer. After genotyping 64 independent SNPs selected for replication in additional Nurses' Health Study and Women's Genome Health Study participants, we did not identify genome-wide significant loci; however, we replicated the inverse association of log relative telomere length with the minor allele variant [C] of rs16847897 at the TERC locus (per allele β = -0.03, P = 0.003) identified by a previous genome-wide association study. We did not find evidence for an association with variants at the OBFC1 locus or other loci reported to be associated with telomere length. With this sample size we had >80% power to detect β estimates as small as ±0.10 for SNPs with minor allele frequencies of ≥0.15 at genome-wide significance. However, power is greatly reduced for β estimates smaller than ±0.10, such as those for variants at the TERC locus. In general, common genetic variants associated with telomere length homeostasis have been difficult to detect. Potential biological and technical issues are discussed.

  7. Genome-wide association study of relative telomere length.

    Directory of Open Access Journals (Sweden)

    Jennifer Prescott

    Full Text Available Telomere function is essential to maintaining the physical integrity of linear chromosomes and healthy human aging. The probability of forming proper telomere structures depends on the length of the telomeric DNA tract. We attempted to identify common genetic variants associated with log relative telomere length using genome-wide genotyping data on 3,554 individuals from the Nurses' Health Study and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial that took part in the National Cancer Institute Cancer Genetic Markers of Susceptibility initiative for breast and prostate cancer. After genotyping 64 independent SNPs selected for replication in additional Nurses' Health Study and Women's Genome Health Study participants, we did not identify genome-wide significant loci; however, we replicated the inverse association of log relative telomere length with the minor allele variant [C] of rs16847897 at the TERC locus (per allele β = -0.03, P = 0.003 identified by a previous genome-wide association study. We did not find evidence for an association with variants at the OBFC1 locus or other loci reported to be associated with telomere length. With this sample size we had >80% power to detect β estimates as small as ±0.10 for SNPs with minor allele frequencies of ≥0.15 at genome-wide significance. However, power is greatly reduced for β estimates smaller than ±0.10, such as those for variants at the TERC locus. In general, common genetic variants associated with telomere length homeostasis have been difficult to detect. Potential biological and technical issues are discussed.

  8. Genome-wide association studies in pediatric endocrinology.

    Science.gov (United States)

    Dauber, Andrew; Hirschhorn, Joel N

    2011-01-01

    Genome-wide association (GWA) studies are a powerful tool for understanding the genetic underpinnings of human disease. In this article, we briefly review the role and findings of GWA studies in type 1 diabetes, stature, pubertal timing, obesity, and vitamin D deficiency. We then discuss the present and future implications of these findings with regards to disease prediction, uncovering basic biology, and the development of novel therapeutic agents.

  9. Statistical Approaches in Genome-Wide Association Studies

    OpenAIRE

    Yazdani, Akram

    2014-01-01

    Genome-wide association studies, GWAS, typically contain hundreds of thousands single nucleotide polymorphisms, SNPs, genotyped for few numbers of samples. The aim of these studies is to identify regions harboring SNPs or to predict the outcomes of interest. Since the number of predictors in the GWAS far exceeds the number of samples, it is impossible to analyze the data with classical statistical methods. In the current GWAS, the widely applied methods are based on single marker analysis th...

  10. Clinical implication of genome-wide profiling in diffuse large B-cell lymphoma and other subtypes of B-cell lymphoma

    DEFF Research Database (Denmark)

    Iqbal, Javeed; Joshi, Shantaram; Patel, Kavita N

    2007-01-01

    of Lymphoid Neoplasms (REAL) and World Health Organization (WHO) classifications. These classification methods were based on histological, immunophenotypic and cytogenetic markers and widely accepted by pathologists and oncologists worldwide. During last several decades, great progress has been made...... technology. The genome-wide transcriptional measurement, also called gene expression profile (GEP) can accurately define the biological phenotype of the tumor. In this review, important discoveries made by genome-wide GEP in understanding the biology of lymphoma and additionally the diagnostic and prognostic...

  11. Clinical implication of genome-wide profiling in diffuse large B-cell lymphoma and other subtypes of B-cell lymphoma

    DEFF Research Database (Denmark)

    Iqbal, Javeed; Joshi, Shantaram; Patel, Kavita N

    2007-01-01

    of Lymphoid Neoplasms (REAL) and World Health Organization (WHO) classifications. These classification methods were based on histological, immunophenotypic and cytogenetic markers and widely accepted by pathologists and oncologists worldwide. During last several decades, great progress has been made...... technology. The genome-wide transcriptional measurement, also called gene expression profile (GEP) can accurately define the biological phenotype of the tumor. In this review, important discoveries made by genome-wide GEP in understanding the biology of lymphoma and additionally the diagnostic and prognostic...

  12. Integrative genome-wide approaches in embryonic stem cell research.

    Science.gov (United States)

    Zhang, Xinyue; Huang, Jing

    2010-10-01

    Embryonic stem (ES) cells are derived from blastocysts. They can differentiate into the three embryonic germ layers and essentially any type of somatic cells. They therefore hold great potential in tissue regeneration therapy. The ethical issues associated with the use of human embryonic stem cells are resolved by the technical break-through of generating induced pluripotent stem (iPS) cells from various types of somatic cells. However, how ES and iPS cells self-renew and maintain their pluripotency is still largely unknown in spite of the great progress that has been made in the last two decades. Integrative genome-wide approaches, such as the gene expression microarray, chromatin immunoprecipitation based microarray (ChIP-chip) and chromatin immunoprecipitation followed by massive parallel sequencing (ChIP-seq) offer unprecedented opportunities to elucidate the mechanism of the pluripotency, reprogramming and DNA damage response of ES and iPS cells. This frontier article summarizes the fundamental biological questions about ES and iPS cells and reviews the recent advances in ES and iPS cell research using genome-wide technologies. To this end, we offer our perspectives on the future of genome-wide studies on stem cells.

  13. Genome-Wide Detection and Analysis of Multifunctional Genes

    Science.gov (United States)

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  14. Genome-wide Escherichia coli stress response and improved tolerance towards industrially relevant chemicals

    DEFF Research Database (Denmark)

    Rau, Martin Holm; Calero Valdayo, Patricia; Lennen, Rebecca

    2016-01-01

    of transcription changes within and between chemical groups, with functions such as energy metabolism, stress response, membrane modification, transporters and iron metabolism being affected. Regulon enrichment analysis identified key regulators likely mediating the transcriptional response, including CRP, Rpo...... approach was employed to understand the chemical stress response of Escherichia coli, including a genome-wide screen for mutants with increased fitness during chemical stress. Twelve chemicals with significant production potential were selected, consisting of organic solvent-like chemicals (butanol...... stress identified 294 enriched and 336 depleted mutants and experimental validation revealed up to 60 % increase in mutant growth rates. Mutants enriched in several conditions contained, among others, insertions in genes of the Mar-Sox-Rob regulon as well as transcription and translation related gene...

  15. Off-Target Effects of Psychoactive Drugs Revealed by Genome-Wide Assays in Yeast

    OpenAIRE

    2008-01-01

    To better understand off-target effects of widely prescribed psychoactive drugs, we performed a comprehensive series of chemogenomic screens using the budding yeast Saccharomyces cerevisiae as a model system. Because the known human targets of these drugs do not exist in yeast, we could employ the yeast gene deletion collections and parallel fitness profiling to explore potential off-target effects in a genome-wide manner. Among 214 tested, documented psychoactive drugs, we identified 81 comp...

  16. MicroTrout: A comprehensive, genome-wide miRNA target prediction framework for rainbow trout, Oncorhynchus mykiss.

    Science.gov (United States)

    Mennigen, Jan A; Zhang, Dapeng

    2016-12-01

    Rainbow trout represent an important teleost research model and aquaculture species. As such, rainbow trout are employed in diverse areas of biological research, including basic biological disciplines such as comparative physiology, toxicology, and, since rainbow trout have undergone both teleost- and salmonid-specific rounds of genome duplication, molecular evolution. In recent years, microRNAs (miRNAs, small non-protein coding RNAs) have emerged as important posttranscriptional regulators of gene expression in animals. Given the increasingly recognized importance of miRNAs as an additional layer in the regulation of gene expression and hence biological function, recent efforts using RNA- and genome sequencing approaches have resulted in the creation of several resources for the construction of a comprehensive repertoire of rainbow trout miRNAs and isomiRs (variant miRNA sequences that all appear to derive from the same gene but vary in sequence due to post-transcriptional processing). Importantly, through the recent publication of the rainbow trout genome (Berthelot et al., 2014), mRNA 3'UTR information has become available, allowing for the first time the genome-wide prediction of miRNA-target RNA relationships in this species. We here report the creation of the microtrout database, a comprehensive resource for rainbow trout miRNA and annotated 3'UTRs. The comprehensive database was used to implement an algorithm to predict genome-wide rainbow trout-specific miRNA-mRNA target relationships, generating an improved predictive framework over previously published approaches. This work will serve as a useful framework and sequence resource to experimentally address the role of miRNAs in several research areas using the rainbow trout model, examples of which are discussed. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Genome-wide identification of genes regulated by the Rcs phosphorelay system in Erwinia amylovora.

    Science.gov (United States)

    Wang, Dongping; Qi, Mingsheng; Calla, Bernarda; Korban, Schuyler S; Clough, Steven J; Cock, Peter J A; Sundin, George W; Toth, Ian; Zhao, Youfu

    2012-01-01

    The exopolysaccharide amylovoran is one of the major pathogenicity factors in Erwinia amylovora, the causal agent of fire blight of apples and pears. We have previously demonstrated that the RcsBCD phosphorelay system is essential for virulence by controlling amylovoran biosynthesis. We have also found that the hybrid sensor kinase RcsC differentially regulates amylovoran production in vitro and in vivo. To further understand how the Rcs system regulates E. amylovora virulence gene expression, we conducted genome-wide microarray analyses to determine the regulons of RcsB and RcsC in liquid medium and on immature pear fruit. Array analyses identified a total of 648 genes differentially regulated by RcsCB in vitro and in vivo. Consistent with our previous findings, RcsB acts as a positive regulator in both conditions, while RcsC positively controls expression of amylovoran biosynthetic genes in vivo but negatively controls expression in vitro. Besides amylovoran biosynthesis and regulatory genes, cell-wall and cell-envelope (membrane) as well as regulatory genes were identified as the major components of the RcsBC regulon, including many novel genes. We have also demonstrated that transcripts of rcsA, rcsC, and rcsD genes but not the rcsB gene were up-regulated when bacterial cells were grown in minimal medium or following infection of pear fruits compared with those grown in Luria Bertani medium. Furthermore, using the genome of E. amylovora ATCC 49946, a hidden Markov model predicted 60 genes with a candidate RcsB binding site in the intergenic region, 28 of which were identified in the microarray assay. Based on these findings as well as previous reported data, a working model has been proposed to illustrate how the Rcs phosphorelay system regulates virulence gene expression in E. amylovora.

  18. Genome-wide association study of colorectal cancer in Hispanics

    Science.gov (United States)

    Schmit, Stephanie L.; Schumacher, Fredrick R.; Edlund, Christopher K.; Conti, David V.; Ihenacho, Ugonna; Wan, Peggy; Van Den Berg, David; Casey, Graham; Fortini, Barbara K.; Lenz, Heinz-Josef; Tusié-Luna, Teresa; Aguilar-Salinas, Carlos A.; Moreno-Macías, Hortensia; Huerta-Chagoya, Alicia; Ordóñez-Sánchez, María Luisa; Rodríguez-Guillén, Rosario; Cruz-Bautista, Ivette; Rodríguez-Torres, Maribel; Muñóz-Hernández, Linda Liliana; Arellano-Campos, Olimpia; Gómez, Donají; Alvirde, Ulices; González-Villalpando, Clicerio; González-Villalpando, María Elena; Le Marchand, Loic; Haiman, Christopher A.; Figueiredo, Jane C.

    2016-01-01

    Genome-wide association studies (GWAS) have identified 58 susceptibility alleles across 37 regions associated with the risk of colorectal cancer (CRC) with P < 5×10−8. Most studies have been conducted in non-Hispanic whites and East Asians; however, the generalizability of these findings and the potential for ethnic-specific risk variation in Hispanic and Latino (HL) individuals have been largely understudied. We describe the first GWAS of common genetic variation contributing to CRC risk in HL (1611 CRC cases and 4330 controls). We also examine known susceptibility alleles and implement imputation-based fine-mapping to identify potential ethnicity-specific association signals in known risk regions. We discovered 17 variants across 4 independent regions that merit further investigation due to suggestive CRC associations (P < 1×10−6) at 1p34.3 (rs7528276; Odds Ratio (OR) = 1.86 [95% confidence interval (CI): 1.47–2.36); P = 2.5×10−7], 2q23.3 (rs1367374; OR = 1.37 (95% CI: 1.21–1.55); P = 4.0×10−7), 14q24.2 (rs143046984; OR = 1.65 (95% CI: 1.36–2.01); P = 4.1×10−7) and 16q12.2 [rs142319636; OR = 1.69 (95% CI: 1.37–2.08); P=7.8×10−7]. Among the 57 previously published CRC susceptibility alleles with minor allele frequency ≥1%, 76.5% of SNPs had a consistent direction of effect and 19 (33.3%) were nominally statistically significant (P < 0.05). Further, rs185423955 and rs60892987 were identified as novel secondary susceptibility variants at 3q26.2 (P = 5.3×10–5) and 11q12.2 (P = 6.8×10−5), respectively. Our findings demonstrate the importance of fine mapping in HL. These results are informative for variant prioritization in functional studies and future risk prediction modeling in minority populations. PMID:27207650

  19. Polygenic analysis of genome-wide SNP data identifies common variants on allergic rhinitis

    DEFF Research Database (Denmark)

    Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette

    Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...

  20. Modeling transcriptional networks regulating secondary growth and wood formation in forest trees.

    Science.gov (United States)

    Liu, Lijun; Filkov, Vladimir; Groover, Andrew

    2014-06-01

    The complex interactions among the genes that underlie a biological process can be modeled and presented as a transcriptional network, in which genes (nodes) and their interactions (edges) are shown in a graphical form similar to a wiring diagram. A large number of genes have been identified that are expressed during the radial woody growth of tree stems (secondary growth), but a comprehensive understanding of how these genes interact to influence woody growth is currently lacking. Modeling transcriptional networks has recently been made tractable by next-generation sequencing-based technologies that can comprehensively catalog gene expression and transcription factor-binding genome-wide, but has not yet been extensively applied to undomesticated tree species or woody growth. Here we discuss basic features of transcriptional networks, approaches for modeling biological networks, and examples of biological network models developed for forest trees to date. We discuss how transcriptional network research is being developed in the model forest tree genus, Populus, and how this research area can be further developed and applied. Transcriptional network models for forest tree secondary growth and wood formation could ultimately provide new predictive models to accelerate hypothesis-driven research and develop new breeding applications.

  1. A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea

    Science.gov (United States)

    Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368

  2. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    Science.gov (United States)

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  3. Genome-wide transcriptome analysis of gametophyte development in Physcomitrella patens

    Directory of Open Access Journals (Sweden)

    Xiao Lihong

    2011-12-01

    Full Text Available Abstract Background Regulation of gene expression plays a pivotal role in controlling the development of multicellular plants. To explore the molecular mechanism of plant developmental-stage transition and cell-fate determination, a genome-wide analysis was undertaken of sequential developmental time-points and individual tissue types in the model moss Physcomitrella patens because of the short life cycle and relative structural simplicity of this plant. Results Gene expression was analyzed by digital gene expression tag profiling of samples taken from P. patens protonema at 3, 14 and 24 days, and from leafy shoot tissues at 30 days, after protoplast isolation, and from 14-day-old caulonemal and chloronemal tissues. In total, 4333 genes were identified as differentially displayed. Among these genes, 4129 were developmental-stage specific and 423 were preferentially expressed in either chloronemal or caulonemal tissues. Most of the differentially displayed genes were assigned to functions in organic substance and energy metabolism or macromolecule biosynthetic and catabolic processes based on gene ontology descriptions. In addition, some regulatory genes identified as candidates might be involved in controlling the developmental-stage transition and cell differentiation, namely MYB-like, HB-8, AL3, zinc finger family proteins, bHLH superfamily, GATA superfamily, GATA and bZIP transcription factors, protein kinases, genes related to protein/amino acid methylation, and auxin, ethylene, and cytokinin signaling pathways. Conclusions These genes that show highly dynamic changes in expression during development in P. patens are potential targets for further functional characterization and evolutionary developmental biology studies.

  4. Genome-wide association analysis of forage quality in maize mature stalk.

    Science.gov (United States)

    Wang, Hongwu; Li, Kun; Hu, Xiaojiao; Liu, Zhifang; Wu, Yujin; Huang, Changling

    2016-10-21

    Plant digestibility of silage maize (Zea mays L.) has a large influence on nutrition intake for animal feeding. Improving forage quality will enhance the utilization efficiency and feeding value of forage maize. Dissecting the genetic basis of forage quality will improve our understanding of the complex nature of cell wall biosynthesis and degradation, which is also helpful for breeding good quality silage maize. Acid detergent fiber (ADF), neutral detergent fiber (NDF) and in vitro dry matter digestibility (IVDMD) of stalk were evaluated in a diverse maize population, which is comprised of 368 inbred lines and planted across seven environments. Using a mixed model accounting for population structure and polygenic background effects, a genome-wide association study was conducted to identify single nucleotide polymorphisms (SNPs) significantly associated with forage quality. Scanning 559,285 SNPs across the whole genome, 73, 41 and 82 SNPs were found to be associated with ADF, NDF, and IVDMD, respectively. Each significant SNP explained 4.2 %-6.2 % of the phenotypic variation. Underlying these associated loci, 56 genes were proposed as candidate genes for forage quality. Of all the candidate genes proposed by GWAS, we only found a C3H gene (ZmC3H2) that is directly involved in cell wall component biosynthesis. The candidate genes found in this study are mainly involved in signal transduction, stress resistance, and transcriptional regulation of cell wall biosynthetic gene expression. Adding high digestibility maize into the association panel would be helpful for increasing genetic variability and identifying more genes associated with forage quality traits. Cloning and functional validation of these genes would be helpful for understanding the molecular mechanism of the fiber content and digestibility. These findings provide us new insights into cell wall formation and deposition.

  5. Genome-wide analysis and molecular dissection of the SPL gene family in Salvia miltiorrhiza

    Institute of Scientific and Technical Information of China (English)

    Linsu Zhang; Bin Wu; Degang Zhao; Caili Li; Fenjuan Shao; Shanfa Lu

    2014-01-01

    SQUAMOSA promoter binding protein-likes (SPLs) are plant-specific transcription factors playing vital regulatory roles in plant growth and development. There is no information about SPLs in Salvia miltiorrhiza (Danshen), a significant medicinal plant widely used in Traditional Chinese medicine (TCM) for>1,700 years and an emerging model plant for TCM studies. Through genome-wide identification and subsequent molecular cloning, we identified a total 15 SmSPLs with divergent sequence features, gene structures, and motifs. Comparative analysis showed sequence conservation between SmSPLs and their Arabidopsis counterparts. A phylogenetic tree clusters SmSPLs into six groups. Many of the motifs identified commonly exist in a group/subgroup, implying their functional redundancy. Eight SmSPLs were predicted and experimental y validated to be targets of miR156/157. SmSPLs were differen-tial y expressed in various tissues of S. milltiorrhiza. The expression of miR156/157-targeted SmSPLs was increased with the maturation of S. miltiorrhiza, whereas the expression of miR156/157 was decreased, confirming the regulatory roles of miR156/157 in SmSPLs and suggesting the functions of SmSPLs in S. miltiorrhiza development. The expression of miR156/157 was negatively correlated with miR172 during the maturation of S. miltiorrhiza. The results indicate the significance and complexity of SmSPL-, miR156-, and miR172-mediated regula-tion of developmental timing in S. miltiorrhiza.

  6. Genome-wide Association Study of Obsessive-Compulsive Disorder

    Science.gov (United States)

    Stewart, S Evelyn; Yu, Dongmei; Scharf, Jeremiah M; Neale, Benjamin M; Fagerness, Jesen A; Mathews, Carol A; Arnold, Paul D; Evans, Patrick D; Gamazon, Eric R; Osiecki, Lisa; McGrath, Lauren; Haddad, Stephen; Crane, Jacquelyn; Hezel, Dianne; Illman, Cornelia; Mayerfeld, Catherine; Konkashbaev, Anuar; Liu, Chunyu; Pluzhnikov, Anna; Tikhomirov, Anna; Edlund, Christopher K; Rauch, Scott L; Moessner, Rainald; Falkai, Peter; Maier, Wolfgang; Ruhrmann, Stephan; Grabe, Hans-Jörgen; Lennertz, Leonard; Wagner, Michael; Bellodi, Laura; Cavallini, Maria Cristina; Richter, Margaret A; Cook, Edwin H; Kennedy, James L; Rosenberg, David; Stein, Dan J; Hemmings, Sian MJ; Lochner, Christine; Azzam, Amin; Chavira, Denise A; Fournier, Eduardo; Garrido, Helena; Sheppard, Brooke; Umaña, Paul; Murphy, Dennis L; Wendland, Jens R; Veenstra-VanderWeele, Jeremy; Denys, Damiaan; Blom, Rianne; Deforce, Dieter; Van Nieuwerburgh, Filip; Westenberg, Herman GM; Walitza, Susanne; Egberts, Karin; Renner, Tobias; Miguel, Euripedes Constantino; Cappi, Carolina; Hounie, Ana G; Conceição do Rosário, Maria; Sampaio, Aline S; Vallada, Homero; Nicolini, Humberto; Lanzagorta, Nuria; Camarena, Beatriz; Delorme, Richard; Leboyer, Marion; Pato, Carlos N; Pato, Michele T; Voyiaziakis, Emanuel; Heutink, Peter; Cath, Danielle C; Posthuma, Danielle; Smit, Jan H; Samuels, Jack; Bienvenu, O Joseph; Cullen, Bernadette; Fyer, Abby J; Grados, Marco A; Greenberg, Benjamin D; McCracken, James T; Riddle, Mark A; Wang, Ying; Coric, Vladimir; Leckman, James F; Bloch, Michael; Pittenger, Christopher; Eapen, Valsamma; Black, Donald W; Ophoff, Roel A; Strengman, Eric; Cusi, Daniele; Turiel, Maurizio; Frau, Francesca; Macciardi, Fabio; Gibbs, J Raphael; Cookson, Mark R; Singleton, Andrew; Hardy, John; Crenshaw, Andrew T; Parkin, Melissa A; Mirel, Daniel B; Conti, David V; Purcell, Shaun; Nestadt, Gerald; Hanna, Gregory L; Jenike, Michael A; Knowles, James A; Cox, Nancy; Pauls, David L

    2014-01-01

    Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1,465 cases, 5,557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9,657 X-chromosome SNPs. Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two p-values were located within DLGAP1 (p=2.49×10-6 and p=3.44×10-6), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a p-value=3.84 × 10-8. However, when trios were meta-analyzed with the combined case-control samples, the p-value for this variant was 3.62×10-5, losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation-QTLs (p<0.001) and frontal lobe eQTLs (p=0.001) was observed within the top-ranked SNPs (p<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921

  7. Genome-wide landscapes of human local adaptation in Asia.

    Directory of Open Access Journals (Sweden)

    Wei Qian

    Full Text Available Genetic studies of human local adaptation have been facilitated greatly by recent advances in high-throughput genotyping and sequencing technologies. However, few studies have investigated local adaptation in Asian populations on a genome-wide scale and with a high geographic resolution. In this study, taking advantage of the dense population coverage in Southeast Asia, which is the part of the world least studied in term of natural selection, we depicted genome-wide landscapes of local adaptations in 63 Asian populations representing the majority of linguistic and ethnic groups in Asia. Using genome-wide data analysis, we discovered many genes showing signs of local adaptation or natural selection. Notable examples, such as FOXQ1, MAST2, and CDH4, were found to play a role in hair follicle development and human cancer, signal transduction, and tumor repression, respectively. These showed strong indications of natural selection in Philippine Negritos, a group of aboriginal hunter-gatherers living in the Philippines. MTTP, which has associations with metabolic syndrome, body mass index, and insulin regulation, showed a strong signature of selection in Southeast Asians, including Indonesians. Functional annotation analysis revealed that genes and genetic variants underlying natural selections were generally enriched in the functional category of alternative splicing. Specifically, many genes showing significant difference with respect to allele frequency between northern and southern Asian populations were found to be associated with human height and growth and various immune pathways. In summary, this study contributes to the overall understanding of human local adaptation in Asia and has identified both known and novel signatures of natural selection in the human genome.

  8. Genome-wide nucleosome specificity and function of chromatin remodellers in ES cells.

    Science.gov (United States)

    de Dieuleveult, Maud; Yen, Kuangyu; Hmitou, Isabelle; Depaux, Arnaud; Boussouar, Fayçal; Bou Dargham, Daria; Jounier, Sylvie; Humbertclaude, Hélène; Ribierre, Florence; Baulard, Céline; Farrell, Nina P; Park, Bongsoo; Keime, Céline; Carrière, Lucie; Berlivet, Soizick; Gut, Marta; Gut, Ivo; Werner, Michel; Deleuze, Jean-François; Olaso, Robert; Aude, Jean-Christophe; Chantalat, Sophie; Pugh, B Franklin; Gérard, Matthieu

    2016-02-01

    ATP-dependent chromatin remodellers allow access to DNA for transcription factors and the general transcription machinery, but whether mammalian chromatin remodellers target specific nucleosomes to regulate transcription is unclear. Here we present genome-wide remodeller-nucleosome interaction profiles for the chromatin remodellers Chd1, Chd2, Chd4, Chd6, Chd8, Chd9, Brg1 and Ep400 in mouse embryonic stem (ES) cells. These remodellers bind one or both full nucleosomes that flank micrococcal nuclease (MNase)-defined nucleosome-free promoter regions (NFRs), where they separate divergent transcription. Surprisingly, large CpG-rich NFRs that extend downstream of annotated transcriptional start sites are nevertheless bound by non-nucleosomal or subnucleosomal histone variants (H3.3 and H2A.Z) and marked by H3K4me3 and H3K27ac modifications. RNA polymerase II therefore navigates hundreds of base pairs of altered chromatin in the sense direction before encountering an MNase-resistant nucleosome at the 3' end of the NFR. Transcriptome analysis after remodeller depletion reveals reciprocal mechanisms of transcriptional regulation by remodellers. Whereas at active genes individual remodellers have either positive or negative roles via altering nucleosome stability, at polycomb-enriched bivalent genes the same remodellers act in an opposite manner. These findings indicate that remodellers target specific nucleosomes at the edge of NFRs, where they regulate ES cell transcriptional programs.

  9. Genome-wide nucleosome specificity and function of chromatin remodellers in ES cells

    Science.gov (United States)

    de Dieuleveult, Maud; Yen, Kuangyu; Hmitou, Isabelle; Depaux, Arnaud; Boussouar, Fayçal; Dargham, Daria Bou; Jounier, Sylvie; Humbertclaude, Hélène; Ribierre, Florence; Baulard, Céline; Farrell, Nina P.; Park, Bongsoo; Keime, Céline; Carrière, Lucie; Berlivet, Soizick; Gut, Marta; Gut, Ivo; Werner, Michel; Deleuze, Jean-François; Olaso, Robert; Aude, Jean-Christophe; Chantalat, Sophie; Pugh, B. Franklin; Gérard, Matthieu

    2015-01-01

    Summary ATP-dependent chromatin remodellers allow access to DNA for transcription factors and the general transcription machinery, but whether mammalian chromatin remodellers1–3 target specific nucleosomes to regulate transcription is unclear. Here, we present genome-wide remodeller-nucleosome interaction profiles for Chd1, Chd2, Chd4, Chd6, Chd8, Chd9, Brg1 and Ep400 in mouse embryonic stem (ES) cells. These remodellers bind one or both full nucleosomes that flank MNase-defined nucleosome-free promoter regions (NFRs), where they separate divergent transcription. Surprisingly, large CpG-rich NFRs that extend downstream of annotated transcriptional start sites (TSSs) are nevertheless chromatinized with non-nucleosomal or subnucleosomal histone variants (H3.3 and H2A.Z) and modifications (H3K4me3 and H3K27ac). RNA polymerase (pol) II therefore navigates hundreds of bp of altered chromatin in the sense direction before encountering an MNase-resistant nucleosome at the 3′ end of the NFR. Transcriptome analysis upon remodeller depletion reveals reciprocal mechanisms of transcriptional regulation by remodellers. Whereas at active genes individual remodellers play either positive or negative roles via altering nucleosome stability, at polycomb-enriched bivalent genes the same remodellers act in an opposite manner. These findings indicate that remodellers target specific nucleosomes at the edge of NFRs, where they regulate ES cell transcriptional programs. PMID:26814966

  10. The development of a genome wide SNP set for the Barnacle goose Branta leucopsis.

    Directory of Open Access Journals (Sweden)

    Rudy M Jonker

    Full Text Available Migratory birds are of particular interest for population genetics because of the high connectivity between habitats and populations. A high degree of connectivity requires using many genetic markers to achieve the required statistical power, and a genome wide SNP set can fit this purpose. Here we present the development of a genome wide SNP set for the Barnacle Goose Branta leucopsis, a model species for the study of bird migration. We used the genome of a different waterfowl species, Mallard Anas platyrhynchos, as a reference to align Barnacle Goose second generation sequence reads from an RRL library and detected 2188 SNPs genome wide. Furthermore, we used chimeric flanking sequences, merged from both Mallard and Barnacle Goose DNA sequence information, to create primers for validation by genotyping. Validation with a 384 SNP genotyping set resulted in 374 (97% successfully typed SNPs in the assay, of which 358 (96% were polymorphic. Additionally, we validated our SNPs on relatively old (30 years museum samples, which resulted in a success rate of at least 80%. This shows that museum samples could be used in standard SNP genotyping assays. Our study also shows that the genome of a related species can be used as reference to detect genome wide SNPs in birds, because genomes of birds are highly conserved. This is illustrated by the use of chimeric flanking sequences, which showed that the incorporation of flanking nucleotides from Mallard into Barnacle Goose sequences lead to equal genotyping performance when compared to flanking sequences solely composed of Barnacle Goose sequence.

  11. Genome-wide association studies and contribution to cardiovascular physiology.

    Science.gov (United States)

    Munroe, Patricia B; Tinker, Andrew

    2015-09-01

    The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology.

  12. [Genome-wide association study for adolescent idiopathic scoliosis].

    Science.gov (United States)

    Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro

    2016-04-01

    Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS.

  13. Genome-wide approaches to understanding human ageing

    Directory of Open Access Journals (Sweden)

    Kaeberlein Matt

    2006-06-01

    Full Text Available Abstract The use of genomic technologies in biogerontology has the potential to greatly enhance our understanding of human ageing. High-throughput screens for alleles correlated with survival in long-lived people have uncovered novel genes involved in age-associated disease. Genome-wide longevity studies in simple eukaryotes are identifying evolutionarily conserved pathways that determine longevity. It is hoped that validation of these 'public' aspects of ageing in mice, along with analyses of variation in candidate human ageing genes, will provide targets for future interventions to slow the ageing process and retard the onset of age-associated pathologies.

  14. A genome-wide analysis of gene-caffeine consumption interaction on basal cell carcinoma.

    Science.gov (United States)

    Li, Xin; Cornelis, Marilyn C; Liang, Liming; Song, Fengju; De Vivo, Immaculata; Giovannucci, Edward; Tang, Jean Y; Han, Jiali

    2016-12-01

    Animal models have suggested that oral or topical administration of caffeine could inhibit ultraviolet-induced carcinogenesis via the ataxia telangiectasia and rad3 (ATR)-related apoptosis. Previous epidemiological studies have demonstrated that increased caffeine consumption is associated with reduced risk of basal cell carcinoma (BCC). To identify common genetic markers that may modify this association, we tested gene-caffeine intake interaction on BCC risk in a genome-wide analysis. We included 3383 BCC cases and 8528 controls of European ancestry from the Nurses' Health Study and Health Professionals Follow-up Study. Single nucleotide polymorphism (SNP) rs142310826 near the NEIL3 gene showed a genome-wide significant interaction with caffeine consumption (P = 1.78 × 10(-8) for interaction) on BCC risk. There was no gender difference for this interaction (P = 0.64 for heterogeneity). NEIL3, a gene belonging to the base excision DNA repair pathway, encodes a DNA glycosylase that recognizes and removes lesions produced by oxidative stress. In addition, we identified several loci with P value for interaction caffeine consumption-related SNPs reported by previous genome-wide association studies and risk of BCC, both individually and jointly, but found no significant association. In sum, we identified a DNA repair gene that could be involved in caffeine-mediated skin tumor inhibition. Further studies are warranted to confirm these findings. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  15. Genome-wide Meta-analysis on the Sense of Smell Among US Older Adults.

    Science.gov (United States)

    Dong, Jing; Yang, Jingyun; Tranah, Greg; Franceschini, Nora; Parimi, Neeta; Alkorta-Aranburu, Gorka; Xu, Zongli; Alonso, Alvaro; Cummings, Steven R; Fornage, Myriam; Huang, Xuemei; Kritchevsky, Stephen; Liu, Yongmei; London, Stephanie; Niu, Liang; Wilson, Robert S; De Jager, Philip L; Yu, Lei; Singleton, Andrew B; Harris, Tamara; Mosley, Thomas H; Pinto, Jayant M; Bennett, David A; Chen, Honglei

    2015-11-01

    Olfactory dysfunction is common among older adults and affects their safety, nutrition, quality of life, and mortality. More importantly, the decreased sense of smell is an early symptom of neurodegenerative diseases such as Parkinson disease (PD) and Alzheimer disease. However, the genetic determinants for the sense of smell have been poorly investigated. We here performed the first genome-wide meta-analysis on the sense of smell among 6252 US older adults of European descent from the Atherosclerosis Risk in Communities (ARIC) study, the Health, Aging, and Body Composition (Health ABC) study, and the Religious Orders Study and the Rush Memory and Aging Project (ROS/MAP). Genome-wide association study analysis was performed first by individual cohorts and then meta-analyzed using fixed-effect models with inverse variance weights. Although no SNPs reached genome-wide statistical significance, we identified 13 loci with suggestive evidence for an association with the sense of smell (Pmeta effects on the expression of microtubule-associated protein tau (MAPT, 17q21.31) in 447 frontal-cortex samples obtained postmortem and profiled by RNA-seq (P smell in older adults. Similar results were obtained after excluding participants who reported a physician-diagnosed PD or use of PD medications. In conclusion, we provide preliminary evidence that the MAPT locus may play a role in regulating the sense of smell in older adults and therefore offer a potential genetic link between poor sense of smell and major neurodegenerative diseases.

  16. On the Analysis of a Repeated Measure Design in Genome-Wide Association Analysis

    Directory of Open Access Journals (Sweden)

    Young Lee

    2014-11-01

    Full Text Available Longitudinal data enables detecting the effect of aging/time, and as a repeated measures design is statistically more efficient compared to cross-sectional data if the correlations between repeated measurements are not large. In particular, when genotyping cost is more expensive than phenotyping cost, the collection of longitudinal data can be an efficient strategy for genetic association analysis. However, in spite of these advantages, genome-wide association studies (GWAS with longitudinal data have rarely been analyzed taking this into account. In this report, we calculate the required sample size to achieve 80% power at the genome-wide significance level for both longitudinal and cross-sectional data, and compare their statistical efficiency. Furthermore, we analyzed the GWAS of eight phenotypes with three observations on each individual in the Korean Association Resource (KARE. A linear mixed model allowing for the correlations between observations for each individual was applied to analyze the longitudinal data, and linear regression was used to analyze the first observation on each individual as cross-sectional data. We found 12 novel genome-wide significant disease susceptibility loci that were then confirmed in the Health Examination cohort, as well as some significant interactions between age/sex and SNPs.

  17. NSD1 mutations generate a genome-wide DNA methylation signature.

    LENUS (Irish Health Repository)

    Choufani, S

    2015-12-22

    Sotos syndrome (SS) represents an important human model system for the study of epigenetic regulation; it is an overgrowth\\/intellectual disability syndrome caused by mutations in a histone methyltransferase, NSD1. As layered epigenetic modifications are often interdependent, we propose that pathogenic NSD1 mutations have a genome-wide impact on the most stable epigenetic mark, DNA methylation (DNAm). By interrogating DNAm in SS patients, we identify a genome-wide, highly significant NSD1(+\\/-)-specific signature that differentiates pathogenic NSD1 mutations from controls, benign NSD1 variants and the clinically overlapping Weaver syndrome. Validation studies of independent cohorts of SS and controls assigned 100% of these samples correctly. This highly specific and sensitive NSD1(+\\/-) signature encompasses genes that function in cellular morphogenesis and neuronal differentiation, reflecting cardinal features of the SS phenotype. The identification of SS-specific genome-wide DNAm alterations will facilitate both the elucidation of the molecular pathophysiology of SS and the development of improved diagnostic testing.

  18. Bayesian inference based modelling for gene transcriptional dynamics by integrating multiple source of knowledge

    Directory of Open Access Journals (Sweden)

    Wang Shu-Qiang

    2012-07-01

    Full Text Available Abstract Background A key challenge in the post genome era is to identify genome-wide transcriptional regulatory networks, which specify the interactions between transcription factors and their target genes. Numerous methods have been developed for reconstructing gene regulatory networks from expression data. However, most of them are based on coarse grained qualitative models, and cannot provide a quantitative view of regulatory systems. Results A binding affinity based regulatory model is proposed to quantify the transcriptional regulatory network. Multiple quantities, including binding affinity and the activity level of transcription factor (TF are incorporated into a general learning model. The sequence features of the promoter and the possible occupancy of nucleosomes are exploited to estimate the binding probability of regulators. Comparing with the previous models that only employ microarray data, the proposed model can bridge the gap between the relative background frequency of the observed nucleotide and the gene's transcription rate. Conclusions We testify the proposed approach on two real-world microarray datasets. Experimental results show that the proposed model can effectively identify the parameters and the activity level of TF. Moreover, the kinetic parameters introduced in the proposed model can reveal more biological sense than previous models can do.

  19. A genome-wide association study of female sexual dysfunction.

    Directory of Open Access Journals (Sweden)

    Andrea Burri

    Full Text Available BACKGROUND: Female sexual dysfunction (FSD is an important but controversial problem with serious negative impact on women's quality of life. Data from twin studies have shown a genetic contribution to the development and maintenance of FSD. METHODOLOGY/PRINCIPAL FINDINGS: We performed a genome-wide association study (GWAS on 2.5 million single-nucleotide polymorphisms (SNPs in 1,104 female twins (25-81 years of age in a population-based register and phenotypic data on lifelong sexual functioning. Although none reached conventional genome-wide level of significance (10 × -8, we found strongly suggestive associations with the phenotypic dimension of arousal (rs13202860, P = 1.2 × 10(-7; rs1876525, P = 1.2 × 10(-7; and rs13209281 P = 8.3 × 10(-7 on chromosome 6, around 500 kb upstream of the locus HTR1E (5-hydroxytryptamine receptor 1E locus, related to the serotonin brain pathways. We could not replicate previously reported candidate SNPs associated with FSD in the DRD4, 5HT2A and IL-1B loci. CONCLUSIONS/SIGNIFICANCE: We report the first GWAS of FSD symptoms in humans. This has pointed to several "risk alleles" and the implication of the serotonin and GABA pathways. Ultimately, understanding key mechanisms via this research may lead to new FSD treatments and inform clinical practice and developments in psychiatric nosology.

  20. A genome-wide association study of aging.

    Science.gov (United States)

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora

    2011-11-01

    Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity.

  1. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    Directory of Open Access Journals (Sweden)

    Varun Warrier

    Full Text Available Asperger Syndrome (AS is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC, which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448 were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448 lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  2. Genome-wide association studies of obesity and metabolic syndrome.

    Science.gov (United States)

    Fall, Tove; Ingelsson, Erik

    2014-01-25

    Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  3. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    Science.gov (United States)

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  4. Planning and executing a genome wide association study (GWAS).

    Science.gov (United States)

    Sale, Michèle M; Mychaleckyj, Josyf C; Chen, Wei-Min

    2009-01-01

    In recent years, genome-wide association approaches have proven a powerful and successful strategy to identify genetic contributors to complex traits, including a number of endocrine disorders. Their success has meant that genome wide association studies (GWAS) are fast becoming the default study design for discovery of new genetic variants that influence a clinical trait or phenotype. This chapter focuses on a number of key elements that require consideration for the successful conduct of a GWAS. Although many of the considerations are common to any genetic study, the greater cost, extreme multiple testing, and greater openness to data sharing require specific awareness and planning by investigators. In the section on designing a GWAS, we reflect on ethical considerations, study design, selection of phenotype/s, power considerations, sample tracking and storage issues, and genotyping product selection. During execution, important considerations include DNA quantity and preparation, genotyping methods, quality control checks of genotype data, in silico genotyping (imputation), tests of association, and replication of association signals. Although the field of human genetics is rapidly evolving, recent experiences can help guide an investigator in making practical and methodological choices that will eventually determine the overall quality of GWAS results. Given the investment to recruit patient populations or cohorts that are powered for a GWAS, and the still substantial costs associated with genotyping, it is helpful to be aware of these aspects to maximize the likelihood of success, especially where there is an opportunity for implementing them prospectively.

  5. Genome-Wide Scan for Methylation Profiles in Keloids

    Directory of Open Access Journals (Sweden)

    Lamont R. Jones

    2015-01-01

    Full Text Available Keloids are benign fibroproliferative tumors of the skin which commonly occur after injury mainly in darker skinned patients. Medical treatment is fraught with high recurrence rates mainly because of an incomplete understanding of the biological mechanisms that lead to keloids. The purpose of this project was to examine keloid pathogenesis from the epigenome perspective of DNA methylation. Genome-wide profiling used the Infinium HumanMethylation450 BeadChip to interrogate DNA from 6 fresh keloid and 6 normal skin samples from 12 anonymous donors. A 3-tiered approach was used to call out genes most differentially methylated between keloid and normal. When compared to normal, of the 685 differentially methylated CpGs at Tier 3, 510 were hypomethylated and 175 were hypermethylated with 190 CpGs in promoter and 495 in nonpromoter regions. The 190 promoter region CpGs corresponded to 152 genes: 96 (63% were hypomethylated and 56 (37% hypermethylated. This exploratory genome-wide scan of the keloid methylome highlights a predominance of hypomethylated genomic landscapes, favoring nonpromoter regions. DNA methylation, as an additional mechanism for gene regulation in keloid pathogenesis, holds potential for novel treatments that reverse deleterious epigenetic changes. As an alternative mechanism for regulating genes, epigenetics may explain why gene mutations alone do not provide definitive mechanisms for keloid formation.

  6. Heat shock factors in tomatoes: genome-wide identification, phylogenetic analysis and expression profiling under development and heat stress

    Directory of Open Access Journals (Sweden)

    Xuedong Yang

    2016-05-01

    Full Text Available The HSF (heat shock factor gene family contains highly conserved plant-specific transcription factors that play an important role in plant high-temperature stress responses. The present study aimed to characterize the HSF transcription factor genes in tomato (Solanum lycopersicum, which is an important vegetable crop worldwide and the model plant for fruit development studies. Twenty-six SlyHSF genes were identified in tomato, and the phylogenetic analysis showed the possible evolution profile of subgroups among in the plant kingdom. A new group O was identified that involved HSF genes in primitive plant species, like in the green algae, mosses and lycophytes. The gene structure and motifs of each SlyHSF were comprehensively analyzed. We identified orthologous, co-orthologous and paralogous HSF gene pairs in tomato, Arabidopsis and rice, and constructed a complex interaction network among these genes. The SlyHSF genes were expressed differentially in different species and at a higher level in mature fruits. The qPCR analysis was performed and showed SlyHSF genes greatly participate in plant heat tolerant pathways. Our comprehensive genome-wide analysis provided insights into the HSF gene family of tomatoes.

  7. Genome-wide analysis of the SBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis).

    Science.gov (United States)

    Tan, Hua-Wei; Song, Xiao-Ming; Duan, Wei-Ke; Wang, Yan; Hou, Xi-Lin

    2015-11-01

    The SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box gene family contains highly conserved plant-specific transcription factors that play an important role in plant development, especially in flowering. Chinese cabbage (Brassica rapa subsp. pekinensis) is a leafy vegetable grown worldwide and is used as a model crop for research in genome duplication. The present study aimed to characterize the SBP-box transcription factor genes in Chinese cabbage. Twenty-nine SBP-box genes were identified in the Chinese cabbage genome and classified into six groups. We identified 23 orthologous and 5 co-orthologous SBP-box gene pairs between Chinese cabbage and Arabidopsis. An interaction network among these genes was constructed. Sixteen SBP-box genes were expressed more abundantly in flowers than in other tissues, suggesting their involvement in flowering. We show that the MiR156/157 family members may regulate the coding regions or 3'-UTR regions of Chinese cabbage SBP-box genes. As SBP-box genes were found to potentially participate in some plant development pathways, quantitative real-time PCR analysis was performed and showed that Chinese cabbage SBP-box genes were also sensitive to the exogenous hormones methyl jasmonic acid and salicylic acid. The SBP-box genes have undergone gene duplication and loss, evolving a more refined regulation for diverse stimulation in plant tissues. Our comprehensive genome-wide analysis provides insights into the SBP-box gene family of Chinese cabbage.

  8. Some additional considerations in modelling the dynamic traits and genome-wide association studies. Comments on "Mapping complex traits as a dynamic system" by L. Sun and R. Wu

    Science.gov (United States)

    Das, Kiranmoy

    2015-06-01

    The revolution in the genetic research in our time is mainly due to (i) the successful completion of human genome project (2003) and its derivative hapmap project (2005), (ii) advanced statistical methodologies for analyzing ultrahigh dimensional data and (iii) the availability of statistical softwares (R, SAS etc.) to analyze large datasets. When complex traits are to be modeled as dynamic systems, the statistical issues regarding the complexity in the model, predictive power of the model, computational cost etc. are to be addressed adequately for powerful inferences. I will mention two additional considerations (statistical) which make dynamic models more meaningful and the results from GWAS more reliable.

  9. Genome-Wide Association Study Identifies Novel Loci Associated With Diisocyanate-Induced Occupational Asthma

    Science.gov (United States)

    Yucesoy, Berran; Kaufman, Kenneth M.; Lummus, Zana L.; Weirauch, Matthew T.; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M.; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B.; Bernstein, David I.

    2015-01-01

    Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10−14). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10−9 and rs2514805, p = 1.22 × 10−8, respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10−6). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies. PMID:25918132

  10. Layers of epistasis: genome-wide regulatory networks and network approaches to genome-wide association studies

    Science.gov (United States)

    Cowper-Sal·lari, Richard; Cole, Michael D.; Karagas, Margaret R.; Lupien, Mathieu; Moore, Jason H.

    2010-01-01

    The conceptual foundation of the genome-wide association study (GWAS) has advanced unchecked since its conception. A revision might seem premature as the potential of GWAS has not been fully realized. Multiple technical and practical limitations need to be overcome before GWAS can be fairly criticized. But with the completion of hundreds of studies and a deeper understanding of the genetic architecture of disease, warnings are being raised. The results compiled to date indicate that risk-associated variants lie predominantly in non-coding regions of the genome. Additionally, alternative methodologies are uncovering large and heterogeneous sets of rare variants underlying disease. The fear is that, even in its fulfilment, the current GWAS paradigm might be incapable of dissecting all kinds of phenotypes. In the following text we review several initiatives that aim to overcome these limitations. The overarching theme of these studies is the inclusion of biological knowledge to both the analysis and interpretation of genotyping data. GWAS is uninformed of biology by design and although there is some virtue in its simplicity it is also its most conspicuous deficiency. We propose a framework in which to integrate these novel approaches, both empirical and theoretical, in the form of a genome-wide regulatory network (GWRN). By processing experimental data into networks, emerging data types based on chromatin-immunoprecipitation are made computationally tractable. This will give GWAS re-analysis efforts the most current and relevant substrates, and root them firmly on our knowledge of human disease. PMID:21197657

  11. Genome wide identification of aberrant alternative splicing events in myotonic dystrophy type 2.

    Science.gov (United States)

    Perfetti, Alessandra; Greco, Simona; Fasanaro, Pasquale; Bugiardini, Enrico; Cardani, Rosanna; Garcia-Manteiga, Jose M; Manteiga, Jose M Garcia; Riba, Michela; Cittaro, Davide; Stupka, Elia; Meola, Giovanni; Martelli, Fabio

    2014-01-01

    Myotonic dystrophy type 2 (DM2) is a genetic, autosomal dominant disease due to expansion of tetraplet (CCTG) repetitions in the first intron of the ZNF9/CNBP gene. DM2 is a multisystemic disorder affecting the skeletal muscle, the heart, the eye and the endocrine system. According to the proposed pathological mechanism, the expanded tetraplets have an RNA toxic effect, disrupting the splicing of many mRNAs. Thus, the identification of aberrantly spliced transcripts is instrumental for our understanding of the molecular mechanisms underpinning the disease. The aim of this study was the identification of new aberrant alternative splicing events in DM2 patients. By genome wide analysis of 10 DM2 patients and 10 controls (CTR), we identified 273 alternative spliced exons in 218 genes. While many aberrant splicing events were already identified in the past, most were new. A subset of these events was validated by qPCR assays in 19 DM2 and 15 CTR subjects. To gain insight into the molecular pathways involving the identified aberrantly spliced genes, we performed a bioinformatics analysis with Ingenuity system. This analysis indicated a deregulation of development, cell survival, metabolism, calcium signaling and contractility. In conclusion, our genome wide analysis provided a database of aberrant splicing events in the skeletal muscle of DM2 patients. The affected genes are involved in numerous pathways and networks important for muscle physio-pathology, suggesting that the identified variants may contribute to DM2 pathogenesis.

  12. Genome wide identification of aberrant alternative splicing events in myotonic dystrophy type 2.

    Directory of Open Access Journals (Sweden)

    Alessandra Perfetti

    Full Text Available Myotonic dystrophy type 2 (DM2 is a genetic, autosomal dominant disease due to expansion of tetraplet (CCTG repetitions in the first intron of the ZNF9/CNBP gene. DM2 is a multisystemic disorder affecting the skeletal muscle, the heart, the eye and the endocrine system. According to the proposed pathological mechanism, the expanded tetraplets have an RNA toxic effect, disrupting the splicing of many mRNAs. Thus, the identification of aberrantly spliced transcripts is instrumental for our understanding of the molecular mechanisms underpinning the disease. The aim of this study was the identification of new aberrant alternative splicing events in DM2 patients. By genome wide analysis of 10 DM2 patients and 10 controls (CTR, we identified 273 alternative spliced exons in 218 genes. While many aberrant splicing events were already identified in the past, most were new. A subset of these events was validated by qPCR assays in 19 DM2 and 15 CTR subjects. To gain insight into the molecular pathways involving the identified aberrantly spliced genes, we performed a bioinformatics analysis with Ingenuity system. This analysis indicated a deregulation of development, cell survival, metabolism, calcium signaling and contractility. In conclusion, our genome wide analysis provided a database of aberrant splicing events in the skeletal muscle of DM2 patients. The affected genes are involved in numerous pathways and networks important for muscle physio-pathology, suggesting that the identified variants may contribute to DM2 pathogenesis.

  13. Genome-wide translocation sequencing reveals mechanisms of chromosome breaks and rearrangements in B cells.

    Science.gov (United States)

    Chiarle, Roberto; Zhang, Yu; Frock, Richard L; Lewis, Susanna M; Molinie, Benoit; Ho, Yu-Jui; Myers, Darienne R; Choi, Vivian W; Compagno, Mara; Malkin, Daniel J; Neuberg, Donna; Monti, Stefano; Giallourakis, Cosmas C; Gostissa, Monica; Alt, Frederick W

    2011-09-30

    Whereas chromosomal translocations are common pathogenetic events in cancer, mechanisms that promote them are poorly understood. To elucidate translocation mechanisms in mammalian cells, we developed high-throughput, genome-wide translocation sequencing (HTGTS). We employed HTGTS to identify tens of thousands of independent translocation junctions involving fixed I-SceI meganuclease-generated DNA double-strand breaks (DSBs) within the c-myc oncogene or IgH locus of B lymphocytes induced for activation-induced cytidine deaminase (AID)-dependent IgH class switching. DSBs translocated widely across the genome but were preferentially targeted to transcribed chromosomal regions. Additionally, numerous AID-dependent and AID-independent hot spots were targeted, with the latter comprising mainly cryptic I-SceI targets. Comparison of translocation junctions with genome-wide nuclear run-ons revealed a marked association between transcription start sites and translocation targeting. The majority of translocation junctions were formed via end-joining with short microhomologies. Our findings have implications for diverse fields, including gene therapy and cancer genomics.

  14. Genome-wide analysis of alternative splicing during human heart development

    Science.gov (United States)

    Wang, He; Chen, Yanmei; Li, Xinzhong; Chen, Guojun; Zhong, Lintao; Chen, Gangbing; Liao, Yulin; Liao, Wangjun; Bin, Jianping

    2016-01-01

    Alternative splicing (AS) drives determinative changes during mouse heart development. Recent high-throughput technological advancements have facilitated genome-wide AS, while its analysis in human foetal heart transition to the adult stage has not been reported. Here, we present a high-resolution global analysis of AS transitions between human foetal and adult hearts. RNA-sequencing data showed extensive AS transitions occurred between human foetal and adult hearts, and AS events occurred more frequently in protein-coding genes than in long non-coding RNA (lncRNA). A significant difference of AS patterns was found between foetal and adult hearts. The predicted difference in AS events was further confirmed using quantitative reverse transcription-polymerase chain reaction analysis of human heart samples. Functional foetal-specific AS event analysis showed enrichment associated with cell proliferation-related pathways including cell cycle, whereas adult-specific AS events were associated with protein synthesis. Furthermore, 42.6% of foetal-specific AS events showed significant changes in gene expression levels between foetal and adult hearts. Genes exhibiting both foetal-specific AS and differential expression were highly enriched in cell cycle-associated functions. In conclusion, we provided a genome-wide profiling of AS transitions between foetal and adult hearts and proposed that AS transitions and deferential gene expression may play determinative roles in human heart development. PMID:27752099

  15. Genome-wide analysis of DNA methylation in Arabidopsis using MeDIP-chip.

    Science.gov (United States)

    Cortijo, Sandra; Wardenaar, René; Colomé-Tatché, Maria; Johannes, Frank; Colot, Vincent

    2014-01-01

    DNA methylation is an epigenetic mark that is essential for preserving genome integrity and normal development in plants and mammals. Although this modification may serve a variety of purposes, it is best known for its role in stable transcriptional silencing of transposable elements and epigenetic regulation of some genes. In addition, it is increasingly recognized that alterations in DNA methylation patterns can sometimes be inherited across multiple generations and thus are a source of heritable phenotypic variation that is independent of any DNA sequence changes. With the advent of genomics, it is now possible to analyze DNA methylation genome-wide with high precision, which is a prerequisite for understanding fully the various functions and phenotypic impact of this modification. Indeed, several so-called epigenomic mapping methods have been developed for the analysis of DNA methylation. Among these, immunoprecipitation of methylated DNA followed by hybridization to genome tiling arrays (MeDIP-chip) arguably offers a reasonable compromise between cost, ease of implementation, and sensitivity to date. Here we describe the application of this method, from DNA extraction to data analysis, to the study of DNA methylation genome-wide in Arabidopsis.

  16. Genome-wide analysis of alternative splicing during human heart development

    Science.gov (United States)

    Wang, He; Chen, Yanmei; Li, Xinzhong; Chen, Guojun; Zhong, Lintao; Chen, Gangbing; Liao, Yulin; Liao, Wangjun; Bin, Jianping

    2016-10-01

    Alternative splicing (AS) drives determinative changes during mouse heart development. Recent high-throughput technological advancements have facilitated genome-wide AS, while its analysis in human foetal heart transition to the adult stage has not been reported. Here, we present a high-resolution global analysis of AS transitions between human foetal and adult hearts. RNA-sequencing data showed extensive AS transitions occurred between human foetal and adult hearts, and AS events occurred more frequently in protein-coding genes than in long non-coding RNA (lncRNA). A significant difference of AS patterns was found between foetal and adult hearts. The predicted difference in AS events was further confirmed using quantitative reverse transcription-polymerase chain reaction analysis of human heart samples. Functional foetal-specific AS event analysis showed enrichment associated with cell proliferation-related pathways including cell cycle, whereas adult-specific AS events were associated with protein synthesis. Furthermore, 42.6% of foetal-specific AS events showed significant changes in gene expression levels between foetal and adult hearts. Genes exhibiting both foetal-specific AS and differential expression were highly enriched in cell cycle-associated functions. In conclusion, we provided a genome-wide profiling of AS transitions between foetal and adult hearts and proposed that AS transitions and deferential gene expression may play determinative roles in human heart development.

  17. Profiling the genome-wide DNA methylation pattern of porcine ovaries using reduced representation bisulfite sequencing.

    Science.gov (United States)

    Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi

    2016-02-25

    Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5' and 3' flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome.

  18. Genome-wide association scan suggests basis for microtia in Awassi sheep.

    Science.gov (United States)

    Jawasreh, K; Boettcher, P J; Stella, A

    2016-08-01

    Hereditary underdevelopment of the ear, a condition also known as microtia, has been observed in several sheep breeds as well as in humans and other species. Its genetic basis in sheep is unknown. The Awassi sheep, a breed native to southwest Asia, carries this phenotype and was targeted for molecular characterization via a genome-wide association study. DNA samples were collected from sheep in Jordan. Eight affected and 12 normal individuals were genotyped with the Illumina OvineSNP50(®) chip. Multilocus analyses failed to identify any genotypic association. In contrast, a single-locus analysis revealed a statistically significant association (P = 0.012, genome-wide) with a SNP at basepair 34 647 499 on OAR23. This marker is adjacent to the gene encoding transcription factor GATA-6, which has been shown to play a role in many developmental processes, including chondrogenesis. The lack of extended homozygosity in this region suggests a fairly ancient mutation, and the time of occurrence was estimated to be approximately 3000 years ago. Many of the earless sheep breeds may thus share the causative mutation, especially within the subgroup of fat-tailed, wool sheep.

  19. Genome-wide detection of genes targeted by non-Ig somatic hypermutation in lymphoma.

    Directory of Open Access Journals (Sweden)

    Yanwen Jiang

    Full Text Available The processes of somatic hypermutation (SHM and class switch recombination introduced by activation-induced cytosine deaminase (AICDA at the Immunoglobulin (Ig loci are key steps for creating a pool of diversified antibodies in germinal center B cells (GCBs. Unfortunately, AICDA can also accidentally introduce mutations at bystander loci, particularly within the 5' regulatory regions of proto-oncogenes relevant to diffuse large B cell lymphomas (DLBCL. Since current methods for genomewide sequencing such as Exon Capture and RNAseq only target mutations in coding regions, to date non-Ig promoter SHMs have been studied only in a handful genes. We designed a novel approach integrating bioinformatics tools with next generation sequencing technology to identify regulatory loci targeted by SHM genome-wide. We observed increased numbers of SHM associated sequence variant hotspots in lymphoma cells as compared to primary normal germinal center B cells. Many of these SHM hotspots map to genes that have not been reported before as mutated, including BACH2, BTG2, CXCR4, CIITA, EBF1, PIM2, and TCL1A, etc., all of which have potential roles in B cell survival, differentiation, and malignant transformation. In addition, using BCL6 and BACH2 as examples, we demonstrated that SHM sites identified in these 5' regulatory regions greatly altered their transcription activities in a reporter assay. Our approach provides a first cost-efficient, genome-wide method to identify regulatory mutations and non-Ig SHM hotspots.

  20. A genome-wide association study of optic disc parameters.

    Directory of Open Access Journals (Sweden)

    Wishal D Ramdas

    2010-06-01

    Full Text Available The optic nerve head is involved in many ophthalmic disorders, including common diseases such as myopia and open-angle glaucoma. Two of the most important parameters are the size of the optic disc area and the vertical cup-disc ratio (VCDR. Both are highly heritable but genetically largely undetermined. We performed a meta-analysis of genome-wide association (GWA data to identify genetic variants associated with optic disc area and VCDR. The gene discovery included 7,360 unrelated individuals from the population-based Rotterdam Study I and Rotterdam Study II cohorts. These cohorts revealed two genome-wide significant loci for optic disc area, rs1192415 on chromosome 1p22 (p = 6.72x10(-19 within 117 kb of the CDC7 gene and rs1900004 on chromosome 10q21.3-q22.1 (p = 2.67x10(-33 within 10 kb of the ATOH7 gene. They revealed two genome-wide significant loci for VCDR, rs1063192 on chromosome 9p21 (p = 6.15x10(-11 in the CDKN2B gene and rs10483727 on chromosome 14q22.3-q23 (p = 2.93x10(-10 within 40 kbp of the SIX1 gene. Findings were replicated in two independent Dutch cohorts (Rotterdam Study III and Erasmus Rucphen Family study; N = 3,612, and the TwinsUK cohort (N = 843. Meta-analysis with the replication cohorts confirmed the four loci and revealed a third locus at 16q12.1 associated with optic disc area, and four other loci at 11q13, 13q13, 17q23 (borderline significant, and 22q12.1 for VCDR. ATOH7 was also associated with VCDR independent of optic disc area. Three of the loci were marginally associated with open-angle glaucoma. The protein pathways in which the loci of optic disc area are involved overlap with those identified for VCDR, suggesting a common genetic origin.

  1. Genome-wide Pleiotropy Between Parkinson Disease and Autoimmune Diseases.

    Science.gov (United States)

    Witoelar, Aree; Jansen, Iris E; Wang, Yunpeng; Desikan, Rahul S; Gibbs, J Raphael; Blauwendraat, Cornelis; Thompson, Wesley K; Hernandez, Dena G; Djurovic, Srdjan; Schork, Andrew J; Bettella, Francesco; Ellinghaus, David; Franke, Andre; Lie, Benedicte A; McEvoy, Linda K; Karlsen, Tom H; Lesage, Suzanne; Morris, Huw R; Brice, Alexis; Wood, Nicholas W; Heutink, Peter; Hardy, John; Singleton, Andrew B; Dale, Anders M; Gasser, Thomas; Andreassen, Ole A; Sharma, Manu

    2017-07-01

    Recent genome-wide association studies (GWAS) and pathway analyses supported long-standing observations of an association between immune-mediated diseases and Parkinson disease (PD). The post-GWAS era provides an opportunity for cross-phenotype analyses between different complex phenotypes. To test the hypothesis that there are common genetic risk variants conveying risk of both PD and autoimmune diseases (ie, pleiotropy) and to identify new shared genetic variants and their pathways by applying a novel statistical framework in a genome-wide approach. Using the conjunction false discovery rate method, this study analyzed GWAS data from a selection of archetypal autoimmune diseases among 138 511 individuals of European ancestry and systemically investigated pleiotropy between PD and type 1 diabetes, Crohn disease, ulcerative colitis, rheumatoid arthritis, celiac disease, psoriasis, and multiple sclerosis. NeuroX data (6927 PD cases and 6108 controls) were used for replication. The study investigated the biological correlation between the top loci through protein-protein interaction and changes in the gene expression and methylation levels. The dates of the analysis were June 10, 2015, to March 4, 2017. The primary outcome was a list of novel loci and their pathways involved in PD and autoimmune diseases. Genome-wide conjunctional analysis identified 17 novel loci at false discovery rate less than 0.05 with overlap between PD and autoimmune diseases, including known PD loci adjacent to GAK, HLA-DRB5, LRRK2, and MAPT for rheumatoid arthritis, ulcerative colitis and Crohn disease. Replication confirmed the involvement of HLA, LRRK2, MAPT, TRIM10, and SETD1A in PD. Among the novel genes discovered, WNT3, KANSL1, CRHR1, BOLA2, and GUCY1A3 are within a protein-protein interaction network with known PD genes. A subset of novel loci was significantly associated with changes in methylation or expression levels of adjacent genes. The study findings provide novel mechanistic

  2. Genome-wide association study of antisocial personality disorder

    Science.gov (United States)

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  3. A unified model for yeast transcript definition.

    Science.gov (United States)

    de Boer, Carl G; van Bakel, Harm; Tsui, Kyle; Li, Joyce; Morris, Quaid D; Nislow, Corey; Greenblatt, Jack F; Hughes, Timothy R

    2014-01-01

    Identifying genes in the genomic context is central to a cell's ability to interpret the genome. Yet, in general, the signals used to define eukaryotic genes are poorly described. Here, we derived simple classifiers that identify where transcription will initiate and terminate using nucleic acid sequence features detectable by the yeast cell, which we integrate into a Unified Model (UM) that models transcription as a whole. The cis-elements that denote where transcription initiates function primarily through nucleosome depletion, and, using a synthetic promoter system, we show that most of these elements are sufficient to initiate transcription in vivo. Hrp1 binding sites are the major characteristic of terminators; these binding sites are often clustered in terminator regions and can terminate transcription bidirectionally. The UM predicts global transcript structure by modeling transcription of the genome using a hidden Markov model whose emissions are the outputs of the initiation and termination classifiers. We validated the novel predictions of the UM with available RNA-seq data and tested it further by directly comparing the transcript structure predicted by the model to the transcription generated by the cell for synthetic DNA segments of random design. We show that the UM identifies transcription start sites more accurately than the initiation classifier alone, indicating that the relative arrangement of promoter and terminator elements influences their function. Our model presents a concrete description of how the cell defines transcript units, explains the existence of nongenic transcripts, and provides insight into genome evolution.

  4. Genome-wide analysis of regions similar to promoters of histone genes

    KAUST Repository

    Chowdhary, Rajesh

    2010-05-28

    Background: The purpose of this study is to: i) develop a computational model of promoters of human histone-encoding genes (shortly histone genes), an important class of genes that participate in various critical cellular processes, ii) use the model so developed to identify regions across the human genome that have similar structure as promoters of histone genes; such regions could represent potential genomic regulatory regions, e.g. promoters, of genes that may be coregulated with histone genes, and iii/ identify in this way genes that have high likelihood of being coregulated with the histone genes.Results: We successfully developed a histone promoter model using a comprehensive collection of histone genes. Based on leave-one-out cross-validation test, the model produced good prediction accuracy (94.1% sensitivity, 92.6% specificity, and 92.8% positive predictive value). We used this model to predict across the genome a number of genes that shared similar promoter structures with the histone gene promoters. We thus hypothesize that these predicted genes could be coregulated with histone genes. This hypothesis matches well with the available gene expression, gene ontology, and pathways data. Jointly with promoters of the above-mentioned genes, we found a large number of intergenic regions with similar structure as histone promoters.Conclusions: This study represents one of the most comprehensive computational analyses conducted thus far on a genome-wide scale of promoters of human histone genes. Our analysis suggests a number of other human genes that share a high similarity of promoter structure with the histone genes and thus are highly likely to be coregulated, and consequently coexpressed, with the histone genes. We also found that there are a large number of intergenic regions across the genome with their structures similar to promoters of histone genes. These regions may be promoters of yet unidentified genes, or may represent remote control regions that

  5. A genome-wide analysis of small regulatory RNAs in the human pathogen group A Streptococcus.

    Directory of Open Access Journals (Sweden)

    Nataly Perez

    Full Text Available The coordinated regulation of gene expression is essential for pathogens to infect and cause disease. A recently appreciated mechanism of regulation is that afforded by small regulatory RNA (sRNA molecules. Here, we set out to assess the prevalence of sRNAs in the human bacterial pathogen group A Streptococcus (GAS. Genome-wide identification of candidate GAS sRNAs was performed through a tiling Affymetrix microarray approach and identified 40 candidate sRNAs within the M1T1 GAS strain MGAS2221. Together with a previous bioinformatic approach this brings the number of novel candidate sRNAs in GAS to 75, a number that approximates the number of GAS transcription factors. Transcripts were confirmed by Northern blot analysis for 16 of 32 candidate sRNAs tested, and the abundance of several of these sRNAs were shown to be temporally regulated. Six sRNAs were selected for further study and the promoter, transcriptional start site, and Rho-independent terminator identified for each. Significant variation was observed between the six sRNAs with respect to their stability during growth, and with respect to their inter- and/or intra-serotype-specific levels of abundance. To start to assess the contribution of sRNAs to gene regulation in M1T1 GAS we deleted the previously described sRNA PEL from four clinical isolates. Data from genome-wide expression microarray, quantitative RT-PCR, and Western blot analyses are consistent with PEL having no regulatory function in M1T1 GAS. The finding that candidate sRNA molecules are prevalent throughout the GAS genome provides significant impetus to the study of this fundamental gene-regulatory mechanism in an important human pathogen.

  6. Advances in the integration of transcriptional regulatory information into genome-scale metabolic models.

    Science.gov (United States)

    Vivek-Ananth, R P; Samal, Areejit

    2016-09-01

    A major goal of systems biology is to build predictive computational models of cellular metabolism. Availability of complete genome sequences and wealth of legacy biochemical information has led to the reconstruction of genome-scale metabolic networks in the last 15 years for several organisms across the three domains of life. Due to paucity of information on kinetic parameters associated with metabolic reactions, the constraint-based modelling approach, flux balance analysis (FBA), has proved to be a vital alternative to investigate the capabilities of reconstructed metabolic networks. In parallel, advent of high-throughput technologies has led to the generation of massive amounts of omics data on transcriptional regulation comprising mRNA transcript levels and genome-wide binding profile of transcriptional regulators. A frontier area in metabolic systems biology has been the development of methods to integrate the available transcriptional regulatory information into constraint-based models of reconstructed metabolic networks in order to increase the predictive capabilities of computational models and understand the regulation of cellular metabolism. Here, we review the existing methods to integrate transcriptional regulatory information into constraint-based models of metabolic networks.

  7. High-resolution genome-wide mapping of histone modifications.

    Science.gov (United States)

    Roh, Tae-young; Ngau, Wing Chi; Cui, Kairong; Landsman, David; Zhao, Keji

    2004-08-01

    The expression patterns of eukaryotic genomes are controlled by their chromatin structure, consisting of nucleosome subunits in which DNA of approximately 146 bp is wrapped around a core of 8 histone molecules. Post-translational histone modifications play an essential role in modifying chromatin structure. Here we apply a combination of SAGE and chromatin immunoprecipitation (ChIP) protocols to determine the distribution of hyperacetylated histones H3 and H4 in the Saccharomyces cerevisiae genome. We call this approach genome-wide mapping technique (GMAT). Using GMAT, we find that the highest acetylation levels are detected in the 5' end of a gene's coding region, but not in the promoter. Furthermore, we show that the histone acetyltransferase, GCN5p, regulates H3 acetylation in the promoter and 5' end of the coding regions. These findings indicate that GMAT should find valuable applications in mapping target sites of chromatin-modifying enzymes.

  8. Genome-wide association studies in pharmacogenomics of antidepressants.

    Science.gov (United States)

    Lin, Eugene; Lane, Hsien-Yuan

    2015-01-01

    Major depressive disorder (MDD) is one of the most common psychiatric disorders worldwide. Doctors must prescribe antidepressants based on educated guesses due to the fact that it is unmanageable to predict the effectiveness of any particular antidepressant in an individual patient. With the recent advent of scientific research, the genome-wide association study (GWAS) is extensively employed to analyze hundreds of thousands of single nucleotide polymorphisms by high-throughput genotyping technologies. In addition to the candidate-gene approach, the GWAS approach has recently been utilized to investigate the determinants of antidepressant response to therapy. In this study, we reviewed GWAS studies, their limitations and future directions with respect to the pharmacogenomics of antidepressants in MDD.

  9. Genome-wide association studies in pediatric chronic kidney disease.

    Science.gov (United States)

    Gupta, Jayanta; Kanetsky, Peter A; Wuttke, Matthias; Köttgen, Anna; Schaefer, Franz; Wong, Craig S

    2016-08-01

    The genome-wide association study (GWAS) has become an established scientific method that provides an unbiased screen for genetic loci potentially associated with phenotypes of clinical interest, such as chronic kidney disease (CKD). Thus, GWAS provides opportunities to gain new perspectives regarding the genetic architecture of CKD progression by identifying new candidate genes and targets for intervention. As such, it has become an important arm of translational science providing a complementary line of investigation to identify novel therapeutics to treat CKD. In this review, we describe the method and the challenges of performing GWAS in the pediatric CKD population. We also provide an overview of successful GWAS for kidney disease, and we discuss the established pediatric CKD cohorts in North America and Europe that are poised to identify genetic risk variants associated with CKD progression.

  10. Chapter 10: Mining genome-wide genetic markers.

    Directory of Open Access Journals (Sweden)

    Xiang Zhang

    Full Text Available Genome-wide association study (GWAS aims to discover genetic factors underlying phenotypic traits. The large number of genetic factors poses both computational and statistical challenges. Various computational approaches have been developed for large scale GWAS. In this chapter, we will discuss several widely used computational approaches in GWAS. The following topics will be covered: (1 An introduction to the background of GWAS. (2 The existing computational approaches that are widely used in GWAS. This will cover single-locus, epistasis detection, and machine learning methods that have been recently developed in biology, statistic, and computer science communities. This part will be the main focus of this chapter. (3 The limitations of current approaches and future directions.

  11. Genome-wide genetic changes during modern breeding of maize.

    Science.gov (United States)

    Jiao, Yinping; Zhao, Hainan; Ren, Longhui; Song, Weibin; Zeng, Biao; Guo, Jinjie; Wang, Baobao; Liu, Zhipeng; Chen, Jing; Li, Wei; Zhang, Mei; Xie, Shaojun; Lai, Jinsheng

    2012-06-03

    The success of modern maize breeding has been demonstrated by remarkable increases in productivity over the last four decades. However, the underlying genetic changes correlated with these gains remain largely unknown. We report here the sequencing of 278 temperate maize inbred lines from different stages of breeding history, including deep resequencing of 4 lines with known pedigree information. The results show that modern breeding has introduced highly dynamic genetic changes into the maize genome. Artificial selection has affected thousands of targets, including genes and non-genic regions, leading to a reduction in nucleotide diversity and an increase in the proportion of rare alleles. Genetic changes during breeding happen rapidly, with extensive variation (SNPs, indels and copy-number variants (CNVs)) occurring, even within identity-by-descent regions. Our genome-wide assessment of genetic changes during modern maize breeding provides new strategies as well as practical targets for future crop breeding and biotechnology.

  12. AID/APOBEC cytosine deaminase induces genome-wide kataegis

    Directory of Open Access Journals (Sweden)

    Lada Artem G

    2012-12-01

    Full Text Available Abstract Clusters of localized hypermutation in human breast cancer genomes, named “kataegis” (from the Greek for thunderstorm, are hypothesized to result from multiple cytosine deaminations catalyzed by AID/APOBEC proteins. However, a direct link between APOBECs and kataegis is still lacking. We have sequenced the genomes of yeast mutants induced in diploids by expression of the gene for PmCDA1, a hypermutagenic deaminase from sea lamprey. Analysis of the distribution of 5,138 induced mutations revealed localized clusters very similar to those found in tumors. Our data provide evidence that unleashed cytosine deaminase activity is an evolutionary conserved, prominent source of genome-wide kataegis events. Reviewers This article was reviewed by: Professor Sandor Pongor, Professor Shamil R. Sunyaev, and Dr Vladimir Kuznetsov.

  13. DNA Break Mapping Reveals Topoisomerase II Activity Genome-Wide

    Directory of Open Access Journals (Sweden)

    Laura Baranello

    2014-07-01

    Full Text Available Genomic DNA is under constant assault by endogenous and exogenous DNA damaging agents. DNA breakage can represent a major threat to genome integrity but can also be necessary for genome function. Here we present approaches to map DNA double-strand breaks (DSBs and single-strand breaks (SSBs at the genome-wide scale by two methods called DSB- and SSB-Seq, respectively. We tested these methods in human colon cancer cells and validated the results using the Topoisomerase II (Top2-poisoning agent etoposide (ETO. Our results show that the combination of ETO treatment with break-mapping techniques is a powerful method to elaborate the pattern of Top2 enzymatic activity across the genome.

  14. Genome-Wide Association Study of Coronary Artery Disease

    Directory of Open Access Journals (Sweden)

    Naomi Ogawa

    2010-01-01

    Full Text Available Coronary artery disease (CAD is a multifactorial disease with environmental and genetic determinants. The genetic determinants of CAD have previously been explored by the candidate gene approach. Recently, the data from the International HapMap Project and the development of dense genotyping chips have enabled us to perform genome-wide association studies (GWAS on a large number of subjects without bias towards any particular candidate genes. In 2007, three chip-based GWAS simultaneously revealed the significant association between common variants on chromosome 9p21 and CAD. This association was replicated among other ethnic groups and also in a meta-analysis. Further investigations have detected several other candidate loci associated with CAD. The chip-based GWAS approach has identified novel and unbiased genetic determinants of CAD and these insights provide the important direction to better understand the pathogenesis of CAD and to develop new and improved preventive measures and treatments for CAD.

  15. A comparison of multivariate genome-wide association methods

    DEFF Research Database (Denmark)

    Galesloot, Tessel E; Van Steen, Kristel; Kiemeney, Lambertus A L M

    2014-01-01

    Joint association analysis of multiple traits in a genome-wide association study (GWAS), i.e. a multivariate GWAS, offers several advantages over analyzing each trait in a separate GWAS. In this study we directly compared a number of multivariate GWAS methods using simulated data. We focused on six...... methods that are implemented in the software packages PLINK, SNPTEST, MultiPhen, BIMBAM, PCHAT and TATES, and also compared them to standard univariate GWAS, analysis of the first principal component of the traits, and meta-analysis of univariate results. We simulated data (N = 1000) for three...... correlation. We compared the power of the methods using empirically fixed significance thresholds (α = 0.05). Our results showed that the multivariate methods implemented in PLINK, SNPTEST, MultiPhen and BIMBAM performed best for the majority of the tested scenarios, with a notable increase in power...

  16. Genome-Wide Association Study of Meiotic Recombination Phenotypes

    Science.gov (United States)

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G.; Sherman, Stephanie L.; Feingold, Eleanor

    2016-01-01

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9. By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2. This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events. PMID:27733454

  17. Genome-wide association and genomic selection in animal breeding.

    Science.gov (United States)

    Hayes, Ben; Goddard, Mike

    2010-11-01

    Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

  18. Genome-wide significant loci for addiction and anxiety

    Science.gov (United States)

    Hodgson, K.; Almasy, L.; Knowles, E.E.M.; Kent, J.W.; Curran, J.E.; Dyer, T.D.; Göring, H.H.H.; Olvera, R.L.; Fox, P.T.; Pearlson, G.D.; Krystal, J.H.; Duggirala, R.; Blangero, J.; Glahn, D.C.

    2017-01-01

    Background Psychiatric comorbidity is common among individuals with addictive disorders, with patients frequently suffering from anxiety disorders. While the genetic architecture of comorbid addictive and anxiety disorders remains unclear, elucidating the genes involved could provide important insights into the underlying etiology. Methods Here we examine a sample of 1284 Mexican-Americans from randomly selected extended pedigrees. Variance decomposition methods were used to examine the role of genetics in addiction phenotypes (lifetime history of alcohol dependence, drug dependence or chronic smoking) and various forms of clinically relevant anxiety. Genome-wide univariate and bivariate linkage scans were conducted to localize the chromosomal regions influencing these traits. Results Addiction phenotypes and anxiety were shown to be heritable and univariate genome-wide linkage scans revealed significant quantitative trait loci for drug dependence (14q13.2–q21.2, LOD = 3.322) and a broad anxiety phenotype (12q24.32–q24.33, LOD = 2.918). Significant positive genetic correlations were observed between anxiety and each of the addiction subtypes (ρg = 0.550–0.655) and further investigation with bivariate linkage analyses identified significant pleiotropic signals for alcohol dependence-anxiety (9q33.1–q33.2, LOD = 3.054) and drug dependence-anxiety (18p11.23–p11.22, LOD = 3.425). Conclusions This study confirms the shared genetic underpinnings of addiction and anxiety and identifies genomic loci involved in the etiology of these comorbid disorders. The linkage signal for anxiety on 12q24 spans the location of TMEM132D, an emerging gene of interest from previous GWAS of anxiety traits, whilst the bivariate linkage signal identified for anxiety-alcohol on 9q33 peak coincides with a region where rare CNVs have been associated with psychiatric disorders. Other signals identified implicate novel regions of the genome in addiction genetics. PMID:27318301

  19. A genome-wide methylation study on obesity

    Science.gov (United States)

    Xu, Xiaojing; Su, Shaoyong; Barnes, Vernon A.; De Miguel, Carmen; Pollock, Jennifer; Ownby, Dennis; Shi, Huidong; Zhu, Haidong; Snieder, Harold; Wang, Xiaoling

    2013-01-01

    Besides differential methylation, DNA methylation variation has recently been proposed and demonstrated to be a potential contributing factor to cancer risk. Here we aim to examine whether differential variability in methylation is also an important feature of obesity, a typical non-malignant common complex disease. We analyzed genome-wide methylation profiles of over 470,000 CpGs in peripheral blood samples from 48 obese and 48 lean African-American youth aged 14–20 y old. A substantial number of differentially variable CpG sites (DVCs), using statistics based on variances, as well as a substantial number of differentially methylated CpG sites (DMCs), using statistics based on means, were identified. Similar to the findings in cancers, DVCs generally exhibited an outlier structure and were more variable in cases than in controls. By randomly splitting the current sample into a discovery and validation set, we observed that both the DVCs and DMCs identified from the first set could independently predict obesity status in the second set. Furthermore, both the genes harboring DMCs and the genes harboring DVCs showed significant enrichment of genes identified by genome-wide association studies on obesity and related diseases, such as hypertension, dyslipidemia, type 2 diabetes and certain types of cancers, supporting their roles in the etiology and pathogenesis of obesity. We generalized the recent finding on methylation variability in cancer research to obesity and demonstrated that differential variability is also an important feature of obesity-related methylation changes. Future studies on the epigenetics of obesity will benefit from both statistics based on means and statistics based on variances. PMID:23644594

  20. Genome-wide association studies in asthma: progress and pitfalls

    Directory of Open Access Journals (Sweden)

    March ME

    2015-01-01

    Full Text Available Michael E March,1 Patrick MA Sleiman,1,2 Hakon Hakonarson1,2 1Center for Applied Genomics, Children's Hospital of Philadelphia Research Institute, 2Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Abstract: Genetic studies of asthma have revealed that there is considerable heritability to the phenotype. An extensive history of candidate-gene studies has identified a long list of genes associated with immune function that are potentially involved in asthma pathogenesis. However, many of the results of candidate-gene studies have failed to be replicated, leaving in question the true impact of the implicated biological pathways on asthma. With the advent of genome-wide association studies, geneticists are able to examine the association of hundreds of thousands of genetic markers with a phenotype, allowing the hypothesis-free identification of variants associated with disease. Many such studies examining asthma or related phenotypes have been published, and several themes have begun to emerge regarding the biological pathways underpinning asthma. The results of many genome-wide association studies have currently not been replicated, and the large sample sizes required for this experimental strategy invoke difficulties with sample stratification and phenotypic heterogeneity. Recently, large collaborative groups of researchers have formed consortia focused on asthma, with the goals of sharing material and data and standardizing diagnosis and experimental methods. Additionally, research has begun to focus on genetic variants that affect the response to asthma medications and on the biology that generates the heterogeneity in the asthma phenotype. As this work progresses, it will move asthma patients closer to more specific, personalized medicine. Keywords: asthma, genetics, GWAS, pharmacogenetics, biomarkers

  1. A genome-wide association study of anorexia nervosa

    Science.gov (United States)

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2015-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  2. A genome-wide association study of anorexia nervosa.

    Science.gov (United States)

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-10-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.

  3. BioMet Toolbox: genome-wide analysis of metabolism

    DEFF Research Database (Denmark)

    Cvijovic, M.; Olivares Hernandez, Roberto; Agren, R.

    2010-01-01

    models. Systematic analysis of biological processes by means of modelling and simulations has made the identification of metabolic networks and prediction of metabolic capabilities under different conditions possible. For facilitating such systemic analysis, we have developed the BioMet Toolbox, a web......-based resource for stoichiometric analysis and for integration of transcriptome and interactome data, thereby exploiting the capabilities of genome-scale metabolic models. The BioMet Toolbox provides an effective user-friendly way to perform linear programming simulations towards maximized or minimized growth...... rates, substrate uptake rates and metabolic production rates by detecting relevant fluxes, simulate single and double gene deletions or detect metabolites around which major transcriptional changes are concentrated. These tools can be used for high-throughput in silico screening and allows fully...

  4. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Science.gov (United States)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  5. Biomarker-based classification of bacterial and fungal whole-blood infections in a genome-wide expression study

    Directory of Open Access Journals (Sweden)

    Andreas eDix

    2015-03-01

    Full Text Available Sepsis is a clinical syndrome that can be caused by bacteria or fungi. Early knowledge on the nature of the causative agent is a prerequisite for targeted anti-microbial therapy. Besides currently used detection methods like blood culture and PCR-based assays, the analysis of the transcriptional response of the host to infecting organisms holds great promise. In this study, we aim to examine the transcriptional footprint of infections caused by the bacterial pathogens Staphylococcus aureus and Escherichia coli and the fungal pathogens Candida albicans and Aspergillus fumigatus in a human whole-blood model. Moreover, we use the expression information to build a random forest classifier to classify if a sample contains a bacterial, fungal, or mock-infection. After normalizing the transcription intensities using stably expressed reference genes, we filtered the gene set for biomarkers of bacterial or fungal blood infections. This selection is based on differential expression and an additional gene relevance measure. In this way, we identified 38 biomarker genes, including IL6, SOCS3, and IRG1 which were already associated to sepsis by other studies. Using these genes, we trained the classifier and assessed its performance. It yielded a 96% accuracy (sensitivities >93%, specificities >97% for a 10-fold stratified cross-validation and a 92% accuracy (sensitivities and specificities >83% for an additional test dataset comprising Cryptococcus neoformans infections. Furthermore, the classifier is robust to Gaussian noise, indicating correct class predictions on datasets of new species. In conclusion, this genome-wide approach demonstrates an effective feature selection process in combination with the construction of a well-performing classification model. Further analyses of genes with pathogen-dependent expression patterns can provide insights into the systemic host responses, which may lead to new anti-microbial therapeutic advances.

  6. Integration of least angle regression with empirical Bayes for multi-locus genome-wide association studies

    Science.gov (United States)

    Multi-locus genome-wide association studies has become the state-of-the-art procedure to identify quantitative trait loci (QTL) associated with traits simultaneously. However, implementation of multi-locus model is still difficult. In this study, we integrated least angle regression with empirical B...

  7. Genome wide study of maternal and parent-of-origin effects on the etiology of orofacial clefts

    DEFF Research Database (Denmark)

    Shi, Min; Murray, Jeff; Marazita, Mary L

    2012-01-01

    We performed a genome wide association analysis of maternally-mediated genetic effects and parent-of-origin (POO) effects on risk of orofacial clefting (OC) using over 2,000 case-parent triads collected through an international cleft consortium. We used log-linear regression models to test...

  8. Principal components analysis corrects for stratification in genome-wide association studies.

    Science.gov (United States)

    Price, Alkes L; Patterson, Nick J; Plenge, Robert M; Weinblatt, Michael E; Shadick, Nancy A; Reich, David

    2006-08-01

    Population stratification--allele frequency differences between cases and controls due to systematic ancestry differences-can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers.

  9. Clinical, polysomnographic and genome-wide association analyses of narcolepsy with cataplexy

    DEFF Research Database (Denmark)

    Luca, Gianina; Haba-Rubio, José; Dauvilliers, Yves

    2013-01-01

    diagnosed according to International Classification of Sleep Disorders-2. Demographic and clinical characteristics, polysomnography and multiple sleep latency test data, hypocretin-1 levels, and genome-wide genotypes were available. We found a significantly lower age at sleepiness onset (men versus women...... diagnostic delay were young age at diagnosis, cataplexy as the first symptom and higher frequency of cataplexy attacks. The mean multiple sleep latency negatively correlated with Epworth Sleepiness Scale (ESS) and with the number of sleep-onset rapid eye movement periods (SOREMPs), but none...... of the polysomnographic variables was associated with subjective or objective measures of sleepiness. Variant rs2859998 in UBXN2B gene showed a strong association (P = 1.28E-07) with the age at onset of excessive daytime sleepiness, and rs12425451 near the transcription factor TEAD4 (P = 1.97E-07) with the age at onset...

  10. Genome-wide differential gene expression in children exposed to air pollution in the Czech Republic

    DEFF Research Database (Denmark)

    van Leeuwen, D M; van Herwijnen, M H M; Pedersen, Marie

    2006-01-01

    The Teplice area in the Czech Republic is a mining district where elevated levels of air pollution including airborne carcinogens, have been demonstrated, especially during winter time. This environmental exposure can impact human health; in particular children may be more vulnerable. To study....... This suggests an effect of air pollution on the primary structural unit of the condensed DNA. In addition, several other pathways were modulated. Based on the results of this study, we suggest that transcriptomic analysis represents a promising biomarker for environmental carcinogenesis....... the impact of air pollution in children at the transcriptional level, peripheral blood cells were subjected to whole genome response analysis, in order to identify significantly modulated biological pathways and processes as a result of exposure. Using genome-wide oligonucleotide microarrays, we investigated...

  11. An Integrated Genome-Wide Systems Genetics Screen for Breast Cancer Metastasis Susceptibility Genes.

    Directory of Open Access Journals (Sweden)

    Ling Bai

    2016-04-01

    Full Text Available Metastasis remains the primary cause of patient morbidity and mortality in solid tumors and is due to the action of a large number of tumor-autonomous and non-autonomous factors. Here we report the results of a genome-wide integrated strategy to identify novel metastasis susceptibility candidate genes and molecular pathways in breast cancer metastasis. This analysis implicates a number of transcriptional regulators and suggests cell-mediated immunity is an important determinant. Moreover, the analysis identified novel or FDA-approved drugs as potentially useful for anti-metastatic therapy. Further explorations implementing this strategy may therefore provide a variety of information for clinical applications in the control and treatment of advanced neoplastic disease.

  12. Genome-wide gene expression changes in an industrial clavulanic acid overproduction strain of Streptomyces clavuligerus.

    Science.gov (United States)

    Medema, Marnix H; Alam, Mohammad T; Heijne, Wilbert H M; van den Berg, Marco A; Müller, Ulrike; Trefzer, Axel; Bovenberg, Roel A L; Breitling, Rainer; Takano, Eriko

    2011-03-01

    To increase production of the important pharmaceutical compound clavulanic acid, a β-lactamase inhibitor, both random mutagenesis approaches and rational engineering of Streptomyces clavuligerus strains have been extensively applied. Here, for the first time, we compared genome-wide gene expression of an industrial S. clavuligerus strain, obtained through iterative mutagenesis, with that of the wild-type strain. Intriguingly, we found that the majority of the changes contributed not to a complex rewiring of primary metabolism but consisted of a simple upregulation of various antibiotic biosynthesis gene clusters. A few additional transcriptional changes in primary metabolism at key points seem to divert metabolic fluxes to the biosynthetic precursors for clavulanic acid. In general, the observed changes largely coincide with genes that have been targeted by rational engineering in recent years, yet the presence of a number of previously unexplored genes clearly demonstrates that functional genomic analysis can provide new leads for strain improvement in biotechnology.

  13. Genome-wide analyses for dissecting gene regulatory networks in the shoot apical meristem.

    Science.gov (United States)

    Bustamante, Mariana; Matus, José Tomás; Riechmann, José Luis

    2016-03-01

    Shoot apical meristem activity is controlled by complex regulatory networks in which components such as transcription factors, miRNAs, small peptides, hormones, enzymes and epigenetic marks all participate. Many key genes that determine the inherent characteristics of the shoot apical meristem have been identified through genetic approaches. Recent advances in genome-wide studies generating extensive transcriptomic and DNA-binding datasets have increased our understanding of the interactions within the regulatory networks that control the activity of the meristem, identifying new regulators and uncovering connections between previously unlinked network components. In this review, we focus on recent studies that illustrate the contribution of whole genome analyses to understand meristem function. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  14. Traumatic Brain Injury Induces Genome-Wide Transcriptomic, Methylomic, and Network Perturbations in Brain and Blood Predicting Neurological Disorders

    Directory of Open Access Journals (Sweden)

    Qingying Meng

    2017-02-01

    Full Text Available The complexity of the traumatic brain injury (TBI pathology, particularly concussive injury, is a serious obstacle for diagnosis, treatment, and long-term prognosis. Here we utilize modern systems biology in a rodent model of concussive injury to gain a thorough view of the impact of TBI on fundamental aspects of gene regulation, which have the potential to drive or alter the course of the TBI pathology. TBI perturbed epigenomic programming, transcriptional activities (expression level and alternative splicing, and the organization of genes in networks centered around genes such as Anax2, Ogn, and Fmod. Transcriptomic signatures in the hippocampus are involved in neuronal signaling, metabolism, inflammation, and blood function, and they overlap with those in leukocytes from peripheral blood. The homology between genomic signatures from blood and brain elicited by TBI provides proof of concept information for development of biomarkers of TBI based on composite genomic patterns. By intersecting with human genome-wide association studies, many TBI signature genes and network regulators identified in our rodent model were causally associated with brain disorders with relevant link to TBI. The overall results show that concussive brain injury reprograms genes which could lead to predisposition to neurological and psychiatric disorders, and that genomic information from peripheral leukocytes has the potential to predict TBI pathogenesis in the brain.

  15. Traumatic Brain Injury Induces Genome-Wide Transcriptomic, Methylomic, and Network Perturbations in Brain and Blood Predicting Neurological Disorders.

    Science.gov (United States)

    Meng, Qingying; Zhuang, Yumei; Ying, Zhe; Agrawal, Rahul; Yang, Xia; Gomez-Pinilla, Fernando

    2017-02-01

    The complexity of the traumatic brain injury (TBI) pathology, particularly concussive injury, is a serious obstacle for diagnosis, treatment, and long-term prognosis. Here we utilize modern systems biology in a rodent model of concussive injury to gain a thorough view of the impact of TBI on fundamental aspects of gene regulation, which have the potential to drive or alter the course of the TBI pathology. TBI perturbed epigenomic programming, transcriptional activities (expression level and alternative splicing), and the organization of genes in networks centered around genes such as Anax2, Ogn, and Fmod. Transcriptomic signatures in the hippocampus are involved in neuronal signaling, metabolism, inflammation, and blood function, and they overlap with those in leukocytes from peripheral blood. The homology between genomic signatures from blood and brain elicited by TBI provides proof of concept information for development of biomarkers of TBI based on composite genomic patterns. By intersecting with human genome-wide association studies, many TBI signature genes and network regulators identified in our rodent model were causally associated with brain disorders with relevant link to TBI. The overall results show that concussive brain injury reprograms genes which could lead to predisposition to neurological and psychiatric disorders, and that genomic information from peripheral leukocytes has the potential to predict TBI pathogenesis in the brain.

  16. Genome-wide association analysis identifies multiple loci related to resting heart rate

    Science.gov (United States)

    Eijgelsheim, Mark; Newton-Cheh, Christopher; Sotoodehnia, Nona; de Bakker, Paul I.W.; Müller, Martina; Morrison, Alanna C.; Smith, Albert V.; Isaacs, Aaron; Sanna, Serena; Dörr, Marcus; Navarro, Pau; Fuchsberger, Christian; Nolte, Ilja M.; de Geus, Eco J.C.; Estrada, Karol; Hwang, Shih-Jen; Bis, Joshua C.; Rückert, Ina-Maria; Alonso, Alvaro; Launer, Lenore J.; Hottenga, Jouke Jan; Rivadeneira, Fernando; Noseworthy, Peter A.; Rice, Kenneth M.; Perz, Siegfried; Arking, Dan E.; Spector, Tim D.; Kors, Jan A.; Aulchenko, Yurii S.; Tarasov, Kirill V.; Homuth, Georg; Wild, Sarah H.; Marroni, Fabio; Gieger, Christian; Licht, Carmilla M.; Prineas, Ronald J.; Hofman, Albert; Rotter, Jerome I.; Hicks, Andrew A.; Ernst, Florian; Najjar, Samer S.; Wright, Alan F.; Peters, Annette; Fox, Ervin R.; Oostra, Ben A.; Kroemer, Heyo K.; Couper, David; Völzke, Henry; Campbell, Harry; Meitinger, Thomas; Uda, Manuela; Witteman, Jacqueline C.M.; Psaty, Bruce M.; Wichmann, H-Erich; Harris, Tamara B.; Kääb, Stefan; Siscovick, David S.; Jamshidi, Yalda; Uitterlinden, André G.; Folsom, Aaron R.; Larson, Martin G.; Wilson, James F.; Penninx, Brenda W.; Snieder, Harold; Pramstaller, Peter P.; van Duijn, Cornelia M.; Lakatta, Edward G.; Felix, Stephan B.; Gudnason, Vilmundur; Pfeufer, Arne; Heckbert, Susan R.; Stricker, Bruno H.Ch.; Boerwinkle, Eric; O'Donnell, Christopher J.

    2010-01-01

    Higher resting heart rate is associated with increased cardiovascular disease and mortality risk. Though heritable factors play a substantial role in population variation, little is known about specific genetic determinants. This knowledge can impact clinical care by identifying novel factors that influence pathologic heart rate states, modulate heart rate through cardiac structure and function or by improving our understanding of the physiology of heart rate regulation. To identify common genetic variants associated with heart rate, we performed a meta-analysis of 15 genome-wide association studies (GWAS), including 38 991 subjects of European ancestry, estimating the association between age-, sex- and body mass-adjusted RR interval (inverse heart rate) and ∼2.5 million markers. Results with P < 5 × 10−8 were considered genome-wide significant. We constructed regression models with multiple markers to assess whether results at less stringent thresholds were likely to be truly associated with RR interval. We identified six novel associations with resting heart rate at six loci: 6q22 near GJA1; 14q12 near MYH7; 12p12 near SOX5, c12orf67, BCAT1, LRMP and CASC1; 6q22 near SLC35F1, PLN and c6orf204; 7q22 near SLC12A9 and UfSp1; and 11q12 near FADS1. Associations at 6q22 400 kb away from GJA1, at 14q12 MYH6 and at 1q32 near CD34 identified in previously published GWAS were confirmed. In aggregate, these variants explain ∼0.7% of RR interval variance. A multivariant regression model including 20 variants with P < 10−5 increased the explained variance to 1.6%, suggesting that some loci falling short of genome-wide significance are likely truly associated. Future research is warranted to elucidate underlying mecha