Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung
Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and pCRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504
Full Text Available Complex regional pain syndrome (CRPS is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II and 5 controls (cut-off value: 1.5-fold change and p<0.05. Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1, matrix metalloproteinase 9 (MMP9, alanine aminopeptidase N (ANPEP, l-histidine decarboxylase (HDC, granulocyte colony-stimulating factor 3 receptor (G-CSF3R, and signal transducer and activator of transcription 3 (STAT3 genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR. We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10(-4. The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression.
Chen, Gary K; Zheng, Tian; Witte, John S; Goode, Ellen L; Gao, Lei; Hu, Pingzhao; Suh, Young Ju; Suktitipat, Bhoom; Szymczak, Silke; Woo, Jung Hoon; Zhang, Wei
A number of issues arise when analyzing the large amount of data from high-throughput genotype and expression microarray experiments, including design and interpretation of genome-wide association studies of expression phenotypes. These issues were considered by contributions submitted to Group 1 of the Genetic Analysis Workshop 15 (GAW15), which focused on the association of quantitative expression data. These contributions evaluated diverse hypotheses, including those relevant to cancer and obesity research, and used various analytic techniques, many of which were derived from information theory. Several observations from these reports stand out. First, one needs to consider the genetic model of the trait of interest and carefully select which single nucleotide polymorphisms and individuals are included early in the design stage of a study. Second, by targeting specific pathways when analyzing genome-wide data, one can generate more interpretable results than agnostic approaches. Finally, for datasets with small sample sizes but a large number of features like the Genetic Analysis Workshop 15 dataset, machine learning approaches may be more practical than traditional parametric approaches. (c) 2007 Wiley-Liss, Inc.
Barbara E Stranger
Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.
Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.
Zheutlin, Amanda B; Viehman, Rachael W; Fortgang, Rebecca; Borg, Jacqueline; Smith, Desmond J; Suvisaari, Jaana; Therman, Sebastian; Hultman, Christina M; Cannon, Tyrone D
We performed a whole-genome expression study to clarify the nature of the biological processes mediating between inherited genetic variations and cognitive dysfunction in schizophrenia. Gene expression was assayed from peripheral blood mononuclear cells using Illumina Human WG6 v3.0 chips in twins discordant for schizophrenia or bipolar disorder and control twins. After quality control, expression levels of 18,559 genes were screened for association with the California Verbal Learning Test (CVLT) performance, and any memory-related probes were then evaluated for variation by diagnostic status in the discovery sample (N = 190), and in an independent replication sample (N = 73). Heritability of gene expression using the twin design was also assessed. After Bonferroni correction (p schizophrenia patients, with comparable effect sizes in the same direction in the replication sample. For 41 of these 43 transcripts, expression levels were heritable. Nearly all identified genes contain common or de novo mutations associated with schizophrenia in prior studies. Genes increasing risk for schizophrenia appear to do so in part via effects on signaling cascades influencing memory. The genes implicated in these processes are enriched for those related to RNA processing and DNA replication and include genes influencing G-protein coupled signal transduction, cytokine signaling, and oligodendrocyte function. (c) 2015 APA, all rights reserved).
Viñuela Rodriguez, A.; Snoek, L.B.; Riksen, J.A.G.; Kammenga, J.E.
Gene expression becomes more variable with age, and it is widely assumed that this is due to a decrease in expression regulation. But currently there is no understanding how gene expression regulatory patterns progress with age. Here we explored genome-wide gene expression variation and regulatory
The plant pleiotropic drug resistance (PDR) family of ATP-binding cassette (ABC) transporters has comprehensively been researched in relation to transport of antifungal agents and resistant pathogens. In our study, analyses of the whole family of PDR genes present in the potato genome were provided. This analysis ...
Madsen, Claus Desler; Durhuus, Jon Ambæk; Rasmussen, Lene Juel
A hypothetical protein (HP) is defined as a protein that is predicted to be expressed from an open reading frame, but for which there is no experimental evidence of translation. HPs constitute a substantial fraction of proteomes of human as well as of other organisms. With the general belief that...... that the majority of HPs are the product of pseudogenes, it is essential to have a tool with the ability of pinpointing the minority of HPs with a high probability of being expressed....
Khan, Meraj A; Sengupta, Jayasree; Mittal, Suneeta; Ghosh, Debabrata
Abstract Background In order to obtain a lead of the pathophysiology of endometriosis, genome-wide expressional analyses of eutopic and ectopic endometrium have earlier been reported, however, the effects of stages of severity and phases of menstrual cycle on expressional profiles have not been examined. The effect of genetic heterogeneity and fertility history on transcriptional activity was also not considered. In the present study, a genome-wide expression analysis of autologous, paired eu...
Keurentjes, Joost J.B.; Fu, Jingyuan; Terpstra, Inez R.; Garcia, Juan M.; Ackerveken, Guido van den; Snoek, L. Basten; Peeters, Anton J.M.; Vreugdenhil, Dick; Koornneef, Maarten; Jansen, Ritsert C.
Accessions of a plant species can show considerable genetic differences that are analyzed effectively by using recombinant inbred line (RIL) populations. Here we describe the results of genome-wide expression variation analysis in an RIL population of Arabidopsis thaliana. For many genes, variation
Full Text Available Aquaporins (Aqps are integral membrane proteins that facilitate the transport of water and small solutes across cell membranes. Among vertebrate species, Aqps are highly conserved in both gene structure and amino acid sequence. These proteins are vital for maintaining water homeostasis in living organisms, especially for aquatic animals such as teleost fish. Studies on teleost Aqps are mainly limited to several model species with diploid genomes. Common carp, which has a tetraploidized genome, is one of the most common aquaculture species being adapted to a wide range of aquatic environments. The complete common carp genome has recently been released, providing us the possibility for gene evolution of aqp gene family after whole genome duplication.In this study, we identified a total of 37 aqp genes from common carp genome. Phylogenetic analysis revealed that most of aqps are highly conserved. Comparative analysis was performed across five typical vertebrate genomes. We found that almost all of the aqp genes in common carp were duplicated in the evolution of the gene family. We postulated that the expansion of the aqp gene family in common carp was the result of an additional whole genome duplication event and that the aqp gene family in other teleosts has been lost in their evolution history with the reason that the functions of genes are redundant and conservation. Expression patterns were assessed in various tissues, including brain, heart, spleen, liver, intestine, gill, muscle, and skin, which demonstrated the comprehensive expression profiles of aqp genes in the tetraploidized genome. Significant gene expression divergences have been observed, revealing substantial expression divergences or functional divergences in those duplicated aqp genes post the latest WGD event.To some extent, the gene families are also considered as a unique source for evolutionary studies. Moreover, the whole set of common carp aqp gene family provides an
Fries, Gabriel R; Dimitrov, Dimitre H; Lee, Shuko; Braida, Nicole; Yantis, Jesse; Honaker, Craig; Cuellar, Joe; Walss-Bass, Consuelo
This study aimed to test whether a dysregulation of gene expression may be the underlying cause of previously reported elevated levels of inflammatory cytokines in veterans with schizophrenia. We performed a genome-wide expression analysis in peripheral blood mononuclear cells from veterans with schizophrenia and controls, and our results show that 167 genes and putative loci were differently expressed between groups. These genes were enriched primarily for pathways related to inflammatory mechanisms and formed networks related to cell death and survival, immune cell trafficking, among others, which is in line with previous reports and further validates the inflammatory hypothesis of schizophrenia. Copyright © 2017 Elsevier B.V. All rights reserved.
Shivaraj, S M; Deshmukh, Rupesh K; Rai, Rhitu; Bélanger, Richard; Agrawal, Pawan K; Dash, Prasanta K
Membrane intrinsic proteins (MIPs) form transmembrane channels and facilitate transport of myriad substrates across the cell membrane in many organisms. Majority of plant MIPs have water transporting ability and are commonly referred as aquaporins (AQPs). In the present study, we identified aquaporin coding genes in flax by genome-wide analysis, their structure, function and expression pattern by pan-genome exploration. Cross-genera phylogenetic analysis with known aquaporins from rice, arabidopsis, and poplar showed five subgroups of flax aquaporins representing 16 plasma membrane intrinsic proteins (PIPs), 17 tonoplast intrinsic proteins (TIPs), 13 NOD26-like intrinsic proteins (NIPs), 2 small basic intrinsic proteins (SIPs), and 3 uncharacterized intrinsic proteins (XIPs). Amongst aquaporins, PIPs contained hydrophilic aromatic arginine (ar/R) selective filter but TIP, NIP, SIP and XIP subfamilies mostly contained hydrophobic ar/R selective filter. Analysis of RNA-seq and microarray data revealed high expression of PIPs in multiple tissues, low expression of NIPs, and seed specific expression of TIP3 in flax. Exploration of aquaporin homologs in three closely related Linum species bienne, grandiflorum and leonii revealed presence of 49, 39 and 19 AQPs, respectively. The genome-wide identification of aquaporins, first in flax, provides insight to elucidate their physiological and developmental roles in flax.
Full Text Available Abstract Background Cancer development is accompanied by genetic phenomena like deletion and amplification of chromosome parts or alterations of chromatin structure. It is expected that these mechanisms have a strong effect on regional gene expression. Results We investigated genome-wide gene expression in colorectal carcinoma (CRC and normal epithelial tissues from 25 patients using oligonucleotide arrays. This allowed us to identify 81 distinct chromosomal islands with aberrant gene expression. Of these, 38 islands show a gain in expression and 43 a loss of expression. In total, 7.892 genes (25.3% of all human genes are located in aberrantly expressed islands. Many chromosomal regions that are linked to hereditary colorectal cancer show deregulated expression. Also, many known tumor genes localize to chromosomal islands of misregulated expression in CRC. Conclusion An extensive comparison with published CGH data suggests that chromosomal regions known for frequent deletions in colon cancer tend to show reduced expression. In contrast, regions that are often amplified in colorectal tumors exhibit heterogeneous expression patterns: even show a decrease of mRNA expression. Because for several islands of deregulated expression chromosomal aberrations have never been observed, we speculate that additional mechanisms (like abnormal states of regional chromatin also have a substantial impact on the formation of co-expression islands in colorectal carcinoma.
Briones, M R S; Bosco, F
Gene expression "noise" is commonly defined as the stochastic variation of gene expression levels in different cells of the same population under identical growth conditions. Here, we tested whether this "noise" is amplified with time, as a consequence of decoherence in global gene expression profiles (genome-wide microarrays) of synchronized cells. The stochastic component of transcription causes fluctuations that tend to be amplified as time progresses, leading to a decay of correlations of expression profiles, in perfect analogy with elementary relaxation processes. Measuring decoherence, defined here as a decay in the auto-correlation function of yeast genome-wide expression profiles, we found a slowdown in the decay of correlations, opposite to what would be expected if, as in mixing systems, correlations decay exponentially as the equilibrium state is reached. Our results indicate that the populational variation in gene expression (noise) is a consequence of temporal decoherence, in which the slow decay of correlations is a signature of strong interdependence of the transcription dynamics of different genes.
Full Text Available The different environments that humans experience are likely to impact physiology and disease susceptibility. In order to estimate the magnitude of the impact of environment on transcript abundance, we examined gene expression in peripheral blood leukocyte samples from 46 desert nomadic, mountain agrarian and coastal urban Moroccan Amazigh individuals. Despite great expression heterogeneity in humans, as much as one third of the leukocyte transcriptome was found to be associated with differences among regions. Genome-wide polymorphism analysis indicates that genetic differentiation in the total sample is limited and is unlikely to explain the expression divergence. Methylation profiling of 1,505 CpG sites suggests limited contribution of methylation to the observed differences in gene expression. Genetic network analysis further implies that specific aspects of immune function are strongly affected by regional factors and may influence susceptibility to respiratory and inflammatory disease. Our results show a strong genome-wide gene expression signature of regional population differences that presumably include lifestyle, geography, and biotic factors, implying that these can play at least as great a role as genetic divergence in modulating gene expression variation in humans.
Freed, Emily F; Winkler, James D; Weiss, Sophie J; Garst, Andrew D; Mutalik, Vivek K; Arkin, Adam P; Knight, Rob; Gill, Ryan T
The reliable engineering of biological systems requires quantitative mapping of predictable and context-independent expression over a broad range of protein expression levels. However, current techniques for modifying expression levels are cumbersome and are not amenable to high-throughput approaches. Here we present major improvements to current techniques through the design and construction of E. coli genome-wide libraries using synthetic DNA cassettes that can tune expression over a ∼10(4) range. The cassettes also contain molecular barcodes that are optimized for next-generation sequencing, enabling rapid and quantitative tracking of alleles that have the highest fitness advantage. We show these libraries can be used to determine which genes and expression levels confer greater fitness to E. coli under different growth conditions.
Full Text Available Abstract Background In ovo electroporation is a widely used technique to study gene function in developmental biology. Despite the widespread acceptance of this technique, no genome-wide analysis of the effects of in ovo electroporation, principally the current applied across the tissue and exogenous vector DNA introduced, on endogenous gene expression has been undertaken. Here, the effects of electric current and expression of a GFP-containing construct, via electroporation into the midbrain of Hamburger-Hamilton stage 10 chicken embryos, are analysed by microarray. Results Both current alone and in combination with exogenous DNA expression have a small but reproducible effect on endogenous gene expression, changing the expression of the genes represented on the array by less than 0.1% (current and less than 0.5% (current + DNA, respectively. The subset of genes regulated by electric current and exogenous DNA span a disparate set of cellular functions. However, no genes involved in the regional identity were affected. In sharp contrast to this, electroporation of a known transcription factor, Dmrt5, caused a much greater change in gene expression. Conclusions These findings represent the first systematic genome-wide analysis of the effects of in ovo electroporation on gene expression during embryonic development. The analysis reveals that this process has minimal impact on the genetic basis of cell fate specification. Thus, the study demonstrates the validity of the in ovo electroporation technique to study gene function and expression during development. Furthermore, the data presented here can be used as a resource to refine the set of transcriptional responders in future in ovo electroporation studies of specific gene function.
Full Text Available Background. Previous studies, including ours, have examined the regulation of microRNAs (miRNAs by DNA methylation, but whether this regulation occurs at a genome-wide level in hepatocellular carcinoma (HCC is unclear. Subjects/Methods. Using a two-phase study design, we conducted genome-wide screening for DNA methylation and miRNA expression to explore the potential role of methylation alterations in miRNAs regulation. Results. We found that expressions of 25 miRNAs were statistically significantly different between tumor and nontumor tissues and perfectly differentiated HCC tumor from nontumor. Six miRNAs were overexpressed, and 19 were repressed in tumors. Among 133 miRNAs with inverse correlations between methylation and expression, 8 miRNAs (6% showed statistically significant differences in expression between tumor and nontumor tissues. Six miRNAs were validated in 56 additional paired HCC tissues, and significant inverse correlations were observed for miR-125b and miR-199a, which is consistent with the inactive chromatin pattern found in HepG2 cells. Conclusion. These data suggest that the expressions of miR-125b and miR-199a are dramatically regulated by DNA hypermethylation that plays a key role in hepatocarcinogenesis.
Diao, Wei-Ping; Snyder, John C; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge
The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper.
Adomas Aleksandra B
Full Text Available Abstract Background Complementary approaches to assaying global gene expression are needed to assess gene expression in regions that are poorly assayed by current methodologies. A key component of nearly all gene expression assays is the reverse transcription of transcribed sequences that has traditionally been performed by priming the poly-A tails on many of the transcribed genes in eukaryotes with oligo-dT, or by priming RNA indiscriminately with random hexamers. We designed an algorithm to find common sequence motifs that were present within most protein-coding genes of Saccharomyces cerevisiae and of Neurospora crassa, but that were not present within their ribosomal RNA or transfer RNA genes. We then experimentally tested whether degenerately priming these motifs with multi-targeted primers improved the accuracy and completeness of transcriptomic assays. Results We discovered two multi-targeted primers that would prime a preponderance of genes in the genomes of Saccharomyces cerevisiae and Neurospora crassa while avoiding priming ribosomal RNA or transfer RNA. Examining the response of Saccharomyces cerevisiae to nitrogen deficiency and profiling Neurospora crassa early sexual development, we demonstrated that using multi-targeted primers in reverse transcription led to superior performance of microarray profiling and next-generation RNA tag sequencing. Priming with multi-targeted primers in addition to oligo-dT resulted in higher sensitivity, a larger number of well-measured genes and greater power to detect differences in gene expression. Conclusions Our results provide the most complete and detailed expression profiles of the yeast nitrogen starvation response and N. crassa early sexual development to date. Furthermore, our multi-targeting priming methodology for genome-wide gene expression assays provides selective targeting of multiple sequences and counter-selection against undesirable sequences, facilitating a more complete and
Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui
MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple. © 2013.
Full Text Available The microarray dataset attached to this report is related to the research article with the title: “A genomic approach to susceptibility and pathogenesis leads to identifying potential novel therapeutic targets in androgenetic alopecia” (Dey-Rao and Sinha, 2017 . Male-pattern hair loss that is induced by androgens (testosterone in genetically predisposed individuals is known as androgenetic alopecia (AGA. The raw dataset is being made publicly available to enable critical and/or extended analyses. Our related research paper utilizes the attached raw dataset, for genome-wide gene-expression associated investigations. Combined with several in silico bioinformatics-based analyses we were able to delineate five strategic molecular elements as potential novel targets towards future AGA-therapy.
Full Text Available Paulownia tomentosa is a fast-growing tree species with multiple uses. It is grown worldwide, but is native to China, where it is widely cultivated in saline regions. We previously confirmed that autotetraploid P. tomentosa plants are more stress-tolerant than the diploid plants. However, the molecular mechanism underlying P. tomentosa salinity tolerance has not been fully characterized. Using the complete Paulownia fortunei genome as a reference, we applied next-generation RNA-sequencing technology to analyze the effects of salt stress on diploid and autotetraploid P. tomentosa plants. We generated 175 million clean reads and identified 15,873 differentially expressed genes (DEGs from four P. tomentosa libraries (two diploid and two autotetraploid. Functional annotations of the differentially expressed genes using the Gene Ontology and Kyoto Encyclopedia of Genes and Genomes databases revealed that plant hormone signal transduction and photosynthetic activities are vital for plant responses to high-salt conditions. We also identified several transcription factors, including members of the AP2/EREBP, bHLH, MYB, and NAC families. Quantitative real-time PCR analysis validated the expression patterns of eight differentially expressed genes. Our findings and the generated transcriptome data may help to accelerate the genetic improvement of cultivated P. tomentosa and other plant species for enhanced growth in saline soils.
Hakelien, Anne-Mari; Gaustad, Kristine G.; Taranger, Christel K.; Skalhegg, Bjorn S.; Kuentziger, Thomas; Collas, Philippe
We demonstrate a cell extract-based, genome-wide and heritable reprogramming of gene expression in vitro. Kidney epithelial 293T cells have previously been shown to take on T cell properties following a brief treatment with an extract of Jurkat T cells. We show here that 293T cells exposed for 1 h to a Jurkat cell extract undergo genome-wide, target cell-type-specific and long-lasting transcriptional changes. Microarray analyses indicate that on any given week after extract treatment, ∼2500 genes are upregulated >3-fold, of which ∼900 are also expressed in Jurkat cells. Concomitantly, ∼1500 genes are downregulated or repressed, of which ∼500 are also downregulated in Jurkat cells. Gene expression changes persist for over 30 passages (∼80 population doublings) in culture. Target cell-type specificity of these changes is shown by the lack of activation or repression of Jurkat-specific genes by extracts of 293T cells or carcinoma cells. Quantitative RT-PCR analysis confirms the long-term transcriptional activation of genes involved in key T cell functions. Additionally, growth of cells in suspended aggregates, expression of CD3 and CD28 T cell surface markers, and interleukin-2 secretion by 293T cells treated with extract of adult peripheral blood T cells illustrate a functional nuclear reprogramming. Therefore, target cell-type-specific and heritable changes in gene expression, and alterations in cell function, can be promoted by extracts derived from transformed cells as well as from adult primary cells
Full Text Available Efforts to unravel the mechanisms underlying taste sensation (gustation have largely focused on rodents. Here we present the first comprehensive characterization of gene expression in primate taste buds. Our findings reveal unique new insights into the biology of taste buds. We generated a taste bud gene expression database using laser capture microdissection (LCM procured fungiform (FG and circumvallate (CV taste buds from primates. We also used LCM to collect the top and bottom portions of CV taste buds. Affymetrix genome wide arrays were used to analyze gene expression in all samples. Known taste receptors are preferentially expressed in the top portion of taste buds. Genes associated with the cell cycle and stem cells are preferentially expressed in the bottom portion of taste buds, suggesting that precursor cells are located there. Several chemokines including CXCL14 and CXCL8 are among the highest expressed genes in taste buds, indicating that immune system related processes are active in taste buds. Several genes expressed specifically in endocrine glands including growth hormone releasing hormone and its receptor are also strongly expressed in taste buds, suggesting a link between metabolism and taste. Cell type-specific expression of transcription factors and signaling molecules involved in cell fate, including KIT, reveals the taste bud as an active site of cell regeneration, differentiation, and development. IKBKAP, a gene mutated in familial dysautonomia, a disease that results in loss of taste buds, is expressed in taste cells that communicate with afferent nerve fibers via synaptic transmission. This database highlights the power of LCM coupled with transcriptional profiling to dissect the molecular composition of normal tissues, represents the most comprehensive molecular analysis of primate taste buds to date, and provides a foundation for further studies in diverse aspects of taste biology.
Khan Meraj A
Full Text Available Abstract Background In order to obtain a lead of the pathophysiology of endometriosis, genome-wide expressional analyses of eutopic and ectopic endometrium have earlier been reported, however, the effects of stages of severity and phases of menstrual cycle on expressional profiles have not been examined. The effect of genetic heterogeneity and fertility history on transcriptional activity was also not considered. In the present study, a genome-wide expression analysis of autologous, paired eutopic and ectopic endometrial samples obtained from fertile women (n = 18 suffering from moderate (stage 3; n = 8 or severe (stage 4; n = 10 ovarian endometriosis during proliferative (n = 13 and secretory (n = 5 phases of menstrual cycle was performed. Methods Individual pure RNA samples were subjected to Agilent’s Whole Human Genome 44K microarray experiments. Microarray data were validated (P Results Higher clustering effect of pairing (cluster distance, cd = 0.1 in samples from same individuals on expressional arrays among eutopic and ectopic samples was observed as compared to that of clinical stages of severity (cd = 0.5 and phases of menstrual cycle (cd = 0.6. Post hoc analysis revealed anomaly in the expressional profiles of several genes associated with immunological, neuracrine and endocrine functions and gynecological cancers however with no overt oncogenic potential in endometriotic tissue. Dys-regulation of three (CLOCK, ESR1, and MYC major transcription factors appeared to be significant causative factors in the pathogenesis of ovarian endometriosis. A novel cohort of twenty-eight (28 genes representing potential marker for ovarian endometriosis in fertile women was discovered. Conclusions Dysfunctional expression of immuno-neuro-endocrine behaviour in endometrium appeared critical to endometriosis. Although no overt oncogenic potential was evident, several genes associated with gynecological cancers were
Khan, Meraj A; Sengupta, Jayasree; Mittal, Suneeta; Ghosh, Debabrata
In order to obtain a lead of the pathophysiology of endometriosis, genome-wide expressional analyses of eutopic and ectopic endometrium have earlier been reported, however, the effects of stages of severity and phases of menstrual cycle on expressional profiles have not been examined. The effect of genetic heterogeneity and fertility history on transcriptional activity was also not considered. In the present study, a genome-wide expression analysis of autologous, paired eutopic and ectopic endometrial samples obtained from fertile women (n=18) suffering from moderate (stage 3; n=8) or severe (stage 4; n=10) ovarian endometriosis during proliferative (n=13) and secretory (n=5) phases of menstrual cycle was performed. Individual pure RNA samples were subjected to Agilent's Whole Human Genome 44K microarray experiments. Microarray data were validated (Pcopy numbers by performing real time RT-PCR of seven (7) arbitrarily selected genes in all samples. The data obtained were subjected to differential expression (DE) and differential co-expression (DC) analyses followed by networks and enrichment analysis, and gene set enrichment analysis (GSEA). The reproducibility of prediction based on GSEA implementation of DC results was assessed by examining the relative expressions of twenty eight (28) selected genes in RNA samples obtained from fresh pool of eutopic and ectopic samples from confirmed ovarian endometriosis patients with stages 3 and 4 (n=4/each) during proliferative and secretory (n=4/each) phases. Higher clustering effect of pairing (cluster distance, cd=0.1) in samples from same individuals on expressional arrays among eutopic and ectopic samples was observed as compared to that of clinical stages of severity (cd=0.5) and phases of menstrual cycle (cd=0.6). Post hoc analysis revealed anomaly in the expressional profiles of several genes associated with immunological, neuracrine and endocrine functions and gynecological cancers however with no overt oncogenic
Full Text Available In order to assess whether gene expression variability could be influenced by several SNPs acting in cis, either through additive or more complex haplotype effects, a systematic genome-wide search for cis haplotype expression quantitative trait loci (eQTL was conducted in a sample of 758 individuals, part of the Cardiogenics Transcriptomic Study, for which genome-wide monocyte expression and GWAS data were available. 19,805 RNA probes were assessed for cis haplotypic regulation through investigation of ~2,1 × 10(9 haplotypic combinations. 2,650 probes demonstrated haplotypic p-values >10(4-fold smaller than the best single SNP p-value. Replication of significant haplotype effects were tested for 412 probes for which SNPs (or proxies that defined the detected haplotypes were available in the Gutenberg Health Study composed of 1,374 individuals. At the Bonferroni correction level of 1.2 × 10(-4 (~0.05/412, 193 haplotypic signals replicated. 1000 G imputation was then conducted, and 105 haplotypic signals still remained more informative than imputed SNPs. In-depth analysis of these 105 cis eQTL revealed that at 76 loci genetic associations were compatible with additive effects of several SNPs, while for the 29 remaining regions data could be compatible with a more complex haplotypic pattern. As 24 of the 105 cis eQTL have previously been reported to be disease-associated loci, this work highlights the need for conducting haplotype-based and 1000 G imputed cis eQTL analysis before commencing functional studies at disease-associated loci.
Swindell, William R.; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P.; Voorhees, John J.; Elder, James T.; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P.; DiGiovanni, John; Pittelkow, Mark R.; Ward, Nicole L.; Gudjonsson, Johann E.
Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis. PMID:21483750
Wang, Jinglu; Qu, Susu; Wang, Weixiao; Guo, Liyuan; Zhang, Kunlin; Chang, Suhua; Wang, Jing
Numbers of gene expression profiling studies of bipolar disorder have been published. Besides different array chips and tissues, variety of the data processes in different cohorts aggravated the inconsistency of results of these genome-wide gene expression profiling studies. By searching the gene expression databases, we obtained six data sets for prefrontal cortex (PFC) of bipolar disorder with raw data and combinable platforms. We used standardized pre-processing and quality control procedures to analyze each data set separately and then combined them into a large gene expression matrix with 101 bipolar disorder subjects and 106 controls. A standard linear mixed-effects model was used to calculate the differentially expressed genes (DEGs). Multiple levels of sensitivity analyses and cross validation with genetic data were conducted. Functional and network analyses were carried out on basis of the DEGs. In the result, we identified 198 unique differentially expressed genes in the PFC of bipolar disorder and control. Among them, 115 DEGs were robust to at least three leave-one-out tests or different pre-processing methods; 51 DEGs were validated with genetic association signals. Pathway enrichment analysis showed these DEGs were related with regulation of neurological system, cell death and apoptosis, and several basic binding processes. Protein-protein interaction network further identified one key hub gene. We have contributed the most comprehensive integrated analysis of bipolar disorder expression profiling studies in PFC to date. The DEGs, especially those with multiple validations, may denote a common signature of bipolar disorder and contribute to the pathogenesis of disease. Copyright © 2016 Elsevier Ltd. All rights reserved.
Celik Altunoglu, Yasemin; Baloglu, Mehmet Cengiz; Baloglu, Pinar; Yer, Esra Nurten; Kara, Sibel
Late embryogenesis abundant (LEA) proteins are large and diverse group of polypeptides which were first identified during seed dehydration and then in vegetative plant tissues during different stress responses. Now, gene family members of LEA proteins have been detected in various organisms. However, there is no report for this protein family in watermelon and melon until this study. A total of 73 LEA genes from watermelon ( ClLEA ) and 61 LEA genes from melon ( CmLEA ) were identified in this comprehensive study. They were classified into four and three distinct clusters in watermelon and melon, respectively. There was a correlation between gene structure and motif composition among each LEA groups. Segmental duplication played an important role for LEA gene expansion in watermelon. Maximum gene ontology of LEA genes was observed with poplar LEA genes. For evaluation of tissue specific expression patterns of ClLEA and CmLEA genes, publicly available RNA-seq data were analyzed. The expression analysis of selected LEA genes in root and leaf tissues of drought-stressed watermelon and melon were examined using qRT-PCR. Among them, ClLEA - 12 - 17 - 46 genes were quickly induced after drought application. Therefore, they might be considered as early response genes for water limitation conditions in watermelon. In addition, CmLEA - 42 - 43 genes were found to be up-regulated in both tissues of melon under drought stress. Our results can open up new frontiers about understanding of functions of these important family members under normal developmental stages and stress conditions by bioinformatics and transcriptomic approaches.
Seol, Min-A; Kim, Dae-Ghon; Chu, In-Sun; Lee, Mi-Jin; Yu, Goung-Ran; Cui, Xiang-Dan; Cho, Baik-Hwan; Ahn, Eun-Kyung; Leem, Sun-Hee; Kim, In-Hee
The molecular mechanisms of CC (cholangiocarcinoma) oncogenesis and progression are poorly understood. This study aimed to determine the genome-wide expression of genes related to CC oncogenesis and sarcomatous transdifferentiation. Genes that were differentially expressed between CC cell lines or tissues and cultured normal biliary epithelial (NBE) cells were identified using DNA microarray technology. Expressions were validated in human CC tissues and cells. Using unsupervised hierarchical clustering analysis of the cell line and tissue samples, we identified a set of 342 commonly regulated (>2-fold change) genes. Of these, 53, including tumor-related genes, were upregulated, and 289, including tumor suppressor genes, were downregulated (<0.5 fold change). Expression of SPP1, EFNB2, E2F2, IRX3, PTTG1, PPARγ, KRT17, UCHL1, IGFBP7 and SPARC proteins was immunohistochemically verified in human and hamster CC tissues. Additional unsupervised hierarchical clustering analysis of sarcomatoid CC cells compared to three adenocarcinomatous CC cell lines revealed 292 differentially upregulated genes (>4-fold change), and 267 differentially downregulated genes (<0.25 fold change). The expression of 12 proteins was validated in the CC cell lines by immunoblot analysis and immunohistochemical staining. Of the proteins analyzed, we found upregulation of the expression of the epithelial-mesenchymal transition (EMT)-related proteins VIM and TWIST1, and restoration of the methylation-silenced proteins LDHB, BNIP3, UCHL1, and NPTX2 during sarcomatoid transdifferentiation of CC. The deregulation of oncogenes, tumor suppressor genes, and methylation-related genes may be useful in identifying molecular targets for CC diagnosis and prognosis
Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang
Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.
Background Aspartic proteases (APs) are a large family of proteolytic enzymes found in almost all organisms. In plants, they are involved in many biological processes, such as senescence, stress responses, programmed cell death, and reproduction. Prior to the present study, no grape AP gene(s) had been reported, and their research on woody species was very limited. Results In this study, a total of 50 AP genes (VvAP) were identified in the grape genome, among which 30 contained the complete ASP domain. Synteny analysis within grape indicated that segmental and tandem duplication events contributed to the expansion of the grape AP family. Additional analysis between grape and Arabidopsis demonstrated that several grape AP genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grape and Arabidopsis. Phylogenetic relationships of the 30 VvAPs with the complete ASP domain and their Arabidopsis orthologs, as well as their gene and protein features were analyzed and their cellular localization was predicted. Moreover, expression profiles of VvAP genes in six different tissues were determined, and their transcript abundance under various stresses and hormone treatments were measured. Twenty-seven VvAP genes were expressed in at least one of the six tissues examined; nineteen VvAPs responded to at least one abiotic stress, 12 VvAPs responded to powdery mildew infection, and most of the VvAPs responded to SA and ABA treatments. Furthermore, integrated synteny and phylogenetic analysis identified orthologous AP genes between grape and Arabidopsis, providing a unique starting point for investigating the function of grape AP genes. Conclusions The genome-wide identification, evolutionary and expression analyses of grape AP genes provide a framework for future analysis of AP genes in defining their roles during stress response. Integrated synteny and phylogenetic analyses provide novel insight into the
Background The MYB gene family comprises one of the richest groups of transcription factors in plants. Plant MYB proteins are characterized by a highly conserved MYB DNA-binding domain. MYB proteins are classified into four major groups namely, 1R-MYB, 2R-MYB, 3R-MYB and 4R-MYB based on the number and position of MYB repeats. MYB transcription factors are involved in plant development, secondary metabolism, hormone signal transduction, disease resistance and abiotic stress tolerance. A comparative analysis of MYB family genes in rice and Arabidopsis will help reveal the evolution and function of MYB genes in plants. Results A genome-wide analysis identified at least 155 and 197 MYB genes in rice and Arabidopsis, respectively. Gene structure analysis revealed that MYB family genes possess relatively more number of introns in the middle as compared with C- and N-terminal regions of the predicted genes. Intronless MYB-genes are highly conserved both in rice and Arabidopsis. MYB genes encoding R2R3 repeat MYB proteins retained conserved gene structure with three exons and two introns, whereas genes encoding R1R2R3 repeat containing proteins consist of six exons and five introns. The splicing pattern is similar among R1R2R3 MYB genes in Arabidopsis. In contrast, variation in splicing pattern was observed among R1R2R3 MYB members of rice. Consensus motif analysis of 1kb upstream region (5′ to translation initiation codon) of MYB gene ORFs led to the identification of conserved and over-represented cis-motifs in both rice and Arabidopsis. Real-time quantitative RT-PCR analysis showed that several members of MYBs are up-regulated by various abiotic stresses both in rice and Arabidopsis. Conclusion A comprehensive genome-wide analysis of chromosomal distribution, tandem repeats and phylogenetic relationship of MYB family genes in rice and Arabidopsis suggested their evolution via duplication. Genome-wide comparative analysis of MYB genes and their expression analysis
Kravatsky, Yuri V; Chechetkin, Vladimir R; Tchurikov, Nikolai A; Kravatskaya, Galina I
The broad class of tasks in genetics and epigenetics can be reduced to the study of various features that are distributed over the genome (genome tracks). The rapid and efficient processing of the huge amount of data stored in the genome-scale databases cannot be achieved without the software packages based on the analytical criteria. However, strong inhomogeneity of genome tracks hampers the development of relevant statistics. We developed the criteria for the assessment of genome track inhomogeneity and correlations between two genome tracks. We also developed a software package, Genome Track Analyzer, based on this theory. The theory and software were tested on simulated data and were applied to the study of correlations between CpG islands and transcription start sites in the Homo sapiens genome, between profiles of protein-binding sites in chromosomes of Drosophila melanogaster, and between DNA double-strand breaks and histone marks in the H. sapiens genome. Significant correlations between transcription start sites on the forward and the reverse strands were observed in genomes of D. melanogaster, Caenorhabditis elegans, Mus musculus, H. sapiens, and Danio rerio. The observed correlations may be related to the regulation of gene expression in eukaryotes. Genome Track Analyzer is freely available at http://ancorr.eimb.ru/. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.
Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…
Full Text Available SET domain-containing proteins represent an evolutionarily conserved family of epigenetic regulators, which are responsible for most histone lysine methylation. Since some of these genes have been revealed to be essential for embryonic development, we propose that the zebrafish, a vertebrate model organism possessing many advantages for developmental studies, can be utilized to study the biological functions of these genes and the related epigenetic mechanisms during early development. To this end, we have performed a genome-wide survey of zebrafish SET domain genes. 58 genes total have been identified. Although gene duplication events give rise to several lineage-specific paralogs, clear reciprocal orthologous relationship reveals high conservation between zebrafish and human SET domain genes. These data were further subject to an evolutionary analysis ranging from yeast to human, leading to the identification of putative clusters of orthologous groups (COGs of this gene family. By means of whole-mount mRNA in situ hybridization strategy, we have also carried out a developmental expression mapping of these genes. A group of maternal SET domain genes, which are implicated in the programming of histone modification states in early development, have been identified and predicted to be responsible for all known sites of SET domain-mediated histone methylation. Furthermore, some genes show specific expression patterns in certain tissues at certain stages, suggesting the involvement of epigenetic mechanisms in the development of these systems. These results provide a global view of zebrafish SET domain histone methyltransferases in evolutionary and developmental dimensions and pave the way for using zebrafish to systematically study the roles of these genes during development.
Full Text Available Abstract In spite of only a 1-2 per cent genomic DNA sequence difference, humans and chimpanzees differ considerably in behaviour and cognition. Affymetrix microarray technology provides a novel approach to addressing a long-term debate on whether the difference between humans and chimpanzees results from the alteration of gene expressions. Here, we used several statistical methods (distance method, two-sample t-tests, regularised t-tests, ANOVA and bootstrapping to detect the differential expression pattern between humans and great apes. Our analysis shows that the pattern we observed before is robust against various statistical methods; that is, the pronounced expression changes occurred on the human lineage after the split from chimpanzees, and that the dramatic brain expression alterations in humans may be mainly driven by a set of genes with increased expression (up-regulated rather than decreased expression (down-regulated.
Full Text Available Abstract Background HIP1 Protein Interactor (HIPPI is a pro-apoptotic protein that induces Caspase8 mediated apoptosis in cell. We have shown earlier that HIPPI could interact with a specific 9 bp sequence motif, defined as the HIPPI binding site (HBS, present in the upstream promoter of Caspase1 gene and regulate its expression. We also have shown that HIPPI, without any known nuclear localization signal, could be transported to the nucleus by HIP1, a NLS containing nucleo-cytoplasmic shuttling protein. Thus our present work aims at the investigation of the role of HIPPI as a global transcription regulator. Results We carried out genome wide search for the presence of HBS in the upstream sequences of genes. Our result suggests that HBS was predominantly located within 2 Kb upstream from transcription start site. Transcription factors like CREBP1, TBP, OCT1, EVI1 and P53 half site were significantly enriched in the 100 bp vicinity of HBS indicating that they might co-operate with HIPPI for transcription regulation. To illustrate the role of HIPPI on transcriptome, we performed gene expression profiling by microarray. Exogenous expression of HIPPI in HeLa cells resulted in up-regulation of 580 genes (p HIP1 was knocked down. HIPPI-P53 interaction was necessary for HIPPI mediated up-regulation of Caspase1 gene. Finally, we analyzed published microarray data obtained with post mortem brains of Huntington's disease (HD patients to investigate the possible involvement of HIPPI in HD pathogenesis. We observed that along with the transcription factors like CREB, P300, SREBP1, Sp1 etc. which are already known to be involved in HD, HIPPI binding site was also significantly over-represented in the upstream sequences of genes altered in HD. Conclusions Taken together, the results suggest that HIPPI could act as an important transcription regulator in cell regulating a vast array of genes, particularly transcription factors and at least, in part, play a
van Leeuwen, D M; van Herwijnen, M H M; Pedersen, Marie
The Teplice area in the Czech Republic is a mining district where elevated levels of air pollution including airborne carcinogens, have been demonstrated, especially during winter time. This environmental exposure can impact human health; in particular children may be more vulnerable. To study....... This suggests an effect of air pollution on the primary structural unit of the condensed DNA. In addition, several other pathways were modulated. Based on the results of this study, we suggest that transcriptomic analysis represents a promising biomarker for environmental carcinogenesis....... the impact of air pollution in children at the transcriptional level, peripheral blood cells were subjected to whole genome response analysis, in order to identify significantly modulated biological pathways and processes as a result of exposure. Using genome-wide oligonucleotide microarrays, we investigated...
Almstrup, Kristian; Hoei-Hansen, Christina E; Wirkner, Ute
in their stoichiometry on progression into embryonic carcinoma. We compared the CIS expression profile with patterns reported in embryonic stem cells (ESCs), which revealed a substantial overlap that may be as high as 50%. We also demonstrated an over-representation of expressed genes in regions of 17q and 12, reported......Carcinoma in situ (CIS) is the common precursor of histologically heterogeneous testicular germ cell tumors (TGCTs), which in recent decades have markedly increased and now are the most common malignancy of young men. Using genome-wide gene expression profiling, we identified >200 genes highly...
Nilsson, Emil K; Boström, Adrian E; Mwinyi, Jessica; Schiöth, Helgi B
Despite an established link between sleep deprivation and epigenetic processes in humans, it remains unclear to what extent sleep deprivation modulates DNA methylation. We performed a within-subject randomized blinded study with 16 healthy subjects to examine the effect of one night of total sleep deprivation (TSD) on the genome-wide methylation profile in blood compared with that in normal sleep. Genome-wide differences in methylation between both conditions were assessed by applying a paired regression model that corrected for monocyte subpopulations. In addition, the correlations between the methylation of genes detected to be modulated by TSD and gene expression were examined in a separate, publicly available cohort of 10 healthy male donors (E-GEOD-49065). Sleep deprivation significantly affected the DNA methylation profile both independently and in dependency of shifts in monocyte composition. Our study detected differential methylation of 269 probes. Notably, one CpG site was located 69 bp upstream of ING5, which has been shown to be differentially expressed after sleep deprivation. Gene set enrichment analysis detected the Notch and Wnt signaling pathways to be enriched among the differentially methylated genes. These results provide evidence that total acute sleep deprivation alters the methylation profile in healthy human subjects. This is, to our knowledge, the first study that systematically investigated the impact of total acute sleep deprivation on genome-wide DNA methylation profiles in blood and related the epigenomic findings to the expression data.
Full Text Available Abstract Background Renal cell carcinoma (RCC is characterized by a number of diverse molecular aberrations that differ among individuals. Recent approaches to molecularly classify RCC were based on clinical, pathological as well as on single molecular parameters. As a consequence, gene expression patterns reflecting the sum of genetic aberrations in individual tumors may not have been recognized. In an attempt to uncover such molecular features in RCC, we used a novel, unbiased and integrative approach. Methods We integrated gene expression data from 97 primary RCC of different pathologic parameters, 15 RCC metastases as well as 34 cancer cell lines for two-way nonsupervised hierarchical clustering using gene groups suggested by the PANTHER Classification System. We depicted the genomic landscape of the resulted tumor groups by means of Single Nuclear Polymorphism (SNP technology. Finally, the achieved results were immunohistochemically analyzed using a tissue microarray (TMA composed of 254 RCC. Results We found robust, genome wide expression signatures, which split RCC into three distinct molecular subgroups. These groups remained stable even if randomly selected gene sets were clustered. Notably, the pattern obtained from RCC cell lines was clearly distinguishable from that of primary tumors. SNP array analysis demonstrated differing frequencies of chromosomal copy number alterations among RCC subgroups. TMA analysis with group-specific markers showed a prognostic significance of the different groups. Conclusion We propose the existence of characteristic and histologically independent genome-wide expression outputs in RCC with potential biological and clinical relevance.
Full Text Available Pallister Killian syndrome (OMIM: # 601803 is a rare multisystem disorder typically caused by tissue limited mosaic tetrasomy of chromosome 12p (isochromosome 12p. The clinical manifestations of Pallister Killian syndrome are variable with the most common findings including craniofacial dysmorphia, hypotonia, cognitive impairment, hearing loss, skin pigmentary differences and epilepsy. Isochromosome 12p is identified primarily in skin fibroblast cultures and in chorionic villus and amniotic fluid cell samples and may be identified in blood lymphocytes during the neonatal and early childhood period. We performed genomic expression profiling correlated with interphase fluorescent in situ hybridization and single nucleotide polymorphism array quantification of degree of mosaicism in fibroblasts from 17 Caucasian probands with Pallister Killian syndrome and 9 healthy age, gender and ethnicity matched controls. We identified a characteristic profile of 354 (180 up- and 174 down-regulated differentially expressed genes in Pallister Killian syndrome probands and supportive evidence for a Pallister Killian syndrome critical region on 12p13.31. The differentially expressed genes were enriched for developmentally important genes such as homeobox genes. Among the differentially expressed genes, we identified several genes whose misexpression may be associated with the clinical phenotype of Pallister Killian syndrome such as downregulation of ZFPM2, GATA6 and SOX9, and overexpression of IGFBP2.
Full Text Available Cassava is an important food and potential biofuel crop that is tolerant to multiple abiotic stressors. The mechanisms underlying these tolerances are currently less known. CBL-interacting protein kinases (CIPKs have been shown to play crucial roles in plant developmental processes, hormone signaling transduction, and in the response to abiotic stress. However, no data is currently available about the CPK family in cassava. In this study, a total of 25 CIPK genes were identified from cassava genome based on our previous genome sequencing data. Phylogenetic analysis suggested that 25 MeCIPKs could be classified into four subfamilies, which was supported by exon-intron organizations and the architectures of conserved protein motifs. Transcriptomic analysis of a wild subspecies and two cultivated varieties showed that most MeCIPKs had different expression patterns between wild subspecies and cultivatars in different tissues or in response to drought stress. Some orthologous genes involved in CIPK interaction networks were identified between Arabidopsis and cassava. The interaction networks and co-expression patterns of these orthologous genes revealed that the crucial pathways controlled by CIPK networks may be involved in the differential response to drought stress in different accessions of cassava. Nine MeCIPK genes were selected to investigate their transcriptional response to various stimuli and the results showed the comprehensive response of the tested MeCIPK genes to osmotic, salt, cold, oxidative stressors, and ABA signaling. The identification and expression analysis of CIPK family suggested that CIPK genes are important components of development and multiple signal transduction pathways in cassava. The findings of this study will help lay a foundation for the functional characterization of the CIPK gene family and provide an improved understanding of abiotic stress responses and signaling transduction in cassava.
Full Text Available The quality of tissue samples and extracted mRNA is a major source of variability in tumor transcriptome analysis using genome-wide expression microarrays. During and immediately after surgical tumor resection, tissues are exposed to metabolic, biochemical and physical stresses characterized as "warm ischemia". Current practice advocates cryopreservation of biosamples within 30 minutes of resection, but this recommendation has not been systematically validated by measurements of mRNA decay over time. Using Illumina HumanHT-12 v3 Expression BeadChips, providing a genome-wide coverage of over 24,000 genes, we have analyzed gene expression variation in samples of 3 hepatocellular carcinomas (HCC and 3 lung carcinomas (LC cryopreserved at times up to 2 hours after resection. RNA Integrity Numbers (RIN revealed no significant deterioration of mRNA up to 2 hours after resection. Genome-wide transcriptome analysis detected non-significant gene expression variations of -3.5%/hr (95% CI: -7.0%/hr to 0.1%/hr; p = 0.054. In LC, no consistent gene expression pattern was detected in relation with warm ischemia. In HCC, a signature of 6 up-regulated genes (CYP2E1, IGLL1, CABYR, CLDN2, NQO1, SCL13A5 and 6 down-regulated genes (MT1G, MT1H, MT1E, MT1F, HABP2, SPINK1 was identified (FDR <0.05. Overall, our observations support current recommendation of time to cryopreservation of up to 30 minutes and emphasize the need for identifying tissue-specific genes deregulated following resection to avoid misinterpreting expression changes induced by warm ischemia as pathologically significant changes.
Full Text Available Modifications to histones, including acetylation and methylation processes, play crucial roles in the regulation of gene expression in plant development as well as in stress responses. However, limited information on the enzymes catalyzing histone acetylation and methylation in non-model plants is currently available. In this study, several histone modifier (HM types, including six histone acetyltransferases (HATs, 11 histone deacetylases (HDACs, 48 histone methyltransferases (HMTs, and 22 histone demethylases (HDMs, are identified in litchi (Litchi chinensis Sonn. cv. Feizixiao based on similarities in their sequences to homologs in Arabidopsis (A. thaliana, tomato (Solanum lycopersicum, and rice (Oryza sativa. Phylogenetic analyses reveal that HM enzymes can be grouped into four HAT, two HDAC, two HMT, and two HDM subfamilies, respectively, while further expression profile analyses demonstrate that 17 HMs were significantly altered during fruit abscission in two field treatments. Analyses reveal that these genes exhibit four distinct patterns of expression in response to fruit abscission, while an in vitro assay was used to confirm the HDAC activity of LcHDA2, LcHDA6, and LcSRT2. Our findings are the first in-depth analysis of HMs in the litchi genome, and imply that some are likely to play important roles in fruit abscission in this commercially important plant.
Gu, Yan-bing; Ji, Zhi-rui; Chi, Fu-mei; Qiao, Zhuang; Xu, Cheng-nan; Zhang, Jun-xiang; Zhou, Zong-shan; Dong, Qing-long
The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.
Zhang, Ningbo; Li, Ruimin; Shen, Wei; Jiao, Shuzhen; Zhang, Junxiang; Xu, Weirong
The major latex protein/ripening-related protein (MLP/RRP) subfamily is known to be involved in a wide range of biological processes of plant development and various stress responses. However, the biological function of MLP/RRP proteins is still far from being clear and identification of them may provide important clues for understanding their roles. Here, we report a genome-wide evolutionary characterization and gene expression analysis of the MLP family in European Vitis species. A total of 14 members, was found in the grape genome, all of which are located on chromosome 1, where are predominantly arranged in tandem clusters. We have noticed, most surprisingly, promoter-sharing by several non-identical but highly similar gene members to a greater extent than expected by chance. Synteny analysis between the grape and Arabidopsis thaliana genomes suggested that 3 grape MLP genes arose before the divergence of the two species. Phylogenetic analysis provided further insights into the evolutionary relationship between the genes, as well as their putative functions, and tissue-specific expression analysis suggested distinct biological roles for different members. Our expression data suggested a couple of candidate genes involved in abiotic stresses and phytohormone responses. The present work provides new insight into the evolution and regulation of Vitis MLP genes, which represent targets for future studies and inclusion in tolerance-related molecular breeding programs.
Full Text Available The WRKY family, a large family of transcription factors (TFs found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta. In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing 3 exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.
Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian
The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava.
Reuscher, Stefan; Akiyama, Masahito; Mori, Chiharu; Aoki, Koh; Shibata, Daisuke; Shiratake, Katsuhiro
The family of aquaporins, also called water channels or major intrinsic proteins, is characterized by six transmembrane domains that together facilitate the transport of water and a variety of low molecular weight solutes. They are found in all domains of life, but show their highest diversity in plants. Numerous studies identified aquaporins as important targets for improving plant performance under drought stress. The phylogeny of aquaporins is well established based on model species like Arabidopsis thaliana, which can be used as a template to investigate aquaporins in other species. In this study we comprehensively identified aquaporin encoding genes in tomato (Solanum lycopersicum), which is an important vegetable crop and also serves as a model for fleshy fruit development. We found 47 aquaporin genes in the tomato genome and analyzed their structural features. Based on a phylogenetic analysis of the deduced amino acid sequences the aquaporin genes were assigned to five subfamilies (PIPs, TIPs, NIPs, SIPs and XIPs) and their substrate specificity was assessed on the basis of key amino acid residues. As ESTs were available for 32 genes, expression of these genes was analyzed in 13 different tissues and developmental stages of tomato. We detected tissue-specific and development-specific expression of tomato aquaporin genes, which is a first step towards revealing the contribution of aquaporins to water and solute transport in leaves and during fruit development.
Full Text Available The family of aquaporins, also called water channels or major intrinsic proteins, is characterized by six transmembrane domains that together facilitate the transport of water and a variety of low molecular weight solutes. They are found in all domains of life, but show their highest diversity in plants. Numerous studies identified aquaporins as important targets for improving plant performance under drought stress. The phylogeny of aquaporins is well established based on model species like Arabidopsis thaliana, which can be used as a template to investigate aquaporins in other species. In this study we comprehensively identified aquaporin encoding genes in tomato (Solanum lycopersicum, which is an important vegetable crop and also serves as a model for fleshy fruit development. We found 47 aquaporin genes in the tomato genome and analyzed their structural features. Based on a phylogenetic analysis of the deduced amino acid sequences the aquaporin genes were assigned to five subfamilies (PIPs, TIPs, NIPs, SIPs and XIPs and their substrate specificity was assessed on the basis of key amino acid residues. As ESTs were available for 32 genes, expression of these genes was analyzed in 13 different tissues and developmental stages of tomato. We detected tissue-specific and development-specific expression of tomato aquaporin genes, which is a first step towards revealing the contribution of aquaporins to water and solute transport in leaves and during fruit development.
Cao, Yunpeng; Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping
The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice ( Oryza sativa ), maize ( Zea mays ), and Arabidopsis ( Arabidopsis thaliana ). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis , respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis , respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis . A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis , respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus , and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis .
He, Qiuling; Jones, Don C.; Li, Wei; Xie, Fuliang; Ma, Jun; Sun, Runrun; Wang, Qinglian; Zhu, Shuijin; Zhang, Baohong
The R2R3-MYB is one of the largest families of transcription factors, which have been implicated in multiple biological processes. There is great diversity in the number of R2R3-MYB genes in different plants. However, there is no report on genome-wide characterization of this gene family in cotton. In the present study, a total of 205 putative R2R3-MYB genes were identified in cotton D genome (Gossypium raimondii), that are much larger than that found in other cash crops with fully sequenced genomes. These GrMYBs were classified into 13 groups with the R2R3-MYB genes from Arabidopsis and rice. The amino acid motifs and phylogenetic tree were predicted and analyzed. The sequences of GrMYBs were distributed across 13 chromosomes at various densities. The results showed that the expansion of the G. Raimondii R2R3-MYB family was mainly attributable to whole genome duplication and segmental duplication. Moreover, the expression pattern of 52 selected GrMYBs and 46 GaMYBs were tested in roots and leaves under different abiotic stress conditions. The results revealed that the MYB genes in cotton were differentially expressed under salt and drought stress treatment. Our results will be useful for determining the precise role of the MYB genes during stress responses with crop improvement. PMID:27009386
Jue, Dengwei; Sang, Xuelian; Lu, Shengqiao; Dong, Chen; Zhao, Qiufang; Chen, Hongliang; Jia, Liqiang
Background Ubiquitination is a post-translation modification where ubiquitin is attached to a substrate. Ubiquitin-conjugating enzymes (E2s) play a major role in the ubiquitin transfer pathway, as well as a variety of functions in plant biological processes. To date, no genome-wide characterization of this gene family has been conducted in maize (Zea mays). Methodology/Principal Findings In the present study, a total of 75 putative ZmUBC genes have been identified and located in the maize genome. Phylogenetic analysis revealed that ZmUBC proteins could be divided into 15 subfamilies, which include 13 ubiquitin-conjugating enzymes (ZmE2s) and two independent ubiquitin-conjugating enzyme variant (UEV) groups. The predicted ZmUBC genes were distributed across 10 chromosomes at different densities. In addition, analysis of exon-intron junctions and sequence motifs in each candidate gene has revealed high levels of conservation within and between phylogenetic groups. Tissue expression analysis indicated that most ZmUBC genes were expressed in at least one of the tissues, indicating that these are involved in various physiological and developmental processes in maize. Moreover, expression profile analyses of ZmUBC genes under different stress treatments (4°C, 20% PEG6000, and 200 mM NaCl) and various expression patterns indicated that these may play crucial roles in the response of plants to stress. Conclusions Genome-wide identification, chromosome organization, gene structure, evolutionary and expression analyses of ZmUBC genes have facilitated in the characterization of this gene family, as well as determined its potential involvement in growth, development, and stress responses. This study provides valuable information for better understanding the classification and putative functions of the UBC-encoding genes of maize. PMID:26606743
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions
Full Text Available WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related
Lin, Hailan; Xia, Xiaofeng; Yu, Liying; Vasseur, Liette; Gurr, Geoff M; Yao, Fengluan; Yang, Guang; You, Minsheng
Serine proteases (SPs) are crucial proteolytic enzymes responsible for digestion and other processes including signal transduction and immune responses in insects. Serine protease homologs (SPHs) lack catalytic activity but are involved in innate immunity. This study presents a genome-wide investigation of SPs and SPHs in the diamondback moth, Plutella xylostella (L.), a globally-distributed destructive pest of cruciferous crops. A total of 120 putative SPs and 101 putative SPHs were identified in the P. xylostella genome by bioinformatics analysis. Based on the features of trypsin, 38 SPs were putatively designated as trypsin genes. The distribution, transcription orientation, exon-intron structure and sequence alignments suggested that the majority of trypsin genes evolved from tandem duplications. Among the 221 SP/SPH genes, ten SP and three SPH genes with one or more clip domains were predicted and designated as PxCLIPs. Phylogenetic analysis of CLIPs in P. xylostella, two other Lepidoptera species (Bombyx mori and Manduca sexta), and two more distantly related insects (Drosophila melanogaster and Apis mellifera) showed that seven of the 13 PxCLIPs were clustered with homologs of the Lepidoptera rather than other species. Expression profiling of the P. xylostella SP and SPH genes in different developmental stages and tissues showed diverse expression patterns, suggesting high functional diversity with roles in digestion and development. This is the first genome-wide investigation on the SP and SPH genes in P. xylostella. The characterized features and profiled expression patterns of the P. xylostella SPs and SPHs suggest their involvement in digestion, development and immunity of this species. Our findings provide a foundation for further research on the functions of this gene family in P. xylostella, and a better understanding of its capacity to rapidly adapt to a wide range of environmental variables including host plants and insecticides.
Carroll, Ronan K; Weiss, Andy; Broach, William H; Wiemels, Richard E; Mogen, Austin B; Rice, Kelly C; Shaw, Lindsey N
In Staphylococcus aureus, hundreds of small regulatory or small RNAs (sRNAs) have been identified, yet this class of molecule remains poorly understood and severely understudied. sRNA genes are typically absent from genome annotation files, and as a consequence, their existence is often overlooked, particularly in global transcriptomic studies. To facilitate improved detection and analysis of sRNAs in S. aureus, we generated updated GenBank files for three commonly used S. aureus strains (MRSA252, NCTC 8325, and USA300), in which we added annotations for >260 previously identified sRNAs. These files, the first to include genome-wide annotation of sRNAs in S. aureus, were then used as a foundation to identify novel sRNAs in the community-associated methicillin-resistant strain USA300. This analysis led to the discovery of 39 previously unidentified sRNAs. Investigating the genomic loci of the newly identified sRNAs revealed a surprising degree of inconsistency in genome annotation in S. aureus, which may be hindering the analysis and functional exploration of these elements. Finally, using our newly created annotation files as a reference, we perform a global analysis of sRNA gene expression in S. aureus and demonstrate that the newly identified tsr25 is the most highly upregulated sRNA in human serum. This study provides an invaluable resource to the S. aureus research community in the form of our newly generated annotation files, while at the same time presenting the first examination of differential sRNA expression in pathophysiologically relevant conditions. Despite a large number of studies identifying regulatory or small RNA (sRNA) genes in Staphylococcus aureus, their annotation is notably lacking in available genome files. In addition to this, there has been a considerable lack of cross-referencing in the wealth of studies identifying these elements, often leading to the same sRNA being identified multiple times and bearing multiple names. In this work
Salazar, Margarita Pena; Vongsangnak, Wanwipa; Panagiotou, Gianni
Glycerol is catabolized by a wide range of microorganisms including Aspergillus species. To identify the transcriptional regulation of glycerol metabolism in Aspergillus, we analyzed data from triplicate batch fermentations of three different Aspergilli (Aspergillus nidulans, Aspergillus oryzae...... and Aspergillus niger) with glucose and glycerol as carbon sources. Protein comparisons and cross-analysis with gene expression data of all three species resulted in the identification of 88 genes having a conserved response across the three Aspergilli. A promoter analysis of the up-regulated genes led...... to the identification of a conserved binding site for a putative regulator to be 5′-TGCGGGGA-3′, a binding site that is similar to the binding site for Adr1 in yeast and humans. We show that this Adr1 consensus binding sequence was over-represented on promoter regions of several genes in A. nidulans, A. oryzae and A...
Sonuga-Barke, E.; Lasky-Su, J.; Neale, B.; Oades, R.D.; Chen, W.; Franke, B.; Buitelaar, J.K.; Banaschewski, T.; Ebstein, R.; Gill, M.; Anney, R.J.; Miranda, A.; Mulas, F.; Roeyers, H.; Rothenberger, A.
Studies of gene x environment (G x E) interaction in ADHD have previously focused on known risk genes for ADHD and environmentally mediated biological risk. Here we use G x E analysis in the context of a genome-wide association scan to identify novel genes whose effects on ADHD symptoms and comorbid conduct disorder are moderated by high maternal expressed emotion (EE). SNPs (600,000) were genotyped in 958 ADHD proband-parent trios. After applying data cleaning procedures we examined 429,981 ...
Du, Hai; Ran, Feng; Dong, Hong-Li; Wen, Jing; Li, Jia-Na; Liang, Zhe
Cytochrome P450 93 family (CYP93) belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies?CYP93A?K, with t...
Full Text Available Management of men with prostate cancer is fraught with uncertainty as physicians and patients balance efficacy with potential toxicity and diminished quality of life. Utilization of genomics as a prognostic biomarker has improved the informed decision-making process by enabling more rationale treatment choices. Recently investigations have begun to determine whether genomic information from tumor transcriptome data can be used to impact clinical decision-making beyond prognosis. Here we discuss the potential of genomics to alter management of a patient who presented with high-risk prostate adenocarcinoma. We suggest that this information help selecting patients for advanced imaging, chemotherapies, or clinical trial.
Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K; Asif, Mehar H
The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively.
Full Text Available The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/ development including fruit ripening process respectively.
Almstrup, Kristian; Hoei-Hansen, Christina E; Wirkner, Ute; Blake, Jonathon; Schwager, Christian; Ansorge, Wilhelm; Nielsen, John E; Skakkebaek, Niels E; Rajpert-De Meyts, Ewa; Leffers, Henrik
Carcinoma in situ (CIS) is the common precursor of histologically heterogeneous testicular germ cell tumors (TGCTs), which in recent decades have markedly increased and now are the most common malignancy of young men. Using genome-wide gene expression profiling, we identified >200 genes highly expressed in testicular CIS, including many never reported in testicular neoplasms. Expression was further verified by semiquantitative reverse transcription-PCR and in situ hybridization. Among the highest expressed genes were NANOG and POU5F1, and reverse transcription-PCR revealed possible changes in their stoichiometry on progression into embryonic carcinoma. We compared the CIS expression profile with patterns reported in embryonic stem cells (ESCs), which revealed a substantial overlap that may be as high as 50%. We also demonstrated an over-representation of expressed genes in regions of 17q and 12, reported as unstable in cultured ESCs. The close similarity between CIS and ESCs explains the pluripotency of CIS. Moreover, the findings are consistent with an early prenatal origin of TGCTs and thus suggest that etiologic factors operating in utero are of primary importance for the incidence trends of TGCTs. Finally, some of the highly expressed genes identified in this study are promising candidates for new diagnostic markers for CIS and/or TGCTs.
Full Text Available One major expectation from the transcriptome in humans is to characterize the biological basis of associations identified by genome-wide association studies. So far, few cis expression quantitative trait loci (eQTLs have been reliably related to disease susceptibility. Trans-regulating mechanisms may play a more prominent role in disease susceptibility. We analyzed 12,808 genes detected in at least 5% of circulating monocyte samples from a population-based sample of 1,490 European unrelated subjects. We applied a method of extraction of expression patterns-independent component analysis-to identify sets of co-regulated genes. These patterns were then related to 675,350 SNPs to identify major trans-acting regulators. We detected three genomic regions significantly associated with co-regulated gene modules. Association of these loci with multiple expression traits was replicated in Cardiogenics, an independent study in which expression profiles of monocytes were available in 758 subjects. The locus 12q13 (lead SNP rs11171739, previously identified as a type 1 diabetes locus, was associated with a pattern including two cis eQTLs, RPS26 and SUOX, and 5 trans eQTLs, one of which (MADCAM1 is a potential candidate for mediating T1D susceptibility. The locus 12q24 (lead SNP rs653178, which has demonstrated extensive disease pleiotropy, including type 1 diabetes, hypertension, and celiac disease, was associated to a pattern strongly correlating to blood pressure level. The strongest trans eQTL in this pattern was CRIP1, a known marker of cellular proliferation in cancer. The locus 12q15 (lead SNP rs11177644 was associated with a pattern driven by two cis eQTLs, LYZ and YEATS4, and including 34 trans eQTLs, several of them tumor-related genes. This study shows that a method exploiting the structure of co-expressions among genes can help identify genomic regions involved in trans regulation of sets of genes and can provide clues for understanding the
Full Text Available Abstract Background Cervical carcinoma (CC is a leading cause of death among women worldwide. Human papilloma virus (HPV is a major etiological factor in CC and HPV 16 is the more frequent viral type present. Our aim was to characterize metabolic pathways altered in HPV 16 tumor samples by means of transcriptome wide analysis and bioinformatics tools for visualizing expression data in the context of KEGG biological pathways. Results We found 2,067 genes significantly up or down-modulated (at least 2-fold in tumor clinical samples compared to normal tissues, representing ~3.7% of analyzed genes. Cervical carcinoma was associated with an important up-regulation of Wnt signaling pathway, which was validated by in situ hybridization in clinical samples. Other up-regulated pathways were those of calcium signaling and MAPK signaling, as well as cell cycle-related genes. There was down-regulation of focal adhesion, TGF-β signaling, among other metabolic pathways. Conclusion This analysis of HPV 16 tumors transcriptome could be useful for the identification of genes and molecular pathways involved in the pathogenesis of cervical carcinoma. Understanding the possible role of these proteins in the pathogenesis of CC deserves further studies.
Esquerré, Thomas; Bouvier, Marie; Turlan, Catherine; Carpousis, Agamemnon J; Girbal, Laurence; Cocaign-Bousquet, Muriel
Bacterial adaptation requires large-scale regulation of gene expression. We have performed a genome-wide analysis of the Csr system, which regulates many important cellular functions. The Csr system is involved in post-transcriptional regulation, but a role in transcriptional regulation has also been suggested. Two proteins, an RNA-binding protein CsrA and an atypical signaling protein CsrD, participate in the Csr system. Genome-wide transcript stabilities and levels were compared in wildtype E. coli (MG1655) and isogenic mutant strains deficient in CsrA or CsrD activity demonstrating for the first time that CsrA and CsrD are global negative and positive regulators of transcription, respectively. The role of CsrA in transcription regulation may be indirect due to the 4.6-fold increase in csrD mRNA concentration in the CsrA deficient strain. Transcriptional action of CsrA and CsrD on a few genes was validated by transcriptional fusions. In addition to an effect on transcription, CsrA stabilizes thousands of mRNAs. This is the first demonstration that CsrA is a global positive regulator of mRNA stability. For one hundred genes, we predict that direct control of mRNA stability by CsrA might contribute to metabolic adaptation by regulating expression of genes involved in carbon metabolism and transport independently of transcriptional regulation.
Full Text Available We provide microarray data comparing genome-wide differential expression and pathology throughout life in four lines of “amyloid” transgenic mice (mutant human APP, PSEN1, or APP/PSEN1 and “TAU” transgenic mice (mutant human MAPT gene. Microarray data were validated by qPCR and by comparison to human studies, including genome-wide association study (GWAS hits. Immune gene expression correlated tightly with plaques whereas synaptic genes correlated negatively with neurofibrillary tangles. Network analysis of immune gene modules revealed six hub genes in hippocampus of amyloid mice, four in common with cortex. The hippocampal network in TAU mice was similar except that Trem2 had hub status only in amyloid mice. The cortical network of TAU mice was entirely different with more hub genes and few in common with the other networks, suggesting reasons for specificity of cortical dysfunction in FTDP17. This Resource opens up many areas for investigation. All data are available and searchable at http://www.mouseac.org.
Huang, D.; Li, M.; He, S.
ROP/RAC GTPases is a plant-specific subfamily of Rho GTPases that plays a versatile role in the regulation of plant growth, development, in hormone signal transduction and response to the environment. Prior to the present study, only one Rop gene in pepper has been described. However, with the recent release of the draft genome sequence of pepper allowes us to conduct a genome wide search to identify how many Rop family members existed in pepper genome. We carried out bioinformatics analysis to establish the conserved as well as divergent regions on the protein sequences, phylogenetically analysis and the corresponding result shows that, CaROPs could be distributed into four groups as described in the literature for their homologs in Arabidopsis. To understand the function of nine Rop genes in pepper, we accordingly studied the tissue, fruit development and ripening expression patterns of CaRop genes by obtained RNA-seq data from public database. From our analysis, we realized that the expression of CaRop genes shows no total tissue or developmental specific expression. Furthermore, gene expression profiles of CaRop in response to environment stresses and hormone treatment, such as inoculated with Ralstonia solanacearum, by heat stress as well as treated with four phytohormones respectively and evaluated with real time RT-PCR. The potential involvement of specific CaRop genes in growth, fruit development, ripening, environment stresses as well as hormone responses discussed and may lay the foundation for future functional analysis to unravel their biological roles. (author)
Full Text Available Half-size adenosine triphosphate-binding cassette transporter subgroup G (ABCG genes play crucial roles in regulating the movements of a variety of substrates and have been well studied in several plants. However, half-size ABCGs have not been characterized in detail in apple (Malus × domestica Borkh.. Here, we performed a genome-wide identification and expression analysis of the half-size ABCG gene family in apple. A total of 46 apple half-size ABCGs were identified and divided into six clusters according to the phylogenetic analysis. A gene structural analysis showed that most half-size ABCGs in the same cluster shared a similar exon–intron organization. A gene duplication analysis showed that segmental, tandem and whole-genome duplications could account for the expansion of half-size ABCG transporters in M. domestica. Moreover, a promoter scan, digital expression analysis and RNA-seq revealed that MdABCG21 may be involved in root's cytokinin transport and that ABCG17 may be involved in the lateral bud development of M. spectabilis ‘Bly114’ by mediating cytokinin transport. The data presented here lay the foundation for further investigations into the biological and physiological processes and functions of half-size ABCG genes in apple. Keywords: apple, ABCG gene, duplication, gene expression
Li, Qingtian; Liu, Jia; Tan, Dunxian; Allan, Andrew C; Jiang, Yuzhuang; Xu, Xuefeng; Han, Zhenhai; Kong, Jin
In some areas of cultivation, a lack of salt tolerance severely affects plant productivity. Apple, Malus x domestica Borkh., is sensitive to salt, and, as a perennial woody plant the mechanism of salt stress adaption will be different from that of annual herbal model plants, such as Arabidopsis. Malus zumi is a salt tolerant apple rootstock, which survives high salinity (up to 0.6% NaCl). To examine the mechanism underlying this tolerance, a genome-wide expression analysis was performed, using a cDNA library constructed from salt-treated seedlings of Malus zumi. A total of 15,000 cDNA clones were selected for microarray analysis. In total a group of 576 cDNAs, of which expression changed more than four-fold, were sequenced and 18 genes were selected to verify their expression pattern under salt stress by semi-quantitative RT-PCR. Our genome-wide expression analysis resulted in the isolation of 50 novel Malus genes and the elucidation of a new apple-specific mechanism of salt tolerance, including the stabilization of photosynthesis under stress, involvement of phenolic compounds, and sorbitol in ROS scavenging and osmoprotection. The promoter regions of 111 genes were analyzed by PlantCARE, suggesting an intensive cross-talking of abiotic stress in Malus zumi. An interaction network of salt responsive genes was constructed and molecular regulatory pathways of apple were deduced. Our research will contribute to gene function analysis and further the understanding of salt-tolerance mechanisms in fruit trees.
Athma A Pai
Full Text Available The modification of DNA by methylation is an important epigenetic mechanism that affects the spatial and temporal regulation of gene expression. Methylation patterns have been described in many contexts within and across a range of species. However, the extent to which changes in methylation might underlie inter-species differences in gene regulation, in particular between humans and other primates, has not yet been studied. To this end, we studied DNA methylation patterns in livers, hearts, and kidneys from multiple humans and chimpanzees, using tissue samples for which genome-wide gene expression data were also available. Using the multi-species gene expression and methylation data for 7,723 genes, we were able to study the role of promoter DNA methylation in the evolution of gene regulation across tissues and species. We found that inter-tissue methylation patterns are often conserved between humans and chimpanzees. However, we also found a large number of gene expression differences between species that might be explained, at least in part, by corresponding differences in methylation levels. In particular, we estimate that, in the tissues we studied, inter-species differences in promoter methylation might underlie as much as 12%-18% of differences in gene expression levels between humans and chimpanzees.
Full Text Available Non-small cell lung cancer (NSCLC represents a genomically unstable cancer type with extensive copy number aberrations. The relationship of gene copy number alterations and subsequent mRNA levels has only fragmentarily been described. The aim of this study was to conduct a genome-wide analysis of gene copy number gains and corresponding gene expression levels in a clinically well annotated NSCLC patient cohort (n = 190 and their association with survival. While more than half of all analyzed gene copy number-gene expression pairs showed statistically significant correlations (10,296 of 18,756 genes, high correlations, with a correlation coefficient >0.7, were obtained only in a subset of 301 genes (1.6%, including KRAS, EGFR and MDM2. Higher correlation coefficients were associated with higher copy number and expression levels. Strong correlations were frequently based on few tumors with high copy number gains and correspondingly increased mRNA expression. Among the highly correlating genes, GO groups associated with posttranslational protein modifications were particularly frequent, including ubiquitination and neddylation. In a meta-analysis including 1,779 patients we found that survival associated genes were overrepresented among highly correlating genes (61 of the 301 highly correlating genes, FDR adjusted p<0.05. Among them are the chaperone CCT2, the core complex protein NUP107 and the ubiquitination and neddylation associated protein CAND1. In conclusion, in a comprehensive analysis we described a distinct set of highly correlating genes. These genes were found to be overrepresented among survival-associated genes based on gene expression in a large collection of publicly available datasets.
Zhou, Xiaojin; Li, Suzhen; Zhao, Qianqian; Liu, Xiaoqing; Zhang, Shaojun; Sun, Cheng; Fan, Yunliu; Zhang, Chunyi; Chen, Rumei
Background Nicotianamine (NA), a ubiquitous molecule in plants, is an important metal ion chelator and the main precursor for phytosiderophores biosynthesis. Considerable progress has been achieved in cloning and characterizing the functions of nicotianamine synthase (NAS) in plants including barley, Arabidopsis and rice. Maize is not only an important cereal crop, but also a model plant for genetics and evolutionary study. The genome sequencing of maize was completed, and many gene families ...
Yu, Hong; Soler, Marçal; Mila, Isabelle; San Clemente, Hélène; Savelli, Bruno; Dunand, Christophe; Paiva, Jorge A. P.; Myburg, Alexander A.; Bouzayen, Mondher; Grima-Pettenati, Jacqueline; Cassan-Wang, Hua
Auxin is a central hormone involved in a wide range of developmental processes including the specification of vascular stem cells. Auxin Response Factors (ARF) are important actors of the auxin signalling pathway, regulating the transcription of auxin-responsive genes through direct binding to their promoters. The recent availability of the Eucalyptus grandis genome sequence allowed us to examine the characteristics and evolutionary history of this gene family in a woody plant of high economic importance. With 17 members, the E. grandis ARF gene family is slightly contracted, as compared to those of most angiosperms studied hitherto, lacking traces of duplication events. In silico analysis of alternative transcripts and gene truncation suggested that these two mechanisms were preeminent in shaping the functional diversity of the ARF family in Eucalyptus. Comparative phylogenetic analyses with genomes of other taxonomic lineages revealed the presence of a new ARF clade found preferentially in woody and/or perennial plants. High-throughput expression profiling among different organs and tissues and in response to environmental cues highlighted genes expressed in vascular cambium and/or developing xylem, responding dynamically to various environmental stimuli. Finally, this study allowed identification of three ARF candidates potentially involved in the auxin-regulated transcriptional program underlying wood formation. PMID:25269088
Full Text Available Auxin is a central hormone involved in a wide range of developmental processes including the specification of vascular stem cells. Auxin Response Factors (ARF are important actors of the auxin signalling pathway, regulating the transcription of auxin-responsive genes through direct binding to their promoters. The recent availability of the Eucalyptus grandis genome sequence allowed us to examine the characteristics and evolutionary history of this gene family in a woody plant of high economic importance. With 17 members, the E. grandis ARF gene family is slightly contracted, as compared to those of most angiosperms studied hitherto, lacking traces of duplication events. In silico analysis of alternative transcripts and gene truncation suggested that these two mechanisms were preeminent in shaping the functional diversity of the ARF family in Eucalyptus. Comparative phylogenetic analyses with genomes of other taxonomic lineages revealed the presence of a new ARF clade found preferentially in woody and/or perennial plants. High-throughput expression profiling among different organs and tissues and in response to environmental cues highlighted genes expressed in vascular cambium and/or developing xylem, responding dynamically to various environmental stimuli. Finally, this study allowed identification of three ARF candidates potentially involved in the auxin-regulated transcriptional program underlying wood formation.
Changes in the environment, such as those caused by climate change, can exert stress on plant growth, diversity and ultimately global food security. Thus, focused efforts to fully understand plant response to stress are urgently needed in order to develop strategies to cope with the effects of climate change. Because Physcomitrella patens holds a key evolutionary position bridging the gap between green algae and higher plants, and because it exhibits a well-developed stress tolerance, it is an excellent model for such exploration. Here, we have used Physcomitrella patens to study genome-wide responses to abiotic stress through transcriptomic analysis by a high-throughput sequencing platform. We report a comprehensive analysis of transcriptome dynamics, defining profiles of elicited gene regulation responses to abiotic stress-associated hormone Abscisic Acid (ABA), cold, drought, and salt treatments. We identified more than 20,000 genes expressed under each aforementioned stress treatments, of which 9,668 display differential expression in response to stress. The comparison of Physcomitrella patens stress regulated genes with unicellular algae, vascular and flowering plants revealed genomic delineation concomitant with the evolutionary movement to land, including a general gene family complexity and loss of genes associated with different functional groups.
Full Text Available Abstract Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs and Support Vector Machines (SVMs were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression.
Yadav, Ruby; Singh, Puneet K; Shukla, Pratyoosh
Probiotic supplements in food industry have attracted a lot of attention and shown a remarkable growth in this field. Metabolic engineering (ME) approaches enable understanding their mechanism of action and increases possibility of designing probiotic strains with desired functions. Probiotic microorganisms generally referred as industrially important lactic acid bacteria (LAB) which are involved in fermenting dairy products, food, beverages and produces lactic acid as final product. A number of illustrations of metabolic engineering approaches in industrial probiotic bacteria have been described in this review including transcriptomic studies of Lactobacillus reuteri and improvement in exopolysaccharide (EPS) biosynthesis yield in Lactobacillus casei LC2W. This review summaries various metabolic engineering approaches for exploring metabolic pathways. These approaches enable evaluation of cellular metabolic state and effective editing of microbial genome or introduction of novel enzymes to redirect the carbon fluxes. In addition, various system biology tools such as in silico design commonly used for improving strain performance is also discussed. Finally, we discuss the integration of metabolic engineering and genome profiling which offers a new way to explore metabolic interactions, fluxomics and probiogenomics using probiotic bacteria like Bifidobacterium spp and Lactobacillus spp. Copyright© Bentham Science Publishers; For any queries, please email at email@example.com.
Bassett, Anne S; Lowther, Chelsea; Merico, Daniele; Costain, Gregory; Chow, Eva W C; van Amelsvoort, Therese; McDonald-McGinn, Donna; Gur, Raquel E; Swillen, Ann; Van den Bree, Marianne; Murphy, Kieran; Gothelf, Doron; Bearden, Carrie E; Eliez, Stephan; Kates, Wendy; Philip, Nicole; Sashi, Vandana; Campbell, Linda; Vorstman, Jacob; Cubells, Joseph; Repetto, Gabriela M; Simon, Tony; Boot, Erik; Heung, Tracy; Evers, Rens; Vingerhoets, Claudia; van Duin, Esther; Zackai, Elaine; Vergaelen, Elfi; Devriendt, Koen; Vermeesch, Joris R; Owen, Michael; Murphy, Clodagh; Michaelovosky, Elena; Kushan, Leila; Schneider, Maude; Fremont, Wanda; Busa, Tiffany; Hooper, Stephen; McCabe, Kathryn; Duijff, Sasja; Isaev, Karin; Pellecchia, Giovanna; Wei, John; Gazzellone, Matthew J; Scherer, Stephen W; Emanuel, Beverly S; Guo, Tingwei; Morrow, Bernice E; Marshall, Christian R
Chromosome 22q11.2 deletion syndrome (22q11.2DS) is associated with a more than 20-fold increased risk for developing schizophrenia. The aim of this study was to identify additional genetic factors (i.e., "second hits") that may contribute to schizophrenia expression. Through an international consortium, the authors obtained DNA samples from 329 psychiatrically phenotyped subjects with 22q11.2DS. Using a high-resolution microarray platform and established methods to assess copy number variation (CNV), the authors compared the genome-wide burden of rare autosomal CNV, outside of the 22q11.2 deletion region, between two groups: a schizophrenia group and those with no psychotic disorder at age ≥25 years. The authors assessed whether genes overlapped by rare CNVs were overrepresented in functional pathways relevant to schizophrenia. Rare CNVs overlapping one or more protein-coding genes revealed significant between-group differences. For rare exonic duplications, six of 19 gene sets tested were enriched in the schizophrenia group; genes associated with abnormal nervous system phenotypes remained significant in a stepwise logistic regression model and showed significant interactions with 22q11.2 deletion region genes in a connectivity analysis. For rare exonic deletions, the schizophrenia group had, on average, more genes overlapped. The additional rare CNVs implicated known (e.g., GRM7, 15q13.3, 16p12.2) and novel schizophrenia risk genes and loci. The results suggest that additional rare CNVs overlapping genes outside of the 22q11.2 deletion region contribute to schizophrenia risk in 22q11.2DS, supporting a multigenic hypothesis for schizophrenia. The findings have implications for understanding expression of psychotic illness and herald the importance of whole-genome sequencing to appreciate the overall genomic architecture of schizophrenia.
Yun Peng eCao
Full Text Available The MYB family is one of the largest families of transcription factors in plants. Although some MYBs have been reported to play roles in secondary metabolism, no comprehensive study of the MYB family in Chinese pear (Pyrus bretschneideri Rehd. has been reported. In the present study, we performed genome-wide analysis of MYB genes in Chinese pear, designated as PbMYBs, including analyses of their phylogenic relationships, structures, chromosomal locations, promoter regions, GO annotations and collinearity. A total of 129 PbMYB genes were identified in the pear genome and were divided into 31 subgroups based on phylogenetic analysis. These PbMYBs were unevenly distributed among 16 chromosomes (total of 17 chromosomes. The occurrence of gene duplication events indicated that whole-genome duplication and segmental duplication likely played key roles in expansion of the PbMYB gene family. Ka/Ks analysis suggested that the duplicated PbMYBs mainly experienced purifying selection with restrictive functional divergence after the duplication events. Interspecies microsynteny analysis revealed maximum orthology between pear and peach, followed by plum and strawberry. Subsequently, the expression patterns of 20 PbMYB genes that may be involved in lignin biosynthesis according to their phylogenetic relationships were examined throughout fruit development. Among the twenty genes examined, PbMYB25 and PbMYB52 exhibited expression patterns consistent with the typical variations in the lignin content previously reported. Moreover, sub-cellular localization analysis revealed that two proteins PbMYB25 and PbMYB52 were localized to the nucleus. All together, PbMYB25 and PbMYB52 were inferred to be candidate genes involved in the regulation of lignin biosynthesis during the development of pear fruit. This study provides useful information for further functional analysis of the MYB gene family in pear.
Wen, Feng; Zhu, Hong; Li, Peng; Jiang, Min; Mao, Wenqing; Ong, Chermaine; Chu, Zhaoqing
Members of plant WRKY gene family are ancient transcription factors that function in plant growth and development and respond to biotic and abiotic stresses. In our present study, we have investigated WRKY family genes in Brachypodium distachyon, a new model plant of family Poaceae. We identified a total of 86 WRKY genes from B. distachyon and explored their chromosomal distribution and evolution, domain alignment, promoter cis-elements, and expression profiles. Combining the analysis of phylogenetic tree of BdWRKY genes and the result of expression profiling, results showed that most of clustered gene pairs had higher similarities in the WRKY domain, suggesting that they might be functionally redundant. Neighbour-joining analysis of 301 WRKY domains from Oryza sativa, Arabidopsis thaliana, and B. distachyon suggested that BdWRKY domains are evolutionarily more closely related to O. sativa WRKY domains than those of A. thaliana. Moreover, tissue-specific expression profile of BdWRKY genes and their responses to phytohormones and several biotic or abiotic stresses were analysed by quantitative real-time PCR. The results showed that the expression of BdWRKY genes was rapidly regulated by stresses and phytohormones, and there was a strong correlation between promoter cis-elements and the phytohormones-induced BdWRKY gene expression. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Olsen, Jørgen; Gerds, Thomas A; Seidelin, Jakob B
Background: Endoscopically obtained mucosal biopsies play an important role in the differential diagnosis between ulcerative colitis (UC) and Crohn's disease (CD), but in some cases where neither macroscopic nor microscopic signs of inflammation are present the biopsies provide only inconclusive...... biopsies from 78 patients were included. A diagnostic model was derived with the random forest method based on 71 biopsies from 60 patients. The model-internal out-of-bag performance measure yielded perfect classification. Furthermore, the model was validated in independent 18 noninflamed biopsies from 18...... of random forest modeling of genome-wide gene expression data for distinguishing quiescent and active UC colonic mucosa versus control and CD colonic mucosa.(Inflamm Bowel Dis 2009)....
Christine E. Baer
Full Text Available To capture global responses to metal poisoning and mechanistic insights into metal toxicity, gene expression changes were evaluated in whole adult male zebrafish following acute 24 h high dose exposure to three metals with known human health risks. Male adult zebrafish were exposed to nickel chloride, cobalt chloride or sodium dichromate at concentrations corresponding to their respective 96 h LC20, LC40 and LC60 (i.e. 96 h concentrations at which 20%, 40% and 60% lethality is expected, respectively. Histopathology was performed on a subset of metal-exposed zebrafish to phenotypically anchor transcriptional changes associated with each metal exposure. Here we describe in detail the contents and quality controls for the gene expression and other data associated with the study published by Hussainzada and colleagues in BMC Pharmacology and Toxicology (Hussainzada et al., 2014 with the data uploaded to Gene Expression Omnibus (accession number GSE50648.
Kristensen, Nanna Ny; Olsen, Jørgen; Gad, Monika
BACKGROUND: In the adoptive transfer model of colitis it has been shown that regulatory T cells (Treg) can hinder disease development and cure already existing mild colitis. The mechanisms underlying this regulatory effect of CD4(+)CD25(+) Tregs are not well understood. METHODS: To identify......Chip Mouse Genome 430 2.0 Array), which enabled an analysis of a complete set of RNA transcript levels in each sample. Array results were confirmed by real-time reverse-transcriptase polymerase chain reaction (RT-PCR). RESULTS: Data were analyzed using combined projections to latent structures and functional...... annotation analysis. The colitic samples were clearly distinguishable from samples from normal mice by a vast number of inflammation- and growth factor-related transcripts. In contrast, the Treg-protected animals could not be distinguished from either the normal BALB/c mice or the normal SCID mice. mRNA...
Full Text Available Abstract Background Tanshinone IIA (Tan IIA is a diterpene quinone extracted from the root of Salvia miltiorrhiza, a Chinese traditional herb. Although previous studies have reported the anti-tumor effects of Tan IIA on various human cancer cells, the underlying mechanisms are not clear. The current study was undertaken to investigate the molecular mechanisms of Tan IIA's apoptotic effects on leukemia cells in vitro. Methods The cytotoxicity of Tan IIA on different types of leukemia cell lines was evaluated by the 3-[4,5-dimethylthiazol-2,5]-diphenyl tetrazolium bromide (MTT assay on cells treated without or with Tan IIA at different concentrations for different time periods. Cellular apoptosis progression with and without Tan IIA treatment was analyzed by Annexin V and Caspase 3 assays. Gene expression profiling was used to identify the genes regulated after Tan IIA treatment and those differentially expressed among the five cell lines. Confirmation of these expression regulations was carried out using real-time quantitative PCR and ELISA. The antagonizing effect of a PXR inhibitor L-SFN on Tan IIA treatment was tested using Colony Forming Unit Assay. Results Our results revealed that Tan IIA had different cytotoxic activities on five types of leukemia cells, with the highest toxicity on U-937 cells. Tan IIA inhibited the growth of U-937 cells in a time- and dose-dependent manner. Annexin V and Caspase-3 assays showed that Tan IIA induced apoptosis in U-937 cells. Using gene expression profiling, 366 genes were found to be significantly regulated after Tan IIA treatment and differentially expressed among the five cell lines. Among these genes, CCL2 was highly expressed in untreated U-937 cells and down-regulated significantly after Tan IIA treatment in a dose-dependent manner. RT-qPCR analyses validated the expression regulation of 80% of genes. Addition of L- sulforaphane (L-SFN, an inhibitor of Pregnane × receptor (PXR significantly
Mende, Susanne; Royer, Loic; Herr, Alexander; Schmiedel, Janet; Deschauer, Marcus; Klopstock, Thomas; Kostic, Vladimir S; Schroeder, Michael; Reichmann, Heinz; Storch, Alexander
The heteroplasmic mitochondrial DNA (mtDNA) mutation A3243G causes the mitochondrial encephalomyopathy, lactic acidosis, and stroke-like episodes (MELAS) syndrome as one of the most frequent mitochondrial diseases. The process of reconfiguration of nuclear gene expression profile to accommodate cellular processes to the functional status of mitochondria might be a key to MELAS disease manifestation and could contribute to its diverse phenotypic presentation. To determine master regulatory protein networks and disease-modifying genes in MELAS syndrome. Analyses of whole blood transcriptomes from 10 MELAS patients using a novel strategy by combining classic Affymetrix oligonucleotide microarray profiling with regulatory and protein interaction network analyses. Hierarchical cluster analysis elucidated that the relative abundance of mutant mtDNA molecules is decisive for the nuclear gene expression response. Further analyses confirmed not only transcription factors already known to be involved in mitochondrial diseases (such as TFAM), but also detected the hypoxia-inducible factor 1 complex, nuclear factor Y and cAMP responsive element-binding protein-related transcription factors as novel master regulators for reconfiguration of nuclear gene expression in response to the MELAS mutation. Correlation analyses of gene alterations and clinico-genetic data detected significant correlations between A3243G-induced nuclear gene expression changes and mutant mtDNA load as well as disease characteristics. These potential disease-modifying genes influencing the expression of the MELAS phenotype are mainly related to clusters primarily unrelated to cellular energy metabolism, but important for nucleic acid and protein metabolism, and signal transduction. Our data thus provide a framework to search for new pathogenetic concepts and potential therapeutic approaches to treat the MELAS syndrome.
Full Text Available Because myostatin normally limits skeletal muscle growth, there are extensive efforts to develop myostatin inhibitors for clinical use. One potential concern is that in muscle degenerative diseases, inducing hypertrophy may increase stress on dystrophic fibers. Our study shows that blocking this pathway in dysferlin deficient mice results in early improvement in histopathology but ultimately accelerates muscle degeneration. Hence, benefits of this approach should be weighed against these potential detrimental effects. Here, we present detailed experimental methods and analysis for the gene expression profiling described in our recently published study in Human Molecular Genetics (Lee et al., 2015. Our data sets have been deposited in the Gene Expression Omnibus (GEO database (GSE62945 and are available at http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE62945. Our data provide a resource for exploring molecular mechanisms that are related to hypertrophy-induced, accelerated muscular degeneration in dysferlinopathy.
Brudzewsky, D.; Pedersen, A. E.; Claesson, M. H.
Inflammatory bowel disease (IBD) is a multifactorial disorder with an unknown aetiology. The aim of this study is to employ a murine model of IBD to identify pathways and genes, which may play a key role in the pathogenesis of IBD and could be important for discovery of new disease markers in human...... and colitis mice, and among these genes there is an overrepresentation of genes involved in inflammatory processes. Some of the most significant genes showing higher expression encode S100A proteins and chemokines involved in trafficking of leucocytes in inflammatory areas. Classification by gene clustering...... based on the genes with the significantly altered gene expression corresponds to two different levels of inflammation as established by the histological scoring of the inflamed rectum. These data demonstrate that this SCID T-cell transfer model is a useful animal model for human IBD and can be used...
Pellegrino, R; Sunaga, D Y; Guindalini, C; Martins, R C S; Mazzotti, D R; Wei, Z; Daye, Z J; Andersen, M L; Tufik, S
Although the specific functions of sleep have not been completely elucidated, the literature has suggested that sleep is essential for proper homeostasis. Sleep loss is associated with changes in behavioral, neurochemical, cellular, and metabolic function as well as impaired immune response. Using high-resolution microarrays we evaluated the gene expression profiles of healthy male volunteers who underwent 60 h of prolonged wakefulness (PW) followed by 12 h of sleep recovery (SR). Peripheral whole blood was collected at 8 am in the morning before the initiation of PW (Baseline), after the second night of PW, and one night after SR. We identified over 500 genes that were differentially expressed. Notably, these genes were related to DNA damage and repair and stress response, as well as diverse immune system responses, such as natural killer pathways including killer cell lectin-like receptors family, as well as granzymes and T-cell receptors, which play important roles in host defense. These results support the idea that sleep loss can lead to alterations in molecular processes that result in perturbation of cellular immunity, induction of inflammatory responses, and homeostatic imbalance. Moreover, expression of multiple genes was downregulated following PW and upregulated after SR compared with PW, suggesting an attempt of the body to re-establish internal homeostasis. In silico validation of alterations in the expression of CETN3, DNAJC, and CEACAM genes confirmed previous findings related to the molecular effects of sleep deprivation. Thus, the present findings confirm that the effects of sleep loss are not restricted to the brain and can occur intensely in peripheral tissues.
Full Text Available The UGlcAE has the capability of interconverting UDP-d-galacturonic acid and UDP-d-glucuronic acid, and UDP-d-galacturonic acid is an activated precursor for the synthesis of pectins in plants. In this study, we identified nine UGlcAE protein-encoding genes in tomato. The nine UGlcAE genes that were distributed on eight chromosomes in tomato, and the corresponding proteins contained one or two trans-membrane domains. The phylogenetic analysis showed that SlUGlcAE genes could be divided into seven groups, designated UGlcAE1 to UGlcAE6, of which the UGlcAE2 were classified into two groups. Expression profile analysis revealed that the SlUGlcAE genes display diverse expression patterns in various tomato tissues. Selective pressure analysis indicated that all of the amino acid sites of SlUGlcAE proteins are undergoing purifying selection. Fifteen stress-, hormone-, and development-related elements were identified in the upstream regions (0.5 kb of these SlUGlcAE genes. Furthermore, we investigated the expression patterns of SlUGlcAE genes in response to three hormones (indole-3-acetic acid (IAA, gibberellin (GA, and salicylic acid (SA. We detected firmness, pectin contents, and expression levels of UGlcAE family genes during the development of tomato fruit. Here, we systematically summarize the general characteristics of the SlUGlcAE genes in tomato, which could provide a basis for further function studies of tomato UGlcAE genes.
Nilsson, Emil K.; Bostr?m, Adrian E.; Mwinyi, Jessica; Schi?th, Helgi B.
Abstract Despite an established link between sleep deprivation and epigenetic processes in humans, it remains unclear to what extent sleep deprivation modulates DNA methylation. We performed a within-subject randomized blinded study with 16 healthy subjects to examine the effect of one night of total sleep deprivation (TSD) on the genome-wide methylation profile in blood compared with that in normal sleep. Genome-wide differences in methylation between both conditions were assessed by applyin...
Wang, John; Ross, Kenneth G; Keller, Laurent
Explaining how interactions between genes and the environment influence social behavior is a fundamental research goal, yet there is limited relevant information for species exhibiting natural variation in social organization. The fire ant Solenopsis invicta is characterized by a remarkable form of social polymorphism, with the presence of one or several queens per colony and the expression of other phenotypic and behavioral differences being completely associated with allelic variation at a single Mendelian factor marked by the gene Gp-9. Microarray analyses of adult workers revealed that differences in the Gp-9 genotype are associated with the differential expression of an unexpectedly small number of genes, many of which have predicted functions, implying a role in chemical communication relevant to the regulation of colony queen number. Even more surprisingly, worker gene expression profiles are more strongly influenced by indirect effects associated with the Gp-9 genotypic composition within their colony than by the direct effect of their own Gp-9 genotype. This constitutes an unusual example of an "extended phenotype" and suggests a complex genetic architecture with a single Mendelian factor, directly and indirectly influencing the individual behaviors that, in aggregate, produce an emergent colony-level phenotype.
Chen, Min; Tan, Qiuping; Sun, Mingyue; Li, Dongmei; Fu, Xiling; Chen, Xiude; Xiao, Wei; Li, Ling; Gao, Dongsheng
Bud dormancy in deciduous fruit trees is an important adaptive mechanism for their survival in cold climates. The WRKY genes participate in several developmental and physiological processes, including dormancy. However, the dormancy mechanisms of WRKY genes have not been studied in detail. We conducted a genome-wide analysis and identified 58 WRKY genes in peach. These putative genes were located on all eight chromosomes. In bioinformatics analyses, we compared the sequences of WRKY genes from peach, rice, and Arabidopsis. In a cluster analysis, the gene sequences formed three groups, of which group II was further divided into five subgroups. Gene structure was highly conserved within each group, especially in groups IId and III. Gene expression analyses by qRT-PCR showed that WRKY genes showed different expression patterns in peach buds during dormancy. The mean expression levels of six WRKY genes (Prupe.6G286000, Prupe.1G393000, Prupe.1G114800, Prupe.1G071400, Prupe.2G185100, and Prupe.2G307400) increased during endodormancy and decreased during ecodormancy, indicating that these six WRKY genes may play a role in dormancy in a perennial fruit tree. This information will be useful for selecting fruit trees with desirable dormancy characteristics or for manipulating dormancy in genetic engineering programs.
Ma, Jin-Qi; Jian, Hong-Ju; Yang, Bo; Lu, Kun; Zhang, Ao-Xiang; Liu, Pu; Li, Jia-Na
Growth regulating-factors (GRFs) are plant-specific transcription factors that help regulate plant growth and development. Genome-wide identification and evolutionary analyses of GRF gene families have been performed in Arabidopsis thaliana, Zea mays, Oryza sativa, and Brassica rapa, but a comprehensive analysis of the GRF gene family in oilseed rape (Brassica napus) has not yet been reported. In the current study, we identified 35 members of the BnGRF family in B. napus. We analyzed the chromosomal distribution, phylogenetic relationships (Bayesian Inference and Neighbor Joining method), gene structures, and motifs of the BnGRF family members, as well as the cis-acting regulatory elements in their promoters. We also analyzed the expression patterns of 15 randomly selected BnGRF genes in various tissues and in plant varieties with different harvest indices and gibberellic acid (GA) responses. The expression levels of BnGRFs under GA treatment suggested the presence of possible negative feedback regulation. The evolutionary patterns and expression profiles of BnGRFs uncovered in this study increase our understanding of the important roles played by these genes in oilseed rape. Copyright © 2017. Published by Elsevier B.V.
Full Text Available Cytochrome P450 93 family (CYP93 belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies-CYP93A-K, with the last two being identified first. CYP93A is the ancestor that was derived in flowering plants, and the remaining showed lineage-specific distribution-CYP93B and CYP93C are present in dicots; CYP93F is distributed only in Poaceae; CYP93G and CYP93J are monocot-specific; CYP93E is unique to legumes; CYP93H and CYP93K are only found in Aquilegia coerulea, and CYP93D is Brassicaceae-specific. Each subfamily generally has conserved gene numbers, structures, and characteristics, indicating functional conservation during evolution. Synonymous nucleotide substitution (dN/dS analysis showed that CYP93 genes are under strong negative selection. Comparative expression analyses of CYP93 genes in dicots and monocots revealed that they are preferentially expressed in the roots and tend to be induced by biotic and/or abiotic stresses, in accordance with their well-known functions in plant secondary biosynthesis.
Bhati, Kaushal K; Sharma, Shivani; Aggarwal, Sipla; Kaur, Mandeep; Shukla, Vishnu; Kaur, Jagdeep; Mantri, Shrikant; Pandey, Ajay K
The ABCC multidrug resistance associated proteins (ABCC-MRP), a subclass of ABC transporters are involved in multiple physiological processes that include cellular homeostasis, metal detoxification, and transport of glutathione-conjugates. Although they are well-studied in humans, yeast, and Arabidopsis, limited efforts have been made to address their possible role in crop like wheat. In the present work, 18 wheat ABCC-MRP proteins were identified that showed the uniform distribution with sub-families from rice and Arabidopsis. Organ-specific quantitative expression analysis of wheat ABCC genes indicated significantly higher accumulation in roots (TaABCC2, TaABCC3, and TaABCC11 and TaABCC12), stem (TaABCC1), leaves (TaABCC16 and TaABCC17), flag leaf (TaABCC14 and TaABCC15), and seeds (TaABCC6, TaABCC8, TaABCC12, TaABCC13, and TaABCC17) implicating their role in the respective tissues. Differential transcript expression patterns were observed for TaABCC genes during grain maturation speculating their role during seed development. Hormone treatment experiments indicated that some of the ABCC genes could be transcriptionally regulated during seed development. In the presence of Cd or hydrogen peroxide, distinct molecular expression of wheat ABCC genes was observed in the wheat seedlings, suggesting their possible role during heavy metal generated oxidative stress. Functional characterization of the wheat transporter, TaABCC13 a homolog of maize LPA1 confirms its role in glutathione-mediated detoxification pathway and is able to utilize adenine biosynthetic intermediates as a substrate. This is the first comprehensive inventory of wheat ABCC-MRP gene subfamily.
Dash, Prasanta K; Cao, Yongguo; Jailani, Abdul K; Gupta, Payal; Venglat, Prakash; Xiang, Daoquan; Rai, Rhitu; Sharma, Rinku; Thirunavukkarasu, Nepolean; Abdin, Malik Z; Yadava, Devendra K; Singh, Nagendra K; Singh, Jas; Selvaraj, Gopalan; Deyholos, Mike; Kumar, Polumetla Ananda; Datla, Raju
A robust phenotypic plasticity to ward off adverse environmental conditions determines performance and productivity in crop plants. Flax (linseed), is an important cash crop produced for natural textile fiber (linen) or oilseed with many health promoting products. This crop is prone to drought stress and yield losses in many parts of the world. Despite recent advances in drought research in a number of important crops, related progress in flax is very limited. Since, response of this plant to drought stress has not been addressed at the molecular level; we conducted microarray analysis to capture transcriptome associated with induced drought in flax. This study identified 183 differentially expressed genes (DEGs) associated with diverse cellular, biophysical and metabolic programs in flax. The analysis also revealed especially the altered regulation of cellular and metabolic pathways governing photosynthesis. Additionally, comparative transcriptome analysis identified a plethora of genes that displayed differential regulation both spatially and temporally. These results revealed co-regulated expression of 26 genes in both shoot and root tissues with implications for drought stress response. Furthermore, the data also showed that more genes are upregulated in roots compared to shoots, suggesting that roots may play important and additional roles in response to drought in flax. With prolonged drought treatment, the number of DEGs increased in both tissue types. Differential expression of selected genes was confirmed by qRT-PCR, thus supporting the suggested functional association of these intrinsic genes in maintaining growth and homeostasis in response to imminent drought stress in flax. Together the present study has developed foundational and new transcriptome data sets for drought stress in flax.
Nathan M. Good
Full Text Available Methyloversatilis universalis FAM5 utilizes single carbon compounds such as methanol or methylamine as a sole source of carbon and energy. Expression profiling reveals distinct sets of genes altered during growth on methylamine vs methanol. As expected, all genes for the N-methylglutamate pathway were induced during growth on methylamine. Among other functions responding to the aminated source of C1-carbon, are a heme-containing amine dehydrogenase (Qhp, a distant homologue of formaldehyde activating enzyme (Fae3, molybdenum-containing formate dehydrogenase, ferredoxin reductase, a set of homologues to urea/ammonium transporters and amino-acid permeases. Mutants lacking one of the functional subunits of the amine dehydrogenase (ΔqhpA or Δfae3 showed no growth defect on C1-compounds. M. universalis FAM5 strains with a lesion in the H4-folate pathway were not able to use any C1-compound, methanol or methylamine. Genes essential for C1-assimilation (the serine cycle and glyoxylate shunt and H4MTP-pathway for formaldehyde oxidation showed similar levels of expression on both C1-carbon sources. M. universalis FAM5 possesses three homologs of the formaldehyde activating enzyme, a key enzyme of the H4MTP-pathway. Strains lacking the canonical Fae (fae1 lost the ability to grow on both C1-compounds. However, upon incubation on methylamine the fae1-mutant produced revertants (Δfae1R, which regained the ability to grow on methylamine. Double and triple mutants (Δfae1RΔfae3, or Δfae1RΔfae2 or Δfae1RΔfae2Δfae3 constructed in the revertant strain background showed growth similar to the Δfae1R phenotype. The metabolic pathways for utilization of methanol and methylamine in Methyloversatilis universalis FAM5 are reconstructed based on these gene expression and phenotypic data.
Fomenko, Dmitri E.; Koc, Ahmet; Agisheva, Natalia; Jacobsen, Michael; Kaya, Alaattin; Malinouski, Mikalai; Rutherford, Julian C.; Siu, Kam-Leung; Jin, Dong-Yan; Winge, Dennis R.; Gladyshev, Vadim N.
Hydrogen peroxide is thought to regulate cellular processes by direct oxidation of numerous cellular proteins, whereas antioxidants, most notably thiol peroxidases, are thought to reduce peroxides and inhibit H2O2 response. However, thiol peroxidases have also been implicated in activation of transcription factors and signaling. It remains unclear if these enzymes stimulate or inhibit redox regulation and whether this regulation is widespread or limited to a few cellular components. Herein, we found that Saccharomyces cerevisiae cells lacking all eight thiol peroxidases were viable and withstood redox stresses. They transcriptionally responded to various redox treatments, but were unable to activate and repress gene expression in response to H2O2. Further studies involving redox transcription factors suggested that thiol peroxidases are major regulators of global gene expression in response to H2O2. The data suggest that thiol peroxidases sense and transfer oxidative signals to the signaling proteins and regulate transcription, whereas a direct interaction between H2O2 and other cellular proteins plays a secondary role. PMID:21282621
Full Text Available Calcineurin B-like (CBL proteins, as calcium sensors, play pivotal roles in plant responses to diverse abiotic stresses and in growth and development through interaction with CBL-interacting protein kinases (CIPKs. However, knowledge about functions and evolution of CBLs in Gossypium plants is scarce. Here, we conducted a genome-wide survey and identified 13, 13 and 22 CBL genes in the progenitor diploid Gossypium arboreum and Gossypium raimondii, and the cultivated allotetraploid Gossypium hirsutum, respectively. Analysis of physical properties, chromosomal locations, conserved domains and phylogeny indicated rather conserved nature of CBLs among the three Gossypium species. Moreover, these CBLs have closer genetic evolutionary relationship with the CBLs from cocoa than with those from other plants. Most CBL genes underwent evolution under purifying selection in the three Gossypium plants. Additionally, nearly all G. hirsutum CBL (GhCBL genes were expressed in the root, stem, leaf, flower and fiber. Many GhCBLs were preferentially expressed in the flower while several GhCBLs were mainly expressed in roots. Expression patterns of GhCBL genes in response to potassium deficiency were also studied. The expression of most GhCBLs were moderately induced in roots after treatments with low-potassium stress. Yeast two-hybrid experiments indicated that GhCBL1-2, GhCBL1-3, GhCBL4-4, GhCBL8, GhCBL9 and GhCBL10-3 interacted with GhCIPK23, respectively. Our results provided a comprehensive view of the CBLs and valuable information for researchers to further investigate the roles and functional mechanisms of the CBLs in Gossypium.
Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi
MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus. PMID:25375352
Full Text Available BACKGROUND: High altitude pulmonary edema (HAPE is a life-threatening form of non-cardiogenic edema which occurs in unacclimatized but otherwise normal individuals within two to four days after rapid ascent to altitude beyond 3000 m. The precise pathoetiology and inciting mechanisms regulating HAPE remain unclear. METHODOLOGY/PRINCIPLE FINDINGS: We performed global gene expression profiling in individuals with established HAPE compared to acclimatized individuals. Our data suggests concurrent modulation of multiple pathways which regulate vascular homeostasis and consequently lung fluid dynamics. These pathways included those which regulate vasoconstriction through smooth muscle contraction, cellular actin cytoskeleton rearrangements and endothelial permeability/dysfunction. Some notable genes within these pathways included MYLK; rho family members ARGEF11, ARHGAP24; cell adhesion molecules such as CLDN6, CLDN23, PXN and VCAM1 besides other signaling intermediates. Further, several important regulators of systemic/pulmonary hypertension including ADRA1D, ECE1, and EDNRA were upregulated in HAPE. We also observed significant upregulation of genes involved in paracrine signaling through chemokines and lymphocyte activation pathways during HAPE represented by transcripts of TNF, JAK2, MAP2K2, MAP2K7, MAPK10, PLCB1, ARAF, SOS1, PAK3 and RELA amongst others. Perturbation of such pathways can potentially skew vascular homeostatic equilibrium towards altered vascular permeability. Additionally, differential regulation of hypoxia-sensing, hypoxia-response and OXPHOS pathway genes in individuals with HAPE were also observed. CONCLUSIONS/SIGNIFICANCE: Our data reveals specific components of the complex molecular circuitry underlying HAPE. We show concurrent perturbation of multiple pathways regulating vascular homeostasis and suggest multi-genic nature of regulation of HAPE.
Full Text Available The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI, with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants.
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B, Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants.
Full Text Available BACKGROUND: Rheumatoid arthritis (RA and osteoarthritis (OA are two major types of joint diseases that share multiple common symptoms. However, their pathological mechanism remains largely unknown. The aim of our study is to identify RA and OA related-genes and gain an insight into the underlying genetic basis of these diseases. METHODS: We collected 11 whole genome-wide expression profiling datasets from RA and OA cohorts and performed a meta-analysis to comprehensively investigate their expression signatures. This method can avoid some pitfalls of single dataset analyses. RESULTS AND CONCLUSION: We found that several biological pathways (i.e., the immunity, inflammation and apoptosis related pathways are commonly involved in the development of both RA and OA. Whereas several other pathways (i.e., vasopressin-related pathway, regulation of autophagy, endocytosis, calcium transport and endoplasmic reticulum stress related pathways present significant difference between RA and OA. This study provides novel insights into the molecular mechanisms underlying this disease, thereby aiding the diagnosis and treatment of the disease.
Liu, Hongyun; Qin, Jiajia; Fan, Hui; Cheng, Jinjin; Li, Lin; Liu, Zheng
As a member of the GRAS gene family, SCARECROW - LIKE ( SCL ) genes encode transcriptional regulators that are involved in plant information transmission and signal transduction. In this study, 44 SCL genes including two SCARECROW genes in millet were identified to be distributed on eight chromosomes, except chromosome 6. All the millet genes contain motifs 6-8, indicating that these motifs are conserved during the evolution. SCL genes of millet were divided into eight groups based on the phylogenetic relationship and classification of Arabidopsis SCL genes. Several putative millet orthologous genes in Arabidopsis , maize and rice were identified. High throughput RNA sequencing revealed that the expressions of millet SCL genes in root, stem, leaf, spica, and along leaf gradient varied greatly. Analyses combining the gene expression patterns, gene structures, motif compositions, promoter cis -elements identification, alternative splicing of transcripts and phylogenetic relationship of SCL genes indicate that the these genes may play diverse functions. Functionally characterized SCL genes in maize, rice and Arabidopsis would provide us some clues for future characterization of their homologues in millet. To the best of our knowledge, this is the first study of millet SCL genes at the genome wide level. Our work provides a useful platform for functional analysis of SCL genes in millet, a model crop for C 4 photosynthesis and bioenergy studies.
Liu, Taibo; Huang, Binbin; Chen, Lin; Xian, Zhiqiang; Song, Shiwei; Chen, Riyuan; Hao, Yanwei
Polyamines (PAs), including putrescine (Put), spermidine (Spd), spermine (Spm), and thermospermine (T-Spm), play key roles in plant development, including fruit setting and ripening, morphogenesis, and abiotic/biotic stress. Their functions appear to be intimately related to their synthesis, which occurs via arginine/ornithine decarboxylase (ADC/ODC), Spd synthase (SPDS), Spm synthase (SPMS), and Acaulis5 (ACL5), respectively. Unfortunately, the expression and function of these PA synthesis-relate genes during specific developmental process or under stress have not been fully elucidated. Here, we present the results of a genome-wide analysis of the PA synthesis genes (ADC, ODC, SPDS, SPMS, ACL5) in the tomato (Solanum lycopersicum). In total, 14 PA synthesis-related genes were identified. Further analysis of their structures, conserved domains, phylogenetic trees, predicted subcellular localization, and promoter cis-regulatory elements were analyzed. Furthermore, we also performed experiments to evaluate their tissue expression patterns and under hormone and various stress treatments. To our knowledge, this is the first study to elucidate the mechanisms underlying PA function in this variety of tomato. Taken together, these data provide valuable information for future functional characterization of specific genes in the PA synthesis pathway in this and other plant species. Although additional research is required, the insight gained by this and similar studies can be used to improve our understanding of PA metabolism ultimately leading to more effective and consistent plant cultivation. Copyright © 2018 Elsevier B.V. All rights reserved.
Full Text Available Sepsis is a clinical syndrome that can be caused by bacteria or fungi. Early knowledge on the nature of the causative agent is a prerequisite for targeted anti-microbial therapy. Besides currently used detection methods like blood culture and PCR-based assays, the analysis of the transcriptional response of the host to infecting organisms holds great promise. In this study, we aim to examine the transcriptional footprint of infections caused by the bacterial pathogens Staphylococcus aureus and Escherichia coli and the fungal pathogens Candida albicans and Aspergillus fumigatus in a human whole-blood model. Moreover, we use the expression information to build a random forest classifier to classify if a sample contains a bacterial, fungal, or mock-infection. After normalizing the transcription intensities using stably expressed reference genes, we filtered the gene set for biomarkers of bacterial or fungal blood infections. This selection is based on differential expression and an additional gene relevance measure. In this way, we identified 38 biomarker genes, including IL6, SOCS3, and IRG1 which were already associated to sepsis by other studies. Using these genes, we trained the classifier and assessed its performance. It yielded a 96% accuracy (sensitivities >93%, specificities >97% for a 10-fold stratified cross-validation and a 92% accuracy (sensitivities and specificities >83% for an additional test dataset comprising Cryptococcus neoformans infections. Furthermore, the classifier is robust to Gaussian noise, indicating correct class predictions on datasets of new species. In conclusion, this genome-wide approach demonstrates an effective feature selection process in combination with the construction of a well-performing classification model. Further analyses of genes with pathogen-dependent expression patterns can provide insights into the systemic host responses, which may lead to new anti-microbial therapeutic advances.
Chen, Dijun; Kaufmann, Kerstin
Key transcription factors (TFs) controlling the morphogenesis of flowers and leaves have been identified in the model plant Arabidopsis thaliana. Recent genome-wide approaches based on chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) enable systematic identification of genome-wide TF binding sites (TFBSs) of these regulators. Here, we describe a computational pipeline for analyzing ChIP-seq data to identify TFBSs and to characterize gene regulatory networks (GRNs) with applications to the regulatory studies of flower development. In particular, we provide step-by-step instructions on how to download, analyze, visualize, and integrate genome-wide data in order to construct GRNs for beginners of bioinformatics. The practical guide presented here is ready to apply to other similar ChIP-seq datasets to characterize GRNs of interest.
Full Text Available Nitrous oxide (N2O is a stable, ozone depleting greenhouse gas. Emissions of N2O into the atmosphere continue to rise, primarily due to the use of nitrogen-containing fertilizers by soil denitrifying microbes. It is clear more effective mitigation strategies are required to reduce emissions. One way to help develop future mitigation strategies is to address the currently poor understanding of transcriptional regulation of the enzymes used to produce and consume N2O. With this ultimate aim in mind we performed RNA-seq on a model soil denitrifier, Paracoccus denitrificans, cultured anaerobically under high N2O and low N2O emitting conditions, and aerobically under zero N2O emitting conditions to identify small RNAs (sRNAs with potential regulatory functions transcribed under these conditions. sRNAs are short (∼40–500 nucleotides non-coding RNAs that regulate a wide range of activities in many bacteria. 167 sRNAs were identified throughout the P. denitrificans genome which are either present in intergenic regions or located antisense to ORFs. Furthermore, many of these sRNAs are differentially expressed under high N2O and low N2O emitting conditions respectively, suggesting they may play a role in production or reduction of N2O. Expression of 16 of these sRNAs have been confirmed by RT-PCR. 90% of the sRNAs are predicted to form secondary structures. Predicted targets include transporters and a number of transcriptional regulators. A number of sRNAs were conserved in other members of the α-proteobacteria. Better understanding of the sRNA factors which contribute to expression of the machinery required to reduce N2O will, in turn, help to inform strategies for mitigation of N2O emissions.
Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh
NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement.
Mommer, Brett C; Bell, Alison M
There is growing evidence for nongenetic effects of maternal experience on offspring. For example, previous studies have shown that female threespined stickleback fish (Gasterosteus aculeatus) exposed to predation risk produce offspring with altered behavior, metabolism and stress physiology. Here, we investigate the effect of maternal exposure to predation risk on the embryonic transcriptome in sticklebacks. Using RNA-sequencing we compared genome-wide transcription in three day post-fertilization embryos of predator-exposed and control mothers. There were hundreds of differentially expressed transcripts between embryos of predator-exposed mothers and embryos of control mothers including several non-coding RNAs. Gene Ontology analysis revealed biological pathways involved in metabolism, epigenetic inheritance, and neural proliferation and differentiation that differed between treatments. Interestingly, predation risk is associated with an accelerated life history in many vertebrates, and several of the genes and biological pathways that were identified in this study suggest that maternal exposure to predation risk accelerates the timing of embryonic development. Consistent with this hypothesis, embryos of predator-exposed mothers were larger than embryos of control mothers. These findings point to some of the molecular mechanisms that might underlie maternal effects.
Brett C Mommer
Full Text Available There is growing evidence for nongenetic effects of maternal experience on offspring. For example, previous studies have shown that female threespined stickleback fish (Gasterosteus aculeatus exposed to predation risk produce offspring with altered behavior, metabolism and stress physiology. Here, we investigate the effect of maternal exposure to predation risk on the embryonic transcriptome in sticklebacks. Using RNA-sequencing we compared genome-wide transcription in three day post-fertilization embryos of predator-exposed and control mothers. There were hundreds of differentially expressed transcripts between embryos of predator-exposed mothers and embryos of control mothers including several non-coding RNAs. Gene Ontology analysis revealed biological pathways involved in metabolism, epigenetic inheritance, and neural proliferation and differentiation that differed between treatments. Interestingly, predation risk is associated with an accelerated life history in many vertebrates, and several of the genes and biological pathways that were identified in this study suggest that maternal exposure to predation risk accelerates the timing of embryonic development. Consistent with this hypothesis, embryos of predator-exposed mothers were larger than embryos of control mothers. These findings point to some of the molecular mechanisms that might underlie maternal effects.
Sohn, Sung-Hwa; Ko, Eunjung; Chung, Hwan-Suck; Lee, Eun-Young; Kim, Sung-Hoon; Shin, Minkyu; Hong, Moochang; Bae, Hyunsu
AIM The rhizome of turmeric, Curcuma longa (CL), is a herbal medicine used in many traditional prescriptions. It has previously been shown that CL treatment showed greater than 47% recovery from cisplatin-induced cell damage in human kidney HEK 293 cells. This study was conducted to evaluate the recovery mechanisms of CL that occur during cisplatin induced nephrotoxicity by examining the genome wide mRNA expression profiles of HEK 293 -cells. METHOD Recovery mechanisms of CL that occur during cisplatin-induced nephrotoxicity were determined by microarray, real-time PCR, immunofluorescent confocal microscopy and Western blot analysis. RESULTS The results of microarray analysis and real-time PCR revealed that NFκB pathway-related genes and apoptosis-related genes were down-regulated in CL-treated HEK 293 cells. In addition, immunofluorescent confocal microscopy and Western blot analysis revealed that NFκB p65 nuclear translocation was inhibited in CL-treated HEK 293 cells. Therefore, the mechanism responsible for the effects of CL on HEK 293 cells is closely associated with regulation of the NFκB pathway. CONCLUSION CL possesses novel therapeutic agents that can be used for the prevention or treatment of cisplatin-induced renal disorders. PMID:20840446
Full Text Available The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps in plants, but little is known about this family in tomato (Solanum lycopersicum, an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20 gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83% were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root and reproductive organs (floral bud and flower, suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript
Yu, Jiahong; Cheng, Yuan; Feng, Kun; Ruan, Meiying; Ye, Qingjing; Wang, Rongqing; Li, Zhimiao; Zhou, Guozhi; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian
The Hsp20 genes are involved in the response of plants to environment stresses including heat shock and also play a vital role in plant growth and development. They represent the most abundant small heat shock proteins (sHsps) in plants, but little is known about this family in tomato (Solanum lycopersicum), an important vegetable crop in the world. Here, we characterized heat shock protein 20 (SlHsp20) gene family in tomato through integration of gene structure, chromosome location, phylogenetic relationship, and expression profile. Using bioinformatics-based methods, we identified at least 42 putative SlHsp20 genes in tomato. Sequence analysis revealed that most of SlHsp20 genes possessed no intron or a relatively short intron in length. Chromosome mapping indicated that inter-arm and intra-chromosome duplication events contributed remarkably to the expansion of SlHsp20 genes. Phylogentic tree of Hsp20 genes from tomato and other plant species revealed that SlHsp20 genes were grouped into 13 subfamilies, indicating that these genes may have a common ancestor that generated diverse subfamilies prior to the mono-dicot split. In addition, expression analysis using RNA-seq in various tissues and developmental stages of cultivated tomato and the wild relative Solanum pimpinellifolium revealed that most of these genes (83%) were expressed in at least one stage from at least one genotype. Out of 42 genes, 4 genes were expressed constitutively in almost all the tissues analyzed, implying that these genes might have specific housekeeping function in tomato cell under normal growth conditions. Two SlHsp20 genes displayed differential expression levels between cultivated tomato and S. pimpinellifolium in vegetative (leaf and root) and reproductive organs (floral bud and flower), suggesting inter-species diversification for functional specialization during the process of domestication. Based on genome-wide microarray analysis, we showed that the transcript levels of SlHsp20
Sonuga-Barke, E.; Lasky-Su, J.; Neale, B.; Oades, R.D.; Chen, W.; Franke, B.; Buitelaar, J.K.; Banaschewski, T.; Ebstein, R.; Gill, M.; Anney, R.J.; Miranda, A.; Mulas, F.; Roeyers, H.; Rothenberger, A.; Sergeant, J.A.; Steinhausen, H.C.; Thompson, M.; Asherson, P.; Faraone, S.V.
Studies of gene x environment (G x E) interaction in ADHD have previously focused on known risk genes for ADHD and environmentally mediated biological risk. Here we use G x E analysis in the context of a genome-wide association scan to identify novel genes whose effects on ADHD symptoms and comorbid
Sonuga-Barke, E.J.S.; Lasky-Su, J.; Neale, B.; Oades, R.D.; Chen, W.; Franke, B.; Buitelaar, J.K.; Banaschewski, T.; Ebstein, R.P.; Gill, M.; Anney, R.; Miranda, A.; Mulas, F.; Roeyers, H.; Rothenberger, A.; Sergeant, J.A.; Steinhausen, H.C.; Thompson, M.; Asherson, P.; Faraone, S.V.
Studies of gene x environment (G x E) interaction in ADHD have previously focused on known risk genes for ADHD and environmentally mediated biological risk. Here we use G x E analysis in the context of a genome-wide association scan to identify novel genes whose effects on ADHD symptoms and comorbid
Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj
The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic
Full Text Available The APETALA2/ethylene-responsive element binding factor (AP2/ERF family is one of the largest transcription factor (TF families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding, ERF (ethylene responsive factors and RAV (Related to ABI3/VP. AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.. A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI. Duplication analysis revealed that 12 (∼7% SiAP2/ERF genes were tandem repeated and 22 (∼13% were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes, maize (14 genes, rice (9 genes and Brachypodium (6 genes showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and
Xu, Zongda; Zhang, Qixiang; Sun, Lidan; Du, Dongliang; Cheng, Tangren; Pan, Huitang; Yang, Weiru; Wang, Jia
MADS-box genes encode transcription factors that play crucial roles in plant development, especially in flower and fruit development. To gain insight into this gene family in Prunus mume, an important ornamental and fruit plant in East Asia, and to elucidate their roles in flower organ determination and fruit development, we performed a genome-wide identification, characterisation and expression analysis of MADS-box genes in this Rosaceae tree. In this study, 80 MADS-box genes were identified in P. mume and categorised into MIKC, Mα, Mβ, Mγ and Mδ groups based on gene structures and phylogenetic relationships. The MIKC group could be further classified into 12 subfamilies. The FLC subfamily was absent in P. mume and the six tandemly arranged DAM genes might experience a species-specific evolution process in P. mume. The MADS-box gene family might experience an evolution process from MIKC genes to Mδ genes to Mα, Mβ and Mγ genes. The expression analysis suggests that P. mume MADS-box genes have diverse functions in P. mume development and the functions of duplicated genes diverged after the duplication events. In addition to its involvement in the development of female gametophytes, type I genes also play roles in male gametophytes development. In conclusion, this study adds to our understanding of the roles that the MADS-box genes played in flower and fruit development and lays a foundation for selecting candidate genes for functional studies in P. mume and other species. Furthermore, this study also provides a basis to study the evolution of the MADS-box family.
Ang Leonard PK
Full Text Available Abstract Background Pterygium is a common ocular surface disease characterized by fibrovascular invasion of the cornea and is sight-threatening due to astigmatism, tear film disturbance, or occlusion of the visual axis. However, the mechanisms for formation and post-surgical recurrence of pterygium are not understood, and a valid animal model does not exist. Here, we investigated the possible mechanisms of pterygium pathogenesis and recurrence. Methods First we performed a genome wide expression analysis (human Affymetrix Genechip, >22000 genes with principal component analysis and clustering techniques, and validated expression of key molecules with PCR. The controls for this study were the un-involved conjunctival tissue of the same eye obtained during the surgical resection of the lesions. Interesting molecules were further investigated with immunohistochemistry, Western blots, and comparison with tear proteins from pterygium patients. Results Principal component analysis in pterygium indicated a signature of matrix-related structural proteins, including fibronectin-1 (both splice-forms, collagen-1A2, keratin-12 and small proline rich protein-1. Immunofluorescence showed strong expression of keratin-6A in all layers, especially the superficial layers, of pterygium epithelium, but absent in the control, with up-regulation and nuclear accumulation of the cell adhesion molecule CD24 in the pterygium epithelium. Western blot shows increased protein expression of beta-microseminoprotein, a protein up-regulated in human cutaneous squamous cell carcinoma. Gene products of 22 up-regulated genes in pterygium have also been found by us in human tears using nano-electrospray-liquid chromatography/mass spectrometry after pterygium surgery. Recurrent disease was associated with up-regulation of sialophorin, a negative regulator of cell adhesion, and never in mitosis a-5, known to be involved in cell motility. Conclusion Aberrant wound healing is therefore
Hattori, Hiroyoshi; Janky, Rekin's; Nietfeld, Wilfried; Aerts, Stein; Madan Babu, M; Venkitaraman, Ashok R
The human DNA damage response (DDR) triggers profound changes in gene expression, whose nature and regulation remain uncertain. Although certain micro-(mi)RNA species including miR34, miR-18, miR-16 and miR-143 have been implicated in the DDR, there is as yet no comprehensive description of genome-wide changes in the expression of miRNAs triggered by DNA breakage in human cells. We have used next-generation sequencing (NGS), combined with rigorous integrative computational analyses, to describe genome-wide changes in the expression of miRNAs during the human DDR. The changes affect 150 of 1523 miRNAs known in miRBase v18 from 4-24 h after the induction of DNA breakage, in cell-type dependent patterns. The regulatory regions of the most-highly regulated miRNA species are enriched in conserved binding sites for p53. Indeed, genome-wide changes in miRNA expression during the DDR are markedly altered in TP53-/- cells compared to otherwise isogenic controls. The expression levels of certain damage-induced, p53-regulated miRNAs in cancer samples correlate with patient survival. Our work reveals genome-wide and cell type-specific alterations in miRNA expression during the human DDR, which are regulated by the tumor suppressor protein p53. These findings provide a genomic resource to identify new molecules and mechanisms involved in the DDR, and to examine their role in tumor suppression and the clinical outcome of cancer patients.
Full Text Available Abstract Background Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21 is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessary for preventing damage caused by F. graminearum. Therefore, we investigated important cellular regulatory processes in a host containing FgV1-DK21 as compared to an uninfected parent using a transcriptional approach. Results Using a 3′-tiling microarray covering all known F. graminearum genes, we carried out genome-wide expression analyses of F. graminearum at two different time points. At the early point of growth of an infected strain as compared to an uninfected strain, genes associated with protein synthesis, including ribosome assembly, nucleolus, and ribosomal RNA processing, were significantly up-regulated. In addition, genes required for transcription and signal transduction, including fungal-specific transcription factors and cAMP signaling, respectively, were actively up-regulated. In contrast, genes involved in various metabolic pathways, particularly in producing carboxylic acids, aromatic amino acids, nitrogen compounds, and polyamines, showed dramatic down-regulation at the early time point. Moreover, genes associated with transport systems localizing to transmembranes were down-regulated at both time points. Conclusion This is the first report of global change in the prominent cellular pathways in the Fusarium host containing FgV1-DK21. The significant increase in transcripts for transcription and translation machinery in fungal host cells seems to be related to virus replication. In addition, significant down-regulation of genes required for metabolism and transporting systems in a fungal host containing the virus appears to be related to the host defense mechanism and fungal virulence. Taken together
Background In spite of its high clinical relevance, the relationship between disc degeneration and low back pain is still not well understood. Recent studies have shown that genome-wide gene expression studies utilizing ontology searches provide an efficient and valuable methodology for identification of clinically relevant genes. Here we use this approach in analysis of pain-, nerve-, and neurotrophin-related gene expression patterns in specimens of human disc tissue. Control, non-herniated clinical, and herniated clinical specimens of human annulus tissue were studied following Institutional Review Board approval. Results Analyses were performed on more generated (Thompson grade IV and V) discs vs. less degenerated discs (grades I-III), on surgically operated discs vs. control discs, and on herniated vs. control discs. Analyses of more degenerated vs. less degenerated discs identified significant upregulation of well-recognized pain-related genes (bradykinin receptor B1, calcitonin gene-related peptide and catechol-0-methyltransferase). Nerve growth factor was significantly upregulated in surgical vs. control and in herniated vs. control discs. All three analyses also found significant changes in numerous proinflammatory cytokine- and chemokine-related genes. Nerve, neurotrophin and pain-ontology searches identified many matrix, signaling and functional genes which have known importance in the disc. Immunohistochemistry was utilized to confirm the presence of calcitonin gene-related peptide, catechol-0-methyltransferase and bradykinin receptor B1 at the protein level in the human annulus. Conclusions Findings point to the utility of microarray analyses in identification of pain-, neurotrophin and nerve-related genes in the disc, and point to the importance of future work exploring functional interactions between nerve and disc cells in vitro and in vivo. Nerve, pain and neurotrophin ontology searches identified numerous changes in proinflammatory cytokines and
Full Text Available Abstract Background Social insects, such as honey bees, use molecular, physiological and behavioral responses to combat pathogens and parasites. The honey bee genome contains all of the canonical insect immune response pathways, and several studies have demonstrated that pathogens can activate expression of immune effectors. Honey bees also use behavioral responses, termed social immunity, to collectively defend their hives from pathogens and parasites. These responses include hygienic behavior (where workers remove diseased brood and allo-grooming (where workers remove ectoparasites from nestmates. We have previously demonstrated that immunostimulation causes changes in the cuticular hydrocarbon profiles of workers, which results in altered worker-worker social interactions. Thus, cuticular hydrocarbons may enable workers to identify sick nestmates, and adjust their behavior in response. Here, we test the specificity of behavioral, chemical and genomic responses to immunostimulation by challenging workers with a panel of different immune stimulants (saline, Sephadex beads and Gram-negative bacteria E. coli. Results While only bacteria-injected bees elicited altered behavioral responses from healthy nestmates compared to controls, all treatments resulted in significant changes in cuticular hydrocarbon profiles. Immunostimulation caused significant changes in expression of hundreds of genes, the majority of which have not been identified as members of the canonical immune response pathways. Furthermore, several new candidate genes that may play a role in cuticular hydrocarbon biosynthesis were identified. Effects of immune challenge expression of several genes involved in immune response, cuticular hydrocarbon biosynthesis, and the Notch signaling pathway were confirmed using quantitative real-time PCR. Finally, we identified common genes regulated by pathogen challenge in honey bees and other insects. Conclusions These results demonstrate that
Tang, Xin; Liu, Huawei; Chen, Quanmei; Wang, Xin; Xiong, Ying; Zhao, Ping
The solute carrier 6 (SLC6) gene family, initially known as the neurotransmitter transporters, plays vital roles in the regulation of neurotransmitter signaling, nutrient absorption and motor behavior. In this study, a total of 16 candidate genes were identified as SLC6 family gene homologs in the silkworm (Bombyx mori) genome. Spatio-temporal expression patterns of silkworm SLC6 gene transcripts indicated that these genes were highly and specifically expressed in midgut, brain and gonads; moreover, these genes were expressed primarily at the feeding stage or adult stage. Levels of expression for most midgut-specific and midgut-enriched gene transcripts were down-regulated after starvation but up-regulated after re-feeding. In addition, we observed that expression levels of these genes except for BmSLC6-15 and BmGT1 were markedly up-regulated by a juvenile hormone analog. Moreover, brain-enriched genes showed differential expression patterns during wandering and mating processes, suggesting that these genes may be involved in modulating wandering and mating behaviors. Our results improve our understanding of the expression patterns and potential physiological functions of the SLC6 gene family, and provide valuable information for the comprehensive functional analysis of the SLC6 gene family.
Full Text Available Abstract Background Sox genes are HMG-domain containing transcription factors with important roles in developmental processes in animals; many of them appear to have conserved functions among eumetazoans. Demosponges have fewer Sox genes than eumetazoans, but their roles remain unclear. The aim of this study is to gain insight into the early evolutionary history of the Sox gene family by identification and expression analysis of Sox genes in the calcareous sponge Sycon ciliatum. Methods Calcaronean Sox related sequences were retrieved by searching recently generated genomic and transcriptome sequence resources and analyzed using variety of phylogenetic methods and identification of conserved motifs. Expression was studied by whole mount in situ hybridization. Results We have identified seven Sox genes and four Sox-related genes in the complete genome of Sycon ciliatum. Phylogenetic and conserved motif analyses showed that five of Sycon Sox genes represent groups B, C, E, and F present in cnidarians and bilaterians. Two additional genes are classified as Sox genes but cannot be assigned to specific subfamilies, and four genes are more similar to Sox genes than to other HMG-containing genes. Thus, the repertoire of Sox genes is larger in this representative of calcareous sponges than in the demosponge Amphimedon queenslandica. It remains unclear whether this is due to the expansion of the gene family in Sycon or a secondary reduction in the Amphimedon genome. In situ hybridization of Sycon Sox genes revealed a variety of expression patterns during embryogenesis and in specific cell types of adult sponges. Conclusions In this study, we describe a large family of Sox genes in Sycon ciliatum with dynamic expression patterns, indicating that Sox genes are regulators in development and cell type determination in sponges, as observed in higher animals. The revealed differences between demosponge and calcisponge Sox genes repertoire highlight the need to
Full Text Available VQ motif-containing proteins play crucial roles in abiotic stress responses in plants. Recent studies have shown that some VQ proteins physically interact with WRKY transcription factors to activate downstream genes. In the present study, we identified and characterized genes encoding VQ motif-containing proteins using the most recent version of the maize genome sequence. In total, 61VQ genes were identified. In a cluster analysis, these genes clustered into nine groups together with their homologous genes in rice and Arabidopsis. Most of the VQ genes (57 out of 61 numbers identified in maize were found to be single-copy genes. Analyses of RNA-seq data obtained using seedlings under long-term drought treatment showed that the expression levels of most ZmVQ genes (41 out of 61 members changed during the drought stress response. Quantitative real-time PCR analyses showed that most of the ZmVQ genes were responsive to NaCl treatment. Also, approximately half of the ZmVQ genes were co-expressed with ZmWRKY genes. The identification of these VQ genes in the maize genome and knowledge of their expression profiles under drought and osmotic stresses will provide a solid foundation for exploring their specific functions in the abiotic stress responses of maize.
Xia, Xiaofeng; Yu, Liying; Xue, Minqian; Yu, Xiaoqiang; Vasseur, Liette; Gurr, Geoff M; Baxter, Simon W; Lin, Hailan; Lin, Junhan; You, Minsheng
The diamondback moth, Plutella xylostella (L.), is a destructive pest that attacks cruciferous crops worldwide. Immune responses are important for interactions between insects and pathogens and information on these underpins the development of strategies for biocontrol-based pest management. Little, however, is known about immune genes and their regulation patterns in P. xylostella. A total of 149 immune-related genes in 20 gene families were identified through comparison of P. xylostella genome with the genomes of other insects. Complete and conserved Toll, IMD and JAK-STAT signaling pathways were found in P. xylostella. Genes involved in pathogen recognition were expanded and more diversified than genes associated with intracellular signal transduction. Gene expression profiles showed that the IMD pathway may regulate expression of antimicrobial peptide (AMP) genes in the midgut, and be related to an observed down-regulation of AMPs in experimental lines of insecticide-resistant P. xylostella. A bacterial feeding study demonstrated that P. xylostella could activate different AMPs in response to bacterial infection. This study has established a framework of comprehensive expression profiles that highlight cues for immune regulation in a major pest. Our work provides a foundation for further studies on the functions of P. xylostella immune genes and mechanisms of innate immunity.
Hansen, Morten; Gerds, Thomas Alexander; Nielsen, Ole Haagen
Analyzing data obtained from genome-wide gene expression experiments is challenging due to the quantity of variables, the need for multivariate analyses, and the demands of managing large amounts of data. Here we present the R package pcaGoPromoter, which facilitates the interpretation of genome.......g., cell cycle progression and the predicted involvement of expected transcription factors, including E2F. In addition, unexpected results, e.g., cholesterol synthesis in serum-depleted cells and NF-¿B activation in inhibitor treated cells, were noted. In summary, the pcaGoPromoter R package provides...
Full Text Available Flavonoids, the compounds that impart color to fruits, flowers, and seeds, are the most widespread secondary metabolites in plants. However, a systematic analysis of these loci has not been performed in Brassicaceae. In this study, we isolated 649 nucleotide sequences related to flavonoid biosynthesis, i.e., the Transparent Testa (TT genes, and their associated amino acid sequences in 17 Brassicaceae species, grouped into Arabidopsis or Brassicaceae subgroups. Moreover, 36 copies of 21 genes of the flavonoid biosynthesis pathway were identified in A. thaliana, 53 were identified in B. rapa, 50 in B. oleracea, and 95 in B. napus, followed the genomic distribution, collinearity analysis and genes triplication of them among Brassicaceae species. The results showed that the extensive gene loss, whole genome triplication, and diploidization that occurred after divergence from the common ancestor. Using qRT-PCR methods, we analyzed the expression of eighteen flavonoid biosynthesis genes in 6 yellow- and black-seeded B. napus inbred lines with different genetic background, found that 12 of which were preferentially expressed during seed development, whereas the remaining genes were expressed in all B. napus tissues examined. Moreover, fourteen of these genes showed significant differences in expression level during seed development, and all but four of these (i.e., BnTT5, BnTT7, BnTT10, and BnTTG1 had similar expression patterns among the yellow- and black-seeded B. napus. Results showed that the structural genes (BnTT3, BnTT18 and BnBAN, regulatory genes (BnTTG2 and BnTT16 and three encoding transfer proteins (BnTT12, BnTT19, and BnAHA10 might play an crucial roles in the formation of different seed coat colors in B. napus. These data will be helpful for illustrating the molecular mechanisms of flavonoid biosynthesis in Brassicaceae species.
Full Text Available The Lohmann Selected Leghorn (LSL and Lohmann Brown (LB layer lines have been selected for high egg production since more than 50 years and belong to the worldwide leading commercial layer lines. The objectives of the present study were to characterize the molecular processes that are different among these two layer lines using whole genome RNA expression profiles. The hens were kept in the newly developed small group housing system Eurovent German with two different group sizes. Differential expression was observed for 6,276 microarray probes (FDR adjusted P-value <0.05 among the two layer lines LSL and LB. A 2-fold or greater change in gene expression was identified on 151 probe sets. In LSL, 72 of the 151 probe sets were up- and 79 of them were down-regulated. Gene ontology (GO enrichment analysis accounting for biological processes evinced 18 GO-terms for the 72 probe sets with higher expression in LSL, especially those taking part in immune system processes and membrane organization. A total of 32 enriched GO-terms were determined among the 79 down-regulated probe sets of LSL. Particularly, these terms included phosphorus metabolic processes and signaling pathways. In conclusion, the phenotypic differences among the two layer lines LSL and LB are clearly reflected in their gene expression profiles of the cerebrum. These novel findings provide clues for genes involved in economically important line characteristics of commercial laying hens.
Fic, A; Mlakar, S Jurković; Juvan, P; Mlakar, V; Marc, J; Dolenc, M Sollner; Broberg, K; Mašič, L Peterlin
The bisphenols AF (BPAF) and S (BPS) are structural analogs of the endocrine disruptor bisphenol A (BPA), and are used in common products as a replacement for BPA. To elucidate genome-wide gene expression responses, estrogen-dependent osteosarcoma cells were cultured with 10 nM BPA, BPAF, or BPS, for 8 h and 3 months. Genome-wide gene expression was analyzed using the Illumina Expression BeadChip. Three months exposure had significant effects on gene expression, particularly for BPS, followed by BPAF and BPA, according to the number of differentially expressed genes (1980, 778, 60, respectively), the magnitude of changes in gene expression, and the number of enriched biological processes (800, 415, 33, respectively) and pathways (77, 52, 6, respectively). 'Embryonic skeletal system development' was the most enriched bone-related process, which was affected only by BPAF and BPS. Interestingly, all three bisphenols showed highest down-regulation of genes related to the cardiovascular system (e.g., NPPB, NPR3, TXNIP). BPA only and BPA/BPAF/BPS also affected genes related to the immune system and fetal development, respectively. For BPAF and BPS, the 'isoprenoid biosynthetic process' was enriched (up-regulated genes: HMGCS1, PDSS1, ACAT2, RCE1, DHDDS). Compared to BPA, BPAF and BPS had more effects on gene expression after long-term exposure. These findings stress the need for careful toxicological characterization of BPA analogs in the future. Copyright © 2015 Elsevier Ltd. All rights reserved.
Full Text Available Plant zinc finger-homeodomain (ZHD genes encode a family of transcription factors that have been demonstrated to play an important role in the regulation of plant growth and development. In this study, we identified a total of 13 ZHD genes (VvZHD in the grape genome that were further classified into at least seven groups. Genome synteny analysis revealed that a number of VvZHD genes were present in the corresponding syntenic blocks of Arabidopsis, indicating that they arose before the divergence of these two species. Gene expression analysis showed that the identified VvZHD genes displayed distinct spatiotemporal expression patterns, and were differentially regulated under various stress conditions and hormone treatments, suggesting that the grape VvZHDs might be also involved in plant response to a variety of biotic and abiotic insults. Our work provides insightful information and knowledge about the ZHD genes in grape, which provides a framework for further characterization of their roles in regulation of stress tolerance as well as other aspects of grape productivity.
Mo, Qianxing; Lu, Shifang; Garippa, Carrie; Brownstein, Michael J; Simon, Neal G
Dehydroepiandrosterone (DHEA) is the most abundant steroid in humans and a multi-functional neuroactive steroid that has been implicated in a variety of biological effects in both the periphery and central nervous system. Mechanistic studies of DHEA in the periphery have emphasized its role as a prohormone and those in the brain have focused on effects exerted at cell surface receptors. Recent results demonstrated that DHEA is intrinsically androgenic. It competes with DHT for binding to androgen receptor (AR), induces AR-regulated reporter gene expression in vitro, and exogenous DHEA administration regulates gene expression in peripheral androgen-dependent tissues and LnCAP prostate cancer cells, indicating genomic effects and adding a level of complexity to functional models. The absence of information about the effect of DHEA on gene expression in the CNS is a significant gap in light of continuing clinical interest in the compound as a hormone replacement therapy in older individuals, patients with adrenal insufficiency, and as a treatment that improves sense of well-being, increases libido, relieves depressive symptoms, and serves as a neuroprotective agent. In the present study, ovariectomized CF-1 female mice, an established model for assessing CNS effects of androgens, were treated with DHEA (1mg/day), dihydrotestosterone (DHT, a potent androgen used as a positive control; 0.1mg/day) or vehicle (negative control) for 7 days. The effects of DHEA on gene expression were assessed in two regions of the CNS that are enriched in AR, hypothalamus and hippocampus, using DNA microarray, real-time RT-PCR, and immunohistochemistry. RIA of serum samples assessed treatment effects on circulating levels of major steroids. In hypothalamus, DHEA and DHT significantly up-regulated the gene expression of hypocretin (Hcrt; also called orexin), pro-melanin-concentrating hormone (Pmch), and protein kinase C delta (Prkcd), and down-regulated the expression of deleted in bladder
Full Text Available WRKY proteins play important regulatory roles in plant developmental processes such as senescence, trichome initiation and embryo morphogenesis. In strawberry, only FaWRKY1 (Fragaria × ananassa has been characterized, leaving numerous WRKY genes to be identified and their function characterized. The publication of the draft genome sequence of the strawberry genome allowed us to conduct a genome-wide search for WRKY proteins in Fragaria vesca, and to compare the identified proteins with their homologs in model plants. Fifty-nine FvWRKY genes were identified and annotated from the F. vesca genome. Detailed analysis, including gene classification, annotation, phylogenetic evaluation, conserved motif determination and expression profiling, based on RNA-seq data, were performed on all members of the family. Additionally, the expression patterns of the WRKY genes in different fruit developmental stages were further investigated using qRT-PCR, to provide a foundation for further comparative genomics and functional studies of this important class of transcriptional regulators in strawberry.
Eichenberger, Ramon M; Ramakrishnan, Chandra; Russo, Giancarlo; Deplazes, Peter; Hehl, Adrian B
Infections of dogs with virulent strains of Babesia canis are characterized by rapid onset and high mortality, comparable to complicated human malaria. As in other apicomplexan parasites, most Babesia virulence factors responsible for survival and pathogenicity are secreted to the host cell surface and beyond where they remodel and biochemically modify the infected cell interacting with host proteins in a very specific manner. Here, we investigated factors secreted by B. canis during acute infections in dogs and report on in silico predictions and experimental analysis of the parasite's exportome. As a backdrop, we generated a fully annotated B. canis genome sequence of a virulent Hungarian field isolate (strain BcH-CHIPZ) underpinned by extensive genome-wide RNA-seq analysis. We find evidence for conserved factors in apicomplexan hemoparasites involved in immune-evasion (e.g. VESA-protein family), proteins secreted across the iRBC membrane into the host bloodstream (e.g. SA- and Bc28 protein families), potential moonlighting proteins (e.g. profilin and histones), and uncharacterized antigens present during acute crisis in dogs. The combined data provides a first predicted and partially validated set of potential virulence factors exported during fatal infections, which can be exploited for urgently needed innovative intervention strategies aimed at facilitating diagnosis and management of canine babesiosis.
Hazel En En Wong
Full Text Available Both Neisseria meningitidis and the closely related bacterium Neisseria lactamica colonise human nasopharyngeal mucosal surface, but only N. meningitidis invades the bloodstream to cause potentially life-threatening meningitis and septicaemia. We have hypothesised that the two neisserial species differentially modulate host respiratory epithelial cell gene expression reflecting their disease potential. Confluent monolayers of 16HBE14 human bronchial epithelial cells were exposed to live and/or dead N. meningitidis (including capsule and pili mutants and N. lactamica, and their transcriptomes were compared using whole genome microarrays. Changes in expression of selected genes were subsequently validated using Q-RT-PCR and ELISAs. Live N. meningitidis and N. lactamica induced genes involved in host energy production processes suggesting that both bacterial species utilise host resources. N. meningitidis infection was associated with down-regulation of host defence genes. N. lactamica, relative to N. meningitidis, initiates up-regulation of proinflammatory genes. Bacterial secreted proteins alone induced some of the changes observed. The results suggest N. meningitidis and N. lactamica differentially regulate host respiratory epithelial cell gene expression through colonisation and/or protein secretion, and that this may contribute to subsequent clinical outcomes associated with these bacteria.
Pan, Feng; Wang, Yue; Liu, Huanglong; Wu, Min; Chu, Wenyuan; Chen, Danmei; Xiang, Yan
The SQUAMOSA promoter binding protein-like (SPL) proteins are plant-specific transcription factors (TFs) that function in a variety of developmental processes including growth, flower development, and signal transduction. SPL proteins are encoded by a gene family, and these genes have been characterized in two model grass species, Zea mays and Oryza sativa. The SPL gene family has not been well studied in moso bamboo (Phyllostachys edulis), a woody grass species. We identified 32 putative PeSPL genes in the P. edulis genome. Phylogenetic analysis arranged the PeSPL protein sequences in eight groups. Similarly, phylogenetic analysis of the SBP-like and SBP proteins from rice and maize clustered them into eight groups analogous to those from P. edulis. Furthermore, the deduced PeSPL proteins in each group contained very similar conserved sequence motifs. Our analyses indicate that the PeSPL genes experienced a large-scale duplication event ~15 million years ago (MYA), and that divergence between the PeSPL and OsSPL genes occurred 34 MYA. The stress-response expression profiles and tissue-specificity of the putative PeSPL gene promoter regions showed that SPL genes in moso bamboo have potential biological functions in stress resistance as well as in growth and development. We therefore examined PeSPL gene expression in response to different plant hormone and drought (polyethylene glycol-6000; PEG) treatments to mimic biotic and abiotic stresses. Expression of three (PeSPL10, -12, -17), six (PeSPL1, -10, -12, -17, -20, -31), and nine (PeSPL5, -8, -9, -14, -15, -19, -20, -31, -32) genes remained relatively stable after treating with salicylic acid (SA), gibberellic acid (GA), and PEG, respectively, while the expression patterns of other genes changed. In addition, analysis of tissue-specific expression of the moso bamboo SPL genes during development showed differences in their spatiotemporal expression patterns, and many were expressed at high levels in flowers and
Shi, Ming-Zhu; Xie, De-Yu
We report metabolic engineering of Arabidopsis red cells and genome-wide gene expression analysis associated with anthocyanin biosynthesis and other metabolic pathways between red cells and wild-type (WT) cells. Red cells of A. thaliana were engineered for the first time from the leaves of production of anthocyanin pigment 1-Dominant (pap1-D). These red cells produced seven anthocyanin molecules including a new one that was characterized by LC-MS analysis. Wild-type cells established as a control did not produce anthocyanins. A genome-wide microarray analysis revealed that nearly 66 and 65% of genes in the genome were expressed in the red cells and wild-type cells, respectively. In comparison with the WT cells, 3.2% of expressed genes in the red cells were differentially expressed. The expression levels of 14 genes involved in the biosynthetic pathway of anthocyanin were significantly higher in the red cells than in the WT cells. Microarray and RT-PCR analyses demonstrated that the TTG1-GL3/TT8-PAP1 complex regulated the biosynthesis of anthocyanins. Furthermore, most of the genes with significant differential expression levels in the red cells versus the WT cells were characterized with diverse biochemical functions, many of which were mapped to different metabolic pathways (e.g., ribosomal protein biosynthesis, photosynthesis, glycolysis, glyoxylate metabolism, and plant secondary metabolisms) or organelles (e.g., chloroplast). We suggest that the difference in gene expression profiles between the two cell lines likely results from cell types, the overexpression of PAP1, and the high metabolic flux toward anthocyanins.
Huang, Zejun; Van Houten, Jason; Gonzalez, Geoffrey; Xiao, Han; van der Knaap, Esther
Members of the plant-specific gene families IQD/SUN, OFP and YABBY are thought to play important roles in plant growth and development. YABBY family members are involved in lateral organ polarity and growth; OFP members encode transcriptional repressors, whereas the role of IQD/SUN members is less clear. The tomato fruit shape genes SUN, OVATE, and FASCIATED belong to IQD/SUN, OFP and the YABBY gene family, respectively. A gene duplication resulting in high expression of SUN leads to elongated fruit, whereas a premature stop codon in OVATE and a large inversion within FASCIATED control fruit elongation and a flat fruit shape, respectively. In this study, we identified 34 SlSUN, 31 SlOFP and 9 SlYABBY genes in tomato and identified their position on 12 chromosomes. Genome mapping analysis showed that the SlSUN, SlOFP, and SlYABBY genes were enriched on the top and bottom segments of several chromosomes. In particular, on chromosome 10, a cluster of SlOFPs were found to originate from tandem duplication events. We also constructed three phylogenetic trees based on the protein sequences of the IQ67, OVATE and YABBY domains, respectively, from members of these families in Arabidopsis and tomato. The closest putative orthologs of the Arabidopsis and tomato genes were determined by the position on the phylogenetic tree and sequence similarity. Furthermore, expression analysis showed that some family members exhibited tissue-specific expression, whereas others were more ubiquitously expressed. Also, certain family members overlapped with known QTLs controlling fruit shape in Solanaceous plants. Combined, these results may help elucidate the roles of SUN, OFP and YABBY family members in plant growth and development.
Hu, Wei; Hou, Xiaowan; Huang, Chao; Yan, Yan; Tie, Weiwei; Ding, Zehong; Wei, Yunxie; Liu, Juhua; Miao, Hongxia; Lu, Zhiwei; Li, Meiying; Xu, Biyu; Jin, Zhiqiang
Aquaporins (AQPs) function to selectively control the flow of water and other small molecules through biological membranes, playing crucial roles in various biological processes. However, little information is available on the AQP gene family in bananas. In this study, we identified 47 banana AQP genes based on the banana genome sequence. Evolutionary analysis of AQPs from banana, Arabidopsis, poplar, and rice indicated that banana AQPs (MaAQPs) were clustered into four subfamilies. Conserved motif analysis showed that all banana AQPs contained the typical AQP-like or major intrinsic protein (MIP) domain. Gene structure analysis suggested the majority of MaAQPs had two to four introns with a highly specific number and length for each subfamily. Expression analysis of MaAQP genes during fruit development and postharvest ripening showed that some MaAQP genes exhibited high expression levels during these stages, indicating the involvement of MaAQP genes in banana fruit development and ripening. Additionally, some MaAQP genes showed strong induction after stress treatment and therefore, may represent potential candidates for improving banana resistance to abiotic stress. Taken together, this study identified some excellent tissue-specific, fruit development- and ripening-dependent, and abiotic stress-responsive candidate MaAQP genes, which could lay a solid foundation for genetic improvement of banana cultivars. PMID:26307965
Full Text Available The homedodomain zipper family (HD-ZIP of transcription factors is present only in plants and plays important role in the regulation of plant-specific processes. The subfamily IV of HDZ transcription factors (HD-ZIP IV has primarily been implicated in the regulation of epidermal structure development. Though this gene family is present in all lineages of land plants, members of this gene family have not been identified in banana, which is one of the major staple fruit crops. In the present work, we identified 21 HDZIV genes in banana by the computational analysis of banana genome resource. Our analysis suggested that these genes putatively encode proteins having all the characteristic domains of HDZIV transcription factors. The phylogenetic analysis of the banana HDZIV family genes further confirmed that after separation from a common ancestor, the banana and poales lineages might have followed distinct evolutionary paths. Further, we conclude that segmental duplication played a major role in the evolution of banana HDZIV genes. All the identified banana HDZIV genes expresses in different banana tissue, however at varying levels. The transcript levels of some of the banana HDZIV genes were also detected in banana fruit pulp, suggesting their putative role in fruit attributes. A large number of genes of this family showed modulated expression under drought and salinity stress. Taken together, the present work lays a foundation for elucidation of functional aspects of the banana HDZIV genes and for their possible use in the banana improvement programs.
Luo, Wei; Hu, Qiang; Wang, Dan; Deeb, Kristin K.; Ma, Yingyu; Morrison, Carl D.; Liu, Song; Johnson, Candace S.; Trump, Donald L.
Endothelial cells (ECs) are an important component involved in the angiogenesis. Little is known about the global gene expression and epigenetic regulation in tumor endothelial cells. The identification of gene expression and epigenetic difference between human prostate tumor-derived endothelial cells (TdECs) and those in normal tissues may uncover unique biological features of TdEC and facilitate the discovery of new anti-angiogenic targets. We established a method for isolation of CD31+ endothelial cells from malignant and normal prostate tissues obtained at prostatectomy. TdECs and normal-derived ECs (NdECs) showed >90% enrichment in primary culture and demonstrated microvascular endothelial cell characteristics such as cobblestone morphology in monolayer culture, diI-acetyl-LDL uptake and capillary-tube like formation in Matrigel®. In vitro primary cultures of ECs maintained expression of endothelial markers such as CD31, von Willebrand factor, intercellular adhesion molecule, vascular endothelial growth factor receptor 1, and vascular endothelial growth factor receptor 2. We then conducted a pilot study of transcriptome and methylome analysis of TdECs and matched NdECs from patients with prostate cancer. We observed a wide spectrum of differences in gene expression and methylation patterns in endothelial cells, between malignant and normal prostate tissues. Array-based expression and methylation data were validated by qRT-PCR and bisulfite DNA pyrosequencing. Further analysis of transcriptome and methylome data revealed a number of differentially expressed genes with loci whose methylation change is accompanied by an inverse change in gene expression. Our study demonstrates the feasibility of isolation of ECs from histologically normal prostate and prostate cancer via CD31+ selection. The data, although preliminary, indicates that there exist widespread differences in methylation and transcription between TdECs and NdECs. Interestingly, only a small
Full Text Available Sphingomonas wittichii strain RW1 (RW1 is one of the few strains that can grow on dibenzo-p-dioxin (DD. We conducted a transcriptomic study of RW1 using RNA-Seq to outline transcriptional responses to DD, dibenzofuran (DF, and the smectite clay mineral saponite with succinate as carbon source. The ability to grow on DD is rare compared to growth on the chemically similar DF even though the same initial dioxygenase may be involved in oxidation of both substrates. Therefore, we hypothesized the reason for this lies beyond catabolic pathways and may concern genes involved in processes for cell-substrate interactions such as substrate recognition, transport, and detoxification. Compared to succinate (SUC as control carbon source, DF caused over 240 protein-coding genes to be differentially expressed, whereas more than 300 were differentially expressed with DD. Stress response genes were up-regulated in response to both DD and DF. This effect was stronger with DD than DF, suggesting a higher toxicity of DD compared to DF. Both DD and DF caused changes in expression of genes involved in active cross-membrane transport such as TonB-dependent receptor proteins, but the patterns of change differed between the two substrates. Multiple transcription factor genes also displayed expression patterns distinct to DD and DF growth. DD and DF induced the catechol ortho- and the salicylate/gentisate pathways, respectively. Both DD and DF induced the shared down-stream aliphatic intermediate compound pathway. Clay caused category-wide down-regulation of genes for cell motility and chemotaxis, particularly those involved in the synthesis, assembly and functioning of flagella. This is an environmentally important finding because clay is a major component of soil microbes' microenvironment influencing local chemistry and may serve as a geosorbent for toxic pollutants. Similar to clay, DD and DF also affected expression of genes involved in motility and chemotaxis.
Genome-wide identification and expression profiling reveal tissue-specific expression and differentially-regulated genes involved in gibberellin metabolism between Williams banana and its dwarf mutant.
Chen, Jingjing; Xie, Jianghui; Duan, Yajie; Hu, Huigang; Hu, Yulin; Li, Weiming
Dwarfism is one of the most valuable traits in banana breeding because semi-dwarf cultivars show good resistance to damage by wind and rain. Moreover, these cultivars present advantages of convenient cultivation, management, and so on. We obtained a dwarf mutant '8818-1' through EMS (ethyl methane sulphonate) mutagenesis of Williams banana 8818 (Musa spp. AAA group). Our research have shown that gibberellins (GAs) content in 8818-1 false stems was significantly lower than that in its parent 8818 and the dwarf type of 8818-1 could be restored by application of exogenous GA3. Although GA exerts important impacts on the 8818-1 dwarf type, our understanding of the regulation of GA metabolism during banana dwarf mutant development remains limited. Genome-wide screening revealed 36 candidate GA metabolism genes were systematically identified for the first time; these genes included 3 MaCPS, 2 MaKS, 1 MaKO, 2 MaKAO, 10 MaGA20ox, 4 MaGA3ox, and 14 MaGA2ox genes. Phylogenetic tree and conserved protein domain analyses showed sequence conservation and divergence. GA metabolism genes exhibited tissue-specific expression patterns. Early GA biosynthesis genes were constitutively expressed but presented differential regulation in different tissues in Williams banana. GA oxidase family genes were mainly transcribed in young fruits, thus suggesting that young fruits were the most active tissue involved in GA metabolism, followed by leaves, bracts, and finally approximately mature fruits. Expression patterns between 8818 and 8818-1 revealed that MaGA20ox4, MaGA20ox5, and MaGA20ox7 of the MaGA20ox gene family and MaGA2ox7, MaGA2ox12, and MaGA2ox14 of the MaGA2ox gene family exhibited significant differential expression and high-expression levels in false stems. These genes are likely to be responsible for the regulation of GAs content in 8818-1 false stems. Overall, phylogenetic evolution, tissue specificity and differential expression analyses of GA metabolism genes can provide a
Jing, Zhaobin; Liu, Zhande
As one of the largest transcriptional factor families in plants, WRKY transcription factors play important roles in various biotic and abiotic stress responses. To date, WRKY genes in kiwifruit (Actinidia spp.) remain poorly understood. In our study, o total of 97 AcWRKY genes have been identified in the kiwifruit genome. An overview of these AcWRKY genes is analyzed, including the phylogenetic relationships, exon-intron structures, synteny and expression profiles. The 97 AcWRKY genes were divided into three groups based on the conserved WRKY domain. Synteny analysis indicated that segmental duplication events contributed to the expansion of the kiwifruit AcWRKY family. In addition, the synteny analysis between kiwifruit and Arabidopsis suggested that some of the AcWRKY genes were derived from common ancestors before the divergence of these two species. Conserved motifs outside the AcWRKY domain may reflect their functional conservation. Genome-wide segmental and tandem duplication were found, which may contribute to the expansion of AcWRKY genes. Furthermore, the analysis of selected AcWRKY genes showed a variety of expression patterns in five different organs as well as during biotic and abiotic stresses. The genome-wide identification and characterization of kiwifruit WRKY transcription factors provides insight into the evolutionary history and is a useful resource for further functional analyses of kiwifruit.
Full Text Available Abstract Background Fluconazole (FLC, a triazole antifungal drug, is widely used for the maintenance therapy of cryptococcal meningoencephalitis, the most common opportunistic infection in AIDS patients. In this study, we examined changes in the gene expression profile of the C. neoformans reference strain H99 (serotype A following FLC treatment in order to investigate the adaptive cellular responses to drug stress. Results Simultaneous analysis of over 6823 transcripts revealed that 476 genes were responsive to FLC. As expected up-regulation of genes involved in ergosterol biosynthesis was observed, including the azole target gene ERG11 and ERG13, ERG1, ERG7, ERG25, ERG2, ERG3 and ERG5. In addition, SRE1 which is a gene encoding a well-known regulator of sterol homeostasis in C. neoformans was up-regulated. Several other genes such as those involved in a variety of important cellular processes (i.e. lipid and fatty acid metabolism, cell wall maintenance, stress and virulence were found to be up-regulated in response to FLC treatment. Conversely, expression of AFR1, the major transporter of azoles in C. neoformans, was not regulated by FLC. Conclusions Short-term exposure of C. neoformans to FLC resulted in a complex altered gene expression profile. Some of the observed changes could represent specific adaptive responses to the antifungal agent in this pathogenic yeast.
Full Text Available Pectin methylesterase inhibitor genes (PMEIs are a large multigene family and play crucial roles in cell wall modifications in plant growth and development. Here, a comprehensive analysis of the PMEI gene family in Brassica campestris, an important leaf vegetable, was performed. We identified 100 Brassica campestris PMEI genes (BcPMEIs, among which 96 BcPMEIs were unevenly distributed on 10 chromosomes and nine tandem arrays containing 20 BcPMEIs were found. We also detected 80 pairs of syntenic PMEI orthologs. These findings indicated that whole-genome triplication (WGT and tandem duplication (TD were the main mechanisms accounting for the current number of BcPMEIs. In evolution, BcPMEIs were retained preferentially and biasedly, consistent with the gene balance hypothesis and two-step theory, respectively. The molecular evolution analysis of BcPMEIs manifested that they evolved through purifying selection and the divergence time is in accordance with the WGT data of B. campestris. To obtain the functional information of BcPMEIs, the expression patterns in five tissues and the cis-elements distributed in promoter regions were investigated. This work can provide a better understanding of the molecular evolution and biological function of PMEIs in B. campestris.
Full Text Available Auxin response factors (ARFs have been reported to play vital roles during plant growth and development. In order to reveal specific functions related to vegetative organs in grasses, an in-depth study of the ARF gene family was carried out in switchgrass (Panicum virgatum L., a warm-season C4 perennial grass that is mostly used as bioenergy and animal feedstock. A total of 47 putative ARF genes (PvARFs were identified in the switchgrass genome (2n = 4x = 36, 42 of which were anchored to the seven pairs of chromosomes and found to be unevenly distributed. Sixteen PvARFs were predicted to be potential targets of small RNAs (microRNA160 and 167. Phylogenetically speaking, PvARFs were divided into seven distinct subgroups based on the phylogeny, exon/intron arrangement, and conserved motif distribution. Moreover, 15 pairs of PvARFs have different temporal-spatial expression profiles in vegetative organs (2nd, 3rd, and 4th internode and leaves, which implies that different PvARFs have specific functions in switchgrass growth and development. In addition, at least 14 pairs of PvARFs respond to naphthylacetic acid (NAA treatment, which might be helpful for us to study on auxin response in switchgrass. The comprehensive analysis, described here, will facilitate the future functional analysis of ARF genes in grasses.
Charfeddine, Mariam; Saïdi, Mohamed Najib; Charfeddine, Safa; Hammami, Asma; Gargouri Bouzid, Radhia
The ERF transcription factors belong to the AP2/ERF superfamily, one of the largest transcription factor families in plants. They play important roles in plant development processes, as well as in the response to biotic, abiotic, and hormone signaling. In the present study, 155 putative ERF transcription factor genes were identified from the potato (Solanum tuberosum) genome database, and compared with those from Arabidopsis thaliana. The StERF proteins are divided into ten phylogenetic groups. Expression analyses of five StERFs were carried out by semi-quantitative RT-PCR and compared with published RNA-seq data. These latter analyses were used to distinguish tissue-specific, biotic, and abiotic stress genes as well as hormone-responsive StERF genes. The results are of interest to better understand the role of the AP2/ERF genes in response to diverse types of stress in potatoes. A comprehensive analysis of the physiological functions and biological roles of the ERF family genes in S. tuberosum is required to understand crop stress tolerance mechanisms.
Full Text Available Sugar transporters (STs play pivotal roles in the growth, development, and stress responses of phloem-sucking insects, such as the whitefly, Bemisia tabaci. In this study, 137 sugar transporters (STs were identified based on analysis of the genome and transcriptome of B. tabaci MEAM1. B. tabaci MEAM1 encodes a larger number of STs than other selected insects. Phylogenetic and molecular evolution analysis showed that the 137 STs formed three expanded clades and that the genes in Sternorrhyncha expanded clades had accelerated rates of evolution. B. tabaci sugar transporters (BTSTs were divided into three groups based on their expression profiles across developmental stages; however, no host-specific BTST was found in B. tabaci fed on different host plants. Feeding of B. tabaci adults with feeding diet containing dsRNA significantly reduced the transcript level of the target genes in B. tabaci and mortality was significantly improved in B. tabaci fed on dsRNA compared to the control, which indicates the sugar transporters may be used as potential RNAi targets for B. tabaci bio-control. These results provide a foundation for further studies of STs in B. tabaci.
Sun, Jiangmei; Li, Leiting; Wang, Peng; Zhang, Shaoling; Wu, Juyou
Leucine-rich repeat receptor-like protein kinase (LRR-RLK) is the largest gene family of receptor-like protein kinases (RLKs) and actively participates in regulating the growth, development, signal transduction, immunity, and stress responses of plants. However, the patterns of LRR-RLK gene family evolution in the five main Rosaceae species for which genome sequences are available have not yet been reported. In this study, we performed a comprehensive analysis of LRR-RLK genes for five Rosaceae species: Fragaria vesca (strawberry), Malus domestica (apple), Pyrus bretschneideri (Chinese white pear), Prunus mume (mei), and Prunus persica (peach), which contained 201, 244, 427, 267, and 258 LRR-RLK genes, respectively. All LRR-RLK genes were further grouped into 23 subfamilies based on the hidden Markov models approach. RLK-Pelle_LRR-XII-1, RLK-Pelle_LRR-XI-1, and RLK-Pelle_LRR-III were the three largest subfamilies. Synteny analysis indicated that there were 236 tandem duplicated genes in the five Rosaceae species, among which subfamilies XII-1 (82 genes) and XI-1 (80 genes) comprised 68.6%. Our results indicate that tandem duplication made a large contribution to the expansion of the subfamilies. The gene expression, tissue-specific expression, and subcellular localization data revealed that LRR-RLK genes were differentially expressed in various organs and tissues, and the largest subfamily XI-1 was highly expressed in all five Rosaceae species, suggesting that LRR-RLKs play important roles in each stage of plant growth and development. Taken together, our results provide an overview of the LRR-RLK family in Rosaceae genomes and the basis for further functional studies.
Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H; Trivedi, Prabodh K
Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana.
Rea, Matthew; Eckstein, Meredith; Eleazer, Rebekah; Smith, Caroline; Fondufe-Mittendorf, Yvonne N.
Chronic low dose inorganic arsenic (iAs) exposure leads to changes in gene expression and epithelial-to-mesenchymal transformation. During this transformation, cells adopt a fibroblast-like phenotype accompanied by profound gene expression changes. While many mechanisms have been implicated in this transformation, studies that focus on the role of epigenetic alterations in this process are just emerging. DNA methylation controls gene expression in physiologic and pathologic states. Several studies show alterations in DNA methylation patterns in iAs-mediated pathogenesis, but these studies focused on single genes. We present a comprehensive genome-wide DNA methylation analysis using methyl-sequencing to measure changes between normal and iAs-transformed cells. Additionally, these differential methylation changes correlated positively with changes in gene expression and alternative splicing. Interestingly, most of these differentially methylated genes function in cell adhesion and communication pathways. To gain insight into how genomic DNA methylation patterns are regulated during iAs-mediated carcinogenesis, we show that iAs probably targets CTCF binding at the promoter of DNA methyltransferases, regulating their expression. These findings reveal how CTCF binding regulates DNA methyltransferase to reprogram the methylome in response to an environmental toxin.
Saulnier, Delphine M.; Santos, Filipe; Roos, Stefan; Mistretta, Toni-Ann; Spinler, Jennifer K.; Molenaar, Douwe; Teusink, Bas; Versalovic, James
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to differ with respect to antimicrobial production, biofilm formation, and immunomodulation. To explain possible mechanisms of survival in the host and probiosis, we completed a detailed genomic comparison...
Saulnier, D.M.A.; Santos, F.; Roos, S.; Mistretta, T.-A.; Spinler, J.K.; Molenaar, D.; Teusink, B.; Versalovic, J.
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to
Saulnier, D.M.; Santos, F.; Roos, S.; Mistretta, T.A.; Spinler, J.K.; Molenaar, D.; Teusink, B.; Versalovic, J.
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to
Saulnier, D.M.; santos, F.; Roos, S.; Mistretta, T.A.; Spinler, J.K.; Molenaar, D.; Teusink, B.; Versalovic, J.
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to
Liu, Xin; Li, Rong; Dai, Yaqing; Chen, Xuesen; Wang, Xiaoyun
The B-box proteins (BBXs) are a family of zinc finger proteins containing one/two B-box domain(s). Compared with intensive studies of animal BBXs, investigations of the plant BBX family are limited, though some specific plant BBXs have been demonstrated to act as transcription factors in the regulation of flowering and photomorphogenesis. In this study, using a global search of the apple (Malus domestica Borkh.) genome, a total of 64 members of BBX (MdBBX) were identified. All the MdBBXs were divided into five groups based on the phylogenetic relationship, numbers of B-boxes contained and whether there was with an additional CCT domain. According to the characteristics of organ-specific expression, MdBBXs were divided into three groups based on the microarray information. An analysis of cis-acting elements showed that elements related to the stress response were prevalent in the promoter sequences of most MdBBXs. Twelve MdBBX members from different groups were randomly selected and exposed to abiotic stresses. Their expressions were up-regulated to some extent in the roots and leaves. Six among 12 MdBBXs were sensitive to osmotic pressure, salt, cold stress and exogenous abscisic acid treatment, with their expressions enhanced more than 20-fold. Our results suggested that MdBBXs may take part in response to abiotic stress.
Liu, Chaoyang; Xie, Tao; Chen, Chenjie; Luan, Aiping; Long, Jianmei; Li, Chuhao; Ding, Yaqi; He, Yehua
The MYB proteins comprise one of the largest families of plant transcription factors, which are involved in various plant physiological and biochemical processes. Pineapple (Ananas comosus) is one of three most important tropical fruits worldwide. The completion of pineapple genome sequencing provides a great opportunity to investigate the organization and evolutionary traits of pineapple MYB genes at the genome-wide level. In the present study, a total of 94 pineapple R2R3-MYB genes were identified and further phylogenetically classified into 26 subfamilies, as supported by the conserved gene structures and motif composition. Collinearity analysis indicated that the segmental duplication events played a crucial role in the expansion of pineapple MYB gene family. Further comparative phylogenetic analysis suggested that there have been functional divergences of MYB gene family during plant evolution. RNA-seq data from different tissues and developmental stages revealed distinct temporal and spatial expression profiles of the AcMYB genes. Further quantitative expression analysis showed the specific expression patterns of the selected putative stress-related AcMYB genes in response to distinct abiotic stress and hormonal treatments. The comprehensive expression analysis of the pineapple MYB genes, especially the tissue-preferential and stress-responsive genes, could provide valuable clues for further function characterization. In this work, we systematically identified AcMYB genes by analyzing the pineapple genome sequence using a set of bioinformatics approaches. Our findings provide a global insight into the organization, phylogeny and expression patterns of the pineapple R2R3-MYB genes, and hence contribute to the greater understanding of their biological roles in pineapple.
Castle, John C.; Armour, Christopher D.; Löwer, Martin; Haynor, David; Biery, Matthew; Bouzek, Heather; Chen, Ronghua; Jackson, Stuart; Johnson, Jason M.; Rohl, Carol A.; Raymond, Christopher K.
Non-coding RNAs (ncRNAs) are an essential class of molecular species that have been difficult to monitor on high throughput platforms due to frequent lack of polyadenylation. Using a polyadenylation-neutral amplification protocol and next-generation sequencing, we explore ncRNA expression in eleven human tissues. ncRNAs 7SL, U2, 7SK, and HBII-52 are expressed at levels far exceeding mRNAs. C/D and H/ACA box snoRNAs are associated with rRNA methylation and pseudouridylation, respectively: spleen expresses both, hypothalamus expresses mainly C/D box snoRNAs, and testes show enriched expression of both H/ACA box snoRNAs and RNA telomerase TERC. Within the snoRNA 14q cluster, 14q(I-6) is expressed at much higher levels than other cluster members. More reads align to mitochondrial than nuclear tRNAs. Many lincRNAs are actively transcribed, particularly those overlapping known ncRNAs. Within the Prader-Willi syndrome loci, the snoRNA HBII-85 (group I) cluster is highly expressed in hypothalamus, greater than in other tissues and greater than group II or III. Additionally, within the disease locus we find novel transcription across a 400,000 nt span in ovaries. This genome-wide polyA-neutral expression compendium demonstrates the richness of ncRNA expression, their high expression patterns, their function-specific expression patterns, and is publicly available. PMID:20668672
John C Castle
Full Text Available Non-coding RNAs (ncRNAs are an essential class of molecular species that have been difficult to monitor on high throughput platforms due to frequent lack of polyadenylation. Using a polyadenylation-neutral amplification protocol and next-generation sequencing, we explore ncRNA expression in eleven human tissues. ncRNAs 7SL, U2, 7SK, and HBII-52 are expressed at levels far exceeding mRNAs. C/D and H/ACA box snoRNAs are associated with rRNA methylation and pseudouridylation, respectively: spleen expresses both, hypothalamus expresses mainly C/D box snoRNAs, and testes show enriched expression of both H/ACA box snoRNAs and RNA telomerase TERC. Within the snoRNA 14q cluster, 14q(I-6 is expressed at much higher levels than other cluster members. More reads align to mitochondrial than nuclear tRNAs. Many lincRNAs are actively transcribed, particularly those overlapping known ncRNAs. Within the Prader-Willi syndrome loci, the snoRNA HBII-85 (group I cluster is highly expressed in hypothalamus, greater than in other tissues and greater than group II or III. Additionally, within the disease locus we find novel transcription across a 400,000 nt span in ovaries. This genome-wide polyA-neutral expression compendium demonstrates the richness of ncRNA expression, their high expression patterns, their function-specific expression patterns, and is publicly available.
Volkov, Petr; Olsson, Anders H; Gillberg, Linn
Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human adipose tissue of 119 men, w...... and epigenetic variation in both cis and trans positions influencing gene expression in adipose tissue and in vivo (dys)metabolic traits associated with the development of obesity and diabetes.......Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human adipose tissue of 119 men......, where 592,794 single nucleotide polymorphisms (SNPs) were related to DNA methylation of 477,891 CpG sites, covering 99% of RefSeq genes. SNPs in significant mQTLs were further related to gene expression in adipose tissue and obesity related traits. We found 101,911 SNP-CpG pairs (mQTLs) in cis and 5...
Magdalena B Wozniak
Full Text Available Gene expression microarray and next generation sequencing efforts on conventional, clear cell renal cell carcinoma (ccRCC have been mostly performed in North American and Western European populations, while the highest incidence rates are found in Central/Eastern Europe. We conducted whole-genome expression profiling on 101 pairs of ccRCC tumours and adjacent non-tumour renal tissue from Czech patients recruited within the "K2 Study", using the Illumina HumanHT-12 v4 Expression BeadChips to explore the molecular variations underlying the biological and clinical heterogeneity of this cancer. Differential expression analysis identified 1650 significant probes (fold change ≥2 and false discovery rate <0.05 mapping to 630 up- and 720 down-regulated unique genes. We performed similar statistical analysis on the RNA sequencing data of 65 ccRCC cases from the Cancer Genome Atlas (TCGA project and identified 60% (402 of the downregulated and 74% (469 of the upregulated genes found in the K2 series. The biological characterization of the significantly deregulated genes demonstrated involvement of downregulated genes in metabolic and catabolic processes, excretion, oxidation reduction, ion transport and response to chemical stimulus, while simultaneously upregulated genes were associated with immune and inflammatory responses, response to hypoxia, stress, wounding, vasculature development and cell activation. Furthermore, genome-wide DNA methylation analysis of 317 TCGA ccRCC/adjacent non-tumour renal tissue pairs indicated that deregulation of approximately 7% of genes could be explained by epigenetic changes. Finally, survival analysis conducted on 89 K2 and 464 TCGA cases identified 8 genes associated with differential prognostic outcomes. In conclusion, a large proportion of ccRCC molecular characteristics were common to the two populations and several may have clinical implications when validated further through large clinical cohorts.
Full Text Available In spite of intensive and increasingly successful attempts to determine the multiple steps involved in colorectal carcinogenesis, the mechanisms responsible for metastasis of colorectal tumors to the liver remain to be clarified. To identify genes that are candidates for involvement in the metastatic process, we analyzed genome-wide expression profiles of 10 primary colorectal cancers and their corresponding metastatic lesions by means of a cDNA microarray consisting of 9121 human genes. This analysis identified 40 genes whose expression was commonly upregulated in metastatic lesions, and 7 that were commonly downregulated. The upregulated genes encoded proteins involved in cell adhesion, or remodeling of the actin cytoskeleton. Investigation of the functions of more of the altered genes should improve our understanding of metastasis and may identify diagnostic markers and/or novel molecular targets for prevention or therapy of metastatic lesions.
Full Text Available Ubiquitin-conjugating enzymes (E2s or UBC enzymes play vital roles in plant development and combat various biotic and abiotic stresses. Longan (Dimocarpus longan Lour. is an important fruit tree in the subtropical region of Southeast Asia and Australia; however the characteristics of the UBC gene family in longan remain unknown. In this study, 40 D. longan UBC genes (DlUBCs, which were classified into 15 groups, were identified in the longan genome. An RNA-seq based analysis showed that DlUBCs showed distinct expression in nine longan tissues. Genome-wide RNA-seq and qRT-PCR based gene expression analysis revealed that 11 DlUBCs were up- or down-regualted in the cultivar “Sijimi” (SJ, suggesting that these genes may be important for flower induction. Finally, qRT-PCR analysis showed that the mRNA levels of 13 DlUBCs under SA (salicylic acid treatment, seven under methyl jasmonate (MeJA treatment, 27 under heat treatment, and 16 under cold treatment were up- or down-regulated, respectively. These results indicated that the DlUBCs may play important roles in responses to abiotic stresses. Taken together, our results provide a comprehensive insight into the organization, phylogeny, and expression patterns of the longan UBC genes, and therefore contribute to the greater understanding of their biological roles in longan.
van Wieringen, W.N.; Belien, J.A.M.; Vosse, S.; Achame, E.M.; Ylstra, B.
Summary: We describe a tool, called ACE-it (Array CGH Expression integration tool). ACE-it links the chromosomal position of the gene dosage measured by array CGH to the genes measured by the expression array. ACE-it uses this link to statistically test whether gene dosage affects RNA expression. ©
Full Text Available Waterlogging causes extensive damage to maize crops in tropical and subtropical regions. The identification of tolerance genes and their interactions at the molecular level will be helpful to engineer tolerant genotypes. A whole-genome transcriptome assay revealed the specific role of genes in response to waterlogging stress in susceptible and tolerant genotypes. Genes involved in the synthesis of ethylene and auxin, cell wall metabolism, activation of G-proteins and formation of aerenchyma and adventitious roots, were upregulated in the tolerant genotype. Many transcription factors, particularly ERFs, MYB, HSPs, MAPK, and LOB-domain protein were involved in regulation of these traits. Genes responsible for scavenging of ROS generated under stress were expressed along with those involved in carbohydrate metabolism. The physical locations of 21 genes expressed in the tolerant genotype were found to correspond with the marker intervals of known QTLs responsible for development of adaptive traits. Among the candidate genes, most showed synteny with genes of sorghum and foxtail millet. Co-expression analysis of 528 microarray samples including 16 samples from the present study generated seven functional modules each in the two genotypes, with differing characteristics. In the tolerant genotype, stress genes were co-expressed along with peroxidase and fermentation pathway genes.
Xu, Jidi; Xu, Haidan; Liu, Yuanlong; Wang, Xia; Xu, Qiang; Deng, Xiuxin
In eukaryotes, histone acetylation and methylation have been known to be involved in regulating diverse developmental processes and plant defense. These histone modification events are controlled by a series of histone modification gene families. To date, there is no study regarding genome-wide characterization of histone modification related genes in citrus species. Based on the two recent sequenced sweet orange genome databases, a total of 136 CsHMs (Citrus sinensis histone modification genes), including 47 CsHMTs (histone methyltransferase genes), 23 CsHDMs (histone demethylase genes), 50 CsHATs (histone acetyltransferase genes), and 16 CsHDACs (histone deacetylase genes) were identified. These genes were categorized to 11 gene families. A comprehensive analysis of these 11 gene families was performed with chromosome locations, phylogenetic comparison, gene structures, and conserved domain compositions of proteins. In order to gain an insight into the potential roles of these genes in citrus fruit development, 42 CsHMs with high mRNA abundance in fruit tissues were selected to further analyze their expression profiles at six stages of fruit development. Interestingly, a numbers of genes were expressed highly in flesh of ripening fruit and some of them showed the increasing expression levels along with the fruit development. Furthermore, we analyzed the expression patterns of all 136 CsHMs response to the infection of blue mold (Penicillium digitatum), which is the most devastating pathogen in citrus post-harvest process. The results indicated that 20 of them showed the strong alterations of their expression levels during the fruit-pathogen infection. In conclusion, this study presents a comprehensive analysis of the histone modification gene families in sweet orange and further elucidates their behaviors during the fruit development and the blue mold infection responses.
Full Text Available Osteoporosis is a complex disorder and commonly leads to fractures in elderly persons. Genome-wide association studies (GWAS have become an unbiased approach to identify variations in the genome that potentially affect health. However, the genetic variants identified so far only explain a small proportion of the heritability for complex traits. Due to the modest genetic effect size and inadequate power, true association signals may not be revealed based on a stringent genome-wide significance threshold. Here, we take advantage of SNP and transcript arrays and integrate GWAS and expression signature profiling relevant to the skeletal system in cellular and animal models to prioritize the discovery of novel candidate genes for osteoporosis-related traits, including bone mineral density (BMD at the lumbar spine (LS and femoral neck (FN, as well as geometric indices of the hip (femoral neck-shaft angle, NSA; femoral neck length, NL; and narrow-neck width, NW. A two-stage meta-analysis of GWAS from 7,633 Caucasian women and 3,657 men, revealed three novel loci associated with osteoporosis-related traits, including chromosome 1p13.2 (RAP1A, p = 3.6x10(-8, 2q11.2 (TBC1D8, and 18q11.2 (OSBPL1A, and confirmed a previously reported region near TNFRSF11B/OPG gene. We also prioritized 16 suggestive genome-wide significant candidate genes based on their potential involvement in skeletal metabolism. Among them, 3 candidate genes were associated with BMD in women. Notably, 2 out of these 3 genes (GPR177, p = 2.6x10(-13; SOX6, p = 6.4x10(-10 associated with BMD in women have been successfully replicated in a large-scale meta-analysis of BMD, but none of the non-prioritized candidates (associated with BMD did. Our results support the concept of our prioritization strategy. In the absence of direct biological support for identified genes, we highlighted the efficiency of subsequent functional characterization using publicly available expression profiling relevant
Awdhesh Kumar Mishra
Full Text Available WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I-V. Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement
Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Khan, Yusuf; Parida, Swarup Kumar; Prasad, Manoj
WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I-V). Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement of millets and
Full Text Available Zero gravity causes several changes in metabolic and functional aspects of the human body and experiments in space flight have demonstrated alterations in cancer growth and progression. This study reports the genome wide expression profiling of a colorectal cancer cell line-DLD-1, and a lymphoblast leukemic cell line-MOLT-4, under simulated microgravity in an effort to understand central processes and cellular functions that are dysregulated among both cell lines. Altered cell morphology, reduced cell viability and an aberrant cell cycle profile in comparison to their static controls were observed in both cell lines under microgravity. The process of cell cycle in DLD-1 cells was markedly affected with reduced viability, reduced colony forming ability, an apoptotic population and dysregulation of cell cycle genes, oncogenes, and cancer progression and prognostic markers. DNA microarray analysis revealed 1801 (upregulated and 2542 (downregulated genes (>2 fold in DLD-1 cultures under microgravity while MOLT-4 cultures differentially expressed 349 (upregulated and 444 (downregulated genes (>2 fold under microgravity. The loss in cell proliferative capacity was corroborated with the downregulation of the cell cycle process as demonstrated by functional clustering of DNA microarray data using gene ontology terms. The genome wide expression profile also showed significant dysregulation of post transcriptional gene silencing machinery and multiple microRNA host genes that are potential tumor suppressors and proto-oncogenes including MIR22HG, MIR17HG and MIR21HG. The MIR22HG, a tumor-suppressor gene was one of the highest upregulated genes in the microarray data showing a 4.4 log fold upregulation under microgravity. Real time PCR validated the dysregulation in the host gene by demonstrating a 4.18 log fold upregulation of the miR-22 microRNA. Microarray data also showed dysregulation of direct targets of miR-22, SP1, CDK6 and CCNA2.
Full Text Available Abstract Background Plant stilbenes are a small group of phenylpropanoids, which have been detected in at least 72 unrelated plant species and accumulate in response to biotic and abiotic stresses such as infection, wounding, UV-C exposure and treatment with chemicals. Stilbenes are formed via the phenylalanine/polymalonate-route, the last step of which is catalyzed by the enzyme stilbene synthase (STS, a type III polyketide synthase (PKS. Stilbene synthases are closely related to chalcone synthases (CHS, the key enzymes of the flavonoid pathway, as illustrated by the fact that both enzymes share the same substrates. To date, STSs have been cloned from peanut, pine, sorghum and grapevine, the only stilbene-producing fruiting-plant for which the entire genome has been sequenced. Apart from sorghum, STS genes appear to exist as a family of closely related genes in these other plant species. Results In this study a complete characterization of the STS multigenic family in grapevine has been performed, commencing with the identification, annotation and phylogenetic analysis of all members and integration of this information with a comprehensive set of gene expression analyses including healthy tissues at differential developmental stages and in leaves exposed to both biotic (downy mildew infection and abiotic (wounding and UV-C exposure stresses. At least thirty-three full length sequences encoding VvSTS genes were identified, which, based on predicted amino acid sequences, cluster in 3 principal groups designated A, B and C. The majority of VvSTS genes cluster in groups B and C and are located on chr16 whereas the few gene family members in group A are found on chr10. Microarray and mRNA-seq expression analyses revealed different patterns of transcript accumulation between the different groups of VvSTS family members and between VvSTSs and VvCHSs. Indeed, under certain conditions the transcriptional response of VvSTS and VvCHS genes appears to be
Lui, Julian C; Nilsson, Ola; Chan, Yingleong; Palmer, Cameron D; Andrade, Anenisia C; Hirschhorn, Joel N; Baron, Jeffrey
Previous meta-analysis of genome-wide association (GWA) studies has identified 180 loci that influence adult height. However, each GWA locus typically comprises a set of contiguous genes, only one of which presumably modulates height. We reasoned that many of the causative genes within these loci influence height because they are expressed in and function in the growth plate, a cartilaginous structure that causes bone elongation and thus determines stature. Therefore, we used expression microarray studies of mouse and rat growth plate, human disease databases and a mouse knockout phenotype database to identify genes within the GWAS loci that are likely required for normal growth plate function. Each of these approaches identified significantly more genes within the GWA height loci than at random genomic locations (P analysis strongly implicates 78 genes in growth plate function, including multiple genes that participate in PTHrP-IHH, BMP and CNP signaling, and many genes that have not previously been implicated in the growth plate. Thus, this analysis reveals a large number of novel genes that regulate human growth plate chondrogenesis and thereby contribute to the normal variations in human adult height. The analytic approach developed for this study may be applied to GWA studies for other common polygenic traits and diseases, thus providing a new general strategy to identify causative genes within GWA loci and to translate genetic associations into mechanistic biological insights.
Islam, Md Shiful; Choudhury, Mouraj; Majlish, Al-Nahian Khan; Islam, Tahmina; Ghosh, Ajit
Glutathione S-transferases (GSTs) are ubiquitous enzymes which play versatile functions including cellular detoxification and stress tolerance. In this study, a comprehensive genome-wide identification of GST gene family was carried out in potato (Solanum tuberosum L.). The result demonstrated the presence of at least 90 GST genes in potato which is greater than any other reported species. According to the phylogenetic analyses of Arabidopsis, rice and potato GST members, GSTs could be subdivided into ten different classes and each class is found to be highly conserved. The largest class of potato GST family is tau with 66 members, followed by phi and lambda. The chromosomal localization analysis revealed the highly uneven distribution of StGST genes across the potato genome. Transcript profiling of 55 StGST genes showed the tissue-specific expression for most of the members. Moreover, expression of StGST genes were mainly repressed in response to abiotic stresses, while largely induced in response to biotic and hormonal elicitations. Further analysis of StGST gene's promoter identified the presence of various stress responsive cis-regulatory elements. Moreover, one of the highly stress responsive StGST members, StGSTU46, showed strong affinity towards flurazole with lowest binding energy of -7.6kcal/mol that could be used as antidote to protect crop against herbicides. These findings will facilitate the further functional and evolutionary characterization of GST genes in potato. Copyright © 2017 Elsevier B.V. All rights reserved.
Delphine M Saulnier
Full Text Available The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to differ with respect to antimicrobial production, biofilm formation, and immunomodulation. To explain possible mechanisms of survival in the host and probiosis, we completed a detailed genomic comparison of two breast milk-derived isolates representative of each group: an established probiotic strain (L. reuteri ATCC 55730 and a strain with promising probiotic features (L. reuteri ATCC PTA 6475. Transcriptomes of L. reuteri strains in different growth phases were monitored using strain-specific microarrays, and compared using a pan-metabolic model representing all known metabolic reactions present in these strains. Both strains contained candidate genes involved in the survival and persistence in the gut such as mucus-binding proteins and enzymes scavenging reactive oxygen species. A large operon predicted to encode the synthesis of an exopolysaccharide was identified in strain 55730. Both strains were predicted to produce health-promoting factors, including antimicrobial agents and vitamins (folate, vitamin B(12. Additionally, a complete pathway for thiamine biosynthesis was predicted in strain 55730 for the first time in this species. Candidate genes responsible for immunomodulatory properties of each strain were identified by transcriptomic comparisons. The production of bioactive metabolites by human-derived probiotics may be predicted using metabolic modeling and transcriptomics. Such strategies may facilitate selection and optimization of probiotics for health promotion, disease prevention and amelioration.
Saulnier, Delphine M; Santos, Filipe; Roos, Stefan; Mistretta, Toni-Ann; Spinler, Jennifer K; Molenaar, Douwe; Teusink, Bas; Versalovic, James
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to differ with respect to antimicrobial production, biofilm formation, and immunomodulation. To explain possible mechanisms of survival in the host and probiosis, we completed a detailed genomic comparison of two breast milk-derived isolates representative of each group: an established probiotic strain (L. reuteri ATCC 55730) and a strain with promising probiotic features (L. reuteri ATCC PTA 6475). Transcriptomes of L. reuteri strains in different growth phases were monitored using strain-specific microarrays, and compared using a pan-metabolic model representing all known metabolic reactions present in these strains. Both strains contained candidate genes involved in the survival and persistence in the gut such as mucus-binding proteins and enzymes scavenging reactive oxygen species. A large operon predicted to encode the synthesis of an exopolysaccharide was identified in strain 55730. Both strains were predicted to produce health-promoting factors, including antimicrobial agents and vitamins (folate, vitamin B(12)). Additionally, a complete pathway for thiamine biosynthesis was predicted in strain 55730 for the first time in this species. Candidate genes responsible for immunomodulatory properties of each strain were identified by transcriptomic comparisons. The production of bioactive metabolites by human-derived probiotics may be predicted using metabolic modeling and transcriptomics. Such strategies may facilitate selection and optimization of probiotics for health promotion, disease prevention and amelioration.
Chandra, Saket; Kazmi, Andaleeb Z; Ahmed, Zainab; Roychowdhury, Gargi; Kumari, Veena; Kumar, Manish; Mukhopadhyay, Kunal
NB-ARC domain-containing resistance genes from the wheat genome were identified, characterized and localized on chromosome arms that displayed differential yet positive response during incompatible and compatible leaf rust interactions. Wheat (Triticum aestivum L.) is an important cereal crop; however, its production is affected severely by numerous diseases including rusts. An efficient, cost-effective and ecologically viable approach to control pathogens is through host resistance. In wheat, high numbers of resistance loci are present but only few have been identified and cloned. A comprehensive analysis of the NB-ARC-containing genes in complete wheat genome was accomplished in this study. Complete NB-ARC encoding genes were mined from the Ensembl Plants database to predict 604 NB-ARC containing sequences using the HMM approach. Genome-wide analysis of orthologous clusters in the NB-ARC-containing sequences of wheat and other members of the Poaceae family revealed maximum homology with Oryza sativa indica and Brachypodium distachyon. The identification of overlap between orthologous clusters enabled the elucidation of the function and evolution of resistance proteins. The distributions of the NB-ARC domain-containing sequences were found to be balanced among the three wheat sub-genomes. Wheat chromosome arms 4AL and 7BL had the most NB-ARC domain-containing contigs. The spatio-temporal expression profiling studies exemplified the positive role of these genes in resistant and susceptible wheat plants during incompatible and compatible interaction in response to the leaf rust pathogen Puccinia triticina. Two NB-ARC domain-containing sequences were modelled in silico, cloned and sequenced to analyze their fine structures. The data obtained in this study will augment isolation, characterization and application NB-ARC resistance genes in marker-assisted selection based breeding programs for improving rust resistance in wheat.
Full Text Available Nucleotide binding site leucine-rich repeats (NBS-LRR disease resistance proteins play an important role in plant defense against pathogen attack. A number of recent studies have been carried out to identify and characterize NBS-LRR gene families in many important plant species. In this study, we identified NBS-LRR gene family comprising of 1015 NBS-LRRs using highly stringent computational methods. These NBS-LRRs were characterized on the basis of conserved protein motifs, gene duplication events, chromosomal locations, phylogenetic relationships and digital gene expression analysis. Surprisingly, equal distribution of Toll/interleukin-1 receptor (TIR and coiled coil (CC (1 ∶ 1 was detected in apple while the unequal distribution was reported in majority of all other known plant genome studies. Prediction of gene duplication events intriguingly revealed that not only tandem duplication but also segmental duplication may equally be responsible for the expansion of the apple NBS-LRR gene family. Gene expression profiling using expressed sequence tags database of apple and quantitative real-time PCR (qRT-PCR revealed the expression of these genes in wide range of tissues and disease conditions, respectively. Taken together, this study will provide a blueprint for future efforts towards improvement of disease resistance in apple.
Arya, Preeti; Kumar, Gulshan; Acharya, Vishal; Singh, Anil K
Nucleotide binding site leucine-rich repeats (NBS-LRR) disease resistance proteins play an important role in plant defense against pathogen attack. A number of recent studies have been carried out to identify and characterize NBS-LRR gene families in many important plant species. In this study, we identified NBS-LRR gene family comprising of 1015 NBS-LRRs using highly stringent computational methods. These NBS-LRRs were characterized on the basis of conserved protein motifs, gene duplication events, chromosomal locations, phylogenetic relationships and digital gene expression analysis. Surprisingly, equal distribution of Toll/interleukin-1 receptor (TIR) and coiled coil (CC) (1 ∶ 1) was detected in apple while the unequal distribution was reported in majority of all other known plant genome studies. Prediction of gene duplication events intriguingly revealed that not only tandem duplication but also segmental duplication may equally be responsible for the expansion of the apple NBS-LRR gene family. Gene expression profiling using expressed sequence tags database of apple and quantitative real-time PCR (qRT-PCR) revealed the expression of these genes in wide range of tissues and disease conditions, respectively. Taken together, this study will provide a blueprint for future efforts towards improvement of disease resistance in apple.
Park, Wonkeun; Scheffler, Brian E; Bauer, Philip J; Campbell, B Todd
Cotton is the world's primary fiber crop and is a major agricultural commodity in over 30 countries. Like many other global commodities, sustainable cotton production is challenged by restricted natural resources. In response to the anticipated increase of agricultural water demand, a major research direction involves developing crops that use less water or that use water more efficiently. In this study, our objective was to identify differentially expressed genes in response to water deficit stress in cotton. A global expression analysis using cDNA-Amplified Fragment Length Polymorphism was conducted to compare root and leaf gene expression profiles from a putative drought resistant cotton cultivar grown under water deficit stressed and well watered field conditions. We identified a total of 519 differentially expressed transcript derived fragments. Of these, 147 transcript derived fragment sequences were functionally annotated according to their gene ontology. Nearly 70 percent of transcript derived fragments belonged to four major categories: 1) unclassified, 2) stress/defense, 3) metabolism, and 4) gene regulation. We found heat shock protein-related and reactive oxygen species-related transcript derived fragments to be among the major parts of functional pathways induced by water deficit stress. Also, twelve novel transcripts were identified as both water deficit responsive and cotton specific. A subset of differentially expressed transcript derived fragments was verified using reverse transcription-polymerase chain reaction. Differential expression analysis also identified five pairs of duplicated transcript derived fragments in which four pairs responded differentially between each of their two homologues under water deficit stress. In this study, we detected differentially expressed transcript derived fragments from water deficit stressed root and leaf tissues in tetraploid cotton and provided their gene ontology, functional/biological distribution, and
Junnila, Siina; Kokkola, Arto; Karjalainen-Lindsberg, Marja-Liisa; Puolakkainen, Pauli; Monni, Outi
Gastric cancer is one of the most common malignancies worldwide and the second most common cause of cancer related death. Gene copy number alterations play an important role in the development of gastric cancer and a change in gene copy number is one of the main mechanisms for a cancer cell to control the expression of potential oncogenes and tumor suppressor genes. To highlight genes of potential biological and clinical relevance in gastric cancer, we carried out a systematic array-based survey of gene expression and copy number levels in primary gastric tumors and gastric cancer cell lines and validated the results using an affinity capture based transcript analysis (TRAC assay) and real-time qRT-PCR. Integrated microarray analysis revealed altogether 256 genes that were located in recurrent regions of gains or losses and had at least a 2-fold copy number- associated change in their gene expression. The expression levels of 13 of these genes, ALPK2, ASAP1, CEACAM5, CYP3A4, ENAH, ERBB2, HHIPL2, LTB4R, MMP9, PERLD1, PNMT, PTPRA, and OSMR, were validated in a total of 118 gastric samples using either the qRT-PCR or TRAC assay. All of these 13 genes were differentially expressed between cancerous samples and nonmalignant tissues (p < 0.05) and the association between copy number and gene expression changes was validated for nine (69.2%) of these genes (p < 0.05). In conclusion, integrated gene expression and copy number microarray analysis highlighted genes that may be critically important for gastric carcinogenesis. TRAC and qRT-PCR analyses validated the microarray results and therefore the role of these genes as potential biomarkers for gastric cancer
Full Text Available Galactinol synthase (GolS is a key enzyme in raffinose family oligosaccharide (RFO biosynthesis. The finding that GolS accumulates in plants exposed to abiotic stresses indicates RFOs function in environmental adaptation. However, the evolutionary relationships and biological functions of GolS family in rapeseed (Brassica napus and tobacco (Nicotiana tabacum remain unclear. In this study, we identified 20 BnGolS and 9 NtGolS genes. Subcellular localization predictions showed that most of the proteins are localized to the cytoplasm. Phylogenetic analysis identified a lost event of an ancient GolS copy in the Solanaceae and an ancient duplication event leading to evolution of GolS4/7 in the Brassicaceae. The three-dimensional structures of two GolS proteins were conserved, with an important DxD motif for binding to UDP-galactose (uridine diphosphate-galactose and inositol. Expression profile analysis indicated that BnGolS and NtGolS genes were expressed in most tissues and highly expressed in one or two specific tissues. Hormone treatments strongly induced the expression of most BnGolS genes and homologous genes in the same subfamilies exhibited divergent-induced expression. Our study provides a comprehensive evolutionary analysis of GolS genes among the Brassicaceae and Solanaceae as well as an insight into the biological function of GolS genes in hormone response in plants.
Bjerrum, Jacob Tveiten; Hansen, Morten; Olsen, Jørgen
colonocytes from UC patients and controls in order to identify the cell types responsible for the continuous inflammatory state. Methods: Adjacent mucosal colonic biopsies were obtained endoscopically from the descending colon in patients with active UC (n = 8), quiescent UC (n = 9), and with irritable bowel......Background: Genome-wide gene expression (GWGE) profiles of mucosal colonic biopsies have suggested the existence of a continuous inflammatory state in quiescent ulcerative colitis (UC). The aim of this study was to use DNA microarray-based GWGE profiling of mucosal colonic biopsies and isolated......-discriminant analysis using the SIMCA-P 11 software (Umetrics, Umea, Sweden). Results: A clear separation between active UC, quiescent UC, and control biopsies were found, whereas the model for the colonocytes was unable to distinguish between quiescent UC and controls. The differentiation between quiescent UC...
Simon Horst H
Full Text Available Abstract Midbrain dopaminergic neurons are involved in control of emotion, motivation and motor behavior. The loss of one of the subpopulations, substantia nigra pars compacta, is the pathological hallmark of one of the most prominent neurological disorders, Parkinson's disease. Several groups have looked at the molecular identity of midbrain dopaminergic neurons and have suggested the gene expression profile of these neurons. Here, after determining the efficiency of each screen, we provide a linked database of the genes, expressed in this neuronal population, by combining and comparing the results of six previous studies and verification of expression of each gene in dopaminergic neurons, using the collection of in situ hybridization in the Allen Brain Atlas.
Campbell, Thomas Mark; Trudel, Guy; Wong, Kayleigh Kristin; Laneuville, Odette
Knee flexion contractures (KFC) are limitations in the ability to fully extend the knee joint. In people with knee osteoarthritis (OA), KFC are common, impair function, and worsen outcomes after arthroplasty. In KFC, the posterior knee capsule is believed to play a key role, but the pathophysiology remains poorly understood. We sought to identify gene expression differences in the posterior knee capsule of patients with OA with and without KFC. Capsule tissue was obtained from the knees of 12 subjects diagnosed with advanced-stage OA at the time of knee arthroplasty surgery. The presence or absence of KFC allocated patients into 2 groups using a case-control design. Genomewide capsular gene expression was compared between the 2 patient groups. Confirmation of differential expression of the corresponding proteins was performed by immunohistochemistry on tissue sections. There were no significant demographic differences between the patients with OA with KFC and without KFC save for reduced extension in their surgical knee (pKFC patients showed a 6.4-fold decrease in CSN1S1 (p=0.017) gene expression and a 3.7-, 2.0-, and 2.6-fold increase in CHAD, Sox9, and Cyr61 gene expression, respectively (p=0.001, 0.004, 0.001, respectively). There were corresponding increases in protein levels for chondroadherin, sex determining region Y-box 9, and casein alphaS1 (all pKFC exhibited differential expression of 4 genes all previously documented to be associated with tissue fibrosis.
Pócsi, István; Miskei, Márton; Karányi, Zsolt; Emri, Tamás; Ayoubi, Patricia; Pusztahelyi, Tünde; Balla, György; Prade, Rolf A
Background In addition to their cytotoxic nature, reactive oxygen species (ROS) are also signal molecules in diverse cellular processes in eukaryotic organisms. Linking genome-wide transcriptional changes to cellular physiology in oxidative stress-exposed Aspergillus nidulans cultures provides the opportunity to estimate the sizes of peroxide (O22-), superoxide (O2•-) and glutathione/glutathione disulphide (GSH/GSSG) redox imbalance responses. Results Genome-wide transcriptional changes triggered by diamide, H2O2 and menadione in A. nidulans vegetative tissues were recorded using DNA microarrays containing 3533 unique PCR-amplified probes. Evaluation of LOESS-normalized data indicated that 2499 gene probes were affected by at least one stress-inducing agent. The stress induced by diamide and H2O2 were pulse-like, with recovery after 1 h exposure time while no recovery was observed with menadione. The distribution of stress-responsive gene probes among major physiological functional categories was approximately the same for each agent. The gene group sizes solely responsive to changes in intracellular O22-, O2•- concentrations or to GSH/GSSG redox imbalance were estimated at 7.7, 32.6 and 13.0 %, respectively. Gene groups responsive to diamide, H2O2 and menadione treatments and gene groups influenced by GSH/GSSG, O22- and O2•- were only partly overlapping with distinct enrichment profiles within functional categories. Changes in the GSH/GSSG redox state influenced expression of genes coding for PBS2 like MAPK kinase homologue, PSK2 kinase homologue, AtfA transcription factor, and many elements of ubiquitin tagging, cell division cycle regulators, translation machinery proteins, defense and stress proteins, transport proteins as well as many enzymes of the primary and secondary metabolisms. Meanwhile, a separate set of genes encoding transport proteins, CpcA and JlbA amino acid starvation-responsive transcription factors, and some elements of sexual development
Leyfer, Dmitriy; Weng, Zhiping
A holistic approach to the study of cellular processes is identifying both gene-expression changes and regulatory elements promoting such changes. Cellular regulatory processes can be viewed as transcriptional modules (TMs), groups of coexpressed genes regulated by groups of transcription factors (TFs). We set out to devise a method that would identify TMs while avoiding arbitrary thresholds on TM sizes and number. Assuming that gene expression is determined by TFs that bind to the gene's promoter, clustering of genes based on TF binding sites (cis-elements) should create gene groups similar to those obtained by gene expression clustering. Intersections between the expression and cis-element-based gene clusters reveal TMs. Statistical significance assigned to each TM allows identification of regulatory units of any size. Our method correctly identifies the number and sizes of TMs on simulated datasets. We demonstrate that yeast experimental TMs are biologically relevant by comparing them with MIPS and GO categories. Our modules are in statistically significant agreement with TMs from other research groups. This work suggests that there is no preferential division of biological processes into regulatory units; each degree of partitioning exhibits a slice of biological network revealing hierarchical modular organization of transcriptional regulation.
Bassett, Anne S.; Lowther, Chelsea; Merico, Daniele; Costain, Gregory; Chow, Eva W C; Van Amelsvoort, Therese; McDonald-Mcginn, Donna M.; Gur, Raquel E.; Swillen, Ann; van den Bree, Marianne B M; Murphy, Kieran C.; Gothelf, Doron; Bearden, Carrie E.; Eliez, Stephan; Kates, Wendy R.; Philip, Nicole; Sashi, Vandana; Campbell, Linda E.; Vorstman, Jacob; Cubells, Joseph; Repetto, Gabriela M.; Simon, Tony J.; Boot, Erik; Heung, Tracy; Evers, Rens; Vingerhoets, Claudia; Van Duin, Esther; Zackai, Elaine; Vergaelen, Elfi; Devriendt, Koen; Vermeesch, Joris R.; Owen, Michael J; Murphy, Clodagh M.; Michaelovosky, Elena; Kushan, Leila; Schneider, Maude; Fremont, Wanda; Busa, Tiffany; Hooper, Stephen R.; McCabe, Kathryn; Duijff, Sasja; Isaev, Karin; Pellecchia, Giovanna; Wei, John; Gazzellone, Matthew J.; Scherer, Stephen W.; Emanuel, Beverly S.; Guo, Tingwei; Morrow, Bernice E.; Marshall, Christian R.
Objective: Chromosome 22q11.2 deletion syndrome (22q11.2DS) is associated with a more than 20-fold increased risk for developing schizophrenia. The aim of this studywas to identify additional genetic factors (i.e., "second hits") that may contribute to schizophrenia expression. Method: Through an
Full Text Available Mitogen-activated protein kinases (MAPKs play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars.
Takahashi, N; Rieneck, K; van der Kraan, P M
To elucidate the antagonism between interleukin-1 (IL-1) and transforming growth factor-beta (TGF-beta) at the gene expression level, as IL-1 and TGF-beta are postulated to be critical mediators of cartilage degeneration/protection in rheumatic diseases....
Full Text Available Glyoxalase is an evolutionary highly conserved pathway present in all organisms. Conventional glyoxalase pathway has two enzymes, glyoxalase I (GLYI and glyoxalase II (GLYII that act sequentially to detoxify a highly cytotoxic compound methylglyoxal (MG to D-lactate with the help of reduced glutathione. Recently, proteins with DJ-1/PfpI domain have been reported to perform the same conversion in a single step without the help of any cofactor and thus termed as “unique glyoxalase III” enzyme. Genome-wide analysis of glyoxalase genes have been previously conducted in Arabidopsis, rice and Soybean plants, but no such study was performed for one of the agricultural important model legume species, Medicago truncatula. A comprehensive genome-wide analysis of Medicago identified a total of putative 29 GLYI, 14 GLYII genes, and 5 glyoxalase III (DJ-1 genes. All these identified genes and their corresponding proteins were analyzed in detail including their chromosomal distribution, gene duplication, phylogenetic relationship, and the presence of conserved domain(s. Expression of all these genes was analyzed in different tissues as well as under two devastating abiotic stresses- salinity and drought using publicly available transcript data. This study revealed that MtGLYI-4, MtGLYII-6, and MtDJ-1A are the constitutive members with a high level of expression at all 17 analyzed tissues; while MtGLYI-1, MtGLYI-11, MtGLYI-5, MtGLYI-7, and MtGLYII-13 showed tissue-specific expression. Moreover, most of the genes displayed similar pattern of expression in response to both salinity and drought stress, irrespective of stress duration and tissue type. MtGLYI-8, MtGLYI-11, MtGLYI-6, MtGLYI-16, MtGLYI-21, and MtGLYII-9 showed up-regulation, while MtGLYI-17 and MtGLYI-7/9 showed down-regulation in response to both stresses. Interestingly, MtGLYI-14/15 showed completely opposite pattern of expression in these two stresses. This study provides an initial basis
Martins, Cristina de Paula Santos; Pedrosa, Andresa Muniz; Du, Dongliang; Gonçalves, Luana Pereira; Yu, Qibin; Gmitter, Frederick G; Costa, Marcio Gilberto Cardoso
The family of aquaporins (AQPs), or major intrinsic proteins (MIPs), includes integral membrane proteins that function as transmembrane channels for water and other small molecules of physiological significance. MIPs are classified into five subfamilies in higher plants, including plasma membrane (PIPs), tonoplast (TIPs), NOD26-like (NIPs), small basic (SIPs) and unclassified X (XIPs) intrinsic proteins. This study reports a genome-wide survey of MIP encoding genes in sweet orange (Citrus sinensis L. Osb.), the most widely cultivated Citrus spp. A total of 34 different genes encoding C. sinensis MIPs (CsMIPs) were identified and assigned into five subfamilies (CsPIPs, CsTIPs, CsNIPs, CsSIPs and CsXIPs) based on sequence analysis and also on their phylogenetic relationships with clearly classified MIPs of Arabidopsis thaliana. Analysis of key amino acid residues allowed the assessment of the substrate specificity of each CsMIP. Gene structure analysis revealed that the CsMIPs possess an exon-intron organization that is highly conserved within each subfamily. CsMIP loci were precisely mapped on every sweet orange chromosome, indicating a wide distribution of the gene family in the sweet orange genome. Investigation of their expression patterns in different tissues and upon drought and salt stress treatments, as well as with 'Candidatus Liberibacter asiaticus' infection, revealed a tissue-specific and coordinated regulation of the different CsMIP isoforms, consistent with the organization of the stress-responsive cis-acting regulatory elements observed in their promoter regions. A special role in regulating the flow of water and nutrients is proposed for CsTIPs and CsXIPs during drought stress, and for most CsMIPs during salt stress and the development of HLB disease. These results provide a valuable reference for further exploration of the CsMIPs functions and applications to the genetic improvement of both abiotic and biotic stress tolerance in citrus.
Cristina de Paula Santos Martins
Full Text Available The family of aquaporins (AQPs, or major intrinsic proteins (MIPs, includes integral membrane proteins that function as transmembrane channels for water and other small molecules of physiological significance. MIPs are classified into five subfamilies in higher plants, including plasma membrane (PIPs, tonoplast (TIPs, NOD26-like (NIPs, small basic (SIPs and unclassified X (XIPs intrinsic proteins. This study reports a genome-wide survey of MIP encoding genes in sweet orange (Citrus sinensis L. Osb., the most widely cultivated Citrus spp. A total of 34 different genes encoding C. sinensis MIPs (CsMIPs were identified and assigned into five subfamilies (CsPIPs, CsTIPs, CsNIPs, CsSIPs and CsXIPs based on sequence analysis and also on their phylogenetic relationships with clearly classified MIPs of Arabidopsis thaliana. Analysis of key amino acid residues allowed the assessment of the substrate specificity of each CsMIP. Gene structure analysis revealed that the CsMIPs possess an exon-intron organization that is highly conserved within each subfamily. CsMIP loci were precisely mapped on every sweet orange chromosome, indicating a wide distribution of the gene family in the sweet orange genome. Investigation of their expression patterns in different tissues and upon drought and salt stress treatments, as well as with 'Candidatus Liberibacter asiaticus' infection, revealed a tissue-specific and coordinated regulation of the different CsMIP isoforms, consistent with the organization of the stress-responsive cis-acting regulatory elements observed in their promoter regions. A special role in regulating the flow of water and nutrients is proposed for CsTIPs and CsXIPs during drought stress, and for most CsMIPs during salt stress and the development of HLB disease. These results provide a valuable reference for further exploration of the CsMIPs functions and applications to the genetic improvement of both abiotic and biotic stress tolerance in citrus.
Yadav, Amita; Khan, Yusuf; Prasad, Manoj
A set of novel and known dehydration-responsive miRNAs have been identified in foxtail millet. These findings provide new insights into understanding the functional role of miRNAs and their respective targets in regulating plant response to dehydration stress. MicroRNAs perform significant regulatory roles in growth, development and stress response of plants. Though the miRNA-mediated gene regulatory networks under dehydration stress remain largely unexplored in plant including foxtail millet (Setaria italica), which is a natural abiotic stress tolerant crop. To find out the dehydration-responsive miRNAs at the global level, four small RNA libraries were constructed from control and dehydration stress treated seedlings of two foxtail millet cultivars showing contrasting tolerance behavior towards dehydration stress. Using Illumina sequencing technology, 55 known and 136 novel miRNAs were identified, representing 22 and 48 miRNA families, respectively. Eighteen known and 33 novel miRNAs were differentially expressed during dehydration stress. After the stress treatment, 32 dehydration-responsive miRNAs were up-regulated in tolerant cultivar and 22 miRNAs were down-regulated in sensitive cultivar, suggesting that miRNA-mediated molecular regulation might play important roles in providing contrasting characteristics to these cultivars. Predicted targets of identified miRNAs were found to encode various transcription factors and functional enzymes, indicating their involvement in broad spectrum regulatory functions and biological processes. Further, differential expression patterns of seven known miRNAs were validated by northern blot and expression of ten novel dehydration-responsive miRNAs were confirmed by SL-qRT PCR. Differential expression behavior of five miRNA-target genes was verified under dehydration stress treatment and two of them also validated by RLM RACE. Overall, the present study highlights the importance of dehydration stress-associated post
Full Text Available The heavy metal ATPase (HMA family plays an important role in transition metal transport in plants. However, this gene family has not been extensively studied in Populus trichocarpa. We identified 17 HMA genes in P. trichocarpa (PtHMAs, of which PtHMA1–PtHMA4 belonged to the zinc (Zn/cobalt (Co/cadmium (Cd/lead (Pb subgroup, and PtHMA5–PtHMA8 were members of the copper (Cu/silver (Ag subgroup. Most of the genes were localized to chromosomes I and III. Gene structure, gene chromosomal location, and synteny analyses of PtHMAs indicated that tandem and segmental duplications likely contributed to the expansion and evolution of the PtHMAs. Most of the HMA genes contained abiotic stress-related cis-elements. Tissue-specific expression of PtHMA genes showed that PtHMA1 and PtHMA4 had relatively high expression levels in the leaves, whereas Cu/Ag subgroup (PtHMA5.1- PtHMA8 genes were upregulated in the roots. High concentrations of Cu, Ag, Zn, Cd, Co, Pb and Mn differentially regulated the expression of PtHMAs in various tissues. The preliminary results of the present study generated basic information on the HMA family of Populus that may serve as foundation for future functional studies.
Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng
Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.
Chen, Aiqun; Chen, Xiao; Wang, Huimin; Liao, Dehua; Gu, Mian; Qu, Hongye; Sun, Shubin; Xu, Guohua
Phosphorus (P) deficiency is one of the major nutrient stresses limiting plant growth. The uptake of P by plants is well considered to be mediated by a number of high-affinity phosphate (Pi) transporters belonging to the Pht1 family. Although the Pht1 genes have been extensively identified in several plant species, there is a lack of systematic analysis of the Pht1 gene family in any solanaceous species thus far. Here, we report the genome-wide analysis, phylogenetic evolution and expression patterns of the Pht1 genes in tomato (Solanum lycopersicum). A total of eight putative Pht1 genes (LePT1 to 8), distributed on three chromosomes (3, 6 and 9), were identified through extensive searches of the released tomato genome sequence database. Chromosomal organization and phylogenetic tree analysis suggested that the six Pht1 paralogues, LePT1/3, LePT2/6 and LePT4/5, which were assigned into three pairs with very close physical distance, were produced from recent tandem duplication events that occurred after Solanaceae splitting with other dicot families. Expression analysis of these Pht1 members revealed that except LePT8, of which the transcript was undetectable in all tissues, the other seven paralogues showed differential but partial-overlapping expression patterns. LePT1 and LePT7 were ubiquitously expressed in all tissues examined, and their transcripts were induced abundantly in response to Pi starvation; LePT2 and LePT6, the two paralogues harboring identical coding sequence, were predominantly expressed in Pi-deficient roots; LePT3, LePT4 and LePT5 were strongly activated in the roots colonized by arbuscular mycorrhizal fungi under low-P, but not high-P condition. Histochemical analysis revealed that a 1250-bp LePT3 promoter fragment and a 471-bp LePT5 promoter fragment containing the two elements, MYCS and P1BS, were sufficient to direct the GUS reporter expression in mycorrhizal roots and were limited to distinct cells harboring AM fungal structures
The main goal of this study is to identify the molecular mechanisms involved in the response to low doses of ionizing radiation by using oligonucleotide micro-arrays. The results presented in this study demonstrate the relevance of this tool in the characterization of changes in gene expression at low doses of g radiation. All experimentations conducted were conducted on T CD4+ human lymphocytes. From this model we tried to characterize in 600 minutes after irradiation the molecular players in response to a dose range of 5 and 500 mGy. In this study, we highlight two types of molecular response. First, a dose-dependent response to irradiation involving groups of genes whose proteins are implicated in the response to DNA damage and in the p53 signaling pathway. The expression of the majority of the genes is modulated from 100 mGy. The second type of response is independent of the irradiation dose and the proteins encoded are involved in the mechanism of oxidative phosphorylation. Some of these genes could potentially be regulated by a family of transcription factor with an ETS domain. These ETS factors are known to be potential targets of the MAPKinase pathway. All these genes are modulated from 5 mGy and are therefore potential molecular players involved in the cellular response to ionizing stress with a low intensity. (author)
J Stephen Dumler
Full Text Available Anaplasma phagocytophilum, an obligate intracellular prokaryote, infects neutrophils and alters cardinal functions via reprogrammed transcription. Large contiguous regions of neutrophil chromosomes are differentially expressed during infection. Secreted A. phagocytophilum effector AnkA transits into the neutrophil or granulocyte nucleus to complex with DNA in heterochromatin across all chromosomes. AnkA binds to gene promoters to dampen cis-transcription and also has features of matrix attachment region (MAR-binding proteins that regulate three-dimensional chromatin architecture and coordinate transcriptional programs encoded in topologically-associated chromatin domains. We hypothesize that identification of additional AnkA binding sites will better delineate how A. phagocytophilum infection results in reprogramming of the neutrophil genome. Using AnkA-binding ChIP-seq, we showed that AnkA binds broadly throughout all chromosomes in a reproducible pattern, especially at: i intergenic regions predicted to be matrix attachment regions (MARs; ii within predicted lamina-associated domains; and iii at promoters ≤3,000 bp upstream of transcriptional start sites. These findings provide genome-wide support for AnkA as a regulator of cis-gene transcription. Moreover, the dominant mark of AnkA in distal intergenic regions known to be AT-enriched, coupled with frequent enrichment in the nuclear lamina, provides strong support for its role as a MAR-binding protein and genome re-organizer. AnkA must be considered a prime candidate to promote neutrophil reprogramming and subsequent functional changes that belie improved microbial fitness and pathogenicity.
Sugimoto, Manabu; Gusev, Oleg; Wheeler, Raymond; Levinskikh, Margarita; Sychev, Vladimir; Bingham, Gail; Hummerick, Mary; Oono, Youko; Matsumoto, Takashi; Yazawa, Takayuki
We have developed a plant growth system, namely Lada, which was installed in ISS to study and grow plants, including vegetables in a spaceflight environment. We have succeeded in cultivating Mizuna, tomato, pea, radish, wheat, rice, and barley in long-term spaceflight. Transcription levels of superoxide dismutase, glutamyl transferase, catalase, and ascorbate peroxidase were increased in the barley germinated and grown for 26 days in Lada, though the whole-plant growth and development of the barley in spaceflight were the same as in the ground control barley. In this study, we investigated the response of the ROS gene network in Mizuna, Brassica rapa var. nipposinica, cultivated under spaceflight condition. Seeds of Mizuna were sown in the root module of LADA aboard the Zvezda module of ISS and the seedlings were grown under 24h lighting in the leaf chamber. After 27 days of cultivation, the plants were harvested and stored at -80(°) C in MELFI aboard the Destiny module, and were transported to the ground at < -20(°) C in GLACIER aboard Space Shuttle. Ground control cultivation was carried out under the same conditions in LADA. Total RNA isolated from leaves was subjected to mRNA-Seq using next generation sequencing (NGS) technology. A total of 20 in 32 ROS oxidative marker genes were up-regulated, including high expression of four hallmarks, and preferentially expressed genes associated with ROS-scavenging including thioredoxin, glutaredoxin, and alternative oxidase genes. In the transcription factors of the ROS gene network, MEKK1-MKK4-MPK3, OXI1-MKK4-MPK3, and OXI1-MPK3 of MAP cascades, induction of WRKY22 by MEKK1-MKK4-MPK3 cascade, induction of WRKY25 and repression of Zat7 by Zat12 were suggested. These results revealed that the spaceflight environment induced oxidative stress and the ROS gene network activation in the space-grown Mizuna.
Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila
The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.
Li, Xiao-xu; Liu, Cheng; Li, Wei; Zhang, Zeng-lin; Gao, Xiao-ming; Zhou, Hui; Guo, Yong-feng
Members of the plant-specific WOX transcription factor family have been reported to play important roles in cell to cell communication as well as other physiological and developmental processes. In this study, ten members of the WOX transcription factor family were identified in Solanum lycopersicum with HMMER. Neighbor-joining phylogenetic tree, maximum-likelihood tree and Bayesian-inference tree were constructed and similar topologies were shown using the protein sequences of the homeodomain. Phylogenetic study revealed that the 25 WOX family members from Arabidopsis and tomato fall into three clades and nine subfamilies. The patterns of exon-intron structures and organization of conserved domains in Arabidopsis and tomato were consistent based on the phylogenetic results. Transcriptome analysis showed that the expression patterns of SlWOXs were different in different tissue types. Gene Ontology (GO) analysis suggested that, as transcription factors, the SlWOX family members could be involved in a number of biological processes including cell to cell communication and tissue development. Our results are useful for future studies on WOX family members in tomato and other plant species.
Silva Monteiro de Almeida, Dayanne; Oliveira Jordão do Amaral, Daniel; Del-Bem, Luiz-Eduardo; Bronze Dos Santos, Emily; Santana Silva, Raner José; Peres Gramacho, Karina; Vincentz, Michel; Micheli, Fabienne
Transcriptional regulation, led by transcription factors (TFs) such as those of the WRKY family, is a mechanism used by the organism to enhance or repress gene expression in response to stimuli. Here, we report on the genome-wide analysis of the Theobroma cacao WRKY TF family and also investigate the expression of WRKY genes in cacao infected by the fungus Moniliophthora perniciosa. In the cacao genome, 61 non-redundant WRKY sequences were found and classified in three groups (I to III) according to the WRKY and zinc-finger motif types. The 61 putative WRKY sequences were distributed on the 10 cacao chromosomes and 24 of them came from duplication events. The sequences were phylogenetically organized according to the general WRKY groups. The phylogenetic analysis revealed that subgroups IIa and IIb are sister groups and share a common ancestor, as well as subgroups IId and IIe. The most divergent groups according to the plant origin were IIc and III. According to the phylogenetic analysis, 7 TcWRKY genes were selected and analyzed by RT-qPCR in susceptible and resistant cacao plants infected (or not) with M. perniciosa. Some TcWRKY genes presented interesting responses to M. perniciosa such as Tc01_p014750/Tc06_p013130/AtWRKY28, Tc09_p001530/Tc06_p004420/AtWRKY40, Tc04_p016130/AtWRKY54 and Tc10_p016570/ AtWRKY70. Our results can help to select appropriate candidate genes for further characterization in cacao or in other Theobroma species.
Silva Monteiro de Almeida, Dayanne; Oliveira Jordão do Amaral, Daniel; Del-Bem, Luiz-Eduardo; Bronze dos Santos, Emily; Santana Silva, Raner José; Peres Gramacho, Karina; Vincentz, Michel
Transcriptional regulation, led by transcription factors (TFs) such as those of the WRKY family, is a mechanism used by the organism to enhance or repress gene expression in response to stimuli. Here, we report on the genome-wide analysis of the Theobroma cacao WRKY TF family and also investigate the expression of WRKY genes in cacao infected by the fungus Moniliophthora perniciosa. In the cacao genome, 61 non-redundant WRKY sequences were found and classified in three groups (I to III) according to the WRKY and zinc-finger motif types. The 61 putative WRKY sequences were distributed on the 10 cacao chromosomes and 24 of them came from duplication events. The sequences were phylogenetically organized according to the general WRKY groups. The phylogenetic analysis revealed that subgroups IIa and IIb are sister groups and share a common ancestor, as well as subgroups IId and IIe. The most divergent groups according to the plant origin were IIc and III. According to the phylogenetic analysis, 7 TcWRKY genes were selected and analyzed by RT-qPCR in susceptible and resistant cacao plants infected (or not) with M. perniciosa. Some TcWRKY genes presented interesting responses to M. perniciosa such as Tc01_p014750/Tc06_p013130/AtWRKY28, Tc09_p001530/Tc06_p004420/AtWRKY40, Tc04_p016130/AtWRKY54 and Tc10_p016570/ AtWRKY70. Our results can help to select appropriate candidate genes for further characterization in cacao or in other Theobroma species. PMID:29084273
Xu, Yingchun; Wang, Yanjie; Mattson, Neil; Yang, Liu; Jin, Qijiang
Trehalose-6-phosphate synthase (TPS) serves important functions in plant desiccation tolerance and response to environmental stimuli. At present, a comprehensive analysis, i.e. functional classification, molecular evolution, and expression patterns of this gene family are still lacking in Solanum tuberosum (potato). In this study, a comprehensive analysis of the TPS gene family was conducted in potato. A total of eight putative potato TPS genes (StTPSs) were identified by searching the latest potato genome sequence. The amino acid identity among eight StTPSs varied from 59.91 to 89.54%. Analysis of d N /d S ratios suggested that regions in the TPP (trehalose-6-phosphate phosphatase) domains evolved faster than the TPS domains. Although the sequence of the eight StTPSs showed high similarity (2571-2796 bp), their gene length is highly differentiated (3189-8406 bp). Many of the regulatory elements possibly related to phytohormones, abiotic stress and development were identified in different TPS genes. Based on the phylogenetic tree constructed using TPS genes of potato, and four other Solanaceae plants, TPS genes could be categorized into 6 distinct groups. Analysis revealed that purifying selection most likely played a major role during the evolution of this family. Amino acid changes detected in specific branches of the phylogenetic tree suggests relaxed constraints might have contributed to functional divergence among groups. Moreover, StTPSs were found to exhibit tissue and treatment specific expression patterns upon analysis of transcriptome data, and performing qRT-PCR. This study provides a reference for genome-wide identification of the potato TPS gene family and sets a framework for further functional studies of this important gene family in development and stress response.
Full Text Available Marek’s disease (MD is a commercially important neoplastic disease of chickens caused by Marek’s disease virus (MDV, an oncogenic alphaherpesvirus. Selecting for increased genetic resistance to MD is a control strategy that can augment vaccinal control measures. To identify high-confidence candidate MD resistance genes, we conducted a genome-wide screen for allele-specific expression (ASE amongst F1 progeny of two inbred chicken lines that differ in MD resistance. High throughput sequencing was used to profile transcriptomes from pools of uninfected and infected individuals at 4 days post-infection to identify any genes showing ASE in response to MDV infection. RNA sequencing identified 22,655 single nucleotide polymorphisms (SNPs of which 5,360 in 3,773 genes exhibited significant allelic imbalance. Illumina GoldenGate assays were subsequently used to quantify regulatory variation controlled at the gene (cis and elsewhere in the genome (trans by examining differences in expression between F1 individuals and artificial F1 RNA pools over 6 time periods in 1,536 of the most significant SNPs identified by RNA sequencing. Allelic imbalance as a result of cis-regulatory changes was confirmed in 861 of the 1,233 GoldenGate assays successfully examined. Furthermore we have identified 7 genes that display trans-regulation only in infected animals and approximately 500 SNP that show a complex interaction between cis- and trans-regulatory changes. Our results indicate ASE analyses are a powerful approach to identify regulatory variation responsible for differences in transcript abundance in genes underlying complex traits. And the genes with SNPs exhibiting ASE provide a strong foundation to further investigate the causative polymorphisms and genetic mechanisms for MD resistance. Finally, the methods used here for identifying specific genes and SNPs may have practical implications for applying marker-assisted selection to complex traits that are
Full Text Available Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL analysis in human adipose tissue of 119 men, where 592,794 single nucleotide polymorphisms (SNPs were related to DNA methylation of 477,891 CpG sites, covering 99% of RefSeq genes. SNPs in significant mQTLs were further related to gene expression in adipose tissue and obesity related traits. We found 101,911 SNP-CpG pairs (mQTLs in cis and 5,342 SNP-CpG pairs in trans showing significant associations between genotype and DNA methylation in adipose tissue after correction for multiple testing, where cis is defined as distance less than 500 kb between a SNP and CpG site. These mQTLs include reported obesity, lipid and type 2 diabetes loci, e.g. ADCY3/POMC, APOA5, CETP, FADS2, GCKR, SORT1 and LEPR. Significant mQTLs were overrepresented in intergenic regions meanwhile underrepresented in promoter regions and CpG islands. We further identified 635 SNPs in significant cis-mQTLs associated with expression of 86 genes in adipose tissue including CHRNA5, G6PC2, GPX7, RPL27A, THNSL2 and ZFP57. SNPs in significant mQTLs were also associated with body mass index (BMI, lipid traits and glucose and insulin levels in our study cohort and public available consortia data. Importantly, the Causal Inference Test (CIT demonstrates how genetic variants mediate their effects on metabolic traits (e.g. BMI, cholesterol, high-density lipoprotein (HDL, hemoglobin A1c (HbA1c and homeostatic model assessment of insulin resistance (HOMA-IR via altered DNA methylation in human adipose tissue. This study identifies genome-wide interactions between genetic and epigenetic variation in both cis and trans positions influencing gene expression in adipose tissue and in vivo (dysmetabolic traits associated with the development of
Hirashima, Kyotaro; Seimiya, Hiroyuki
Telomere erosion causes cell mortality, suggesting that longer telomeres enable more cell divisions. In telomerase-positive human cancer cells, however, telomeres are often kept shorter than those of surrounding normal tissues. Recently, we showed that cancer cell telomere elongation represses innate immune genes and promotes their differentiation in vivo. This implies that short telomeres contribute to cancer malignancy, but it is unclear how such genetic repression is caused by elongated telomeres. Here, we report that telomeric repeat-containing RNA (TERRA) induces a genome-wide alteration of gene expression in telomere-elongated cancer cells. Using three different cell lines, we found that telomere elongation up-regulates TERRA signal and down-regulates innate immune genes such as STAT1, ISG15 and OAS3 in vivo. Ectopic TERRA oligonucleotides repressed these genes even in cells with short telomeres under three-dimensional culture conditions. This appeared to occur from the action of G-quadruplexes (G4) in TERRA, because control oligonucleotides had no effect and a nontelomeric G4-forming oligonucleotide phenocopied the TERRA oligonucleotide. Telomere elongation and G4-forming oligonucleotides showed similar gene expression signatures. Most of the commonly suppressed genes were involved in the innate immune system and were up-regulated in various cancers. We propose that TERRA G4 counteracts cancer malignancy by suppressing innate immune genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Zhang, Jin; Liu, Bobin; Li, Jianbo; Zhang, Li; Wang, Yan; Zheng, Huanquan; Lu, Mengzhu; Chen, Jun
Heat shock proteins (Hsps) are molecular chaperones that are involved in many normal cellular processes and stress responses, and heat shock factors (Hsfs) are the transcriptional activators of Hsps. Hsfs and Hsps are widely coordinated in various biological processes. Although the roles of Hsfs and Hsps in stress responses have been well characterized in Arabidopsis, their roles in perennial woody species undergoing various environmental stresses remain unclear. Here, a comprehensive identification and analysis of Hsf and Hsp families in poplars is presented. In Populus trichocarpa, we identified 42 paralogous pairs, 66.7% resulting from a whole genome duplication. The gene structure and motif composition are relatively conserved in each subfamily. Microarray and quantitative real-time RT-PCR analyses showed that most of the Populus Hsf and Hsp genes are differentially expressed upon exposure to various stresses. A coexpression network between Populus Hsf and Hsp genes was generated based on their expression. Coordinated relationships were validated by transient overexpression and subsequent qPCR analyses. The comprehensive analysis indicates that different sets of PtHsps are downstream of particular PtHsfs and provides a basis for functional studies aimed at revealing the roles of these families in poplar development and stress responses.
Rund, Samuel S C; Yoo, Boyoung; Alam, Camille; Green, Taryn; Stephens, Melissa T; Zeng, Erliang; George, Gary F; Sheppard, Aaron D; Duffield, Giles E; Milenković, Tijana; Pfrender, Michael E
Marine and freshwater zooplankton exhibit daily rhythmic patterns of behavior and physiology which may be regulated directly by the light:dark (LD) cycle and/or a molecular circadian clock. One of the best-studied zooplankton taxa, the freshwater crustacean Daphnia, has a 24 h diel vertical migration (DVM) behavior whereby the organism travels up and down through the water column daily. DVM plays a critical role in resource tracking and the behavioral avoidance of predators and damaging ultraviolet radiation. However, there is little information at the transcriptional level linking the expression patterns of genes to the rhythmic physiology/behavior of Daphnia. Here we analyzed genome-wide temporal transcriptional patterns from Daphnia pulex collected over a 44 h time period under a 12:12 LD cycle (diel) conditions using a cosine-fitting algorithm. We used a comprehensive network modeling and analysis approach to identify novel co-regulated rhythmic genes that have similar network topological properties and functional annotations as rhythmic genes identified by the cosine-fitting analyses. Furthermore, we used the network approach to predict with high accuracy novel gene-function associations, thus enhancing current functional annotations available for genes in this ecologically relevant model species. Our results reveal that genes in many functional groupings exhibit 24 h rhythms in their expression patterns under diel conditions. We highlight the rhythmic expression of immunity, oxidative detoxification, and sensory process genes. We discuss differences in the chronobiology of D. pulex from other well-characterized terrestrial arthropods. This research adds to a growing body of literature suggesting the genetic mechanisms governing rhythmicity in crustaceans may be divergent from other arthropod lineages including insects. Lastly, these results highlight the power of using a network analysis approach to identify differential gene expression and provide novel
Full Text Available Alien chromosome substitution (CS lines are treated as vital germplasms for breeding and genetic mapping. Previously, a whole set of nine Brassica rapa-oleracea monosonic alien addition lines (MAALs, C1-C9 was established in the background of natural B. napus genotype “Oro,” after the restituted B. rapa (RBR for Oro was realized. Herein, a monosomic substitution line with one alien C1 chromosome (Cs1 in the RBR complement was selected in the progenies of MAAL C1 and RBR, by the PCR amplification of specific gene markers and fluorescence in situ hybridization. Cs1 exhibited the whole plant morphology similar to RBR except for the defective stamens without fertile pollen grains, but it produced some seeds and progeny plants carrying the C1 chromosome at high rate besides those without the alien chromosome after pollinated by RBR. The viability of the substitution and its progeny for the RBR diploid further elucidated the functional compensation between the chromosome pairs with high homoeology. To reveal the impact of such aneuploidy on genome-wide gene expression, the transcriptomes of MAAL C1, Cs1 and euploid RBR were analyzed. Compared to RBR, Cs1 had sharply reduced gene expression level across chromosome A1, demonstrating the loss of one copy of A1 chromosome. Both additional chromosome C1 in MAAL and substitutional chromosome C1 in Cs1 caused not only cis-effect but also prevalent trans-effect differentially expressed genes. A dominant gene dosage effects prevailed among low expressed genes across chromosome A1 in Cs1, and moreover, dosage effects for some genes potentially contributed to the phenotype deviations. Our results provided novel insights into the transcriptomic perturbation and gene dosage effects on phenotype in CS related to one naturally evolved allopolyploid.
Full Text Available Abstract Background Cellular function is regulated by the balance of stringently regulated amounts of mRNA. Previous reports revealed that RNA polymerase II (RNAPII, which transcribes mRNA, can be classified into the pausing state and the active transcription state according to the phosphorylation state of RPB1, the catalytic subunit of RNAPII. However, genome-wide association between mRNA expression level and the phosphorylation state of RNAPII is unclear. While the functional importance of pausing genes is clear, such as in mouse Embryonic Stem cells for differentiation, understanding this association is critical for distinguishing pausing genes from active transcribing genes in expression profiling data, such as microarrays and RNAseq. Therefore, we examined the correlation between the phosphorylation of RNAPII and mRNA expression levels using a combined analysis by ChIPseq and RNAseq. Results We first performed a precise quantitative measurement of mRNA by performing an optimized calculation in RNAseq. We then visualized the recruitment of various phosphorylated RNAPIIs, such as Ser2P and Ser5P. A combined analysis using optimized RNAseq and ChIPseq for phosphorylated RNAPII revealed that mRNA levels correlate with the various phosphorylation states of RNAPII. Conclusions We demonstrated that the amount of mRNA is precisely reflected by the phased phosphorylation of Ser2 and Ser5. In particular, even the most "pausing" genes, for which only Ser5 is phosphorylated, were detectable at a certain level of mRNA. Our analysis indicated that the complexity of quantitative regulation of mRNA levels could be classified into three categories according to the phosphorylation state of RNAPII.
Dung Tien Le
Full Text Available The availability of complete genome sequence of soybean has allowed research community to design the 66 K Affymetrix Soybean Array GeneChip for genome-wide expression profiling of soybean. In this study, we carried out microarray analysis of leaf tissues of soybean plants, which were subjected to drought stress from late vegetative V6 and from full bloom reproductive R2 stages. Our data analyses showed that out of 46,093 soybean genes, which were predicted with high confidence among approximately 66,000 putative genes, 41,059 genes could be assigned with a known function. Using the criteria of a ratio change > = 2 and a q-value<0.05, we identified 1458 and 1818 upregulated and 1582 and 1688 downregulated genes in drought-stressed V6 and R2 leaves, respectively. These datasets were classified into 19 most abundant biological categories with similar proportions. There were only 612 and 463 genes that were overlapped among the upregulated and downregulated genes, respectively, in both stages, suggesting that both conserved and unconserved pathways might be involved in regulation of drought response in different stages of plant development. A comparative expression analysis using our datasets and that of drought stressed Arabidopsis leaves revealed the existence of both conserved and species-specific mechanisms that regulate drought responses. Many upregulated genes encode either regulatory proteins, such as transcription factors, including those with high homology to Arabidopsis DREB, NAC, AREB and ZAT/STZ transcription factors, kinases and two-component system members, or functional proteins, e.g. late embryogenesis-abundant proteins, glycosyltransferases, glycoside hydrolases, defensins and glyoxalase I family proteins. A detailed analysis of the GmNAC family and the hormone-related gene category showed that expression of many GmNAC and hormone-related genes was altered by drought in V6 and/or R2 leaves. Additionally, the downregulation of
Houston, Kelly; Burton, Rachel A.; Sznajder, Beata; Rafalski, Antoni J.; Dhugga, Kanwarpal S.; Mather, Diane E.; Taylor, Jillian; Steffenson, Brian J.; Waugh, Robbie; Fincher, Geoffrey B.
Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two – rowed and 288 six – rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with CELLULOSE SYNTHASE A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the
Väremo, Leif; Nielsen, Jens; Nookaew, Intawat
Gene set analysis (GSA) is used to elucidate genome-wide data, in particular transcriptome data. A multitude of methods have been proposed for this step of the analysis, and many of them have been compared and evaluated. Unfortunately, there is no consolidated opinion regarding what methods should be preferred, and the variety of available GSA software and implementations pose a difficulty for the end-user who wants to try out different methods. To address this, we have developed the R package Piano that collects a range of GSA methods into the same system, for the benefit of the end-user. Further on we refine the GSA workflow by using modifications of the gene-level statistics. This enables us to divide the resulting gene set P-values into three classes, describing different aspects of gene expression directionality at gene set level. We use our fully implemented workflow to investigate the impact of the individual components of GSA by using microarray and RNA-seq data. The results show that the evaluated methods are globally similar and the major separation correlates well with our defined directionality classes. As a consequence of this, we suggest to use a consensus scoring approach, based on multiple GSA runs. In combination with the directionality classes, this constitutes a more thorough basis for an enriched biological interpretation. PMID:23444143
Coppola, Mariateresa; van Meijgaarden, Krista E.; Franken, Kees L. M. C.
-wide transcriptomics of Mtb infected lungs we developed data sets and methods to identify IVE-TB (in-vivo expressed Mtb) antigens expressed in the lung. Quantitative expression analysis of 2,068 Mtb genes from the predicted first operons identified the most upregulated IVE-TB genes during in-vivo pulmonary infection...
Xu, Jianing; Xing, Shanshan; Cui, Haoran; Chen, Xuesen; Wang, Xiaoyun
The ubiquitin-protein ligases (E3s) directly participate in ubiquitin (Ub) transferring to the target proteins in the ubiquitination pathway. The HECT ubiquitin-protein ligase (UPL), one type of E3s, is characterized as containing a conserved HECT domain of approximately 350 amino acids in the C terminus. Some UPLs were found to be involved in trichome development and leaf senescence in Arabidopsis. However, studies on plant UPLs, such as characteristics of the protein structure, predicted functional motifs of the HECT domain, and the regulatory expression of UPLs have all been limited. Here, we present genome-wide identification of the genes encoding UPLs (HECT gene) in apple. The 13 genes (named as MdUPL1-MdUPL13) from ten different chromosomes were divided into four groups by phylogenetic analysis. Among these groups, the encoding genes in the intron-exon structure and the included additional functional domains were quite different. Notably, the F-box domain was first found in MdUPL7 in plant UPLs. The HECT domain in different MdUPL groups also presented different spatial features and three types of conservative motifs were identified. The promoters of each MdUPL member carried multiple stress-response related elements by cis-acting element analysis. Experimental results demonstrated that the expressions of several MdUPLs were quite sensitive to cold-, drought-, and salt-stresses by qRT-PCR assay. The results of this study helped to elucidate the functions of HECT proteins, especially in Rosaceae plants.
Matschegewski, Claudia; Zetzsche, Holger; Hasan, Yaser; Leibeguth, Lena; Briggs, William; Ordon, Frank; Uptmoor, Ralf
Cauliflower (Brassica oleracea var. botrytis) is a vernalization-responsive crop. High ambient temperatures delay harvest time. The elucidation of the genetic regulation of floral transition is highly interesting for a precise harvest scheduling and to ensure stable market supply. This study aims at genetic dissection of temperature-dependent curd induction in cauliflower by genome-wide association studies and gene expression analysis. To assess temperature-dependent curd induction, two greenhouse trials under distinct temperature regimes were conducted on a diversity panel consisting of 111 cauliflower commercial parent lines, genotyped with 14,385 SNPs. Broad phenotypic variation and high heritability (0.93) were observed for temperature-related curd induction within the cauliflower population. GWA mapping identified a total of 18 QTL localized on chromosomes O1, O2, O3, O4, O6, O8, and O9 for curding time under two distinct temperature regimes. Among those, several QTL are localized within regions of promising candidate flowering genes. Inferring population structure and genetic relatedness among the diversity set assigned three main genetic clusters. Linkage disequilibrium (LD) patterns estimated global LD extent of r2 = 0.06 and a maximum physical distance of 400 kb for genetic linkage. Transcriptional profiling of flowering genes FLOWERING LOCUS C (BoFLC) and VERNALIZATION 2 (BoVRN2) was performed, showing increased expression levels of BoVRN2 in genotypes with faster curding. However, functional relevance of BoVRN2 and BoFLC2 could not consistently be supported, which probably suggests to act facultative and/or might evidence for BoVRN2/BoFLC-independent mechanisms in temperature-regulated floral transition in cauliflower. Genetic insights in temperature-regulated curd induction can underpin genetically informed phenology models and benefit molecular breeding strategies toward the development of thermo-tolerant cultivars. PMID:26442034
Zhao, Peng; Wang, Dongdong; Wang, Ruoqiu; Kong, Nana; Zhang, Chao; Yang, Chenghui; Wu, Wentao; Ma, Haoli; Chen, Qin
Heat shock proteins (Hsps) are essential components in plant tolerance mechanism under various abiotic stresses. Hsp20 is the major family of heat shock proteins, but little of Hsp20 family is known in potato (Solanum tuberosum), which is an important vegetable crop that is thermosensitive. To reveal the mechanisms of potato Hsp20s coping with abiotic stresses, analyses of the potato Hsp20 gene family were conducted using bioinformatics-based methods. In total, 48 putative potato Hsp20 genes (StHsp20s) were identified and named according to their chromosomal locations. A sequence analysis revealed that most StHsp20 genes (89.6%) possessed no, or only one, intron. A phylogenetic analysis indicated that all of the StHsp20 genes, except 10, were grouped into 12 subfamilies. The 48 StHsp20 genes were randomly distributed on 12 chromosomes. Nineteen tandem duplicated StHsp20s and one pair of segmental duplicated genes (StHsp20-15 and StHsp20-48) were identified. A cis-element analysis inferred that StHsp20s, except for StHsp20-41, possessed at least one stress response cis-element. A heatmap of the StHsp20 gene family showed that the genes, except for StHsp20-2 and StHsp20-45, were expressed in various tissues and organs. Real-time quantitative PCR was used to detect the expression level of StHsp20 genes and demonstrated that the genes responded to multiple abiotic stresses, such as heat, salt or drought stress. The relative expression levels of 14 StHsp20 genes (StHsp20-4, 6, 7, 9, 20, 21, 33, 34, 35, 37, 41, 43, 44 and 46) were significantly up-regulated (more than 100-fold) under heat stress. These results provide valuable information for clarifying the evolutionary relationship of the StHsp20 family and in aiding functional characterization of StHsp20 genes in further research.
Full Text Available Abstract Background Interferons (IFNs are potent antiviral cytokines capable of reprogramming the macrophage phenotype through the induction of interferon-stimulated genes (ISGs. Here we have used targeted RNA interference to suppress the expression of a number of key genes associated with IFN signalling in murine macrophages prior to stimulation with interferon-gamma. Genome-wide changes in transcript abundance caused by siRNA activity were measured using exon-level microarrays in the presence or absence of IFNγ. Results Transfection of murine bone-marrow derived macrophages (BMDMs with a non-targeting (control siRNA and 11 sequence-specific siRNAs was performed using a cationic lipid transfection reagent (Lipofectamine2000 prior to stimulation with IFNγ. Total RNA was harvested from cells and gene expression measured on Affymetrix GeneChip Mouse Exon 1.0 ST Arrays. Network-based analysis of these data revealed six siRNAs to cause a marked shift in the macrophage transcriptome in the presence or absence IFNγ. These six siRNAs targeted the Ifnb1, Irf3, Irf5, Stat1, Stat2 and Nfkb2 transcripts. The perturbation of the transcriptome by the six siRNAs was highly similar in each case and affected the expression of over 600 downstream transcripts. Regulated transcripts were clustered based on co-expression into five major groups corresponding to transcriptional networks associated with the type I and II IFN response, cell cycle regulation, and NF-KB signalling. In addition we have observed a significant non-specific immune stimulation of cells transfected with siRNA using Lipofectamine2000, suggesting use of this reagent in BMDMs, even at low concentrations, is enough to induce a type I IFN response. Conclusion Our results provide evidence that the type I IFN response in murine BMDMs is dependent on Ifnb1, Irf3, Irf5, Stat1, Stat2 and Nfkb2, and that siRNAs targeted to these genes results in perturbation of key transcriptional networks associated
Desta, Zeratsion Abera; Ortiz, Rodomiro
Association analysis is used to measure relations between markers and quantitative trait loci (QTL). Their estimation ignores genes with small effects that trigger underpinning quantitative traits. By contrast, genome-wide selection estimates marker effects across the whole genome on the target population based on a prediction model developed in the training population (TP). Whole-genome prediction models estimate all marker effects in all loci and capture small QTL effects. Here, we review several genomic selection (GS) models with respect to both the prediction accuracy and genetic gain from selection. Phenotypic selection or marker-assisted breeding protocols can be replaced by selection, based on whole-genome predictions in which phenotyping updates the model to build up the prediction accuracy. Copyright © 2014 Elsevier Ltd. All rights reserved.
Background The Hepatitis B Virus (HBV) HBx regulatory protein is required for HBV replication and involved in HBV-related carcinogenesis. HBx interacts with chromatin modifying enzymes and transcription factors to modulate histone post-translational modifications and to regulate viral cccDNA transcription and cellular gene expression. Aiming to identify genes and non-coding RNAs (ncRNAs) directly targeted by HBx, we performed a chromatin immunoprecipitation sequencing (ChIP-Seq) to analyse HBV recruitment on host cell chromatin in cells replicating HBV. Results ChIP-Seq high throughput sequencing of HBx-bound fragments was used to obtain a high-resolution, unbiased, mapping of HBx binding sites across the genome in HBV replicating cells. Protein-coding genes and ncRNAs involved in cell metabolism, chromatin dynamics and cancer were enriched among HBx targets together with genes/ncRNAs known to modulate HBV replication. The direct transcriptional activation of genes/miRNAs that potentiate endocytosis (Ras-related in brain (RAB) GTPase family) and autophagy (autophagy related (ATG) genes, beclin-1, miR-33a) and the transcriptional repression of microRNAs (miR-138, miR-224, miR-576, miR-596) that directly target the HBV pgRNA and would inhibit HBV replication, contribute to HBx-mediated increase of HBV replication. Conclusions Our ChIP-Seq analysis of HBx genome wide chromatin recruitment defined the repertoire of genes and ncRNAs directly targeted by HBx and led to the identification of new mechanisms by which HBx positively regulates cccDNA transcription and HBV replication.
Melvin Anyasi Ambele
Full Text Available We have undertaken an in-depth transcriptome analysis of adipogenesis in human adipose-derived stromal cells (ASCs induced to differentiate into adipocytes in vitro. Gene expression was assessed on days 1, 7, 14 and 21 post-induction and genes differentially expressed numbered 128, 218, 253 and 240 respectively. Up-regulated genes were associated with blood vessel development, leukocyte migration, as well as tumor growth, invasion and metastasis. They also shared common pathways with certain obesity-related pathophysiological conditions. Down-regulated genes were enriched for immune response processes. KLF15, LMO3, FOXO1 and ZBTB16 transcription factors were up-regulated throughout the differentiation process. CEBPA, PPARG, ZNF117, MLXIPL, MMP3 and RORB were up-regulated only on days 14 and 21, which coincide with the maturation of adipocytes and could possibly serve as candidates for controlling fat accumulation and the size of mature adipocytes. In summary, we have identified genes that were up-regulated only on days 1 and 7 or days 14 and 21 that could serve as potential early and late-stage differentiation markers.
Navarro, Fábio C P; Galante, Pedro A F
Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Full Text Available Acetyl-CoA carboxylase is an important enzyme, which catalyzes acetyl-CoA’s carboxylation to produce malonyl-CoA and to serve as a committed step for de novo fatty acid biosynthesis in plastids. In this study, 24 putative cotton BCCP genes were identified based on the lately published genome data in Gossypium. Among them, 4, 4, 8, and 8 BCCP homologs were identified in Gossypium raimondii, G. arboreum, G. hirsutum, and G. barbadense, respectively. These genes were divided into two classes based on a phylogenetic analysis. In each class, these homologs were relatively conserved in gene structure and motifs. The chromosomal distribution pattern revealed that all the BCCP genes were distributed equally on corresponding chromosomes or scaffold in the four cotton species. Segmental duplication was a predominant duplication event in both of G. hirsutum and G. barbadense. The analysis of the expression profile showed that 8 GhBCCP genes expressed in all the tested tissues with changed expression levels, and GhBCCP genes belonging to class II were predominantly expressed in developing ovules. Meanwhile, the expression analysis for the 16 cotton BCCP genes from G. raimondii, G. arboreum and G. hirsutum showed that they were induced or suppressed by cold or salt stress, and their expression patterns varied among different tissues. These findings will help to determine the functional and evolutionary characteristics of the BCCP genes in Gossypium species.
Wei, Ling; Yang, Chao; Tao, Wenjing; Wang, Deshou
The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG) box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus), and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.
Full Text Available The Sox transcription factor family is characterized with the presence of a Sry-related high-mobility group (HMG box and plays important roles in various biological processes in animals, including sex determination and differentiation, and the development of multiple organs. In this study, 27 Sox genes were identified in the genome of the Nile tilapia (Oreochromis niloticus, and were classified into seven groups. The members of each group of the tilapia Sox genes exhibited a relatively conserved exon-intron structure. Comparative analysis showed that the Sox gene family has undergone an expansion in tilapia and other teleost fishes following their whole genome duplication, and group K only exists in teleosts. Transcriptome-based analysis demonstrated that most of the tilapia Sox genes presented stage-specific and/or sex-dimorphic expressions during gonadal development, and six of the group B Sox genes were specifically expressed in the adult brain. Our results provide a better understanding of gene structure and spatio-temporal expression of the Sox gene family in tilapia, and will be useful for further deciphering the roles of the Sox genes during sex determination and gonadal development in teleosts.
Zega, Alessandra; D'Ovidio, Renato
Pectin methyl esterase (PME) genes code for enzymes that are involved in structural modifications of the plant cell wall during plant growth and development. They are also involved in plant-pathogen interaction. PME genes belong to a multigene family and in this study we report the first comprehensive analysis of the PME gene family in bread wheat (Triticum aestivum L.). Like in other species, the members of the TaPME family are dispersed throughout the genome and their encoded products retain the typical structural features of PMEs. qRT-PCR analysis showed variation in the expression pattern of TaPME genes in different tissues and revealed that these genes are mainly expressed in flowering spikes. In our attempt to identify putative TaPME genes involved in wheat defense, we revealed a strong variation in the expression of the TaPME following Fusarium graminearum infection, the causal agent of Fusarium head blight (FHB). Particularly interesting was the finding that the expression profile of some PME genes was markedly different between the FHB-resistant wheat cultivar Sumai3 and the FHB-susceptible cultivar Bobwhite, suggesting a possible involvement of these PME genes in FHB resistance. Moreover, the expression analysis of the TaPME genes during F. graminearum progression within the spike revealed those genes that responded more promptly to pathogen invasion. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Atopic dermatitis (AD) is a common inflammatory skin disorder with a strong genetic component. Genome-wide association studies have been successful in the identification of common single nucleotide polymorphisms associated with AD, but their functional relevance has not been investigated yet. This work presents a comprehensive functional characterization of common and infrequent variants at the AD-associated C11orf30/LRRC32 locus. Analyses of cutaneous gene expression profiles in AD patients ...
Bergholdt, Regine; Brorsson, Caroline; Palleja, Albert
Genome-wide association studies (GWAS) have heralded a new era in susceptibility locus discovery in complex diseases. For type 1 diabetes, >40 susceptibility loci have been discovered. However, GWAS do not inevitably lead to identification of the gene or genes in a given locus associated with dis......-cells. Our results provide novel insight to the mechanisms behind type 1 diabetes pathogenesis and, thus, may provide the basis for the design of novel treatment strategies.......Genome-wide association studies (GWAS) have heralded a new era in susceptibility locus discovery in complex diseases. For type 1 diabetes, >40 susceptibility loci have been discovered. However, GWAS do not inevitably lead to identification of the gene or genes in a given locus associated...... with disease, and they do not typically inform the broader context in which the disease genes operate. Here, we integrated type 1 diabetes GWAS data with protein-protein interactions to construct biological networks of relevance for disease. A total of 17 networks were identified. To prioritize...
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa , Zea mays , Sorghum bicolor , Cicer arietinum , and Vitis vinifera , and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii , Physcomitrella patens , and Amborella trichopoda , revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice ( OsAlba ), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure-function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants.
Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu
Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.
Guo, Meng; Lu, Jin-Ping; Zhai, Yu-Fei; Chai, Wei-Guo; Gong, Zhen-Hui; Lu, Ming-Hui
Heat shock factors (Hsfs) play crucial roles in plant developmental and defence processes. The production and quality of pepper (Capsicum annuum L.), an economically important vegetable crop, are severely reduced by adverse environmental stress conditions, such as heat, salt and osmotic stress. Although the pepper genome has been fully sequenced, the characterization of the Hsf gene family under abiotic stress conditions remains incomplete. A total of 25 CaHsf members were identified in the pepper genome by bioinformatics analysis and PCR assays. They were grouped into three classes, CaHsfA, B and C, based on highly conserved Hsf domains, were distributed over 11 of 12 chromosomes, with none found on chromosome 11, and all of them, except CaHsfA5, formed a protein-protein interaction network. According to the RNA-seq data of pepper cultivar CM334, most CaHsf members were expressed in at least one tissue among root, stem, leaf, pericarp and placenta. Quantitative real-time PCR assays showed that all of the CaHsfs responded to heat stress (40 °C for 2 h), except CaHsfC1 in thermotolerant line R9 leaves, and that the expression patterns were different from those in thermosensitive line B6. Many CaHsfs were also regulated by salt and osmotic stresses, as well as exogenous Ca(2+), putrescine, abscisic acid and methyl jasmonate. Additionally, CaHsfA2 was located in the nucleus and had transcriptional activity, consistent with the typical features of Hsfs. Time-course expression profiling of CaHsfA2 in response to heat stress revealed differences in its expression level and pattern between the pepper thermosensitive line B6 and thermotolerant line R9. Twenty-five Hsf genes were identified in the pepper genome and most of them responded to heat, salt, osmotic stress, and exogenous substances, which provided potential clues for further analyses of CaHsfs functions in various kinds of abiotic stresses and of corresponding signal transduction pathways in pepper.
Full Text Available A number of transcriptome datasets for differential expression (DE genes have been widely used for understanding organismal biology, but these datasets also contain untapped information that can be used to develop more precise analytical tools. With the use of transcriptome data generated from poplar/canker disease interaction system, we describe a methodology to identify candidate reference genes from high-throughput sequencing data. This methodology will improve the accuracy of RT-qPCR and will lead to better standards for the normalization of expression data. Expression stability analysis from xylem and phloem of Populus bejingensis inoculated with the fungal canker pathogen Botryosphaeria dothidea revealed that 729 poplar transcripts (1.11% were stably expressed, at a threshold level of coefficient of variance (CV of FPKM < 20% and maximum fold change (MFC of FPKM < 2.0. Expression stability and bioinformatics analysis suggested that commonly used house-keeping (HK genes were not the most appropriate internal controls: 70 of the 72 commonly used HK genes were not stably expressed, 45 of the 72 produced multiple isoform transcripts, and some of their reported primers produced unspecific amplicons in PCR amplification. RT-qPCR analysis to compare and evaluate the expression stability of 10 commonly used poplar HK genes and 20 of the 729 newly-identified stably expressed transcripts showed that some of the newly-identified genes (such as SSU_S8e, LSU_L5e, and 20S_PSU had higher stability ranking than most of commonly used HK genes. Based on these results, we recommend a pipeline for deriving reference genes from transcriptome data. An appropriate candidate gene should have a unique transcript, constitutive expression, CV value of expression < 20% (or possibly 30% and MFC value of expression <2, and an expression level of 50–1,000 units. Lastly, when four of the newly identified HK genes were used in the normalization of expression data for 20
Maximova, Siela N; Florez, Sergio; Shen, Xiangling; Niemenak, Nicolas; Zhang, Yufan; Curtis, Wayne; Guiltinan, Mark J
Theobroma cacao L. is a tropical fruit tree, the seeds of which are used to create chocolate. In vitro somatic embryogenesis (SE) of cacao is a propagation system useful for rapid mass-multiplication to accelerate breeding programs and to provide plants directly to farmers. Two major limitations of cacao SE remain: the efficiency of embryo production is highly genotype dependent and the lack of full cotyledon development results in low embryo to plant conversion rates. With the goal to better understand SE development and to improve the efficiency of SE conversion we examined gene expression differences between zygotic and somatic embryos using a whole genome microarray. The expression of 28,752 genes was determined at 4 developmental time points during zygotic embryogenesis (ZE) and 2 time points during cacao somatic embryogenesis (SE). Within the ZE time course, 10,288 differentially expressed genes were enriched for functions related to responses to abiotic and biotic stimulus, metabolic and cellular processes. A comparison ZE and SE expression profiles identified 10,175 differentially expressed genes. Many TF genes, putatively involved in ethylene metabolism and response, were more strongly expressed in SEs as compared to ZEs. Expression levels of genes involved in fatty acid metabolism, flavonoid biosynthesis and seed storage protein genes were also differentially expressed in the two types of embryos. Large numbers of genes were differentially regulated during various stages of both ZE and SE development in cacao. The relatively higher expression of ethylene and flavonoid related genes during SE suggests that the developing tissues may be experiencing high levels of stress during SE maturation caused by the in vitro environment. The expression of genes involved in the synthesis of auxin, polyunsaturated fatty acids and secondary metabolites was higher in SEs relative to ZEs despite lack of lipid and metabolite accumulation. These differences in gene
Full Text Available Most eukaryotic pathogens have complex life cycles in which gene expression networks orchestrate the formation of cells specialized for dissemination or host colonization. In the oomycete Phytophthora infestans, the potato late blight pathogen, major shifts in mRNA profiles during developmental transitions were identified using microarrays. We used those data with search algorithms to discover about 100 motifs that are over-represented in promoters of genes up-regulated in hyphae, sporangia, sporangia undergoing zoosporogenesis, swimming zoospores, or germinated cysts forming appressoria (infection structures. Most of the putative stage-specific transcription factor binding sites (TFBSs thus identified had features typical of TFBSs such as position or orientation bias, palindromy, and conservation in related species. Each of six motifs tested in P. infestans transformants using the GUS reporter gene conferred the expected stage-specific expression pattern, and several were shown to bind nuclear proteins in gel-shift assays. Motifs linked to the appressoria-forming stage, including a functionally validated TFBS, were over-represented in promoters of genes encoding effectors and other pathogenesis-related proteins. To understand how promoter and genome architecture influence expression, we also mapped transcription patterns to the P. infestans genome assembly. Adjacent genes were not typically induced in the same stage, including genes transcribed in opposite directions from small intergenic regions, but co-regulated gene pairs occurred more than expected by random chance. These data help illuminate the processes regulating development and pathogenesis, and will enable future attempts to purify the cognate transcription factors.
Nault, Rance; Kim, Suntae; Zacharewski, Timothy R., E-mail: firstname.lastname@example.org
Although the structure and function of the AhR are conserved, emerging evidence suggests that downstream effects are species-specific. In this study, rat hepatic gene expression data from the DrugMatrix database (National Toxicology Program) were compared to mouse hepatic whole-genome gene expression data following treatment with 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD). For the DrugMatrix study, male Sprague–Dawley rats were gavaged daily with 20 μg/kg TCDD for 1, 3 and 5 days, while female C57BL/6 ovariectomized mice were examined 1, 3 and 7 days after a single oral gavage of 30 μg/kg TCDD. A total of 649 rat and 1386 mouse genes (|fold change| ≥ 1.5, P1(t) ≥ 0.99) were differentially expressed following treatment. HomoloGene identified 11,708 orthologs represented across the rat Affymetrix 230 2.0 GeneChip (12,310 total orthologs), and the mouse 4 × 44K v.1 Agilent oligonucleotide array (17,578 total orthologs). Comparative analysis found 563 and 922 orthologs differentially expressed in response to TCDD in the rat and mouse, respectively, with 70 responses associated with immune function and lipid metabolism in common to both. Moreover, QRTPCR analysis of Ceacam1, showed divergent expression (induced in rat; repressed in mouse) functionally consistent with TCDD-elicited hepatic steatosis in the mouse but not the rat. Functional analysis identified orthologs involved in nucleotide binding and acetyltransferase activity in rat, while mouse-specific responses were associated with steroid, phospholipid, fatty acid, and carbohydrate metabolism. These results provide further evidence that TCDD elicits species-specific regulation of distinct gene networks, and outlines considerations for future comparisons of publicly available microarray datasets. - Highlights: ► We performed a whole-genome comparison of TCDD-regulated genes in mice and rats. ► Previous species comparisons were extended using data from the DrugMatrix database. ► Less than 15% of TCDD
Full Text Available The natural sex reversal severely affects the sex ratio and thus decreases the productivity of the rice field eel (Monopterus albus. How to understand and manipulate this process is one of the major issues for the rice field eel stocking. So far the genomics and transcriptomics data available for this species are still scarce. Here we provide a comprehensive study of transcriptomes of brain and gonad tissue in three sex stages (female, intersex and male from the rice field eel to investigate changes in transcriptional level during the sex reversal process.Approximately 195 thousand unigenes were generated and over 44.4 thousand were functionally annotated. Comparative study between stages provided multiple differentially expressed genes in brain and gonad tissue. Overall 4668 genes were found to be of unequal abundance between gonad tissues, far more than that of the brain tissues (59 genes. These genes were enriched in several different signaling pathways. A number of 231 genes were found with different levels in gonad in each stage, with several reproduction-related genes included. A total of 19 candidate genes that could be most related to sex reversal were screened out, part of these genes' expression patterns were validated by RT-qPCR. The expression of spef2, maats1, spag6 and dmc1 were abundant in testis, but was barely detected in females, while the 17β-hsd12, zpsbp3, gal3 and foxn5 were only expressed in ovary.This study investigated the complexity of brain and gonad transcriptomes in three sex stages of the rice field eel. Integrated analysis of different gene expression and changes in signaling pathways, such as PI3K-Akt pathway, provided crucial data for further study of sex transformation mechanisms.
Linka, Y; Ginzel, S; Krüger, M; Novosel, A; Gombert, M; Kremmer, E; Harbott, J; Thiele, R; Borkhardt, A; Landgraf, P
The reciprocal translocation t(12;21)(p13;q22), the most common structural genomic alteration in B-cell precursor acute lymphoblastic leukaemia in children, results in a chimeric transcription factor TEL-AML1 (ETV6-RUNX1). We identified directly and indirectly regulated target genes utilizing an inducible TEL-AML1 system derived from the murine pro B-cell line BA/F3 and a monoclonal antibody directed against TEL-AML1. By integration of promoter binding identified with chromatin immunoprecipitation (ChIP)-on-chip, gene expression and protein output through microarray technology and stable labelling of amino acids in cell culture, we identified 217 directly and 118 indirectly regulated targets of the TEL-AML1 fusion protein. Directly, but not indirectly, regulated promoters were enriched in AML1-binding sites. The majority of promoter regions were specific for the fusion protein and not bound by native AML1 or TEL. Comparison with gene expression profiles from TEL-AML1-positive patients identified 56 concordantly misregulated genes with negative effects on proliferation and cellular transport mechanisms and positive effects on cellular migration, and stress responses including immunological responses. In summary, this work for the first time gives a comprehensive insight into how TEL-AML1 expression may directly and indirectly contribute to alter cells to become prone for leukemic transformation
Han, Jeonghoon; Kim, Duck-Hyun; Kim, Hui-Su; Nelson, David R; Lee, Jae-Seong
Cytochrome P450s (CYPs) are enzymes with a heme-binding domain that are found in all living organisms. CYP enzymes have important roles associated with detoxification of xenobiotics and endogenous compounds (e.g. steroids, fatty acids, and hormones). Although CYP enzymes have been reported in several invertebrates, including insects, little is known about copepod CYPs. Here, we identified the entire repertoire of CYP genes (n=52) from whole genome and transcriptome sequences of the benthic copepod Tigriopus japonicus, including a tandem duplication (CYP3026A3, CYP3026A4, CYP3026A5), and examined patterns of gene expression over various developmental stages and in response to benzo[α]pyrene (B[α]P) exposure. Through phylogenetic analysis, the 52 T. japonicus CYP genes were assigned to five distinct clans: CYP2 (22 genes), CYP3 (19 genes), CYP4 (two genes), CYP20 (one gene), and mitochondrial (eight genes). Developmental stage and gender-specific expression patterns of the 52 T. japonicus CYPs were analyzed. CYP3022A1 was constitutively expressed during all developmental stages. CYP genes in clans 2 and 3 were induced in response to B[α]P, suggesting that these differentially modulated CYP transcripts are likely involved in defense against exposure to B[α]P and other pollutants. This study enhances our understanding of the repertoire of CYP genes in copepods and of their potential role in development and detoxification in copepods. Copyright © 2017 Elsevier Inc. All rights reserved.
Jing, Hua; Li, Chao; Ma, Fang; Ma, Ji-Hui; Khan, Abid; Wang, Xiao; Zhao, Li-Yang; Gong, Zhen-Hui; Chen, Ru-Gang
Dehydrins (DHNs) play a crucial role in enhancing abiotic stress tolerance in plants. Although DHNs have been identified and characterized in many plants, there is little known about Capsicum annuum L., one of the economically important vegetable crops. In this study, seven CaDHNs in the pepper genome were identified, which could be divided into two classes: YnSKn- and SKn-type, based on their highly conserved domains. Quantitative real-time PCR (qRT-PCR) results showed that the seven DHN genes were expressed in all tissues and might be involved in the growth and development of pepper. The gene expression profiles analysis suggested that most of the CaDHN genes were induced by various stresses (low temperature, salt and mannitol) and signaling molecules (ABA, SA and MeJA). Furthermore, the CaDHN3 (YSK2)-silenced pepper plants showed obvious lower resistance to abiotic stresses (cold, salt and mannitol) than the control plants (TRV2:00). So the CaDHN3 might act as a positive role in resisting abiotic stresses. This study lays the foundation for further studies into the regulation of their expression under various conditions.
Liu, Jianxia; Wang, Runmei; Liu, Wenying; Zhang, Hongli; Guo, Yaodong; Wen, Riyu
Heat-shock proteins (HSPs) are ubiquitous proteins with important roles in response to biotic and abiotic stress. The 70-kDa heat-shock genes ( Hsp70s ) encode a group of conserved chaperone proteins that play central roles in cellular networks of molecular chaperones and folding catalysts across all the studied organisms including bacteria, plants and animals. Several Hsp70s involved in drought tolerance have been well characterized in various plants, whereas no research on Chenopodium quinoa HSPs has been completed. Here, we analyzed the genome of C. quinoa and identified sixteen Hsp70 members in quinoa genome. Phylogenetic analysis revealed the independent origination of those Hsp70 members, with eight paralogous pairs comprising the Hsp70 family in quinoa. While the gene structure and motif analysis showed high conservation of those paralogous pairs, the synteny analysis of those paralogous pairs provided evidence for expansion coming from the polyploidy event. With several subcellular localization signals detected in CqHSP70 protein paralogous pairs, some of the paralogous proteins lost the localization information, indicating the diversity of both subcellular localizations and potential functionalities of those HSP70s. Further gene expression analyses revealed by quantitative polymerase chain reaction (qPCR) analysis illustrated the significant variations of Cqhsp70s in response to drought stress. In conclusion, the sixteen Cqhsp70 s undergo lineage-specific expansions and might play important and varied roles in response to drought stress.
Tan, Yanxiao; Wang, Suncai; Liang, Dong; Li, Mingjun; Ma, Fengwang
Cystatins or phytocystatins (PhyCys) comprise a family of plant-specific inhibitors of cysteine proteinases. Such inhibitors are thought to be involved in the regulation of several endogenous processes as well as defense against biotic or abiotic stresses. However, information about this family is limited in apple. We identified 26 PhyCys genes within the entire apple genome. They were clustered into three distinct groups distributed across several chromosomes. All of their putative proteins contained one or two typical cystatin domains, which shared the characteristic motifs of PhyCys. Eight selected genes displayed differential expression patterns in various tissues. Moreover, their transcript levels were also up-regulated significantly in leaves during maturation, senescence or in response to treatment with one or more abiotic stresses. Our results indicated that members of this family may function in tissue development, leaf senescence, and adaptation to adverse environments in apple. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Xu, Li-Xin; Holland, Heidrun; Kirsten, Holger; Ahnert, Peter; Krupp, Wolfgang; Bauer, Manfred; Schober, Ralf; Mueller, Wolf; Fritzsch, Dominik; Meixensberger, Jürgen; Koschny, Ronald
According to the World Health Organization gangliogliomas are classified as well-differentiated and slowly growing neuroepithelial tumors, composed of neoplastic mature ganglion and glial cells. It is the most frequent tumor entity observed in patients with long-term epilepsy. Comprehensive cytogenetic and molecular cytogenetic data including high-resolution genomic profiling (single nucleotide polymorphism (SNP)-array) of gangliogliomas are scarce but necessary for a better oncological understanding of this tumor entity. For a detailed characterization at the single cell and cell population levels, we analyzed genomic alterations of three gangliogliomas using trypsin-Giemsa banding (GTG-banding) and by spectral karyotyping (SKY) in combination with SNP-array and gene expression array experiments. By GTG and SKY, we could confirm frequently detected chromosomal aberrations (losses within chromosomes 10, 13 and 22; gains within chromosomes 5, 7, 8 and 12), and identify so far unknown genetic aberrations like the unbalanced non-reciprocal translocation t(1;18)(q21;q21). Interestingly, we report on the second so far detected ganglioglioma with ring chromosome 1. Analyses of SNP-array data from two of the tumors and respective germline DNA (peripheral blood) identified few small gains and losses and a number of copy-neutral regions with loss of heterozygosity (LOH) in germline and in tumor tissue. In comparison to germline DNA, tumor tissues did not show substantial regions with significant loss or gain or with newly developed LOH. Gene expression analyses of tumor-specific genes revealed similarities in the profile of the analyzed samples regarding different relevant pathways. Taken together, we describe overlapping but also distinct and novel genetic aberrations of three gangliogliomas. © 2014 Japanese Society of Neuropathology.
Wang, Ningning; Zhang, Di; Wang, Zhenhui; Xun, Hongwei; Ma, Jian; Wang, Hui; Huang, Wei; Liu, Ying; Lin, Xiuyun; Li, Ning; Ou, Xiufang; Zhang, Chunyu; Wang, Ming-Bo; Liu, Bao
Endogenous small (sm) RNAs (primarily si- and miRNAs) are important trans/cis-acting regulators involved in diverse cellular functions. In plants, the RNA-dependent RNA polymerases (RDRs) are essential for smRNA biogenesis. It has been established that RDR2 is involved in the 24 nt siRNA-dependent RNA-directed DNA methylation (RdDM) pathway. Recent studies have suggested that RDR1 is involved in a second RdDM pathway that relies mostly on 21 nt smRNAs and functions to silence a subset of genomic loci that are usually refractory to the normal RdDM pathway in Arabidopsis. Whether and to what extent the homologs of RDR1 may have similar functions in other plants remained unknown. We characterized a loss-of-function mutant (Osrdr1) of the OsRDR1 gene in rice (Oryza sativa L.) derived from a retrotransposon Tos17 insertion. Microarray analysis identified 1,175 differentially expressed genes (5.2% of all expressed genes in the shoot-tip tissue of rice) between Osrdr1 and WT, of which 896 and 279 genes were up- and down-regulated, respectively, in Osrdr1. smRNA sequencing revealed regional alterations in smRNA clusters across the rice genome. Some of the regions with altered smRNA clusters were associated with changes in DNA methylation. In addition, altered expression of several miRNAs was detected in Osrdr1, and at least some of which were associated with altered expression of predicted miRNA target genes. Despite these changes, no phenotypic difference was identified in Osrdr1 relative to WT under normal condition; however, ephemeral phenotypic fluctuations occurred under some abiotic stress conditions. Our results showed that OsRDR1 plays a role in regulating a substantial number of endogenous genes with diverse functions in rice through smRNA-mediated pathways involving DNA methylation, and which participates in abiotic stress response.
He, Peng; Zhao, Peng; Wang, Limin; Zhang, Yuzhou; Wang, Xiaosi; Xiao, Hui; Yu, Jianing; Xiao, Guanghui
Cell elongation and expansion are significant contributors to plant growth and morphogenesis, and are often regulated by environmental cues and endogenous hormones. Auxin is one of the most important phytohormones involved in the regulation of plant growth and development and plays key roles in plant cell expansion and elongation. Cotton fiber cells are a model system for studying cell elongation due to their large size. Cotton is also the world's most utilized crop for the production of natural fibers for textile and garment industries, and targeted expression of the IAA biosynthetic gene iaaM increased cotton fiber initiation. Polar auxin transport, mediated by PIN and AUX/LAX proteins, plays a central role in the control of auxin distribution. However, very limited information about PIN-FORMED (PIN) efflux carriers in cotton is known. In this study, 17 PIN-FORMED (PIN) efflux carrier family members were identified in the Gossypium hirsutum (G. hirsutum) genome. We found that PIN1-3 and PIN2 genes originated from the At subgenome were highly expressed in roots. Additionally, evaluation of gene expression patterns indicated that PIN genes are differentially induced by various abiotic stresses. Furthermore, we found that the majority of cotton PIN genes contained auxin (AuxREs) and salicylic acid (SA) responsive elements in their promoter regions were significantly up-regulated by exogenous hormone treatment. Our results provide a comprehensive analysis of the PIN gene family in G. hirsutum, including phylogenetic relationships, chromosomal locations, and gene expression and gene duplication analyses. This study sheds light on the precise roles of PIN genes in cotton root development and in adaption to stress responses.
Full Text Available The Hsp20 genes are present in all plant species and play important roles in alleviating heat stress and enhancing plant thermotolerance by preventing the irreversible aggregation of denaturing proteins. However, very little is known about the CaHsp20 gene family in pepper (Capsicum annuum L., an important vegetable crop with character of temperate but thermosensitive. In this study, a total of 35 putative pepper Hsp20 genes (CaHsp20s were identified and renamed on the basis of their molecular weight, and then their gene structure, genome location, gene duplication, phylogenetic relationship and interaction network were also analyzed. The expression patterns of CaHsp20 genes in four different tissues (root, stem, leaf and flower from the thermotolerant line R9 under heat stress condition were measured using semi-quantitative RT-PCR. The transcripts of most CaHsp20 genes maintained a low level in all of the four tissues under normal temperature condition, but were highly induced by heat stress, while the expression of CaHsp16.6b, 16.7 and 23.8 were only detected in specific tissues and were not so sensitive to heat stress like other CaHsp20 genes. In addition, compared to those in thermotolerant line R9, the expression peak of most CaHsp20 genes in thermosensitive line B6 under heat stress was hysteretic, and several CaHsp20 genes (CaHsp16.4, 18.2a, 18.7, 21.2, 22.0, 25.8 and 25.9 showed higher expression levels in both line B6 and R9. These data suggest that the CaHsp20 genes may be involved in heat stress and defense responses in pepper, which provides the basis for further functional analyses of CaHsp20s in the formation of pepper acquired thermotoleance.
Full Text Available Although there have been numerous observations of vitamin D deficiency and its links to chronic diseases, no studies have reported on how vitamin D status and vitamin D3 supplementation affects broad gene expression in humans. The objective of this study was to determine the effect of vitamin D status and subsequent vitamin D supplementation on broad gene expression in healthy adults. (Trial registration: ClinicalTrials.gov NCT01696409.A randomized, double-blind, single center pilot trial was conducted for comparing vitamin D supplementation with either 400 IUs (n = 3 or 2000 IUs (n = 5 vitamin D3 daily for 2 months on broad gene expression in the white blood cells collected from 8 healthy adults in the winter. Microarrays of the 16 buffy coats from eight subjects passed the quality control filters and normalized with the RMA method. Vitamin D3 supplementation that improved serum 25-hydroxyvitamin D concentrations was associated with at least a 1.5 fold alteration in the expression of 291 genes. There was a significant difference in the expression of 66 genes between subjects at baseline with vitamin D deficiency (25(OHD20 ng/ml. After vitamin D3 supplementation gene expression of these 66 genes was similar for both groups. Seventeen vitamin D-regulated genes with new candidate vitamin D response elements including TRIM27, CD83, COPB2, YRNA and CETN3 which have been shown to be important for transcriptional regulation, immune function, response to stress and DNA repair were identified.Our data suggest that any improvement in vitamin D status will significantly affect expression of genes that have a wide variety of biologic functions of more than 160 pathways linked to cancer, autoimmune disorders and cardiovascular disease with have been associated with vitamin D deficiency. This study reveals for the first time molecular finger prints that help explain the nonskeletal health benefits of vitamin D.ClinicalTrials.gov NCT01696409.
Wang, Ronghua; Mei, Yi; Xu, Liang; Zhu, Xianwen; Wang, Yan; Guo, Jun; Liu, Liwang
Heat stress (HS) causes detrimental effects on plant morphology, physiology, and biochemistry that lead to drastic reduction in plant biomass production and economic yield worldwide. To date, little is known about HS-responsive genes involved in thermotolerance mechanism in radish. In this study, a total of 6600 differentially expressed genes (DEGs) from the control and Heat24 cDNA libraries of radish were isolated by high-throughput sequencing. With Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis, some genes including MAPK, DREB, ERF, AP2, GST, Hsf, and Hsp were predominantly assigned in signal transductions, metabolic pathways, and biosynthesis and abiotic stress-responsive pathways. These pathways played significant roles in reducing stress-induced damages and enhancing heat tolerance in radish. Expression patterns of 24 candidate genes were validated by reverse-transcription quantitative PCR (RT-qPCR). Based mainly on the analysis of DEGs combining with the previous miRNAs analysis, the schematic model of HS-responsive regulatory network was proposed. To counter the effects of HS, a rapid response of the plasma membrane leads to the opening of specific calcium channels and cytoskeletal reorganization, after which HS-responsive genes are activated to repair damaged proteins and ultimately facilitate further enhancement of thermotolerance in radish. These results could provide fundamental insight into the regulatory network underlying heat tolerance in radish and facilitate further genetic manipulation of thermotolerance in root vegetable crops.
Etebari, Kayvan; Furlong, Michael J.; Asgari, Sassan
Long non-coding RNAs (lncRNAs) play important roles in genomic imprinting, cancer, differentiation and regulation of gene expression. Here, we identified 3844 long intergenic ncRNAs (lincRNA) in Plutella xylostella, which is a notorious pest of cruciferous plants that has developed field resistance to all classes of insecticides, including Bacillus thuringiensis (Bt) endotoxins. Further, we found that some of those lincRNAs may potentially serve as precursors for the production of small ncRNAs. We found 280 and 350 lincRNAs that are differentially expressed in Chlorpyrifos and Fipronil resistant larvae. A survey on P. xylostella midgut transcriptome data from Bt-resistant populations revealed 59 altered lincRNA in two resistant strains compared with the susceptible population. We validated the transcript levels of a number of putative lincRNAs in deltamethrin-resistant larvae that were exposed to deltamethrin, which indicated that this group of lincRNAs might be involved in the response to xenobiotics in this insect. To functionally characterize DBM lincRNAs, gene ontology (GO) enrichment of their associated protein-coding genes was extracted and showed over representation of protein, DNA and RNA binding GO terms. The data presented here will facilitate future studies to unravel the function of lincRNAs in insecticide resistance or the response to xenobiotics of eukaryotic cells. PMID:26411386
Full Text Available The LATERAL ORGAN BOUNDARIES DOMAIN (LBD gene family has been well-studied in Arabidopsis and play crucial roles in the diverse growth and development processes including establishment and maintenance of boundary of developmental lateral organs. In this study we identified and characterized 38 LBD genes in Lotus japonicus (LjLBD and 57 LBD genes in Medicago truncatula (MtLBD, both of which are model legume plants that have some specific development features absent in Arabidopsis. The phylogenetic relationships, their locations in the genome, genes structure and conserved motifs were examined. The results revealed that all LjLBD and MtLBD genes could be distinctly divided into two classes: Class I and II. The evolutionary analysis showed that Type I functional divergence with some significantly site-specific shifts may be the main force for the divergence between Class I and Class II. In addition, the expression patterns of LjLBD genes uncovered the diverse functions in plant development. Interestingly, we found that two LjLBD proteins that were highly expressed during compound leaf and pulvinus development, can interact via yeast two-hybrid assays. Taken together, our findings provide an evolutionary and genetic foundation in further understanding the molecular basis of LBD gene family in general, specifically in L. japonicus and M. truncatula.
Chien Van Ha
Full Text Available The plant-specific NAC transcription factors (TFs play important roles in regulation of diverse biological processes, including development, growth, cell division and responses to environmental stimuli. In this study, we identified the members of the NAC TF family of chickpea (Cicer arietinum and assess their expression profiles during plant development and under dehydration and abscisic acid (ABA treatments in a systematic manner. Seventy-one CaNAC genes were detected from the chickpea genome, including 8 membrane-bound members of which many might be involved in dehydration responses as judged from published literature. Phylogenetic analysis of the chickpea and well-known stress-related Arabidopsis and rice NACs enabled us to predict several putative stress-related CaNACs. By exploring available transcriptome data, we provided a comprehensive expression atlas of CaNACs in various tissues at different developmental stages. With the highest interest in dehydration responses, we examined the expression of the predicted stress-related and membrane-bound CaNACs in roots and leaves of chickpea seedlings, subjected to well-watered (control, dehydration and ABA treatments, using real-time quantitative PCR (RT-qPCR. Nine-teen of the 23 CaNACs examined were found to be dehydration-responsive in chickpea roots and/or leaves in either ABA-dependent or -independent pathway. Our results have provided a solid foundation for selection of promising tissue-specific and/or dehydration-responsive CaNAC candidates for detailed in planta functional analyses, leading to development of transgenic chickpea varieties with improved productivity under drought.
Xu, Zong-Chang; Kong, Yingzhen
Cellulose-synthase proteins (CESAs) are membrane localized proteins and they form protein complexes to produce cellulose in the plasma membrane. CESA proteins play very important roles in cell wall construction during plant growth and development. In this study, a total of 21 NtCESA gene sequences were identified by using PF03552 conserved protein sequence and 10 AtCESA protein sequences of Arabidopsis thaliana to blast against the common tobacco (Nicotiana tabacum L.) genome database with TBLASTN protocol. We analyzed the physical and chemical properties of protein sequences based on some software or on-line analysis tools. The results showed that there were no significant variances in terms of the physical and chemical properties of the 21 NtCESA proteins. First, phylogenetic tree analysis showed that 21 NtCESA genes and 10 AtCESA genes were clustered into five groups, and the gene structures were similar among the genes that are clustered into the same group. Second, in all of the 21 NtCESA proteins the conserved zinc finger domain was identified in the N-terminus, transmembrane domains were identified in the C-terminus and the DDD-QXXRW conserved domains were also identified. Third, gene expression analysis results indicated that most NtCESA genes were expressed in roots and leaves of seedling or mature tissues of tobacco, seeds and callus tissues. The genes that clustered into the same group share similar expression patterns. Importantly, NtCESA proteins that are involved in secondary cell wall cellulose synthesis have two extra transmembrane domains compared with that involved in primary cell wall cellulose biosynthesis. In addition, subcellular localization results showed that NtCESA9 and NtCESA14 were two plasma membrane anchored proteins. This study will lay a foundation for further functional characterization of these NtCESA genes.
Genome-wide gene expression profiling and a forward genetic screen show that differential expression of the sodium ion transporter Ena21 contributes to the differential tolerance of Candida albicans and Candida dubliniensis to osmotic stress.
Candida albicans is more pathogenic than Candida dubliniensis. However, this disparity in virulence is surprising given the high level of sequence conservation and the wide range of phenotypic traits shared by these two species. Increased sensitivity to environmental stresses has been suggested to be a possible contributory factor to the lower virulence of C. dubliniensis. In this study, we investigated, in the first comparison of C. albicans and C. dubliniensis by transcriptional profiling, global gene expression in each species when grown under conditions in which the two species exhibit differential stress tolerance. The profiles revealed similar core responses to stresses in both species, but differences in the amplitude of the general transcriptional responses to thermal, salt and oxidative stress. Differences in the regulation of specific stress genes were observed between the two species. In particular, ENA21, encoding a sodium ion transporter, was strongly induced in C. albicans but not in C. dubliniensis. In addition, ENA21 was identified in a forward genetic screen for C. albicans genomic sequences that increase salt tolerance in C. dubliniensis. Introduction of a single copy of CaENA21 was subsequently shown to be sufficient to confer salt tolerance upon C. dubliniensis.
Lada Artem G
Full Text Available Abstract Clusters of localized hypermutation in human breast cancer genomes, named “kataegis” (from the Greek for thunderstorm, are hypothesized to result from multiple cytosine deaminations catalyzed by AID/APOBEC proteins. However, a direct link between APOBECs and kataegis is still lacking. We have sequenced the genomes of yeast mutants induced in diploids by expression of the gene for PmCDA1, a hypermutagenic deaminase from sea lamprey. Analysis of the distribution of 5,138 induced mutations revealed localized clusters very similar to those found in tumors. Our data provide evidence that unleashed cytosine deaminase activity is an evolutionary conserved, prominent source of genome-wide kataegis events. Reviewers This article was reviewed by: Professor Sandor Pongor, Professor Shamil R. Sunyaev, and Dr Vladimir Kuznetsov.
Full Text Available The AP2/ERF transcription factor family, one of the largest families unique to plants, performs a significant role in terms of regulation of growth and development, and responses to biotic and abiotic stresses. Moso bamboo (Phyllostachys edulis is a fast-growing non-timber forest species with the highest ecological, economic and social values of all bamboos in Asia. The draft genome of moso bamboo and the available genomes of other plants provide great opportunities to research global information on the AP2/ERF family in moso bamboo. In total, 116 AP2/ERF transcription factors were identified in moso bamboo. The phylogeny analyses indicated that the 116 AP2/ERF genes could be divided into three subfamilies: AP2, RAV and ERF; and the ERF subfamily genes were divided into 11 groups. The gene structures, exons/introns and conserved motifs of the PeAP2/ERF genes were analyzed. Analysis of the evolutionary patterns and divergence showed the PeAP2/ERF genes underwent a large-scale event around 15 million years ago (MYA and the division time of AP2/ERF family genes between rice and moso bamboo was 15-23 MYA. We surveyed the putative promoter regions of the PeDREBs and showed that largely stress-related cis-elements existed in these genes. Further analysis of expression patterns of PeDREBs revealed that the most were strongly induced by drought, low-temperature and/or high salinity stresses in roots and, in contrast, most PeDREB genes had negative functions in leaves under the same respective stresses. In this study there were two main interesting points: there were fewer members of the PeDREB subfamily in moso bamboo than in other plants and there were differences in DREB gene expression profiles between leaves and roots triggered in response to abiotic stress. The information produced from this study may be valuable in overcoming challenges in cultivating moso bamboo.
Pritykin, Yuri; Ghersi, Dario; Singh, Mona
Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655
to protein: through epigenetic modifications, transcription regulators or post-transcriptional controls. The following papers concern several layers of gene regulation with questions answered by different HTS approaches. Genome-wide screening of epigenetic changes by ChIP-seq allowed us to study both spatial...... and temporal alterations of histone modifications (Papers I and II). Coupling the data with machine learning approaches, we established a prediction framework to assess the most informative histone marks as well as their most influential nucleosome positions in predicting the promoter usages. (Papers I...... they regulated or if the sites had global elevated usage rates by multiple TFs. Using RNA-seq, 5’end-seq in combination with depletion of 5’exonuclease as well as nonsensemediated decay (NMD) factors, we systematically analyzed NMD substrates as well as their degradation intermediates in human cells (Paper V...
Han, Jeonghoon; Kim, Duck-Hyun; Kim, Hui-Su; Kim, Hee-Jin; Declerck, Steven A J; Hagiwara, Atsushi; Lee, Jae-Seong
While marine invertebrate cytochrome P450 (CYP) genes and their roles in detoxification mechanisms have been studied, little information is available regarding freshwater rotifer CYPs and their functions. Here, we used genomic sequences and RNA-seq databases to identify 31 CYP genes in the freshwater rotifer Brachionus calyciflorus. The 31 Bc-CYP genes with a few tandem duplications were clustered into CYP 2, 3, 4, mitochondrial, and 46 clans with two marine rotifers Brachionus plicatilis and Brachionus koreanus. To understand the molecular responses of these 31 Bc-CYP genes, we also examined their expression patterns in response to benzo[α]pyrene (B[α]P). Three Bc-CYP genes (Bc-CYP3044B3, Bc-CYP3049B4, Bc-CYP3049B6) were significantly upregulated (P<0.05) in response to B[α]P, suggesting that these CYP genes can be involved in detoxification in response to B[α]P exposure. These genes might be useful as biomarkers of B[α]P exposure in B. calyciflorus. Overall, our findings expand the repertoire of known CYPs and shed light on their potential roles in xenobiotic detoxification in rotifers. Copyright © 2017 Elsevier Inc. All rights reserved.
Sharma, Ranu; Rawat, Vimal; Suresh, C G
The nucleotide binding site-leucine rich repeat (NBS-LRR) proteins play an important role in the defense mechanisms against pathogens. Using bioinformatics approach, we identified and annotated 104 NBS-LRR genes in chickpea. Phylogenetic analysis points to their diversification into two families namely TIR-NBS-LRR and non-TIR-NBS-LRR. Gene architecture revealed intron gain/loss events in this resistance gene family during their independent evolution into two families. Comparative genomics analysis elucidated its evolutionary relationship with other fabaceae species. Around 50% NBS-LRRs reside in macro-syntenic blocks underlining positional conservation along with sequence conservation of NBS-LRR genes in chickpea. Transcriptome sequencing data provided evidence for their transcription and tissue-specific expression. Four cis -regulatory elements namely WBOX, DRE, CBF, and GCC boxes, that commonly occur in resistance genes, were present in the promoter regions of these genes. Further, the findings will provide a strong background to use candidate disease resistance NBS-encoding genes and identify their specific roles in chickpea.
Zhang, Zhenzhu; Chen, Xiuling; Guan, Xin; Liu, Yang; Chen, Hongyu; Wang, Tingting; Mouekouba, Liana Dalcantara Ongouya; Li, Jingfu; Wang, Aoxue
Homeodomain-leucine zipper (HD-Zip) proteins are a kind of transcriptional factors that play a vital role in plant growth and development. However, no detailed information of HD-Zip family in tomato has been reported till now. In this study, 51 HD-Zip genes (SlHZ01-51) in this family were identified and categorized into 4 classes by exon-intron and protein structure in tomato (Solanum lycopersicum) genome. The synthetical phylogenetic tree of tomato, Arabidopsis and rice HD-Zip genes were established for an insight into their evolutionary relationships and putative functions. The results showed that the contribution of segmental duplication was larger than that of tandem duplication for expansion and evolution of genes in this family of tomato. The expression profile results under abiotic stress suggested that all SlHZ I genes were responsive to cold stress. This study will provide a clue for the further investigation of functional identification and the role of tomato HD-Zip I subfamily in plant cold stress responses and developmental events.
Abdul Kayum, Md.; Nath, Ujjal Kumar; Park, Jong-In; Choi, Eung Kyoo; Song, Jae-Young; Kim, Hoy-Taek; Nou, Ill-Sup
Plant growth and development can be adversely affected by cold stress, limiting productivity. The glutathione S-transferase (GST) family comprises important detoxifying enzymes, which play major roles in biotic and abiotic stress responses by reducing the oxidative damage caused by reactive oxygen species. Pumpkins (Cucurbita maxima) are widely grown, economically important, and nutritious; however, their yield can be severely affected by cold stress. The identification of putative candidate genes responsible for cold-stress tolerance, including the GST family genes, is therefore vital. For the first time, we identified 32 C. maxima GST (CmaGST) genes using a combination of bioinformatics approaches and characterized them by expression profiling. These CmaGST genes represent seven of the 14 known classes of plant GSTs, with 18 CmaGSTs categorized into the tau class. The CmaGSTs were distributed across 13 of pumpkin’s 20 chromosomes, with the highest numbers found on chromosomes 4 and 6. The large number of CmaGST genes resulted from gene duplication; 11 and 5 pairs of CmaGST genes were segmental- and tandem-duplicated, respectively. In addition, all CmaGST genes showed organ-specific expression. The expression of the putative GST genes in pumpkin was examined under cold stress in two lines with contrasting cold tolerance: cold-tolerant CP-1 (C. maxima) and cold-susceptible EP-1 (Cucurbita moschata). Seven genes (CmaGSTU3, CmaGSTU7, CmaGSTU8, CmaGSTU9, CmaGSTU11, CmaGSTU12, and CmaGSTU14) were highly expressed in the cold-tolerant line and are putative candidates for use in breeding cold-tolerant crop varieties. These results increase our understanding of the cold-stress-related functions of the GST family, as well as potentially enhancing pumpkin breeding programs. PMID:29439434
Liu, Pu; Zhang, Chao; Ma, Jin-Qi; Zhang, Li-Yuan; Yang, Bo; Tang, Xin-Yu; Huang, Ling; Zhou, Xin-Tong; Lu, Kun; Li, Jia-Na
Cytokinin oxidase/dehydrogenases (CKXs) play a critical role in the irreversible degradation of cytokinins, thereby regulating plant growth and development. Brassica napus is one of the most widely cultivated oilseed crops worldwide. With the completion of whole-genome sequencing of B. napus , genome-wide identification and expression analysis of the BnCKX gene family has become technically feasible. In this study, we identified 23 BnCKX genes and analyzed their phylogenetic relationships, gene structures, conserved motifs, protein subcellular localizations, and other properties. We also analyzed the expression of the 23 BnCKX genes in the B. napus cultivar Zhong Shuang 11 ('ZS11') by quantitative reverse-transcription polymerase chain reaction (qRT-PCR), revealing their diverse expression patterns. We selected four BnCKX genes based on the results of RNA-sequencing and qRT-PCR and compared their expression in cultivated varieties with extremely long versus short siliques. The expression levels of BnCKX5-1 , 5-2 , 6-1 , and 7-1 significantly differed between the two lines and changed during pod development, suggesting they might play roles in determining silique length and in pod development. Finally, we investigated the effects of treatment with the synthetic cytokinin 6-benzylaminopurine (6-BA) and the auxin indole-3-acetic acid (IAA) on the expression of the four selected BnCKX genes. Our results suggest that regulating BnCKX expression is a promising way to enhance the harvest index and stress resistance in plants.
Fu, Xing-Zheng; Tong, Ya-Hua; Zhou, Xue; Ling, Li-Li; Chun, Chang-Pin; Cao, Li; Zeng, Ming; Peng, Liang-Zhi
Plant metal tolerance proteins (MTPs) play important roles in heavy metal homeostasis; however, related information in citrus plants is limited. Citrus genome sequencing and assembly have enabled us to perform a systematic analysis of the MTP gene family. We identified 12 MTP genes in sweet orange, which we have named as CitMTP1 and CitMTP3 to CitMTP12 based on their sequence similarity to Arabidopsis thaliana MTPs. The CitMTPs were predicted to encode proteins of 864 to 2556 amino acids in length that included 4 to 6 putative transmembrane domains (TMDs). Furthermore, all the CitMTPs contained a highly conserved signature sequence encompassing the TMD-II and the start of the TMD-III. Phylogenetic analysis further classified the CitMTPs into Fe/Zn-MTP, Mn-MTP, and Zn-MTP subgroups, which coincided with the MTPs of A. thaliana and rice. The closely clustered CitMTPs shared a similar gene structure. Expression analysis indicated that most CitMTP transcripts were upregulated to various extents under heavy metal stress. Among these, CitMTP5 in the roots and CitMTP11 in the leaves during Zn stress, CitMTP8 in the roots and CitMTP8.1 in the leaves during Mn stress, CitMTP12 in the roots and CitMTP1 in the leaves during Cu stress, and CitMTP11 in the roots and CitMTP1 in the leaves during Cd stress showed the highest extent of upregulation. These findings are suggestive of their individual roles in heavy metal detoxification. Copyright © 2017 Elsevier B.V. All rights reserved.
Le, Dung Tien; Nishiyama, Rie; Watanabe, Yasuko; Mochida, Keiichi; Yamaguchi-Shinozaki, Kazuko; Shinozaki, Kazuo; Tran, Lam-Son Phan
Plant-specific NAC transcription factors (TFs) play important roles in regulating diverse biological processes, including development, senescence, growth, cell division and responses to environmental stress stimuli. Within the soybean genome, we identified 152 full-length GmNAC TFs, including 11 membrane-bound members. In silico analysis of the GmNACs, together with their Arabidopsis and rice counterparts, revealed similar NAC architecture. Next, we explored the soybean Affymetrix array and Illumina transcriptome sequence data to analyse tissue-specific expression profiles of GmNAC genes. Phylogenetic analysis using stress-related NAC TFs from Arabidopsis and rice as seeding sequences identified 58 of the 152 GmNACs as putative stress-responsive genes, including eight previously reported dehydration-responsive GmNACs. We could design gene-specific primers for quantitative real-time PCR verification of 38 out of 50 newly predicted stress-related genes. Twenty-five and six GmNACs were found to be induced and repressed 2-fold or more, respectively, in soybean roots and/or shoots in response to dehydration. GmNAC085, whose amino acid sequence was 39%; identical to that of well-known SNAC1/ONAC2, was the most induced gene upon dehydration, showing 390-fold and 20-fold induction in shoots and roots, respectively. Our systematic analysis has identified excellent tissue-specific and/or dehydration-responsive candidate GmNAC genes for in-depth characterization and future development of improved drought-tolerant transgenic soybeans.
Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A
Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq TM (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.
Yuan, Xiguo; Yu, Guoqiang; Hou, Xuchu; Shih, Ie-Ming; Clarke, Robert; Zhang, Junying; Hoffman, Eric P; Wang, Roger R; Zhang, Zhen; Wang, Yue
Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2) performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3) iteratively detecting Significant Copy Number Aberrations (SCAs) and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS) on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma). When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC) or tumor suppressor genes (e.g., CDKN2A/B). Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes. Open-source and platform-independent SAIC software is
Full Text Available Abstract Background Somatic Copy Number Alterations (CNAs in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC, a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1 exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2 performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3 iteratively detecting Significant Copy Number Aberrations (SCAs and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. Results We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma. When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC or tumor suppressor genes (e.g., CDKN2A/B. Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Conclusions Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes
Jee, Sun Ha; Sull, Jae Woong; Lee, Jong-Eun; Shin, Chol; Park, Jongkeun; Kimm, Heejin; Cho, Eun-Young; Shin, Eun-Soon; Yun, Ji Eun; Park, Ji Wan; Kim, Sang Yeun; Lee, Sun Ju; Jee, Eun Jung; Baik, Inkyung; Kao, Linda; Yoon, Sungjoo Kim; Jang, Yangsoo; Beaty, Terri H.
Adiponectin is associated with obesity and insulin resistance. To date, there has been no genome-wide association study (GWAS) of adiponectin levels in Asians. Here we present a GWAS of a cohort of Korean volunteers. A total of 4,001 subjects were genotyped by using a genome-wide marker panel in a two-stage design (979 subjects initially and 3,022 in a second stage). Another 2,304 subjects were used for follow-up replication studies with selected markers. In the discovery phase, the top SNP associated with mean log adiponectin was rs3865188 in CDH13 on chromosome 16 (p = 1.69 × 10−15 in the initial sample, p = 6.58 × 10−39 in the second genome-wide sample, and p = 2.12 × 10−32 in the replication sample). The meta-analysis p value for rs3865188 in all 6,305 individuals was 2.82 × 10−83. The association of rs3865188 with high-molecular-weight adiponectin (p = 7.36 × 10−58) was even stronger in the third sample. A reporter assay that evaluated the effects of a CDH13 promoter SNP in complete linkage disequilibrium with rs3865188 revealed that the major allele increased expression 2.2-fold. This study clearly shows that genetic variants in CDH13 influence adiponectin levels in Korean adults. PMID:20887962
Lin, Tao; Gao, Lihui
population of mutants with different tags, after recovered from different tissues of infected mice and ticks, mutants from output pool and input pool are detected using high-throughput, semi-quantitative Luminex ® FLEXMAP™ or next-generation sequencing (Tn-seq) technologies. Thus far, we have created a high-density, sequence-defined transposon library of over 6600 STM mutants for the efficient genome-wide investigation of genes and gene products required for wild-type pathogenesis, host-pathogen interactions, in vitro growth, in vivo survival, physiology, morphology, chemotaxis, motility, structure, metabolism, gene regulation, plasmid maintenance and replication, etc. The insertion sites of 4480 transposon mutants have been determined. About 800 predicted protein-encoding genes in the genome were disrupted in the STM transposon library. The infectivity and some functions of 800 mutants in 500 genes have been determined. Analysis of these transposon mutants has yielded valuable information regarding the genes and gene products important in the pathogenesis and biology of B. burgdorferi and its tick vectors.
Han, Jeonghoon; Kim, Duck-Hyun; Kim, Hui-Su; Kim, Hee-Jin; Declerck, S.A.J.; Hagiwara, Atsushi; Lee, Jae-Seong
While marine invertebrate cytochrome P450 (CYP) genes and their roles in detoxification mechanisms have been studied, little information is available regarding freshwater rotifer CYPs and their functions. Here, we used genomic sequences and RNA-seq databases to identify 31 CYP genes in the
Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora
Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. Copyright © 2011 Elsevier Inc. All rights reserved.
Makarova, Olga; MacLean, Allyson M.; Hogenhout, Saskia A.
this technique for reliable gene expression quantification of phytoplasmas on a large scale. In our experimental setup, 242 genes of aster yellows phytoplasma strain witches' broom (AY-WB) were tested for differences in expression in plant and insect host environments, and were shown to be predominantly...
Full Text Available We aimed to identify novel molecular associations between chronic intermittent hypoxia with re-oxygenation and adverse consequences in obstructive sleep apnea (OSA. We analyzed gene expression profiles of peripheral blood mononuclear cells from 48 patients with sleep-disordered breathing stratified into four groups: primary snoring (PS, moderate to severe OSA (MSO, very severe OSA (VSO, and very severe OSA patients on long-term continuous positive airway pressure treatment (VSOC. Comparisons of the microarray gene expression data identified eight genes up-regulated with OSA and down-regulated with CPAP treatment, and five genes down-regulated with OSA and up-regulated with CPAP treatment. Protein expression levels of two genes related to endothelial tight junction (AMOT P130, and PLEKHH3, and three genes related to anti-or pro-apoptosis (BIRC3, ADAR1 P150, and LGALS3 were all increased in the VSO group, while AMOT P130 was further increased, and PLEKHH3, BIRC3, and ADAR1 P150 were all decreased in the VSOC group. Subgroup analyses revealed that AMOT P130 protein expression was increased in OSA patients with excessive daytime sleepiness, BIRC3 protein expression was decreased in OSA patients with hypertension, and LGALS3 protein expression was increased in OSA patients with chronic kidney disease. In vitro short-term intermittent hypoxia with re-oxygenation experiment showed immediate over-expression of ADAR1 P150. In conclusion, we identified a novel association between AMOT/PLEKHH3/BIRC3/ADAR1/LGALS3 over-expressions and high severity index in OSA patients. AMOT and GALIG may constitute an important determinant for the development of hypersomnia and kidney injury, respectively, while BIRC3 may play a protective role in the development of hypertension.
Conforto Tara L
Full Text Available Abstract Background Early liver development and the transcriptional transitions during hepatogenesis are well characterized. However, gene expression changes during the late postnatal/pre-pubertal to young adulthood period are less well understood, especially with regards to sex-specific gene expression. Methods Microarray analysis of male and female mouse liver was carried out at 3, 4, and 8 wk of age to elucidate developmental changes in gene expression from the late postnatal/pre-pubertal period to young adulthood. Results A large number of sex-biased and sex-independent genes showed significant changes during this developmental period. Notably, sex-independent genes involved in cell cycle, chromosome condensation, and DNA replication were down regulated from 3 wk to 8 wk, while genes associated with metal ion binding, ion transport and kinase activity were up regulated. A majority of genes showing sex differential expression in adult liver did not display sex differences prior to puberty, at which time extensive changes in sex-specific gene expression were seen, primarily in males. Thus, in male liver, 76% of male-specific genes were up regulated and 47% of female-specific genes were down regulated from 3 to 8 wk of age, whereas in female liver 67% of sex-specific genes showed no significant change in expression. In both sexes, genes up regulated from 3 to 8 wk were significantly enriched (p p Ihh; female-specific Cdx4, Cux2, Tox, and Trim24 and may contribute to the developmental changes that lead to global acquisition of liver sex-specificity by 8 wk of age. Conclusions Overall, the observed changes in gene expression during postnatal liver development reflect the deceleration of liver growth and the induction of specialized liver functions, with widespread changes in sex-specific gene expression primarily occurring in male liver.
Full Text Available Drosophila melanogaster head development represents a valuable process to study the developmental control of various organs, such as the antennae, the dorsal ocelli and the compound eyes from a common precursor, the eye-antennal imaginal disc. While the gene regulatory network underlying compound eye development has been extensively studied, the key transcription factors regulating the formation of other head structures from the same imaginal disc are largely unknown. We obtained the developmental transcriptome of the eye-antennal discs covering late patterning processes at the late 2nd larval instar stage to the onset and progression of differentiation at the end of larval development. We revealed the expression profiles of all genes expressed during eye-antennal disc development and we determined temporally co-expressed genes by hierarchical clustering. Since co-expressed genes may be regulated by common transcriptional regulators, we combined our transcriptome dataset with publicly available ChIP-seq data to identify central transcription factors that co-regulate genes during head development. Besides the identification of already known and well-described transcription factors, we show that the transcription factor Hunchback (Hb regulates a significant number of genes that are expressed during late differentiation stages. We confirm that hb is expressed in two polyploid subperineurial glia cells (carpet cells and a thorough functional analysis shows that loss of Hb function results in a loss of carpet cells in the eye-antennal disc. Additionally, we provide for the first time functional data indicating that carpet cells are an integral part of the blood-brain barrier. Eventually, we combined our expression data with a de novo Hb motif search to reveal stage specific putative target genes of which we find a significant number indeed expressed in carpet cells.
Torres-Oliva, Montserrat; Schneider, Julia; Wiegleb, Gordon
Drosophila melanogaster head development represents a valuable process to study the developmental control of various organs, such as the antennae, the dorsal ocelli and the compound eyes from a common precursor, the eye-antennal imaginal disc. While the gene regulatory network underlying compound eye development has been extensively studied, the key transcription factors regulating the formation of other head structures from the same imaginal disc are largely unknown. We obtained the developmental transcriptome of the eye-antennal discs covering late patterning processes at the late 2nd larval instar stage to the onset and progression of differentiation at the end of larval development. We revealed the expression profiles of all genes expressed during eye-antennal disc development and we determined temporally co-expressed genes by hierarchical clustering. Since co-expressed genes may be regulated by common transcriptional regulators, we combined our transcriptome dataset with publicly available ChIP-seq data to identify central transcription factors that co-regulate genes during head development. Besides the identification of already known and well-described transcription factors, we show that the transcription factor Hunchback (Hb) regulates a significant number of genes that are expressed during late differentiation stages. We confirm that hb is expressed in two polyploid subperineurial glia cells (carpet cells) and a thorough functional analysis shows that loss of Hb function results in a loss of carpet cells in the eye-antennal disc. Additionally, we provide for the first time functional data indicating that carpet cells are an integral part of the blood-brain barrier. Eventually, we combined our expression data with a de novo Hb motif search to reveal stage specific putative target genes of which we find a significant number indeed expressed in carpet cells. PMID:29360820
Full Text Available Abstract Background Temperature adaptation is one of the most important determinants of distribution and population size of organisms in nature. Recently, quantitative trait loci (QTL mapping and gene expression profiling approaches have been used for detecting candidate genes for heat resistance. However, the resolution of QTL mapping is not high enough to examine the individual effects of various genes in each QTL. Heat stress-responsive genes, characterized by gene expression profiling studies, are not necessarily responsible for heat resistance. Some of these genes may be regulated in association with the heat stress response of other genes. Results To evaluate which heat-responsive genes are potential candidates for heat resistance with higher resolution than previous QTL mapping studies, we performed genome-wide deficiency screen for QTL for heat resistance. We screened 439 isogenic deficiency strains from the DrosDel project, covering 65.6% of the Drosophila melanogaster genome in order to map QTL for thermal resistance. As a result, we found 19 QTL for heat resistance, including 3 novel QTL outside the QTL found in previous studies. Conclusion The QTL found in this study encompassed 19 heat-responsive genes found in the previous gene expression profiling studies, suggesting that they were strong candidates for heat resistance. This result provides new insights into the genetic architecture of heat resistance. It also emphasizes the advantages of genome-wide deficiency screen using isogenic deficiency libraries.
Full Text Available Abstract Background Bovine follicular development is regulated by numerous molecular mechanisms and biological pathways. In this study, we tried to identify differentially expressed genes between largest (F1 and second-largest follicles (F2, and classify them by global gene expression profiling using a combination of microarray and quantitative real-time PCR (QPCR analysis. The follicular status of F1 and F2 were further evaluated in terms of healthy and atretic conditions by investigating mRNA localization of identified genes. Methods Global gene expression profiles of F1 (10.7 +/- 0.7 mm and F2 (7.8 +/- 0.2 mm were analyzed by hierarchical cluster analysis and expression profiles of 16 representative genes were confirmed by QPCR analysis. In addition, localization of six identified transcripts was investigated in healthy and atretic follicles using in situ hybridization. The healthy or atretic condition of examined follicles was classified by progesterone and estradiol concentrations in follicular fluid. Results Hierarchical cluster analysis of microarray data classified the follicles into two clusters. Cluster A was composed of only F2 and was characterized by high expression of 31 genes including IGFBP5, whereas cluster B contained only F1 and predominantly expressed 45 genes including CYP19 and FSHR. QPCR analysis confirmed AMH, CYP19, FSHR, GPX3, PlGF, PLA2G1B, SCD and TRB2 were greater in F1 than F2, while CCL2, GADD45A, IGFBP5, PLAUR, SELP, SPP1, TIMP1 and TSP2 were greater in F2 than in F1. In situ hybridization showed that AMH and CYP19 were detected in granulosa cells (GC of healthy as well as atretic follicles. PlGF was localized in GC and in the theca layer (TL of healthy follicles. IGFBP5 was detected in both GC and TL of atretic follicles. GADD45A and TSP2 were localized in both GC and TL of atretic follicles, whereas healthy follicles expressed them only in GC. Conclusion We demonstrated that global gene expression profiling of F
Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J
The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53-3.14), P=1.9 × 10(-5)). Two polymorphisms at 6p21.2 LINC00951-LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37-1.85), P=1.6 × 10(-9)) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder.
Full Text Available Genes and miRNAs involved in sarcomagenesis related pathways are unknown and therefore signaling events leading to mesenchymal cell transformation to sarcoma are poorly elucidated. Exiqon and Illumina microarray study on human chondrosarcoma JJ012 and chondrocytes C28 cell lines to compare and analyze the differentially expressed miRNAs and their gene targets was recently published in the Journal Tumor Biology in 2014. Here we describe in details the contents and quality controls for the miRNA and gene expression data associated with the study that is relevant to this dataset.
Clemmensen, Anders; Andersen, Klaus E; Clemmensen, Ole
the differential molecular events induced in the epidermis by different irritants, we collected sequential biopsies ((1/2), 4, and 24 hours after a single exposure and at day 11 after repeated exposure) from human volunteers exposed to either sodium lauryl sulfate (SLS) or nonanoic acid (NON). Gene expression...
Vilella Albert J
Full Text Available Abstract Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i exhaustive population-genetic analyses including those based on the coalescent theory; ii analysis adapted to the shallow data generated by the high-throughput genome projects; iii use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
Li, Guo; Liu, Yong; Liu, Chao; Su, Zhongwu; Ren, Shuling; Wang, Yunyun; Deng, Tengbo; Huang, Donghai; Tian, Yongquan; Qiu, Yuanzheng
Radioresistance is one of the major factors limiting the therapeutic efficacy and prognosis of patients with nasopharyngeal carcinoma (NPC). Accumulating evidence has suggested that aberrant expression of long noncoding RNAs (lncRNAs) contributes to cancer progression. Therefore, here we identified lncRNAs associated with radioresistance in NPC. The differential expression profiles of lncRNAs associated with NPC radioresistance were constructed by next-generation deep sequencing by comparing radioresistant NPC cells with their parental cells. LncRNA-related mRNAs were predicted and analyzed using bioinformatics algorithms compared with the mRNA profiles related to radioresistance obtained in our previous study. Several lncRNAs and associated mRNAs were validated in established NPC radioresistant cell models and NPC tissues. By comparison between radioresistant CNE-2-Rs and parental CNE-2 cells by next-generation deep sequencing, a total of 781 known lncRNAs and 2054 novel lncRNAs were annotated. The top five upregulated and downregulated known/novel lncRNAs were detected using quantitative real-time reverse transcription-polymerase chain reaction, and 7/10 known lncRNAs and 3/10 novel lncRNAs were demonstrated to have significant differential expression trends that were the same as those predicted by deep sequencing. From the prediction process, 13 pairs of lncRNAs and their associated genes were acquired, and the prediction trends of three pairs were validated in both radioresistant CNE-2-Rs and 6-10B-Rs cell lines, including lncRNA n373932 and SLITRK5, n409627 and PRSS12, and n386034 and RIMKLB. LncRNA n373932 and its related SLITRK5 showed dramatic expression changes in post-irradiation radioresistant cells and a negative expression correlation in NPC tissues (R = -0.595, p < 0.05). Our study provides an overview of the expression profiles of radioresistant lncRNAs and potentially related mRNAs, which will facilitate future investigations into the
Gao, Lijie; Wang, Yunqi; Li, Yi; Dong, Ya; Yang, Aimin; Zhang, Jie; Li, Fengying; Zhang, Rongqiang
Comprehensive bioinformatics analyses were performed to explore the key biomarkers in response to HIV infection of CD4 + and CD8 + T cells. The numbers of CD4 + and CD8 + T cells of HIV infected individuals were analyzed and the GEO database (GSE6740) was screened for differentially expressed genes (DEGs) in HIV infected CD4 + and CD8 + T cells. Gene Ontology enrichment, KEGG pathway analyses, and protein-protein interaction (PPI) network were performed to identify the key pathway and core proteins in anti-HIV virus process of CD4 + and CD8 + T cells. Finally, we analyzed the expressions of key proteins in HIV-infected T cells (GSE6740 dataset) and peripheral blood mononuclear cells(PBMCs) (GSE511 dataset). 1) CD4 + T cells counts and ratio of CD4 + /CD8 + T cells decreased while CD8 + T cells counts increased in HIV positive individuals; 2) 517 DEGs were found in HIV infected CD4 + and CD8 + T cells at acute and chronic stage with the criterial of P-value T cells. The main biological processes of the DEGs were response to virus and defense response to virus. At chronic stage, ISG15 protein, in conjunction with IFN-1 pathway might play key roles in anti-HIV responses of CD4 + T cells; and 4) The expression of ISG15 increased in both T cells and PBMCs after HIV infection. Gene expression profile of CD4 + and CD8 + T cells changed significantly in HIV infection, in which ISG15 gene may play a central role in activating the natural antiviral process of immune cells. © 2018 Wiley Periodicals, Inc.
Clemmensen, Anders; Andersen, Klaus E; Clemmensen, Ole; Tan, Qihua; Petersen, Thomas K; Kruse, Torben A; Thomassen, Mads
The pathogenesis of irritant contact dermatitis (ICD) is poorly understood, and genes participating in the epidermal response to chemical irritants are only partly known. It is commonly accepted that different irritants have different mechanisms of action in the development of ICD. To define the differential molecular events induced in the epidermis by different irritants, we collected sequential biopsies ((1/2), 4, and 24 hours after a single exposure and at day 11 after repeated exposure) from human volunteers exposed to either sodium lauryl sulfate (SLS) or nonanoic acid (NON). Gene expression analysis using high-density oligonucleotide microarrays (representing 47,000 transcripts) revealed essentially different pathway responses (1/2)hours after exposure: NON transiently induced the IL-6 pathway as well as a number of mitogen-activated signaling cascades including extracellular signal-regulated kinase and growth factor receptor signaling, whereas SLS transiently downregulated cellular energy metabolism pathways. Differential expression of the cyclooxygenase-2 and matrix metalloproteinase 3 transcripts was confirmed immunohistochemically. After cumulative exposure, 883 genes were differentially expressed, whereas we identified 23 suggested common biomarkers for ICD. In conclusion, we bring new insights into two hitherto less well-elucidated phases of skin irritancy: the very initial as well as the late phase after single and cumulative mild exposures, respectively.
Full Text Available Plant 14-3-3 proteins act as critical components of various cellular signaling processes and play an important role in regulating multiple physiological processes. However, less information is known about the 14-3-3 gene family in banana. In this study, 25 14-3-3 genes were identified from the banana genome. Based on the evolutionary analysis, banana 14-3-3 proteins were clustered into ε and non-ε groups. Conserved motif analysis showed that all identified banana 14-3-3 genes had the typical 14-3-3 motif. The gene structure of banana 14-3-3 genes showed distinct class-specific divergence between the ε group and the non-ε group. Most banana 14-3-3 genes showed strong transcript accumulation changes during fruit development and postharvest ripening in two banana varieties, indicating that they might be involved in regulating fruit development and ripening. Moreover, some 14-3-3 genes also showed great changes after osmotic, cold, and salt treatments in two banana varieties, suggested their potential role in regulating banana response to abiotic stress. Taken together, this systemic analysis reveals the involvement of banana 14-3-3 genes in fruit development, postharvest ripening, and response to abiotic stress and provides useful information for understanding the functions of 14-3-3 genes in banana.
Yuan, Fengjie; Yu, Xiaomin; Dong, Dekun; Yang, Qinghua; Fu, Xujun; Zhu, Shenlong; Zhu, Danhua
Seed germination is important to soybean (Glycine max) growth and development, ultimately affecting soybean yield. A lower seed field emergence has been the main hindrance for breeding soybeans low in phytate. Although this reduction could be overcome by additional breeding and selection, the mechanisms of seed germination in different low phytate mutants remain unknown. In this study, we performed a comparative transcript analysis of two low phytate soybean mutants (TW-1 and TW-1-M), which have the same mutation, a 2 bp deletion in GmMIPS1, but show a significant difference in seed field emergence, TW-1-M was higher than that of TW-1 . Numerous genes analyzed by RNA-Seq showed markedly different expression levels between TW-1-M and TW-1 mutants. Approximately 30,000-35,000 read-mapped genes and ~21000-25000 expressed genes were identified for each library. There were ~3900-9200 differentially expressed genes (DEGs) in each contrast library, the number of up-regulated genes was similar with down-regulated genes in the mutant TW-1and TW-1-M. Gene ontology functional categories of DEGs indicated that the ethylene-mediated signaling pathway, the abscisic acid-mediated signaling pathway, response to hormone, ethylene biosynthetic process, ethylene metabolic process, regulation of hormone levels, and oxidation-reduction process, regulation of flavonoid biosynthetic process and regulation of abscisic acid-activated signaling pathway had high correlations with seed germination. In total, 2457 DEGs involved in the above functional categories were identified. Twenty-two genes with 20 biological functions were the most highly up/down- regulated (absolute value Log2FC >5) in the high field emergence mutant TW-1-M and were related to metabolic or signaling pathways. Fifty-seven genes with 36 biological functions had the greatest expression abundance (FRPM >100) in germination-related pathways. Seed germination in the soybean low phytate mutants is a very complex process
Mohanty, Sujata; Khanna, Radhika
Comparative analysis of multiple genomes of closely or distantly related Drosophila species undoubtedly creates excitement among evolutionary biologists in exploring the genomic changes with an ecology and evolutionary perspective. We present herewith the de novo assembled whole genome sequences of four Drosophila species, D. bipectinata, D. takahashii, D. biarmipes and D. nasuta of Indian origin using Next Generation Sequencing technology on an Illumina platform along with their detailed assembly statistics. The comparative genomics analysis, e.g. gene predictions and annotations, functional and orthogroup analysis of coding sequences and genome wide SNP distribution were performed. The whole genome of Zaprionus indianus of Indian origin published earlier by us and the genome sequences of previously sequenced 12 Drosophila species available in the NCBI database were included in the analysis. The present work is a part of our ongoing genomics project of Indian Drosophila species.
Frech, Christian; Chen, Nansheng
Correct classification of genes into gene families is important for understanding gene function and evolution. Although gene families of many species have been resolved both computationally and experimentally with high accuracy, gene family classification in most newly sequenced genomes has not been done with the same high standard. This project has been designed to develop a strategy to effectively and accurately classify gene families across genomes. We first examine and compare the performance of computer programs developed for automated gene family classification. We demonstrate that some programs, including the hierarchical average-linkage clustering algorithm MC-UPGMA and the popular Markov clustering algorithm TRIBE-MCL, can reconstruct manual curation of gene families accurately. However, their performance is highly sensitive to parameter setting, i.e. different gene families require different program parameters for correct resolution. To circumvent the problem of parameterization, we have developed a comparative strategy for gene family classification. This strategy takes advantage of existing curated gene families of reference species to find suitable parameters for classifying genes in related genomes. To demonstrate the effectiveness of this novel strategy, we use TRIBE-MCL to classify chemosensory and ABC transporter gene families in C. elegans and its four sister species. We conclude that fully automated programs can establish biologically accurate gene families if parameterized accordingly. Comparative gene family classification finds optimal parameters automatically, thus allowing rapid insights into gene families of newly sequenced species. PMID:20976221
Rajeev K. Varshney
Full Text Available Biotic stress in legume crops is one of the major threats to crop yield and productivity. Being sessile organisms, plants have evolved a myriad of mechanisms to combat different stresses imposed on them. One such mechanism, deciphered in the last decade, is small RNA (sRNA mediated defense in plants. Small RNAs (sRNAs have emerged as one of the major players in gene expression regulation in plants during developmental stages and under stress conditions. They are known to act both at transcriptional and post-transcriptional levels. Dicer-like (DCL, Argonaute (AGO, and RNA dependent RNA polymerase (RDR constitute the major components of sRNA biogenesis machinery and are known to play a significant role in combating biotic and abiotic stresses. This study is, therefore, focused on identification and characterization of sRNA biogenesis proteins in three important legume crops, namely chickpea, pigeonpea, and groundnut. Phylogenetic analysis of these proteins between legume species classified them into distinct clades and suggests the evolutionary conservation of these genes across the members of Papillionidoids subfamily. Variable expression of sRNA biogenesis genes in response to the biotic stresses among the three legumes indicate the possible existence of specialized regulatory mechanisms in different legumes. This is the first ever study to understand the role of sRNA biogenesis genes in response to pathogen attacks in the studied legumes.
Gao, Junpeng; Cao, Xiaoli; Shi, Shandang; Ma, Yuling; Wang, Kai; Liu, Shengjie; Chen, Dan; Chen, Qin; Ma, Haoli
The Auxin/indole-3-acetic acid (Aux/IAA) genes encode short-lived nuclear proteins that are known to be involved in the primary cellular responses to auxin. To date, systematic analysis of the Aux/IAA genes in potato (Solanum tuberosum) has not been conducted. In this study, a total of 26 potato Aux/IAA genes were identified (designated from StIAA1 to StIAA26), and the distribution of four conserved domains shared by the StIAAs were analyzed based on multiple sequence alignment and a motif-based sequence analysis. A phylogenetic analysis of the Aux/IAA gene families of potato and Arabidopsis was also conducted. In order to assess the roles of StIAA genes in tuber development, the results of RNA-seq studies were reformatted to analyze the expression patterns of StIAA genes, and then verified by quantitative real-time PCR. A large number of StIAA genes (12 genes) were highly expressed in stolon organs and in during the tuber initiation and expansion developmental stages, and most of these genes were responsive to indoleacetic acid treatment. Our results suggested that StIAA genes were involved in the process of tuber development and provided insights into functional roles of potato Aux/IAA genes. Copyright © 2016 Elsevier Inc. All rights reserved.
Full Text Available In seeking more specific biomarkers of the cystic fibrosis (CF lung inflammatory disease that would be sensitive to antibiotic therapy, we sought to evaluate the gene expression profiles of neutrophils in CF patients before treatment in comparison with non-CF healthy individuals and after antibiotic treatment. Genes involved in neutrophil-mediated inflammation, i.e. chemotaxis, respiratory burst, apoptosis, and granule exocytosis, were the targets of this study. Microarray analysis was carried out in blood and airway neutrophils from CF patients and in control subjects. A fold change (log threshold of 1.4 and a cut-off of p<0.05 were utilized to identify significant genes. Community networks and principal component analysis were used to distinguish the groups of controls, pre- and post-therapy patients. Control subjects and CF patients before therapy were readily separated, whereas a clear distinction between patients before and after antibiotic therapy was not possible. Blood neutrophils before therapy presented 269 genes down-regulated and 56 up-regulated as compared with control subjects. Comparison between the same patients before and after therapy showed instead 44 genes down-regulated and 72 up-regulated. Three genes appeared to be sensitive to therapy and returned to "healthy" condition: phorbol-12-myristate-13-acetate-induced protein 1 (PMAIP1, hydrogen voltage-gated channel 1 (HVCN1, and β-arrestin 1 (ARRB1. The up-regulation of these genes after therapy were confirmed by real time PCR. In airway neutrophils, 1029 genes were differentially expressed post- vs pre-therapy. Of these, 30 genes were up-regulated and 75 down-regulated following antibiotic treatment. However, biological plausibility determined that only down-regulated genes belonged to the gene classes studied for blood neutrophils. Finally, it was observed that commonly expressed genes showed a greater variability in airway neutrophils than that found in blood neutrophils
Conese, Massimo; Castellani, Stefano; Lepore, Silvia; Palumbo, Orazio; Manca, Antonio; Santostasi, Teresa; Polizzi, Angela Maria; Copetti, Massimiliano; Di Gioia, Sante; Casavola, Valeria; Guerra, Lorenzo; Diana, Anna; Montemurro, Pasqualina; Mariggiò, Maria Addolorata; Gallo, Crescenzio; Maffione, Angela Bruna; Carella, Massimo
In seeking more specific biomarkers of the cystic fibrosis (CF) lung inflammatory disease that would be sensitive to antibiotic therapy, we sought to evaluate the gene expression profiles of neutrophils in CF patients before treatment in comparison with non-CF healthy individuals and after antibiotic treatment. Genes involved in neutrophil-mediated inflammation, i.e. chemotaxis, respiratory burst, apoptosis, and granule exocytosis, were the targets of this study. Microarray analysis was carried out in blood and airway neutrophils from CF patients and in control subjects. A fold change (log) threshold of 1.4 and a cut-off of p<0.05 were utilized to identify significant genes. Community networks and principal component analysis were used to distinguish the groups of controls, pre- and post-therapy patients. Control subjects and CF patients before therapy were readily separated, whereas a clear distinction between patients before and after antibiotic therapy was not possible. Blood neutrophils before therapy presented 269 genes down-regulated and 56 up-regulated as compared with control subjects. Comparison between the same patients before and after therapy showed instead 44 genes down-regulated and 72 up-regulated. Three genes appeared to be sensitive to therapy and returned to "healthy" condition: phorbol-12-myristate-13-acetate-induced protein 1 (PMAIP1), hydrogen voltage-gated channel 1 (HVCN1), and β-arrestin 1 (ARRB1). The up-regulation of these genes after therapy were confirmed by real time PCR. In airway neutrophils, 1029 genes were differentially expressed post- vs pre-therapy. Of these, 30 genes were up-regulated and 75 down-regulated following antibiotic treatment. However, biological plausibility determined that only down-regulated genes belonged to the gene classes studied for blood neutrophils. Finally, it was observed that commonly expressed genes showed a greater variability in airway neutrophils than that found in blood neutrophils, both before and
Mao, Ke; Dong, Qinglong; Li, Chao; Liu, Changhai; Ma, Fengwang
The bHLH (basic helix-loop-helix) transcription factor family is the second largest in plants. It occurs in all three eukaryotic kingdoms, and plays important roles in regulating growth and development. However, family members have not previously been studied in apple. Here, we identified 188 MdbHLH proteins in apple "Golden Delicious" ( Malus × domestica Borkh.), which could be classified into 18 groups. We also investigated the gene structures and 12 conserved motifs in these MdbHLH s. Coupled with expression analysis and protein interaction network prediction, we identified several genes that might be responsible for abiotic stress responses. This study provides insight and rich resources for subsequent investigations of such proteins in apple.
Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C
Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.
Hu, Wei; Yan, Yan; Shi, Haitao; Liu, Juhua; Miao, Hongxia; Tie, Weiwei; Ding, Zehong; Ding, XuPo; Wu, Chunlai; Liu, Yang; Wang, Jiashui; Xu, Biyu; Jin, Zhiqiang
Abscisic acid (ABA) signaling plays a crucial role in developmental and environmental adaptation processes of plants. However, the PYL-PP2C-SnRK2 families that function as the core components of ABA signaling are not well understood in banana. In the present study, 24 PYL, 87 PP2C, and 11 SnRK2 genes were identified from banana, which was further supported by evolutionary relationships, conserved motif and gene structure analyses. The comprehensive transcriptomic analyses showed that banana PYL-PP2C-SnRK2 genes are involved in tissue development, fruit development and ripening, and response to abiotic stress in two cultivated varieties. Moreover, comparative expression analyses of PYL-PP2C-SnRK2 genes between BaXi Jiao (BX) and Fen Jiao (FJ) revealed that PYL-PP2C-SnRK2-mediated ABA signaling might positively regulate banana fruit ripening and tolerance to cold, salt, and osmotic stresses. Finally, interaction networks and co-expression assays demonstrated that the core components of ABA signaling were more active in FJ than in BX in response to abiotic stress, further supporting the crucial role of the genes in tolerance to abiotic stress in banana. This study provides new insights into the complicated transcriptional control of PYL-PP2C-SnRK2 genes, improves the understanding of PYL-PP2C-SnRK2-mediated ABA signaling in the regulation of fruit development, ripening, and response to abiotic stress, and identifies some candidate genes for genetic improvement of banana.
Gao, Junpeng; Cao, Xiaoli; Shi, Shandang; Ma, Yuling; Wang, Kai; Liu, Shengjie; Chen, Dan; Chen, Qin; Ma, Haoli
The Auxin/indole-3-acetic acid (Aux/IAA) genes encode short-lived nuclear proteins that are known to be involved in the primary cellular responses to auxin. To date, systematic analysis of the Aux/IAA genes in potato (Solanum tuberosum) has not been conducted. In this study, a total of 26 potato Aux/IAA genes were identified (designated from StIAA1 to StIAA26), and the distribution of four conserved domains shared by the StIAAs were analyzed based on multiple sequence alignment and a motif-based sequence analysis. A phylogenetic analysis of the Aux/IAA gene families of potato and Arabidopsis was also conducted. In order to assess the roles of StIAA genes in tuber development, the results of RNA-seq studies were reformatted to analyze the expression patterns of StIAA genes, and then verified by quantitative real-time PCR. A large number of StIAA genes (12 genes) were highly expressed in stolon organs and in during the tuber initiation and expansion developmental stages, and most of these genes were responsive to indoleacetic acid treatment. Our results suggested that StIAA genes were involved in the process of tuber development and provided insights into functional roles of potato Aux/IAA genes. - Highlights: • A systematic analysis of the potato AUX/IAA gene family were performed. • StIAA genes were related to auxin perception and signal transduction. • Candidate StIAA genes likely related to tuber initiation and expansion were screened.
Gao, Junpeng [State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A& F University, Yangling, Shaanxi 712100 (China); Innovation Experimental College, Northwest A& F University, Yangling, Shaanxi 712100 (China); Cao, Xiaoli; Shi, Shandang [State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A& F University, Yangling, Shaanxi 712100 (China); Ma, Yuling [State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A& F University, Yangling, Shaanxi 712100 (China); Innovation Experimental College, Northwest A& F University, Yangling, Shaanxi 712100 (China); Wang, Kai; Liu, Shengjie [State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A& F University, Yangling, Shaanxi 712100 (China); Chen, Dan [School of Life Sciences and Technology, Xidian University, Xi' an, Shaanxi 710071 (China); Chen, Qin [State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A& F University, Yangling, Shaanxi 712100 (China); Ma, Haoli, E-mail: email@example.com [State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A& F University, Yangling, Shaanxi 712100 (China)
The Auxin/indole-3-acetic acid (Aux/IAA) genes encode short-lived nuclear proteins that are known to be involved in the primary cellular responses to auxin. To date, systematic analysis of the Aux/IAA genes in potato (Solanum tuberosum) has not been conducted. In this study, a total of 26 potato Aux/IAA genes were identified (designated from StIAA1 to StIAA26), and the distribution of four conserved domains shared by the StIAAs were analyzed based on multiple sequence alignment and a motif-based sequence analysis. A phylogenetic analysis of the Aux/IAA gene families of potato and Arabidopsis was also conducted. In order to assess the roles of StIAA genes in tuber development, the results of RNA-seq studies were reformatted to analyze the expression patterns of StIAA genes, and then verified by quantitative real-time PCR. A large number of StIAA genes (12 genes) were highly expressed in stolon organs and in during the tuber initiation and expansion developmental stages, and most of these genes were responsive to indoleacetic acid treatment. Our results suggested that StIAA genes were involved in the process of tuber development and provided insights into functional roles of potato Aux/IAA genes. - Highlights: • A systematic analysis of the potato AUX/IAA gene family were performed. • StIAA genes were related to auxin perception and signal transduction. • Candidate StIAA genes likely related to tuber initiation and expansion were screened.
Li, Hui; Wang, Yu; Wu, Mei; Li, Lihong; Li, Cong; Han, Zhanpin; Yuan, Jiye; Chen, Chengbin; Song, Wenqin; Wang, Chunguo
The AP2/ERF transcription factors (TFs) comprise one of the largest gene superfamilies in plants. These TFs perform vital roles in plant growth, development, and responses to biotic and abiotic stresses. In this study, 171 AP2/ERF TFs were identified in cauliflower (Brassica oleracea L. var. botrytis), one of the most important horticultural crops in Brassica. Among these TFs, 15, 9, and 1 TFs were classified into the AP2, RAV, and Soloist family, respectively. The other 146 TFs belong to ERF family, which were further divided into the ERF and DREB subfamilies. The ERF subfamily contained 91 TFs, while the DREB subfamily contained 55 TFs. Phylogenetic analysis results indicated that the AP2/ERF TFs can be classified into 13 groups, in which 25 conserved motifs were confirmed. Some motifs were group- or subgroup- specific, implying that they are significant to the functions of the AP2/ERF TFs of these clades. In addition, 35 AP2/ERF TFs from the 13 groups were selected randomly and then used for expression pattern analysis under salt and drought stresses. The majority of these AP2/ERF TFs exhibited positive responses to these stress conditions. In specific, Bra-botrytis-ERF054a, Bra-botrytis-ERF056, and Bra-botrytis-CRF2a demonstrated rapid responses. By contrast, six AP2/ERF TFs were showed to delay responses to both stresses. The AP2/ERF TFs exhibiting specific expression patterns under salt or drought stresses were also confirmed. Further functional analysis indicated that ectopic overexpression of Bra-botrytis-ERF056 could increase tolerance to both salt and drought treatments. These findings provide new insights into the AP2/ERF TFs present in cauliflower, and offer candidate AP2/ERF TFs for further studies on their roles in salt and drought stress tolerance. PMID:28642765
Dijkstra, Akkelies E; Smolonska, Joanna; van den Berge, Maarten
by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years). Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism...... (SNP). RESULTS: A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6), OR = 1.17), located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1) on chromosome 3. The risk allele (G) was associated with higher mRNA expression...... of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations. METHODS: GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed...
Soil water deficit is one of the major factors limiting plant productivity. Plants cope with this adverse environmental condition by coordinating the up- or downregulation of an array of stress responsive genes. Reprogramming the expression of these genes leads to rebalanced development and growth that are in concert with the reduced water availability and that ultimately confer enhanced stress tolerance. Currently, several techniques have been employed to monitor genome-wide transcriptional reprogramming under drought stress. The results from these high throughput studies indicate that drought stress-induced transcriptional reprogramming is dynamic, has temporal and spatial specificity, and is coupled with the circadian clock and phytohormone signaling pathways. © 2012 Springer-Verlag Berlin Heidelberg. All rights are reserved.
Genome-Wide Analyses of the NAC Transcription Factor Gene Family in Pepper (Capsicum annuum L.: Chromosome Location, Phylogeny, Structure, Expression Patterns, Cis-Elements in the Promoter, and Interaction Network
Full Text Available The NAM, ATAF1/2, and CUC2 (NAC transcription factors form a large plant-specific gene family, which is involved in the regulation of tissue development in response to biotic and abiotic stress. To date, there have been no comprehensive studies investigating chromosomal location, gene structure, gene phylogeny, conserved motifs, or gene expression of NAC in pepper (Capsicum annuum L.. The recent release of the complete genome sequence of pepper allowed us to perform a genome-wide investigation of Capsicum annuum L. NAC (CaNAC proteins. In the present study, a comprehensive analysis of the CaNAC gene family in pepper was performed, and a total of 104 CaNAC genes were identified. Genome mapping analysis revealed that CaNAC genes were enriched on four chromosomes (chromosomes 1, 2, 3, and 6. In addition, phylogenetic analysis of the NAC domains from pepper, potato, Arabidopsis, and rice showed that CaNAC genes could be clustered into three groups (I, II, and III. Group III, which contained 24 CaNAC genes, was exclusive to the Solanaceae plant family. Gene structure and protein motif analyses showed that these genes were relatively conserved within each subgroup. The number of introns in CaNAC genes varied from 0 to 8, with 83 (78.9% of CaNAC genes containing two or less introns. Promoter analysis confirmed that CaNAC genes are involved in pepper growth, development, and biotic or abiotic stress responses. Further, the expression of 22 selected CaNAC genes in response to seven different biotic and abiotic stresses [salt, heat shock, drought, Phytophthora capsici, abscisic acid, salicylic acid (SA, and methyl jasmonate (MeJA] was evaluated by quantitative RT-PCR to determine their stress-related expression patterns. Several putative stress-responsive CaNAC genes, including CaNAC72 and CaNAC27, which are orthologs of the known stress-responsive Arabidopsis gene ANAC055 and potato gene StNAC30, respectively, were highly regulated by treatment with
Full Text Available Because most human stroke victims are elderly, studies of experimental stroke in the aged rather than the young rat model may be optimal for identifying clinically relevant cellular responses, as well for pinpointing beneficial interventions.We employed the Affymetrix platform to analyze the whole-gene transcriptome following temporary ligation of the middle cerebral artery in aged and young rats. The correspondence, heat map, and dendrogram analyses independently suggest a differential, age-group-specific behaviour of major gene clusters after stroke. Overall, the pattern of gene expression strongly suggests that the response of the aged rat brain is qualitatively rather than quantitatively different from the young, i.e. the total number of regulated genes is comparable in the two age groups, but the aged rats had great difficulty in mounting a timely response to stroke. Our study indicates that four genes related to neuropathic syndrome, stress, anxiety disorders and depression (Acvr1c, Cort, Htr2b and Pnoc may have impaired response to stroke in aged rats. New therapeutic options in aged rats may also include Calcrl, Cyp11b1, Prcp, Cebpa, Cfd, Gpnmb, Fcgr2b, Fcgr3a, Tnfrsf26, Adam 17 and Mmp14. An unexpected target is the enzyme 3-hydroxy-3-methylglutaryl-Coenzyme A synthase 1 in aged rats, a key enzyme in the cholesterol synthesis pathway. Post-stroke axonal growth was compromised in both age groups.We suggest that a multi-stage, multimodal treatment in aged animals may be more likely to produce positive results. Such a therapeutic approach should be focused on tissue restoration but should also address other aspects of patient post-stroke therapy such as neuropathic syndrome, stress, anxiety disorders, depression, neurotransmission and blood pressure.
Nguyen, Van Ngoc Tuyet; Moon, Sunok; Jung, Ki-Hong
Although the super family of ATP-binding cassette (ABC) proteins plays key roles in the physiology and development of plants, the functions of members of this interesting family mostly remain to be clarified, especially in crop plants. Thus, systematic analysis of this family in rice (Oryza sativa), a major model crop plant, will be helpful in the design of effective strategies for functional analysis. Phylogenomic analysis that integrates anatomy and stress meta-profiling data based on a large collection of rice Affymetrix array data into the phylogenic context provides useful clues into the functions for each of the ABC transporter family members in rice. Using anatomy data, we identified 17 root-preferred and 16-shoot preferred genes at the vegetative stage, and 3 pollen, 2 embryo, 2 ovary, 2 endosperm, and 1 anther-preferred gene at the reproductive stage. The stress data revealed significant up-regulation or down-regulation of 47 genes under heavy metal treatment, 16 genes under nutrient deficient conditions, and 51 genes under abiotic stress conditions. Of these, we confirmed the differential expression patterns of 14 genes in root samples exposed to drought stress using quantitative real-time PCR. Network analysis using RiceNet suggests a functional gene network involving nine rice ABC transporters that are differentially regulated by drought stress in root, further enhancing the prediction of biological function. Our analysis provides a molecular basis for the study of diverse biological phenomena mediated by the ABC family in rice and will contribute to the enhancement of crop yield and stress tolerance. Copyright © 2014 Elsevier GmbH. All rights reserved.
Full Text Available As Ca2+ sensors and effectors, calcium-dependent protein kinases (CDPKs play important roles in regulating the downstream components of calcium signaling, which are ubiquitously involved in plant growth, development, and response to environmental cues. However, no CDPKs have been characterized in Capsicum annuum thus far. Herein, a comprehensive analysis of genes encoding pepper CDPKs and CDPK-related protein kinases (CRKs was performed, and 31 CDPK genes and five closely related kinase genes were identified, which were phylogenetically divided into four distinct subfamilies and unevenly distributed across nine chromosomes. Conserved sequence and exon-intron structures were found to be shared by pepper CDPKs within the same subfamily, and the expansion of the CaCPK family in pepper was found to be due to segmental duplication events. Five CDPKs in the Capsicum annuum variety CM334 were found to be mutated in the Chiltepin variety, and one CDPK present in CM334 was lost in Chiltepin. The majority of CDPK and CRK genes were expressed in different pepper tissues and developmental stages, and 10, 12, and eight CDPK genes were transcriptionally modified by salt, heat, and Ralstonia solanacearum stresses, respectively. Furthermore, these genes were found to respond specifically to one stress as well as respond synergistically to two stresses or three stresses, suggesting that these CDPK genes might be involved in the specific or synergistic response of pepper to salt, heat, and R. solanacearum. Our results lay the foundation for future functional characterization of pepper CDPK and its closely related gene families.
Apr 1, 2010 ... Genome-wide association studies (GWAS) examine the entire human genome with the goal of identifying genetic variants. (usually single nucleotide polymorphisms (SNPs)) that are associated with phenotypic traits such as disease status and drug response. The discordance of significantly associated ...
Full Text Available Schizophrenia is one of the major psychiatric disorders. It is a disorder of complex inheritance, involving both heritable and environmental factors. DNA methylation is an inheritable epigenetic modification that stably alters gene expression. We reasoned that genetic modifications that are a result of environmental stimuli could also make a contribution.
Levinson, Douglas F; Shi, Jianxin; Wang, Kai
The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs).......The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs)....
Fanous, Ayman H; Zhou, Baiyu; Aggen, Steven H
Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia.......Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia....
SNPs from the African American breast cancer scan to COGs , a European collaborative study which is has designed a SNP array with that will be genotyped...Award Number: W81XWH-08-1-0383 TITLE: A Genome-wide Breast Cancer Scan in African Americans PRINCIPAL INVESTIGATOR: Christopher A...SUBTITLE A Genome-wide Breast Cancer Scan in African Americans 5a. CONTRACT NUMBER 5b. GRANT NUMBER W81XWH-08-1-0383 5c. PROGRAM
Full Text Available Abstract Background Despite the recent success of genome-wide association studies in identifying novel loci contributing effects to complex human traits, such as type 2 diabetes and obesity, much of the genetic component of variation in these phenotypes remains unexplained. One way to improving power to detect further novel loci is through meta-analysis of studies from the same population, increasing the sample size over any individual study. Although statistical software analysis packages incorporate routines for meta-analysis, they are ill equipped to meet the challenges of the scale and complexity of data generated in genome-wide association studies. Results We have developed flexible, open-source software for the meta-analysis of genome-wide association studies. The software incorporates a variety of error trapping facilities, and provides a range of meta-analysis summary statistics. The software is distributed with scripts that allow simple formatting of files containing the results of each association study and generate graphical summaries of genome-wide meta-analysis results. Conclusions The GWAMA (Genome-Wide Association Meta-Analysis software has been developed to perform meta-analysis of summary statistics generated from genome-wide association studies of dichotomous phenotypes or quantitative traits. Software with source files, documentation and example data files are freely available online at http://www.well.ox.ac.uk/GWAMA.
Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J
The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967
Willour, Virginia L.; Seifuddin, Fayaz; Mahon, Pamela B.; Jancic, Dubravka; Pirooznia, Mehdi; Steele, Jo; Schweizer, Barbara; Goes, Fernando S.; Mondimore, Francis M.; MacKinnon, Dean F.; Perlis, Roy H.; Lee, Phil Hyoun; Huang, Jie; Kelsoe, John R.; Shilling, Paul D.; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W.; Craddock, Nicholas; DePaulo, J. Raymond; Schulze, Thomas G.; McMahon, Francis J.; Zandi, Peter P.; Potash, James B.
The heritable component to attempted and completed suicide is partly related to psychiatric disorders and also partly independent of them. While attempted suicide linkage regions have been identified on 2p11–12 and 6q25–26, there are likely many more such loci, the discovery of which will require a much higher resolution approach, such as the genome-wide association study (GWAS). With this in mind, we conducted an attempted suicide GWAS that compared the single nucleotide polymorphism (SNP) genotypes of 1,201 bipolar (BP) subjects with a history of suicide attempts to the genotypes of 1,497 BP subjects without a history of suicide attempts. 2,507 SNPs with evidence for association at p<0.001 were identified. These associated SNPs were subsequently tested for association in a large and independent BP sample set. None of these SNPs were significantly associated in the replication sample after correcting for multiple testing, but the combined analysis of the two sample sets produced an association signal on 2p25 (rs300774) at the threshold of genome-wide significance (p= 5.07 × 10−8). The associated SNPs on 2p25 fall in a large linkage disequilibrium block containing the ACP1 gene, a gene whose expression is significantly elevated in BP subjects who have completed suicide. Furthermore, the ACP1 protein is a tyrosine phosphatase that influences Wnt signaling, a pathway regulated by lithium, making ACP1 a functional candidate for involvement in the phenotype. Larger GWAS sample sets will be required to confirm the signal on 2p25 and to identify additional genetic risk factors increasing susceptibility for attempted suicide. PMID:21423239
Trinidad, Susan Brown; Fullerton, Stephanie M; Bares, Julie M; Jarvik, Gail P; Larson, Eric B; Burke, Wylie
Sharing study data within the research community generates tension between two important goods: promoting scientific goals and protecting the privacy interests of study participants. This study was designed to explore the perceptions, beliefs, and attitudes of research participants and possible future participants regarding genome-wide association studies and repository-based research. Focus group sessions with (1) current research participants, (2) surrogate decision-makers, and (3) three age-defined cohorts (18-34 years, 35-50, >50). Participants expressed a variety of opinions about the acceptability of wide sharing of genetic and phenotypic information for research purposes through large, publicly accessible data repositories. Most believed that making de-identified study data available to the research community is a social good that should be pursued. Privacy and confidentiality concerns were common, although they would not necessarily preclude participation. Many participants voiced reservations about sharing data with for-profit organizations. Trust is central in participants' views regarding data sharing. Further research is needed to develop governance models that enact the values of stewardship.
Akkelies E Dijkstra
Full Text Available Chronic mucus hypersecretion (CMH is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA study of CMH in Caucasian populations.GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years. Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP.A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6, OR = 1.17, located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1 on chromosome 3. The risk allele (G was associated with higher mRNA expression of SATB1 (4.3×10(-9 in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture.Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH.
Full Text Available Although great progress in genome-wide association studies (GWAS has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked. The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001
Rönn, Tina; Volkov, Petr; Gillberg, Linn
Increased age, BMI and HbA1c levels are risk factors for several non-communicable diseases. However, the impact of these factors on the genome-wide DNA methylation pattern in human adipose tissue remains unknown. We analyzed the DNA methylation of ∼480 000 sites in human adipose tissue from 96 ma...
Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G
The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.
Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095
Jee, Sun Ha; Sull, Jae Woong; Lee, Jong-Eun; Shin, Chol; Park, Jongkeun; Kimm, Heejin; Cho, Eun-Young; Shin, Eun-Soon; Yun, Ji Eun; Park, Ji Wan; Kim, Sang Yeun; Lee, Sun Ju; Jee, Eun Jung; Baik, Inkyung; Kao, Linda
Adiponectin is associated with obesity and insulin resistance. To date, there has been no genome-wide association study (GWAS) of adiponectin levels in Asians. Here we present a GWAS of a cohort of Korean volunteers. A total of 4,001 subjects were genotyped by using a genome-wide marker panel in a two-stage design (979 subjects initially and 3,022 in a second stage). Another 2,304 subjects were used for follow-up replication studies with selected markers. In the discovery phase, the top SNP a...
Full Text Available Community samples suggest that approximately 1 in 20 children and adults exhibit clinically significant anger, hostility, and aggression. Individuals with dysregulated emotional control have a greater lifetime burden of psychiatric morbidity, severe impairment in role functioning, and premature mortality due to cardiovascular disease.With publically available data secured from dbGaP, we conducted a genome-wide association study of proneness to anger using the Spielberger State-Trait Anger Scale in the Atherosclerosis Risk in Communities (ARIC study (n = 8,747.Subjects were, on average, 54 (range 45-64 years old at baseline enrollment, 47% (n = 4,117 were male, and all were of European descent by self-report. The mean Angry Temperament and Angry Reaction scores were 5.8 ± 1.8 and 7.6 ± 2.2. We observed a nominally significant finding (p = 2.9E-08, λ = 1.027 - corrected pgc = 2.2E-07, λ = 1.0015 on chromosome 6q21 in the gene coding for the non-receptor protein-tyrosine kinase, Fyn.Fyn interacts with NDMA receptors and inositol-1,4,5-trisphosphate (IP3-gated channels to regulate calcium influx and intracellular release in the post-synaptic density. These results suggest that signaling pathways regulating intracellular calcium homeostasis, which are relevant to memory, learning, and neuronal survival, may in part underlie the expression of Angry Temperament.
Full Text Available Plant organ development and polarity establishment is mediated by the action of several transcription factors. Among these, the KANADI (KAN subclade of the GARP protein family plays important roles in polarity-associated processes during embryo, shoot and root patterning. In this study, we have identified a set of potential direct target genes of KAN1 through a combination of chromatin immunoprecipitation/DNA sequencing (ChIP-Seq and genome-wide transcriptional profiling using tiling arrays. Target genes are over-represented for genes involved in the regulation of organ development as well as in the response to auxin. KAN1 affects directly the expression of several genes previously shown to be important in the establishment of polarity during lateral organ and vascular tissue development. We also show that KAN1 controls through its target genes auxin effects on organ development at different levels: transport and its regulation, and signaling. In addition, KAN1 regulates genes involved in the response to abscisic acid, jasmonic acid, brassinosteroids, ethylene, cytokinins and gibberellins. The role of KAN1 in organ polarity is antagonized by HD-ZIPIII transcription factors, including REVOLUTA (REV. A comparison of their target genes reveals that the REV/KAN1 module acts in organ patterning through opposite regulation of shared targets. Evidence of mutual repression between closely related family members is also shown.
Oskari Kilpeläinen, Tuomas
Genome-wide association studies (GWASs) have revolutionized the search for genetic variants regulating resting heart rate. In the last 10 years, GWASs have led to the identification of at least 21 novel heart rate loci. These discoveries have provided valuable insights into the mechanisms...... and pathways that regulate heart rate and link heart rate to cardiovascular morbidity and mortality. GWASs capture majority of genetic variation in a population sample by utilizing high-throughput genotyping chips measuring genotypes for up to several millions of SNPs across the genome in thousands...... of individuals. This allows the identification of the strongest heart rate associated signals at genome-wide level. While GWASs provide robust statistical evidence of the association of a given genetic locus with heart rate, they are only the starting point for detailed follow-up studies to locate the causal...
Full Text Available The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level.
Ripke, S.; Sanders, A. R.; Kendler, K. S.; Levinson, D. F.; Sklar, P.; Holmans, P. A.; Lin, D. Y.; Duan, J.; Ophoff, R. A.; Andreassen, O. A.; Scolnick, E.; Cichon, S.; St Clair, D.; Corvin, A.; Gurling, H.; Werge, T.; Rujescu, D.; Blackwood, D. H.; Pato, C. N.; Malhotra, A. K.; Purcell, S.; Dudbridge, F.; Neale, B. M.; Rossin, L.; Visscher, P. M.; Posthuma, D.; Ruderfer, D. M.; Fanous, A.; Stefansson, H.; Steinberg, S.; Mowry, B. J.; Golimbet, V.; de Hert, M.; Jonsson, E. G.; Bitter, I.; Pietilainen, O. P.; Collier, D. A.; Tosato, S.; Agartz, I.; Albus, M.; Alexander, M.; Amdur, R. L.; Amin, F.; Bass, N.; Bergen, S. E.; Black, D. W.; Borglum, A. D.; Brown, M. A.; Bruggeman, R.; Buccola, N. G.; Byerley, W. F.; Cahn, W.; Cantor, R. M.; Carr, V. J.; Catts, S. V.; Choudhury, K.; Cloninger, C. R.; Cormican, P.; Craddock, N.; Danoy, P. A.; Datta, S.; de Haan, L.; Demontis, D.; Dikeos, D.; Djurovic, S.; Donnely, P.; Donohoe, G.; Duong, L.; Dwyer, S.; Fink-Jensen, A.; Freedman, R.; Freimer, N. B.; Friedl, M.; Georgieva, L.; Giegling, I.; Gill, M.; Glenthoj, B.; Godard, S.; Hamshere, M.; Hansen, M.; Hartmann, A. M.; Henskens, F. A.; Hougaard, D. M.; Hultman, C. M.; Ingason, A.; Jablensky, A. V.; Jakobsen, K. D.; Jay, M.; Jurgens, G.; Kahn, R. S.; Keller, M. C.; Kenis, G.; Kenny, E.; Kim, Y.; Kirov, G. K.; Konnerth, H.; Konte, B.; Krabbendam, L.; Krasucki, R.; Lasseter, V. K.; Laurent, C.; Lawrence, J.; Lencz, T.; Lerer, F. B.; Liang, K. Y.; Lichtenstein, P.; Lieberman, J. A.; Linszen, D. H.; Lonnqvist, J.; Loughland, C. M.; Maclean, A. W.; Maher, B. S.; Maier, W.; Mallet, J.; Malloy, P.; Mattheisen, M.; Mattingsdal, M.; McGhee, K. A.; McGrath, J. J.; McIntosh, A.; McLean, D. E.; McQuillin, A.; Melle, I.; Michie, P. T.; Milanova, V.; Morris, D. W.; Mors, O.; Mortensen, P. B.; Moskvina, V.; Muglia, P.; Myin-Germeys, I.; Nertney, D. A.; Nestadt, G.; Nielsen, J.; Nikolov, I.; Nordentoft, M.; Norton, N.; Nothen, M. M.; O'Dushlaine, C. T.; Olincy, A.; Olsen, L.; O'Neill, F. A.; Orntoft, T. F.; Owen, M. J.; Pantelis, C.; Papadimitriou, G.; Pato, M. T.; Peltonen, L.; Petursson, H.; Pickard, B.; Pimm, J.; Pulver, A. E.; Puri, V.; Quested, D.; Quinn, E. M.; Rasmussen, H. B.; Rethelyi, J. M.; Ribble, R.; Rietschel, M.; Riley, B. P.; Ruggeri, M.; Schall, U.; Schulze, T. G.; Schwab, S. G.; Scott, R. J.; Shi, J.; Sigurdsson, E.; Silvermann, J. M.; Spencer, C. C.; Stefansson, K.; Strange, A.; Strengman, E.; Stroup, T. S.; Suvisaari, J.; Terenius, L.; Thirumalai, S.; Thygesen, J. H.; Timm, S.; Toncheva, D.; van den Oord, E.; van Os, J.; van Winkel, R.; Veldink, J.; Walsh, D.; Wang, A. G.; Wiersma, D.; Wildenauer, D. B.; Williams, H. J.; Williams, N. M.; Wormley, B.; Zammit, S.; Sullivan, P. F.; O'Donovan, M. C.; Daly, M. J.; Gejman, P. V.
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded
Beekman, Marian; Blanché, Hélène; Perola, Markus
Clear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian...
Scharf, J. M.; Yu, D.; Mathews, C. A.; Neale, B. M.; Stewart, S. E.; Fagerness, J. A.; Evans, P.; Gamazon, E.; Edlund, C. K.; Service, S. K.; Tikhomirov, A.; Osiecki, L.; Illmann, C.; Pluzhnikov, A.; Konkashbaev, A.; Davis, L. K.; Han, B.; Crane, J.; Moorjani, P.; Crenshaw, A. T.; Parkin, M. A.; Reus, V. I.; Lowe, T. L.; Rangel-Lugo, M.; Chouinard, S.; Dion, Y.; Girard, S.; Cath, D. C.; Smit, J. H.; King, R. A.; Fernandez, T. V.; Leckman, J. F.; Kidd, K. K.; Kidd, J. R.; Pakstis, A. J.; State, M. W.; Herrera, L. D.; Romero, R.; Fournier, E.; Sandor, P.; Barr, C. L.; Phan, N.; Gross-Tsur, V.; Benarroch, F.; Pollak, Y.; Budman, C. L.; Bruun, R. D.; Erenberg, G.; Naarden, A. L.; Hoekstra, P. J.
Tourette's syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10(-11)) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).
Oskari Kilpeläinen, Tuomas; Ingelsson, Erik
Adiposity is strongly heritable and one of the leading risk factors for type 2 diabetes, cardiovascular disease, cancer, and premature death. In the past 8 years, genome-wide association studies (GWAS) have greatly increased our understanding of the genes and biological pathways that regulate...
Conclusion: We identified the consistence and specific DEGs of human placenta and umbilical cord based on the genome-wide comparison. Our results indicated that hMSCs derived from umbilical cord and placenta have different gene expression patterns, and most of specific genes are involved in the cell cycle, cell division, cell death, and cell developmental processes.
Paul, Petra; van den Hoorn, Tineke; Jongsma, Marlieke L. M.; Bakker, Mark J.; Hengeveld, Rutger; Janssen, Lennert; Cresswell, Peter; Egan, David A.; van Ham, Marieke; ten Brinke, Anja; Ovaa, Huib; Beijersbergen, Roderick L.; Kuijl, Coenraad; Neefjes, Jacques
MHC class II molecules (MHC-II) present peptides to T helper cells to facilitate immune responses and are strongly linked to autoimmune diseases. To unravel processes controlling MHC-II antigen presentation, we performed a genome-wide flow cytometry-based RNAi screen detecting MHC-II expression and
Power, Robert A; Parkhill, Julian; de Oliveira, Tulio
The reduced costs of sequencing have led to whole-genome sequences for a large number of microorganisms, enabling the application of microbial genome-wide association studies (GWAS). Given the successes of human GWAS in understanding disease aetiology and identifying potential drug targets, microbial GWAS are likely to further advance our understanding of infectious diseases. These advances include insights into pressing global health problems, such as antibiotic resistance and disease transmission. In this Review, we outline the methodologies of GWAS, the current state of the field of microbial GWAS, and how lessons from human GWAS can direct the future of the field.
Xu, Zhuofei; Zhou, Rui
As is well known, pathogenic microbes evolve rapidly to escape from the host immune system and antibiotics. Genetic variations among microbial populations occur frequently during the long-term pathogen–host evolutionary arms race, and individual mutation beneficial for the fitness can be fixed...... to scan genome-wide alignments for evidence of positive Darwinian selection, recombination, and other evolutionary forces operating on the coding regions. In this chapter, we describe an integrative analysis pipeline and its application to tracking featured evolutionary trajectories on the genome...
Li, Yifeng; Shi, Wenqiang; Wasserman, Wyeth W
In the human genome, 98% of DNA sequences are non-protein-coding regions that were previously disregarded as junk DNA. In fact, non-coding regions host a variety of cis-regulatory regions which precisely control the expression of genes. Thus, Identifying active cis-regulatory regions in the human genome is critical for understanding gene regulation and assessing the impact of genetic variation on phenotype. The developments of high-throughput sequencing and machine learning technologies make it possible to predict cis-regulatory regions genome wide. Based on rich data resources such as the Encyclopedia of DNA Elements (ENCODE) and the Functional Annotation of the Mammalian Genome (FANTOM) projects, we introduce DECRES based on supervised deep learning approaches for the identification of enhancer and promoter regions in the human genome. Due to their ability to discover patterns in large and complex data, the introduction of deep learning methods enables a significant advance in our knowledge of the genomic locations of cis-regulatory regions. Using models for well-characterized cell lines, we identify key experimental features that contribute to the predictive performance. Applying DECRES, we delineate locations of 300,000 candidate enhancers genome wide (6.8% of the genome, of which 40,000 are supported by bidirectional transcription data), and 26,000 candidate promoters (0.6% of the genome). The predicted annotations of cis-regulatory regions will provide broad utility for genome interpretation from functional genomics to clinical applications. The DECRES model demonstrates potentials of deep learning technologies when combined with high-throughput sequencing data, and inspires the development of other advanced neural network models for further improvement of genome annotations.
Renton, Alan E.; Pliner, Hannah A.; Provenzano, Carlo; Evoli, Amelia; Ricciardi, Roberta; Nalls, Michael A.; Marangi, Giuseppe; Abramzon, Yevgeniya; Arepalli, Sampath; Chong, Sean; Hernandez, Dena G.; Johnson, Janel O.; Bartoccioni, Emanuela; Scuderi, Flavia; Maestri, Michelangelo; Raphael Gibbs, J.; Errichiello, Edoardo; Chiò, Adriano; Restagno, Gabriella; Sabatelli, Mario; Macek, Mark; Scholz, Sonja W.; Corse, Andrea; Chaudhry, Vinay; Benatar, Michael; Barohn, Richard J.; McVey, April; Pasnoor, Mamatha; Dimachkie, Mazen M.; Rowin, Julie; Kissel, John; Freimer, Miriam; Kaminski, Henry J.; Sanders, Donald B.; Lipscomb, Bernadette; Massey, Janice M.; Chopra, Manisha; Howard, James F.; Koopman, Wilma J.; Nicolle, Michael W.; Pascuzzi, Robert M.; Pestronk, Alan; Wulf, Charlie; Florence, Julaine; Blackmore, Derrick; Soloway, Aimee; Siddiqi, Zaeem; Muppidi, Srikanth; Wolfe, Gil; Richman, David; Mezei, Michelle M.; Jiwa, Theresa; Oger, Joel; Drachman, Daniel B.; Traynor, Bryan J.
IMPORTANCE Myasthenia gravis is a chronic, autoimmune, neuromuscular disease characterized by fluctuating weakness of voluntary muscle groups. Although genetic factors are known to play a role in this neuroimmunological condition, the genetic etiology underlying myasthenia gravis is not well understood. OBJECTIVE To identify genetic variants that alter susceptibility to myasthenia gravis, we performed a genome-wide association study. DESIGN, SETTING, AND PARTICIPANTS DNA was obtained from 1032 white individuals from North America diagnosed as having acetylcholine receptor antibody–positive myasthenia gravis and 1998 race/ethnicity-matched control individuals from January 2010 to January 2011. These samples were genotyped on Illumina OmniExpress single-nucleotide polymorphism arrays. An independent cohort of 423 Italian cases and 467 Italian control individuals were used for replication. MAIN OUTCOMES AND MEASURES We calculated P values for association between 8114394 genotyped and imputed variants across the genome and risk for developing myasthenia gravis using logistic regression modeling. A threshold P value of 5.0 × 10−8 was set for genome-wide significance after Bonferroni correction for multiple testing. RESULTS In the over all case-control cohort, we identified association signals at CTLA4 (rs231770; P = 3.98 × 10−8; odds ratio, 1.37; 95% CI, 1.25–1.49), HLA-DQA1 (rs9271871; P = 1.08 × 10−8; odds ratio, 2.31; 95% CI, 2.02 – 2.60), and TNFRSF11A (rs4263037; P = 1.60 × 10−9; odds ratio, 1.41; 95% CI, 1.29–1.53). These findings replicated for CTLA4 and HLA-DQA1 in an independent cohort of Italian cases and control individuals. Further analysis revealed distinct, but overlapping, disease-associated loci for early- and late-onset forms of myasthenia gravis. In the late-onset cases, we identified 2 association peaks: one was located in TNFRSF11A (rs4263037; P = 1.32 × 10−12; odds ratio, 1.56; 95% CI, 1.44–1.68) and the other was detected
Pedersen, Jakob Skou; Valen, Eivind; Velazquez, Amhed Missael Vargas
Epigenetic information is available from contemporary organisms, but is difficult to track back in evolutionary time. Here, we show that genome-wide epigenetic information can be gathered directly from next-generation sequence reads of DNA isolated from ancient remains. Using the genome sequence...... data generated from hair shafts of a 4000-yr-old Paleo-Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery...
Sap, J; D'Eustachio, P; Givol, D
We describe the identification of a widely expressed receptor-type (transmembrane) protein tyrosine phosphatase (PTPase; EC 220.127.116.11). Screening of a mouse brain cDNA library under low-stringency conditions with a probe encompassing the intracellular (phosphatase) domain of the CD45 lymphocyte...... antigen yielded cDNA clones coding for a 794-amino acid transmembrane protein [hereafter referred to as receptor protein tyrosine phosphatase alpha (R-PTP-alpha)] with an intracellular domain displaying clear homology to the catalytic domains of CD45 and LAR (45% and 53%, respectively). The 142-amino acid...
Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.
Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi
WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.
Zhang, Na; Huang, Xing; Bao, Yaning; Wang, Bo; Zeng, Hongxia; Cheng, Weishun; Tang, Mi; Li, Yuhua; Ren, Jian; Sun, Yuhong
The early auxin responsive SAUR family is an important gene family in auxin signal transduction. We here present the first report of a genome-wide identification of SAUR genes in watermelon genome. We successfully identified 65 ClaSAURs and provide a genomic framework for future study on these genes. Phylogenetic result revealed a Cucurbitaceae-specific SAUR subfamily and contribute to understanding of the evolutionary pattern of SAUR genes in plants. Quantitative RT-PCR analysis demonstrates the existed expression of 11 randomly selected SAUR genes in watermelon tissues. ClaSAUR36 was highly expressed in fruit, for which further study might bring a new prospective for watermelon fruit development. Moreover, correlation analysis revealed the similar expression profiles of SAUR genes between watermelon and Arabidopsis during shoot organogenesis. This work gives us a new support for the conserved auxin machinery in plants.
Full Text Available Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP, uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.
Full Text Available Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr. We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set.
Xavier, Alencar; Muir, William M; Rainey, Katy Martin
Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set. Copyright © 2016 Xavie et al.
Lang, M; Leménager, T; Streit, F; Fauth-Bühler, M; Frank, J; Juraeva, D; Witt, S H; Degenhardt, F; Hofmann, A; Heilmann-Heimbach, S; Kiefer, F; Brors, B; Grabe, H-J; John, U; Bischof, A; Bischof, G; Völker, U; Homuth, G; Beutel, M; Lind, P A; Medland, S E; Slutske, W S; Martin, N G; Völzke, H; Nöthen, M M; Meyer, C; Rumpf, H-J; Wurst, F M; Rietschel, M; Mann, K F
Pathological gambling is a behavioural addiction with negative economic, social, and psychological consequences. Identification of contributing genes and pathways may improve understanding of aetiology and facilitate therapy and prevention. Here, we report the first genome-wide association study of pathological gambling. Our aims were to identify pathways involved in pathological gambling, and examine whether there is a genetic overlap between pathological gambling and alcohol dependence. Four hundred and forty-five individuals with a diagnosis of pathological gambling according to the Diagnostic and Statistical Manual of Mental Disorders were recruited in Germany, and 986 controls were drawn from a German general population sample. A genome-wide association study of pathological gambling comprising single marker, gene-based, and pathway analyses, was performed. Polygenic risk scores were generated using data from a German genome-wide association study of alcohol dependence. No genome-wide significant association with pathological gambling was found for single markers or genes. Pathways for Huntington's disease (P-value=6.63×10(-3)); 5'-adenosine monophosphate-activated protein kinase signalling (P-value=9.57×10(-3)); and apoptosis (P-value=1.75×10(-2)) were significant. Polygenic risk score analysis of the alcohol dependence dataset yielded a one-sided nominal significant P-value in subjects with pathological gambling, irrespective of comorbid alcohol dependence status. The present results accord with previous quantitative formal genetic studies which showed genetic overlap between non-substance- and substance-related addictions. Furthermore, pathway analysis suggests shared pathology between Huntington's disease and pathological gambling. This finding is consistent with previous imaging studies. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.
Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924
Full Text Available Asperger Syndrome (AS is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC, which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448 were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448 lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.
Sempéré, Guilhem; Philippe, Florian; Dereeper, Alexis; Ruiz, Manuel; Sarah, Gautier; Larmande, Pierre
Exploring the structure of genomes and analyzing their evolution is essential to understanding the ecological adaptation of organisms. However, with the large amounts of data being produced by next-generation sequencing, computational challenges arise in terms of storage, search, sharing, analysis and visualization. This is particularly true with regards to studies of genomic variation, which are currently lacking scalable and user-friendly data exploration solutions. Here we present Gigwa, a web-based tool that provides an easy and intuitive way to explore large amounts of genotyping data by filtering it not only on the basis of variant features, including functional annotations, but also on genotype patterns. The data storage relies on MongoDB, which offers good scalability properties. Gigwa can handle multiple databases and may be deployed in either single- or multi-user mode. In addition, it provides a wide range of popular export formats. The Gigwa application is suitable for managing large amounts of genomic variation data. Its user-friendly web interface makes such processing widely accessible. It can either be simply deployed on a workstation or be used to provide a shared data portal for a given community of researchers.
Full Text Available Genome-wide association study (GWAS aims to discover genetic factors underlying phenotypic traits. The large number of genetic factors poses both computational and statistical challenges. Various computational approaches have been developed for large scale GWAS. In this chapter, we will discuss several widely used computational approaches in GWAS. The following topics will be covered: (1 An introduction to the background of GWAS. (2 The existing computational approaches that are widely used in GWAS. This will cover single-locus, epistasis detection, and machine learning methods that have been recently developed in biology, statistic, and computer science communities. This part will be the main focus of this chapter. (3 The limitations of current approaches and future directions.
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Katharine J Sepp
Full Text Available While genetic screens have identified many genes essential for neurite outgrowth, they have been limited in their ability to identify neural genes that also have earlier critical roles in the gastrula, or neural genes for which maternally contributed RNA compensates for gene mutations in the zygote. To address this, we developed methods to screen the Drosophila genome using RNA-interference (RNAi on primary neural cells and present the results of the first full-genome RNAi screen in neurons. We used live-cell imaging and quantitative image analysis to characterize the morphological phenotypes of fluorescently labelled primary neurons and glia in response to RNAi-mediated gene knockdown. From the full genome screen, we focused our analysis on 104 evolutionarily conserved genes that when downregulated by RNAi, have morphological defects such as reduced axon extension, excessive branching, loss of fasciculation, and blebbing. To assist in the phenotypic analysis of the large data sets, we generated image analysis algorithms that could assess the statistical significance of the mutant phenotypes. The algorithms were essential for the analysis of the thousands of images generated by the screening process and will become a valuable tool for future genome-wide screens in primary neurons. Our analysis revealed unexpected, essential roles in neurite outgrowth for genes representing a wide range of functional categories including signalling molecules, enzymes, channels, receptors, and cytoskeletal proteins. We also found that genes known to be involved in protein and vesicle trafficking showed similar RNAi phenotypes. We confirmed phenotypes of the protein trafficking genes Sec61alpha and Ran GTPase using Drosophila embryo and mouse embryonic cerebral cortical neurons, respectively. Collectively, our results showed that RNAi phenotypes in primary neural culture can parallel in vivo phenotypes, and the screening technique can be used to identify many new
Chen, Piaojuan; Li, Yi; Zhao, Lihua; Hou, Zhimin; Yan, Maokai; Hu, Bingyan; Liu, Yanhui; Azam, Syed Muhammad; Zhang, Ziyan; Rahman, Zia Ur; Liu, Liping; Qin, Yuan
Pineapple ( Ananas comosus L .) cultivation commonly relies on asexual reproduction which is easily impeded by many factors in agriculture production. Sexual reproduction might be a novel approach to improve the pineapple planting. However, genes controlling pineapple sexual reproduction are still remain elusive. In different organisms a conserved superfamily proteins known as ATP binding cassette (ABC) participate in various biological processes. Whereas, till today the ABC gene family has not been identified in pineapple. Here 100 ABC genes were identified in the pineapple genome and grouped into eight subfamilies (5 ABCAs , 20 ABCB s, 16 ABCCs , 2 ABCDs , one ABCEs , 5 ABCFs , 42 ABCGs and 9 ABCIs ). Gene expression profiling revealed the dynamic expression pattern of ABC gene family in various tissues and different developmental stages. AcABCA5, AcABCB6, AcABCC4 , AcABCC7 , AcABCC9 , AcABCG26 , AcABCG38 and AcABCG42 exhibited preferential expression in ovule and stamen. Over-expression of AcABCG38 in the Arabidopsis double mutant abcg1-2abcg16-2 partially restored its pollen abortion defects, indicating that AcABCG38 plays important roles in pollen development. Our study on ABC gene family in pineapple provides useful information for developing sexual pineapple plantation which could be utilized to improve pineapple agricultural production.
Li, Lei; Wang, Xiangfeng; Stolc, Viktor
. We report here a full-genome transcription analysis of the indica rice subspecies using high-density oligonucleotide tiling microarrays. Our results provided expression data support for the existence of 35,970 (81.9%) annotated gene models and identified 5,464 unique transcribed intergenic regions...... that share similar compositional properties with the annotated exons and have significant homology to other plant proteins. Elucidating and mapping of all transcribed regions revealed an association between global transcription and cytological chromosome features, and an overall similarity of transcriptional......Sequencing and computational annotation revealed several features, including high gene numbers, unusual composition of the predicted genes and a large number of genes lacking homology to known genes, that distinguish the rice (Oryza sativa) genome from that of other fully sequenced model species...
Full Text Available Abstract Background One of the consequences of the rapid and widespread adoption of high-throughput experimental technologies is an exponential increase of the amount of data produced by genome-wide experiments. Researchers increasingly need to handle very large volumes of heterogeneous data, including both the data generated by their own experiments and the data retrieved from publicly available repositories of genomic knowledge. Integration, exploration, manipulation and interpretation of data and information therefore need to become as automated as possible, since their scale and breadth are, in general, beyond the limits of what individual researchers and the basic data management tools in normal use can handle. This paper describes Genephony, a tool we are developing to address these challenges. Results We describe how Genephony can be used to manage large datesets of genomic information, integrating them with existing knowledge repositories. We illustrate its functionalities with an example of a complex annotation task, in which a set of SNPs coming from a genotyping experiment is annotated with genes known to be associated to a phenotype of interest. We show how, thanks to the modular architecture of Genephony and its user-friendly interface, this task can be performed in a few simple steps. Conclusion Genephony is an online tool for the manipulation of large datasets of genomic information. It can be used as a browser for genomic data, as a high-throughput annotation tool, and as a knowledge discovery tool. It is designed to be easy to use, flexible and extensible. Its knowledge management engine provides fine-grained control over individual data elements, as well as efficient operations on large datasets.
Dogan, Meeshanthini V; Beach, Steven R H; Philibert, Robert A
Smoking is the leading cause of death in the United States. It exerts its effects by increasing susceptibility to a variety of complex disorders among those who smoke, and if pregnant, to their unborn children. In prior efforts to understand the epigenetic mechanisms through which this increased vulnerability is conveyed, a number of investigators have conducted genome wide methylation analyses. Unfortunately, secondary to methodological limitations, these studies were unable to examine methylation in gene regions with significant amounts of genetic variation. Using genome wide genetic and epigenetic data from the Framingham Heart Study, we re-examined the relationship of smoking status to genome wide methylation status. When only methylation status is considered, smoking was significantly associated with differential methylation in 310 genes that map to a variety of biological process and cellular differentiation pathways. However, when SNP effects on the magnitude of smoking associated methylation changes are also considered, cis and trans-interaction effects were noted at a total of 266 and 4353 genes with no marked enrichment for any biological pathways. Furthermore, the SNP variation participating in the significant interaction effects is enriched for loci previously associated with complex medical illnesses. The enlarged scope of the methylome shown to be affected by smoking may better explicate the mediational pathways linking smoking with a myriad of smoking related complex syndromes. Additionally, these results strongly suggest that combined epigenetic and genetic data analyses may be critical for a more complete understanding of the relationship between environmental variables, such as smoking, and pathophysiological outcomes. © 2017 Wiley Periodicals, Inc.
Full Text Available Since the first report of a genome-wide association study (GWAS on human age-related macular degeneration, GWAS has successfully been used to discover genetic variants for a variety of complex human diseases and/or traits, and thousands of associated loci have been identified. However, the underlying mechanisms for these loci remain largely unknown. To make these GWAS findings more useful, it is necessary to perform in-depth data mining. The data analysis in the post-GWAS era will include the following aspects: fine-mapping of susceptibility regions to identify susceptibility genes for elucidating the biological mechanism of action; joint analysis of susceptibility genes in different diseases; integration of GWAS, transcriptome, and epigenetic data to analyze expression and methylation quantitative trait loci at the whole-genome level, and find single-nucleotide polymorphisms that influence gene expression and DNA methylation; genome-wide association analysis of disease-related DNA copy number variations. Applying these strategies and methods will serve to strengthen GWAS data to enhance the utility and significance of GWAS in improving understanding of the genetics of complex diseases or traits and translate these findings for clinical applications. Keywords: Genome-wide association study, Data mining, Integrative data analysis, Polymorphism, Copy number variation
Gong, Jian; Hsu, Li; Harrison, Tabitha
Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated...... this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening and the Women’s Health Initiative (WHI). We...... tested association between 2,474,333 single nucleotide polymorphisms (SNPs) and serum selenium concentrations using linear regression models. In the first stage (PLCO) 41 SNPs clustered in 15 regions had p
Full Text Available Drought is a common and recurring climatic condition in many parts of the world, and it can have disastrous impacts on plant growth and development. Many genes involved in the drought response of plants have been identified. Transcriptome, microRNA (miRNA, and degradome analyses are rapid ways of identifying drought-responsive genes. The reference genome sequence of Paulownia fortunei (Seem Hemsl. is now available, which makes it easier to explore gene expression, transcriptional regulation, and post-transcriptional in this species. In this study, four transcriptome, small RNA, and degradome libraries were sequenced by Illumina sequencing, respectively. A total of 258 genes and 11 miRNAs were identified for drought-responsive genes and miRNAs in P. fortunei. Degradome sequencing detected 28 miRNA target genes that were cleaved by members of nine conserved miRNA families and 12 novel miRNAs. The results here will contribute toward enriching our understanding of the response of Paulownia fortunei trees to drought stress and may provide new direction for further experimental studies related the development of molecular markers, the genetic map construction, and other genomic research projects in Paulownia.
Johnson Andrew D
Full Text Available Abstract Background The number of genome-wide association studies (GWAS is growing rapidly leading to the discovery and replication of many new disease loci. Combining results from multiple GWAS datasets may potentially strengthen previous conclusions and suggest new disease loci, pathways or pleiotropic genes. However, no database or centralized resource currently exists that contains anywhere near the full scope of GWAS results. Methods We collected available results from 118 GWAS articles into a database of 56,411 significant SNP-phenotype associations and accompanying information, making this database freely available here. In doing so, we met and describe here a number of challenges to creating an open access database of GWAS results. Through preliminary analyses and characterization of available GWAS, we demonstrate the potential to gain new insights by querying a database across GWAS. Results Using a genomic bin-based density analysis to search for highly associated regions of the genome, positive control loci (e.g., MHC loci were detected with high sensitivity. Likewise, an analysis of highly repeated SNPs across GWAS identified replicated loci (e.g., APOE, LPL. At the same time we identified novel, highly suggestive loci for a variety of traits that did not meet genome-wide significant thresholds in prior analyses, in some cases with strong support from the primary medical genetics literature (SLC16A7, CSMD1, OAS1, suggesting these genes merit further study. Additional adjustment for linkage disequilibrium within most regions with a high density of GWAS associations did not materially alter our findings. Having a centralized database with standardized gene annotation also allowed us to examine the representation of functional gene categories (gene ontologies containing one or more associations among top GWAS results. Genes relating to cell adhesion functions were highly over-represented among significant associations (p -14, a finding
Matheus Sanitá Lima
Full Text Available Organelle genomes are among the most sequenced kinds of chromosome. This is largely because they are small and widely used in molecular studies, but also because next-generation sequencing technologies made sequencing easier, faster, and cheaper. However, studies of organelle RNA have not kept pace with those of DNA, despite huge amounts of freely available eukaryotic RNA-sequencing (RNA-seq data. Little is known about organelle transcription in nonmodel species, and most of the available eukaryotic RNA-seq data have not been mined for organelle transcripts. Here, we use publicly available RNA-seq experiments to investigate organelle transcription in 30 diverse plastid-bearing protists with varying organelle genomic architectures. Mapping RNA-seq data to organelle genomes revealed pervasive, genome-wide transcription, regardless of the taxonomic grouping, gene organization, or noncoding content. For every species analyzed, transcripts covered ≥85% of the mitochondrial and/or plastid genomes (all of which were ≤105 kb, indicating that most of the organelle DNA—coding and noncoding—is transcriptionally active. These results follow earlier studies of model species showing that organellar transcription is coupled and ubiquitous across the genome, requiring significant downstream processing of polycistronic transcripts. Our findings suggest that noncoding organelle DNA can be transcriptionally active, raising questions about the underlying function of these transcripts and underscoring the utility of publicly available RNA-seq data for recovering complete genome sequences. If pervasive transcription is also found in bigger organelle genomes (>105 kb and across a broader range of eukaryotes, this could indicate that noncoding organelle RNAs are regulating fundamental processes within eukaryotic cells.
Sanitá Lima, Matheus; Smith, David Roy
Organelle genomes are among the most sequenced kinds of chromosome. This is largely because they are small and widely used in molecular studies, but also because next-generation sequencing technologies made sequencing easier, faster, and cheaper. However, studies of organelle RNA have not kept pace with those of DNA, despite huge amounts of freely available eukaryotic RNA-sequencing (RNA-seq) data. Little is known about organelle transcription in nonmodel species, and most of the available eukaryotic RNA-seq data have not been mined for organelle transcripts. Here, we use publicly available RNA-seq experiments to investigate organelle transcription in 30 diverse plastid-bearing protists with varying organelle genomic architectures. Mapping RNA-seq data to organelle genomes revealed pervasive, genome-wide transcription, regardless of the taxonomic grouping, gene organization, or noncoding content. For every species analyzed, transcripts covered ≥85% of the mitochondrial and/or plastid genomes (all of which were ≤105 kb), indicating that most of the organelle DNA-coding and noncoding-is transcriptionally active. These results follow earlier studies of model species showing that organellar transcription is coupled and ubiquitous across the genome, requiring significant downstream processing of polycistronic transcripts. Our findings suggest that noncoding organelle DNA can be transcriptionally active, raising questions about the underlying function of these transcripts and underscoring the utility of publicly available RNA-seq data for recovering complete genome sequences. If pervasive transcription is also found in bigger organelle genomes (>105 kb) and across a broader range of eukaryotes, this could indicate that noncoding organelle RNAs are regulating fundamental processes within eukaryotic cells. Copyright © 2017 Sanitá Lima and Smith.
A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...
Fall, Tove; Ingelsson, Erik
Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
M. Ilyas Kamboh
Full Text Available Background. The persistent presence of antiphospholipid antibodies (APA may lead to the development of primary or secondary antiphospholipid syndrome. Although the genetic basis of APA has been suggested, the identity of the underlying genes is largely unknown. In this study, we have performed a genome-wide association study (GWAS in an effort to identify susceptibility loci/genes for three main APA: anticardiolipin antibodies (ACL, lupus anticoagulant (LAC, and anti-β2 glycoprotein I antibodies (anti-β2GPI. Methods. DNA samples were genotyped using the Affymetrix 6.0 array containing 906,600 single-nucleotide polymorphisms (SNPs. Association of SNPs with the antibody status (positive/negative was tested using logistic regression under the additive model. Results. We have identified a number of suggestive novel loci with P
Full Text Available Michael E March,1 Patrick MA Sleiman,1,2 Hakon Hakonarson1,2 1Center for Applied Genomics, Children's Hospital of Philadelphia Research Institute, 2Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Abstract: Genetic studies of asthma have revealed that there is considerable heritability to the phenotype. An extensive history of candidate-gene studies has identified a long list of genes associated with immune function that are potentially involved in asthma pathogenesis. However, many of the results of candidate-gene studies have failed to be replicated, leaving in question the true impact of the implicated biological pathways on asthma. With the advent of genome-wide association studies, geneticists are able to examine the association of hundreds of thousands of genetic markers with a phenotype, allowing the hypothesis-free identification of variants associated with disease. Many such studies examining asthma or related phenotypes have been published, and several themes have begun to emerge regarding the biological pathways underpinning asthma. The results of many genome-wide association studies have currently not been replicated, and the large sample sizes required for this experimental strategy invoke difficulties with sample stratification and phenotypic heterogeneity. Recently, large collaborative groups of researchers have formed consortia focused on asthma, with the goals of sharing material and data and standardizing diagnosis and experimental methods. Additionally, research has begun to focus on genetic variants that affect the response to asthma medications and on the biology that generates the heterogeneity in the asthma phenotype. As this work progresses, it will move asthma patients closer to more specific, personalized medicine. Keywords: asthma, genetics, GWAS, pharmacogenetics, biomarkers
Full Text Available The aim of this study was to identify the evidence of recent selection based on estimation of the integrated Haplotype Score (iHS, population differentiation index (FST and characterize affected regions near QTL associated with traits under strong selection in Pinzgau cattle. In total 21 Austrian and 19 Slovak purebreed bulls genotyped with Illumina bovineHD and bovineSNP50 BeadChip were used to identify genomic regions under selection. Only autosomal loci with call rate higher than 90%, minor allele frequency higher than 0.01 and Hardy-Weinberg equlibrium limit of 0.001 were included in the subsequent analyses of selection sweeps presence. The final dataset was consisted from 30538 SNPs with 81.86 kb average adjacent SNPs spacing. The iHS score were averaged into non-overlapping 500 kb segments across the genome. The FST values were also plotted against genome position based on sliding windows approach and averaged over 8 consecutive SNPs. Based on integrated Haplotype Score evaluation only 7 regions with iHS score higher than 1.7 was found. The average iHS score observed for each adjacent syntenic regions indicated slight effect of recent selection in analysed group of Pinzgau bulls. The level of genetic differentiation between Austrian and Slovak bulls estimated based on FST index was low. Only 24% of FST values calculated for each SNP was greather than 0.01. By using sliding windows approach was found that 5% of analysed windows had higher value than 0.01. Our results indicated use of similar selection scheme in breeding programs of Slovak and Austrian Pinzgau bulls. The evidence for genome-wide association between signatures of selection and regions affecting complex traits such as milk production was insignificant, because the loci in segments identified as affected by selection were very distant from each other. Identification of genomic regions that may be under pressure of selection for phenotypic traits to better understanding of the
Full Text Available The Fungal Genome Initiative of the Broad Institute, in partnership with the Paracoccidioides research community, has recently sequenced the genome of representative isolates of this human-pathogen dimorphic fungus: Pb18 (S1, Pb03 (PS2 and Pb01. The accomplishment of future high-throughput, genome-wide, functional genomics will rely upon appropriate molecular tools and straightforward techniques to streamline the generation of stable loss-of-function phenotypes. In the past decades, RNAi has emerged as the most robust genetic technique to modulate or to suppress gene expression in diverse eukaryotes, including fungi. These molecular tools and techniques, adapted for RNAi, were up until now unavailable for P. brasiliensis.In this paper, we report Agrobacterium tumefaciens mediated transformation of yeast cells for high-throughput applications with which higher transformation frequencies of 150±24 yeast cell transformants per 1×106 viable yeast cells were obtained. Our approach is based on a bifunctional selective marker fusion protein consisted of the Streptoalloteichus hindustanus bleomycin-resistance gene (Shble and the intrinsically fluorescent monomeric protein mCherry which was codon-optimized for heterologous expression in P. brasiliensis. We also report successful GP43 gene knock-down through the expression of intron-containing hairpin RNA (ihpRNA from a Gateway-adapted cassette (cALf which was purpose-built for gene silencing in a high-throughput manner. Gp43 transcript levels were reduced by 73.1±22.9% with this approach.We have a firm conviction that the genetic transformation technique and the molecular tools herein described will have a relevant contribution in future Paracoccidioides spp. functional genomics research.
Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T
Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Behura, Susanta K; Severson, David W
Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance. © 2012 The Authors. Biological Reviews © 2012 Cambridge Philosophical Society.
Balestrini, Raffaella; Sillo, Fabiano; Kohler, Annegret; Schneider, Georg; Faccio, Antonella; Tisserant, Emilie; Martin, Francis; Bonfante, Paola
A genome-wide inventory of proteins involved in cell wall synthesis and remodeling has been obtained by taking advantage of the recently released genome sequence of the ectomycorrhizal Tuber melanosporum black truffle. Genes that encode cell wall biosynthetic enzymes, enzymes involved in cell wall polysaccharide synthesis or modification, GPI-anchored proteins and other cell wall proteins were identified in the black truffle genome. As a second step, array data were validated and the symbiotic stage was chosen as the main focus. Quantitative RT-PCR experiments were performed on 29 selected genes to verify their expression during ectomycorrhizal formation. The results confirmed the array data, and this suggests that cell wall-related genes are required for morphogenetic transition from mycelium growth to the ectomycorrhizal branched hyphae. Labeling experiments were also performed on T. melanosporum mycelium and ectomycorrhizae to localize cell wall components.
Leister, Dario; Varotto, Claudio
The profiling of mRNA expression based on DNA arrays has become a powerful tool to study genome-wide transcription of genes in a number of organisms. GST-PRIME is a software package created to facilitate large-scale primer design for the amplification of probes to be immobilized on arrays for transcriptome analyses, even though it can be also applied in low-throughput approaches. GST-PRIME allows highly efficient, direct amplification of gene-sequence tags (GSTs) from genomic DNA (gDNA), starting from annotated genome or transcript sequences. GST-PRIME provides a customer-friendly platform for automatic primer design, and despite the relative simplicity of the algorithm, experimental tests in the model plant species Arabidopsis thaliana confirmed the reliability of the software. This chapter describes the algorithm used for primer design, its input and output files, and the installation of the standalone package and its use.
Bertram, Lars; Tanzi, Rudolph E
Genome-wide association studies (GWAS) have gained considerable momentum over the last couple of years for the identification of novel complex disease genes. In the field of Alzheimer's disease (AD), there are currently eight published and two provisionally reported GWAS, highlighting over two dozen novel potential susceptibility loci beyond the well-established APOE association. On the basis of the data available at the time of this writing, the most compelling novel GWAS signal has been observed in GAB2 (GRB2-associated binding protein 2), followed by less consistently replicated signals in galanin-like peptide (GALP), piggyBac transposable element derived 1 (PGBD1), tyrosine kinase, non-receptor 1 (TNK1). Furthermore, consistent replication has been recently announced for CLU (clusterin, also known as apolipoprotein J). Finally, there are at least three replicated loci in hitherto uncharacterized genomic intervals on chromosomes 14q32.13, 14q31.2 and 6q24.1 likely implicating the existence of novel AD genes in these regions. In this review, we will discuss the characteristics and potential relevance to pathogenesis of the outcomes of all currently available GWAS in AD. A particular emphasis will be laid on findings with independent data in favor of the original association.
Emily R Davenport
Full Text Available The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both. These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%. For example, we identified an association between a taxon known to affect obesity (genus Akkermansia and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10-7. Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut.
Full Text Available Abstract Background The study of genome-wide DNA methylation changes has become more accessible with the development of various array-based technologies though when studying species other than human the choice of applications are limited and not always within reach. In this study, we adapted and tested the applicability of Methylation Specific Digital Karyotyping (MSDK, a non-array based method, for the prospective analysis of epigenetic changes after perinatal nutritional modifications in a mouse model of allergic airway disease. MSDK is a sequenced based method that allows a comprehensive and unbiased methylation profiling. The method generates 21 base pairs long sequence tags derived from specific locations in the genome. The resulting tag frequencies determine in a quantitative manner the methylation level of the corresponding loci. Results Genomic DNA from whole lung was isolated and subjected to MSDK analysis using the methylation-sensitive enzyme Not I as the mapping enzyme and Nla III as the fragmenting enzyme. In a pair wise comparison of the generated mouse MSDK libraries we identified 158 loci that are significantly differentially methylated (P-value = 0.05 after perinatal dietary changes in our mouse model. Quantitative methylation specific PCR and sequence analysis of bisulfate modified genomic DNA confirmed changes in methylation at specific loci. Differences in genomic MSDK tag counts for a selected set of genes, correlated well with changes in transcription levels as measured by real-time PCR. Furthermore serial analysis of gene expression profiling demonstrated a dramatic difference in expressed transcripts in mice exposed to perinatal nutritional changes. Conclusion The genome-wide methylation survey applied in this study allowed for an unbiased methylation profiling revealing subtle changes in DNA methylation in mice maternally exposed to dietary changes in methyl-donor content. The MSDK method is applicable for mouse models
Galesloot, Tessel E; Van Steen, Kristel; Kiemeney, Lambertus A L M
Joint association analysis of multiple traits in a genome-wide association study (GWAS), i.e. a multivariate GWAS, offers several advantages over analyzing each trait in a separate GWAS. In this study we directly compared a number of multivariate GWAS methods using simulated data. We focused on six...... methods that are implemented in the software packages PLINK, SNPTEST, MultiPhen, BIMBAM, PCHAT and TATES, and also compared them to standard univariate GWAS, analysis of the first principal component of the traits, and meta-analysis of univariate results. We simulated data (N = 1000) for three...... for scenarios with an opposite sign of genetic and residual correlation. All multivariate analyses resulted in a higher power than univariate analyses, even when only one of the traits was associated with the QTL. Hence, use of multivariate GWAS methods can be recommended, even when genetic correlations between...
Full Text Available Genome-wide association studies (GWAS use high-throughput genotyping technology to relate hundreds of thousands of genetic markers (genotypes to clinical conditions and measurable traits (phenotypes. This review is intended to serve as an introduction to GWAS for clinicians, to allow them to better appreciate the value and limitations of GWAS for genotype-disease association studies. The input of clinicians is vital for GWAS, since disease heterogeneity is frequently a confounding factor that can only really be solved by clinicians. For diseases that are difficult to diagnose, clinicians should ensure that the cases do indeed have the disease; for common diseases, clinicians should ensure that the controls are truly disease-free.
Genetic studies have identified >60 loci associated with the risk of developing type 1 diabetes (T1D). The vast majority of these are identified by genome-wide association studies (GWAS) using large case-control cohorts of European ancestry. More than 80% of the heritability of T1D can be explained...... by GWAS data in this population group. However, with few exceptions, their individual contribution to T1D risk is low and understanding their function in disease biology remains a huge challenge. GWAS on its own does not inform us in detail on disease mechanisms, but the combination of GWAS data...... with other omics-data is beginning to advance our understanding of T1D etiology and pathogenesis. Current knowledge supports the notion that genetic variation in both pancreatic β cells and in immune cells is central in mediating T1D risk. Advances, perspectives and limitations of GWAS are discussed...
Caicedo, Ana L; Williamson, Scott H; Hernandez, Ryan D
Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments......, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models...... to explain contemporary patterns of polymorphisms in rice, including a (i) selectively neutral population bottleneck model, (ii) bottleneck plus migration model, (iii) multiple selective sweeps model, and (iv) bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been...
Full Text Available Rice growth is severely affected by toxic concentrations of the nonessential heavy metal cadmium (Cd. To elucidate the molecular basis of the response to Cd stress, we performed mRNA sequencing of rice following our previous study on exposure to high concentrations of Cd (Oono et al., 2014. In this study, rice plants were hydroponically treated with low concentrations of Cd and approximately 211 million sequence reads were mapped onto the IRGSP-1.0 reference rice genome sequence. Many genes, including some identified under high Cd concentration exposure in our previous study, were found to be responsive to low Cd exposure, with an average of about 11,000 transcripts from each condition. However, genes expressed constitutively across the developmental course responded only slightly to low Cd concentrations, in contrast to their clear response to high Cd concentration, which causes fatal damage to rice seedlings according to phenotypic changes. The expression of metal ion transporter genes tended to correlate with Cd concentration, suggesting the potential of the RNA-Seq strategy to reveal novel Cd-responsive transporters by analyzing gene expression under different Cd concentrations. This study could help to develop novel strategies for improving tolerance to Cd exposure in rice and other cereal crops.
Full Text Available Metals are major contaminants that influence human health. Many metals have physiologic roles, but excessive levels can be harmful. Advances in technology have made toxicogenomic analyses possible to characterize the effects of metal exposure on the entire genome. Much of what is known about cellular responses to metals has come from mammalian systems; however the use of non-mammalian species is gaining wider attention. Caenorhabditis elegans (C. elegans is a small round worm whose genome has been fully sequenced and its development from egg to adult is well characterized. It is an attractive model for high throughput screens due to its short lifespan, ease of genetic mutability, low cost and high homology with humans. Research performed in C. elegans has led to insights in apoptosis, gene expression and neurodegeneration, all of which can be altered by metal exposure. Additionally, by using worms one can potentially study how the mechanisms that underline differential responses to metals in nematodes and humans, allowing for identification of novel pathways and therapeutic targets. In this review, toxicogenomic studies performed in C. elegans exposed to various metals will be discussed, highlighting how this non-mammalian system can be utilized to study cellular processes and pathways induced by metals. Recent work focusing on neurodegeneration in Parkinson’s disease will be discussed as an example of the usefulness of genetic screens in C. elegans and the novel findings that can be produced.
Wexler, Eric M; Rosen, Ezra; Lu, Daning; Osborn, Gregory E; Martin, Elizabeth; Raybould, Helen; Geschwind, Daniel H
Wnt proteins are critical to mammalian brain development and function. The canonical Wnt signaling pathway involves the stabilization and nuclear translocation of β-catenin; however, Wnt also signals through alternative, noncanonical pathways. To gain a systems-level, genome-wide view of Wnt signaling, we analyzed Wnt1-stimulated changes in gene expression by transcriptional microarray analysis in cultured human neural progenitor (hNP) cells at multiple time points over a 72-hour time course. We observed a widespread oscillatory-like pattern of changes in gene expression, involving components of both the canonical and the noncanonical Wnt signaling pathways. A higher-order, systems-level analysis that combined independent component analysis, waveform analysis, and mutual information-based network construction revealed effects on pathways related to cell death and neurodegenerative disease. Wnt effectors were tightly clustered with presenilin1 (PSEN1) and granulin (GRN), which cause dominantly inherited forms of Alzheimer's disease and frontotemporal dementia (FTD), respectively. We further explored a potential link between Wnt1 and GRN and found that Wnt1 decreased GRN expression by hNPs. Conversely, GRN knockdown increased WNT1 expression, demonstrating that Wnt and GRN reciprocally regulate each other. Finally, we provided in vivo validation of the in vitro findings by analyzing gene expression data from individuals with FTD. These unbiased and genome-wide analyses provide evidence for a connection between Wnt signaling and the transcriptional regulation of neurodegenerative disease genes.
Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J.; Tropf, Felix C.; Shen, Xia; Wilson, James F.; Chasman, Daniel I.; Nolte, Ilja M.; Tragante, Vinicius; van der Laan, Sander W.; Perry, John R. B.; Kong, Augustine; Ahluwalia, Tarunveer; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F.; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J.; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F.; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J.; Gieger, Christian; Gunderson, Erica P.; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K.; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A.; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F.; McMahon, George; Meddens, S. Fleur W.; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A.; Monnereau, Claire; van der Most, Peter J.; Myhre, Ronny; Nalls, Mike A.; Nutile, Teresa; Panagiota, Kalafati Ioanna; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B.; Rich-Edwards, Janet; Rietveld, Cornelius A.; Robino, Antonietta; Rose, Lynda M.; Rueedi, Rico; Ryan, Kathy; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A.; Stolk, Lisette; Streeten, Elizabeth; Tonjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V.; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I.; Buring, Julie E.; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R.; Cucca, Francesco; Daniela, Toniolo; Davey-Smith, George; Deary, Ian J.; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M.; de Geus, Eco JC.; Eriksson, Johan G.; Evans, Denis A.; Faul, Jessica D.; Felicita, Sala Cinzia; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J.F.; de Haan, Hugoline G.; Haerting, Johannes; Harris, Tamara B.; Heath, Andrew C.; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hypponen, Elina; Jacobsson, Bo; Jaddoe, Vincent W. V.; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L.R.; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; McQuillan, Ruth; Medland, Sarah E.; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Michela, Traglia; Milani, Lili; Mitchell, Paul; Montgomery, Grant W.; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K.; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda WJH; Perola, Markus; Peyser, Patricia A.; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J.; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M.; Ring, Susan M.; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D.; Starr, John M.; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A.; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tönjes, Anke; Tung, Joyce Y.; Uitterlinden, André G.; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G.; Wang, Jie Jin; Wareham, Nicholas J.; Weir, David R.; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F.; Zondervan, Krina T.; Stefansson, Kari; Krueger, Robert F.; Lee, James J.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C.
The genetic architecture of human reproductive behavior – age at first birth (AFB) and number of children ever born (NEB) – has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified and the underlying mechanisms of AFB and NEB are poorly understood. We report the largest genome-wide association study to date of both sexes including 251,151 individuals for AFB and 343,072 for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study, and four additional loci in a gene-based effort. These loci harbor genes that are likely to play a role – either directly or by affecting non-local gene expression – in human reproduction and infertility, thereby increasing our understanding of these complex traits. PMID:27798627
Wu, Yili; Duan, Haiping; Tian, Xiaocao
Previous genome-wide association studies on anthropometric measurements have identified more than 100 related loci, but only a small portion of heritability in obesity was explained. Here we present a bivariate twin study to look for the genetic variants associated with body mass index and waist......-hip ratio, and to explore the obesity-related pathways in Northern Han Chinese. Cholesky decompositionmodel for 242monozygotic and 140 dizygotic twin pairs indicated a moderate genetic correlation (r = 0.53, 95%CI: 0.42–0.64) between body mass index and waist-hip ratio. Bivariate genome-wide association.......05. Expression quantitative trait loci analysis identified rs2242044 as a significant cis-eQTL in both the normal adipose-subcutaneous (P = 1.7 × 10−9) and adipose-visceral (P = 4.4 × 10−15) tissue. These findings may provide an important entry point to unravel genetic pleiotropy in obesity traits....
Okbay, Aysu; P. Beauchamp, Jonathan; Alan Fontana, Mark
-nucleotide polymorphisms associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural......Educational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals1. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends...... development. Our findings demonstrate that, even for a behavioural phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because educational attainment is measured in large numbers of individuals...
Full Text Available Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO Cancer Screening and the Women’s Health Initiative (WHI. We tested association between 2,474,333 single nucleotide polymorphisms (SNPs and serum selenium concentrations using linear regression models. In the first stage (PLCO 41 SNPs clustered in 15 regions had p < 1 × 10−5. None of these 41 SNPs reached the significant threshold (p = 0.05/15 regions = 0.003 in the second stage (WHI. Three SNPs had p < 0.05 in the second stage (rs1395479 and rs1506807 in 4q34.3/AGA-NEIL3; and rs891684 in 17q24.3/SLC39A11 and had p between 2.62 × 10−7 and 4.04 × 10−7 in the combined analysis (PLCO + WHI. Additional studies are needed to replicate these findings. Identification of genetic variation that impacts selenium concentrations may contribute to a better understanding of which genes regulate circulating selenium concentrations.
The capacity to identify immunogens for vaccine development by genome-wide screening has been markedly enhanced by the availability of complete microbial genome sequences coupled to rapid proteomic and bioinformatic analysis. Critical to this genome-wide screening is in vivo testing in the context o...
Zhu, Haidong; Wang, Xiaoling; Shi, Huidong; Su, Shaoyong; Harshfield, Gregory A.; Gutin, Bernard; Snieder, Harold; Dong, Yanbin
Objectives To test the hypothesis that changes in DNA methylation are involved in vitamin D deficiency-related immune cell regulation using an unbiased genome-wide approach combined with a genomic and epigenomic integrative approach. Study design We performed a genome-wide methylation scan using the
Navarro, Antonia; Yin, Ping; Monsivais, Diana; Lin, Simon M.; Du, Pan; Wei, Jian-Jun; Bulun, Serdar E.
Background Uterine leiomyomas, or fibroids, represent the most common benign tumor of the female reproductive tract. Fibroids become symptomatic in 30% of all women and up to 70% of African American women of reproductive age. Epigenetic dysregulation of individual genes has been demonstrated in leiomyoma cells; however, the in vivo genome-wide distribution of such epigenetic abnormalities remains unknown. Principal Findings We characterized and compared genome-wide DNA methylation and mRNA expression profiles in uterine leiomyoma and matched adjacent normal myometrial tissues from 18 African American women. We found 55 genes with differential promoter methylation and concominant differences in mRNA expression in uterine leiomyoma versus normal myometrium. Eighty percent of the identified genes showed an inverse relationship between DNA methylation status and mRNA expression in uterine leiomyoma tissues, and the majority of genes (62%) displayed hypermethylation associated with gene silencing. We selected three genes, the known tumor suppressors KLF11, DLEC1, and KRT19 and verified promoter hypermethylation, mRNA repression and protein expression using bisulfite sequencing, real-time PCR and western blot. Incubation of primary leiomyoma smooth muscle cells with a DNA methyltransferase inhibitor restored KLF11, DLEC1 and KRT19 mRNA levels. Conclusions These results suggest a possible functional role of promoter DNA methylation-mediated gene silencing in the pathogenesis of uterine leiomyoma in African American women. PMID:22428009
Biazzi, Elisa; Nazzicari, Nelson; Pecetti, Luciano; Brummer, E Charles; Palmonari, Alberto; Tava, Aldo; Annicchiarico, Paolo
Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the most-grown forage legume worldwide. This study aimed at exploring opportunities for marker-assisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3-0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits
Shasha Dennis E
Full Text Available Abstract Background Gene duplication can lead to genetic redundancy, which masks the function of mutated genes in genetic analyses. Methods to increase sensitivity in identifying genetic redundancy can improve the efficiency of reverse genetics and lend insights into the evolutionary outcomes of gene duplication. Machine learning techniques are well suited to classifying gene family members into redundant and non-redundant gene pairs in model species where sufficient genetic and genomic data is available, such as Arabidopsis thaliana, the test case used here. Results Machine learning techniques that combine multiple attributes led to a dramatic improvement in predicting genetic redundancy over single trait classifiers alone, such as BLAST E-values or expression correlation. In withholding analysis, one of the methods used here, Support Vector Machines, was two-fold more precise than single attribute classifiers, reaching a level where the majority of redundant calls were correctly labeled. Using this higher confidence in identifying redundancy, machine learning predicts that about half of all genes in Arabidopsis showed the signature of predicted redundancy with at least one but typically less than three other family members. Interestingly, a large proportion of predicted redundant gene pairs were relatively old duplications (e.g., Ks > 1, suggesting that redundancy is stable over long evolutionary periods. Conclusions Machine learning predicts that most genes will have a functionally redundant paralog but will exhibit redundancy with relatively few genes within a family. The predictions and gene pair attributes for Arabidopsis provide a new resource for research in genetics and genome evolution. These techniques can now be applied to other organisms.
Okbay, Aysu; Beauchamp, Jonathan P.; Fontana, Mark A.; Lee, James J.; Pers, Tune H.; Rietveld, Cornelius A.; Turley, Patrick; Chen, Guo-Bo; Emilsson, Valur; Meddens, S. Fleur W.; Oskarsson, Sven; Pickrell, Joseph K.; Thom, Kevin; Timshel, Pascal; de Vlaming, Ronald; Abdellaoui, Abdel; Ahluwalia, Tarunveer S.; Bacelis, Jonas; Baumbach, Clemens; Bjornsdottir, Gyda; Brandsma, Johannes H.; Concas, Maria Pina; Derringer, Jaime; Furlotte, Nicholas A.; Galesloot, Tessel E.; Girotto, Giorgia; Gupta, Richa; Hall, Leanne M.; Harris, Sarah E.; Hofer, Edith; Horikoshi, Momoko; Huffman, Jennifer E.; Kaasik, Kadri; Kalafati, Ioanna P.; Karlsson, Robert; Kong, Augustine; Lahti, Jari; van der Lee, Sven J.; de Leeuw, Christiaan; Lind, Penelope A.; Lindgren, Karl-Oskar; Liu, Tian; Mangino, Massimo; Marten, Jonathan; Mihailov, Evelin; Miller, Michael B.; van der Most, Peter J.; Oldmeadow, Christopher; Payton, Antony; Pervjakova, Natalia; Peyrot, Wouter J.; Qian, Yong; Raitakari, Olli; Rueedi, Rico; Salvi, Erika; Schmidt, Börge; Schraut, Katharina E.; Shi, Jianxin; Smith, Albert V.; Poot, Raymond A.; Pourcain, Beate; Teumer, Alexander; Thorleifsson, Gudmar; Verweij, Niek; Vuckovic, Dragana; Wellmann, Juergen; Westra, Harm-Jan; Yang, Jingyun; Zhao, Wei; Zhu, Zhihong; Alizadeh, Behrooz Z.; Amin, Najaf; Bakshi, Andrew; Baumeister, Sebastian E.; Biino, Ginevra; Bønnelykke, Klaus; Boyle, Patricia A.; Campbell, Harry; Cappuccio, Francesco P.; Davies, Gail; De Neve, Jan-Emmanuel; Deloukas, Panos; Demuth, Ilja; Ding, Jun; Eibich, Peter; Eisele, Lewin; Eklund, Niina; Evans68, David M.; Faul, Jessica D.; Feitosa, Mary F.; Forstner, Andreas J.; Gandin, Ilaria; Gunnarsson, Bjarni; Halldórsson, Bjarni V.; Harris, Tamara B.; Heath, Andrew C.; Hocking, Lynne J.; Holliday, Elizabeth G.; Homuth, Georg; Horan, Michael A.; Hottenga, Jouke-Jan; de Jager, Philip L.; Joshi, Peter K.; Jugessur, Astanand; Kaakinen, Marika A.; Kähönen, Mika; Kanoni, Stavroula; Keltigangas-Järvinen, Liisa; Kiemeney, Lambertus A.L.M.; Kolcic, Ivana; Koskinen, Seppo; Kraja, Aldi T.; Kroh, Martin; Kutalik, Zoltan; Latvala, Antti; Launer, Lenore J.; Lebreton, Maël P.; Levinson, Douglas F.; Lichtenstein, Paul; Lichtner, Peter; Liewald, David C.M.; Loukola, Anu; Madden, Pamela A.; Mägi, Reedik; Mäki-Opas, Tomi; Marioni, Riccardo E.; Marques-Vidal, Pedro; Meddens, Gerardus A.; McMahon, George; Meisinger, Christa; Meitinger, Thomas; Milaneschi, Yusplitri; Milani, Lili; Montgomery, Grant W.; Myhre, Ronny; Nelson, Christopher P.; Nyholt, Dale R.; Ollier, William E.R.; Palotie, Aarno; Paternoster, Lavinia; Pedersen, Nancy L.; Petrovic, Katja E.; Porteous, David J.; Räikkönen, Katri; Ring, Susan M.; Robino, Antonietta; Rostapshova, Olga; Rudan, Igor; Rustichini, Aldo; Salomaa, Veikko; Sanders, Alan R.; Sarin, Antti-Pekka; Schmidt, Helena; Scott, Rodney J.; Smith, Blair H.; Smith, Jennifer A.; Staessen, Jan A.; Steinhagen-Thiessen, Elisabeth; Strauch, Konstantin; Terracciano, Antonio; Tobin, Martin D.; Ulivi, Sheila; Vaccargiu, Simona; Quaye, Lydia; van Rooij, Frank J.A.; Venturini, Cristina; Vinkhuyzen, Anna A.E.; Völker, Uwe; Völzke, Henry; Vonk, Judith M.; Vozzi, Diego; Waage, Johannes; Ware, Erin B.; Willemsen, Gonneke; Attia, John R.; Bennett, David A.; Berger, Klaus; Bertram, Lars; Bisgaard, Hans; Boomsma, Dorret I.; Borecki, Ingrid B.; Bultmann, Ute; Chabris, Christopher F.; Cucca, Francesco; Cusi, Daniele; Deary, Ian J.; Dedoussis, George V.; van Duijn, Cornelia M.; Eriksson, Johan G.; Franke, Barbara; Franke, Lude; Gasparini, Paolo; Gejman, Pablo V.; Gieger, Christian; Grabe, Hans-Jörgen; Gratten, Jacob; Groenen, Patrick J.F.; Gudnason, Vilmundur; van der Harst, Pim; Hayward, Caroline; Hinds, David A.; Hoffmann, Wolfgang; Hyppönen, Elina; Iacono, William G.; Jacobsson, Bo; Järvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Kaprio, Jaakko; Kardia, Sharon L.R.; Lehtimäki, Terho; Lehrer, Steven F.; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; Metspalu, Andres; Pendleton, Neil; Penninx, Brenda W.J.H.; Perola, Markus; Pirastu, Nicola; Pirastu, Mario; Polasek, Ozren; Posthuma, Danielle; Power, Christine; Province, Michael A.; Samani, Nilesh J.; Schlessinger, David; Schmidt, Reinhold; Sørensen, Thorkild I.A.; Spector, Tim D.; Stefansson, Kari; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tiemeier, Henning; Tung, Joyce Y.; Uitterlinden, André G.; Vitart, Veronique; Vollenweider, Peter; Weir, David R.; Wilson, James F.; Wright, Alan F.; Conley, Dalton C.; Krueger, Robert F.; Smith, George Davey; Hofman, Albert; Laibson, David I.; Medland, Sarah E.; Meyer, Michelle N.; Yang, Jian; Johannesson, Magnus; Visscher, Peter M.; Esko, Tõnu; Koellinger, Philipp D.; Cesarini, David; Benjamin, Daniel J.
Summary Educational attainment (EA) is strongly influenced by social and other environmental factors, but genetic factors are also estimated to account for at least 20% of the variation across individuals1. We report the results of a genome-wide association study (GWAS) for EA that extends our earlier discovery sample1,2 of 101,069 individuals to 293,723 individuals, and a replication in an independent sample of 111,349 individuals from the UK Biobank. We now identify 74 genome-wide significant loci associated with number of years of schooling completed. Single-nucleotide polymorphisms (SNPs) associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioral phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because EA is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric disease. PMID:27225129
Hayden, Lystra P; Cho, Michael H; McDonald, Merry-Lynn N; Crapo, James D; Beaty, Terri H; Silverman, Edwin K; Hersh, Craig P
Previous studies have indicated that in adult smokers, a history of childhood pneumonia is associated with reduced lung function and chronic obstructive pulmonary disease. There have been few previous investigations using genome-wide association studies to investigate genetic predisposition to pneumonia. This study aims to identify the genetic variants associated with the development of pneumonia during childhood and over the course of the lifetime. Study subjects included current and former smokers with and without chronic obstructive pulmonary disease participating in the COPDGene Study. Pneumonia was defined by subject self-report, with childhood pneumonia categorized as having the first episode at pneumonia (843 cases, 9,091 control subjects) and lifetime pneumonia (3,766 cases, 5,659 control subjects) were performed separately in non-Hispanic whites and African Americans. Non-Hispanic white and African American populations were combined in the meta-analysis. Top genetic variants from childhood pneumonia were assessed in network analysis. No single-nucleotide polymorphisms reached genome-wide significance, although we identified potential regions of interest. In the childhood pneumonia analysis, this included variants in NGR1 (P = 6.3 × 10 -8 ), PAK6 (P = 3.3 × 10 -7 ), and near MATN1 (P = 2.8 × 10 -7 ). In the lifetime pneumonia analysis, this included variants in LOC339862 (P = 8.7 × 10 -7 ), RAPGEF2 (P = 8.4 × 10 -7 ), PHACTR1 (P = 6.1 × 10 -7 ), near PRR27 (P = 4.3 × 10 -7 ), and near MCPH1 (P = 2.7 × 10 -7 ). Network analysis of the genes associated with childhood pneumonia included top networks related to development, blood vessel morphogenesis, muscle contraction, WNT signaling, DNA damage, apoptosis, inflammation, and immune response (P ≤ 0.05). We have identified genes potentially associated with the risk of pneumonia. Further research will be required to confirm these
Full Text Available Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed.We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality and EDAR (associated with hair thickness were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9 were associated with pre-weaning gain in our previous genome-wide association study.Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.
Wang, Huihua; Zhang, Li; Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin
Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.
Full Text Available The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000-1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs. We estimate that the Roma harbor about 80% West Eurasian ancestry-derived from a combination of European and South Asian sources-and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe.
Full Text Available Abstract Background With the availability of large-scale genome-wide association study (GWAS data, choosing an optimal set of SNPs for disease susceptibility prediction is a challenging task. This study aimed to use single nucleotide polymorphisms (SNPs to predict psoriasis from searching GWAS data. Methods Totally we had 2,798 samples and 451,724 SNPs. Process for searching a set of SNPs to predict susceptibility for psoriasis consisted of two steps. The first one was to search top 1,000 SNPs with high accuracy for prediction of psoriasis from GWAS dataset. The second one was to search for an optimal SNP subset for predicting psoriasis. The sequential information bottleneck (sIB method was compared with classical linear discriminant analysis(LDA for classification performance. Results The best test harmonic mean of sensitivity and specificity for predicting psoriasis by sIB was 0.674(95% CI: 0.650-0.698, while only 0.520(95% CI: 0.472-0.524 was reported for predicting disease by LDA. Our results indicate that the new classifier sIB performs better than LDA in the study. Conclusions The fact that a small set of SNPs can predict disease status with average accuracy of 68% makes it possible to use SNP data for psoriasis prediction.
Yu, Jingyin; Tehrim, Sadia; Zhang, Fengqi; Tong, Chaobo; Huang, Junyan; Cheng, Xiaohui; Dong, Caihua; Zhou, Yanqiu; Qin, Rui; Hua, Wei; Liu, Shengyi
Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana. Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species. This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome
Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M
RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.
Sprung, Carl N; Yang, Yuqing; Forrester, Helen B; Li, Jason; Zaitseva, Marina; Cann, Leonie; Restall, Tina; Anderson, Robin L; Crosbie, Jeffrey C; Rogers, Peter A W
The majority of cancer patients achieve benefit from radiotherapy. A significant limitation of radiotherapy is its relatively low therapeutic index, defined as the maximum radiation dose that causes acceptable normal tissue damage to the minimum dose required to achieve tumor control. Recently, a new radiotherapy modality using synchrotron-generated X-ray microbeam radiotherapy has been demonstrated in animal models to ablate tumors with concurrent sparing of normal tissue. Very little work has been undertaken into the cellular and molecular mechanisms that differentiate microbeam radiotherapy from broad beam. The purpose of this study was to investigate and compare the whole genome transcriptional response of in vivo microbeam radiotherapy versus broad beam irradiated tumors. We hypothesized that gene expression changes after microbeam radiotherapy are different from those seen after broad beam. We found that in EMT6.5 tumors at 4-48 h postirradiation, microbeam radiotherapy differentially regulates a number of genes, including major histocompatibility complex (MHC) class II antigen gene family members, and other immunity-related genes including Ciita, Ifng, Cxcl1, Cxcl9, Indo and Ubd when compared to broad beam. Our findings demonstrate molecular differences in the tumor response to microbeam versus broad beam irradiation and these differences provide insight into the underlying mechanisms of microbeam radiotherapy and broad beam.
Full Text Available The emergence and re-emergence of plant pathogenic microorganisms are processes that imply perturbations in both host and pathogen ecological niches. Global change is largely assumed to drive the emergence of new etiological agents by altering the equilibrium of the ecological habitats which in turn places hosts more in contact with pathogen reservoirs. In this context, the number of epidemics is expected to increase dramatically in the next coming decades both in wild and crop plants. Under these considerations, the identification of the genetic variants underlying natural variation of resistance is a pre-requisite to estimate the adaptive potential of wild plant populations and to develop new breeding resistant cultivars. On the other hand, the prediction of pathogen's genetic determinants underlying disease emergence can help to identify plant resistance alleles. In the genomic era, whole genome sequencing combined with the development of statistical methods led to the emergence of Genome Wide Association (GWA mapping, a powerful tool for detecting genomic regions associated with natural variation of disease resistance in both wild and cultivated plants. However, GWA mapping has been less employed for the detection of genetic variants associated with pathogenicity in microbes. Here, we reviewed GWA studies performed either in plants or in pathogenic microorganisms (bacteria, fungi and oomycetes. In addition, we highlighted the benefits and caveats of the emerging joint GWA mapping approach that allows for the simultaneous identification of genes interacting between genomes of both partners. Finally, based on co-evolutionary processes in wild populations, we highlighted a phenotyping-free joint GWA mapping approach as a promising tool for describing the molecular landscape underlying plant - microbe interactions.
Ling, Jian; Jiang, Weijie; Zhang, Ying; Yu, Hongjun; Mao, Zhenchuan; Gu, Xingfang; Huang, Sanwen; Xie, Bingyan
WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes.
Have, Christian Theil; Mørk, Søren
We introduce a new type of probabilistic sequence model, that model the sequential composition of reading frames of genes in a genome. Our approach extends gene finders with a model of the sequential composition of genes at the genome-level -- effectively producing a sequential genome annotation...... as output. The model can be used to obtain the most probable genome annotation based on a combination of i: a gene finder score of each gene candidate and ii: the sequence of the reading frames of gene candidates through a genome. The model --- as well as a higher order variant --- is developed and tested...... and are evaluated by the effect on prediction performance. Since bacterial gene finding to a large extent is a solved problem it forms an ideal proving ground for evaluating the explicit modeling of larger scale gene sequence composition of genomes. We conclude that the sequential composition of gene reading frames...
Aaron L. Statham
Full Text Available DNA methylation and nucleosome positioning are two key mechanisms that contribute to the epigenetic control of gene expression. During carcinogenesis, the expression of many genes is altered alongside extensive changes in the epigenome, with repressed genes often being associated with local DNA hypermethylation and gain of nucleosomes at their promoters. However the spectrum of alterations that occur at distal regulatory regions has not been extensively studied. To address this we used Nucleosome Occupancy and Methylation sequencing (NOMe-seq to compare the genome-wide DNA methylation and nucleosome occupancy profiles between normal and cancer cell line models of the breast and prostate. Here we describe the bioinformatic pipeline and methods that we developed for the processing and analysis of the NOMe-seq data published by (Taberlay et al., 2014  and deposited in the Gene Expression Omnibus with accession GSE57498.
Jolly Emmitt R
Full Text Available Abstract Background A major challenge in computational genomics is the development of methodologies that allow accurate genome-wide prediction of the regulatory targets of a transcription factor. We present a method for target identification that combines experimental characterization of binding requirements with computational genomic analysis. Results Our method identified potential target genes of the transcription factor Ndt80, a key transcriptional regulator involved in yeast sporulation, using the combined information of binding affinity, positional distribution, and conservation of the binding sites across multiple species. We have also developed a mathematical approach to compute the false positive rate and the total number of targets in the genome based on the multiple selection criteria. Conclusion We have shown that combining biochemical characterization and computational genomic analysis leads to accurate identification of the genome-wide targets of a transcription factor. The method can be extended to other transcription factors and can complement other genomic approaches to transcriptional regulation.
Muslihudeen Abdul-Razaq Abdul-Aziz
Full Text Available As our understanding of the human microbiome expands, impacts on health and disease continue to be revealed. Alterations in the microbiome can result in dysbiosis, which has now been linked to subsequent autoimmune and metabolic diseases, highlighting the need to identify factors that shape the microbiome. Research has identified that the composition and functions of the human microbiome can be influenced by diet, age, gender, and environment. More recently, studies have explored how human genetic variation may also influence the microbiome. Here, we review several recent analytical advances in this new research area, including those that use genome-wide association studies to examine host genome-microbiome interactions, while controlling for the influence of other factors. We find that current research is limited by small sample sizes, lack of cohort replication, and insufficient confirmatory mechanistic studies. In addition, we discuss the importance of understanding long-term interactions between the host genome and microbiome, as well as the potential impacts of disrupting this relationship, and explore new research avenues that may provide information about the co-evolutionary history of humans and their microorganisms.
Full Text Available Abstract Background Gene bodies are the most evolutionarily conserved targets of DNA methylation in eukaryotes. However, the regulatory functions of gene body DNA methylation remain largely unknown. DNA methylation in insects appears to be primarily confined to exons. Two recent studies in Apis mellifera (honeybee and Nasonia vitripennis (jewel wasp analyzed transcription and DNA methylation data for one gene in each species to demonstrate that exon-specific DNA methylation may be associated with alternative splicing events. In this study we investigated the relationship between DNA methylation, alternative splicing, and cross-species gene conservation on a genome-wide scale using genome-wide transcription and DNA methylation data. Results We generated RNA deep sequencing data (RNA-seq to measure genome-wide mRNA expression at the exon- and gene-level. We produced a de novo transcriptome from this RNA-seq data and computationally predicted splice variants for the honeybee genome. We found that exons that are included in transcription are higher methylated than exons that are skipped during transcription. We detected enrichment for alternative splicing among methylated genes compared to unmethylated genes using fisher’s exact test. We performed a statistical analysis to reveal that the presence of DNA methylation or alternative splicing are both factors associated with a longer gene length and a greater number of exons in genes. In concordance with this observation, a conservation analysis using BLAST revealed that each of these factors is also associated with higher cross-species gene conservation. Conclusions This study constitutes the first genome-wide analysis exhibiting a positive relationship between exon-level DNA methylation and mRNA expression in the honeybee. Our finding that methylated genes are enriched for alternative splicing suggests that, in invertebrates, exon-level DNA methylation may play a role in the construction of splice
Hsiung, Chris C-S; Morrissey, Christapher S; Udugama, Maheshi; Frank, Christopher L; Keller, Cheryl A; Baek, Songjoon; Giardine, Belinda; Crawford, Gregory E; Sung, Myong-Hee; Hardison, Ross C; Blobel, Gerd A
Mitosis entails global alterations to chromosome structure and nuclear architecture, concomitant with transient silencing of transcription. How cells transmit transcriptional states through mitosis remains incompletely understood. While many nuclear factors dissociate from mitotic chromosomes, the observation that certain nuclear factors and chromatin features remain associated with individual loci during mitosis originated the hypothesis that such mitotically retained molecular signatures could provide transcriptional memory through mitosis. To understand the role of chromatin structure in mitotic memory, we performed the first genome-wide comparison of DNase I sensitivity of chromatin in mitosis and interphase, using a murine erythroblast model. Despite chromosome condensation during mitosis visible by microscopy, the landscape of chromatin accessibility at the macromolecular level is largely unaltered. However, mitotic chromatin accessibility is locally dynamic, with individual loci maintaining none, some, or all of their interphase accessibility. Mitotic reduction in accessibility occurs primarily within narrow, highly DNase hypersensitive sites that frequently coincide with transcription factor binding sites, whereas broader domains of moderate accessibility tend to be more stable. In mitosis, proximal promoters generally maintain their accessibility more strongly, whereas distal regulatory elements tend to lose accessibility. Large domains of DNA hypomethylation mark a subset of promoters that retain accessibility during mitosis and across many cell types in interphase. Erythroid transcription factor GATA1 exerts site-specific changes in interphase accessibility that are most pronounced at distal regulatory elements, but has little influence on mitotic accessibility. We conclude that features of open chromatin are remarkably stable through mitosis, but are modulated at the level of individual genes and regulatory elements. © 2015 Hsiung et al.; Published by
Full Text Available Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions. The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila-like 2] gene located on 9p21.3 (p = 0.00087. In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026. The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology.
Full Text Available DNA methylation plays a central role in regulating many aspects of growth and development in mammals through regulating gene expression. The development of next generation sequencing technologies have paved the way for genome-wide, high resolution analysis of DNA methylation landscapes using methodology known as reduced representation bisulfite sequencing (RRBS. While RRBS has proven to be effective in understanding DNA methylation landscapes in humans, mice, and rats, to date, few studies have utilised this powerful method for investigating DNA methylation in agricultural animals. Here we describe the utilisation of RRBS to investigate DNA methylation in sheep Longissimus dorsi muscles. RRBS analysis of ∼1% of the genome from Longissimus dorsi muscles provided data of suitably high precision and accuracy for DNA methylation analysis, at all levels of resolution from genome-wide to individual nucleotides. Combining RRBS data with mRNAseq data allowed the sheep Longissimus dorsi muscle methylome to be compared with methylomes from other species. While some species differences were identified, many similarities were observed between DNA methylation patterns in sheep and other more commonly studied species. The RRBS data presented here highlights the complexity of epigenetic regulation of genes. However, the similarities observed across species are promising, in that knowledge gained from epigenetic studies in human and mice may be applied, with caution, to agricultural species. The ability to accurately measure DNA methylation in agricultural animals will contribute an additional layer of information to the genetic analyses currently being used to maximise production gains in these species.
Huang, Yen-Tsung; Hsu, Thomas; Christiani, David C
The effects of copy number alterations make up a significant part of the tumor genome profile, but pathway analyses of these alterations are still not well established. We proposed a novel method to analyze multiple copy numbers of genes within a pathway, termed Test for the Effect of a Gene Set with Copy Number data (TEGS-CN). TEGS-CN was adapted from TEGS, a method that we previously developed for gene expression data using a variance component score test. With additional development, we extend the method to analyze DNA copy number data, accounting for different sizes and thus various numbers of copy number probes in genes. The test statistic follows a mixture of X (2) distributions that can be obtained using permutation with scaled X (2) approximation. We conducted simulation studies to evaluate the size and the power of TEGS-CN and to compare its performance with TEGS. We analyzed a genome-wide copy number data from 264 patients of non-small-cell lung cancer. With the Molecular Signatures Database (MSigDB) pathway database, the genome-wide copy number data can be classified into 1814 biological pathways or gene sets. We investigated associations of the copy number profile of the 1814 gene sets with pack-years of cigarette smoking. Our analysis revealed five pathways with significant P values after Bonferroni adjustment (number data, and causal mechanisms of the five pathways require further study.
Glebes, Tirzah Y; Sandoval, Nicholas R; Gillis, Jacob H; Gill, Ryan T
Engineering both feedstock and product tolerance is important for transitioning towards next-generation biofuels derived from renewable sources. Tolerance to chemical inhibitors typically results in complex phenotypes, for which multiple genetic changes must often be made to confer tolerance. Here, we performed a genome-wide search for furfural-tolerant alleles using the TRackable Multiplex Recombineering (TRMR) method (Warner et al. (2010), Nature Biotechnology), which uses chromosomally integrated mutations directed towards increased or decreased expression of virtually every gene in Escherichia coli. We employed various growth selection strategies to assess the role of selection design towards growth enrichments. We also compared genes with increased fitness from our TRMR selection to those from a previously reported genome-wide identification study of furfural tolerance genes using a plasmid-based genomic library approach (Glebes et al. (2014) PLOS ONE). In several cases, growth improvements were observed for the chromosomally integrated promoter/RBS mutations but not for the plasmid-based overexpression constructs. Through this assessment, four novel tolerance genes, ahpC, yhjH, rna, and dicA, were identified and confirmed for their effect on improving growth in the presence of furfural. © 2014 Wiley Periodicals, Inc.
Down Thomas A
Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.
Zhang, Hao; Shaffer, John R.; Hansen, Thomas; Esserlind, Ann-Louise; Boyd, Heather A.; Nohr, Ellen A.; Timpson, Nicholas J.; Fatemifar, Ghazaleh; Paternoster, Lavinia; Evans, David M.; Weyant, Robert J.; Levy, Steven M.; Lathrop, Mark; Smith, George Davey; Murray, Jeffrey C.; Olesen, Jes; Werge, Thomas; Marazita, Mary L.; Sørensen, Thorkild I. A.; Melbye, Mads
The sequence and timing of permanent tooth eruption is thought to be highly heritable and can have important implications for the risk of malocclusion, crowding, and periodontal disease. We conducted a genome-wide association study of number of permanent teeth erupted between age 6 and 14 years, analyzed as age-adjusted standard deviation score averaged over multiple time points, based on childhood records for 5,104 women from the Danish National Birth Cohort. Four loci showed association at Peruption and were also known to influence height and breast cancer, respectively. The two other loci pointed to genomic regions without any previous significant genome-wide association study results. The intronic SNP rs7924176 in ADK could be linked to gene expression in monocytes. The combined effect of the four genetic variants was most pronounced between age 10 and 12 years, where children with 6 to 8 delayed tooth eruption alleles had on average 3.5 (95% confidence interval: 2.9–4.1) fewer permanent teeth than children with 0 or 1 of these alleles. PMID:21931568
Wen, Zixiang; Boyse, John F; Song, Qijian; Cregan, Perry B; Wang, Dechun
Crop improvement always involves selection of specific alleles at genes controlling traits of agronomic importance, likely resulting in detectable signatures of selection within the genome of modern soybean (Glycine max L. Merr.). The identification of these signatures of selection is meaningful from the perspective of evolutionary biology and for uncovering the genetic architecture of agronomic traits. To this end, two populations of soybean, consisting of 342 landraces and 1062 improved lines, were genotyped with the SoySNP50K Illumina BeadChip containing 52,041 single nucleotide polymorphisms (SNPs), and systematically phenotyped for 9 agronomic traits. A cross-population composite likelihood ratio (XP-CLR) method was used to screen the signals of selective sweeps. A total of 125 candidate selection regions were identified, many of which harbored genes potentially involved in crop improvement. To further investigate whether these candidate regions were in fact enriched for genes affected by selection, genome-wide association studies (GWAS) were conducted on 7 selection traits targeted in soybean breeding (grain yield, plant height, lodging, maturity date, seed coat color, seed protein and oil content) and 2 non-selection traits (pubescence and flower color). Major genomic regions associated with selection traits overlapped with candidate selection regions, whereas no overlap of this kind occurred for the non-selection traits, suggesting that the selection sweeps identified are associated with traits of agronomic importance. Multiple novel loci and refined map locations of known loci related to these traits were also identified. These findings illustrate that comparative genomic analyses, especially when combined with GWAS, are a promising approach to dissect the genetic architecture of complex traits.
Chiu, Yu-Chiao; Wang, Li-Ju; Hsiao, Tzu-Hung; Chuang, Eric Y; Chen, Yidong
With the advances in high-throughput gene profiling technologies, a large volume of gene interaction maps has been constructed. A higher-level layer of gene-gene interaction, namely modulate gene interaction, is composed of gene pairs of which interaction strengths are modulated by (i.e., dependent on) the expression level of a key modulator gene. Systematic investigations into the modulation by estrogen receptor (ER), the best-known modulator gene, have revealed the functional and prognostic significance in breast cancer. However, a genome-wide identification of key modulator genes that may further unveil the landscape of modulated gene interaction is still lacking. We proposed a systematic workflow to screen for key modulators based on genome-wide gene expression profiles. We designed four modularity parameters to measure the ability of a putative modulator to perturb gene interaction networks. Applying the method to a dataset of 286 breast tumors, we comprehensively characterized the modularity parameters and identified a total of 973 key modulator genes. The modularity of these modulators was verified in three independent breast cancer datasets. ESR1, the encoding gene of ER, appeared in the list, and abundant novel modulators were illuminated. For instance, a prognostic predictor of breast cancer, SFRP1, was found the second modulator. Functional annotation analysis of the 973 modulators revealed involvements in ER-related cellular processes as well as immune- and tumor-associated functions. Here we present, as far as we know, the first comprehensive analysis of key modulator genes on a genome-wide scale. The validity of filtering parameters as well as the conservativity of modulators among cohorts were corroborated. Our data bring new insights into the modulated layer of gene-gene interaction and provide candidates for further biological investigations.
Xu, Jiawei; Bao, Xiao; Peng, Zhaofeng; Wang, Linlin; Du, Linqing; Niu, Wenbin; Sun, Yingpu
Polycystic ovary syndrome (PCOS) affects approximately 7% of the reproductive-age women. A growing body of evidence indicated that epigenetic mechanisms contributed to the development of PCOS. The role of DNA modification in human PCOS ovary granulosa cell is still unknown in PCOS progression. Global DNA methylation and hydroxymethylation were detected between PCOS' and controls' granulosa cell. Genome-wide DNA methylation was profiled to investigate the putative function of DNA methylaiton. Selected genes expressions were analyzed between PCOS' and controls' granulosa cell. Our results showed that the granulosa cell global DNA methylation of PCOS patients was significant higher than the controls'. The global DNA hydroxymethylation showed low level and no statistical difference between PCOS and control. 6936 differentially methylated CpG sites were identified between control and PCOS-obesity. 12245 differential methylated CpG sites were detected between control and PCOS-nonobesity group. 5202 methylated CpG sites were significantly differential between PCOS-obesity and PCOS-nonobesity group. Our results showed that DNA methylation not hydroxymethylation altered genome-wide in PCOS granulosa cell. The different methylation genes were enriched in development protein, transcription factor activity, alternative splicing, sequence-specific DNA binding and embryonic morphogenesis. YWHAQ, NCF2, DHRS9 and SCNA were up-regulation in PCOS-obesity patients with no significance different between control and PCOS-nonobesity patients, which may be activated by lower DNA methylaiton. Global and genome-wide DNA methylation alteration may contribute to different genes expression and PCOS clinical pathology.
Full Text Available Mycobacterium avium subspecies paratuberculosis (M. ap, the causative agent of Johne’s disease (JD, infects many farmed ruminants, wildlife animals and humans. To better understand the molecular pathogenesis of these infections, we analyzed the whole genome sequences of several M. ap and M. avium subspecies avium (M. avium strains isolated from various hosts and environments. Using Next-generation sequencing technology, all 6 M. ap isolates showed a high percentage of homology (98% to the reference genome sequence of M. ap K-10 isolated from cattle. However, 2 M. avium isolates (DT 78 and Env 77 showed significant sequence diversity from the reference strain M. avium 104. The genomes of M. avium isolates DT 78 and Env 77 exhibited only 87% and 40% homology, respectively, to the M. avium 104 reference genome. Within the M. ap isolates, genomic rearrangements (insertions/deletions, Indels were not detected, and only unique single nucleotide polymorphisms (SNPs were observed among the 6 M. ap strains. While most of the SNPs (~100 in M. ap genomes were non-synonymous, a total of ~ 6000 SNPs were detected among M. avium genomes, most of them were synonymous suggesting a differential selective pressure between M. ap and M. avium isolates. In addition, SNPs-based phylo-genomic analysis showed that isolates from goat and Oryx are closely related to the cattle (K-10 strain while the human isolate (M. ap 4B is closely related to the environmental strains, indicating environmental source to human infections. Overall, SNPs were the most common variations among M. ap isolates while SNPs in addition to Indels were prevalent among M. avium isolates. Genomic variations will be useful in designing host-specific markers for the analysis of mycobacterial evolution and for developing novel diagnostics directed against Johne’s disease in animals.
Full Text Available Senescence is a permanent proliferation arrest in response to cell stress such as DNA damage. It contributes strongly to tissue aging and serves as a major barrier against tumor development. Most tumor cells are believed to bypass the senescence barrier (become "immortal" by inactivating growth control genes such as TP53 and CDKN2A. They also reactivate telomerase reverse transcriptase. Senescence-to-immortality transition is accompanied by major phenotypic and biochemical changes mediated by genome-wide transcriptional modifications. This appears to happen during hepatocellular carcinoma (HCC development in patients with liver cirrhosis, however, the accompanying transcriptional changes are virtually unknown. We investigated genome-wide transcriptional changes related to the senescence-to-immortality switch during hepatocellular carcinogenesis. Initially, we performed transcriptome analysis of senescent and immortal clones of Huh7 HCC cell line, and identified genes with significant differential expression to establish a senescence-related gene list. Through the analysis of senescence-related gene expression in different liver tissues we showed that cirrhosis and HCC display expression patterns compatible with senescent and immortal phenotypes, respectively; dysplasia being a transitional state. Gene set enrichment analysis revealed that cirrhosis/senescence-associated genes were preferentially expressed in non-tumor tissues, less malignant tumors, and differentiated or senescent cells. In contrast, HCC/immortality genes were up-regulated in tumor tissues, or more malignant tumors and progenitor cells. In HCC tumors and immortal cells genes involved in DNA repair, cell cycle, telomere extension and branched chain amino acid metabolism were up-regulated, whereas genes involved in cell signaling, as well as in drug, lipid, retinoid and glycolytic metabolism were down-regulated. Based on these distinctive gene expression features we developed a 15
Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang
Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...
Sasayama, Daimei; Hattori, Kotaro; Ogawa, Shintaro; Yokota, Yuuki; Matsumura, Ryo; Teraishi, Toshiya; Hori, Hiroaki; Ota, Miho; Yoshida, Sumiko; Kunugi, Hiroshi
Cerebrospinal fluid (CSF) is virtually the only one accessible source of proteins derived from the central nervous system (CNS) of living humans and possibly reflects the pathophysiology of a variety of neuropsychiatric diseases. However, little is known regarding the genetic basis of variation in protein levels of human CSF. We examined CSF levels of 1,126 proteins in 133 subjects and performed a genome-wide association analysis of 514,227 single nucleotide polymorphisms (SNPs) to detect protein quantitative trait loci (pQTLs). To be conservative, Spearman's correlation was used to identify an association between genotypes of SNPs and protein levels. A total of 421 cis and 25 trans SNP-protein pairs were significantly correlated at a false discovery rate (FDR) of less than 0.01 (nominal P genome-wide association studies. The present findings suggest that genetic variations play an important role in the regulation of protein expression in the CNS. The obtained database may serve as a valuable resource to understand the genetic bases for CNS protein expression pattern in humans. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: firstname.lastname@example.org.
Iqbal, Javeed; Joshi, Shantaram; Patel, Kavita N
of Lymphoid Neoplasms (REAL) and World Health Organization (WHO) classifications. These classification methods were based on histological, immunophenotypic and cytogenetic markers and widely accepted by pathologists and oncologists worldwide. During last several decades, great progress has been made...... technology. The genome-wide transcriptional measurement, also called gene expression profile (GEP) can accurately define the biological phenotype of the tumor. In this review, important discoveries made by genome-wide GEP in understanding the biology of lymphoma and additionally the diagnostic and prognostic...
Zhao, Qian; Ma, Dongna; Huang, Yuping; He, Weiyi; Li, Yiying; Vasseur, Liette; You, Minsheng
Transcription factors (TFs), which play a vital role in regulating gene expression, are prevalent in all organisms and characterization of them may provide important clues for understanding regulation in vivo. The present study reports a genome-wide investigation of TFs in the diamondback moth, Plutella xylostella (L.), a worldwide pest of crucifers. A total of 940 TFs distributed among 133 families were identified. Phylogenetic analysis of insect species showed that some of these families were found to have expanded during the evolution of P. xylostella or Lepidoptera. RNA-seq analysis showed that some of the TF families, such as zinc fingers, homeobox, bZIP, bHLH, and MADF_DNA_bdg genes, were highly expressed in certain tissues including midgut, salivary glands, fat body, and hemocytes, with an obvious sex-biased expression pattern. In addition, a number of TFs showed significant differences in expression between insecticide susceptible and resistant strains, suggesting that these TFs play a role in regulating genes related to insecticide resistance. Finally, we identified an expansion of the HOX cluster in Lepidoptera, which might be related to Lepidoptera-specific evolution. Knockout of this cluster using CRISPR/Cas9 showed that the egg cannot hatch, indicating that this cluster may be related to egg development and maturation. This is the first comprehensive study on identifying and characterizing TFs in P. xylostella. Our results suggest that some TF families are expanded in the P. xylostella genome, and these TFs may have important biological roles in growth, development, sexual dimorphism, and resistance to insecticides. The present work provides a solid foundation for understanding regulation via TFs in P. xylostella and insights into the evolution of the P. xylostella genome.
Xu, P; Wu, X; Wang, B; Luo, J; Liu, Y; Ehlers, J D; Close, T J; Roberts, P A; Lu, Z; Wang, S; Li, G
Association mapping of important traits of crop plants relies on first understanding the extent and patterns of linkage disequilibrium (LD) in the particular germplasm being investigated. We characterize here the genetic diversity, population structure and genome wide LD patterns in a set of asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm from China. A diverse collection of 99 asparagus bean and normal cowpea accessions were genotyped with 1127 expressed sequence tag-derived single nucleotide polymorphism markers (SNPs). The proportion of polymorphic SNPs across the collection was relatively low (39%), with an average number of SNPs per locus of 1.33. Bayesian population structure analysis indicated two subdivisions within the collection sampled that generally represented the 'standard vegetable' type (subgroup SV) and the 'non-standard vegetable' type (subgroup NSV), respectively. Level of LD (r(2)) was higher and extent of LD persisted longer in subgroup SV than in subgroup NSV, whereas LD decayed rapidly (0-2 cM) in both subgroups. LD decay distance varied among chromosomes, with the longest (≈ 5 cM) five times longer than the shortest (≈ 1 cM). Partitioning of LD variance into within- and between-subgroup components coupled with comparative LD decay analysis suggested that linkage group 5, 7 and 10 may have undergone the most intensive epistatic selection toward traits favorable for vegetable use. This work provides a first population genetic insight into domestication history of asparagus bean and demonstrates the feasibility of mapping complex traits by genome wide association study in asparagus bean using a currently available cowpea SNPs marker platform.
Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C
Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC...
Minster, Ryan L; Sanders, Jason L; Singh, Jatinder
BACKGROUND: The Healthy Aging Index (HAI) is a tool for measuring the extent of health and disease across multiple systems. METHODS: We conducted a genome-wide association study and a genome-wide linkage analysis to map quantitative trait loci associated with the HAI and a modified HAI weighted...
Sud, Amit; Thomsen, Hauke; Law, Philip J.
Several susceptibility loci for classical Hodgkin lymphoma have been reported. However, much of the heritable risk is unknown. Here, we perform a meta-analysis of two existing genome-wide association studies, a new genome-wide association study, and replication totalling 5,314 cases and 16,749 co...
Sud, A. (Amit); Thomsen, H. (Hauke); Law, P.J. (Philip J.); A. Försti (Asta); Filho, M.I.D.S. (Miguel Inacio Da Silva); Holroyd, A. (Amy); P. Broderick (Peter); Orlando, G. (Giulia); Lenive, O. (Oleg); Wright, L. (Lauren); R. Cooke (Rosie); D.F. Easton (Douglas); P.D.P. Pharoah (Paul); A.M. Dunning (Alison); J. Peto (Julian); F. Canzian (Federico); Eeles, R. (Rosalind); Z. Kote-Jarai; K.R. Muir (K.); Pashayan, N. (Nora); B.E. Henderson (Brian); C.A. Haiman (Christopher); S. Benlloch (Sara); F.R. Schumacher (Fredrick R); Olama, A.A.A. (Ali Amin Al); S.I. Berndt (Sonja); G. Conti (Giario); F. Wiklund (Fredrik); S.J. Chanock (Stephen); Stevens, V.L. (Victoria L.); C.M. Tangen (Catherine M.); Batra, J. (Jyotsna); Clements, J. (Judith); H. Grönberg (Henrik); Schleutker, J. (Johanna); D. Albanes (Demetrius); Weinstein, S. (Stephanie); K. Wolk (Kerstin); West, C. (Catharine); Mucci, L. (Lorelei); Cancel-Tassin, G. (Géraldine); Koutros, S. (Stella); Sorensen, K.D. (Karina Dalsgaard); L. Maehle; D. Neal (David); S.P.L. Travis (Simon); Hamilton, R.J. (Robert J.); S.A. Ingles (Sue); B.S. Rosenstein (Barry S.); Lu, Y.-J. (Yong-Jie); Giles, G.G. (Graham G.); A. Kibel (Adam); Vega, A. (Ana); M. Kogevinas (Manolis); Penney, K.L. (Kathryn L.); Park, J.Y. (Jong Y.); Stanford, J.L. (Janet L.); C. Cybulski (Cezary); B.G. Nordestgaard (Børge); Br