WorldWideScience

Sample records for gene set enrichment

  1. Comparative study on gene set and pathway topology-based enrichment methods.

    Science.gov (United States)

    Bayerlová, Michaela; Jung, Klaus; Kramer, Frank; Klemm, Florian; Bleckmann, Annalen; Beißbarth, Tim

    2015-10-22

    Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis. We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods. In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower. We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both

  2. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

    Science.gov (United States)

    Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

    2015-11-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.

  3. IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering.

    Science.gov (United States)

    Wu, Lingxiang; Chen, Xiujie; Zhang, Denan; Zhang, Wubing; Liu, Lei; Ma, Hongzhe; Yang, Jingbo; Xie, Hongbo; Liu, Bo; Jin, Qing

    2016-01-01

    Analysis of gene sets has been widely applied in various high-throughput biological studies. One weakness in the traditional methods is that they neglect the heterogeneity of genes expressions in samples which may lead to the omission of some specific and important gene sets. It is also difficult for them to reflect the severities of disease and provide expression profiles of gene sets for individuals. We developed an application software called IGSA that leverages a powerful analytical capacity in gene sets enrichment and samples clustering. IGSA calculates gene sets expression scores for each sample and takes an accumulating clustering strategy to let the samples gather into the set according to the progress of disease from mild to severe. We focus on gastric, pancreatic and ovarian cancer data sets for the performance of IGSA. We also compared the results of IGSA in KEGG pathways enrichment with David, GSEA, SPIA, ssGSEA and analyzed the results of IGSA clustering and different similarity measurement methods. Notably, IGSA is proved to be more sensitive and specific in finding significant pathways, and can indicate related changes in pathways with the severity of disease. In addition, IGSA provides with significant gene sets profile for each sample.

  4. Ranking metrics in gene set enrichment analysis: do they matter?

    Science.gov (United States)

    Zyla, Joanna; Marczyk, Michal; Weiner, January; Polanska, Joanna

    2017-05-12

    There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results. In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA . Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner

  5. Constellation Map: Downstream visualization and interpretation of gene set enrichment results [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Yan Tan

    2015-06-01

    Full Text Available Summary: Gene set enrichment analysis (GSEA approaches are widely used to identify coordinately regulated genes associated with phenotypes of interest. Here, we present Constellation Map, a tool to visualize and interpret the results when enrichment analyses yield a long list of significantly enriched gene sets. Constellation Map identifies commonalities that explain the enrichment of multiple top-scoring gene sets and maps the relationships between them. Constellation Map can help investigators take full advantage of GSEA and facilitates the biological interpretation of enrichment results. Availability: Constellation Map is freely available as a GenePattern module at http://www.genepattern.org.

  6. Gene set of nuclear-encoded mitochondrial regulators is enriched for common inherited variation in obesity.

    Directory of Open Access Journals (Sweden)

    Nadja Knoll

    Full Text Available There are hints of an altered mitochondrial function in obesity. Nuclear-encoded genes are relevant for mitochondrial function (3 gene sets of known relevant pathways: (1 16 nuclear regulators of mitochondrial genes, (2 91 genes for oxidative phosphorylation and (3 966 nuclear-encoded mitochondrial genes. Gene set enrichment analysis (GSEA showed no association with type 2 diabetes mellitus in these gene sets. Here we performed a GSEA for the same gene sets for obesity. Genome wide association study (GWAS data from a case-control approach on 453 extremely obese children and adolescents and 435 lean adult controls were used for GSEA. For independent confirmation, we analyzed 705 obesity GWAS trios (extremely obese child and both biological parents and a population-based GWAS sample (KORA F4, n = 1,743. A meta-analysis was performed on all three samples. In each sample, the distribution of significance levels between the respective gene set and those of all genes was compared using the leading-edge-fraction-comparison test (cut-offs between the 50(th and 95(th percentile of the set of all gene-wise corrected p-values as implemented in the MAGENTA software. In the case-control sample, significant enrichment of associations with obesity was observed above the 50(th percentile for the set of the 16 nuclear regulators of mitochondrial genes (p(GSEA,50 = 0.0103. This finding was not confirmed in the trios (p(GSEA,50 = 0.5991, but in KORA (p(GSEA,50 = 0.0398. The meta-analysis again indicated a trend for enrichment (p(MAGENTA,50 = 0.1052, p(MAGENTA,75 = 0.0251. The GSEA revealed that weak association signals for obesity might be enriched in the gene set of 16 nuclear regulators of mitochondrial genes.

  7. Zebrafish Expression Ontology of Gene Sets (ZEOGS): A Tool to Analyze Enrichment of Zebrafish Anatomical Terms in Large Gene Sets

    Science.gov (United States)

    Marsico, Annalisa

    2013-01-01

    Abstract The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene

  8. Zebrafish Expression Ontology of Gene Sets (ZEOGS): a tool to analyze enrichment of zebrafish anatomical terms in large gene sets.

    Science.gov (United States)

    Prykhozhij, Sergey V; Marsico, Annalisa; Meijsing, Sebastiaan H

    2013-09-01

    The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene expression

  9. Application of biclustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

    Directory of Open Access Journals (Sweden)

    Andrew Williams

    2015-12-01

    Full Text Available Background: The presence of diverse types of nanomaterials (NMs in commerce is growing at an exponential pace. As a result, human exposure to these materials in the environment is inevitable, necessitating the need for rapid and reliable toxicity testing methods to accurately assess the potential hazards associated with NMs. In this study, we applied biclustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related biclusters of genes showing similar expression profiles were identified. The identified biclusters were then used to conduct a gene set enrichment analysis on pulmonary gene expression profiles derived from mice exposed to nano-titanium dioxide (nano-TiO2, carbon black (CB or carbon nanotubes (CNTs to determine the disease significance of these data-driven gene sets.Results: Biclusters representing inflammation (chemokine activity, DNA binding, cell cycle, apoptosis, reactive oxygen species (ROS and fibrosis processes were identified. All of the NM studies were significant with respect to the bicluster related to chemokine activity (DAVID; FDR p-value = 0.032. The bicluster related to pulmonary fibrosis was enriched in studies where toxicity induced by CNT and CB studies was investigated, suggesting the potential for these materials to induce lung fibrosis. The pro-fibrogenic potential of CNTs is well established. Although CB has not been shown to induce fibrosis, it induces stronger inflammatory, oxidative stress and DNA damage responses than nano-TiO2 particles.Conclusion: The results of the analysis correctly identified all NMs to be inflammogenic and only CB and CNTs as potentially fibrogenic. In addition to identifying several

  10. Identification of a set of genes showing regionally enriched expression in the mouse brain

    Directory of Open Access Journals (Sweden)

    Marra Marco A

    2008-07-01

    Full Text Available Abstract Background The Pleiades Promoter Project aims to improve gene therapy by designing human mini-promoters ( Results We have utilized LongSAGE to identify regionally enriched transcripts in the adult mouse brain. As supplemental strategies, we also performed a meta-analysis of published literature and inspected the Allen Brain Atlas in situ hybridization data. From a set of approximately 30,000 mouse genes, 237 were identified as showing specific or enriched expression in 30 target regions of the mouse brain. GO term over-representation among these genes revealed co-involvement in various aspects of central nervous system development and physiology. Conclusion Using a multi-faceted expression validation approach, we have identified mouse genes whose human orthologs are good candidates for design of mini-promoters. These mouse genes represent molecular markers in several discrete brain regions/cell-types, which could potentially provide a mechanistic explanation of unique functions performed by each region. This set of markers may also serve as a resource for further studies of gene regulatory elements influencing brain expression.

  11. Gene-Based Analysis of Regionally Enriched Cortical Genes in GWAS Data Sets of Cognitive Traits and Psychiatric Disorders

    DEFF Research Database (Denmark)

    Ersland, Kari M; Christoforou, Andrea; Stansberg, Christine

    2012-01-01

    the regionally enriched cortical genes to mine a genome-wide association study (GWAS) of the Norwegian Cognitive NeuroGenetics (NCNG) sample of healthy adults for association to nine psychometric tests measures. In addition, we explored GWAS data sets for the serious psychiatric disorders schizophrenia (SCZ) (n...

  12. NetGen: a novel network-based probabilistic generative model for gene set functional enrichment analysis.

    Science.gov (United States)

    Sun, Duanchen; Liu, Yinliang; Zhang, Xiang-Sun; Wu, Ling-Yun

    2017-09-21

    High-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct parent or descendant terms in the GO hierarchical structure. Highly redundant terms make users difficult to analyze the underlying biological processes. In this paper, a novel network-based probabilistic generative model, NetGen, was proposed to perform the functional enrichment analysis. An additional protein-protein interaction (PPI) network was explicitly used to assist the identification of significantly enriched GO terms. NetGen achieved a superior performance than the existing methods in the simulation studies. The effectiveness of NetGen was explored further on four real datasets. Notably, several GO terms which were not directly linked with the active gene list for each disease were identified. These terms were closely related to the corresponding diseases when accessed to the curated literatures. NetGen has been implemented in the R package CopTea publicly available at GitHub ( http://github.com/wulingyun/CopTea/ ). Our procedure leads to a more reasonable and interpretable result of the functional enrichment analysis. As a novel term combination-based functional enrichment analysis method, NetGen is complementary to current individual term-based methods, and can help to explore the underlying pathogenesis of complex diseases.

  13. FunGeneNet: a web tool to estimate enrichment of functional interactions in experimental gene sets.

    Science.gov (United States)

    Tiys, Evgeny S; Ivanisenko, Timofey V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2018-02-09

    experimental gene sets, both for different global networks and for different types of interactions. Using examples of thyroid cancer and apoptosis networks, we have shown that the links over-represented in the analyzed network in comparison with the random ones make possible a biological interpretation of the original gene/protein sets. The FunGeneNet web tool for assessment of the functional enrichment of networks is available at http://www-bionet.sscc.ru/fungenenet/ .

  14. A cross-study gene set enrichment analysis identifies critical pathways in endometriosis

    Directory of Open Access Journals (Sweden)

    Bai Chunyan

    2009-09-01

    Full Text Available Abstract Background Endometriosis is an enigmatic disease. Gene expression profiling of endometriosis has been used in several studies, but few studies went further to classify subtypes of endometriosis based on expression patterns and to identify possible pathways involved in endometriosis. Some of the observed pathways are more inconsistent between the studies, and these candidate pathways presumably only represent a fraction of the pathways involved in endometriosis. Methods We applied a standardised microarray preprocessing and gene set enrichment analysis to six independent studies, and demonstrated increased concordance between these gene datasets. Results We find 16 up-regulated and 19 down-regulated pathways common in ovarian endometriosis data sets, 22 up-regulated and one down-regulated pathway common in peritoneal endometriosis data sets. Among them, 12 up-regulated and 1 down-regulated were found consistent between ovarian and peritoneal endometriosis. The main canonical pathways identified are related to immunological and inflammatory disease. Early secretory phase has the most over-represented pathways in the three uterine cycle phases. There are no overlapping significant pathways between the dataset from human endometrial endothelial cells and the datasets from ovarian endometriosis which used whole tissues. Conclusion The study of complex diseases through pathway analysis is able to highlight genes weakly connected to the phenotype which may be difficult to detect by using classical univariate statistics. By standardised microarray preprocessing and GSEA, we have increased the concordance in identifying many biological mechanisms involved in endometriosis. The identified gene pathways will shed light on the understanding of endometriosis and promote the development of novel therapies.

  15. Bi-directional gene set enrichment and canonical correlation analysis identify key diet-sensitive pathways and biomarkers of metabolic syndrome

    Directory of Open Access Journals (Sweden)

    Gaora Peadar Ó

    2010-10-01

    Full Text Available Abstract Background Currently, a number of bioinformatics methods are available to generate appropriate lists of genes from a microarray experiment. While these lists represent an accurate primary analysis of the data, fewer options exist to contextualise those lists. The development and validation of such methods is crucial to the wider application of microarray technology in the clinical setting. Two key challenges in clinical bioinformatics involve appropriate statistical modelling of dynamic transcriptomic changes, and extraction of clinically relevant meaning from very large datasets. Results Here, we apply an approach to gene set enrichment analysis that allows for detection of bi-directional enrichment within a gene set. Furthermore, we apply canonical correlation analysis and Fisher's exact test, using plasma marker data with known clinical relevance to aid identification of the most important gene and pathway changes in our transcriptomic dataset. After a 28-day dietary intervention with high-CLA beef, a range of plasma markers indicated a marked improvement in the metabolic health of genetically obese mice. Tissue transcriptomic profiles indicated that the effects were most dramatic in liver (1270 genes significantly changed; p Conclusion Bi-directional gene set enrichment analysis more accurately reflects dynamic regulatory behaviour in biochemical pathways, and as such highlighted biologically relevant changes that were not detected using a traditional approach. In such cases where transcriptomic response to treatment is exceptionally large, canonical correlation analysis in conjunction with Fisher's exact test highlights the subset of pathways showing strongest correlation with the clinical markers of interest. In this case, we have identified selenoamino acid metabolism and steroid biosynthesis as key pathways mediating the observed relationship between metabolic health and high-CLA beef. These results indicate that this type of

  16. Integrative set enrichment testing for multiple omics platforms

    Directory of Open Access Journals (Sweden)

    Poisson Laila M

    2011-11-01

    Full Text Available Abstract Background Enrichment testing assesses the overall evidence of differential expression behavior of the elements within a defined set. When we have measured many molecular aspects, e.g. gene expression, metabolites, proteins, it is desirable to assess their differential tendencies jointly across platforms using an integrated set enrichment test. In this work we explore the properties of several methods for performing a combined enrichment test using gene expression and metabolomics as the motivating platforms. Results Using two simulation models we explored the properties of several enrichment methods including two novel methods: the logistic regression 2-degree of freedom Wald test and the 2-dimensional permutation p-value for the sum-of-squared statistics test. In relation to their univariate counterparts we find that the joint tests can improve our ability to detect results that are marginal univariately. We also find that joint tests improve the ranking of associated pathways compared to their univariate counterparts. However, there is a risk of Type I error inflation with some methods and self-contained methods lose specificity when the sets are not representative of underlying association. Conclusions In this work we show that consideration of data from multiple platforms, in conjunction with summarization via a priori pathway information, leads to increased power in detection of genomic associations with phenotypes.

  17. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool.

    Science.gov (United States)

    Chen, Edward Y; Tan, Christopher M; Kou, Yan; Duan, Qiaonan; Wang, Zichen; Meirelles, Gabriela Vaz; Clark, Neil R; Ma'ayan, Avi

    2013-04-15

    System-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement. Here, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios. Enrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr.

  18. Tracking difference in gene expression in a time-course experiment using gene set enrichment analysis.

    Directory of Open Access Journals (Sweden)

    Pui Shan Wong

    Full Text Available Fistulifera sp. strain JPCC DA0580 is a newly sequenced pennate diatom that is capable of simultaneously growing and accumulating lipids. This is a unique trait, not found in other related microalgae so far. It is able to accumulate between 40 to 60% of its cell weight in lipids, making it a strong candidate for the production of biofuel. To investigate this characteristic, we used RNA-Seq data gathered at four different times while Fistulifera sp. strain JPCC DA0580 was grown in oil accumulating and non-oil accumulating conditions. We then adapted gene set enrichment analysis (GSEA to investigate the relationship between the difference in gene expression of 7,822 genes and metabolic functions in our data. We utilized information in the KEGG pathway database to create the gene sets and changed GSEA to use re-sampling so that data from the different time points could be included in the analysis. Our GSEA method identified photosynthesis, lipid synthesis and amino acid synthesis related pathways as processes that play a significant role in oil production and growth in Fistulifera sp. strain JPCC DA0580. In addition to GSEA, we visualized the results by creating a network of compounds and reactions, and plotted the expression data on top of the network. This made existing graph algorithms available to us which we then used to calculate a path that metabolizes glucose into triacylglycerol (TAG in the smallest number of steps. By visualizing the data this way, we observed a separate up-regulation of genes at different times instead of a concerted response. We also identified two metabolic paths that used less reactions than the one shown in KEGG and showed that the reactions were up-regulated during the experiment. The combination of analysis and visualization methods successfully analyzed time-course data, identified important metabolic pathways and provided new hypotheses for further research.

  19. The Molecular Signatures Database (MSigDB) hallmark gene set collection.

    Science.gov (United States)

    Liberzon, Arthur; Birger, Chet; Thorvaldsdóttir, Helga; Ghandi, Mahmoud; Mesirov, Jill P; Tamayo, Pablo

    2015-12-23

    The Molecular Signatures Database (MSigDB) is one of the most widely used and comprehensive databases of gene sets for performing gene set enrichment analysis. Since its creation, MSigDB has grown beyond its roots in metabolic disease and cancer to include >10,000 gene sets. These better represent a wider range of biological processes and diseases, but the utility of the database is reduced by increased redundancy across, and heterogeneity within, gene sets. To address this challenge, here we use a combination of automated approaches and expert curation to develop a collection of "hallmark" gene sets as part of MSigDB. Each hallmark in this collection consists of a "refined" gene set, derived from multiple "founder" sets, that conveys a specific biological state or process and displays coherent expression. The hallmarks effectively summarize most of the relevant information of the original founder sets and, by reducing both variation and redundancy, provide more refined and concise inputs for gene set enrichment analysis.

  20. Enriching the gene set analysis of genome-wide data by incorporating directionality of gene expression and combining statistical hypotheses and methods

    Science.gov (United States)

    Väremo, Leif; Nielsen, Jens; Nookaew, Intawat

    2013-01-01

    Gene set analysis (GSA) is used to elucidate genome-wide data, in particular transcriptome data. A multitude of methods have been proposed for this step of the analysis, and many of them have been compared and evaluated. Unfortunately, there is no consolidated opinion regarding what methods should be preferred, and the variety of available GSA software and implementations pose a difficulty for the end-user who wants to try out different methods. To address this, we have developed the R package Piano that collects a range of GSA methods into the same system, for the benefit of the end-user. Further on we refine the GSA workflow by using modifications of the gene-level statistics. This enables us to divide the resulting gene set P-values into three classes, describing different aspects of gene expression directionality at gene set level. We use our fully implemented workflow to investigate the impact of the individual components of GSA by using microarray and RNA-seq data. The results show that the evaluated methods are globally similar and the major separation correlates well with our defined directionality classes. As a consequence of this, we suggest to use a consensus scoring approach, based on multiple GSA runs. In combination with the directionality classes, this constitutes a more thorough basis for an enriched biological interpretation. PMID:23444143

  1. Knowledge Enrichment Analysis for Human Tissue- Specific Genes Uncover New Biological Insights

    Directory of Open Access Journals (Sweden)

    Gong Xiu-Jun

    2012-06-01

    Full Text Available The expression and regulation of genes in different tissues are fundamental questions to be answered in biology. Knowledge enrichment analysis for tissue specific (TS and housekeeping (HK genes may help identify their roles in biological process or diseases and gain new biological insights.In this paper, we performed the knowledge enrichment analysis for 17,343 genes in 84 human tissues using Gene Set Enrichment Analysis (GSEA and Hypergeometric Analysis (HA against three biological ontologies: Gene Ontology (GO, KEGG pathways and Disease Ontology (DO respectively.The analyses results demonstrated that the functions of most gene groups are consistent with their tissue origins. Meanwhile three interesting new associations for HK genes and the skeletal muscle tissuegenes are found. Firstly, Hypergeometric analysis against KEGG database for HK genes disclosed that three disease terms (Parkinson’s disease, Huntington’s disease, Alzheimer’s disease are intensively enriched.Secondly, Hypergeometric analysis against the KEGG database for Skeletal Muscle tissue genes shows that two cardiac diseases of “Hypertrophic cardiomyopathy (HCM” and “Arrhythmogenic right ventricular cardiomyopathy (ARVC” are heavily enriched, which are also considered as no relationship with skeletal functions.Thirdly, “Prostate cancer” is intensively enriched in Hypergeometric analysis against the disease ontology (DO for the Skeletal Muscle tissue genes, which is a much unexpected phenomenon.

  2. Enrichment of putative PAX8 target genes at serous epithelial ovarian cancer susceptibility loci

    DEFF Research Database (Denmark)

    Kar, Siddhartha P; Adler, Emily; Tyrer, Jonathan

    2017-01-01

    BACKGROUND: Genome-wide association studies (GWAS) have identified 18 loci associated with serous ovarian cancer (SOC) susceptibility but the biological mechanisms driving these findings remain poorly characterised. Germline cancer risk loci may be enriched for target genes of transcription factors...... (TFs) critical to somatic tumorigenesis. METHODS: All 615 TF-target sets from the Molecular Signatures Database were evaluated using gene set enrichment analysis (GSEA) and three GWAS for SOC risk: discovery (2196 cases/4396 controls), replication (7035 cases/21 693 controls; independent from discovery...... to interact with PAX8 in the literature to the PAX8-target set and applying an alternative to GSEA, interval enrichment, further confirmed this association (P=0.006). Fifteen of the 157 genes from this expanded PAX8 pathway were near eight loci associated with SOC risk at P

  3. GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists

    Directory of Open Access Journals (Sweden)

    Steinfeld Israel

    2009-02-01

    Full Text Available Abstract Background Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results. Results GOrilla is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression. GOrilla employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the top of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, GOrilla computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms. Conclusion GOrilla is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. GOrilla's unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. GOrilla is publicly available at: http://cbl-gorilla.cs.technion.ac.il

  4. Statistical assessment of crosstalk enrichment between gene groups in biological networks.

    Science.gov (United States)

    McCormack, Theodore; Frings, Oliver; Alexeyenko, Andrey; Sonnhammer, Erik L L

    2013-01-01

    Analyzing groups of functionally coupled genes or proteins in the context of global interaction networks has become an important aspect of bioinformatic investigations. Assessing the statistical significance of crosstalk enrichment between or within groups of genes can be a valuable tool for functional annotation of experimental gene sets. Here we present CrossTalkZ, a statistical method and software to assess the significance of crosstalk enrichment between pairs of gene or protein groups in large biological networks. We demonstrate that the standard z-score is generally an appropriate and unbiased statistic. We further evaluate the ability of four different methods to reliably recover crosstalk within known biological pathways. We conclude that the methods preserving the second-order topological network properties perform best. Finally, we show how CrossTalkZ can be used to annotate experimental gene sets using known pathway annotations and that its performance at this task is superior to gene enrichment analysis (GEA). CrossTalkZ (available at http://sonnhammer.sbc.su.se/download/software/CrossTalkZ/) is implemented in C++, easy to use, fast, accepts various input file formats, and produces a number of statistics. These include z-score, p-value, false discovery rate, and a test of normality for the null distributions.

  5. Genes misregulated in C. elegans deficient in Dicer, RDE-4, or RDE-1 are enriched for innate immunity genes.

    Science.gov (United States)

    Welker, Noah C; Habig, Jeffrey W; Bass, Brenda L

    2007-07-01

    We describe the first microarray analysis of a whole animal containing a mutation in the Dicer gene. We used adult Caenorhabditis elegans and, to distinguish among different roles of Dicer, we also performed microarray analyses of animals with mutations in rde-4 and rde-1, which are involved in silencing by siRNA, but not miRNA. Surprisingly, we find that the X chromosome is greatly enriched for genes regulated by Dicer. Comparison of all three microarray data sets indicates the majority of Dicer-regulated genes are not dependent on RDE-4 or RDE-1, including the X-linked genes. However, all three data sets are enriched in genes important for innate immunity and, specifically, show increased expression of innate immunity genes.

  6. Gene Ontology and KEGG Enrichment Analyses of Genes Related to Age-Related Macular Degeneration

    Directory of Open Access Journals (Sweden)

    Jian Zhang

    2014-01-01

    Full Text Available Identifying disease genes is one of the most important topics in biomedicine and may facilitate studies on the mechanisms underlying disease. Age-related macular degeneration (AMD is a serious eye disease; it typically affects older adults and results in a loss of vision due to retina damage. In this study, we attempt to develop an effective method for distinguishing AMD-related genes. Gene ontology and KEGG enrichment analyses of known AMD-related genes were performed, and a classification system was established. In detail, each gene was encoded into a vector by extracting enrichment scores of the gene set, including it and its direct neighbors in STRING, and gene ontology terms or KEGG pathways. Then certain feature-selection methods, including minimum redundancy maximum relevance and incremental feature selection, were adopted to extract key features for the classification system. As a result, 720 GO terms and 11 KEGG pathways were deemed the most important factors for predicting AMD-related genes.

  7. An Independent Filter for Gene Set Testing Based on Spectral Enrichment

    NARCIS (Netherlands)

    Frost, H Robert; Li, Zhigang; Asselbergs, Folkert W; Moore, Jason H

    2015-01-01

    Gene set testing has become an indispensable tool for the analysis of high-dimensional genomic data. An important motivation for testing gene sets, rather than individual genomic variables, is to improve statistical power by reducing the number of tested hypotheses. Given the dramatic growth in

  8. Effect of the absolute statistic on gene-sampling gene-set analysis methods.

    Science.gov (United States)

    Nam, Dougu

    2017-06-01

    Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.

  9. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  10. Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

    Science.gov (United States)

    Osato, Naoki

    2018-01-19

    Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional

  11. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Boris P Hejblum

    2015-06-01

    Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.

  12. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery.

    Science.gov (United States)

    Jia, Zhilong; Liu, Ying; Guan, Naiyang; Bo, Xiaochen; Luo, Zhigang; Barnes, Michael R

    2016-05-27

    Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and

  13. Gene set analysis using variance component tests.

    Science.gov (United States)

    Huang, Yen-Tsung; Lin, Xihong

    2013-06-28

    Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.

  14. Transcriptional profiles of supragranular-enriched genes associate with corticocortical network architecture in the human brain.

    Science.gov (United States)

    Krienen, Fenna M; Yeo, B T Thomas; Ge, Tian; Buckner, Randy L; Sherwood, Chet C

    2016-01-26

    The human brain is patterned with disproportionately large, distributed cerebral networks that connect multiple association zones in the frontal, temporal, and parietal lobes. The expansion of the cortical surface, along with the emergence of long-range connectivity networks, may be reflected in changes to the underlying molecular architecture. Using the Allen Institute's human brain transcriptional atlas, we demonstrate that genes particularly enriched in supragranular layers of the human cerebral cortex relative to mouse distinguish major cortical classes. The topography of transcriptional expression reflects large-scale brain network organization consistent with estimates from functional connectivity MRI and anatomical tracing in nonhuman primates. Microarray expression data for genes preferentially expressed in human upper layers (II/III), but enriched only in lower layers (V/VI) of mouse, were cross-correlated to identify molecular profiles across the cerebral cortex of postmortem human brains (n = 6). Unimodal sensory and motor zones have similar molecular profiles, despite being distributed across the cortical mantle. Sensory/motor profiles were anticorrelated with paralimbic and certain distributed association network profiles. Tests of alternative gene sets did not consistently distinguish sensory and motor regions from paralimbic and association regions: (i) genes enriched in supragranular layers in both humans and mice, (ii) genes cortically enriched in humans relative to nonhuman primates, (iii) genes related to connectivity in rodents, (iv) genes associated with human and mouse connectivity, and (v) 1,454 gene sets curated from known gene ontologies. Molecular innovations of upper cortical layers may be an important component in the evolution of long-range corticocortical projections.

  15. The Schizophrenia-Associated BRD1 Gene Regulates Behavior, Neurotransmission, and Expression of Schizophrenia Risk Enriched Gene Sets in Mice.

    Science.gov (United States)

    Qvist, Per; Christensen, Jane Hvarregaard; Vardya, Irina; Rajkumar, Anto Praveen; Mørk, Arne; Paternoster, Veerle; Füchtbauer, Ernst-Martin; Pallesen, Jonatan; Fryland, Tue; Dyrvig, Mads; Hauberg, Mads Engel; Lundsberg, Birgitte; Fejgin, Kim; Nyegaard, Mette; Jensen, Kimmo; Nyengaard, Jens Randel; Mors, Ole; Didriksen, Michael; Børglum, Anders Dupont

    2017-07-01

    The schizophrenia-associated BRD1 gene encodes a transcriptional regulator whose comprehensive chromatin interactome is enriched with schizophrenia risk genes. However, the biology underlying the disease association of BRD1 remains speculative. This study assessed the transcriptional drive of a schizophrenia-associated BRD1 risk variant in vitro. Accordingly, to examine the effects of reduced Brd1 expression, we generated a genetically modified Brd1 +/- mouse and subjected it to behavioral, electrophysiological, molecular, and integrative genomic analyses with focus on schizophrenia-relevant parameters. Brd1 +/- mice displayed cerebral histone H3K14 hypoacetylation and a broad range of behavioral changes with translational relevance to schizophrenia. These behaviors were accompanied by striatal dopamine/serotonin abnormalities and cortical excitation-inhibition imbalances involving loss of parvalbumin immunoreactive interneurons. RNA-sequencing analyses of cortical and striatal micropunches from Brd1 +/- and wild-type mice revealed differential expression of genes enriched for schizophrenia risk, including several schizophrenia genome-wide association study risk genes (e.g., calcium channel subunits [Cacna1c and Cacnb2], cholinergic muscarinic receptor 4 [Chrm4)], dopamine receptor D 2 [Drd2], and transcription factor 4 [Tcf4]). Integrative analyses further found differentially expressed genes to cluster in functional networks and canonical pathways associated with mental illness and molecular signaling processes (e.g., glutamatergic, monoaminergic, calcium, cyclic adenosine monophosphate [cAMP], dopamine- and cAMP-regulated neuronal phosphoprotein 32 kDa [DARPP-32], and cAMP responsive element binding protein signaling [CREB]). Our study bridges the gap between genetic association and pathogenic effects and yields novel insights into the unfolding molecular changes in the brain of a new schizophrenia model that incorporates genetic risk at three levels: allelic

  16. Integrating genome-wide association study and expression quantitative trait loci data identifies multiple genes and gene set associated with neuroticism.

    Science.gov (United States)

    Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng

    2017-08-01

    Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.

  17. Pathway profiles based on gene-set enrichment analysis in the honey bee Apis mellifera under brood rearing-suppressed conditions.

    Science.gov (United States)

    Kim, Kyungmun; Kim, Ju Hyeon; Kim, Young Ho; Hong, Seong-Eui; Lee, Si Hyeock

    2018-01-01

    Perturbation of normal behaviors in honey bee colonies by any external factor can immediately reduce the colony's capacity for brood rearing, which can eventually lead to colony collapse. To investigate the effects of brood-rearing suppression on the biology of honey bee workers, gene-set enrichment analysis of the transcriptomes of worker bees with or without suppressed brood rearing was performed. When brood rearing was suppressed, pathways associated with both protein degradation and synthesis were simultaneously over-represented in both nurses and foragers, and their overall pathway representation profiles resembled those of normal foragers and nurses, respectively. Thus, obstruction of normal labor induced over-representation in pathways related with reshaping of worker bee physiology, suggesting that transition of labor is physiologically reversible. In addition, some genes associated with the regulation of neuronal excitability, cellular and nutritional stress and aggressiveness were over-expressed under brood rearing suppression perhaps to manage in-hive stress under unfavorable conditions. Copyright © 2017 Elsevier Inc. All rights reserved.

  18. Integrative analysis of survival-associated gene sets in breast cancer.

    Science.gov (United States)

    Varn, Frederick S; Ung, Matthew H; Lou, Shao Ke; Cheng, Chao

    2015-03-12

    Patient gene expression information has recently become a clinical feature used to evaluate breast cancer prognosis. The emergence of prognostic gene sets that take advantage of these data has led to a rich library of information that can be used to characterize the molecular nature of a patient's cancer. Identifying robust gene sets that are consistently predictive of a patient's clinical outcome has become one of the main challenges in the field. We inputted our previously established BASE algorithm with patient gene expression data and gene sets from MSigDB to develop the gene set activity score (GSAS), a metric that quantitatively assesses a gene set's activity level in a given patient. We utilized this metric, along with patient time-to-event data, to perform survival analyses to identify the gene sets that were significantly correlated with patient survival. We then performed cross-dataset analyses to identify robust prognostic gene sets and to classify patients by metastasis status. Additionally, we created a gene set network based on component gene overlap to explore the relationship between gene sets derived from MSigDB. We developed a novel gene set based on this network's topology and applied the GSAS metric to characterize its role in patient survival. Using the GSAS metric, we identified 120 gene sets that were significantly associated with patient survival in all datasets tested. The gene overlap network analysis yielded a novel gene set enriched in genes shared by the robustly predictive gene sets. This gene set was highly correlated to patient survival when used alone. Most interestingly, removal of the genes in this gene set from the gene pool on MSigDB resulted in a large reduction in the number of predictive gene sets, suggesting a prominent role for these genes in breast cancer progression. The GSAS metric provided a useful medium by which we systematically investigated how gene sets from MSigDB relate to breast cancer patient survival. We used

  19. Using OWL reasoning to support the generation of novel gene sets for enrichment analysis.

    Science.gov (United States)

    Osumi-Sutherland, David J; Ponta, Enrico; Courtot, Melanie; Parkinson, Helen; Badi, Laura

    2018-02-14

    The Gene Ontology (GO) consists of over 40,000 terms for biological processes, cell components and gene product activities linked into a graph structure by over 90,000 relationships. It has been used to annotate the functions and cellular locations of several million gene products. The graph structure is used by a variety of tools to group annotated genes into sets whose products share function or location. These gene sets are widely used to interpret the results of genomics experiments by assessing which sets are significantly over- or under-represented in results lists. F Hoffmann-La Roche Ltd. has developed a bespoke, manually maintained controlled vocabulary (RCV) for use in over-representation analysis. Many terms in this vocabulary group GO terms in novel ways that cannot easily be derived using the graph structure of the GO. For example, some RCV terms group GO terms by the cell, chemical or tissue type they refer to. Recent improvements in the content and formal structure of the GO make it possible to use logical queries in Web Ontology Language (OWL) to automatically map these cross-cutting classifications to sets of GO terms. We used this approach to automate mapping between RCV and GO, largely replacing the increasingly unsustainable manual mapping process. We then tested the utility of the resulting groupings for over-representation analysis. We successfully mapped 85% of RCV terms to logical OWL definitions and showed that these could be used to recapitulate and extend manual mappings between RCV terms and the sets of GO terms subsumed by them. We also show that gene sets derived from the resulting GO terms sets can be used to detect the signatures of cell and tissue types in whole genome expression data. The rich formal structure of the GO makes it possible to use reasoning to dynamically generate novel, biologically relevant groupings of GO terms. GO term groupings generated with this approach can be used in. over-representation analysis to detect

  20. Gene-ontology enrichment analysis in two independent family-based samples highlights biologically plausible processes for autism spectrum disorders.

    LENUS (Irish Health Repository)

    Anney, Richard J L

    2012-02-01

    Recent genome-wide association studies (GWAS) have implicated a range of genes from discrete biological pathways in the aetiology of autism. However, despite the strong influence of genetic factors, association studies have yet to identify statistically robust, replicated major effect genes or SNPs. We apply the principle of the SNP ratio test methodology described by O\\'Dushlaine et al to over 2100 families from the Autism Genome Project (AGP). Using a two-stage design we examine association enrichment in 5955 unique gene-ontology classifications across four groupings based on two phenotypic and two ancestral classifications. Based on estimates from simulation we identify excess of association enrichment across all analyses. We observe enrichment in association for sets of genes involved in diverse biological processes, including pyruvate metabolism, transcription factor activation, cell-signalling and cell-cycle regulation. Both genes and processes that show enrichment have previously been examined in autistic disorders and offer biologically plausibility to these findings.

  1. DNMT1 is associated with cell cycle and DNA replication gene sets in diffuse large B-cell lymphoma.

    Science.gov (United States)

    Loo, Suet Kee; Ab Hamid, Suzina Sheikh; Musa, Mustaffa; Wong, Kah Keng

    2018-01-01

    Dysregulation of DNA (cytosine-5)-methyltransferase 1 (DNMT1) is associated with the pathogenesis of various types of cancer. It has been previously shown that DNMT1 is frequently expressed in diffuse large B-cell lymphoma (DLBCL), however its functions remain to be elucidated in the disease. In this study, we gene expression profiled (GEP) shRNA targeting DNMT1(shDNMT1)-treated germinal center B-cell-like DLBCL (GCB-DLBCL)-derived cell line (i.e. HT) compared with non-silencing shRNA (control shRNA)-treated HT cells. Independent gene set enrichment analysis (GSEA) performed using GEPs of shRNA-treated HT cells and primary GCB-DLBCL cases derived from two publicly-available datasets (i.e. GSE10846 and GSE31312) produced three separate lists of enriched gene sets for each gene sets collection from Molecular Signatures Database (MSigDB). Subsequent Venn analysis identified 268, 145 and six consensus gene sets from analyzing gene sets in C2 collection (curated gene sets), C5 sub-collection [gene sets from gene ontology (GO) biological process ontology] and Hallmark collection, respectively to be enriched in positive correlation with DNMT1 expression profiles in shRNA-treated HT cells, GSE10846 and GSE31312 datasets [false discovery rate (FDR) 0.8) with DNMT1 expression and significantly downregulated (log fold-change <-1.35; p<0.05) following DNMT1 silencing in HT cells. These results suggest the involvement of DNMT1 in the activation of cell cycle and DNA replication in DLBCL cells. Copyright © 2017 Elsevier GmbH. All rights reserved.

  2. Nociceptor-Enriched Genes Required for Normal Thermal Nociception

    Directory of Open Access Journals (Sweden)

    Ken Honjo

    2016-07-01

    Full Text Available Here, we describe a targeted reverse genetic screen for thermal nociception genes in Drosophila larvae. Using laser capture microdissection and microarray analyses of nociceptive and non-nociceptive neurons, we identified 275 nociceptor-enriched genes. We then tested the function of the enriched genes with nociceptor-specific RNAi and thermal nociception assays. Tissue-specific RNAi targeted against 14 genes caused insensitive thermal nociception while targeting of 22 genes caused hypersensitive thermal nociception. Previously uncategorized genes were named for heat resistance (i.e., boilerman, fire dancer, oven mitt, trivet, thawb, and bunker gear or heat sensitivity (firelighter, black match, eucalyptus, primacord, jet fuel, detonator, gasoline, smoke alarm, and jetboil. Insensitive nociception phenotypes were often associated with severely reduced branching of nociceptor neurites and hyperbranched dendrites were seen in two of the hypersensitive cases. Many genes that we identified are conserved in mammals.

  3. Optimal set of selected uranium enrichments that minimizes blending consequences

    International Nuclear Information System (INIS)

    Nachlas, J.A.; Kurstedt, H.A. Jr.; Lobber, J.S. Jr.

    1977-01-01

    Identities, quantities, and costs associated with producing a set of selected enrichments and blending them to provide fuel for existing reactors are investigated using an optimization model constructed with appropriate constraints. Selected enrichments are required for either nuclear reactor fuel standardization or potential uranium enrichment alternatives such as the gas centrifuge. Using a mixed-integer linear program, the model minimizes present worth costs for a 39-product-enrichment reference case. For four ingredients, the marginal blending cost is only 0.18% of the total direct production cost. Natural uranium is not an optimal blending ingredient. Optimal values reappear in most sets of ingredient enrichments

  4. GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Chris Cheadle

    2007-01-01

    Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

  5. Model-based gene set analysis for Bioconductor.

    Science.gov (United States)

    Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien

    2011-07-01

    Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.

  6. Literature mining, gene-set enrichment and pathway analysis for target identification in Behçet's disease.

    Science.gov (United States)

    Wilson, Paul; Larminie, Christopher; Smith, Rona

    2016-01-01

    To use literature mining to catalogue Behçet's associated genes, and advanced computational methods to improve the understanding of the pathways and signalling mechanisms that lead to the typical clinical characteristics of Behçet's patients. To extend this technique to identify potential treatment targets for further experimental validation. Text mining methods combined with gene enrichment tools, pathway analysis and causal analysis algorithms. This approach identified 247 human genes associated with Behçet's disease and the resulting disease map, comprising 644 nodes and 19220 edges, captured important details of the relationships between these genes and their associated pathways, as described in diverse data repositories. Pathway analysis has identified how Behçet's associated genes are likely to participate in innate and adaptive immune responses. Causal analysis algorithms have identified a number of potential therapeutic strategies for further investigation. Computational methods have captured pertinent features of the prominent disease characteristics presented in Behçet's disease and have highlighted NOD2, ICOS and IL18 signalling as potential therapeutic strategies.

  7. Novel gene sets improve set-level classification of prokaryotic gene expression data.

    Science.gov (United States)

    Holec, Matěj; Kuželka, Ondřej; Železný, Filip

    2015-10-28

    Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.

  8. Integrated Enrichment Analysis of Variants and Pathways in Genome-Wide Association Studies Indicates Central Role for IL-2 Signaling Genes in Type 1 Diabetes, and Cytokine Signaling Genes in Crohn's Disease

    Science.gov (United States)

    Carbonetto, Peter; Stephens, Matthew

    2013-01-01

    Pathway analyses of genome-wide association studies aggregate information over sets of related genes, such as genes in common pathways, to identify gene sets that are enriched for variants associated with disease. We develop a model-based approach to pathway analysis, and apply this approach to data from the Wellcome Trust Case Control Consortium (WTCCC) studies. Our method offers several benefits over existing approaches. First, our method not only interrogates pathways for enrichment of disease associations, but also estimates the level of enrichment, which yields a coherent way to promote variants in enriched pathways, enhancing discovery of genes underlying disease. Second, our approach allows for multiple enriched pathways, a feature that leads to novel findings in two diseases where the major histocompatibility complex (MHC) is a major determinant of disease susceptibility. Third, by modeling disease as the combined effect of multiple markers, our method automatically accounts for linkage disequilibrium among variants. Interrogation of pathways from eight pathway databases yields strong support for enriched pathways, indicating links between Crohn's disease (CD) and cytokine-driven networks that modulate immune responses; between rheumatoid arthritis (RA) and “Measles” pathway genes involved in immune responses triggered by measles infection; and between type 1 diabetes (T1D) and IL2-mediated signaling genes. Prioritizing variants in these enriched pathways yields many additional putative disease associations compared to analyses without enrichment. For CD and RA, 7 of 8 additional non-MHC associations are corroborated by other studies, providing validation for our approach. For T1D, prioritization of IL-2 signaling genes yields strong evidence for 7 additional non-MHC candidate disease loci, as well as suggestive evidence for several more. Of the 7 strongest associations, 4 are validated by other studies, and 3 (near IL-2 signaling genes RAF1, MAPK14

  9. Uniform approximation is more appropriate for Wilcoxon Rank-Sum Test in gene set analysis.

    Directory of Open Access Journals (Sweden)

    Zhide Fang

    Full Text Available Gene set analysis is widely used to facilitate biological interpretations in the analyses of differential expression from high throughput profiling data. Wilcoxon Rank-Sum (WRS test is one of the commonly used methods in gene set enrichment analysis. It compares the ranks of genes in a gene set against those of genes outside the gene set. This method is easy to implement and it eliminates the dichotomization of genes into significant and non-significant in a competitive hypothesis testing. Due to the large number of genes being examined, it is impractical to calculate the exact null distribution for the WRS test. Therefore, the normal distribution is commonly used as an approximation. However, as we demonstrate in this paper, the normal approximation is problematic when a gene set with relative small number of genes is tested against the large number of genes in the complementary set. In this situation, a uniform approximation is substantially more powerful, more accurate, and less intensive in computation. We demonstrate the advantage of the uniform approximations in Gene Ontology (GO term analysis using simulations and real data sets.

  10. Redundancy control in pathway databases (ReCiPa): an application for improving gene-set enrichment analysis in Omics studies and "Big data" biology.

    Science.gov (United States)

    Vivar, Juan C; Pemu, Priscilla; McPherson, Ruth; Ghosh, Sujoy

    2013-08-01

    Abstract Unparalleled technological advances have fueled an explosive growth in the scope and scale of biological data and have propelled life sciences into the realm of "Big Data" that cannot be managed or analyzed by conventional approaches. Big Data in the life sciences are driven primarily via a diverse collection of 'omics'-based technologies, including genomics, proteomics, metabolomics, transcriptomics, metagenomics, and lipidomics. Gene-set enrichment analysis is a powerful approach for interrogating large 'omics' datasets, leading to the identification of biological mechanisms associated with observed outcomes. While several factors influence the results from such analysis, the impact from the contents of pathway databases is often under-appreciated. Pathway databases often contain variously named pathways that overlap with one another to varying degrees. Ignoring such redundancies during pathway analysis can lead to the designation of several pathways as being significant due to high content-similarity, rather than truly independent biological mechanisms. Statistically, such dependencies also result in correlated p values and overdispersion, leading to biased results. We investigated the level of redundancies in multiple pathway databases and observed large discrepancies in the nature and extent of pathway overlap. This prompted us to develop the application, ReCiPa (Redundancy Control in Pathway Databases), to control redundancies in pathway databases based on user-defined thresholds. Analysis of genomic and genetic datasets, using ReCiPa-generated overlap-controlled versions of KEGG and Reactome pathways, led to a reduction in redundancy among the top-scoring gene-sets and allowed for the inclusion of additional gene-sets representing possibly novel biological mechanisms. Using obesity as an example, bioinformatic analysis further demonstrated that gene-sets identified from overlap-controlled pathway databases show stronger evidence of prior association

  11. Composting-Like Conditions Are More Efficient for Enrichment and Diversity of Organisms Containing Cellulase-Encoding Genes than Submerged Cultures.

    Directory of Open Access Journals (Sweden)

    Senta Heiss-Blanquet

    Full Text Available Cost-effective biofuel production from lignocellulosic biomass depends on efficient degradation of the plant cell wall. One of the major obstacles for the development of a cost-efficient process is the lack of resistance of currently used fungal enzymes to harsh conditions such as high temperature. Adapted, thermophilic microbial communities provide a huge reservoir of potentially interesting lignocellulose-degrading enzymes for improvement of the cellulose hydrolysis step. In order to identify such enzymes, a leaf and wood chip compost was enriched on a mixture of thermo-chemically pretreated wheat straw, poplar and Miscanthus under thermophile conditions, but in two different set-ups. Unexpectedly, metagenome sequencing revealed that incubation of the lignocellulosic substrate with compost as inoculum in a suspension culture resulted in an impoverishment of putative cellulase- and hemicellulase-encoding genes. However, mimicking composting conditions without liquid phase yielded a high number and diversity of glycoside hydrolase genes and an enrichment of genes encoding cellulose binding domains. These identified genes were most closely related to species from Actinobacteria, which seem to constitute important players of lignocellulose degradation under the applied conditions. The study highlights that subtle changes in an enrichment set-up can have an important impact on composition and functions of the microcosm. Composting-like conditions were found to be the most successful method for enrichment in species with high biomass degrading capacity.

  12. Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales).

    Science.gov (United States)

    Moore, Abigail J; Vos, Jurriaan M De; Hancock, Lillian P; Goolsby, Eric; Edwards, Erika J

    2018-05-01

    Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the "portullugo" (Caryophyllales), a moderately sized lineage of flowering plants (~ 2200 species) that includes the cacti and harbors many evolutionary transitions to C$_{\\mathrm{4}}$ and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C$_{\\mathrm{4}}$ and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C$_{\\mathrm{4}}$ and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75-218 loci across 74 taxa, with ~ 50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae $+$ Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.

  13. Separate enrichment analysis of pathways for up- and downregulated genes.

    Science.gov (United States)

    Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng

    2014-03-06

    Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.

  14. Diversity of reductive dehalogenase genes from environmental samples and enrichment cultures identified with degenerate primer PCR screens.

    Directory of Open Access Journals (Sweden)

    Laura Audrey Hug

    2013-11-01

    Full Text Available Reductive dehalogenases are the critical enzymes for anaerobic organohalide respiration, a microbial metabolic process that has been harnessed for bioremediation efforts to resolve chlorinated solvent contamination in groundwater and is implicated in the global halogen cycle. Reductive dehalogenase sequence diversity is informative for the dechlorination potential of the site or enrichment culture. A suite of degenerate PCR primers targeting a comprehensive curated set of reductive dehalogenase genes was designed and applied to twelve DNA samples extracted from contaminated and pristine sites, as well as six enrichment cultures capable of reducing chlorinated compounds to non-toxic end-products. The amplified gene products from four environmental sites and two enrichment cultures were sequenced using Illumina HiSeq, and the reductive dehalogenase complement of each sample determined. The results indicate that the diversity of the reductive dehalogenase gene family is much deeper than is currently accounted for: one-third of the translated proteins have less than 70% pairwise amino acid identity to database sequences. Approximately 60% of the sequenced reductive dehalogenase genes were broadly distributed, being identified in four or more samples, and often in previously sequenced genomes as well. In contrast, 17% of the sequenced reductive dehalogenases were unique, present in only a single sample and bearing less than 90% pairwise amino acid identity to any previously identified proteins. Many of the broadly distributed reductive dehalogenases are uncharacterized in terms of their substrate specificity, making these intriguing targets for further biochemical experimentation. Finally, comparison of samples from a contaminated site and an enrichment culture derived from the same site eight years prior allowed examination of the effect of the enrichment process.

  15. Mutation intolerant genes and targets of FMRP are enriched for nonsynonymous alleles in schizophrenia.

    Science.gov (United States)

    Leonenko, Ganna; Richards, Alexander L; Walters, James T; Pocklington, Andrew; Chambert, Kimberly; Al Eissa, Mariam M; Sharp, Sally I; O'Brien, Niamh L; Curtis, David; Bass, Nicholas J; McQuillin, Andrew; Hultman, Christina; Moran, Jennifer L; McCarroll, Steven A; Sklar, Pamela; Neale, Benjamin M; Holmans, Peter A; Owen, Michael J; Sullivan, Patrick F; O'Donovan, Michael C

    2017-10-01

    Risk of schizophrenia is conferred by alleles occurring across the full spectrum of frequencies from common SNPs of weak effect through to ultra rare alleles, some of which may be moderately to highly penetrant. Previous studies have suggested that some of the risk of schizophrenia is attributable to uncommon alleles represented on Illumina exome arrays. Here, we present the largest study of exomic variation in schizophrenia to date, using samples from the United Kingdom and Sweden (10,011 schizophrenia cases and 13,791 controls). Single variants, genes, and gene sets were analyzed for association with schizophrenia. No single variant or gene reached genome-wide significance. Among candidate gene sets, we found significant enrichment for rare alleles (minor allele frequency [MAF] schizophrenia by excluding a role for uncommon exomic variants (0.01 ≤ MAF ≥ 0.001) that confer a relatively large effect (odds ratio [OR] > 4). We also show risk alleles within this frequency range exist, but confer smaller effects and should be identified by larger studies. © 2017 Wiley Periodicals, Inc.

  16. The Resistome of Farmed Fish Feces Contributes to the Enrichment of Antibiotic Resistance Genes in Sediments below Baltic Sea Fish Farms.

    Science.gov (United States)

    Muziasari, Windi I; Pitkänen, Leena K; Sørum, Henning; Stedtfeld, Robert D; Tiedje, James M; Virta, Marko

    2016-01-01

    Our previous studies showed that particular antibiotic resistance genes (ARGs) were enriched locally in sediments below fish farms in the Northern Baltic Sea, Finland, even when the selection pressure from antibiotics was negligible. We assumed that a constant influx of farmed fish feces could be the plausible source of the ARGs enriched in the farm sediments. In the present study, we analyzed the composition of the antibiotic resistome from the intestinal contents of 20 fish from the Baltic Sea farms. We used a high-throughput method, WaferGen qPCR array with 364 primer sets to detect and quantify ARGs, mobile genetic elements (MGE), and the 16S rRNA gene. Despite a considerably wide selection of qPCR primer sets, only 28 genes were detected in the intestinal contents. The detected genes were ARGs encoding resistance to sulfonamide ( sul1 ), trimethoprim ( dfrA1 ), tetracycline [ tet(32), tetM, tetO, tetW ], aminoglycoside ( aadA1, aadA2 ), chloramphenicol ( catA1 ), and efflux-pumps resistance genes ( emrB, matA, mefA, msrA ). The detected genes also included class 1 integron-associated genes ( intI1, qacE Δ 1 ) and transposases ( tnpA ). Importantly, most of the detected genes were the same genes enriched in the farm sediments. This preliminary study suggests that feces from farmed fish contribute to the ARG enrichment in farm sediments despite the lack of contemporaneous antibiotic treatments at the farms. We observed that the intestinal contents of individual farmed fish had their own resistome compositions. Our result also showed that the total relative abundances of transposases and tet genes were significantly correlated ( p = 0.001, R 2 = 0.71). In addition, we analyzed the mucosal skin and gill filament resistomes of the farmed fish but only one multidrug-efflux resistance gene ( emrB ) was detected. To our knowledge, this is the first study reporting the resistome of farmed fish using a culture-independent method. Determining the possible sources of

  17. Systematic enrichment analysis of gene expression profiling studies identifies consensus pathways implicated in colorectal cancer development

    Directory of Open Access Journals (Sweden)

    Jesús Lascorz

    2011-01-01

    Full Text Available Background: A large number of gene expression profiling (GEP studies on colorectal carcinogenesis have been performed but no reliable gene signature has been identified so far due to the lack of reproducibility in the reported genes. There is growing evidence that functionally related genes, rather than individual genes, contribute to the etiology of complex traits. We used, as a novel approach, pathway enrichment tools to define functionally related genes that are consistently up- or down-regulated in colorectal carcinogenesis. Materials and Methods: We started the analysis with 242 unique annotated genes that had been reported by any of three recent meta-analyses covering GEP studies on genes differentially expressed in carcinoma vs normal mucosa. Most of these genes (218, 91.9% had been reported in at least three GEP studies. These 242 genes were submitted to bioinformatic analysis using a total of nine tools to detect enrichment of Gene Ontology (GO categories or Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. As a final consistency criterion the pathway categories had to be enriched by several tools to be taken into consideration. Results: Our pathway-based enrichment analysis identified the categories of ribosomal protein constituents, extracellular matrix receptor interaction, carbonic anhydrase isozymes, and a general category related to inflammation and cellular response as significantly and consistently overrepresented entities. Conclusions: We triaged the genes covered by the published GEP literature on colorectal carcinogenesis and subjected them to multiple enrichment tools in order to identify the consistently enriched gene categories. These turned out to have known functional relationships to cancer development and thus deserve further investigation.

  18. Mammalian transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes and are predicted to act as transcriptional activator hubs.

    Science.gov (United States)

    Joshi, Anagha

    2014-12-30

    Transcriptional hotspots are defined as genomic regions bound by multiple factors. They have been identified recently as cell type specific enhancers regulating developmentally essential genes in many species such as worm, fly and humans. The in-depth analysis of hotspots across multiple cell types in same species still remains to be explored and can bring new biological insights. We therefore collected 108 transcription-related factor (TF) ChIP sequencing data sets in ten murine cell types and classified the peaks in each cell type in three groups according to binding occupancy as singletons (low-occupancy), combinatorials (mid-occupancy) and hotspots (high-occupancy). The peaks in the three groups clustered largely according to the occupancy, suggesting priming of genomic loci for mid occupancy irrespective of cell type. We then characterized hotspots for diverse structural functional properties. The genes neighbouring hotspots had a small overlap with hotspot genes in other cell types and were highly enriched for cell type specific function. Hotspots were enriched for sequence motifs of key TFs in that cell type and more than 90% of hotspots were occupied by pioneering factors. Though we did not find any sequence signature in the three groups, the H3K4me1 binding profile had bimodal peaks at hotspots, distinguishing hotspots from mono-modal H3K4me1 singletons. In ES cells, differentially expressed genes after perturbation of activators were enriched for hotspot genes suggesting hotspots primarily act as transcriptional activator hubs. Finally, we proposed that ES hotspots might be under control of SetDB1 and not DNMT for silencing. Transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes. In ES cells, they are predicted to act as transcriptional activator hubs and might be under SetDB1 control for silencing.

  19. Network-based functional enrichment

    Directory of Open Access Journals (Sweden)

    Poirel Christopher L

    2011-11-01

    Full Text Available Abstract Background Many methods have been developed to infer and reason about molecular interaction networks. These approaches often yield networks with hundreds or thousands of nodes and up to an order of magnitude more edges. It is often desirable to summarize the biological information in such networks. A very common approach is to use gene function enrichment analysis for this task. A major drawback of this method is that it ignores information about the edges in the network being analyzed, i.e., it treats the network simply as a set of genes. In this paper, we introduce a novel method for functional enrichment that explicitly takes network interactions into account. Results Our approach naturally generalizes Fisher’s exact test, a gene set-based technique. Given a function of interest, we compute the subgraph of the network induced by genes annotated to this function. We use the sequence of sizes of the connected components of this sub-network to estimate its connectivity. We estimate the statistical significance of the connectivity empirically by a permutation test. We present three applications of our method: i determine which functions are enriched in a given network, ii given a network and an interesting sub-network of genes within that network, determine which functions are enriched in the sub-network, and iii given two networks, determine the functions for which the connectivity improves when we merge the second network into the first. Through these applications, we show that our approach is a natural alternative to network clustering algorithms. Conclusions We presented a novel approach to functional enrichment that takes into account the pairwise relationships among genes annotated by a particular function. Each of the three applications discovers highly relevant functions. We used our methods to study biological data from three different organisms. Our results demonstrate the wide applicability of our methods. Our algorithms are

  20. Length bias correction in gene ontology enrichment analysis using logistic regression.

    Science.gov (United States)

    Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H

    2012-01-01

    When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.

  1. Two new loci and gene sets related to sex determination and cancer progression are associated with susceptibility to testicular germ cell tumor.

    Science.gov (United States)

    Kristiansen, Wenche; Karlsson, Robert; Rounge, Trine B; Whitington, Thomas; Andreassen, Bettina K; Magnusson, Patrik K; Fosså, Sophie D; Adami, Hans-Olov; Turnbull, Clare; Haugen, Trine B; Grotmol, Tom; Wiklund, Fredrik

    2015-07-15

    Genome-wide association (GWA) studies have reported 19 distinct susceptibility loci for testicular germ cell tumor (TGCT). A GWA study for TGCT was performed by genotyping 610 240 single-nucleotide polymorphisms (SNPs) in 1326 cases and 6687 controls from Sweden and Norway. No novel genome-wide significant associations were observed in this discovery stage. We put forward 27 SNPs from 15 novel regions and 12 SNPs previously reported, for replication in 710 case-parent triads and 289 cases and 290 controls. Predefined biological pathways and processes, in addition to a custom-built sex-determination gene set, were subject to enrichment analyses using Meta-Analysis Gene Set Enrichment of Variant Associations (M) and Improved Gene Set Enrichment Analysis for Genome-wide Association Study (I). In the combined meta-analysis, we observed genome-wide significant association for rs7501939 on chromosome 17q12 (OR = 0.78, 95% CI = 0.72-0.84, P = 1.1 × 10(-9)) and rs2195987 on chromosome 19p12 (OR = 0.76, 95% CI: 0.69-0.84, P = 3.2 × 10(-8)). The marker rs7501939 on chromosome 17q12 is located in an intron of the HNF1B gene, encoding a member of the homeodomain-containing superfamily of transcription factors. The sex-determination gene set (false discovery rate, FDRM cancer and apoptosis, was associated with TGCT (FDR utero are implicated in the development of TGCT. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  2. Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer

    Directory of Open Access Journals (Sweden)

    Pingzhao Hu

    2006-01-01

    Full Text Available Background: Microarray technology has been previously used to identify genes that are differentially expressed between tumour and normal samples in a single study, as well as in syntheses involving multiple studies. When integrating results from several Affymetrix microarray datasets, previous studies summarized probeset-level data, which may potentially lead to a loss of information available at the probe-level. In this paper, we present an approach for integrating results across studies while taking probe-level data into account. Additionally, we follow a new direction in the analysis of microarray expression data, namely to focus on the variation of expression phenotypes in predefined gene sets, such as pathways. This targeted approach can be helpful for revealing information that is not easily visible from the changes in the individual genes. Results: We used a recently developed method to integrate Affymetrix expression data across studies. The idea is based on a probe-level based test statistic developed for testing for differentially expressed genes in individual studies. We incorporated this test statistic into a classic random-effects model for integrating data across studies. Subsequently, we used a gene set enrichment test to evaluate the significance of enriched biological pathways in the differentially expressed genes identified from the integrative analysis. We compared statistical and biological significance of the prognostic gene expression signatures and pathways identified in the probe-level model (PLM with those in the probeset-level model (PSLM. Our integrative analysis of Affymetrix microarray data from 110 prostate cancer samples obtained from three studies reveals thousands of genes significantly correlated with tumour cell differentiation. The bioinformatics analysis, mapping these genes to the publicly available KEGG database, reveals evidence that tumour cell differentiation is significantly associated with many

  3. Gene set analysis of the EADGENE chicken data-set

    DEFF Research Database (Denmark)

    Skarman, Axel; Jiang, Li; Hornshøj, Henrik

    2009-01-01

     Abstract Background: Gene set analysis is considered to be a way of improving our biological interpretation of the observed expression patterns. This paper describes different methods applied to analyse expression data from a chicken DNA microarray dataset. Results: Applying different gene set...... analyses to the chicken expression data led to different ranking of the Gene Ontology terms tested. A method for prediction of possible annotations was applied. Conclusion: Biological interpretation based on gene set analyses dependent on the statistical method used. Methods for predicting the possible...

  4. GOMA: functional enrichment analysis tool based on GO modules

    Institute of Scientific and Technical Information of China (English)

    Qiang Huang; Ling-Yun Wu; Yong Wang; Xiang-Sun Zhang

    2013-01-01

    Analyzing the function of gene sets is a critical step in interpreting the results of high-throughput experiments in systems biology.A variety of enrichment analysis tools have been developed in recent years,but most output a long list of significantly enriched terms that are often redundant,making it difficult to extract the most meaningful functions.In this paper,we present GOMA,a novel enrichment analysis method based on the new concept of enriched functional Gene Ontology (GO) modules.With this method,we systematically revealed functional GO modules,i.e.,groups of functionally similar GO terms,via an optimization model and then ranked them by enrichment scores.Our new method simplifies enrichment analysis results by reducing redundancy,thereby preventing inconsistent enrichment results among functionally similar terms and providing more biologically meaningful results.

  5. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-03-01

    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  6. Benchmarking methods and data sets for ligand enrichment assessment in virtual screening.

    Science.gov (United States)

    Xia, Jie; Tilahun, Ermias Lemma; Reid, Terry-Elinor; Zhang, Liangren; Wang, Xiang Simon

    2015-01-01

    Retrospective small-scale virtual screening (VS) based on benchmarking data sets has been widely used to estimate ligand enrichments of VS approaches in the prospective (i.e. real-world) efforts. However, the intrinsic differences of benchmarking sets to the real screening chemical libraries can cause biased assessment. Herein, we summarize the history of benchmarking methods as well as data sets and highlight three main types of biases found in benchmarking sets, i.e. "analogue bias", "artificial enrichment" and "false negative". In addition, we introduce our recent algorithm to build maximum-unbiased benchmarking sets applicable to both ligand-based and structure-based VS approaches, and its implementations to three important human histone deacetylases (HDACs) isoforms, i.e. HDAC1, HDAC6 and HDAC8. The leave-one-out cross-validation (LOO CV) demonstrates that the benchmarking sets built by our algorithm are maximum-unbiased as measured by property matching, ROC curves and AUCs. Copyright © 2014 Elsevier Inc. All rights reserved.

  7. Determining Semantically Related Significant Genes.

    Science.gov (United States)

    Taha, Kamal

    2014-01-01

    GO relation embodies some aspects of existence dependency. If GO term xis existence-dependent on GO term y, the presence of y implies the presence of x. Therefore, the genes annotated with the function of the GO term y are usually functionally and semantically related to the genes annotated with the function of the GO term x. A large number of gene set enrichment analysis methods have been developed in recent years for analyzing gene sets enrichment. However, most of these methods overlook the structural dependencies between GO terms in GO graph by not considering the concept of existence dependency. We propose in this paper a biological search engine called RSGSearch that identifies enriched sets of genes annotated with different functions using the concept of existence dependency. We observe that GO term xcannot be existence-dependent on GO term y, if x- and y- have the same specificity (biological characteristics). After encoding into a numeric format the contributions of GO terms annotating target genes to the semantics of their lowest common ancestors (LCAs), RSGSearch uses microarray experiment to identify the most significant LCA that annotates the result genes. We evaluated RSGSearch experimentally and compared it with five gene set enrichment systems. Results showed marked improvement.

  8. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Christian J. Michel

    2017-12-01

    Full Text Available A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C 3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X , using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X , in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level, and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R . We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions. This property is true for all cardinalities of X motifs (from 4 to 20 and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non- X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together

  9. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

    Science.gov (United States)

    Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

    2017-12-03

    A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first

  10. ADAGE signature analysis: differential expression analysis with data-defined gene sets.

    Science.gov (United States)

    Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S

    2017-11-22

    Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed

  11. Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture.

    Science.gov (United States)

    Johnson, Timothy A; Stedtfeld, Robert D; Wang, Qiong; Cole, James R; Hashsham, Syed A; Looft, Torey; Zhu, Yong-Guan; Tiedje, James M

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance genes if

  12. Principles for the organization of gene-sets.

    Science.gov (United States)

    Li, Wentian; Freudenberg, Jan; Oswald, Michaela

    2015-12-01

    A gene-set, an important concept in microarray expression analysis and systems biology, is a collection of genes and/or their products (i.e. proteins) that have some features in common. There are many different ways to construct gene-sets, but a systematic organization of these ways is lacking. Gene-sets are mainly organized ad hoc in current public-domain databases, with group header names often determined by practical reasons (such as the types of technology in obtaining the gene-sets or a balanced number of gene-sets under a header). Here we aim at providing a gene-set organization principle according to the level at which genes are connected: homology, physical map proximity, chemical interaction, biological, and phenotypic-medical levels. We also distinguish two types of connections between genes: actual connection versus sharing of a label. Actual connections denote direct biological interactions, whereas shared label connection denotes shared membership in a group. Some extensions of the framework are also addressed such as overlapping of gene-sets, modules, and the incorporation of other non-protein-coding entities such as microRNAs. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. GeneTopics - interpretation of gene sets via literature-driven topic models

    Science.gov (United States)

    2013-01-01

    Background Annotation of a set of genes is often accomplished through comparison to a library of labelled gene sets such as biological processes or canonical pathways. However, this approach might fail if the employed libraries are not up to date with the latest research, don't capture relevant biological themes or are curated at a different level of granularity than is required to appropriately analyze the input gene set. At the same time, the vast biomedical literature offers an unstructured repository of the latest research findings that can be tapped to provide thematic sub-groupings for any input gene set. Methods Our proposed method relies on a gene-specific text corpus and extracts commonalities between documents in an unsupervised manner using a topic model approach. We automatically determine the number of topics summarizing the corpus and calculate a gene relevancy score for each topic allowing us to eliminate non-specific topics. As a result we obtain a set of literature topics in which each topic is associated with a subset of the input genes providing directly interpretable keywords and corresponding documents for literature research. Results We validate our method based on labelled gene sets from the KEGG metabolic pathway collection and the genetic association database (GAD) and show that the approach is able to detect topics consistent with the labelled annotation. Furthermore, we discuss the results on three different types of experimentally derived gene sets, (1) differentially expressed genes from a cardiac hypertrophy experiment in mice, (2) altered transcript abundance in human pancreatic beta cells, and (3) genes implicated by GWA studies to be associated with metabolite levels in a healthy population. In all three cases, we are able to replicate findings from the original papers in a quick and semi-automated manner. Conclusions Our approach provides a novel way of automatically generating meaningful annotations for gene sets that are directly

  14. MAGMA: generalized gene-set analysis of GWAS data.

    Science.gov (United States)

    de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

    2015-04-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.

  15. Digital gene expression profiling of flax (Linum usitatissimum L.) stem peel identifies genes enriched in fiber-bearing phloem tissue.

    Science.gov (United States)

    Guo, Yuan; Qiu, Caisheng; Long, Songhua; Chen, Ping; Hao, Dongmei; Preisner, Marta; Wang, Hui; Wang, Yufu

    2017-08-30

    To better understand the molecular mechanisms and gene expression characteristics associated with development of bast fiber cell within flax stem phloem, the gene expression profiling of flax stem peels and leaves were screened, using Illumina's Digital Gene Expression (DGE) analysis. Four DGE libraries (2 for stem peel and 2 for leaf), ranging from 6.7 to 9.2 million clean reads were obtained, which produced 7.0 million and 6.8 million mapped reads for flax stem peel and leave, respectively. By differential gene expression analysis, a total of 975 genes, of which 708 (73%) genes have protein-coding annotation, were identified as phloem enriched genes putatively involved in the processes of polysaccharide and cell wall metabolism. Differential expression genes (DEGs) was validated using quantitative RT-PCR, the expression pattern of all nine genes determined by qRT-PCR fitted in well with that obtained by sequencing analysis. Cluster and Gene Ontology (GO) analysis revealed that a large number of genes related to metabolic process, catalytic activity and binding category were expressed predominantly in the stem peels. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of the phloem enriched genes suggested approximately 111 biological pathways. The large number of genes and pathways produced from DGE sequencing will expand our understanding of the complex molecular and cellular events in flax bast fiber development and provide a foundation for future studies on fiber development in other bast fiber crops. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Motif enrichment tool.

    Science.gov (United States)

    Blatti, Charles; Sinha, Saurabh

    2014-07-01

    The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Pathway enrichment analysis approach based on topological structure and updated annotation of pathway.

    Science.gov (United States)

    Yang, Qian; Wang, Shuyuan; Dai, Enyu; Zhou, Shunheng; Liu, Dianming; Liu, Haizhou; Meng, Qianqian; Jiang, Bin; Jiang, Wei

    2017-08-16

    Pathway enrichment analysis has been widely used to identify cancer risk pathways, and contributes to elucidating the mechanism of tumorigenesis. However, most of the existing approaches use the outdated pathway information and neglect the complex gene interactions in pathway. Here, we first reviewed the existing widely used pathway enrichment analysis approaches briefly, and then, we proposed a novel topology-based pathway enrichment analysis (TPEA) method, which integrated topological properties and global upstream/downstream positions of genes in pathways. We compared TPEA with four widely used pathway enrichment analysis tools, including database for annotation, visualization and integrated discovery (DAVID), gene set enrichment analysis (GSEA), centrality-based pathway enrichment (CePa) and signaling pathway impact analysis (SPIA), through analyzing six gene expression profiles of three tumor types (colorectal cancer, thyroid cancer and endometrial cancer). As a result, we identified several well-known cancer risk pathways that could not be obtained by the existing tools, and the results of TPEA were more stable than that of the other tools in analyzing different data sets of the same cancer. Ultimately, we developed an R package to implement TPEA, which could online update KEGG pathway information and is available at the Comprehensive R Archive Network (CRAN): https://cran.r-project.org/web/packages/TPEA/. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. Differential Gene Expression Profiling of Enriched Human Spermatogonia after Short- and Long-Term Culture

    Directory of Open Access Journals (Sweden)

    Sabine Conrad

    2014-01-01

    Full Text Available This study aimed to provide a molecular signature for enriched adult human stem/progenitor spermatogonia during short-term (<2 weeks and long-term culture (up to more than 14 months in comparison to human testicular fibroblasts and human embryonic stem cells. Human spermatogonia were isolated by CD49f magnetic activated cell sorting and collagen−/laminin+ matrix binding from primary testis cultures obtained from ten adult men. For transcriptomic analysis, single spermatogonia-like cells were collected based on their morphology and dimensions using a micromanipulation system from the enriched germ cell cultures. Immunocytochemical, RT-PCR and microarray analyses revealed that the analyzed populations of cells were distinct at the molecular level. The germ- and pluripotency-associated genes and genes of differentiation/spermatogenesis pathway were highly expressed in enriched short-term cultured spermatogonia. After long-term culture, a proportion of cells retained and aggravated the “spermatogonial” gene expression profile with the expression of germ and pluripotency-associated genes, while in the majority of long-term cultured cells this molecular profile, typical for the differentiation pathway, was reduced and more genes related to the extracellular matrix production and attachment were expressed. The approach we provide here to study the molecular status of in vitro cultured spermatogonia may be important to optimize the culture conditions and to evaluate the germ cell plasticity in the future.

  19. Diversity of bacteria and glycosyl hydrolase family 48 genes in cellulolytic consortia enriched from thermophilic biocompost.

    Science.gov (United States)

    Izquierdo, Javier A; Sizova, Maria V; Lynd, Lee R

    2010-06-01

    The enrichment from nature of novel microbial communities with high cellulolytic activity is useful in the identification of novel organisms and novel functions that enhance the fundamental understanding of microbial cellulose degradation. In this work we identify predominant organisms in three cellulolytic enrichment cultures with thermophilic compost as an inoculum. Community structure based on 16S rRNA gene clone libraries featured extensive representation of clostridia from cluster III, with minor representation of clostridial clusters I and XIV and a novel Lutispora species cluster. Our studies reveal different levels of 16S rRNA gene diversity, ranging from 3 to 18 operational taxonomic units (OTUs), as well as variability in community membership across the three enrichment cultures. By comparison, glycosyl hydrolase family 48 (GHF48) diversity analyses revealed a narrower breadth of novel clostridial genes associated with cultured and uncultured cellulose degraders. The novel GHF48 genes identified in this study were related to the novel clostridia Clostridium straminisolvens and Clostridium clariflavum, with one cluster sharing as little as 73% sequence similarity with the closest known relative. In all, 14 new GHF48 gene sequences were added to the known diversity of 35 genes from cultured species.

  20. Screening Key Genes Associated with the Development and Progression of Non-small Cell Lung Cancer Based on Gene-enrichment Analysis and Meta-analysis

    Directory of Open Access Journals (Sweden)

    Wenwu HE

    2012-07-01

    Full Text Available Background and objective Non-small cell lung cancer (NSCLC is one of the most common malignant tumors; however, its causes are still not completely understood. This study was designed to screen the key genes and pathways related to NSCLC occurrence and development and to establish the scientific foundation for the genetic mechanisms and targeted therapy of NSCLC. Methods Both gene set-enrichment analysis (GSEA and meta-analysis (meta were used to screen the critical pathways and genes that might be corretacted with the development and progression of lung cancer at the transcription level. Results Using the GSEA and meta methods, focal adhesion and regulation of actin cytoskeleton were determined to be the more prominent overlapping significant pathways. In the focal adhesion pathway, 31 genes were statistically significant (P<0.05, whereas in the regulation of actin cytoskeleton pathway, 32 genes were statistically significant (P<0.05. Conclusion The focal adhesion and the regulation of actin cytoskeleton pathways might play important roles in the occurrence and development of NSCLC. Further studies are needed to determine the biological function for the positiue genes.

  1. Amygdala-enriched genes identified by microarray technology are restricted to specific amygdaloid subnuclei

    OpenAIRE

    Zirlinger, M.; Kreiman, Gabriel; Anderson, D. J.

    2001-01-01

    Microarray technology represents a potentially powerful method for identifying cell type- and regionally restricted genes expressed in the brain. Here we have combined a microarray analysis of differential gene expression among five selected brain regions, including the amygdala, cerebellum, hippocampus, olfactory bulb, and periaqueductal gray, with in situ hybridization. On average, 0.3% of the 34,000 genes interrogated were highly enriched in each of the five regions...

  2. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes.

    Directory of Open Access Journals (Sweden)

    Simone de Jong

    Full Text Available Despite large-scale genome-wide association studies (GWAS, the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1, is located in, and regulated by the major histocompatibility (MHC complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network.

  3. Microbial gene functions enriched in the Deepwater Horizon deep-sea oil plume

    Energy Technology Data Exchange (ETDEWEB)

    Lu, Z.; Deng, Y.; Nostrand, J.D. Van; He, Z.; Voordeckers, J.; Zhou, A.; Lee, Y.-J.; Mason, O.U.; Dubinsky, E.; Chavarria, K.; Tom, L.; Fortney, J.; Lamendella, R.; Jansson, J.K.; D?haeseleer, P.; Hazen, T.C.; Zhou, J.

    2011-06-15

    The Deepwater Horizon oil spill in the Gulf of Mexico is the deepest and largest offshore spill in U.S. history and its impacts on marine ecosystems are largely unknown. Here, we showed that the microbial community functional composition and structure were dramatically altered in a deep-sea oil plume resulting from the spill. A variety of metabolic genes involved in both aerobic and anaerobic hydrocarbon degradation were highly enriched in the plume compared to outside the plume, indicating a great potential for intrinsic bioremediation or natural attenuation in the deep-sea. Various other microbial functional genes relevant to carbon, nitrogen, phosphorus, sulfur and iron cycling, metal resistance, and bacteriophage replication were also enriched in the plume. Together, these results suggest that the indigenous marine microbial communities could play a significant role in biodegradation of oil spills in deep-sea environments.

  4. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene.

    Science.gov (United States)

    Casucci, Monica; Falcone, Laura; Camisa, Barbara; Norelli, Margherita; Porcellini, Simona; Stornaiuolo, Anna; Ciceri, Fabio; Traversari, Catia; Bordignon, Claudio; Bonini, Chiara; Bondanza, Attilio

    2018-01-01

    Chimeric antigen receptor (CAR)-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR). We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6). We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting.

  5. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene

    Science.gov (United States)

    Casucci, Monica; Falcone, Laura; Camisa, Barbara; Norelli, Margherita; Porcellini, Simona; Stornaiuolo, Anna; Ciceri, Fabio; Traversari, Catia; Bordignon, Claudio; Bonini, Chiara; Bondanza, Attilio

    2018-01-01

    Chimeric antigen receptor (CAR)-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR). We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6). We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting. PMID:29619024

  6. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene

    Directory of Open Access Journals (Sweden)

    Monica Casucci

    2018-03-01

    Full Text Available Chimeric antigen receptor (CAR-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR. We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6. We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting.

  7. Evidence for intron length conservation in a set of mammalian genes associated with embryonic development

    LENUS (Irish Health Repository)

    2011-10-05

    Abstract Background We carried out an analysis of intron length conservation across a diverse group of nineteen mammalian species. Motivated by recent research suggesting a role for time delays associated with intron transcription in gene expression oscillations required for early embryonic patterning, we searched for examples of genes that showed the most extreme conservation of total intron content in mammals. Results Gene sets annotated as being involved in pattern specification in the early embryo or containing the homeobox DNA-binding domain, were significantly enriched among genes with highly conserved intron content. We used ancestral sequences reconstructed with probabilistic models that account for insertion and deletion mutations to distinguish insertion and deletion events on lineages leading to human and mouse from their last common ancestor. Using a randomization procedure, we show that genes containing the homeobox domain show less change in intron content than expected, given the number of insertion and deletion events within their introns. Conclusions Our results suggest selection for gene expression precision or the existence of additional development-associated genes for which transcriptional delay is functionally significant.

  8. Using targeted enrichment of nuclear genes to increase phylogenetic resolution in the neotropical rain forest genus Inga (Leguminosae: Mimosoideae

    Directory of Open Access Journals (Sweden)

    James A Nicholls

    2015-09-01

    Full Text Available Evolutionary radiations are prominent and pervasive across many plant lineages in diverse geographical and ecological settings; in neotropical rainforests there is growing evidence suggesting that a significant fraction of species richness is the result of recent radiations. Understanding the evolutionary trajectories and mechanisms underlying these radiations demands much greater phylogenetic resolution than is currently available for these groups. The neotropical tree genus Inga (Leguminosae is a good example, with ~300 extant species and a crown age of 2-10 MY, yet over 6kb of plastid and nuclear DNA sequence data gives only poor phylogenetic resolution among species. Here we explore the use of larger-scale nuclear gene data obtained though targeted enrichment to increase phylogenetic resolution within Inga. Transcriptome data from three Inga species were used to select 264 nuclear loci for targeted enrichment and sequencing. Following quality control to remove probable paralogs from these sequence data, the final dataset comprised 259,313 bases from 194 loci for 24 accessions representing 22 Inga species and an outgroup (Zygia. Bayesian phylogenies reconstructed using either all loci concatenated or a subset of 60 loci in a gene-tree/species-tree approach yielded highly resolved phylogenies. We used coalescent approaches to show that the same targeted enrichment data also have significant power to discriminate among alternative within-species population histories in the widespread species I. umbellifera. In either application, targeted enrichment simplifies the informatics challenge of identifying orthologous loci associated with de novo genome sequencing. We conclude that targeted enrichment provides the large volumes of phylogenetically-informative sequence data required to resolve relationships within recent plant species radiations, both at the species level and for within-species phylogeographic studies.

  9. MAGMA: Generalized Gene-Set Analysis of GWAS Data

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  10. MAGMA: generalized gene-set analysis of GWAS data.

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  11. Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.

    Science.gov (United States)

    Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M

    2011-11-01

    Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.

  12. Gene set analysis for GWAS

    DEFF Research Database (Denmark)

    Debrabant, Birgit; Soerensen, Mette

    2014-01-01

    Abstract We discuss the use of modified Kolmogorov-Smirnov (KS) statistics in the context of gene set analysis and review corresponding null and alternative hypotheses. Especially, we show that, when enhancing the impact of highly significant genes in the calculation of the test statistic, the co...

  13. A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

    Science.gov (United States)

    Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.

  14. Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

    Directory of Open Access Journals (Sweden)

    Tintle Nathan L

    2012-08-01

    Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

  15. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  16. Neuron-Enriched Gene Expression Patterns are Regionally Anti-Correlated with Oligodendrocyte-Enriched Patterns in the Adult Mouse and Human Brain.

    Science.gov (United States)

    Tan, Powell Patrick Cheng; French, Leon; Pavlidis, Paul

    2013-01-01

    An important goal in neuroscience is to understand gene expression patterns in the brain. The recent availability of comprehensive and detailed expression atlases for mouse and human creates opportunities to discover global patterns and perform cross-species comparisons. Recently we reported that the major source of variation in gene transcript expression in the adult normal mouse brain can be parsimoniously explained as reflecting regional variation in glia to neuron ratios, and is correlated with degree of connectivity and location in the brain along the anterior-posterior axis. Here we extend this investigation to two gene expression assays of adult normal human brains that consisted of over 300 brain region samples, and perform comparative analyses of brain-wide expression patterns to the mouse. We performed principal components analysis (PCA) on the regional gene expression of the adult human brain to identify the expression pattern that has the largest variance. As in the mouse, we observed that the first principal component is composed of two anti-correlated patterns enriched in oligodendrocyte and neuron markers respectively. However, we also observed interesting discordant patterns between the two species. For example, a few mouse neuron markers show expression patterns that are more correlated with the human oligodendrocyte-enriched pattern and vice-versa. In conclusion, our work provides insights into human brain function and evolution by probing global relationships between regional cell type marker expression patterns in the human and mouse brain.

  17. Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

    Science.gov (United States)

    Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

    2012-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.

  18. Alteration of synaptic activity-regulating genes underlying functional improvement by long-term exposure to an enriched environment in the adult brain.

    Science.gov (United States)

    Lee, Min-Young; Yu, Ji Hea; Kim, Ji Yeon; Seo, Jung Hwa; Park, Eun Sook; Kim, Chul Hoon; Kim, Hyongbum; Cho, Sung-Rae

    2013-01-01

    Housing animals in an enriched environment (EE) enhances behavioral function. However, the mechanism underlying this EE-mediated functional improvement and the resultant changes in gene expression have yet to be elucidated. We attempted to investigate the underlying mechanisms associated with long-term exposure to an EE by evaluating gene expression patterns. We housed 6-week-old CD-1 (ICR) mice in standard cages or an EE comprising a running wheel, novel objects, and social interaction for 2 months. Motor and cognitive performances were evaluated using the rotarod test and passive avoidance test, and gene expression profile was investigated in the cerebral hemispheres using microarray and gene set enrichment analysis (GSEA). In behavioral assessment, an EE significantly enhanced rotarod performance and short-term working memory. Microarray analysis revealed that genes associated with neuronal activity were significantly altered by an EE. GSEA showed that genes involved in synaptic transmission and postsynaptic signal transduction were globally upregulated, whereas those associated with reuptake by presynaptic neurotransmitter transporters were downregulated. In particular, both microarray and GSEA demonstrated that EE exposure increased opioid signaling, acetylcholine release cycle, and postsynaptic neurotransmitter receptors but decreased Na+ / Cl- -dependent neurotransmitter transporters, including dopamine transporter Slc6a3 in the brain. Western blotting confirmed that SLC6A3, DARPP32 (PPP1R1B), and P2RY12 were largely altered in a region-specific manner. An EE enhanced motor and cognitive function through the alteration of synaptic activity-regulating genes, improving the efficient use of neurotransmitters and synaptic plasticity by the upregulation of genes associated with postsynaptic receptor activity and downregulation of presynaptic reuptake by neurotransmitter transporters.

  19. SNP-based pathway enrichment analysis for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Potkin Steven G

    2011-04-01

    Full Text Available Abstract Background Recently we have witnessed a surge of interest in using genome-wide association studies (GWAS to discover the genetic basis of complex diseases. Many genetic variations, mostly in the form of single nucleotide polymorphisms (SNPs, have been identified in a wide spectrum of diseases, including diabetes, cancer, and psychiatric diseases. A common theme arising from these studies is that the genetic variations discovered by GWAS can only explain a small fraction of the genetic risks associated with the complex diseases. New strategies and statistical approaches are needed to address this lack of explanation. One such approach is the pathway analysis, which considers the genetic variations underlying a biological pathway, rather than separately as in the traditional GWAS studies. A critical challenge in the pathway analysis is how to combine evidences of association over multiple SNPs within a gene and multiple genes within a pathway. Most current methods choose the most significant SNP from each gene as a representative, ignoring the joint action of multiple SNPs within a gene. This approach leads to preferential identification of genes with a greater number of SNPs. Results We describe a SNP-based pathway enrichment method for GWAS studies. The method consists of the following two main steps: 1 for a given pathway, using an adaptive truncated product statistic to identify all representative (potentially more than one SNPs of each gene, calculating the average number of representative SNPs for the genes, then re-selecting the representative SNPs of genes in the pathway based on this number; and 2 ranking all selected SNPs by the significance of their statistical association with a trait of interest, and testing if the set of SNPs from a particular pathway is significantly enriched with high ranks using a weighted Kolmogorov-Smirnov test. We applied our method to two large genetically distinct GWAS data sets of schizophrenia, one

  20. The Gene Ontology Differs in Bursa of Fabricius Between Two Breeds of Ducks Post Hatching by Enriching the Differentially Expressed Genes

    Directory of Open Access Journals (Sweden)

    H Liu

    Full Text Available ABSTRACT The bursa of Fabricius (BF is the central humoral immune organ unique to birds. The present study investigated the possible difference on a molecular level between two duck breeds. The digital gene expression profiling (DGE technology was used to enrich the differentially expressed genes (DEGs in BF between the Jianchang and Nonghua-P strains of ducks. DGE data identified 195 DEGs in the bursa. Gene Ontology (GO analysis suggested that DEGs were mainly enriched in the metabolic pathways and ribosome components. Pathways analysis identified the spliceosome, RNA transport, RNA degradation process, Jak-STAT signaling pathway, TNF signaling pathway and B cell receptor signaling pathway. The results indicated that the main difference in the BF between the two duck strains was in the capabilities of protein formation and B cell development. These data have revealed the main divergence in the BF on a molecular level between genetically different duck breeds and may help to perform molecular breeding programs in poultry in the future.

  1. Hyb-Seq: Combining Target Enrichment and Genome Skimming for Plant Phylogenomics

    Directory of Open Access Journals (Sweden)

    Kevin Weitemier

    2014-08-01

    Full Text Available Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics.

  2. Benchmarking Methods and Data Sets for Ligand Enrichment Assessment in Virtual Screening

    Science.gov (United States)

    Xia, Jie; Tilahun, Ermias Lemma; Reid, Terry-Elinor; Zhang, Liangren; Wang, Xiang Simon

    2014-01-01

    Retrospective small-scale virtual screening (VS) based on benchmarking data sets has been widely used to estimate ligand enrichments of VS approaches in the prospective (i.e. real-world) efforts. However, the intrinsic differences of benchmarking sets to the real screening chemical libraries can cause biased assessment. Herein, we summarize the history of benchmarking methods as well as data sets and highlight three main types of biases found in benchmarking sets, i.e. “analogue bias”, “artificial enrichment” and “false negative”. In addition, we introduced our recent algorithm to build maximum-unbiased benchmarking sets applicable to both ligand-based and structure-based VS approaches, and its implementations to three important human histone deacetylase (HDAC) isoforms, i.e. HDAC1, HDAC6 and HDAC8. The Leave-One-Out Cross-Validation (LOO CV) demonstrates that the benchmarking sets built by our algorithm are maximum-unbiased in terms of property matching, ROC curves and AUCs. PMID:25481478

  3. Development of a versatile enrichment analysis tool reveals associations between the maternal brain and mental health disorders, including autism

    Science.gov (United States)

    2013-01-01

    Background A recent study of lateral septum (LS) suggested a large number of autism-related genes with altered expression in the postpartum state. However, formally testing the findings for enrichment of autism-associated genes proved to be problematic with existing software. Many gene-disease association databases have been curated which are not currently incorporated in popular, full-featured enrichment tools, and the use of custom gene lists in these programs can be difficult to perform and interpret. As a simple alternative, we have developed the Modular Single-set Enrichment Test (MSET), a minimal tool that enables one to easily evaluate expression data for enrichment of any conceivable gene list of interest. Results The MSET approach was validated by testing several publicly available expression data sets for expected enrichment in areas of autism, attention deficit hyperactivity disorder (ADHD), and arthritis. Using nine independent, unique autism gene lists extracted from association databases and two recent publications, a striking consensus of enrichment was detected within gene expression changes in LS of postpartum mice. A network of 160 autism-related genes was identified, representing developmental processes such as synaptic plasticity, neuronal morphogenesis, and differentiation. Additionally, maternal LS displayed enrichment for genes associated with bipolar disorder, schizophrenia, ADHD, and depression. Conclusions The transition to motherhood includes the most fundamental social bonding event in mammals and features naturally occurring changes in sociability. Some individuals with autism, schizophrenia, or other mental health disorders exhibit impaired social traits. Genes involved in these deficits may also contribute to elevated sociability in the maternal brain. To date, this is the first study to show a significant, quantitative link between the maternal brain and mental health disorders using large scale gene expression data. Thus, the

  4. Omega-3 Fatty Acid Enriched Chevon (Goat Meat Lowers Plasma Cholesterol Levels and Alters Gene Expressions in Rats

    Directory of Open Access Journals (Sweden)

    Mahdi Ebrahimi

    2014-01-01

    Full Text Available In this study, control chevon (goat meat and omega-3 fatty acid enriched chevon were obtained from goats fed a 50% oil palm frond diet and commercial goat concentrate for 100 days, respectively. Goats fed the 50% oil palm frond diet contained high amounts of α-linolenic acid (ALA in their meat compared to goats fed the control diet. The chevon was then used to prepare two types of pellets (control or enriched chevon that were then fed to twenty-male-four-month-old Sprague-Dawley rats (n=10 in each group for 12 weeks to evaluate their effects on plasma cholesterol levels, tissue fatty acids, and gene expression. There was a significant increase in ALA and docosahexaenoic acid (DHA in the muscle tissues and liver of the rats fed the enriched chevon compared with the control group. Plasma cholesterol also decreased (P<0.05 in rats fed the enriched chevon compared to the control group. The rat pellets containing enriched chevon significantly upregulated the key transcription factor PPAR-γ and downregulated SREBP-1c expression relative to the control group. The results showed that the omega-3 fatty acid enriched chevon increased the omega-3 fatty acids in the rat tissues and altered PPAR-γ and SREBP-1c genes expression.

  5. Omega-3 fatty acid enriched chevon (goat meat) lowers plasma cholesterol levels and alters gene expressions in rats.

    Science.gov (United States)

    Ebrahimi, Mahdi; Rajion, Mohamed Ali; Meng, Goh Yong; Soleimani Farjam, Abdoreza

    2014-01-01

    In this study, control chevon (goat meat) and omega-3 fatty acid enriched chevon were obtained from goats fed a 50% oil palm frond diet and commercial goat concentrate for 100 days, respectively. Goats fed the 50% oil palm frond diet contained high amounts of α-linolenic acid (ALA) in their meat compared to goats fed the control diet. The chevon was then used to prepare two types of pellets (control or enriched chevon) that were then fed to twenty-male-four-month-old Sprague-Dawley rats (n = 10 in each group) for 12 weeks to evaluate their effects on plasma cholesterol levels, tissue fatty acids, and gene expression. There was a significant increase in ALA and docosahexaenoic acid (DHA) in the muscle tissues and liver of the rats fed the enriched chevon compared with the control group. Plasma cholesterol also decreased (P < 0.05) in rats fed the enriched chevon compared to the control group. The rat pellets containing enriched chevon significantly upregulated the key transcription factor PPAR-γ and downregulated SREBP-1c expression relative to the control group. The results showed that the omega-3 fatty acid enriched chevon increased the omega-3 fatty acids in the rat tissues and altered PPAR-γ and SREBP-1c genes expression.

  6. Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior.

    Science.gov (United States)

    Windhorst, Dafna A; Mileva-Seitz, Viara R; Rippe, Ralph C A; Tiemeier, Henning; Jaddoe, Vincent W V; Verhulst, Frank C; van IJzendoorn, Marinus H; Bakermans-Kranenburg, Marian J

    2016-08-01

    In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and gene-set approaches in tests of Gene by Environment (G × E) effects on complex behavior. This approach can offer an important alternative or complement to candidate gene and genome-wide environmental interaction (GWEI) studies in the search for genetic variation underlying individual differences in behavior. Genetic variants in 12 autosomal dopaminergic genes were available in an ethnically homogenous part of a population-based cohort. Harsh parenting was assessed with maternal (n = 1881) and paternal (n = 1710) reports at age 3. Externalizing behavior was assessed with the Child Behavior Checklist (CBCL) at age 5 (71 ± 3.7 months). We conducted gene-set analyses of the association between variation in dopaminergic genes and externalizing behavior, stratified for harsh parenting. The association was statistically significant or approached significance for children without harsh parenting experiences, but was absent in the group with harsh parenting. Similarly, significant associations between single genes and externalizing behavior were only found in the group without harsh parenting. Effect sizes in the groups with and without harsh parenting did not differ significantly. Gene-environment interaction tests were conducted for individual genetic variants, resulting in two significant interaction effects (rs1497023 and rs4922132) after correction for multiple testing. Our findings are suggestive of G × E interplay, with associations between dopamine genes and externalizing behavior present in children without harsh parenting, but not in children with harsh parenting experiences. Harsh parenting may overrule the role of genetic factors in externalizing behavior. Gene-based and gene-set

  7. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    Science.gov (United States)

    2013-01-01

    Background Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the

  8. Hyb-Seq: Combining target enrichment and genome skimming for plant phylogenomics1

    Science.gov (United States)

    Weitemier, Kevin; Straub, Shannon C. K.; Cronn, Richard C.; Fishbein, Mark; Schmickl, Roswitha; McDonnell, Angela; Liston, Aaron

    2014-01-01

    • Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. • Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca) were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp) followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera) resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. • Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics. PMID:25225629

  9. Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population

    Science.gov (United States)

    Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

    2016-01-01

    Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. PMID:27189991

  10. Discovery of cancer common and specific driver gene sets

    Science.gov (United States)

    2017-01-01

    Abstract Cancer is known as a disease mainly caused by gene alterations. Discovery of mutated driver pathways or gene sets is becoming an important step to understand molecular mechanisms of carcinogenesis. However, systematically investigating commonalities and specificities of driver gene sets among multiple cancer types is still a great challenge, but this investigation will undoubtedly benefit deciphering cancers and will be helpful for personalized therapy and precision medicine in cancer treatment. In this study, we propose two optimization models to de novo discover common driver gene sets among multiple cancer types (ComMDP) and specific driver gene sets of one certain or multiple cancer types to other cancers (SpeMDP), respectively. We first apply ComMDP and SpeMDP to simulated data to validate their efficiency. Then, we further apply these methods to 12 cancer types from The Cancer Genome Atlas (TCGA) and obtain several biologically meaningful driver pathways. As examples, we construct a common cancer pathway model for BRCA and OV, infer a complex driver pathway model for BRCA carcinogenesis based on common driver gene sets of BRCA with eight cancer types, and investigate specific driver pathways of the liquid cancer lymphoblastic acute myeloid leukemia (LAML) versus other solid cancer types. In these processes more candidate cancer genes are also found. PMID:28168295

  11. APPRIS 2017: principal isoforms for multiple gene sets

    Science.gov (United States)

    Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso

    2018-01-01

    Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475

  12. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    Directory of Open Access Journals (Sweden)

    Hettne Kristina M

    2013-01-01

    Full Text Available Abstract Background Availability of chemical response-specific lists of genes (gene sets for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM, and that these can be used with gene set analysis (GSA methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human and 588 (mouse gene sets from the Comparative Toxicogenomics Database (CTD. We tested for significant differential expression (SDE (false discovery rate -corrected p-values Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Conclusions Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect.

  13. Enrichment of HP1a on Drosophila chromosome 4 genes creates an alternate chromatin structure critical for regulation in this heterochromatic domain.

    Directory of Open Access Journals (Sweden)

    Nicole C Riddle

    2012-09-01

    Full Text Available Chromatin environments differ greatly within a eukaryotic genome, depending on expression state, chromosomal location, and nuclear position. In genomic regions characterized by high repeat content and high gene density, chromatin structure must silence transposable elements but permit expression of embedded genes. We have investigated one such region, chromosome 4 of Drosophila melanogaster. Using chromatin-immunoprecipitation followed by microarray (ChIP-chip analysis, we examined enrichment patterns of 20 histone modifications and 25 chromosomal proteins in S2 and BG3 cells, as well as the changes in several marks resulting from mutations in key proteins. Active genes on chromosome 4 are distinct from those in euchromatin or pericentric heterochromatin: while there is a depletion of silencing marks at the transcription start sites (TSSs, HP1a and H3K9me3, but not H3K9me2, are enriched strongly over gene bodies. Intriguingly, genes on chromosome 4 are less frequently associated with paused polymerase. However, when the chromatin is altered by depleting HP1a or POF, the RNA pol II enrichment patterns of many chromosome 4 genes shift, showing a significant decrease over gene bodies but not at TSSs, accompanied by lower expression of those genes. Chromosome 4 genes have a low incidence of TRL/GAGA factor binding sites and a low T(m downstream of the TSS, characteristics that could contribute to a low incidence of RNA polymerase pausing. Our data also indicate that EGG and POF jointly regulate H3K9 methylation and promote HP1a binding over gene bodies, while HP1a targeting and H3K9 methylation are maintained at the repeats by an independent mechanism. The HP1a-enriched, POF-associated chromatin structure over the gene bodies may represent one type of adaptation for genes embedded in repetitive DNA.

  14. Identification of Genes Enriched in GnRH Neurons by Translating Ribosome Affinity Purification and RNAseq in Mice.

    Science.gov (United States)

    Burger, Laura L; Vanacker, Charlotte; Phumsatitpong, Chayarndorn; Wagenmaker, Elizabeth R; Wang, Luhong; Olson, David P; Moenter, Suzanne M

    2018-04-01

    Gonadotropin-releasing hormone (GnRH) neurons are a nexus of fertility regulation. We used translating ribosome affinity purification coupled with RNA sequencing to examine messenger RNAs of GnRH neurons in adult intact and gonadectomized (GDX) male and female mice. GnRH neuron ribosomes were tagged with green fluorescent protein (GFP) and GFP-labeled polysomes isolated by immunoprecipitation, producing one RNA fraction enhanced for GnRH neuron transcripts and one RNA fraction depleted. Complementary DNA libraries were created from each fraction and 50-base, paired-end sequencing done and differential expression (enhanced fraction/depleted fraction) determined with a threshold of >1.5- or <0.66-fold (false discovery rate P ≤ 0.05). A core of ∼840 genes was differentially expressed in GnRH neurons in all treatments, including enrichment for Gnrh1 (∼40-fold), and genes critical for GnRH neuron and/or gonadotrope development. In contrast, non-neuronal transcripts were not enriched or were de-enriched. Several epithelial markers were also enriched, consistent with the olfactory epithelial origins of GnRH neurons. Interestingly, many synaptic transmission pathways were de-enriched, in accordance with relatively low innervation of GnRH neurons. The most striking difference between intact and GDX mice of both sexes was a marked downregulation of genes associated with oxidative phosphorylation and upregulation of glucose transporters in GnRH neurons from GDX mice. This may suggest that GnRH neurons switch to an alternate fuel to increase adenosine triphosphate production in the absence of negative feedback when GnRH release is elevated. Knowledge of the GnRH neuron translatome and its regulation can guide functional studies and can be extended to disease states, such as polycystic ovary syndrome.

  15. Groundwater fluoride enrichment in an active rift setting: Central Kenya Rift case study

    Energy Technology Data Exchange (ETDEWEB)

    Olaka, Lydia A., E-mail: lydiaolaka@gmail.com [Department of Geology, University of Nairobi, P.O Box 30197, Nairobi (Kenya); Wilke, Franziska D.H. [Geoforschungs Zentrum, Telegrafenberg, 14473 Potsdam (Germany); Olago, Daniel O.; Odada, Eric O. [Department of Geology, University of Nairobi, P.O Box 30197, Nairobi (Kenya); Mulch, Andreas [Senckenberg Biodiversity and Climate Research Centre, Senckenberganlage 25, 60325 Frankfurt (Germany); Institut für Geowissenschaften, Goethe Universität Frankfurt, Altenhöferallee 1, 60438 Frankfurt (Germany); Musolff, Andreas [UFZ-Helmholtz-Centre for Environmental Research, Department of Hydrogeology, Permoserstr. 15, 04318 Leipzig (Germany)

    2016-03-01

    Groundwater is used extensively in the Central Kenya Rift for domestic and agricultural demands. In these active rift settings groundwater can exhibit high fluoride levels. In order to address water security and reduce human exposure to high fluoride in drinking water, knowledge of the source and geochemical processes of enrichment are required. A study was therefore carried out within the Naivasha catchment (Kenya) to understand the genesis, enrichment and seasonal variations of fluoride in the groundwater. Rocks, rain, surface and groundwater sources were sampled for hydrogeochemical and isotopic investigations, the data was statistically and geospatially analyzed. Water sources have variable fluoride concentrations between 0.02–75 mg/L. 73% exceed the health limit (1.5 mg/L) in both dry and wet seasons. F{sup −} concentrations in rivers are lower (0.2–9.2 mg/L) than groundwater (0.09 to 43.6 mg/L) while saline lake waters have the highest concentrations (0.27–75 mg/L). The higher values are confined to elevations below 2000 masl. Oxygen (δ{sup 18}O) and hydrogen (δD) isotopic values range from − 6.2 to + 5.8‰ and − 31.3 to + 33.3‰, respectively, they are also highly variable in the rift floor where they attain maximum values. Fluoride base levels in the precursor vitreous volcanic rocks are higher (between 3750–6000 ppm) in minerals such as cordierite and muscovite while secondary minerals like illite and kaolinite have lower remnant fluoride (< 1000 ppm). Thus, geochemical F{sup −} enrichment in regional groundwater is mainly due to a) rock alteration, i.e. through long residence times and natural discharge and/or enhanced leakages of deep seated geothermal water reservoirs, b) secondary concentration fortification of natural reservoirs through evaporation, through reduced recharge and/or enhanced abstraction and c) through additional enrichment of fluoride after volcanic emissions. The findings are useful to help improve water management

  16. Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population.

    Science.gov (United States)

    Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

    2016-06-02

    Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  17. Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains

    Directory of Open Access Journals (Sweden)

    Bharti Arvind K

    2008-12-01

    Full Text Available Abstract Background Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR and methylation spanning linker libraries (MSLL. These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends. Results A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig, while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%. These two techniques were compared with other gene-enrichment methods, and shown to be complementary. Conclusion MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of

  18. twzPEA: A Topology and Working Zone Based Pathway Enrichment Analysis Framework

    Science.gov (United States)

    Sensitive detection of involvement and adaptation of key signaling, regulatory, and metabolic pathways holds the key to deciphering molecular mechanisms such as those in the biomass-to-biofuel conversion process in yeast. Typical gene set enrichment analyses often do not use topology information in...

  19. Sex hormones and gene expression signatures in peripheral blood from postmenopausal women - the NOWAC postgenome study

    Directory of Open Access Journals (Sweden)

    Rylander Charlotta

    2011-03-01

    Full Text Available Abstract Background Postmenopausal hormone therapy (HT influences endogenous hormone concentrations and increases the risk of breast cancer. Gene expression profiling may reveal the mechanisms behind this relationship. Our objective was to explore potential associations between sex hormones and gene expression in whole blood from a population-based, random sample of postmenopausal women Methods Gene expression, as measured by the Applied Biosystems microarray platform, was compared between hormone therapy (HT users and non-users and between high and low hormone plasma concentrations using both gene-wise analysis and gene set analysis. Gene sets found to be associated with HT use were further analysed for enrichment in functional clusters and network predictions. The gene expression matrix included 285 samples and 16185 probes and was adjusted for significant technical variables. Results Gene-wise analysis revealed several genes significantly associated with different types of HT use. The functional cluster analyses provided limited information on these genes. Gene set analysis revealed 22 gene sets that were enriched between high and low estradiol concentration (HT-users excluded. Among these were seven oestrogen related gene sets, including our gene list associated with systemic estradiol use, which thereby represents a novel oestrogen signature. Seven gene sets were related to immune response. Among the 15 gene sets enriched for progesterone, 11 overlapped with estradiol. No significant gene expression patterns were found for testosterone, follicle stimulating hormone (FSH or sex hormone binding globulin (SHBG. Conclusions Distinct gene expression patterns associated with sex hormones are detectable in a random group of postmenopausal women, as demonstrated by the finding of a novel oestrogen signature.

  20. Expressed sequence enrichment for candidate gene analysis of citrus tristeza virus resistance.

    Science.gov (United States)

    Bernet, G P; Bretó, M P; Asins, M J

    2004-02-01

    Several studies have reported markers linked to a putative resistance gene from Poncirus trifoliata ( Ctv-R) located at linkage group 4 that confers resistance against one of the most important citrus pathogens, citrus tristeza virus (CTV). To be successful in both marker-assisted selection and transformation experiments, its accurate mapping is needed. Several factors may affect its localization, among them two are considered here: the definition of resistance and the genetic background of progeny. Two progenies derived from P. trifoliata, by self-pollination and by crossing with sour orange ( Citrus aurantium), a citrus rootstock well-adapted to arid and semi-arid areas, were used for linkage group-4 marker enrichment. Two new methodologies were used to enrich this region with expressed sequences. The enrichment of group 4 resulted in the fusion of several C. aurantium linkage groups. The new one A(7+3+4) is now saturated with 48 markers including expressed sequences. Surprisingly, sour orange was as resistant to the CTV isolate tested as was P. trifoliata, and three hybrids that carry Ctv-R, as deduced from its flanking markers, are susceptible to CTV. The new linkage maps were used to map Ctv-R under the hypothesis of monogenic inheritance. Its position on linkage group 4 of P. trifoliata differs from the location previously reported in other progenies. The genetic analysis of virus-plant interaction in the family derived from C. aurantium after a CTV chronic infection showed the segregation of five types of interaction, which is not compatible with the hypothesis of a single gene controlling resistance. Two major issues are discussed: another type of genetic analysis of CTV resistance is needed to avoid the assumption of monogenic inheritance, and transferring Ctv-R from P. trifoliata to sour orange might not avoid the CTV decline of sweet orange trees.

  1. Enrichment of target sequences for next-generation sequencing applications in research and diagnostics.

    Science.gov (United States)

    Altmüller, Janine; Budde, Birgit S; Nürnberg, Peter

    2014-02-01

    Abstract Targeted re-sequencing such as gene panel sequencing (GPS) has become very popular in medical genetics, both for research projects and in diagnostic settings. The technical principles of the different enrichment methods have been reviewed several times before; however, new enrichment products are constantly entering the market, and researchers are often puzzled about the requirement to take decisions about long-term commitments, both for the enrichment product and the sequencing technology. This review summarizes important considerations for the experimental design and provides helpful recommendations in choosing the best sequencing strategy for various research projects and diagnostic applications.

  2. A novel CpG island set identifies tissue-specific methylation at developmental gene loci.

    Directory of Open Access Journals (Sweden)

    Robert Illingworth

    2008-01-01

    Full Text Available CpG islands (CGIs are dense clusters of CpG sequences that punctuate the CpG-deficient human genome and associate with many gene promoters. As CGIs also differ from bulk chromosomal DNA by their frequent lack of cytosine methylation, we devised a CGI enrichment method based on nonmethylated CpG affinity chromatography. The resulting library was sequenced to define a novel human blood CGI set that includes many that are not detected by current algorithms. Approximately half of CGIs were associated with annotated gene transcription start sites, the remainder being intra- or intergenic. Using an array representing over 17,000 CGIs, we established that 6%-8% of CGIs are methylated in genomic DNA of human blood, brain, muscle, and spleen. Inter- and intragenic CGIs are preferentially susceptible to methylation. CGIs showing tissue-specific methylation were overrepresented at numerous genetic loci that are essential for development, including HOX and PAX family members. The findings enable a comprehensive analysis of the roles played by CGI methylation in normal and diseased human tissues.

  3. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    Hettne, K.M.; Boorsma, A.; Dartel, D.A. van; Goeman, J.J.; Jong, E. de; Piersma, A.H.; Stierum, R.H.; Kleinjans, J.C.; Kors, J.A.

    2013-01-01

    BACKGROUND: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set

  4. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    Hettne, K.M.; Boorsma, A.; Dartel, van D.A.M.; Goeman, J.J.; Jong, de E.; Piersma, A.H.; Stierum, R.H.; Kleinjans, J.C.; Kors, J.A.

    2013-01-01

    Background: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set

  5. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer

    Energy Technology Data Exchange (ETDEWEB)

    Pandi, Narayanan Sathiya, E-mail: sathiyapandi@gmail.com; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-10-04

    Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC.

  6. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer

    International Nuclear Information System (INIS)

    Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-01-01

    Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC

  7. Gene set analysis: limitations in popular existing methods and proposed improvements.

    Science.gov (United States)

    Mishra, Pashupati; Törönen, Petri; Leino, Yrjö; Holm, Liisa

    2014-10-01

    Gene set analysis is the analysis of a set of genes that collectively contribute to a biological process. Most popular gene set analysis methods are based on empirical P-value that requires large number of permutations. Despite numerous gene set analysis methods developed in the past decade, the most popular methods still suffer from serious limitations. We present a gene set analysis method (mGSZ) based on Gene Set Z-scoring function (GSZ) and asymptotic P-values. Asymptotic P-value calculation requires fewer permutations, and thus speeds up the gene set analysis process. We compare the GSZ-scoring function with seven popular gene set scoring functions and show that GSZ stands out as the best scoring function. In addition, we show improved performance of the GSA method when the max-mean statistics is replaced by the GSZ scoring function. We demonstrate the importance of both gene and sample permutations by showing the consequences in the absence of one or the other. A comparison of asymptotic and empirical methods of P-value estimation demonstrates a clear advantage of asymptotic P-value over empirical P-value. We show that mGSZ outperforms the state-of-the-art methods based on two different evaluations. We compared mGSZ results with permutation and rotation tests and show that rotation does not improve our asymptotic P-values. We also propose well-known asymptotic distribution models for three of the compared methods. mGSZ is available as R package from cran.r-project.org. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Schizophrenia and vitamin D related genes could have been subject to latitude-driven adaptation.

    Science.gov (United States)

    Amato, Roberto; Pinelli, Michele; Monticelli, Antonella; Miele, Gennaro; Cocozza, Sergio

    2010-11-11

    Many natural phenomena are directly or indirectly related to latitude. Living at different latitudes, indeed, has its consequences with being exposed to different climates, diets, light/dark cycles, etc. In humans, one of the best known examples of genetic traits following a latitudinal gradient is skin pigmentation. Nevertheless, also several diseases show latitudinal clinals such as hypertension, cancer, dismetabolic conditions, schizophrenia, Parkinson's disease and many more. We investigated, for the first time on a wide genomic scale, the latitude-driven adaptation phenomena. In particular, we selected a set of genes showing signs of latitude-dependent population differentiation. The biological characterization of these genes showed enrichment for neural-related processes. In light of this, we investigated whether genes associated to neuropsychiatric diseases were enriched by Latitude-Related Genes (LRGs). We found a strong enrichment of LRGs in the set of genes associated to schizophrenia. In an attempt to try to explain this possible link between latitude and schizophrenia, we investigated their associations with vitamin D. We found in a set of vitamin D related genes a significant enrichment of both LRGs and of genes involved in schizophrenia. Our results suggest a latitude-driven adaptation for both schizophrenia and vitamin D related genes. In addition we confirm, at a molecular level, the link between schizophrenia and vitamin D. Finally, we discuss a model in which schizophrenia is, at least partly, a maladaptive by-product of latitude dependent adaptive changes in vitamin D metabolism.

  9. Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior

    NARCIS (Netherlands)

    J. Windhorst (Judith); V. Mileva-Seitz (Viara); R.C.A. Rippe (Ralph C.A.); H.W. Tiemeier (Henning); V.W.V. Jaddoe (Vincent); F.C. Verhulst (Frank); M.H. van IJzendoorn (Rien); M.J. Bakermans-Kranenburg (Marian)

    2016-01-01

    textabstractBackground: In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and

  10. Effect of bioaugmentation by cellulolytic bacteria enriched from sheep rumen on methane production from wheat straw.

    Science.gov (United States)

    Ozbayram, E Gozde; Kleinsteuber, Sabine; Nikolausz, Marcell; Ince, Bahar; Ince, Orhan

    2017-08-01

    The aim of this study was to determine the potential of bioaugmentation with cellulolytic rumen microbiota to enhance the anaerobic digestion of lignocellulosic feedstock. An anaerobic cellulolytic culture was enriched from sheep rumen fluid using wheat straw as substrate under mesophilic conditions. To investigate the effects of bioaugmentation on methane production from straw, the enrichment culture was added to batch reactors in proportions of 2% (Set-1) and 4% (Set-2) of the microbial cell number of the standard inoculum slurry. The methane production in the bioaugmented reactors was higher than in the control reactors. After 30 days of batch incubation, the average methane yield was 154 mL N CH 4 g VS -1 in the control reactors. Addition of 2% enrichment culture did not enhance methane production, whereas in Set-2 the methane yield was increased by 27%. The bacterial communities were examined by 454 amplicon sequencing of 16S rRNA genes, while terminal restriction fragment length polymorphism (T-RFLP) fingerprinting of mcrA genes was applied to analyze the methanogenic communities. The results highlighted that relative abundances of Ruminococcaceae and Lachnospiraceae increased during the enrichment. However, Cloacamonaceae, which were abundant in the standard inoculum, dominated the bacterial communities of all batch reactors. T-RFLP profiles revealed that Methanobacteriales were predominant in the rumen fluid, whereas the enrichment culture was dominated by Methanosarcinales. In the batch rectors, the most abundant methanogens were affiliated to Methanobacteriales and Methanomicrobiales. Our results suggest that bioaugmentation with sheep rumen enrichment cultures can enhance the performance of digesters treating lignocellulosic feedstock. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Enrichment of short interspersed transposable elements to embryonic stem cell-specific hypomethylated gene regions.

    Science.gov (United States)

    Muramoto, Hiroki; Yagi, Shintaro; Hirabayashi, Keiji; Sato, Shinya; Ohgane, Jun; Tanaka, Satoshi; Shiota, Kunio

    2010-08-01

    Embryonic stem cells (ESCs) have a distinctive epigenome, which includes their genome-wide DNA methylation modification status, as represented by the ESC-specific hypomethylation of tissue-dependent and differentially methylated regions (T-DMRs) of Pou5f1 and Nanog. Here, we conducted a genome-wide investigation of sequence characteristics associated with T-DMRs that were differentially methylated between ESCs and somatic cells, by focusing on transposable elements including short interspersed elements (SINEs), long interspersed elements (LINEs) and long terminal repeats (LTRs). We found that hypomethylated T-DMRs were predominantly present in SINE-rich/LINE-poor genomic loci. The enrichment for SINEs spread over 300 kb in cis and there existed SINE-rich genomic domains spreading continuously over 1 Mb, which contained multiple hypomethylated T-DMRs. The characterization of sequence information showed that the enriched SINEs were relatively CpG rich and belonged to specific subfamilies. A subset of the enriched SINEs were hypomethylated T-DMRs in ESCs at Dppa3 gene locus, although SINEs are overall methylated in both ESCs and the liver. In conclusion, we propose that SINE enrichment is the genomic property of regions harboring hypomethylated T-DMRs in ESCs, which is a novel aspect of the ESC-specific epigenomic information.

  12. Pathway-Enriched Gene Signature Associated with 53BP1 Response to PARP Inhibition in Triple-Negative Breast Cancer.

    Science.gov (United States)

    Hassan, Saima; Esch, Amanda; Liby, Tiera; Gray, Joe W; Heiser, Laura M

    2017-12-01

    Effective treatment of patients with triple-negative (ER-negative, PR-negative, HER2-negative) breast cancer remains a challenge. Although PARP inhibitors are being evaluated in clinical trials, biomarkers are needed to identify patients who will most benefit from anti-PARP therapy. We determined the responses of three PARP inhibitors (veliparib, olaparib, and talazoparib) in a panel of eight triple-negative breast cancer cell lines. Therapeutic responses and cellular phenotypes were elucidated using high-content imaging and quantitative immunofluorescence to assess markers of DNA damage (53BP1) and apoptosis (cleaved PARP). We determined the pharmacodynamic changes as percentage of cells positive for 53BP1, mean number of 53BP1 foci per cell, and percentage of cells positive for cleaved PARP. Inspired by traditional dose-response measures of cell viability, an EC 50 value was calculated for each cellular phenotype and each PARP inhibitor. The EC 50 values for both 53BP1 metrics strongly correlated with IC 50 values for each PARP inhibitor. Pathway enrichment analysis identified a set of DNA repair and cell cycle-associated genes that were associated with 53BP1 response following PARP inhibition. The overall accuracy of our 63 gene set in predicting response to olaparib in seven breast cancer patient-derived xenograft tumors was 86%. In triple-negative breast cancer patients who had not received anti-PARP therapy, the predicted response rate of our gene signature was 45%. These results indicate that 53BP1 is a biomarker of response to anti-PARP therapy in the laboratory, and our DNA damage response gene signature may be used to identify patients who are most likely to respond to PARP inhibition. Mol Cancer Ther; 16(12); 2892-901. ©2017 AACR . ©2017 American Association for Cancer Research.

  13. Genomic Analysis Reveals Contrasting PIFq Contribution to Diurnal Rhythmic Gene Expression in PIF-Induced and -Repressed Genes.

    Science.gov (United States)

    Martin, Guiomar; Soy, Judit; Monte, Elena

    2016-01-01

    Members of the PIF quartet (PIFq; PIF1, PIF3, PIF4, and PIF5) collectively contribute to induce growth in Arabidopsis seedlings under short day (SD) conditions, specifically promoting elongation at dawn. Their action involves the direct regulation of growth-related and hormone-associated genes. However, a comprehensive definition of the PIFq-regulated transcriptome under SD is still lacking. We have recently shown that SD and free-running (LL) conditions correspond to "growth" and "no growth" conditions, respectively, correlating with greater abundance of PIF protein in SD. Here, we present a genomic analysis whereby we first define SD-regulated genes at dawn compared to LL in the wild type, followed by identification of those SD-regulated genes whose expression depends on the presence of PIFq. By using this sequential strategy, we have identified 349 PIF/SD-regulated genes, approximately 55% induced and 42% repressed by both SD and PIFq. Comparison with available databases indicates that PIF/SD-induced and PIF/SD-repressed sets are differently phased at dawn and mid-morning, respectively. In addition, we found that whereas rhythmicity of the PIF/SD-induced gene set is lost in LL, most PIF/SD-repressed genes keep their rhythmicity in LL, suggesting differential regulation of both gene sets by the circadian clock. Moreover, we also uncovered distinct overrepresented functions in the induced and repressed gene sets, in accord with previous studies in other examined PIF-regulated processes. Interestingly, promoter analyses showed that, whereas PIF/SD-induced genes are enriched in direct PIF targets, PIF/SD-repressed genes are mostly indirectly regulated by the PIFs and might be more enriched in ABA-regulated genes.

  14. Multiplex Real-Time PCR for Detection of Staphylococcus aureus, mecA and Panton-Valentine Leukocidin (PVL) Genes from Selective Enrichments from Animals and Retail Meat

    Science.gov (United States)

    Velasco, Valeria; Sherwood, Julie S.; Rojas-García, Pedro P.; Logue, Catherine M.

    2014-01-01

    The aim of this study was to compare a real-time PCR assay, with a conventional culture/PCR method, to detect S. aureus, mecA and Panton-Valentine Leukocidin (PVL) genes in animals and retail meat, using a two-step selective enrichment protocol. A total of 234 samples were examined (77 animal nasal swabs, 112 retail raw meat, and 45 deli meat). The multiplex real-time PCR targeted the genes: nuc (identification of S. aureus), mecA (associated with methicillin resistance) and PVL (virulence factor), and the primary and secondary enrichment samples were assessed. The conventional culture/PCR method included the two-step selective enrichment, selective plating, biochemical testing, and multiplex PCR for confirmation. The conventional culture/PCR method recovered 95/234 positive S. aureus samples. Application of real-time PCR on samples following primary and secondary enrichment detected S. aureus in 111/234 and 120/234 samples respectively. For detection of S. aureus, the kappa statistic was 0.68–0.88 (from substantial to almost perfect agreement) and 0.29–0.77 (from fair to substantial agreement) for primary and secondary enrichments, using real-time PCR. For detection of mecA gene, the kappa statistic was 0–0.49 (from no agreement beyond that expected by chance to moderate agreement) for primary and secondary enrichment samples. Two pork samples were mecA gene positive by all methods. The real-time PCR assay detected the mecA gene in samples that were negative for S. aureus, but positive for Staphylococcus spp. The PVL gene was not detected in any sample by the conventional culture/PCR method or the real-time PCR assay. Among S. aureus isolated by conventional culture/PCR method, the sequence type ST398, and multi-drug resistant strains were found in animals and raw meat samples. The real-time PCR assay may be recommended as a rapid method for detection of S. aureus and the mecA gene, with further confirmation of methicillin-resistant S. aureus (MRSA) using

  15. Multiplex real-time PCR for detection of Staphylococcus aureus, mecA and Panton-Valentine Leukocidin (PVL genes from selective enrichments from animals and retail meat.

    Directory of Open Access Journals (Sweden)

    Valeria Velasco

    Full Text Available The aim of this study was to compare a real-time PCR assay, with a conventional culture/PCR method, to detect S. aureus, mecA and Panton-Valentine Leukocidin (PVL genes in animals and retail meat, using a two-step selective enrichment protocol. A total of 234 samples were examined (77 animal nasal swabs, 112 retail raw meat, and 45 deli meat. The multiplex real-time PCR targeted the genes: nuc (identification of S. aureus, mecA (associated with methicillin resistance and PVL (virulence factor, and the primary and secondary enrichment samples were assessed. The conventional culture/PCR method included the two-step selective enrichment, selective plating, biochemical testing, and multiplex PCR for confirmation. The conventional culture/PCR method recovered 95/234 positive S. aureus samples. Application of real-time PCR on samples following primary and secondary enrichment detected S. aureus in 111/234 and 120/234 samples respectively. For detection of S. aureus, the kappa statistic was 0.68-0.88 (from substantial to almost perfect agreement and 0.29-0.77 (from fair to substantial agreement for primary and secondary enrichments, using real-time PCR. For detection of mecA gene, the kappa statistic was 0-0.49 (from no agreement beyond that expected by chance to moderate agreement for primary and secondary enrichment samples. Two pork samples were mecA gene positive by all methods. The real-time PCR assay detected the mecA gene in samples that were negative for S. aureus, but positive for Staphylococcus spp. The PVL gene was not detected in any sample by the conventional culture/PCR method or the real-time PCR assay. Among S. aureus isolated by conventional culture/PCR method, the sequence type ST398, and multi-drug resistant strains were found in animals and raw meat samples. The real-time PCR assay may be recommended as a rapid method for detection of S. aureus and the mecA gene, with further confirmation of methicillin-resistant S. aureus (MRSA

  16. Annotating gene sets by mining large literature collections with protein networks.

    Science.gov (United States)

    Wang, Sheng; Ma, Jianzhu; Yu, Michael Ku; Zheng, Fan; Huang, Edward W; Han, Jiawei; Peng, Jian; Ideker, Trey

    2018-01-01

    Analysis of patient genomes and transcriptomes routinely recognizes new gene sets associated with human disease. Here we present an integrative natural language processing system which infers common functions for a gene set through automatic mining of the scientific literature with biological networks. This system links genes with associated literature phrases and combines these links with protein interactions in a single heterogeneous network. Multiscale functional annotations are inferred based on network distances between phrases and genes and then visualized as an ontology of biological concepts. To evaluate this system, we predict functions for gene sets representing known pathways and find that our approach achieves substantial improvement over the conventional text-mining baseline method. Moreover, our system discovers novel annotations for gene sets or pathways without previously known functions. Two case studies demonstrate how the system is used in discovery of new cancer-related pathways with ontological annotations.

  17. Phylogenetics and evolution of Trx SET genes in fully sequenced land plants.

    Science.gov (United States)

    Zhu, Xinyu; Chen, Caoyi; Wang, Baohua

    2012-04-01

    Plant Trx SET proteins are involved in H3K4 methylation and play a key role in plant floral development. Genes encoding Trx SET proteins constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. To investigate the evolutionary history of the Trx SET gene family, we made a comprehensive evolutionary analysis on this gene family from 13 major representatives of green plants. A novel clustering (here named as cpTrx clade), which included the III-1, III-2, and III-4 orthologous groups, previously resolved was identified. Our analysis showed that plant Trx proteins possessed a variety of domain organizations and gene structures among paralogs. Additional domains such as PHD, PWWP, and FYR were early integrated into primordial SET-PostSET domain organization of cpTrx clade. We suggested that the PostSET domain was lost in some members of III-4 orthologous group during the evolution of land plants. At least four classes of gene structures had been formed at the early evolutionary stage of land plants. Three intronless orphan Trx SET genes from the Physcomitrella patens (moss) were identified, and supposedly, their parental genes have been eliminated from the genome. The structural differences among evolutionary groups of plant Trx SET genes with different functions were described, contributing to the design of further experimental studies.

  18. Schizophrenia and vitamin D related genes could have been subject to latitude-driven adaptation

    Directory of Open Access Journals (Sweden)

    Monticelli Antonella

    2010-11-01

    Full Text Available Abstract Background Many natural phenomena are directly or indirectly related to latitude. Living at different latitudes, indeed, has its consequences with being exposed to different climates, diets, light/dark cycles, etc. In humans, one of the best known examples of genetic traits following a latitudinal gradient is skin pigmentation. Nevertheless, also several diseases show latitudinal clinals such as hypertension, cancer, dismetabolic conditions, schizophrenia, Parkinson's disease and many more. Results We investigated, for the first time on a wide genomic scale, the latitude-driven adaptation phenomena. In particular, we selected a set of genes showing signs of latitude-dependent population differentiation. The biological characterization of these genes showed enrichment for neural-related processes. In light of this, we investigated whether genes associated to neuropsychiatric diseases were enriched by Latitude-Related Genes (LRGs. We found a strong enrichment of LRGs in the set of genes associated to schizophrenia. In an attempt to try to explain this possible link between latitude and schizophrenia, we investigated their associations with vitamin D. We found in a set of vitamin D related genes a significant enrichment of both LRGs and of genes involved in schizophrenia. Conclusions Our results suggest a latitude-driven adaptation for both schizophrenia and vitamin D related genes. In addition we confirm, at a molecular level, the link between schizophrenia and vitamin D. Finally, we discuss a model in which schizophrenia is, at least partly, a maladaptive by-product of latitude dependent adaptive changes in vitamin D metabolism.

  19. Investigating the effect of paralogs on microarray gene-set analysis

    LENUS (Irish Health Repository)

    Faure, Andre J

    2011-01-24

    Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

  20. A reference gene set for sex pheromone biosynthesis and degradation genes from the diamondback moth, Plutella xylostella, based on genome and transcriptome digital gene expression analyses.

    Science.gov (United States)

    He, Peng; Zhang, Yun-Fei; Hong, Duan-Yang; Wang, Jun; Wang, Xing-Liang; Zuo, Ling-Hua; Tang, Xian-Fu; Xu, Wei-Ming; He, Ming

    2017-03-01

    comprehensive gene data set of sex pheromone biosynthesis and degradation enzyme related genes in DBM created by genome- and transcriptome-wide identification, characterization and expression profiling. Our findings provide a basis to better understand the function of genes with tissue enriched expression. The results also provide information on the genes involved in sex pheromone biosynthesis and degradation, and may be useful to identify potential gene targets for pest control strategies by disrupting the insect-insect communication using pheromone-based behavioral antagonists.

  1. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer.

    Science.gov (United States)

    Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-10-04

    Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC. Copyright © 2013 Elsevier Inc. All rights reserved.

  2. RS-SNP: a random-set method for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Mukherjee Sayan

    2011-03-01

    Full Text Available Abstract Background The typical objective of Genome-wide association (GWA studies is to identify single-nucleotide polymorphisms (SNPs and corresponding genes with the strongest evidence of association (the 'most-significant SNPs/genes' approach. Borrowing ideas from micro-array data analysis, we propose a new method, named RS-SNP, for detecting sets of genes enriched in SNPs moderately associated to the phenotype. RS-SNP assesses whether the number of significant SNPs, with p-value P ≤ α, belonging to a given SNP set is statistically significant. The rationale of proposed method is that two kinds of null hypotheses are taken into account simultaneously. In the first null model the genotype and the phenotype are assumed to be independent random variables and the null distribution is the probability of the number of significant SNPs in greater than observed by chance. The second null model assumes the number of significant SNPs in depends on the size of and not on the identity of the SNPs in . Statistical significance is assessed using non-parametric permutation tests. Results We applied RS-SNP to the Crohn's disease (CD data set collected by the Wellcome Trust Case Control Consortium (WTCCC and compared the results with GENGEN, an approach recently proposed in literature. The enrichment analysis using RS-SNP and the set of pathways contained in the MSigDB C2 CP pathway collection highlighted 86 pathways rich in SNPs weakly associated to CD. Of these, 47 were also indicated to be significant by GENGEN. Similar results were obtained using the MSigDB C5 pathway collection. Many of the pathways found to be enriched by RS-SNP have a well-known connection to CD and often with inflammatory diseases. Conclusions The proposed method is a valuable alternative to other techniques for enrichment analysis of SNP sets. It is well founded from a theoretical and statistical perspective. Moreover, the experimental comparison with GENGEN highlights that it is

  3. Environmental enrichment increases transcriptional and epigenetic differentiation between mouse dorsal and ventral dentate gyrus.

    Science.gov (United States)

    Zhang, Tie-Yuan; Keown, Christopher L; Wen, Xianglan; Li, Junhao; Vousden, Dulcie A; Anacker, Christoph; Bhattacharyya, Urvashi; Ryan, Richard; Diorio, Josie; O'Toole, Nicholas; Lerch, Jason P; Mukamel, Eran A; Meaney, Michael J

    2018-01-19

    Early life experience influences stress reactivity and mental health through effects on cognitive-emotional functions that are, in part, linked to gene expression in the dorsal and ventral hippocampus. The hippocampal dentate gyrus (DG) is a major site for experience-dependent plasticity associated with sustained transcriptional alterations, potentially mediated by epigenetic modifications. Here, we report comprehensive DNA methylome, hydroxymethylome and transcriptome data sets from mouse dorsal and ventral DG. We find genome-wide transcriptional and methylation differences between dorsal and ventral DG, including at key developmental transcriptional factors. Peripubertal environmental enrichment increases hippocampal volume and enhances dorsal DG-specific differences in gene expression. Enrichment also enhances dorsal-ventral differences in DNA methylation, including at binding sites of the transcription factor NeuroD1, a regulator of adult neurogenesis. These results indicate a dorsal-ventral asymmetry in transcription and methylation that parallels well-known functional and anatomical differences, and that may be enhanced by environmental enrichment.

  4. Witnessing stressful events induces glutamatergic synapse pathway alterations and gene set enrichment of positive EPSP regulation within the VTA of adult mice: An ontology based approach

    Science.gov (United States)

    Brewer, Jacob S.

    It is well known that exposure to severe stress increases the risk for developing mood disorders. Currently, the neurobiological and genetic mechanisms underlying the functional effects of psychological stress are poorly understood. Presenting a major obstacle to the study of psychological stress is the inability of current animal models of stress to distinguish between physical and psychological stressors. A novel paradigm recently developed by Warren et al., is able to tease apart the effects of physical and psychological stress in adult mice by allowing these mice to "witness," the social defeat of another mouse thus removing confounding variables associated with physical stressors. Using this 'witness' model of stress and RNA-Seq technology, the current study aims to study the genetic effects of psychological stress. After, witnessing the social defeat of another mouse, VTA tissue was extracted, sequenced, and analyzed for differential expression. Since genes often work together in complex networks, a pathway and gene ontology (GO) analysis was performed using data from the differential expression analysis. The pathway and GO analyzes revealed a perturbation of the glutamatergic synapse pathway and an enrichment of positive excitatory post-synaptic potential regulation. This is consistent with the excitatory synapse theory of depression. Together these findings demonstrate a dysregulation of the mesolimbic reward pathway at the gene level as a result of psychological stress potentially contributing to depressive like behaviors.

  5. Histone H4 Lys 20 methyltransferase SET8 promotes androgen receptor-mediated transcription activation in prostate cancer

    Energy Technology Data Exchange (ETDEWEB)

    Yao, Lushuai [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China); University of Chinese Academy of Sciences, Beijing 100049 (China); Li, Yanyan; Du, Fengxia [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China); Han, Xiao [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China); University of Chinese Academy of Sciences, Beijing 100049 (China); Li, Xiaohua [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China); Niu, Yuanjie [Chawnshang Chang Sex Hormone Research Center, Tianjin Institute of Urology, Tianjin Medical University, Tianjin 300070 (China); Ren, Shancheng, E-mail: renshancheng@gmail.com [Department of Urology, Shanghai Changhai Hospital, Second Military Medical University, Shanghai 200433 (China); Sun, Yingli, E-mail: sunyl@big.ac.cn [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China)

    2014-07-18

    Highlights: • Dihydrotestosterone stimulates H4K20me1 enrichment at the PSA promoter. • SET8 promotes AR-mediated transcription activation. • SET8 interacts with AR and promotes cell proliferation. - Abstract: Histone methylation status in different lysine residues has an important role in transcription regulation. The effect of H4K20 monomethylation (H4K20me1) on androgen receptor (AR)-mediated gene transcription remains unclear. Here we show that AR agonist stimulates the enrichment of H4K20me1 and SET8 at the promoter of AR target gene PSA in an AR dependent manner. Furthermore, SET8 is crucial for the transcription activation of PSA. Co-immunoprecipitation analyses demonstrate that SET8 interacts with AR. Therefore, we conclude that SET8 is involved in AR-mediated transcription activation, possibly through its interaction with AR and H4K20me1 modification.

  6. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    K.M. Hettne (Kristina); J. Boorsma (Jeffrey); D.A.M. van Dartel (Dorien A M); J.J. Goeman (Jelle); E.C. de Jong (Esther); A.H. Piersma (Aldert); R.H. Stierum (Rob); J. Kleinjans (Jos); J.A. Kors (Jan)

    2013-01-01

    textabstractBackground: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with

  7. Machine learning approaches to supporting the identification of photoreceptor-enriched genes based on expression data

    Directory of Open Access Journals (Sweden)

    Simpson David

    2006-03-01

    Full Text Available Abstract Background Retinal photoreceptors are highly specialised cells, which detect light and are central to mammalian vision. Many retinal diseases occur as a result of inherited dysfunction of the rod and cone photoreceptor cells. Development and maintenance of photoreceptors requires appropriate regulation of the many genes specifically or highly expressed in these cells. Over the last decades, different experimental approaches have been developed to identify photoreceptor enriched genes. Recent progress in RNA analysis technology has generated large amounts of gene expression data relevant to retinal development. This paper assesses a machine learning methodology for supporting the identification of photoreceptor enriched genes based on expression data. Results Based on the analysis of publicly-available gene expression data from the developing mouse retina generated by serial analysis of gene expression (SAGE, this paper presents a predictive methodology comprising several in silico models for detecting key complex features and relationships encoded in the data, which may be useful to distinguish genes in terms of their functional roles. In order to understand temporal patterns of photoreceptor gene expression during retinal development, a two-way cluster analysis was firstly performed. By clustering SAGE libraries, a hierarchical tree reflecting relationships between developmental stages was obtained. By clustering SAGE tags, a more comprehensive expression profile for photoreceptor cells was revealed. To demonstrate the usefulness of machine learning-based models in predicting functional associations from the SAGE data, three supervised classification models were compared. The results indicated that a relatively simple instance-based model (KStar model performed significantly better than relatively more complex algorithms, e.g. neural networks. To deal with the problem of functional class imbalance occurring in the dataset, two data re

  8. Identification of a robust gene signature that predicts breast cancer outcome in independent data sets

    International Nuclear Information System (INIS)

    Korkola, James E; Waldman, Frederic M; Blaveri, Ekaterina; DeVries, Sandy; Moore, Dan H II; Hwang, E Shelley; Chen, Yunn-Yi; Estep, Anne LH; Chew, Karen L; Jensen, Ronald H

    2007-01-01

    Breast cancer is a heterogeneous disease, presenting with a wide range of histologic, clinical, and genetic features. Microarray technology has shown promise in predicting outcome in these patients. We profiled 162 breast tumors using expression microarrays to stratify tumors based on gene expression. A subset of 55 tumors with extensive follow-up was used to identify gene sets that predicted outcome. The predictive gene set was further tested in previously published data sets. We used different statistical methods to identify three gene sets associated with disease free survival. A fourth gene set, consisting of 21 genes in common to all three sets, also had the ability to predict patient outcome. To validate the predictive utility of this derived gene set, it was tested in two published data sets from other groups. This gene set resulted in significant separation of patients on the basis of survival in these data sets, correctly predicting outcome in 62–65% of patients. By comparing outcome prediction within subgroups based on ER status, grade, and nodal status, we found that our gene set was most effective in predicting outcome in ER positive and node negative tumors. This robust gene selection with extensive validation has identified a predictive gene set that may have clinical utility for outcome prediction in breast cancer patients

  9. A Bayesian variable selection procedure for ranking overlapping gene sets

    DEFF Research Database (Denmark)

    Skarman, Axel; Mahdi Shariati, Mohammad; Janss, Luc

    2012-01-01

    Background Genome-wide expression profiling using microarrays or sequence-based technologies allows us to identify genes and genetic pathways whose expression patterns influence complex traits. Different methods to prioritize gene sets, such as the genes in a given molecular pathway, have been de...

  10. Enriched expression of the ciliopathy gene Ick in cell proliferating regions of adult mice.

    Science.gov (United States)

    Tsutsumi, Ryotaro; Chaya, Taro; Furukawa, Takahisa

    2018-04-07

    Cilia are essential for sensory and motile functions across species. In humans, ciliary dysfunction causes "ciliopathies", which show severe developmental abnormalities in various tissues. Several missense mutations in intestinal cell kinase (ICK) gene lead to endocrine-cerebro-osteodysplasia syndrome or short rib-polydactyly syndrome, lethal recessive developmental ciliopathies. We and others previously reported that Ick-deficient mice exhibit neonatal lethality with developmental defects. Mechanistically, Ick regulates intraflagellar transport and cilia length at ciliary tips. Although Ick plays important roles during mammalian development, roles of Ick at the adult stage are poorly understood. In the current study, we investigated the Ick gene expression in adult mouse tissues. RT-PCR analysis showed that Ick is ubiquitously expressed, with enrichment in the retina, brain, lung, intestine, and reproductive system. In the adult brain, we found that Ick expression is enriched in the walls of the lateral ventricle, in the rostral migratory stream of the olfactory bulb, and in the subgranular zone of the hippocampal dentate gyrus by in situ hybridization analysis. We also observed that Ick staining pattern is similar to pachytene spermatocyte to spermatid markers in the mature testis and to an intestinal stem cell marker in the adult small intestine. These results suggest that Ick is expressed in proliferating regions in the adult mouse brain, testis, and intestine. Copyright © 2018 Elsevier B.V. All rights reserved.

  11. Identification of key pathways and genes influencing prognosis in bladder urothelial carcinoma

    Directory of Open Access Journals (Sweden)

    Ning X

    2017-03-01

    Full Text Available Xin Ning, Yaoliang Deng Department of Urology, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Province, People’s Republic of China Background: Genomic profiling can be used to identify the predictive effect of genomic subsets for determining prognosis in bladder urothelial carcinoma (BUC after radical cystectomy. This study aimed to investigate potential gene and pathway markers associated with prognosis in BUC.Methods: A microarray dataset of BUC was obtained from The Cancer Genome Atlas database. Differentially expressed genes (DEGs were identified by DESeq of the R platform. Kaplan–Meier analysis was applied for prognostic markers. Key pathways and genes were identified using bioinformatics tools, such as gene set enrichment analysis, gene ontology, the Kyoto Encyclopedia of Genes and Genomes, gene multiple association network integration algorithm (GeneMANIA, Search Tool for the Retrieval of Interacting Genes/Proteins, and Molecular Complex Detection.Results: A comparative gene set enrichment analysis of tumor and adjacent normal tissues suggested BUC tumorigenesis resulted mainly from enrichment of cell cycle and DNA damage and repair-related biological processes and pathways, including TP53 and mitotic recombination. Two hundred and fifty-six genes were identified as potential prognosis-related DEGs. Gene ontology and Kyoto Encyclopedia of Genes and Genomes analyses showed that the potential prognosis-related DEGs were enriched in angiogenesis, including the cyclic adenosine monophosphate biosynthetic process, cyclic guanosine monophosphate-protein kinase G, mitogen-activated protein kinase, Rap1, and phosphoinositide-3-kinase-AKT signaling pathway. Nine hub genes, TAGLN, ACTA2, MYH11, CALD1, MYLK, GEM, PRELP, TPM2, and OGN, were identified from the intersection of protein–protein interaction and GeneMANIA networks. Module analysis of protein–protein interaction and GeneMANIA networks mainly showed

  12. Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets.

    Science.gov (United States)

    Springer, Mark S; Gatesy, John

    2018-02-26

    coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including the critical assumption that recombination occurs freely among but not within coalescence genes (c-genes), which are the fundamental units of analysis for these methods. Each c-gene has a single branching history, and large sets of these independent gene histories should be the input for genome-scale coalescence estimates of phylogeny. By contrast, numerous studies have reported the results of coalescence analyses in which complete protein-coding sequences are treated as c-genes even though exons for these loci can span more than a megabase of DNA. Empirical estimates of recombination breakpoints suggest that c-genes may be much shorter, especially when large clades with many species are the focus of analysis. Although this idea has been challenged recently in the literature, the inverse relationship between c-gene size and increased taxon sampling in a dataset-the 'recombination ratchet'-is a fundamental property of c-genes. For taxonomic groups characterized by genes with long intron sequences, complete protein-coding sequences are likely not valid c-genes and are inappropriate units of analysis for summary coalescence methods unless they occur in recombination deserts that are devoid of incomplete lineage sorting (ILS). Finally, it has been argued that coalescence methods are robust when the no-recombination within loci assumption is violated, but recombination must matter at some scale because ILS, a by-product of recombination, is the raison d'etre for coalescence methods. That is, extensive recombination is required to yield the large number of independently segregating c-genes used to infer a species tree. If coalescent methods are powerful

  13. Microarray analysis identifies a common set of cellular genes modulated by different HCV replicon clones

    Directory of Open Access Journals (Sweden)

    Gerosolimo Germano

    2008-06-01

    Full Text Available Abstract Background Hepatitis C virus (HCV RNA synthesis and protein expression affect cell homeostasis by modulation of gene expression. The impact of HCV replication on global cell transcription has not been fully evaluated. Thus, we analysed the expression profiles of different clones of human hepatoma-derived Huh-7 cells carrying a self-replicating HCV RNA which express all viral proteins (HCV replicon system. Results First, we compared the expression profile of HCV replicon clone 21-5 with both the Huh-7 parental cells and the 21-5 cured (21-5c cells. In these latter, the HCV RNA has been eliminated by IFN-α treatment. To confirm data, we also analyzed microarray results from both the 21-5 and two other HCV replicon clones, 22-6 and 21-7, compared to the Huh-7 cells. The study was carried out by using the Applied Biosystems (AB Human Genome Survey Microarray v1.0 which provides 31,700 probes that correspond to 27,868 human genes. Microarray analysis revealed a specific transcriptional program induced by HCV in replicon cells respect to both IFN-α-cured and Huh-7 cells. From the original datasets of differentially expressed genes, we selected by Venn diagrams a final list of 38 genes modulated by HCV in all clones. Most of the 38 genes have never been described before and showed high fold-change associated with significant p-value, strongly supporting data reliability. Classification of the 38 genes by Panther System identified functional categories that were significantly enriched in this gene set, such as histones and ribosomal proteins as well as extracellular matrix and intracellular protein traffic. The dataset also included new genes involved in lipid metabolism, extracellular matrix and cytoskeletal network, which may be critical for HCV replication and pathogenesis. Conclusion Our data provide a comprehensive analysis of alterations in gene expression induced by HCV replication and reveal modulation of new genes potentially useful

  14. Enriched pathways for major depressive disorder identified from a genome-wide association study.

    Science.gov (United States)

    Kao, Chung-Feng; Jia, Peilin; Zhao, Zhongming; Kuo, Po-Hsiu

    2012-11-01

    Major depressive disorder (MDD) has caused a substantial burden of disease worldwide with moderate heritability. Despite efforts through conducting numerous association studies and now, genome-wide association (GWA) studies, the success of identifying susceptibility loci for MDD has been limited, which is partially attributed to the complex nature of depression pathogenesis. A pathway-based analytic strategy to investigate the joint effects of various genes within specific biological pathways has emerged as a powerful tool for complex traits. The present study aimed to identify enriched pathways for depression using a GWA dataset for MDD. For each gene, we estimated its gene-wise p value using combined and minimum p value, separately. Canonical pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) and BioCarta were used. We employed four pathway-based analytic approaches (gene set enrichment analysis, hypergeometric test, sum-square statistic, sum-statistic). We adjusted for multiple testing using Benjamini & Hochberg's method to report significant pathways. We found 17 significantly enriched pathways for depression, which presented low-to-intermediate crosstalk. The top four pathways were long-term depression (p⩽1×10-5), calcium signalling (p⩽6×10-5), arrhythmogenic right ventricular cardiomyopathy (p⩽1.6×10-4) and cell adhesion molecules (p⩽2.2×10-4). In conclusion, our comprehensive pathway analyses identified promising pathways for depression that are related to neurotransmitter and neuronal systems, immune system and inflammatory response, which may be involved in the pathophysiological mechanisms underlying depression. We demonstrated that pathway enrichment analysis is promising to facilitate our understanding of complex traits through a deeper interpretation of GWA data. Application of this comprehensive analytic strategy in upcoming GWA data for depression could validate the findings reported in this study.

  15. Direct cloning from enrichment cultures, a reliable strategy for isolation of complete operons and genes from microbial consortia.

    Science.gov (United States)

    Entcheva, P; Liebl, W; Johann, A; Hartsch, T; Streit, W R

    2001-01-01

    Enrichment cultures of microbial consortia enable the diverse metabolic and catabolic activities of these populations to be studied on a molecular level and to be explored as potential sources for biotechnology processes. We have used a combined approach of enrichment culture and direct cloning to construct cosmid libraries with large (>30-kb) inserts from microbial consortia. Enrichment cultures were inoculated with samples from five environments, and high amounts of avidin were added to the cultures to favor growth of biotin-producing microbes. DNA was extracted from three of these enrichment cultures and used to construct cosmid libraries; each library consisted of between 6,000 and 35,000 clones, with an average insert size of 30 to 40 kb. The inserts contained a diverse population of genomic DNA fragments isolated from the consortia organisms. These three libraries were used to complement the Escherichia coli biotin auxotrophic strain ATCC 33767 Delta(bio-uvrB). Initial screens resulted in the isolation of seven different complementing cosmid clones, carrying biotin biosynthesis operons. Biotin biosynthesis capabilities and growth under defined conditions of four of these clones were studied. Biotin measured in the different culture supernatants ranged from 42 to 3,800 pg/ml/optical density unit. Sequencing the identified biotin synthesis genes revealed high similarities to bio operons from gram-negative bacteria. In addition, random sequencing identified other interesting open reading frames, as well as two operons, the histidine utilization operon (hut), and the cluster of genes involved in biosynthesis of molybdopterin cofactors in bacteria (moaABCDE).

  16. Gene expression profiles of lung adenocarcinoma linked to histopathological grading and survival but not to EGF-R status: a microarray study

    Directory of Open Access Journals (Sweden)

    Passlick Bernward

    2010-03-01

    Full Text Available Abstract Background Several different gene expression signatures have been proposed to predict response to therapy and clinical outcome in lung adenocarcinoma. Herein, we investigate if elements of published gene sets can be reproduced in a small dataset, and how gene expression profiles based on limited sample size relate to clinical parameters including histopathological grade and EGFR protein expression. Methods Affymetrix Human Genome U133A platform was used to obtain gene expression profiles of 28 pathologically and clinically annotated adenocarcinomas of the lung. EGFR status was determined by fluorescent in situ hybridization and immunohistochemistry. Results Using unsupervised clustering algorithms, the predominant gene expression signatures correlated with the histopathological grade but not with EGFR protein expression as detected by immunohistochemistry. In a supervised analysis, the signature of high grade tumors but not of EGFR overexpressing cases showed significant enrichment of gene sets reflecting MAPK activation and other potential signaling cascades downstream of EGFR. Out of four different previously published gene sets that had been linked to prognosis, three showed enrichment in the gene expression signature associated with favorable prognosis. Conclusions In this dataset, histopathological tumor grades but not EGFR status were associated with dominant gene expression signatures and gene set enrichment reflecting oncogenic pathway activation, suggesting that high immunohistochemistry EGFR scores may not necessarily be linked to downstream effects that cause major changes in gene expression patterns. Published gene sets showed association with patient survival; however, the small sample size of this study limited the options for a comprehensive validation of previously reported prognostic gene expression signatures.

  17. Gene set analysis for interpreting genetic studies

    DEFF Research Database (Denmark)

    Pers, Tune H

    2016-01-01

    Interpretation of genome-wide association study (GWAS) results is lacking behind the discovery of new genetic associations. Consequently, there is an urgent need for data-driven methods for interpreting genetic association studies. Gene set analysis (GSA) can identify aetiologic pathways...

  18. Improving Gene Therapy Efficiency through the Enrichment of Human Hematopoietic Stem Cells.

    Science.gov (United States)

    Masiuk, Katelyn E; Brown, Devin; Laborada, Jennifer; Hollis, Roger P; Urbinati, Fabrizia; Kohn, Donald B

    2017-09-06

    Lentiviral vector (LV)-based hematopoietic stem cell (HSC) gene therapy is becoming a promising clinical strategy for the treatment of genetic blood diseases. However, the current approach of modifying 1 × 10 8 to 1 × 10 9 CD34 + cells per patient requires large amounts of LV, which is expensive and technically challenging to produce at clinical scale. Modification of bulk CD34 + cells uses LV inefficiently, because the majority of CD34 + cells are short-term progenitors with a limited post-transplant lifespan. Here, we utilized a clinically relevant, immunomagnetic bead (IB)-based method to purify CD34 + CD38 - cells from human bone marrow (BM) and mobilized peripheral blood (mPB). IB purification of CD34 + CD38 - cells enriched severe combined immune deficiency (SCID) repopulating cell (SRC) frequency an additional 12-fold beyond standard CD34 + purification and did not affect gene marking of long-term HSCs. Transplant of purified CD34 + CD38 - cells led to delayed myeloid reconstitution, which could be rescued by the addition of non-transduced CD38 + cells. Importantly, LV modification and transplantation of IB-purified CD34 + CD38 - cells/non-modified CD38 + cells into immune-deficient mice achieved long-term gene-marked engraftment comparable with modification of bulk CD34 + cells, while utilizing ∼7-fold less LV. Thus, we demonstrate a translatable method to improve the clinical and commercial viability of gene therapy for genetic blood cell diseases. Copyright © 2017 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.

  19. IPAD: the Integrated Pathway Analysis Database for Systematic Enrichment Analysis.

    Science.gov (United States)

    Zhang, Fan; Drabier, Renee

    2012-01-01

    Next-Generation Sequencing (NGS) technologies and Genome-Wide Association Studies (GWAS) generate millions of reads and hundreds of datasets, and there is an urgent need for a better way to accurately interpret and distill such large amounts of data. Extensive pathway and network analysis allow for the discovery of highly significant pathways from a set of disease vs. healthy samples in the NGS and GWAS. Knowledge of activation of these processes will lead to elucidation of the complex biological pathways affected by drug treatment, to patient stratification studies of new and existing drug treatments, and to understanding the underlying anti-cancer drug effects. There are approximately 141 biological human pathway resources as of Jan 2012 according to the Pathguide database. However, most currently available resources do not contain disease, drug or organ specificity information such as disease-pathway, drug-pathway, and organ-pathway associations. Systematically integrating pathway, disease, drug and organ specificity together becomes increasingly crucial for understanding the interrelationships between signaling, metabolic and regulatory pathway, drug action, disease susceptibility, and organ specificity from high-throughput omics data (genomics, transcriptomics, proteomics and metabolomics). We designed the Integrated Pathway Analysis Database for Systematic Enrichment Analysis (IPAD, http://bioinfo.hsc.unt.edu/ipad), defining inter-association between pathway, disease, drug and organ specificity, based on six criteria: 1) comprehensive pathway coverage; 2) gene/protein to pathway/disease/drug/organ association; 3) inter-association between pathway, disease, drug, and organ; 4) multiple and quantitative measurement of enrichment and inter-association; 5) assessment of enrichment and inter-association analysis with the context of the existing biological knowledge and a "gold standard" constructed from reputable and reliable sources; and 6) cross-linking of

  20. Dimethylated H3K27 Is a Repressive Epigenetic Histone Mark in the Protist Entamoeba histolytica and Is Significantly Enriched in Genes Silenced via the RNAi Pathway*

    Science.gov (United States)

    Foda, Bardees M.; Singh, Upinder

    2015-01-01

    RNA interference (RNAi) is a fundamental biological process that plays a crucial role in regulation of gene expression in many organisms. Transcriptional gene silencing (TGS) is one of the important nuclear roles of RNAi. Our previous data show that Entamoeba histolytica has a robust RNAi pathway that links to TGS via Argonaute 2-2 (Ago2-2) associated 27-nucleotide small RNAs with 5′-polyphosphate termini. Here, we report the first repressive histone mark to be identified in E. histolytica, dimethylation of H3K27 (H3K27Me2), and demonstrate that it is enriched at genes that are silenced by RNAi-mediated TGS. An RNAi-silencing trigger can induce H3K27Me2 deposits at both episomal and chromosomal loci, mediating gene silencing. Our data support two phases of RNAi-mediated TGS: an active silencing phase where the RNAi trigger is present and both H3K27Me2 and Ago2-2 concurrently enrich at chromosomal loci; and an established silencing phase in which the RNAi trigger is removed, but gene silencing with H3K27Me2 enrichment persist independently of Ago2-2 deposition. Importantly, some genes display resistance to chromosomal silencing despite induction of functional small RNAs. In those situations, the RNAi-triggering plasmid that is maintained episomally gets partially silenced and has H3K27Me2 enrichment, but the chromosomal copy displays no repressive histone enrichment. Our data are consistent with a model in which H3K27Me2 is a repressive histone modification, which is strongly associated with transcriptional repression. This is the first example of an epigenetic histone modification that functions to mediate RNAi-mediated TGS in the deep-branching eukaryote E. histolytica. PMID:26149683

  1. Dimethylated H3K27 Is a Repressive Epigenetic Histone Mark in the Protist Entamoeba histolytica and Is Significantly Enriched in Genes Silenced via the RNAi Pathway.

    Science.gov (United States)

    Foda, Bardees M; Singh, Upinder

    2015-08-21

    RNA interference (RNAi) is a fundamental biological process that plays a crucial role in regulation of gene expression in many organisms. Transcriptional gene silencing (TGS) is one of the important nuclear roles of RNAi. Our previous data show that Entamoeba histolytica has a robust RNAi pathway that links to TGS via Argonaute 2-2 (Ago2-2) associated 27-nucleotide small RNAs with 5'-polyphosphate termini. Here, we report the first repressive histone mark to be identified in E. histolytica, dimethylation of H3K27 (H3K27Me2), and demonstrate that it is enriched at genes that are silenced by RNAi-mediated TGS. An RNAi-silencing trigger can induce H3K27Me2 deposits at both episomal and chromosomal loci, mediating gene silencing. Our data support two phases of RNAi-mediated TGS: an active silencing phase where the RNAi trigger is present and both H3K27Me2 and Ago2-2 concurrently enrich at chromosomal loci; and an established silencing phase in which the RNAi trigger is removed, but gene silencing with H3K27Me2 enrichment persist independently of Ago2-2 deposition. Importantly, some genes display resistance to chromosomal silencing despite induction of functional small RNAs. In those situations, the RNAi-triggering plasmid that is maintained episomally gets partially silenced and has H3K27Me2 enrichment, but the chromosomal copy displays no repressive histone enrichment. Our data are consistent with a model in which H3K27Me2 is a repressive histone modification, which is strongly associated with transcriptional repression. This is the first example of an epigenetic histone modification that functions to mediate RNAi-mediated TGS in the deep-branching eukaryote E. histolytica. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  2. Genome-wide survey and developmental expression mapping of zebrafish SET domain-containing genes.

    Directory of Open Access Journals (Sweden)

    Xiao-Jian Sun

    Full Text Available SET domain-containing proteins represent an evolutionarily conserved family of epigenetic regulators, which are responsible for most histone lysine methylation. Since some of these genes have been revealed to be essential for embryonic development, we propose that the zebrafish, a vertebrate model organism possessing many advantages for developmental studies, can be utilized to study the biological functions of these genes and the related epigenetic mechanisms during early development. To this end, we have performed a genome-wide survey of zebrafish SET domain genes. 58 genes total have been identified. Although gene duplication events give rise to several lineage-specific paralogs, clear reciprocal orthologous relationship reveals high conservation between zebrafish and human SET domain genes. These data were further subject to an evolutionary analysis ranging from yeast to human, leading to the identification of putative clusters of orthologous groups (COGs of this gene family. By means of whole-mount mRNA in situ hybridization strategy, we have also carried out a developmental expression mapping of these genes. A group of maternal SET domain genes, which are implicated in the programming of histone modification states in early development, have been identified and predicted to be responsible for all known sites of SET domain-mediated histone methylation. Furthermore, some genes show specific expression patterns in certain tissues at certain stages, suggesting the involvement of epigenetic mechanisms in the development of these systems. These results provide a global view of zebrafish SET domain histone methyltransferases in evolutionary and developmental dimensions and pave the way for using zebrafish to systematically study the roles of these genes during development.

  3. BiNChE: a web tool and library for chemical enrichment analysis based on the ChEBI ontology.

    Science.gov (United States)

    Moreno, Pablo; Beisken, Stephan; Harsha, Bhavana; Muthukrishnan, Venkatesh; Tudose, Ilinca; Dekker, Adriano; Dornfeldt, Stefanie; Taruttis, Franziska; Grosse, Ivo; Hastings, Janna; Neumann, Steffen; Steinbeck, Christoph

    2015-02-21

    Ontology-based enrichment analysis aids in the interpretation and understanding of large-scale biological data. Ontologies are hierarchies of biologically relevant groupings. Using ontology annotations, which link ontology classes to biological entities, enrichment analysis methods assess whether there is a significant over or under representation of entities for ontology classes. While many tools exist that run enrichment analysis for protein sets annotated with the Gene Ontology, there are only a few that can be used for small molecules enrichment analysis. We describe BiNChE, an enrichment analysis tool for small molecules based on the ChEBI Ontology. BiNChE displays an interactive graph that can be exported as a high-resolution image or in network formats. The tool provides plain, weighted and fragment analysis based on either the ChEBI Role Ontology or the ChEBI Structural Ontology. BiNChE aids in the exploration of large sets of small molecules produced within Metabolomics or other Systems Biology research contexts. The open-source tool provides easy and highly interactive web access to enrichment analysis with the ChEBI ontology tool and is additionally available as a standalone library.

  4. Metagenomic survey of methanesulfonic acid (MSA catabolic genes in an Atlantic Ocean surface water sample and in a partial enrichment

    Directory of Open Access Journals (Sweden)

    Ana C. Henriques

    2016-10-01

    Full Text Available Methanesulfonic acid (MSA is a relevant intermediate of the biogeochemical cycle of sulfur and environmental microorganisms assume an important role in the mineralization of this compound. Several methylotrophic bacterial strains able to grow on MSA have been isolated from soil or marine water and two conserved operons, msmABCD coding for MSA monooxygenase and msmEFGH coding for a transport system, have been repeatedly encountered in most of these strains. Homologous sequences have also been amplified directly from the environment or observed in marine metagenomic data, but these showed a base composition (G + C content very different from their counterparts from cultivated bacteria. The aim of this study was to understand which microorganisms within the coastal surface oceanic microflora responded to MSA as a nutrient and how the community evolved in the early phases of an enrichment by means of metagenome and gene-targeted amplicon sequencing. From the phylogenetic point of view, the community shifted significantly with the disappearance of all signals related to the Archaea, the Pelagibacteraceae and phylum SAR406, and the increase in methylotroph-harboring taxa, accompanied by other groups so far not known to comprise methylotrophs such as the Hyphomonadaceae. At the functional level, the abundance of several genes related to sulfur metabolism and methylotrophy increased during the enrichment and the allelic distribution of gene msmA diagnostic for MSA monooxygenase altered considerably. Even more dramatic was the disappearance of MSA import-related gene msmE, which suggests that alternative transporters must be present in the enriched community and illustrate the inadequacy of msmE as an ecofunctional marker for MSA degradation at sea.

  5. The Candidate Cancer Gene Database: a database of cancer driver genes from forward genetic screens in mice.

    Science.gov (United States)

    Abbott, Kenneth L; Nyre, Erik T; Abrahante, Juan; Ho, Yen-Yi; Isaksson Vogel, Rachel; Starr, Timothy K

    2015-01-01

    Identification of cancer driver gene mutations is crucial for advancing cancer therapeutics. Due to the overwhelming number of passenger mutations in the human tumor genome, it is difficult to pinpoint causative driver genes. Using transposon mutagenesis in mice many laboratories have conducted forward genetic screens and identified thousands of candidate driver genes that are highly relevant to human cancer. Unfortunately, this information is difficult to access and utilize because it is scattered across multiple publications using different mouse genome builds and strength metrics. To improve access to these findings and facilitate meta-analyses, we developed the Candidate Cancer Gene Database (CCGD, http://ccgd-starrlab.oit.umn.edu/). The CCGD is a manually curated database containing a unified description of all identified candidate driver genes and the genomic location of transposon common insertion sites (CISs) from all currently published transposon-based screens. To demonstrate relevance to human cancer, we performed a modified gene set enrichment analysis using KEGG pathways and show that human cancer pathways are highly enriched in the database. We also used hierarchical clustering to identify pathways enriched in blood cancers compared to solid cancers. The CCGD is a novel resource available to scientists interested in the identification of genetic drivers of cancer. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.

    Science.gov (United States)

    Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R

    2005-09-01

    We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.

  7. Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach.

    Directory of Open Access Journals (Sweden)

    Nilotpal Chowdhury

    Full Text Available Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis.The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets.Four microarray series (having 742 patients were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA.Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed.To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and

  8. Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach.

    Science.gov (United States)

    Chowdhury, Nilotpal; Sapru, Shantanu

    2015-01-01

    Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting

  9. Transcriptome and Gene Ontology (GO) Enrichment Analysis Reveals Genes Involved in Biotin Metabolism That Affect L-Lysine Production in Corynebacterium glutamicum.

    Science.gov (United States)

    Kim, Hong-Il; Kim, Jong-Hyeon; Park, Young-Jin

    2016-03-09

    Corynebacterium glutamicum is widely used for amino acid production. In the present study, 543 genes showed a significant change in their mRNA expression levels in L-lysine-producing C. glutamicum ATCC21300 than that in the wild-type C. glutamicum ATCC13032. Among these 543 differentially expressed genes (DEGs), 28 genes were up- or downregulated. In addition, 454 DEGs were functionally enriched and categorized based on BLAST sequence homologies and gene ontology (GO) annotations using the Blast2GO software. Interestingly, NCgl0071 (bioB, encoding biotin synthase) was expressed at levels ~20-fold higher in the L-lysine-producing ATCC21300 strain than that in the wild-type ATCC13032 strain. Five other genes involved in biotin metabolism or transport--NCgl2515 (bioA, encoding adenosylmethionine-8-amino-7-oxononanoate aminotransferase), NCgl2516 (bioD, encoding dithiobiotin synthetase), NCgl1883, NCgl1884, and NCgl1885--were also expressed at significantly higher levels in the L-lysine-producing ATCC21300 strain than that in the wild-type ATCC13032 strain, which we determined using both next-generation RNA sequencing and quantitative real-time PCR analysis. When we disrupted the bioB gene in C. glutamicum ATCC21300, L-lysine production decreased by approximately 76%, and the three genes involved in biotin transport (NCgl1883, NCgl1884, and NCgl1885) were significantly downregulated. These results will be helpful to improve our understanding of C. glutamicum for industrial amino acid production.

  10. New generation enrichment monitoring technology for gas centrifuge enrichment plants

    International Nuclear Information System (INIS)

    Ianakiev, Kiril D.; Alexandrov, Boian S.; Boyer, Brian D.; Hill, Thomas R.; Macarthur, Duncan W.; Marks, Thomas; Moss, Calvin E.; Sheppard, Gregory A.; Swinhoe, Martyn T.

    2008-01-01

    The continuous enrichment monitor, developed and fielded in the 1990s by the International Atomic Energy Agency, provided a go-no-go capability to distinguish between UF 6 containing low enriched (approximately 4% 235 U) and highly enriched (above 20% 235 U) uranium. This instrument used the 22-keV line from a 109 Cd source as a transmission source to achieve a high sensitivity to the UF 6 gas absorption. The 1.27-yr half-life required that the source be periodically replaced and the instrument recalibrated. The instrument's functionality and accuracy were limited by the fact that measured gas density and gas pressure were treated as confidential facility information. The modern safeguarding of a gas centrifuge enrichment plant producing low-enriched UF 6 product aims toward a more quantitative flow and enrichment monitoring concept that sets new standards for accuracy stability, and confidence. An instrument must be accurate enough to detect the diversion of a significant quantity of material, have virtually zero false alarms, and protect the operator's proprietary process information. We discuss a new concept for advanced gas enrichment assay measurement technology. This design concept eliminates the need for the periodic replacement of a radioactive source as well as the need for maintenance by experts. Some initial experimental results will be presented.

  11. Optimal structural inference of signaling pathways from unordered and overlapping gene sets.

    Science.gov (United States)

    Acharya, Lipi R; Judeh, Thair; Wang, Guangdi; Zhu, Dongxiao

    2012-02-15

    A plethora of bioinformatics analysis has led to the discovery of numerous gene sets, which can be interpreted as discrete measurements emitted from latent signaling pathways. Their potential to infer signaling pathway structures, however, has not been sufficiently exploited. Existing methods accommodating discrete data do not explicitly consider signal cascading mechanisms that characterize a signaling pathway. Novel computational methods are thus needed to fully utilize gene sets and broaden the scope from focusing only on pairwise interactions to the more general cascading events in the inference of signaling pathway structures. We propose a gene set based simulated annealing (SA) algorithm for the reconstruction of signaling pathway structures. A signaling pathway structure is a directed graph containing up to a few hundred nodes and many overlapping signal cascades, where each cascade represents a chain of molecular interactions from the cell surface to the nucleus. Gene sets in our context refer to discrete sets of genes participating in signal cascades, the basic building blocks of a signaling pathway, with no prior information about gene orderings in the cascades. From a compendium of gene sets related to a pathway, SA aims to search for signal cascades that characterize the optimal signaling pathway structure. In the search process, the extent of overlap among signal cascades is used to measure the optimality of a structure. Throughout, we treat gene sets as random samples from a first-order Markov chain model. We evaluated the performance of SA in three case studies. In the first study conducted on 83 KEGG pathways, SA demonstrated a significantly better performance than Bayesian network methods. Since both SA and Bayesian network methods accommodate discrete data, use a 'search and score' network learning strategy and output a directed network, they can be compared in terms of performance and computational time. In the second study, we compared SA and

  12. AnGeLi: A Tool for the Analysis of Gene Lists from Fission Yeast

    Directory of Open Access Journals (Sweden)

    Danny A Bitton

    2015-11-01

    Full Text Available Genome-wide assays and screens typically result in large lists of genes or proteins. Enrichments of functional or other biological properties within such lists can provide valuable insights and testable hypotheses. To systematically detect these enrichments can be challenging and time-consuming, because relevant data to compare against query gene lists are spread over many different sources. We have developed AnGeLi (Analysis of Gene Lists, an intuitive, integrated web-tool for comprehensive and customized interrogation of gene lists from the fission yeast, Schizosaccharomyces pombe. AnGeLi searches for significant enrichments among multiple qualitative and quantitative information sources, including gene and phenotype ontologies, genetic and protein interactions, numerous features of genes, transcripts, translation, and proteins such as copy numbers, chromosomal positions, genetic diversity, RNA polymerase II and ribosome occupancy, localization, conservation, half-lives, domains and molecular weight among others, as well as diverse sets of genes that are co-regulated or lead to the same phenotypes when mutated. AnGeLi uses robust statistics which can be tailored to specific needs. It also provides the option to upload user-defined gene sets to compare against the query list. Through an integrated data submission form, AnGeLi encourages the community to contribute additional curated gene lists to further increase the usefulness of this resource and to get the most from the ever increasing large-scale experiments. AnGeLi offers a rigorous yet flexible statistical analysis platform for rich insights into functional enrichments and biological context for query gene lists, thus providing a powerful exploratory tool through which S. pombe researchers can uncover fresh perspectives and unexpected connections from genomic data. AnGeLi is freely available at: www.bahlerlab.info/AnGeLi

  13. Mechanism-based biomarker gene sets for glutathione depletion-related hepatotoxicity in rats

    International Nuclear Information System (INIS)

    Gao Weihua; Mizukawa, Yumiko; Nakatsu, Noriyuki; Minowa, Yosuke; Yamada, Hiroshi; Ohno, Yasuo; Urushidani, Tetsuro

    2010-01-01

    Chemical-induced glutathione depletion is thought to be caused by two types of toxicological mechanisms: PHO-type glutathione depletion [glutathione conjugated with chemicals such as phorone (PHO) or diethyl maleate (DEM)], and BSO-type glutathione depletion [i.e., glutathione synthesis inhibited by chemicals such as L-buthionine-sulfoximine (BSO)]. In order to identify mechanism-based biomarker gene sets for glutathione depletion in rat liver, male SD rats were treated with various chemicals including PHO (40, 120 and 400 mg/kg), DEM (80, 240 and 800 mg/kg), BSO (150, 450 and 1500 mg/kg), and bromobenzene (BBZ, 10, 100 and 300 mg/kg). Liver samples were taken 3, 6, 9 and 24 h after administration and examined for hepatic glutathione content, physiological and pathological changes, and gene expression changes using Affymetrix GeneChip Arrays. To identify differentially expressed probe sets in response to glutathione depletion, we focused on the following two courses of events for the two types of mechanisms of glutathione depletion: a) gene expression changes occurring simultaneously in response to glutathione depletion, and b) gene expression changes after glutathione was depleted. The gene expression profiles of the identified probe sets for the two types of glutathione depletion differed markedly at times during and after glutathione depletion, whereas Srxn1 was markedly increased for both types as glutathione was depleted, suggesting that Srxn1 is a key molecule in oxidative stress related to glutathione. The extracted probe sets were refined and verified using various compounds including 13 additional positive or negative compounds, and they established two useful marker sets. One contained three probe sets (Akr7a3, Trib3 and Gstp1) that could detect conjugation-type glutathione depletors any time within 24 h after dosing, and the other contained 14 probe sets that could detect glutathione depletors by any mechanism. These two sets, with appropriate scoring

  14. Dissecting the organ specificity of insecticide resistance candidate genes in Anopheles gambiae: known and novel candidate genes.

    Science.gov (United States)

    Ingham, Victoria A; Jones, Christopher M; Pignatelli, Patricia; Balabanidou, Vasileia; Vontas, John; Wagstaff, Simon C; Moore, Jonathan D; Ranson, Hilary

    2014-11-25

    The elevated expression of enzymes with insecticide metabolism activity can lead to high levels of insecticide resistance in the malaria vector, Anopheles gambiae. In this study, adult female mosquitoes from an insecticide susceptible and resistant strain were dissected into four different body parts. RNA from each of these samples was used in microarray analysis to determine the enrichment patterns of the key detoxification gene families within the mosquito and to identify additional candidate insecticide resistance genes that may have been overlooked in previous experiments on whole organisms. A general enrichment in the transcription of genes from the four major detoxification gene families (carboxylesterases, glutathione transferases, UDP glucornyltransferases and cytochrome P450s) was observed in the midgut and malpighian tubules. Yet the subset of P450 genes that have previously been implicated in insecticide resistance in An gambiae, show a surprisingly varied profile of tissue enrichment, confirmed by qPCR and, for three candidates, by immunostaining. A stringent selection process was used to define a list of 105 genes that are significantly (p ≤0.001) over expressed in body parts from the resistant versus susceptible strain. Over half of these, including all the cytochrome P450s on this list, were identified in previous whole organism comparisons between the strains, but several new candidates were detected, notably from comparisons of the transcriptomes from dissected abdomen integuments. The use of RNA extracted from the whole organism to identify candidate insecticide resistance genes has a risk of missing candidates if key genes responsible for the phenotype have restricted expression within the body and/or are over expression only in certain tissues. However, as transcription of genes implicated in metabolic resistance to insecticides is not enriched in any one single organ, comparison of the transcriptome of individual dissected body parts cannot

  15. A stochastic model for identifying differential gene pair co-expression patterns in prostate cancer progression

    Directory of Open Access Journals (Sweden)

    Mao Yu

    2009-07-01

    Full Text Available Abstract Background The identification of gene differential co-expression patterns between cancer stages is a newly developing method to reveal the underlying molecular mechanisms of carcinogenesis. Most researches of this subject lack an algorithm useful for performing a statistical significance assessment involving cancer progression. Lacking this specific algorithm is apparently absent in identifying precise gene pairs correlating to cancer progression. Results In this investigation we studied gene pair co-expression change by using a stochastic process model for approximating the underlying dynamic procedure of the co-expression change during cancer progression. Also, we presented a novel analytical method named 'Stochastic process model for Identifying differentially co-expressed Gene pair' (SIG method. This method has been applied to two well known prostate cancer data sets: hormone sensitive versus hormone resistant, and healthy versus cancerous. From these data sets, 428,582 gene pairs and 303,992 gene pairs were identified respectively. Afterwards, we used two different current statistical methods to the same data sets, which were developed to identify gene pair differential co-expression and did not consider cancer progression in algorithm. We then compared these results from three different perspectives: progression analysis, gene pair identification effectiveness analysis, and pathway enrichment analysis. Statistical methods were used to quantify the quality and performance of these different perspectives. They included: Re-identification Scale (RS and Progression Score (PS in progression analysis, True Positive Rate (TPR in gene pair analysis, and Pathway Enrichment Score (PES in pathway analysis. Our results show small values of RS and large values of PS, TPR, and PES; thus, suggesting that gene pairs identified by the SIG method are highly correlated with cancer progression, and highly enriched in disease-specific pathways. From

  16. Glutamatergic and GABAergic gene sets in attention-deficit/hyperactivity disorder

    DEFF Research Database (Denmark)

    Naaijen, Jill; Bralten, Janita; Poelmans, Geert

    2017-01-01

    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance...... within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms...... is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants...

  17. Robust de novo pathway enrichment with KeyPathwayMiner 5 [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Nicolas Alcaraz

    2016-06-01

    Full Text Available Identifying functional modules or novel active pathways, recently termed de novo pathway enrichment, is a computational systems biology challenge that has gained much attention during the last decade. Given a large biological interaction network, KeyPathwayMiner extracts connected subnetworks that are enriched for differentially active entities from a series of molecular profiles encoded as binary indicator matrices. Since interaction networks constantly evolve, an important question is how robust the extracted results are when the network is modified. We enable users to study this effect through several network perturbation techniques and over a range of perturbation degrees. In addition, users may now provide a gold-standard set to determine how enriched extracted pathways are with relevant genes compared to randomized versions of the original network.

  18. Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

    Science.gov (United States)

    Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

    2018-04-23

    Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis

  19. Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool.

    Science.gov (United States)

    Auerbach, Raymond K; Chen, Bin; Butte, Atul J

    2013-08-01

    Biological analysis has shifted from identifying genes and transcripts to mapping these genes and transcripts to biological functions. The ENCODE Project has generated hundreds of ChIP-Seq experiments spanning multiple transcription factors and cell lines for public use, but tools for a biomedical scientist to analyze these data are either non-existent or tailored to narrow biological questions. We present the ENCODE ChIP-Seq Significance Tool, a flexible web application leveraging public ENCODE data to identify enriched transcription factors in a gene or transcript list for comparative analyses. The ENCODE ChIP-Seq Significance Tool is written in JavaScript on the client side and has been tested on Google Chrome, Apple Safari and Mozilla Firefox browsers. Server-side scripts are written in PHP and leverage R and a MySQL database. The tool is available at http://encodeqt.stanford.edu. abutte@stanford.edu Supplementary material is available at Bioinformatics online.

  20. Can survival prediction be improved by merging gene expression data sets?

    Directory of Open Access Journals (Sweden)

    Haleh Yasrebi

    Full Text Available BACKGROUND: High-throughput gene expression profiling technologies generating a wealth of data, are increasingly used for characterization of tumor biopsies for clinical trials. By applying machine learning algorithms to such clinically documented data sets, one hopes to improve tumor diagnosis, prognosis, as well as prediction of treatment response. However, the limited number of patients enrolled in a single trial study limits the power of machine learning approaches due to over-fitting. One could partially overcome this limitation by merging data from different studies. Nevertheless, such data sets differ from each other with regard to technical biases, patient selection criteria and follow-up treatment. It is therefore not clear at all whether the advantage of increased sample size outweighs the disadvantage of higher heterogeneity of merged data sets. Here, we present a systematic study to answer this question specifically for breast cancer data sets. We use survival prediction based on Cox regression as an assay to measure the added value of merged data sets. RESULTS: Using time-dependent Receiver Operating Characteristic-Area Under the Curve (ROC-AUC and hazard ratio as performance measures, we see in overall no significant improvement or deterioration of survival prediction with merged data sets as compared to individual data sets. This apparently was due to the fact that a few genes with strong prognostic power were not available on all microarray platforms and thus were not retained in the merged data sets. Surprisingly, we found that the overall best performance was achieved with a single-gene predictor consisting of CYB5D1. CONCLUSIONS: Merging did not deteriorate performance on average despite (a The diversity of microarray platforms used. (b The heterogeneity of patients cohorts. (c The heterogeneity of breast cancer disease. (d Substantial variation of time to death or relapse. (e The reduced number of genes in the merged data

  1. Microarray analysis reveals key genes and pathways in Tetralogy of Fallot

    Science.gov (United States)

    He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai

    2017-01-01

    The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF

  2. CAsubtype: An R Package to Identify Gene Sets Predictive of Cancer Subtypes and Clinical Outcomes.

    Science.gov (United States)

    Kong, Hualei; Tong, Pan; Zhao, Xiaodong; Sun, Jielin; Li, Hua

    2018-03-01

    In the past decade, molecular classification of cancer has gained high popularity owing to its high predictive power on clinical outcomes as compared with traditional methods commonly used in clinical practice. In particular, using gene expression profiles, recent studies have successfully identified a number of gene sets for the delineation of cancer subtypes that are associated with distinct prognosis. However, identification of such gene sets remains a laborious task due to the lack of tools with flexibility, integration and ease of use. To reduce the burden, we have developed an R package, CAsubtype, to efficiently identify gene sets predictive of cancer subtypes and clinical outcomes. By integrating more than 13,000 annotated gene sets, CAsubtype provides a comprehensive repertoire of candidates for new cancer subtype identification. For easy data access, CAsubtype further includes the gene expression and clinical data of more than 2000 cancer patients from TCGA. CAsubtype first employs principal component analysis to identify gene sets (from user-provided or package-integrated ones) with robust principal components representing significantly large variation between cancer samples. Based on these principal components, CAsubtype visualizes the sample distribution in low-dimensional space for better understanding of the distinction between samples and classifies samples into subgroups with prevalent clustering algorithms. Finally, CAsubtype performs survival analysis to compare the clinical outcomes between the identified subgroups, assessing their clinical value as potentially novel cancer subtypes. In conclusion, CAsubtype is a flexible and well-integrated tool in the R environment to identify gene sets for cancer subtype identification and clinical outcome prediction. Its simple R commands and comprehensive data sets enable efficient examination of the clinical value of any given gene set, thus facilitating hypothesis generating and testing in biological and

  3. Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

    Science.gov (United States)

    Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

    2014-05-01

    Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  4. Approaching the axiomatic enrichment of the Gene Ontology from a lexical perspective.

    Science.gov (United States)

    Quesada-Martínez, Manuel; Mikroyannidi, Eleni; Fernández-Breis, Jesualdo Tomás; Stevens, Robert

    2015-09-01

    The main goal of this work is to measure how lexical regularities in biomedical ontology labels can be used for the automatic creation of formal relationships between classes, and to evaluate the results of applying our approach to the Gene Ontology (GO). In recent years, we have developed a method for the lexical analysis of regularities in biomedical ontology labels, and we showed that the labels can present a high degree of regularity. In this work, we extend our method with a cross-products extension (CPE) metric, which estimates the potential interest of a specific regularity for axiomatic enrichment in the lexical analysis, using information on exact matches in external ontologies. The GO consortium recently enriched the GO by using so-called cross-product extensions. Cross-products are generated by establishing axioms that relate a given GO class with classes from the GO or other biomedical ontologies. We apply our method to the GO and study how its lexical analysis can identify and reconstruct the cross-products that are defined by the GO consortium. The label of the classes of the GO are highly regular in lexical terms, and the exact matches with labels of external ontologies affect 80% of the GO classes. The CPE metric reveals that 31.48% of the classes that exhibit regularities have fragments that are classes into two external ontologies that are selected for our experiment, namely, the Cell Ontology and the Chemical Entities of Biological Interest ontology, and 18.90% of them are fully decomposable into smaller parts. Our results show that the CPE metric permits our method to detect GO cross-product extensions with a mean recall of 62% and a mean precision of 28%. The study is completed with an analysis of false positives to explain this precision value. We think that our results support the claim that our lexical approach can contribute to the axiomatic enrichment of biomedical ontologies and that it can provide new insights into the engineering of

  5. Integrated bioinformatics analysis reveals key candidate genes and pathways in breast cancer.

    Science.gov (United States)

    Wang, Yuzhi; Zhang, Yi; Huang, Qian; Li, Chengwen

    2018-04-19

    Breast cancer (BC) is the leading malignancy in women worldwide, yet relatively little is known about the genes and signaling pathways involved in BC tumorigenesis and progression. The present study aimed to elucidate potential key candidate genes and pathways in BC. Five gene expression profile data sets (GSE22035, GSE3744, GSE5764, GSE21422 and GSE26910) were downloaded from the Gene Expression Omnibus (GEO) database, which included data from 113 tumorous and 38 adjacent non‑tumorous tissue samples. Differentially expressed genes (DEGs) were identified using t‑tests in the limma R package. These DEGs were subsequently investigated by pathway enrichment analysis and a protein‑protein interaction (PPI) network was constructed. The most significant module from the PPI network was selected for pathway enrichment analysis. In total, 227 DEGs were identified, of which 82 were upregulated and 145 were downregulated. Pathway enrichment analysis results revealed that the upregulated DEGs were mainly enriched in 'cell division', the 'proteinaceous extracellular matrix (ECM)', 'ECM structural constituents' and 'ECM‑receptor interaction', whereas downregulated genes were mainly enriched in 'response to drugs', 'extracellular space', 'transcriptional activator activity' and the 'peroxisome proliferator‑activated receptor signaling pathway'. The PPI network contained 174 nodes and 1,257 edges. DNA topoisomerase 2‑a, baculoviral inhibitor of apoptosis repeat‑containing protein 5, cyclin‑dependent kinase 1, G2/mitotic‑specific cyclin‑B1 and kinetochore protein NDC80 homolog were identified as the top 5 hub genes. Furthermore, the genes in the most significant module were predominantly involved in 'mitotic nuclear division', 'mid‑body', 'protein binding' and 'cell cycle'. In conclusion, the DEGs, relative pathways and hub genes identified in the present study may aid in understanding of the molecular mechanisms underlying BC progression and provide

  6. Enrichment of colorectal cancer associations in functional regions: Insight for using epigenomics data in the analysis of whole genome sequence-imputed GWAS data.

    Directory of Open Access Journals (Sweden)

    Stephanie A Bien

    Full Text Available The evaluation of less frequent genetic variants and their effect on complex disease pose new challenges for genomic research. To investigate whether epigenetic data can be used to inform aggregate rare-variant association methods (RVAM, we assessed whether variants more significantly associated with colorectal cancer (CRC were preferentially located in non-coding regulatory regions, and whether enrichment was specific to colorectal tissues.Active regulatory elements (ARE were mapped using data from 127 tissues and cell-types from NIH Roadmap Epigenomics and Encyclopedia of DNA Elements (ENCODE projects. We investigated whether CRC association p-values were more significant for common variants inside versus outside AREs, or 2 inside colorectal (CR AREs versus AREs of other tissues and cell-types. We employed an integrative epigenomic RVAM for variants with allele frequency <1%. Gene sets were defined as ARE variants within 200 kilobases of a transcription start site (TSS using either CR ARE or ARE from non-digestive tissues. CRC-set association p-values were used to evaluate enrichment of less frequent variant associations in CR ARE versus non-digestive ARE.ARE from 126/127 tissues and cell-types were significantly enriched for stronger CRC-variant associations. Strongest enrichment was observed for digestive tissues and immune cell types. CR-specific ARE were also enriched for stronger CRC-variant associations compared to ARE combined across non-digestive tissues (p-value = 9.6 × 10-4. Additionally, we found enrichment of stronger CRC association p-values for rare variant sets of CR ARE compared to non-digestive ARE (p-value = 0.029.Integrative epigenomic RVAM may enable discovery of less frequent variants associated with CRC, and ARE of digestive and immune tissues are most informative. Although distance-based aggregation of less frequent variants in CR ARE surrounding TSS showed modest enrichment, future association studies would likely

  7. BRAIN NETWORKS. Correlated gene expression supports synchronous activity in brain networks.

    Science.gov (United States)

    Richiardi, Jonas; Altmann, Andre; Milazzo, Anna-Clare; Chang, Catie; Chakravarty, M Mallar; Banaschewski, Tobias; Barker, Gareth J; Bokde, Arun L W; Bromberg, Uli; Büchel, Christian; Conrod, Patricia; Fauth-Bühler, Mira; Flor, Herta; Frouin, Vincent; Gallinat, Jürgen; Garavan, Hugh; Gowland, Penny; Heinz, Andreas; Lemaître, Hervé; Mann, Karl F; Martinot, Jean-Luc; Nees, Frauke; Paus, Tomáš; Pausova, Zdenka; Rietschel, Marcella; Robbins, Trevor W; Smolka, Michael N; Spanagel, Rainer; Ströhle, Andreas; Schumann, Gunter; Hawrylycz, Mike; Poline, Jean-Baptiste; Greicius, Michael D

    2015-06-12

    During rest, brain activity is synchronized between different regions widely distributed throughout the brain, forming functional networks. However, the molecular mechanisms supporting functional connectivity remain undefined. We show that functional brain networks defined with resting-state functional magnetic resonance imaging can be recapitulated by using measures of correlated gene expression in a post mortem brain tissue data set. The set of 136 genes we identify is significantly enriched for ion channels. Polymorphisms in this set of genes significantly affect resting-state functional connectivity in a large sample of healthy adolescents. Expression levels of these genes are also significantly associated with axonal connectivity in the mouse. The results provide convergent, multimodal evidence that resting-state functional networks correlate with the orchestrated activity of dozens of genes linked to ion channel activity and synaptic function. Copyright © 2015, American Association for the Advancement of Science.

  8. Enrichment of human hematopoietic stem/progenitor cells facilitates transduction for stem cell gene therapy.

    Science.gov (United States)

    Baldwin, Kismet; Urbinati, Fabrizia; Romero, Zulema; Campo-Fernandez, Beatriz; Kaufman, Michael L; Cooper, Aaron R; Masiuk, Katelyn; Hollis, Roger P; Kohn, Donald B

    2015-05-01

    Autologous hematopoietic stem cell (HSC) gene therapy for sickle cell disease has the potential to treat this illness without the major immunological complications associated with allogeneic transplantation. However, transduction efficiency by β-globin lentiviral vectors using CD34-enriched cell populations is suboptimal and large vector production batches may be needed for clinical trials. Transducing a cell population more enriched for HSC could greatly reduce vector needs and, potentially, increase transduction efficiency. CD34(+) /CD38(-) cells, comprising ∼1%-3% of all CD34(+) cells, were isolated from healthy cord blood CD34(+) cells by fluorescence-activated cell sorting and transduced with a lentiviral vector expressing an antisickling form of beta-globin (CCL-β(AS3) -FB). Isolated CD34(+) /CD38(-) cells were able to generate progeny over an extended period of long-term culture (LTC) compared to the CD34(+) cells and required up to 40-fold less vector for transduction compared to bulk CD34(+) preparations containing an equivalent number of CD34(+) /CD38(-) cells. Transduction of isolated CD34(+) /CD38(-) cells was comparable to CD34(+) cells measured by quantitative PCR at day 14 with reduced vector needs, and average vector copy/cell remained higher over time for LTC initiated from CD34(+) /38(-) cells. Following in vitro erythroid differentiation, HBBAS3 mRNA expression was similar in cultures derived from CD34(+) /CD38(-) cells or unfractionated CD34(+) cells. In vivo studies showed equivalent engraftment of transduced CD34(+) /CD38(-) cells when transplanted in competition with 100-fold more CD34(+) /CD38(+) cells. This work provides initial evidence for the beneficial effects from isolating human CD34(+) /CD38(-) cells to use significantly less vector and potentially improve transduction for HSC gene therapy. © 2015 AlphaMed Press.

  9. N-of-1-pathways MixEnrich: advancing precision medicine via single-subject analysis in discovering dynamic changes of transcriptomes.

    Science.gov (United States)

    Li, Qike; Schissler, A Grant; Gardeux, Vincent; Achour, Ikbel; Kenost, Colleen; Berghout, Joanne; Li, Haiquan; Zhang, Hao Helen; Lussier, Yves A

    2017-05-24

    Transcriptome analytic tools are commonly used across patient cohorts to develop drugs and predict clinical outcomes. However, as precision medicine pursues more accurate and individualized treatment decisions, these methods are not designed to address single-patient transcriptome analyses. We previously developed and validated the N-of-1-pathways framework using two methods, Wilcoxon and Mahalanobis Distance (MD), for personal transcriptome analysis derived from a pair of samples of a single patient. Although, both methods uncover concordantly dysregulated pathways, they are not designed to detect dysregulated pathways with up- and down-regulated genes (bidirectional dysregulation) that are ubiquitous in biological systems. We developed N-of-1-pathways MixEnrich, a mixture model followed by a gene set enrichment test, to uncover bidirectional and concordantly dysregulated pathways one patient at a time. We assess its accuracy in a comprehensive simulation study and in a RNA-Seq data analysis of head and neck squamous cell carcinomas (HNSCCs). In presence of bidirectionally dysregulated genes in the pathway or in presence of high background noise, MixEnrich substantially outperforms previous single-subject transcriptome analysis methods, both in the simulation study and the HNSCCs data analysis (ROC Curves; higher true positive rates; lower false positive rates). Bidirectional and concordant dysregulated pathways uncovered by MixEnrich in each patient largely overlapped with the quasi-gold standard compared to other single-subject and cohort-based transcriptome analyses. The greater performance of MixEnrich presents an advantage over previous methods to meet the promise of providing accurate personal transcriptome analysis to support precision medicine at point of care.

  10. Genome-Wide Association Studies Suggest Limited Immune Gene Enrichment in Schizophrenia Compared to 5 Autoimmune Diseases

    DEFF Research Database (Denmark)

    Pouget, Jennie G; Gonçalves, Vanessa F; Spain, Sarah L

    2016-01-01

    There has been intense debate over the immunological basis of schizophrenia, and the potential utility of adjunct immunotherapies. The major histocompatibility complex is consistently the most powerful region of association in genome-wide association studies (GWASs) of schizophrenia and has been...... in immune genes contributes to schizophrenia. We show that there is no enrichment of immune loci outside of the MHC region in the largest genetic study of schizophrenia conducted to date, in contrast to 5 diseases of known immune origin. Among 108 regions of the genome previously associated...

  11. Gene set-based module discovery in the breast cancer transcriptome

    Directory of Open Access Journals (Sweden)

    Zhang Michael Q

    2009-02-01

    Full Text Available Abstract Background Although microarray-based studies have revealed global view of gene expression in cancer cells, we still have little knowledge about regulatory mechanisms underlying the transcriptome. Several computational methods applied to yeast data have recently succeeded in identifying expression modules, which is defined as co-expressed gene sets under common regulatory mechanisms. However, such module discovery methods are not applied cancer transcriptome data. Results In order to decode oncogenic regulatory programs in cancer cells, we developed a novel module discovery method termed EEM by extending a previously reported module discovery method, and applied it to breast cancer expression data. Starting from seed gene sets prepared based on cis-regulatory elements, ChIP-chip data, and gene locus information, EEM identified 10 principal expression modules in breast cancer based on their expression coherence. Moreover, EEM depicted their activity profiles, which predict regulatory programs in each subtypes of breast tumors. For example, our analysis revealed that the expression module regulated by the Polycomb repressive complex 2 (PRC2 is downregulated in triple negative breast cancers, suggesting similarity of transcriptional programs between stem cells and aggressive breast cancer cells. We also found that the activity of the PRC2 expression module is negatively correlated to the expression of EZH2, a component of PRC2 which belongs to the E2F expression module. E2F-driven EZH2 overexpression may be responsible for the repression of the PRC2 expression modules in triple negative tumors. Furthermore, our network analysis predicts regulatory circuits in breast cancer cells. Conclusion These results demonstrate that the gene set-based module discovery approach is a powerful tool to decode regulatory programs in cancer cells.

  12. Relation of addiction genes to hypothalamic gene changes subserving genesis and gratification of a classic instinct, sodium appetite.

    Science.gov (United States)

    Liedtke, Wolfgang B; McKinley, Michael J; Walker, Lesley L; Zhang, Hao; Pfenning, Andreas R; Drago, John; Hochendoner, Sarah J; Hilton, Donald L; Lawrence, Andrew J; Denton, Derek A

    2011-07-26

    Sodium appetite is an instinct that involves avid specific intention. It is elicited by sodium deficiency, stress-evoked adrenocorticotropic hormone (ACTH), and reproduction. Genome-wide microarrays in sodium-deficient mice or after ACTH infusion showed up-regulation of hypothalamic genes, including dopamine- and cAMP-regulated neuronal phosphoprotein 32 kDa (DARPP-32), dopamine receptors-1 and -2, α-2C- adrenoceptor, and striatally enriched protein tyrosine phosphatase (STEP). Both DARPP-32 and neural plasticity regulator activity-regulated cytoskeleton associated protein (ARC) were up-regulated in lateral hypothalamic orexinergic neurons by sodium deficiency. Administration of dopamine D1 (SCH23390) and D2 receptor (raclopride) antagonists reduced gratification of sodium appetite triggered by sodium deficiency. SCH23390 was specific, having no effect on osmotic-induced water drinking, whereas raclopride also reduced water intake. D1 receptor KO mice had normal sodium appetite, indicating compensatory regulation. Appetite was insensitive to SCH23390, confirming the absence of off-target effects. Bilateral microinjection of SCH23390 (100 nM in 200 nL) into rats' lateral hypothalamus greatly reduced sodium appetite. Gene set enrichment analysis in hypothalami of mice with sodium appetite showed significant enrichment of gene sets previously linked to addiction (opiates and cocaine). This finding of concerted gene regulation was attenuated on gratification with perplexingly rapid kinetics of only 10 min, anteceding significant absorption of salt from the gut. Salt appetite and hedonic liking of salt taste have evolved over >100 million y (e.g., being present in Metatheria). Drugs causing pleasure and addiction are comparatively recent and likely reflect usurping of evolutionary ancient systems with high survival value by the gratification of contemporary hedonic indulgences. Our findings outline a molecular logic for instinctive behavior encoded by the brain with

  13. Pros and cons of HaloPlex enrichment in cancer predisposition genetic diagnosis

    Directory of Open Access Journals (Sweden)

    Agnès Collet

    2015-12-01

    Full Text Available Panel sequencing is a practical option in genetic diagnosis. Enrichment and library preparation steps are critical in the diagnostic setting. In order to test the value of HaloPlex technology in diagnosis, we designed a custom oncogenetic panel including 62 genes. The procedure was tested on a training set of 71 controls and then blindly validated on 48 consecutive hereditary breast/ovarian cancer (HBOC patients tested negative for BRCA1/2 mutation. Libraries were sequenced on HiSeq2500 and data were analysed with our academic bioinformatics pipeline. Point mutations were detected using Varscan2, median size indels were detected using Pindel and large genomic rearrangements (LGR were detected by DESeq. Proper coverage was obtained. However, highly variable read depth was observed within genes. Excluding pseudogene analysis, all point mutations were detected on the training set. All indels were also detected using Pindel. On the other hand, DESeq allowed LGR detection but with poor specificity, preventing its use in diagnostics. Mutations were detected in 8% of BRCA1/2-negative HBOC cases. HaloPlex technology appears to be an efficient and promising solution for gene panel diagnostics. Data analysis remains a major challenge and geneticists should enhance their bioinformatics knowledge in order to ensure good quality diagnostic results.

  14. Modulation of microbial consortia enriched from different polluted environments during petroleum biodegradation.

    Science.gov (United States)

    Omrani, Rahma; Spini, Giulia; Puglisi, Edoardo; Saidane, Dalila

    2018-04-01

    Environmental microbial communities are key players in the bioremediation of hydrocarbon pollutants. Here we assessed changes in bacterial abundance and diversity during the degradation of Tunisian Zarzatine oil by four indigenous bacterial consortia enriched from a petroleum station soil, a refinery reservoir soil, a harbor sediment and seawater. The four consortia were found to efficiently degrade up to 92.0% of total petroleum hydrocarbons after 2 months of incubation. Illumina 16S rRNA gene sequencing revealed that the consortia enriched from soil and sediments were dominated by species belonging to Pseudomonas and Acinetobacter genera, while in the seawater-derived consortia Dietzia, Fusobacterium and Mycoplana emerged as dominant genera. We identified a number of species whose relative abundances bloomed from small to high percentages: Dietzia daqingensis in the seawater microcosms, and three OTUs classified as Acinetobacter venetianus in all two soils and sediment derived microcosms. Functional analyses on degrading genes were conducted by comparing PCR results of the degrading genes alkB, ndoB, cat23, xylA and nidA1 with inferences obtained by PICRUSt analysis of 16S amplicon data: the two data sets were partly in agreement and suggest a relationship between the catabolic genes detected and the rate of biodegradation obtained. The work provides detailed insights about the modulation of bacterial communities involved in petroleum biodegradation and can provide useful information for in situ bioremediation of oil-related pollution.

  15. Identification and functional analysis of endothelial tip cell-enriched genes.

    Science.gov (United States)

    del Toro, Raquel; Prahst, Claudia; Mathivet, Thomas; Siegfried, Geraldine; Kaminker, Joshua S; Larrivee, Bruno; Breant, Christiane; Duarte, Antonio; Takakura, Nobuyuki; Fukamizu, Akiyoshi; Penninger, Josef; Eichmann, Anne

    2010-11-11

    Sprouting of developing blood vessels is mediated by specialized motile endothelial cells localized at the tips of growing capillaries. Following behind the tip cells, endothelial stalk cells form the capillary lumen and proliferate. Expression of the Notch ligand Delta-like-4 (Dll4) in tip cells suppresses tip cell fate in neighboring stalk cells via Notch signaling. In DLL4(+/-) mouse mutants, most retinal endothelial cells display morphologic features of tip cells. We hypothesized that these mouse mutants could be used to isolate tip cells and so to determine their genetic repertoire. Using transcriptome analysis of retinal endothelial cells isolated from DLL4(+/-) and wild-type mice, we identified 3 clusters of tip cell-enriched genes, encoding extracellular matrix degrading enzymes, basement membrane components, and secreted molecules. Secreted molecules endothelial-specific molecule 1, angiopoietin 2, and apelin bind to cognate receptors on endothelial stalk cells. Knockout mice and zebrafish morpholino knockdown of apelin showed delayed angiogenesis and reduced proliferation of stalk cells expressing the apelin receptor APJ. Thus, tip cells may regulate angiogenesis via matrix remodeling, production of basement membrane, and release of secreted molecules, some of which regulate stalk cell behavior.

  16. The null hypothesis of GSEA, and a novel statistical model for competitive gene set analysis

    DEFF Research Database (Denmark)

    Debrabant, Birgit

    2017-01-01

    MOTIVATION: Competitive gene set analysis intends to assess whether a specific set of genes is more associated with a trait than the remaining genes. However, the statistical models assumed to date to underly these methods do not enable a clear cut formulation of the competitive null hypothesis....... This is a major handicap to the interpretation of results obtained from a gene set analysis. RESULTS: This work presents a hierarchical statistical model based on the notion of dependence measures, which overcomes this problem. The two levels of the model naturally reflect the modular structure of many gene set...... analysis methods. We apply the model to show that the popular GSEA method, which recently has been claimed to test the self-contained null hypothesis, actually tests the competitive null if the weight parameter is zero. However, for this result to hold strictly, the choice of the dependence measures...

  17. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

    Science.gov (United States)

    Uddin, Raihan; Singh, Shiva M

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they

  18. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

    Science.gov (United States)

    Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

    2009-02-01

    Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

  19. Genome-wide Anaplasma phagocytophilum AnkA-DNA interactions are enriched in intergenic regions and gene promoters and correlate with infection-induced differential gene expression.

    Directory of Open Access Journals (Sweden)

    J Stephen Dumler

    2016-09-01

    Full Text Available Anaplasma phagocytophilum, an obligate intracellular prokaryote, infects neutrophils and alters cardinal functions via reprogrammed transcription. Large contiguous regions of neutrophil chromosomes are differentially expressed during infection. Secreted A. phagocytophilum effector AnkA transits into the neutrophil or granulocyte nucleus to complex with DNA in heterochromatin across all chromosomes. AnkA binds to gene promoters to dampen cis-transcription and also has features of matrix attachment region (MAR-binding proteins that regulate three-dimensional chromatin architecture and coordinate transcriptional programs encoded in topologically-associated chromatin domains. We hypothesize that identification of additional AnkA binding sites will better delineate how A. phagocytophilum infection results in reprogramming of the neutrophil genome. Using AnkA-binding ChIP-seq, we showed that AnkA binds broadly throughout all chromosomes in a reproducible pattern, especially at: i intergenic regions predicted to be matrix attachment regions (MARs; ii within predicted lamina-associated domains; and iii at promoters ≤3,000 bp upstream of transcriptional start sites. These findings provide genome-wide support for AnkA as a regulator of cis-gene transcription. Moreover, the dominant mark of AnkA in distal intergenic regions known to be AT-enriched, coupled with frequent enrichment in the nuclear lamina, provides strong support for its role as a MAR-binding protein and genome re-organizer. AnkA must be considered a prime candidate to promote neutrophil reprogramming and subsequent functional changes that belie improved microbial fitness and pathogenicity.

  20. Gene and miRNA expression signature of Lewis lung carcinoma LLC1 cells in extracellular matrix enriched microenvironment

    International Nuclear Information System (INIS)

    Stankevicius, Vaidotas; Vasauskas, Gintautas; Bulotiene, Danute; Butkyte, Stase; Jarmalaite, Sonata; Rotomskis, Ricardas; Suziedelis, Kestutis

    2016-01-01

    The extracellular matrix (ECM), one of the key components of tumor microenvironment, has a tremendous impact on cancer development and highly influences tumor cell features. ECM affects vital cellular functions such as cell differentiation, migration, survival and proliferation. Gene and protein expression levels are regulated in cell-ECM interaction dependent manner as well. The rate of unsuccessful clinical trials, based on cell culture research models lacking the ECM microenvironment, indicates the need for alternative models and determines the shift to three-dimensional (3D) laminin rich ECM models, better simulating tissue organization. Recognized advantages of 3D models suggest the development of new anticancer treatment strategies. This is among the most promising directions of 3D cell cultures application. However, detailed analysis at the molecular level of 2D/3D cell cultures and tumors in vivo is still needed to elucidate cellular pathways most promising for the development of targeted therapies. In order to elucidate which biological pathways are altered during microenvironmental shift we have analyzed whole genome mRNA and miRNA expression differences in LLC1 cells cultured in 2D or 3D culture conditions. In our study we used DNA microarrays for whole genome analysis of mRNA and miRNA expression differences in LLC1 cells cultivated in 2D or 3D culture conditions. Next, we indicated the most common enriched functional categories using KEGG pathway enrichment analysis. Finally, we validated the microarray data by quantitative PCR in LLC1 cells cultured under 2D or 3D conditions or LLC1 tumors implanted in experimental animals. Microarray gene expression analysis revealed that 1884 genes and 77 miRNAs were significantly altered in LLC1 cells after 48 h cell growth under 2D and ECM based 3D cell growth conditions. Pathway enrichment results indicated metabolic pathway, MAP kinase, cell adhesion and immune response as the most significantly altered

  1. Risk score modeling of multiple gene to gene interactions using aggregated-multifactor dimensionality reduction

    Directory of Open Access Journals (Sweden)

    Dai Hongying

    2013-01-01

    Full Text Available Abstract Background Multifactor Dimensionality Reduction (MDR has been widely applied to detect gene-gene (GxG interactions associated with complex diseases. Existing MDR methods summarize disease risk by a dichotomous predisposing model (high-risk/low-risk from one optimal GxG interaction, which does not take the accumulated effects from multiple GxG interactions into account. Results We propose an Aggregated-Multifactor Dimensionality Reduction (A-MDR method that exhaustively searches for and detects significant GxG interactions to generate an epistasis enriched gene network. An aggregated epistasis enriched risk score, which takes into account multiple GxG interactions simultaneously, replaces the dichotomous predisposing risk variable and provides higher resolution in the quantification of disease susceptibility. We evaluate this new A-MDR approach in a broad range of simulations. Also, we present the results of an application of the A-MDR method to a data set derived from Juvenile Idiopathic Arthritis patients treated with methotrexate (MTX that revealed several GxG interactions in the folate pathway that were associated with treatment response. The epistasis enriched risk score that pooled information from 82 significant GxG interactions distinguished MTX responders from non-responders with 82% accuracy. Conclusions The proposed A-MDR is innovative in the MDR framework to investigate aggregated effects among GxG interactions. New measures (pOR, pRR and pChi are proposed to detect multiple GxG interactions.

  2. Toxoplasmosis and Polygenic Disease Susceptibility Genes: Extensive Toxoplasma gondii Host/Pathogen Interactome Enrichment in Nine Psychiatric or Neurological Disorders

    Directory of Open Access Journals (Sweden)

    C. J. Carter

    2013-01-01

    Full Text Available Toxoplasma gondii is not only implicated in schizophrenia and related disorders, but also in Alzheimer's or Parkinson's disease, cancer, cardiac myopathies, and autoimmune disorders. During its life cycle, the pathogen interacts with ~3000 host genes or proteins. Susceptibility genes for multiple sclerosis, Alzheimer's disease, schizophrenia, bipolar disorder, depression, childhood obesity, Parkinson's disease, attention deficit hyperactivity disorder (multiple sclerosis, and autism (, but not anorexia or chronic fatigue are highly enriched in the human arm of this interactome and 18 (ADHD to 33% (MS of the susceptibility genes relate to it. The signalling pathways involved in the susceptibility gene/interactome overlaps are relatively specific and relevant to each disease suggesting a means whereby susceptibility genes could orient the attentions of a single pathogen towards disruption of the specific pathways that together contribute (positively or negatively to the endophenotypes of different diseases. Conditional protein knockdown, orchestrated by T. gondii proteins or antibodies binding to those of the host (pathogen derived autoimmunity and metabolite exchange, may contribute to this disruption. Susceptibility genes may thus be related to the causes and influencers of disease, rather than (and as well as to the disease itself.

  3. Cardiac-enriched BAF chromatin-remodeling complex subunit Baf60c regulates gene expression programs essential for heart development and function

    Directory of Open Access Journals (Sweden)

    Xin Sun

    2018-01-01

    Full Text Available How chromatin-remodeling complexes modulate gene networks to control organ-specific properties is not well understood. For example, Baf60c (Smarcd3 encodes a cardiac-enriched subunit of the SWI/SNF-like BAF chromatin complex, but its role in heart development is not fully understood. We found that constitutive loss of Baf60c leads to embryonic cardiac hypoplasia and pronounced cardiac dysfunction. Conditional deletion of Baf60c in cardiomyocytes resulted in postnatal dilated cardiomyopathy with impaired contractile function. Baf60c regulates a gene expression program that includes genes encoding contractile proteins, modulators of sarcomere function, and cardiac metabolic genes. Many of the genes deregulated in Baf60c null embryos are targets of the MEF2/SRF co-factor Myocardin (MYOCD. In a yeast two-hybrid screen, we identified MYOCD as a BAF60c interacting factor; we showed that BAF60c and MYOCD directly and functionally interact. We conclude that Baf60c is essential for coordinating a program of gene expression that regulates the fundamental functional properties of cardiomyocytes.

  4. Effect of biochar amendment on the control of soil sulfonamides, antibiotic-resistant bacteria, and gene enrichment in lettuce tissues

    International Nuclear Information System (INIS)

    Ye, Mao; Sun, Mingming; Feng, Yanfang; Wan, Jinzhong; Xie, Shanni; Tian, Da; Zhao, Yu; Wu, Jun; Hu, Feng; Li, Huixin; Jiang, Xin

    2016-01-01

    Highlights: • Biochar can prevent soil sulfonamides from accumulating in lettuce tissues. • ARB enrichment in lettuce tissues decreased significantly after biochar amendment. • Impedance effect of biochar addition on soil ARGs was also quite effective. • Biochar application can be a practical strategy to protect vegetable safety. - Abstract: Considering the potential threat of vegetables growing in antibiotic-polluted soil with high abundance of antibiotic-resistant genes (ARGs) against human health through the food chain, it is thus urgent to develop novel control technology to ensure vegetable safety. In the present work, pot experiments were conducted in lettuce cultivation to assess the impedance effect of biochar amendment on soil sulfonamides (SAs), antibiotic-resistant bacteria (ARB), and ARG enrichment in lettuce tissues. After 100 days of cultivation, lettuce cultivation with biochar amendment exhibited the greatest soil SA dissipation as well as the significant improvement of lettuce growth indices, with residual soil SAs mainly existing as the tightly bound fraction. Moreover, the SA contents in roots and new/old leaves were reduced by one to two orders of magnitude compared to those without biochar amendment. In addition, isolate counts for SA-resistant bacterial endophytes in old leaves and sul gene abundances in roots and old leaves also decreased significantly after biochar application. However, neither SA resistant bacteria nor sul genes were detected in new leaves. It was the first study to demonstrate that biochar amendment can be a practical strategy to protect lettuce safety growing in SA-polluted soil with rich ARB and ARGs.

  5. Effect of biochar amendment on the control of soil sulfonamides, antibiotic-resistant bacteria, and gene enrichment in lettuce tissues

    Energy Technology Data Exchange (ETDEWEB)

    Ye, Mao [State Key Laboratory of Soil and Sustainable Agriculture, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008 (China); Sun, Mingming [Soil Ecology Lab, College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing 210095 (China); Feng, Yanfang, E-mail: fengyanfang@163.com [Institute of Agricultural Resources and Environment, Jiangsu Academy of Agricultural Sciences, Nanjing 210014 (China); Wan, Jinzhong [Nanjing Institute of Environmental Science, Ministry of Environmental Protection of China, Nanjing 210042 (China); Xie, Shanni; Tian, Da [Soil Ecology Lab, College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing 210095 (China); Zhao, Yu [Collaborative Innovation Center of Advanced Microstructures, Jiangsu Provincial Key Laboratory of Photonic and Electronic Materials, School of Electronic Science and Engineering, Nanjing University, Nanjing 210093 (China); Wu, Jun; Hu, Feng; Li, Huixin [Soil Ecology Lab, College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing 210095 (China); Jiang, Xin, E-mail: Jiangxin@issas.ac.cn [State Key Laboratory of Soil and Sustainable Agriculture, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008 (China)

    2016-05-15

    Highlights: • Biochar can prevent soil sulfonamides from accumulating in lettuce tissues. • ARB enrichment in lettuce tissues decreased significantly after biochar amendment. • Impedance effect of biochar addition on soil ARGs was also quite effective. • Biochar application can be a practical strategy to protect vegetable safety. - Abstract: Considering the potential threat of vegetables growing in antibiotic-polluted soil with high abundance of antibiotic-resistant genes (ARGs) against human health through the food chain, it is thus urgent to develop novel control technology to ensure vegetable safety. In the present work, pot experiments were conducted in lettuce cultivation to assess the impedance effect of biochar amendment on soil sulfonamides (SAs), antibiotic-resistant bacteria (ARB), and ARG enrichment in lettuce tissues. After 100 days of cultivation, lettuce cultivation with biochar amendment exhibited the greatest soil SA dissipation as well as the significant improvement of lettuce growth indices, with residual soil SAs mainly existing as the tightly bound fraction. Moreover, the SA contents in roots and new/old leaves were reduced by one to two orders of magnitude compared to those without biochar amendment. In addition, isolate counts for SA-resistant bacterial endophytes in old leaves and sul gene abundances in roots and old leaves also decreased significantly after biochar application. However, neither SA resistant bacteria nor sul genes were detected in new leaves. It was the first study to demonstrate that biochar amendment can be a practical strategy to protect lettuce safety growing in SA-polluted soil with rich ARB and ARGs.

  6. A gene pathway analysis highlights the role of cellular adhesion molecules in multiple sclerosis susceptibility

    DEFF Research Database (Denmark)

    Damotte, V; Guillot-Noel, L; Patsopoulos, N A

    2014-01-01

    adhesion molecule (CAMs) biological pathway using Cytoscape software. This network is a strong candidate, as it is involved in the crossing of the blood-brain barrier by the T cells, an early event in MS pathophysiology, and is used as an efficient therapeutic target. We drew up a list of 76 genes...... in interaction with other genes as a group. Pathway analysis is an alternative way to highlight such group of genes. Using SNP association P-values from eight multiple sclerosis (MS) GWAS data sets, we performed a candidate pathway analysis for MS susceptibility by considering genes interacting in the cell...... belonging to the CAM network. We highlighted 64 networks enriched with CAM genes with low P-values. Filtering by a percentage of CAM genes up to 50% and rejecting enriched signals mainly driven by transcription factors, we highlighted five networks associated with MS susceptibility. One of them, constituted...

  7. Environmental enrichment for aquatic animals.

    Science.gov (United States)

    Corcoran, Mike

    2015-05-01

    Aquatic animals are the most popular pets in the United States based on the number of owned pets. They are popular display animals and are increasingly used in research settings. Enrichment of captive animals is an important element of zoo and laboratory medicine. The importance of enrichment for aquatic animals has been slower in implementation. For a long time, there was debate over whether or not fish were able to experience pain or form long-term memories. As that debate has reduced and the consciousness of more aquatic animals is accepted, the need to discuss enrichment for these animals has increased. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. Report of the Subcommittee on Domestic Uranium Enrichment

    International Nuclear Information System (INIS)

    1981-01-01

    A report by the Subcommittee on Domestic Uranium Enrichment to the Atomic Energy Commission is described; which covers the procedure of the domestic uranium enrichment by centrifugal process up to the commercial production, reviewing the current situation in this field. Domestic uranium enrichment is important in the aspects of securing stable enrichment service, establishing sound fuel cycle, and others. As the future target, the production around the year 2000 is set at 3,000 tons SWU per year at least. The business of uranium enrichment, which is now developed in the Power Reactor and Nuclear Fuel Development Corporation, is to be carried out by private enterprise. The contents are as follows: demand and supply balance of uranium enrichment service, significance of domestic uranium enrichment, evaluation of centrifugal uranium enrichment technology, the target of domestic uranium enrichment, the policy of domestic uranium enrichment promotion. (J.P.N.)

  9. Mangrove microniches determine the structural and functional diversity of enriched petroleum hydrocarbon-degrading consortia.

    Science.gov (United States)

    Gomes, Newton C M; Flocco, Cecilia G; Costa, Rodrigo; Junca, Howard; Vilchez, Ramiro; Pieper, Dietmar H; Krögerrecklenfort, Ellen; Paranhos, Rodolfo; Mendonça-Hagler, Leda C S; Smalla, Kornelia

    2010-11-01

    In this study, the combination of culture enrichments and molecular tools was used to identify bacterial guilds, plasmids and functional genes potentially important in the process of petroleum hydrocarbon (PH) decontamination in mangrove microniches (rhizospheres and bulk sediment). In addition, we aimed to recover PH-degrading consortia (PHDC) for future use in remediation strategies. The PHDC were enriched with petroleum from rhizosphere and bulk sediment samples taken from a mangrove chronically polluted with oil hydrocarbons. Southern blot hybridization (SBH) assays of PCR amplicons from environmental DNA before enrichments resulted in weak positive signals for the functional gene types targeted, suggesting that PH-degrading genotypes and plasmids were in low abundance in the rhizosphere and bulk sediments. However, after enrichment, these genes were detected and strong microniche-dependent differences in the abundance and composition of hydrocarbonoclastic bacterial populations, plasmids (IncP-1α, IncP-1β, IncP-7 and IncP-9) and functional genes (naphthalene, extradiol and intradiol dioxygenases) were revealed by in-depth molecular analyses [PCR-denaturing gradient gel electrophoresis and hybridization (SBH and microarray)]. Our results suggest that, despite the low abundance of PH-degrading genes and plasmids in the environmental samples, the original bacterial composition of the mangrove microniches determined the structural and functional diversity of the PHDC enriched. © 2010 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  10. Comparative gene expression analysis of two mouse models of autism:transcriptome profiling of the BTBR and En2-/- hippocampus

    Directory of Open Access Journals (Sweden)

    Giovanni Provenzano

    2016-08-01

    Full Text Available Autism spectrum disorders (ASD are characterized by a high degree of genetic heterogeneity. Genomic studies identified common pathological processes underlying the heterogeneous clinical manifestations of ASD, and transcriptome analyses revealed that gene networks involved in synapse development, neuronal activity and immune function are deregulated in ASD. Mouse models provide unique tools to investigate the neurobiological basis of ASD; however, a comprehensive approach to identify transcriptional abnormalities in different ASD models has never been performed. Here we used two well-recognized ASD mouse models, BTBR T+ Itpr3tf/J (BTBR and Engrailed-2 knockout (En2-/-, to identify conserved ASD-related molecular signatures. En2-/- mice bear a mutation within the EN2 transcription factor homeobox, while BTBR is an inbred strain with unknown genetic defects. Hippocampal RNA samples from BTBR, En2-/- and respective control (C57Bl/6J and En2+/+ adult mice were assessed for differential gene expression using microarrays. A total of 153 genes were similarly deregulated in the BTBR and En2-/- hippocampus. Mouse phenotype and gene ontology enrichment analyses were performed on BTBR and En2-/- hippocampal differentially expressed genes (DEGs. Pathways represented in both BTBR and En2-/- hippocampal DEGs included abnormal behavioral response and chemokine/MAP kinase signaling. Genes involved in abnormal function of the immune system and abnormal synaptic transmission/seizures were significantly represented among BTBR and En2-/- DEGs, respectively. Interestingly, both BTBR and En2-/- hippocampal DEGs showed a significant enrichment of ASD and schizophrenia (SCZ-associated genes. Specific gene sets were enriched in the two models: microglial genes were significantly enriched among BTBR DEGs, whereas GABAergic/glutamatergic postsynaptic genes, FMRP-interacting genes and epilepsy-related genes were significantly enriched among En2-/- DEGs. Weighted

  11. Enriching Genomic Resources and Marker Development from Transcript Sequences of Jatropha curcas for Microgravity Studies

    Science.gov (United States)

    Tian, Wenlan; Paudel, Dev

    2017-01-01

    Jatropha (Jatropha curcas L.) is an economically important species with a great potential for biodiesel production. To enrich the jatropha genomic databases and resources for microgravity studies, we sequenced and annotated the transcriptome of jatropha and developed SSR and SNP markers from the transcriptome sequences. In total 1,714,433 raw reads with an average length of 441.2 nucleotides were generated. De novo assembling and clustering resulted in 115,611 uniquely assembled sequences (UASs) including 21,418 full-length cDNAs and 23,264 new jatropha transcript sequences. The whole set of UASs were fully annotated, out of which 59,903 (51.81%) were assigned with gene ontology (GO) term, 12,584 (10.88%) had orthologs in Eukaryotic Orthologous Groups (KOG), and 8,822 (7.63%) were mapped to 317 pathways in six different categories in Kyoto Encyclopedia of Genes and Genome (KEGG) database, and it contained 3,588 putative transcription factors. From the UASs, 9,798 SSRs were discovered with AG/CT as the most frequent (45.8%) SSR motif type. Further 38,693 SNPs were detected and 7,584 remained after filtering. This UAS set has enriched the current jatropha genomic databases and provided a large number of genetic markers, which can facilitate jatropha genetic improvement and many other genetic and biological studies. PMID:28154822

  12. Addiction and Reward-related Genes Show Altered Expression in the Postpartum Nucleus Accumbens

    Directory of Open Access Journals (Sweden)

    Changjiu eZhao

    2014-11-01

    Full Text Available Motherhood involves a switch in natural rewards, whereby offspring become highly rewarding. Nucleus accumbens (NAC is a key CNS region for natural rewards and addictions, but to date no study has evaluated on a large scale the events in NAC that underlie the maternal change in natural rewards. In this study we utilized microarray and bioinformatics approaches to evaluate postpartum NAC gene expression changes in mice. Modular Single-set Enrichment Test (MSET indicated that postpartum (relative to virgin NAC gene expression profile was significantly enriched for genes related to addiction and reward in 5 of 5 independently curated databases (e.g., Malacards, Phenopedia. Over 100 addiction/reward related genes were identified and these included: Per1, Per2, Arc, Homer2, Creb1, Grm3, Fosb, Gabrb3, Adra2a, Ntrk2, Cry1, Penk, Cartpt, Adcy1, Npy1r, Htr1a, Drd1a, Gria1, and Pdyn. ToppCluster analysis found maternal NAC expression profile to be significantly enriched for genes related to the drug action of nicotine, ketamine, and dronabinol. Pathway analysis indicated postpartum NAC as enriched for RNA processing, CNS development/differentiation, and transcriptional regulation. Weighted Gene Coexpression Network Analysis identified possible networks for transcription factors, including Nr1d1, Per2, Fosb, Egr1, and Nr4a1. The postpartum state involves increased risk for mental health disorders and MSET analysis indicated postpartum NAC to be enriched for genes related to depression, bipolar disorder, and schizophrenia. Mental health related genes included: Fabp7, Grm3, Penk, and Nr1d1. We confirmed via quantitative PCR Nr1d1, Per2, Grm3, Penk, Drd1a, and Pdyn. This study indicates for the first time that postpartum NAC involves large scale gene expression alterations linked to addiction and reward. Because the postpartum state also involves decreased response to drugs, the findings could provide insights into how to mitigate addictions.

  13. Gene expression meta-analysis identifies chromosomal regions involved in ovarian cancer survival

    DEFF Research Database (Denmark)

    Thomassen, Mads; Jochumsen, Kirsten M; Mogensen, Ole

    2009-01-01

    the relation of gene expression and chromosomal position to identify chromosomal regions of importance for early recurrence of ovarian cancer. By use of *Gene Set Enrichment Analysis*, we have ranked chromosomal regions according to their association to survival. Over-representation analysis including 1...... using death (P = 0.015) and recurrence (P = 0.002) as outcome. The combined mutation score is strongly associated to upregulation of several growth factor pathways....

  14. GSHR, a Web-Based Platform Provides Gene Set-Level Analyses of Hormone Responses in Arabidopsis

    Directory of Open Access Journals (Sweden)

    Xiaojuan Ran

    2018-01-01

    Full Text Available Phytohormones regulate diverse aspects of plant growth and environmental responses. Recent high-throughput technologies have promoted a more comprehensive profiling of genes regulated by different hormones. However, these omics data generally result in large gene lists that make it challenging to interpret the data and extract insights into biological significance. With the rapid accumulation of theses large-scale experiments, especially the transcriptomic data available in public databases, a means of using this information to explore the transcriptional networks is needed. Different platforms have different architectures and designs, and even similar studies using the same platform may obtain data with large variances because of the highly dynamic and flexible effects of plant hormones; this makes it difficult to make comparisons across different studies and platforms. Here, we present a web server providing gene set-level analyses of Arabidopsis thaliana hormone responses. GSHR collected 333 RNA-seq and 1,205 microarray datasets from the Gene Expression Omnibus, characterizing transcriptomic changes in Arabidopsis in response to phytohormones including abscisic acid, auxin, brassinosteroids, cytokinins, ethylene, gibberellins, jasmonic acid, salicylic acid, and strigolactones. These data were further processed and organized into 1,368 gene sets regulated by different hormones or hormone-related factors. By comparing input gene lists to these gene sets, GSHR helped to identify gene sets from the input gene list regulated by different phytohormones or related factors. Together, GSHR links prior information regarding transcriptomic changes induced by hormones and related factors to newly generated data and facilities cross-study and cross-platform comparisons; this helps facilitate the mining of biologically significant information from large-scale datasets. The GSHR is freely available at http://bioinfo.sibs.ac.cn/GSHR/.

  15. Identification of a conserved set of upregulated genes in mouse skeletal muscle hypertrophy and regrowth.

    Science.gov (United States)

    Chaillou, Thomas; Jackson, Janna R; England, Jonathan H; Kirby, Tyler J; Richards-White, Jena; Esser, Karyn A; Dupont-Versteegden, Esther E; McCarthy, John J

    2015-01-01

    The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. Copyright © 2015 the American Physiological Society.

  16. Flavanol-Enriched Cocoa Powder Alters the Intestinal Microbiota, Tissue and Fluid Metabolite Profiles, and Intestinal Gene Expression in Pigs.

    Science.gov (United States)

    Jang, Saebyeol; Sun, Jianghao; Chen, Pei; Lakshman, Sukla; Molokin, Aleksey; Harnly, James M; Vinyard, Bryan T; Urban, Joseph F; Davis, Cindy D; Solano-Aguilar, Gloria

    2016-04-01

    Consumption of cocoa-derived polyphenols has been associated with several health benefits; however, their effects on the intestinal microbiome and related features of host intestinal health are not adequately understood. The objective of this study was to determine the effects of eating flavanol-enriched cocoa powder on the composition of the gut microbiota, tissue metabolite profiles, and intestinal immune status. Male pigs (5 mo old, 28 kg mean body weight) were supplemented with 0, 2.5, 10, or 20 g flavanol-enriched cocoa powder/d for 27 d. Metabolites in serum, urine, the proximal colon contents, liver, and adipose tissue; bacterial abundance in the intestinal contents and feces; and intestinal tissue gene expression of inflammatory markers and Toll-like receptors (TLRs) were then determined. O-methyl-epicatechin-glucuronide conjugates dose-dependently increased (Pcocoa powder. The concentration of 3-hydroxyphenylpropionic acid isomers in urine decreased as the dose of cocoa powder fed to pigs increased (75-85%,Pcocoa powder/d, respectively. Moreover, consumption of cocoa powder reducedTLR9gene expression in ileal Peyer's patches (67-80%,Pcocoa powder/d compared with pigs not supplemented with cocoa powder. This study demonstrates that consumption of cocoa powder by pigs can contribute to gut health by enhancing the abundance ofLactobacillusandBifidobacteriumspecies and modulating markers of localized intestinal immunity. © 2016 American Society for Nutrition.

  17. Integrative ChIP-seq/microarray analysis identifies a CTNNB1 target signature enriched in intestinal stem cells and colon cancer.

    Science.gov (United States)

    Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L; Roberts, Brian S; Arthur, William T; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing

    2014-01-01

    Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells.

  18. Chronic vitamin A-enriched diet feeding regulates hypercholesterolaemia through transcriptional regulation of reverse cholesterol transport pathway genes in obese rat model of WNIN/GR-Ob strain

    Directory of Open Access Journals (Sweden)

    Shanmugam M Jeyakumar

    2016-01-01

    Full Text Available Background & objectives: Hepatic scavenger receptor class B1 (SR-B1, a high-density lipoprotein (HDL receptor, is involved in the selective uptake of HDL-associated esterified cholesterol (EC, thereby regulates cholesterol homoeostasis and improves reverse cholesterol transport. Previously, we reported in euglycaemic obese rats (WNIN/Ob strain that feeding of vitamin A-enriched diet normalized hypercholesterolaemia, possibly through hepatic SR-B1-mediated pathway. This study was aimed to test whether it would be possible to normalize hypercholesterolaemia in glucose-intolerant obese rat model (WNIN/GR/Ob through similar mechanism by feeding identical vitamin A-enriched diet. Methods: In this study, 30 wk old male lean and obese rats of WNIN/GR-Ob strain were divided into two groups and received either stock diet or vitamin A-enriched diet (2.6 mg or 129 mg vitamin A/kg diet for 14 wk. Blood and other tissues were collected for various biochemical analyses. Results: Chronic vitamin A-enriched diet feeding decreased hypercholesterolaemia and normalized abnormally elevated plasma HDL-cholesterol (HDL-C levels in obese rats as compared to stock diet-fed obese groups. Further, decreased free cholesterol (FC and increased esterified cholesterol (EC contents of plasma cholesterol were observed, which were reflected in higher EC to FC ratio of vitamin A-enriched diet-fed obese rats. However, neither lecithin-cholesterol acyltransferase (LCAT activity of plasma nor its expression (both gene and protein in the liver were altered. On the contrary, hepatic cholesterol levels significantly increased in vitamin A-enriched diet fed obese rats. Hepatic SR-B1 expression (both mRNA and protein remained unaltered among groups. Vitamin A-enriched diet fed obese rats showed a significant increase in hepatic low-density lipoprotein receptor mRNA levels, while the expression of genes involved in HDL synthesis, namely, ATP-binding cassette protein 1 (ABCA1 and

  19. DOSE RESPONSE FROM HIGH THROUGHPUT GENE EXPRESSION STUDIES AND THE INFLUENCE OF TIME AND CELL LINE ON INFERRED MODE OF ACTION BY ONTOLOGIC ENRICHMENT (SOT)

    Science.gov (United States)

    Gene expression with ontologic enrichment and connectivity mapping tools is widely used to infer modes of action (MOA) for therapeutic drugs. Despite progress in high-throughput (HT) genomic systems, strategies suitable to identify industrial chemical MOA are needed. The L1000 is...

  20. Enrichment of Acinetobacter spp. from food samples.

    Science.gov (United States)

    Carvalheira, Ana; Ferreira, Vânia; Silva, Joana; Teixeira, Paula

    2016-05-01

    Relatively little is known about the role of foods in the chain of transmission of acinetobacters and the occurrence of different Acinetobacter spp. in foods. Currently, there is no standard procedure to recover acinetobacters from food in order to gain insight into the food-related ecology and epidemiology of acinetobacters. This study aimed to assess whether enrichment in Dijkshoorn enrichment medium followed by plating in CHROMagar™ Acinetobacter medium is a useful method for the isolation of Acinetobacter spp. from foods. Recovery of six Acinetobacter species from food spiked with these organisms was compared for two selective enrichment media (Baumann's enrichment and Dijkshoorn's enrichment). Significantly (p enrichment. Next, the Dijkshoorn's enrichment followed by direct plating on CHROMagar™ Acinetobacter was applied to detect Acinetobacter spp. in different foods. Fourteen different presumptive acinetobacters were recovered and assumed to represent nine different strains on the basis of REP-PCR typing. Eight of these strains were identified by rpoB gene analysis as belonging to the species Acinetobacter johnsonii, Acinetobacter calcoaceticus, Acinetobacter guillouiae and Acinetobacter gandensis. It was not possible to identify the species level of one strain which may suggests that it represents a distinct species. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes.

    Science.gov (United States)

    Johnson, Emma C; Border, Richard; Melroy-Greif, Whitney E; de Leeuw, Christiaan A; Ehringer, Marissa A; Keller, Matthew C

    2017-11-15

    A recent analysis of 25 historical candidate gene polymorphisms for schizophrenia in the largest genome-wide association study conducted to date suggested that these commonly studied variants were no more associated with the disorder than would be expected by chance. However, the same study identified other variants within those candidate genes that demonstrated genome-wide significant associations with schizophrenia. As such, it is possible that variants within historic schizophrenia candidate genes are associated with schizophrenia at levels above those expected by chance, even if the most-studied specific polymorphisms are not. The present study used association statistics from the largest schizophrenia genome-wide association study conducted to date as input to a gene set analysis to investigate whether variants within schizophrenia candidate genes are enriched for association with schizophrenia. As a group, variants in the most-studied candidate genes were no more associated with schizophrenia than were variants in control sets of noncandidate genes. While a small subset of candidate genes did appear to be significantly associated with schizophrenia, these genes were not particularly noteworthy given the large number of more strongly associated noncandidate genes. The history of schizophrenia research should serve as a cautionary tale to candidate gene investigators examining other phenotypes: our findings indicate that the most investigated candidate gene hypotheses of schizophrenia are not well supported by genome-wide association studies, and it is likely that this will be the case for other complex traits as well. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.

  2. Laser capture microdissection of enriched populations of neurons or single neurons for gene expression analysis after traumatic brain injury.

    Science.gov (United States)

    Boone, Deborah R; Sell, Stacy L; Hellmich, Helen Lee

    2013-04-10

    Long-term cognitive disability after TBI is associated with injury-induced neurodegeneration in the hippocampus-a region in the medial temporal lobe that is critical for learning, memory and executive function. Hence our studies focus on gene expression analysis of specific neuronal populations in distinct subregions of the hippocampus. The technique of laser capture microdissection (LCM), introduced in 1996 by Emmert-Buck, et al., has allowed for significant advances in gene expression analysis of single cells and enriched populations of cells from heterogeneous tissues such as the mammalian brain that contains thousands of functional cell types. We use LCM and a well established rat model of traumatic brain injury (TBI) to investigate the molecular mechanisms that underlie the pathogenesis of TBI. Following fluid-percussion TBI, brains are removed at pre-determined times post-injury, immediately frozen on dry ice, and prepared for sectioning in a cryostat. The rat brains can be embedded in OCT and sectioned immediately, or stored several months at -80 °C before sectioning for laser capture microdissection. Additionally, we use LCM to study the effects of TBI on circadian rhythms. For this, we capture neurons from the suprachiasmatic nuclei that contain the master clock of the mammalian brain. Here, we demonstrate the use of LCM to obtain single identified neurons (injured and degenerating, Fluoro-Jade-positive, or uninjured, Fluoro-Jade-negative) and enriched populations of hippocampal neurons for subsequent gene expression analysis by real time PCR and/or whole-genome microarrays. These LCM-enabled studies have revealed that the selective vulnerability of anatomically distinct regions of the rat hippocampus are reflected in the different gene expression profiles of different populations of neurons obtained by LCM from these distinct regions. The results from our single-cell studies, where we compare the transcriptional profiles of dying and adjacent surviving

  3. Community Composition of Nitrous Oxide-Related Genes in Salt Marsh Sediments Exposed to Nitrogen Enrichment.

    Science.gov (United States)

    Angell, John H; Peng, Xuefeng; Ji, Qixing; Craick, Ian; Jayakumar, Amal; Kearns, Patrick J; Ward, Bess B; Bowen, Jennifer L

    2018-01-01

    Salt marshes provide many key ecosystem services that have tremendous ecological and economic value. One critical service is the removal of fixed nitrogen from coastal waters, which limits the negative effects of eutrophication resulting from increased nutrient supply. Nutrient enrichment of salt marsh sediments results in higher rates of nitrogen cycling and, commonly, a concurrent increase in the flux of nitrous oxide, an important greenhouse gas. Little is known, however, regarding controls on the microbial communities that contribute to nitrous oxide fluxes in marsh sediments. To address this disconnect, we generated profiles of microbial communities and communities of micro-organisms containing specific nitrogen cycling genes that encode several enzymes ( amoA, norB, nosZ) related to nitrous oxide flux from salt marsh sediments. We hypothesized that communities of microbes responsible for nitrogen transformations will be structured by nitrogen availability. Taxa that respond positively to high nitrogen inputs may be responsible for the elevated rates of nitrogen cycling processes measured in fertilized sediments. Our data show that, with the exception of ammonia-oxidizing archaea, the community composition of organisms involved in the production and consumption of nitrous oxide was altered under nutrient enrichment. These results suggest that previously measured rates of nitrous oxide production and consumption are likely the result of changes in community structure, not simply changes in microbial activity.

  4. A Meta-Analysis of Multiple Matched Copy Number and Transcriptomics Data Sets for Inferring Gene Regulatory Relationships

    Science.gov (United States)

    Newton, Richard; Wernisch, Lorenz

    2014-01-01

    Inferring gene regulatory relationships from observational data is challenging. Manipulation and intervention is often required to unravel causal relationships unambiguously. However, gene copy number changes, as they frequently occur in cancer cells, might be considered natural manipulation experiments on gene expression. An increasing number of data sets on matched array comparative genomic hybridisation and transcriptomics experiments from a variety of cancer pathologies are becoming publicly available. Here we explore the potential of a meta-analysis of thirty such data sets. The aim of our analysis was to assess the potential of in silico inference of trans-acting gene regulatory relationships from this type of data. We found sufficient correlation signal in the data to infer gene regulatory relationships, with interesting similarities between data sets. A number of genes had highly correlated copy number and expression changes in many of the data sets and we present predicted potential trans-acted regulatory relationships for each of these genes. The study also investigates to what extent heterogeneity between cell types and between pathologies determines the number of statistically significant predictions available from a meta-analysis of experiments. PMID:25148247

  5. Medial prefrontal cortex: genes linked to bipolar disorder and schizophrenia have altered expression in the highly social maternal phenotype

    Directory of Open Access Journals (Sweden)

    Brian E Eisinger

    2014-04-01

    Full Text Available The transition to motherhood involves CNS changes that modify sociability and affective state. However, these changes also put females at risk for postpartum depression and psychosis, which impairs parenting abilities and adversely affects children. Thus, changes in expression and interactions in a core subset of genes may be critical for emergence of a healthy maternal phenotype, but inappropriate changes of the same genes could put women at risk for postpartum disorders. This study evaluated microarray gene expression changes in medial prefrontal cortex (mPFC, a region implicated in both maternal behavior and psychiatric disorders. Postpartum mice were compared to virgin controls housed with females and isolated for identical durations. Using the Modular Single-set Enrichment Test (MSET, we found that the genetic landscape of maternal mPFC bears statistical similarity to gene databases associated with schizophrenia (5 of 5 sets and bipolar disorder (BPD, 3 of 3 sets. In contrast to previous studies of maternal lateral septum and medial preoptic area, enrichment of autism and depression-linked genes was not significant (2 of 9 sets, 0 of 4 sets. Among genes linked to multiple disorders were fatty acid binding protein 7 (Fabp7, glutamate metabotropic receptor 3 (Grm3, platelet derived growth factor, beta polypeptide (Pdgfrb, and nuclear receptor subfamily 1, group D, member 1 (Nr1d1. RT-qPCR confirmed these gene changes as well as FMS-like tyrosine kinase 1 (Flt1 and proenkephalin (Penk. Systems-level methods revealed involvement of developmental gene networks in establishing the maternal phenotype and indirectly suggested a role for numerous microRNAs and transcription factors in mediating expression changes. Together, this study suggests that a subset of genes involved in shaping the healthy maternal brain may also be dysregulated in mental health disorders and put females at risk for postpartum psychosis with aspects of schizophrenia and BPD.

  6. Heart morphogenesis gene regulatory networks revealed by temporal expression analysis.

    Science.gov (United States)

    Hill, Jonathon T; Demarest, Bradley; Gorsi, Bushra; Smith, Megan; Yost, H Joseph

    2017-10-01

    During embryogenesis the heart forms as a linear tube that then undergoes multiple simultaneous morphogenetic events to obtain its mature shape. To understand the gene regulatory networks (GRNs) driving this phase of heart development, during which many congenital heart disease malformations likely arise, we conducted an RNA-seq timecourse in zebrafish from 30 hpf to 72 hpf and identified 5861 genes with altered expression. We clustered the genes by temporal expression pattern, identified transcription factor binding motifs enriched in each cluster, and generated a model GRN for the major gene batteries in heart morphogenesis. This approach predicted hundreds of regulatory interactions and found batteries enriched in specific cell and tissue types, indicating that the approach can be used to narrow the search for novel genetic markers and regulatory interactions. Subsequent analyses confirmed the GRN using two mutants, Tbx5 and nkx2-5 , and identified sets of duplicated zebrafish genes that do not show temporal subfunctionalization. This dataset provides an essential resource for future studies on the genetic/epigenetic pathways implicated in congenital heart defects and the mechanisms of cardiac transcriptional regulation. © 2017. Published by The Company of Biologists Ltd.

  7. Evaluation of the uranium enrichment demonstration plant project

    International Nuclear Information System (INIS)

    Sugitsue, Noritake

    2001-01-01

    In this report, the organization system of the uranium enrichment business is evaluated, based on the operation of the uranium enrichment demonstration plant. As a result, in uranium enrichment technology development or business, it was acknowledged that maintenance of the organization which has the Trinity of a research/engineering/operation was necessary in an industrialization stage by exceptional R and D cycle. Japan Nuclear Fuel Ltd. (JNFL) set up the Rokkashomura Aomori Uranium Enrichment Research and Development Center in November 2000. As a result, the system that company directly engaged in engineering development was prepared. And results obtained in this place is expected toward certain establishment of the uranium enrichment business of Japan. (author)

  8. 47 CFR 1.2111 - Assignment or transfer of control: unjust enrichment.

    Science.gov (United States)

    2010-10-01

    ... enrichment. 1.2111 Section 1.2111 Telecommunication FEDERAL COMMUNICATIONS COMMISSION GENERAL PRACTICE AND...: unjust enrichment. (a) Reporting requirement. An applicant seeking approval for a transfer of control or... an option to purchase; below market financing). (b) Unjust enrichment payment: set-aside. As...

  9. Reduced expression of brain-enriched microRNAs in glioblastomas permits targeted regulation of a cell death gene.

    Directory of Open Access Journals (Sweden)

    Rebecca L Skalsky

    Full Text Available Glioblastoma is a highly aggressive malignant tumor involving glial cells in the human brain. We used high-throughput sequencing to comprehensively profile the small RNAs expressed in glioblastoma and non-tumor brain tissues. MicroRNAs (miRNAs made up the large majority of small RNAs, and we identified over 400 different cellular pre-miRNAs. No known viral miRNAs were detected in any of the samples analyzed. Cluster analysis revealed several miRNAs that were significantly down-regulated in glioblastomas, including miR-128, miR-124, miR-7, miR-139, miR-95, and miR-873. Post-transcriptional editing was observed for several miRNAs, including the miR-376 family, miR-411, miR-381, and miR-379. Using the deep sequencing information, we designed a lentiviral vector expressing a cell suicide gene, the herpes simplex virus thymidine kinase (HSV-TK gene, under the regulation of a miRNA, miR-128, that was found to be enriched in non-tumor brain tissue yet down-regulated in glioblastomas, Glioblastoma cells transduced with this vector were selectively killed when cultured in the presence of ganciclovir. Using an in vitro model to recapitulate expression of brain-enriched miRNAs, we demonstrated that neuronally differentiated SH-SY5Y cells transduced with the miRNA-regulated HSV-TK vector are protected from killing by expression of endogenous miR-128. Together, these results provide an in-depth analysis of miRNA dysregulation in glioblastoma and demonstrate the potential utility of these data in the design of miRNA-regulated therapies for the treatment of brain cancers.

  10. The SET1 Complex Selects Actively Transcribed Target Genes via Multivalent Interaction with CpG Island Chromatin

    Directory of Open Access Journals (Sweden)

    David A. Brown

    2017-09-01

    Full Text Available Chromatin modifications and the promoter-associated epigenome are important for the regulation of gene expression. However, the mechanisms by which chromatin-modifying complexes are targeted to the appropriate gene promoters in vertebrates and how they influence gene expression have remained poorly defined. Here, using a combination of live-cell imaging and functional genomics, we discover that the vertebrate SET1 complex is targeted to actively transcribed gene promoters through CFP1, which engages in a form of multivalent chromatin reading that involves recognition of non-methylated DNA and histone H3 lysine 4 trimethylation (H3K4me3. CFP1 defines SET1 complex occupancy on chromatin, and its multivalent interactions are required for the SET1 complex to place H3K4me3. In the absence of CFP1, gene expression is perturbed, suggesting that normal targeting and function of the SET1 complex are central to creating an appropriately functioning vertebrate promoter-associated epigenome.

  11. Gene Sets for Utilization of Primary and Secondary Nutrition Supplies in the Distal Gut of Endangered Iberian Lynx

    Science.gov (United States)

    Alcaide, María; Messina, Enzo; Richter, Michael; Bargiela, Rafael; Peplies, Jörg; Huws, Sharon A.; Newbold, Charles J.; Golyshin, Peter N.; Simón, Miguel A.; López, Guillermo; Yakimov, Michail M.; Ferrer, Manuel

    2012-01-01

    Recent studies have indicated the existence of an extensive trans-genomic trans-mural co-metabolism between gut microbes and animal hosts that is diet-, host phylogeny- and provenance-influenced. Here, we analyzed the biodiversity at the level of small subunit rRNA gene sequence and the metabolic composition of 18 Mbp of consensus metagenome sequences and activity characteristics of bacterial intra-cellular extracts, in wild Iberian lynx (Lynx pardinus) fecal samples. Bacterial signatures (14.43% of all of the Firmicutes reads and 6.36% of total reads) related to the uncultured anaerobic commensals Anaeroplasma spp., which are typically found in ovine and bovine rumen, were first identified. The lynx gut was further characterized by an over-representation of ‘presumptive’ aquaporin aqpZ genes and genes encoding ‘active’ lysosomal-like digestive enzymes that are possibly needed to acquire glycerol, sugars and amino acids from glycoproteins, glyco(amino)lipids, glyco(amino)glycans and nucleoside diphosphate sugars. Lynx gut was highly enriched (28% of the total glycosidases) in genes encoding α-amylase and related enzymes, although it exhibited low rate of enzymatic activity indicative of starch degradation. The preponderance of β-xylosidase activity in protein extracts further suggests lynx gut microbes being most active for the metabolism of β-xylose containing plant N-glycans, although β-xylosidases sequences constituted only 1.5% of total glycosidases. These collective and unique bacterial, genetic and enzymatic activity signatures suggest that the wild lynx gut microbiota not only harbors gene sets underpinning sugar uptake from primary animal tissues (with the monotypic dietary profile of the wild lynx consisting of 80–100% wild rabbits) but also for the hydrolysis of prey-derived plant biomass. Although, the present investigation corresponds to a single sample and some of the statements should be considered qualitative, the data most likely

  12. Gene sets for utilization of primary and secondary nutrition supplies in the distal gut of endangered Iberian lynx.

    Directory of Open Access Journals (Sweden)

    María Alcaide

    Full Text Available Recent studies have indicated the existence of an extensive trans-genomic trans-mural co-metabolism between gut microbes and animal hosts that is diet-, host phylogeny- and provenance-influenced. Here, we analyzed the biodiversity at the level of small subunit rRNA gene sequence and the metabolic composition of 18 Mbp of consensus metagenome sequences and activity characteristics of bacterial intra-cellular extracts, in wild Iberian lynx (Lynx pardinus fecal samples. Bacterial signatures (14.43% of all of the Firmicutes reads and 6.36% of total reads related to the uncultured anaerobic commensals Anaeroplasma spp., which are typically found in ovine and bovine rumen, were first identified. The lynx gut was further characterized by an over-representation of 'presumptive' aquaporin aqpZ genes and genes encoding 'active' lysosomal-like digestive enzymes that are possibly needed to acquire glycerol, sugars and amino acids from glycoproteins, glyco(aminolipids, glyco(aminoglycans and nucleoside diphosphate sugars. Lynx gut was highly enriched (28% of the total glycosidases in genes encoding α-amylase and related enzymes, although it exhibited low rate of enzymatic activity indicative of starch degradation. The preponderance of β-xylosidase activity in protein extracts further suggests lynx gut microbes being most active for the metabolism of β-xylose containing plant N-glycans, although β-xylosidases sequences constituted only 1.5% of total glycosidases. These collective and unique bacterial, genetic and enzymatic activity signatures suggest that the wild lynx gut microbiota not only harbors gene sets underpinning sugar uptake from primary animal tissues (with the monotypic dietary profile of the wild lynx consisting of 80-100% wild rabbits but also for the hydrolysis of prey-derived plant biomass. Although, the present investigation corresponds to a single sample and some of the statements should be considered qualitative, the data most likely

  13. Network-Based Integration of GWAS and Gene Expression Identifies a HOX-Centric Network Associated with Serous Ovarian Cancer Risk.

    Science.gov (United States)

    Kar, Siddhartha P; Tyrer, Jonathan P; Li, Qiyuan; Lawrenson, Kate; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Chenevix-Trench, Georgia; Baker, Helen; Bandera, Elisa V; Bean, Yukie T; Beckmann, Matthias W; Berchuck, Andrew; Bisogna, Maria; Bjørge, Line; Bogdanova, Natalia; Brinton, Louise; Brooks-Wilson, Angela; Butzow, Ralf; Campbell, Ian; Carty, Karen; Chang-Claude, Jenny; Chen, Yian Ann; Chen, Zhihua; Cook, Linda S; Cramer, Daniel; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas F; Edwards, Robert P; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus K; Hosono, Satoyo; Iversen, Edwin S; Jakubowska, Anna; Paul, James; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kjaer, Susanne K; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph; Kiemeney, Lambertus A; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; McNeish, Iain A; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Nevanlinna, Heli; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schildkraut, Joellen M; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston-Campbell, Lara E; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S; van Altena, Anne M; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A; Monteiro, Alvaro N A; Freedman, Matthew L; Gayther, Simon A; Pharoah, Paul D P

    2015-10-01

    Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by coexpression may also be enriched for additional EOC risk associations. We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly coexpressed with each selected TF gene in the unified microarray dataset of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this dataset were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P < 0.05 and FDR < 0.05). These results were replicated (P < 0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network. We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development. Network analysis integrating large, context-specific datasets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization. ©2015 American Association for Cancer Research.

  14. Genome-Wide Temporal Expression Profiling in Caenorhabditis elegans Identifies a Core Gene Set Related to Long-Term Memory.

    Science.gov (United States)

    Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila

    2017-07-12

    The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.

  15. Flavanol-Enriched Cocoa Powder Alters the Intestinal Microbiota, Tissue and Fluid Metabolite Profiles, and Intestinal Gene Expression in Pigs1234

    Science.gov (United States)

    Jang, Saebyeol; Sun, Jianghao; Chen, Pei; Lakshman, Sukla; Molokin, Aleksey; Harnly, James M; Vinyard, Bryan T; Urban, Joseph F; Davis, Cindy D; Solano-Aguilar, Gloria

    2016-01-01

    Background: Consumption of cocoa-derived polyphenols has been associated with several health benefits; however, their effects on the intestinal microbiome and related features of host intestinal health are not adequately understood. Objective: The objective of this study was to determine the effects of eating flavanol-enriched cocoa powder on the composition of the gut microbiota, tissue metabolite profiles, and intestinal immune status. Methods: Male pigs (5 mo old, 28 kg mean body weight) were supplemented with 0, 2.5, 10, or 20 g flavanol-enriched cocoa powder/d for 27 d. Metabolites in serum, urine, the proximal colon contents, liver, and adipose tissue; bacterial abundance in the intestinal contents and feces; and intestinal tissue gene expression of inflammatory markers and Toll-like receptors (TLRs) were then determined. Results: O-methyl-epicatechin-glucuronide conjugates dose-dependently increased (P cocoa powder. The concentration of 3-hydroxyphenylpropionic acid isomers in urine decreased as the dose of cocoa powder fed to pigs increased (75–85%, P cocoa powder/d, respectively. Moreover, consumption of cocoa powder reduced TLR9 gene expression in ileal Peyer’s patches (67–80%, P cocoa powder/d compared with pigs not supplemented with cocoa powder. Conclusion: This study demonstrates that consumption of cocoa powder by pigs can contribute to gut health by enhancing the abundance of Lactobacillus and Bifidobacterium species and modulating markers of localized intestinal immunity. PMID:26936136

  16. Tissue-Specific Enrichment of Lymphoma Risk Loci in Regulatory Elements.

    Science.gov (United States)

    Hayes, James E; Trynka, Gosia; Vijai, Joseph; Offit, Kenneth; Raychaudhuri, Soumya; Klein, Robert J

    2015-01-01

    Though numerous polymorphisms have been associated with risk of developing lymphoma, how these variants function to promote tumorigenesis is poorly understood. Here, we report that lymphoma risk SNPs, especially in the non-Hodgkin's lymphoma subtype chronic lymphocytic leukemia, are significantly enriched for co-localization with epigenetic marks of active gene regulation. These enrichments were seen in a lymphoid-specific manner for numerous ENCODE datasets, including DNase-hypersensitivity as well as multiple segmentation-defined enhancer regions. Furthermore, we identify putatively functional SNPs that are both in regulatory elements in lymphocytes and are associated with gene expression changes in blood. We developed an algorithm, UES, that uses a Monte Carlo simulation approach to calculate the enrichment of previously identified risk SNPs in various functional elements. This multiscale approach integrating multiple datasets helps disentangle the underlying biology of lymphoma, and more broadly, is generally applicable to GWAS results from other diseases as well.

  17. Identification of upstream transcription factors (TFs) for expression signature genes in breast cancer.

    Science.gov (United States)

    Zang, Hongyan; Li, Ning; Pan, Yuling; Hao, Jingguang

    2017-03-01

    Breast cancer is a common malignancy among women with a rising incidence. Our intention was to detect transcription factors (TFs) for deeper understanding of the underlying mechanisms of breast cancer. Integrated analysis of gene expression datasets of breast cancer was performed. Then, functional annotation of differentially expressed genes (DEGs) was conducted, including Gene Ontology (GO) enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment. Furthermore, TFs were identified and a global transcriptional regulatory network was constructed. Seven publically available GEO datasets were obtained, and a set of 1196 DEGs were identified (460 up-regulated and 736 down-regulated). Functional annotation results showed that cell cycle was the most significantly enriched pathway, which was consistent with the fact that cell cycle is closely related to various tumors. Fifty-three differentially expressed TFs were identified, and the regulatory networks consisted of 817 TF-target interactions between 46 TFs and 602 DEGs in the context of breast cancer. Top 10 TFs covering the most downstream DEGs were SOX10, NFATC2, ZNF354C, ARID3A, BRCA1, FOXO3, GATA3, ZEB1, HOXA5 and EGR1. The transcriptional regulatory networks could enable a better understanding of regulatory mechanisms of breast cancer pathology and provide an opportunity for the development of potential therapy.

  18. Integrative ChIP-seq/microarray analysis identifies a CTNNB1 target signature enriched in intestinal stem cells and colon cancer.

    Directory of Open Access Journals (Sweden)

    Kazuhide Watanabe

    Full Text Available Deregulation of canonical Wnt/CTNNB1 (beta-catenin pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells.We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis.Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells.

  19. Differential gene expression from genome-wide microarray analyses distinguishes Lohmann Selected Leghorn and Lohmann Brown layers.

    Directory of Open Access Journals (Sweden)

    Christin Habig

    Full Text Available The Lohmann Selected Leghorn (LSL and Lohmann Brown (LB layer lines have been selected for high egg production since more than 50 years and belong to the worldwide leading commercial layer lines. The objectives of the present study were to characterize the molecular processes that are different among these two layer lines using whole genome RNA expression profiles. The hens were kept in the newly developed small group housing system Eurovent German with two different group sizes. Differential expression was observed for 6,276 microarray probes (FDR adjusted P-value <0.05 among the two layer lines LSL and LB. A 2-fold or greater change in gene expression was identified on 151 probe sets. In LSL, 72 of the 151 probe sets were up- and 79 of them were down-regulated. Gene ontology (GO enrichment analysis accounting for biological processes evinced 18 GO-terms for the 72 probe sets with higher expression in LSL, especially those taking part in immune system processes and membrane organization. A total of 32 enriched GO-terms were determined among the 79 down-regulated probe sets of LSL. Particularly, these terms included phosphorus metabolic processes and signaling pathways. In conclusion, the phenotypic differences among the two layer lines LSL and LB are clearly reflected in their gene expression profiles of the cerebrum. These novel findings provide clues for genes involved in economically important line characteristics of commercial laying hens.

  20. Clinicopathologic and gene expression parameters predict liver cancer prognosis

    International Nuclear Information System (INIS)

    Hao, Ke; Zhong, Hua; Greenawalt, Danielle; Ferguson, Mark D; Ng, Irene O; Sham, Pak C; Poon, Ronnie T; Molony, Cliona; Schadt, Eric E; Dai, Hongyue; Luk, John M; Lamb, John; Zhang, Chunsheng; Xie, Tao; Wang, Kai; Zhang, Bin; Chudin, Eugene; Lee, Nikki P; Mao, Mao

    2011-01-01

    The prognosis of hepatocellular carcinoma (HCC) varies following surgical resection and the large variation remains largely unexplained. Studies have revealed the ability of clinicopathologic parameters and gene expression to predict HCC prognosis. However, there has been little systematic effort to compare the performance of these two types of predictors or combine them in a comprehensive model. Tumor and adjacent non-tumor liver tissues were collected from 272 ethnic Chinese HCC patients who received curative surgery. We combined clinicopathologic parameters and gene expression data (from both tissue types) in predicting HCC prognosis. Cross-validation and independent studies were employed to assess prediction. HCC prognosis was significantly associated with six clinicopathologic parameters, which can partition the patients into good- and poor-prognosis groups. Within each group, gene expression data further divide patients into distinct prognostic subgroups. Our predictive genes significantly overlap with previously published gene sets predictive of prognosis. Moreover, the predictive genes were enriched for genes that underwent normal-to-tumor gene network transformation. Previously documented liver eSNPs underlying the HCC predictive gene signatures were enriched for SNPs that associated with HCC prognosis, providing support that these genes are involved in key processes of tumorigenesis. When applied individually, clinicopathologic parameters and gene expression offered similar predictive power for HCC prognosis. In contrast, a combination of the two types of data dramatically improved the power to predict HCC prognosis. Our results also provided a framework for understanding the impact of gene expression on the processes of tumorigenesis and clinical outcome

  1. Mining gene expression data by interpreting principal components

    Directory of Open Access Journals (Sweden)

    Mortazavi Ali

    2006-04-01

    Full Text Available Abstract Background There are many methods for analyzing microarray data that group together genes having similar patterns of expression over all conditions tested. However, in many instances the biologically important goal is to identify relatively small sets of genes that share coherent expression across only some conditions, rather than all or most conditions as required in traditional clustering; e.g. genes that are highly up-regulated and/or down-regulated similarly across only a subset of conditions. Equally important is the need to learn which conditions are the decisive ones in forming such gene sets of interest, and how they relate to diverse conditional covariates, such as disease diagnosis or prognosis. Results We present a method for automatically identifying such candidate sets of biologically relevant genes using a combination of principal components analysis and information theoretic metrics. To enable easy use of our methods, we have developed a data analysis package that facilitates visualization and subsequent data mining of the independent sources of significant variation present in gene microarray expression datasets (or in any other similarly structured high-dimensional dataset. We applied these tools to two public datasets, and highlight sets of genes most affected by specific subsets of conditions (e.g. tissues, treatments, samples, etc.. Statistically significant associations for highlighted gene sets were shown via global analysis for Gene Ontology term enrichment. Together with covariate associations, the tool provides a basis for building testable hypotheses about the biological or experimental causes of observed variation. Conclusion We provide an unsupervised data mining technique for diverse microarray expression datasets that is distinct from major methods now in routine use. In test uses, this method, based on publicly available gene annotations, appears to identify numerous sets of biologically relevant genes. It

  2. The SET1 Complex Selects Actively Transcribed Target Genes via Multivalent Interaction with CpG Island Chromatin.

    Science.gov (United States)

    Brown, David A; Di Cerbo, Vincenzo; Feldmann, Angelika; Ahn, Jaewoo; Ito, Shinsuke; Blackledge, Neil P; Nakayama, Manabu; McClellan, Michael; Dimitrova, Emilia; Turberfield, Anne H; Long, Hannah K; King, Hamish W; Kriaucionis, Skirmantas; Schermelleh, Lothar; Kutateladze, Tatiana G; Koseki, Haruhiko; Klose, Robert J

    2017-09-05

    Chromatin modifications and the promoter-associated epigenome are important for the regulation of gene expression. However, the mechanisms by which chromatin-modifying complexes are targeted to the appropriate gene promoters in vertebrates and how they influence gene expression have remained poorly defined. Here, using a combination of live-cell imaging and functional genomics, we discover that the vertebrate SET1 complex is targeted to actively transcribed gene promoters through CFP1, which engages in a form of multivalent chromatin reading that involves recognition of non-methylated DNA and histone H3 lysine 4 trimethylation (H3K4me3). CFP1 defines SET1 complex occupancy on chromatin, and its multivalent interactions are required for the SET1 complex to place H3K4me3. In the absence of CFP1, gene expression is perturbed, suggesting that normal targeting and function of the SET1 complex are central to creating an appropriately functioning vertebrate promoter-associated epigenome. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  3. Combining target enrichment with barcode multiplexing for high throughput SNP discovery

    Directory of Open Access Journals (Sweden)

    Lunke Sebastian

    2010-11-01

    Full Text Available Abstract Background The primary goal of genetic linkage analysis is to identify genes affecting a phenotypic trait. After localisation of the linkage region, efficient genetic dissection of the disease linked loci requires that functional variants are identified across the loci. These functional variations are difficult to detect due to extent of genetic diversity and, to date, incomplete cataloguing of the large number of variants present both within and between populations. Massively parallel sequencing platforms offer unprecedented capacity for variant discovery, however the number of samples analysed are still limited by cost per sample. Some progress has been made in reducing the cost of resequencing using either multiplexing methodologies or through the utilisation of targeted enrichment technologies which provide the ability to resequence genomic areas of interest rather that full genome sequencing. Results We developed a method that combines current multiplexing methodologies with a solution-based target enrichment method to further reduce the cost of resequencing where region-specific sequencing is required. Our multiplex/enrichment strategy produced high quality data with nominal reduction of sequencing depth. We undertook a genotyping study and were successful in the discovery of novel SNP alleles in all samples at uniplex, duplex and pentaplex levels. Conclusion Our work describes the successful combination of a targeted enrichment method and index barcode multiplexing to reduce costs, time and labour associated with processing large sample sets. Furthermore, we have shown that the sequencing depth obtained is adequate for credible SNP genotyping analysis at uniplex, duplex and pentaplex levels.

  4. Variant allele frequency enrichment analysis in vitro reveals sonic hedgehog pathway to impede sustained temozolomide response in GBM.

    Science.gov (United States)

    Biswas, Nidhan K; Chandra, Vikas; Sarkar-Roy, Neeta; Das, Tapojyoti; Bhattacharya, Rabindra N; Tripathy, Laxmi N; Basu, Sunandan K; Kumar, Shantanu; Das, Subrata; Chatterjee, Ankita; Mukherjee, Ankur; Basu, Pryiadarshi; Maitra, Arindam; Chattopadhyay, Ansuman; Basu, Analabha; Dhara, Surajit

    2015-01-21

    Neoplastic cells of Glioblastoma multiforme (GBM) may or may not show sustained response to temozolomide (TMZ) chemotherapy. We hypothesize that TMZ chemotherapy response in GBM is predetermined in its neoplastic clones via a specific set of mutations that alter relevant pathways. We describe exome-wide enrichment of variant allele frequencies (VAFs) in neurospheres displaying contrasting phenotypes of sustained versus reversible TMZ-responses in vitro. Enrichment of VAFs was found on genes ST5, RP6KA1 and PRKDC in cells showing sustained TMZ-effect whereas on genes FREM2, AASDH and STK36, in cells showing reversible TMZ-effect. Ingenuity pathway analysis (IPA) revealed that these genes alter cell-cycle, G2/M-checkpoint-regulation and NHEJ pathways in sustained TMZ-effect cells whereas the lysine-II&V/phenylalanine degradation and sonic hedgehog (Hh) pathways in reversible TMZ-effect cells. Next, we validated the likely involvement of the Hh-pathway in TMZ-response on additional GBM neurospheres as well as on GBM patients, by extracting RNA-sequencing-based gene expression data from the TCGA-GBM database. Finally, we demonstrated TMZ-sensitization of a TMZ non-responder neurosphere in vitro by treating them with the FDA-approved pharmacological Hh-pathway inhibitor vismodegib. Altogether, our results indicate that the Hh-pathway impedes sustained TMZ-response in GBM and could be a potential therapeutic target to enhance TMZ-response in this malignancy.

  5. Classification of Non-Small Cell Lung Cancer Using Significance Analysis of Microarray-Gene Set Reduction Algorithm

    Directory of Open Access Journals (Sweden)

    Lei Zhang

    2016-01-01

    Full Text Available Among non-small cell lung cancer (NSCLC, adenocarcinoma (AC, and squamous cell carcinoma (SCC are two major histology subtypes, accounting for roughly 40% and 30% of all lung cancer cases, respectively. Since AC and SCC differ in their cell of origin, location within the lung, and growth pattern, they are considered as distinct diseases. Gene expression signatures have been demonstrated to be an effective tool for distinguishing AC and SCC. Gene set analysis is regarded as irrelevant to the identification of gene expression signatures. Nevertheless, we found that one specific gene set analysis method, significance analysis of microarray-gene set reduction (SAMGSR, can be adopted directly to select relevant features and to construct gene expression signatures. In this study, we applied SAMGSR to a NSCLC gene expression dataset. When compared with several novel feature selection algorithms, for example, LASSO, SAMGSR has equivalent or better performance in terms of predictive ability and model parsimony. Therefore, SAMGSR is a feature selection algorithm, indeed. Additionally, we applied SAMGSR to AC and SCC subtypes separately to discriminate their respective stages, that is, stage II versus stage I. Few overlaps between these two resulting gene signatures illustrate that AC and SCC are technically distinct diseases. Therefore, stratified analyses on subtypes are recommended when diagnostic or prognostic signatures of these two NSCLC subtypes are constructed.

  6. Enrichment allows identification of diverse, rare elements in metagenomic resistome-virulome sequencing.

    Science.gov (United States)

    Noyes, Noelle R; Weinroth, Maggie E; Parker, Jennifer K; Dean, Chris J; Lakin, Steven M; Raymond, Robert A; Rovira, Pablo; Doster, Enrique; Abdo, Zaid; Martin, Jennifer N; Jones, Kenneth L; Ruiz, Jaime; Boucher, Christina A; Belk, Keith E; Morley, Paul S

    2017-10-17

    Shotgun metagenomic sequencing is increasingly utilized as a tool to evaluate ecological-level dynamics of antimicrobial resistance and virulence, in conjunction with microbiome analysis. Interest in use of this method for environmental surveillance of antimicrobial resistance and pathogenic microorganisms is also increasing. In published metagenomic datasets, the total of all resistance- and virulence-related sequences accounts for enrichment system that incorporates unique molecular indices to count DNA molecules and correct for enrichment bias. The use of the bait-capture and enrichment system significantly increased on-target sequencing of the resistome-virulome, enabling detection of an additional 1441 gene accessions and revealing a low-abundance portion of the resistome-virulome that was more diverse and compositionally different than that detected by more traditional metagenomic assays. The low-abundance portion of the resistome-virulome also contained resistance genes with public health importance, such as extended-spectrum betalactamases, that were not detected using traditional shotgun metagenomic sequencing. In addition, the use of the bait-capture and enrichment system enabled identification of rare resistance gene haplotypes that were used to discriminate between sample origins. These results demonstrate that the rare resistome-virulome contains valuable and unique information that can be utilized for both surveillance and population genetic investigations of resistance. Access to the rare resistome-virulome using the bait-capture and enrichment system validated in this study can greatly advance our understanding of microbiome-resistome dynamics.

  7. Glutamatergic and GABAergic gene sets in attention-deficit/hyperactivity disorder: association to overlapping traits in ADHD and autism.

    Science.gov (United States)

    Naaijen, J; Bralten, J; Poelmans, G; Glennon, J C; Franke, B; Buitelaar, J K

    2017-01-10

    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms, autism symptom severity and inhibition were performed using principal component regression analyses. Subsequently, gene-wide association analyses were performed. The glutamate gene set showed an association with severity of hyperactivity/impulsivity (P=0.009), which was robust to correcting for genome-wide association levels. The GABA gene set showed nominally significant association with inhibition (P=0.04), but this did not survive correction for multiple comparisons. None of single gene or single variant associations was significant on their own. By analyzing multiple genetic variants within candidate gene sets together, we were able to find genetic associations supporting the involvement of excitatory and inhibitory neurotransmitter systems in ADHD and ASD symptom severity in ADHD.

  8. EWS and FUS bind a subset of transcribed genes encoding proteins enriched in RNA regulatory functions

    DEFF Research Database (Denmark)

    Luo, Yonglun; Friis, Jenny Blechingberg; Fernandes, Ana Miguel

    2015-01-01

    at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. Conclusions The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes...... involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.......Background FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins...

  9. Shrinkage covariance matrix approach based on robust trimmed mean in gene sets detection

    Science.gov (United States)

    Karjanto, Suryaefiza; Ramli, Norazan Mohamed; Ghani, Nor Azura Md; Aripin, Rasimah; Yusop, Noorezatty Mohd

    2015-02-01

    Microarray involves of placing an orderly arrangement of thousands of gene sequences in a grid on a suitable surface. The technology has made a novelty discovery since its development and obtained an increasing attention among researchers. The widespread of microarray technology is largely due to its ability to perform simultaneous analysis of thousands of genes in a massively parallel manner in one experiment. Hence, it provides valuable knowledge on gene interaction and function. The microarray data set typically consists of tens of thousands of genes (variables) from just dozens of samples due to various constraints. Therefore, the sample covariance matrix in Hotelling's T2 statistic is not positive definite and become singular, thus it cannot be inverted. In this research, the Hotelling's T2 statistic is combined with a shrinkage approach as an alternative estimation to estimate the covariance matrix to detect significant gene sets. The use of shrinkage covariance matrix overcomes the singularity problem by converting an unbiased to an improved biased estimator of covariance matrix. Robust trimmed mean is integrated into the shrinkage matrix to reduce the influence of outliers and consequently increases its efficiency. The performance of the proposed method is measured using several simulation designs. The results are expected to outperform existing techniques in many tested conditions.

  10. Hyb-Seq: combining target enrichment and genome skimming for plant phylogenomics

    Science.gov (United States)

    Kevin Weitemier; Shannon C.K. Straub; Richard C. Cronn; Mark Fishbein; Roswitha Schmickl; Angela McDonnell; Aaron. Liston

    2014-01-01

    • Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. • Methods and Results: Genome and transcriptome assemblies for milkweed ( Asclepias syriaca ) were used to design enrichment probes for 3385...

  11. Bioinformatic identification and characterization of human endothelial cell-restricted genes

    Directory of Open Access Journals (Sweden)

    Keskin Derin B

    2010-05-01

    Full Text Available Abstract Background In this study, we used a systematic bioinformatics analysis approach to elucidate genes that exhibit an endothelial cell (EC restricted expression pattern, and began to define their regulation, tissue distribution, and potential biological role. Results Using a high throughput microarray platform, a primary set of 1,191 transcripts that are enriched in different primary ECs compared to non-ECs was identified (LCB >3, FDR Conclusion The study provides an initial catalogue of EC-restricted genes most of which are ubiquitously expressed in different endothelial cells.

  12. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets.

    Science.gov (United States)

    Khan, Aziz; Mathelier, Anthony

    2017-05-31

    A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. To address this gap, we have developed the Intervene tool, which provides an easy and automated interface for the effective intersection and visualization of genomic region or list sets, thus facilitating their analysis and interpretation. Intervene contains three modules: venn to generate Venn diagrams of up to six sets, upset to generate UpSet plots of multiple sets, and pairwise to compute and visualize intersections of multiple sets as clustered heat maps. Intervene, and its interactive web ShinyApp companion, generate publication-quality figures for the interpretation of genomic region and list sets. Intervene and its web application companion provide an easy command line and an interactive web interface to compute intersections of multiple genomic and list sets. They have the capacity to plot intersections using easy-to-interpret visual approaches. Intervene is developed and designed to meet the needs of both computer scientists and biologists. The source code is freely available at https://bitbucket.org/CBGR/intervene , with the web application available at https://asntech.shinyapps.io/intervene .

  13. Uranium-enriched granites in Sweden

    International Nuclear Information System (INIS)

    Wilson, M.R.; Aakerblom, G.

    1980-01-01

    Granites with uranium contents higher than normal occur in a variety of geological settings in the Swedish Precambrian, and represent a variety of granite types and ages. They may have been generated by the anatexis of continental crust or processes occurring at a much greater depth. They commonly show enrichment in F, Sn, W and/or Mo. Only in one case is an important uranium mineralization thought to be directly related to a uranium-enriched granite, while the majority of epigenetic uranium mineralizations with economic potential are related to hydrothermal processes in areas where the bedrock is regionally uranium-enhanced. (author)

  14. Uranium enriched granites in Sweden

    International Nuclear Information System (INIS)

    Wilson, M.R.; Aakerblom, G.

    1980-01-01

    Granites with uranium contents higher than normal occur in a variety of geological settings in the Swedish Precambrian, and represent a variety of granite types and ages. They may have been generated by (1) the anatexis of continental crust (2) processes occurring at a much greater depth. They commonly show enrichement in F, Sn, W and/or Mo. Only in one case is an important uranium mineralization thought to be directly related to a uranium-enriched granite, while the majority of epigenetic uranium mineralizations with economic potential are related to hydrothermal processes in areas where the bedrock is regionally uranium-enhanced. (Authors)

  15. Evaluation of endogenous control genes for gene expression studies across multiple tissues and in the specific sets of fat- and muscle-type samples of the pig.

    Science.gov (United States)

    Gu, Y R; Li, M Z; Zhang, K; Chen, L; Jiang, A A; Wang, J Y; Li, X W

    2011-08-01

    To normalize a set of quantitative real-time PCR (q-PCR) data, it is essential to determine an optimal number/set of housekeeping genes, as the abundance of housekeeping genes can vary across tissues or cells during different developmental stages, or even under certain environmental conditions. In this study, of the 20 commonly used endogenous control genes, 13, 18 and 17 genes exhibited credible stability in 56 different tissues, 10 types of adipose tissue and five types of muscle tissue, respectively. Our analysis clearly showed that three optimal housekeeping genes are adequate for an accurate normalization, which correlated well with the theoretical optimal number (r ≥ 0.94). In terms of economical and experimental feasibility, we recommend the use of the three most stable housekeeping genes for calculating the normalization factor. Based on our results, the three most stable housekeeping genes in all analysed samples (TOP2B, HSPCB and YWHAZ) are recommended for accurate normalization of q-PCR data. We also suggest that two different sets of housekeeping genes are appropriate for 10 types of adipose tissue (the HSPCB, ALDOA and GAPDH genes) and five types of muscle tissue (the TOP2B, HSPCB and YWHAZ genes), respectively. Our report will serve as a valuable reference for other studies aimed at measuring tissue-specific mRNA abundance in porcine samples. © 2011 Blackwell Verlag GmbH.

  16. Elevated expression of protein biosynthesis genes in liver and muscle of hibernating black bears (Ursus americanus).

    Science.gov (United States)

    Fedorov, Vadim B; Goropashnaya, Anna V; Tøien, Øivind; Stewart, Nathan C; Gracey, Andrew Y; Chang, Celia; Qin, Shizhen; Pertea, Geo; Quackenbush, John; Showe, Louise C; Showe, Michael K; Boyer, Bert B; Barnes, Brian M

    2009-04-10

    We conducted a large-scale gene expression screen using the 3,200 cDNA probe microarray developed specifically for Ursus americanus to detect expression differences in liver and skeletal muscle that occur during winter hibernation compared with animals sampled during summer. The expression of 12 genes, including RNA binding protein motif 3 (Rbm3), that are mostly involved in protein biosynthesis, was induced during hibernation in both liver and muscle. The Gene Ontology and Gene Set Enrichment analysis consistently showed a highly significant enrichment of the protein biosynthesis category by overexpressed genes in both liver and skeletal muscle during hibernation. Coordinated induction in transcriptional level of genes involved in protein biosynthesis is a distinctive feature of the transcriptome in hibernating black bears. This finding implies induction of translation and suggests an adaptive mechanism that contributes to a unique ability to reduce muscle atrophy over prolonged periods of immobility during hibernation. Comparing expression profiles in bears to small mammalian hibernators shows a general trend during hibernation of transcriptional changes that include induction of genes involved in lipid metabolism and carbohydrate synthesis as well as depression of genes involved in the urea cycle and detoxification function in liver.

  17. Geo-Enrichment and Semantic Enhancement of Metadata Sets to Augment Discovery in Geoportals

    Directory of Open Access Journals (Sweden)

    Bernhard Vockner

    2014-03-01

    Full Text Available Geoportals are established to function as main gateways to find, evaluate, and start “using” geographic information. Still, current geoportal implementations face problems in optimizing the discovery process due to semantic heterogeneity issues, which leads to low recall and low precision in performing text-based searches. Therefore, we propose an enhanced semantic discovery approach that supports multilingualism and information domain context. Thus, we present workflow that enriches existing structured metadata with synonyms, toponyms, and translated terms derived from user-defined keywords based on multilingual thesauri and ontologies. To make the results easier and understandable, we also provide automated translation capabilities for the resource metadata to support the user in conceiving the thematic content of the descriptive metadata, even if it has been documented using a language the user is not familiar with. In addition, to text-enable spatial filtering capabilities, we add additional location name keywords to metadata sets. These are based on the existing bounding box and shall tweak discovery scores when performing single text line queries. In order to improve the user’s search experience, we tailor faceted search strategies presenting an enhanced query interface for geo-metadata discovery that are transparently leveraging the underlying thesauri and ontologies.

  18. Evaluating biomarkers for prognostic enrichment of clinical trials.

    Science.gov (United States)

    Kerr, Kathleen F; Roth, Jeremy; Zhu, Kehao; Thiessen-Philbrook, Heather; Meisner, Allison; Wilson, Francis Perry; Coca, Steven; Parikh, Chirag R

    2017-12-01

    A potential use of biomarkers is to assist in prognostic enrichment of clinical trials, where only patients at relatively higher risk for an outcome of interest are eligible for the trial. We investigated methods for evaluating biomarkers for prognostic enrichment. We identified five key considerations when considering a biomarker and a screening threshold for prognostic enrichment: (1) clinical trial sample size, (2) calendar time to enroll the trial, (3) total patient screening costs and the total per-patient trial costs, (4) generalizability of trial results, and (5) ethical evaluation of trial eligibility criteria. Items (1)-(3) are amenable to quantitative analysis. We developed the Biomarker Prognostic Enrichment Tool for evaluating biomarkers for prognostic enrichment at varying levels of screening stringency. We demonstrate that both modestly prognostic and strongly prognostic biomarkers can improve trial metrics using Biomarker Prognostic Enrichment Tool. Biomarker Prognostic Enrichment Tool is available as a webtool at http://prognosticenrichment.com and as a package for the R statistical computing platform. In some clinical settings, even biomarkers with modest prognostic performance can be useful for prognostic enrichment. In addition to the quantitative analysis provided by Biomarker Prognostic Enrichment Tool, investigators must consider the generalizability of trial results and evaluate the ethics of trial eligibility criteria.

  19. Enrichment of conserved synaptic activity-responsive element in neuronal genes predicts a coordinated response of MEF2, CREB and SRF.

    Directory of Open Access Journals (Sweden)

    Fernanda M Rodríguez-Tornos

    Full Text Available A unique synaptic activity-responsive element (SARE sequence, composed of the consensus binding sites for SRF, MEF2 and CREB, is necessary for control of transcriptional upregulation of the Arc gene in response to synaptic activity. We hypothesize that this sequence is a broad mechanism that regulates gene expression in response to synaptic activation and during plasticity; and that analysis of SARE-containing genes could identify molecular mechanisms involved in brain disorders. To search for conserved SARE sequences in the mammalian genome, we used the SynoR in silico tool, and found the SARE cluster predominantly in the regulatory regions of genes expressed specifically in the nervous system; most were related to neural development and homeostatic maintenance. Two of these SARE sequences were tested in luciferase assays and proved to promote transcription in response to neuronal activation. Supporting the predictive capacity of our candidate list, up-regulation of several SARE containing genes in response to neuronal activity was validated using external data and also experimentally using primary cortical neurons and quantitative real time RT-PCR. The list of SARE-containing genes includes several linked to mental retardation and cognitive disorders, and is significantly enriched in genes that encode mRNA targeted by FMRP (fragile X mental retardation protein. Our study thus supports the idea that SARE sequences are relevant transcriptional regulatory elements that participate in plasticity. In addition, it offers a comprehensive view of how activity-responsive transcription factors coordinate their actions and increase the selectivity of their targets. Our data suggest that analysis of SARE-containing genes will reveal yet-undescribed pathways of synaptic plasticity and additional candidate genes disrupted in mental disease.

  20. Cardiovascular risk and lifestyle habits of consumers of a phytosterol-enriched yogurt in a real-life setting.

    Science.gov (United States)

    Paillard, F; Bruckert, E; Naelten, G; Picard, P; van Ganse, E

    2015-06-01

    Data on the characteristics of consumers of phytosterol-enriched products and modalities of consumption are rare. An observational study evaluating the lifestyle characteristics and cardiovascular risk (CVR) profile of phytosterol-enriched yogurt consumers was performed in France. Subjects were recruited from general practitioners via electronic medical records. Data were obtained from 358 consumers and 422 nonconsumers with 519 subject questionnaires (243 consumers, 276 nonconsumers; 67% response). Consumers had more cardiovascular risk factors than nonconsumers (2.0 ± 1.5 versus 1.6 ± 1.4; P Phytosterol-enriched yogurt intake conformed to recommendations in two-thirds of consumers and was mainly consumed because of concerns over cholesterol levels and CVR. The higher cardiovascular disease risk profile of phytosterol-enriched yogurt consumers corresponds to a population for whom European guidelines recommend lifestyle changes to manage cholesterol. The coherence of the data in terms of risk factors, adherence to lifestyle recommendations and the consumption of phytosterol-enriched yogurt conforming to recommendations reflects a health-conscious consumer population. © 2014 The British Dietetic Association Ltd.

  1. Enrichment and Preservation of Architectural Knowledge

    DEFF Research Database (Denmark)

    Beetz, Jakob; Blümel, Ina; Dietze, Stefan

    2016-01-01

    In the context of the EU FP7 DURAARK project (2013–2016), inter-disciplinary methods, technologies and tools have been researched and developed, that support the Long Term Preservation of semantically enriched digital representations of built structures. The results of the research efforts include...... approaches of semi-automatically deriving building models from point cloud data sets acquired from laser scans and the integration and overlay of such representations with explicit Building Information Models (BIM). We introduce novel ways for the further semantic enrichment of such hybrid building models...

  2. [Analysis of tissue-specific differentially methylated genes with differential gene expression in non-small cell lung cancer].

    Science.gov (United States)

    Yin, L G; Zou, Z Q; Zhao, H Y; Zhang, C L; Shen, J G; Qi, L; Qi, M; Xue, Z Q

    2014-01-01

    Adenocarcinoma (ADC) and squamous cell carcinomas (SCC) are two subtypes of non-small cell lung carcinomas which are regarded as the leading cause of cancer-related malignancy worldwide. The aim of this study is to detect the differentially methylated loci (DMLs) and differentially methylated genes (DMGs) of these two tumor sets, and then to illustrate the different expression level of specific methylated genes. Using TCGA database and Illumina HumanMethylation 27 arrays, we first screened the DMGs and DMLs in tumor samples. Then, we explored the BiologicalProcess terms of hypermethylated and hypomethylated genes using Functional Gene Ontology (GO) catalogues. Hypermethylation intensively occurred in CpG-island, whereas hypomethylation was located in non-CpG-island. Most SCC and ADC hypermethylated genes involved GO function of DNA dependenit regulation of transcription, and hypomethylated genes mainly 'enriched in the term of immune responses. Additionally, the expression level of specific differentially methylated genesis distinctbetween ADC and SCC. It is concluded that ADC and SCC have different methylated status that might play an important role in carcinogenesis.

  3. A comparative analysis of biclustering algorithms for gene expression data

    Science.gov (United States)

    Eren, Kemal; Deveci, Mehmet; Küçüktunç, Onur; Çatalyürek, Ümit V.

    2013-01-01

    The need to analyze high-dimension biological data is driving the development of new data mining methods. Biclustering algorithms have been successfully applied to gene expression data to discover local patterns, in which a subset of genes exhibit similar expression levels over a subset of conditions. However, it is not clear which algorithms are best suited for this task. Many algorithms have been published in the past decade, most of which have been compared only to a small number of algorithms. Surveys and comparisons exist in the literature, but because of the large number and variety of biclustering algorithms, they are quickly outdated. In this article we partially address this problem of evaluating the strengths and weaknesses of existing biclustering methods. We used the BiBench package to compare 12 algorithms, many of which were recently published or have not been extensively studied. The algorithms were tested on a suite of synthetic data sets to measure their performance on data with varying conditions, such as different bicluster models, varying noise, varying numbers of biclusters and overlapping biclusters. The algorithms were also tested on eight large gene expression data sets obtained from the Gene Expression Omnibus. Gene Ontology enrichment analysis was performed on the resulting biclusters, and the best enrichment terms are reported. Our analyses show that the biclustering method and its parameters should be selected based on the desired model, whether that model allows overlapping biclusters, and its robustness to noise. In addition, we observe that the biclustering algorithms capable of finding more than one model are more successful at capturing biologically relevant clusters. PMID:22772837

  4. Replication error deficient and proficient colorectal cancer gene expression differences caused by 3'UTR polyT sequence deletions

    DEFF Research Database (Denmark)

    Wilding, Jennifer L; McGowan, Simon; Liu, Ying

    2010-01-01

    , and have distinct pathologies. Regulatory sequences controlling all aspects of mRNA processing, especially including message stability, are found in the 3'UTR sequence of most genes. The relevant sequences are typically A/U-rich elements or U repeats. Microarray analysis of 14 RER+ (deficient) and 16 RER......- (proficient) colorectal cancer cell lines confirms a striking difference in expression profiles. Analysis of the incidence of mononucleotide repeat sequences in the 3'UTRs, 5'UTRs, and coding sequences of those genes most differentially expressed in RER+ versus RER- cell lines has shown that much...... of this differential expression can be explained by the occurrence of a massive enrichment of genes with 3'UTR T repeats longer than 11 base pairs in the most differentially expressed genes. This enrichment was confirmed by analysis of two published consensus sets of RER differentially expressed probesets for a large...

  5. Sex Differences in Drosophila Somatic Gene Expression: Variation and Regulation by doublesex

    Directory of Open Access Journals (Sweden)

    Michelle N. Arbeitman

    2016-07-01

    Full Text Available Sex differences in gene expression have been widely studied in Drosophila melanogaster. Sex differences vary across strains, but many molecular studies focus on only a single strain, or on genes that show sexually dimorphic expression in many strains. How extensive variability is and whether this variability occurs among genes regulated by sex determination hierarchy terminal transcription factors is unknown. To address these questions, we examine differences in sexually dimorphic gene expression between two strains in Drosophila adult head tissues. We also examine gene expression in doublesex (dsx mutant strains to determine which sex-differentially expressed genes are regulated by DSX, and the mode by which DSX regulates expression. We find substantial variation in sex-differential expression. The sets of genes with sexually dimorphic expression in each strain show little overlap. The prevalence of different DSX regulatory modes also varies between the two strains. Neither the patterns of DSX DNA occupancy, nor mode of DSX regulation explain why some genes show consistent sex-differential expression across strains. We find that the genes identified as regulated by DSX in this study are enriched with known sites of DSX DNA occupancy. Finally, we find that sex-differentially expressed genes and genes regulated by DSX are highly enriched on the fourth chromosome. These results provide insights into a more complete pool of potential DSX targets, as well as revealing the molecular flexibility of DSX regulation.

  6. Transcriptome-wide selection of a reliable set of reference genes for gene expression studies in potato cyst nematodes (Globodera spp.).

    Science.gov (United States)

    Sabeh, Michael; Duceppe, Marc-Olivier; St-Arnaud, Marc; Mimee, Benjamin

    2018-01-01

    Relative gene expression analyses by qRT-PCR (quantitative reverse transcription PCR) require an internal control to normalize the expression data of genes of interest and eliminate the unwanted variation introduced by sample preparation. A perfect reference gene should have a constant expression level under all the experimental conditions. However, the same few housekeeping genes selected from the literature or successfully used in previous unrelated experiments are often routinely used in new conditions without proper validation of their stability across treatments. The advent of RNA-Seq and the availability of public datasets for numerous organisms are opening the way to finding better reference genes for expression studies. Globodera rostochiensis is a plant-parasitic nematode that is particularly yield-limiting for potato. The aim of our study was to identify a reliable set of reference genes to study G. rostochiensis gene expression. Gene expression levels from an RNA-Seq database were used to identify putative reference genes and were validated with qRT-PCR analysis. Three genes, GR, PMP-3, and aaRS, were found to be very stable within the experimental conditions of this study and are proposed as reference genes for future work.

  7. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    Science.gov (United States)

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  8. Mining tissue specificity, gene connectivity and disease association to reveal a set of genes that modify the action of disease causing genes

    Directory of Open Access Journals (Sweden)

    Reverter Antonio

    2008-09-01

    Full Text Available Abstract Background The tissue specificity of gene expression has been linked to a number of significant outcomes including level of expression, and differential rates of polymorphism, evolution and disease association. Recent studies have also shown the importance of exploring differential gene connectivity and sequence conservation in the identification of disease-associated genes. However, no study relates gene interactions with tissue specificity and disease association. Methods We adopted an a priori approach making as few assumptions as possible to analyse the interplay among gene-gene interactions with tissue specificity and its subsequent likelihood of association with disease. We mined three large datasets comprising expression data drawn from massively parallel signature sequencing across 32 tissues, describing a set of 55,606 true positive interactions for 7,197 genes, and microarray expression results generated during the profiling of systemic inflammation, from which 126,543 interactions among 7,090 genes were reported. Results Amongst the myriad of complex relationships identified between expression, disease, connectivity and tissue specificity, some interesting patterns emerged. These include elevated rates of expression and network connectivity in housekeeping and disease-associated tissue-specific genes. We found that disease-associated genes are more likely to show tissue specific expression and most frequently interact with other disease genes. Using the thresholds defined in these observations, we develop a guilt-by-association algorithm and discover a group of 112 non-disease annotated genes that predominantly interact with disease-associated genes, impacting on disease outcomes. Conclusion We conclude that parameters such as tissue specificity and network connectivity can be used in combination to identify a group of genes, not previously confirmed as disease causing, that are involved in interactions with disease causing

  9. Common Mechanisms Underlying Refractive Error Identified in Functional Analysis of Gene Lists From Genome-Wide Association Study Results in 2 European British Cohorts

    Science.gov (United States)

    Hysi, Pirro G.; Mahroo, Omar A.; Cumberland, Phillippa; Wojciechowski, Robert; Williams, Katie M.; Young, Terri L.; Mackey, David A.; Rahi, Jugnoo S.; Hammond, Christopher J.

    2014-01-01

    IMPORTANCE To date, relatively few genes responsible for a fraction of heritability have been identified by means of large genetic association studies of refractive error. OBJECTIVE To explore the genetic mechanisms that lead to refractive error in the general population. DESIGN, SETTING, AND PARTICIPANTS Genome-wide association studies were carried out in 2 British population-based independent cohorts (N = 5928 participants) to identify genes moderately associated with refractive error. MAIN OUTCOMES AND MEASURES Enrichment analyses were used to identify sets of genes overrepresented in both cohorts. Enriched groups of genes were compared between both participating cohorts as a further measure against random noise. RESULTS Groups of genes enriched at highly significant statistical levels were remarkably consistent in both cohorts. In particular, these results indicated that plasma membrane (P = 7.64 × 10−30), cell-cell adhesion (P = 2.42 × 10−18), synaptic transmission (P = 2.70 × 10−14), calcium ion binding (P = 3.55 × 10−15), and cation channel activity (P = 2.77 × 10−14) were significantly overrepresented in relation to refractive error. CONCLUSIONS AND RELEVANCE These findings provide evidence that development of refractive error in the general population is related to the intensity of photosignal transduced from the retina, which may have implications for future interventions to minimize this disorder. Pathways connected to the procession of the nerve impulse are major mechanisms involved in the development of refractive error in populations of European origin. PMID:24264139

  10. Neonicotinoid Insecticides Alter the Gene Expression Profile of Neuron-Enriched Cultures from Neonatal Rat Cerebellum

    Directory of Open Access Journals (Sweden)

    Junko Kimura-Kuroda

    2016-10-01

    Full Text Available Neonicotinoids are considered safe because of their low affinities to mammalian nicotinic acetylcholine receptors (nAChRs relative to insect nAChRs. However, because of importance of nAChRs in mammalian brain development, there remains a need to establish the safety of chronic neonicotinoid exposures with regards to children’s health. Here we examined the effects of longterm (14 days and low dose (1 μM exposure of neuron-enriched cultures from neonatal rat cerebellum to nicotine and two neonicotinoids: acetamiprid and imidacloprid. Immunocytochemistry revealed no differences in the number or morphology of immature neurons or glial cells in any group versus untreated control cultures. However, a slight disturbance in Purkinje cell dendritic arborization was observed in the exposed cultures. Next we performed transcriptome analysis on total RNAs using microarrays, and identified significant differential expression (p < 0.05, q < 0.05, ≥1.5 fold between control cultures versus nicotine-, acetamiprid-, or imidacloprid-exposed cultures in 34, 48, and 67 genes, respectively. Common to all exposed groups were nine genes essential for neurodevelopment, suggesting that chronic neonicotinoid exposure alters the transcriptome of the developing mammalian brain in a similar way to nicotine exposure. Our results highlight the need for further careful investigations into the effects of neonicotinoids in the developing mammalian brain.

  11. FUN-L: gene prioritization for RNAi screens.

    Science.gov (United States)

    Lees, Jonathan G; Hériché, Jean-Karim; Morilla, Ian; Fernández, José M; Adler, Priit; Krallinger, Martin; Vilo, Jaak; Valencia, Alfonso; Ellenberg, Jan; Ranea, Juan A; Orengo, Christine

    2015-06-15

    Most biological processes remain only partially characterized with many components still to be identified. Given that a whole genome can usually not be tested in a functional assay, identifying the genes most likely to be of interest is of critical importance to avoid wasting resources. Given a set of known functionally related genes and using a state-of-the-art approach to data integration and mining, our Functional Lists (FUN-L) method provides a ranked list of candidate genes for testing. Validation of predictions from FUN-L with independent RNAi screens confirms that FUN-L-produced lists are enriched in genes with the expected phenotypes. In this article, we describe a website front end to FUN-L. The website is freely available to use at http://funl.org © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  12. Transcriptome Analysis of Porcine PBMCs Reveals the Immune Cascade Response and Gene Ontology Terms Related to Cell Death and Fibrosis in the Progression of Liver Failure

    Directory of Open Access Journals (Sweden)

    YiMin Zhang

    2018-01-01

    Full Text Available Background. The key gene sets involved in the progression of acute liver failure (ALF, which has a high mortality rate, remain unclear. This study aims to gain a deeper understanding of the transcriptional response of peripheral blood mononuclear cells (PBMCs following ALF. Methods. ALF was induced by D-galactosamine (D-gal in a porcine model. PBMCs were separated at time zero (baseline group, 36 h (failure group, and 60 h (dying group after D-gal injection. Transcriptional profiling was performed using RNA sequencing and analysed using DAVID bioinformatics resources. Results. Compared with the baseline group, 816 and 1,845 differentially expressed genes (DEGs were identified in the failure and dying groups, respectively. A total of five and two gene ontology (GO term clusters were enriched in 107 GO terms in the failure group and 154 GO terms in the dying group. These GO clusters were primarily immune-related, including genes regulating the inflammasome complex and toll-like receptor signalling pathways. Specifically, GO terms related to cell death, including apoptosis, pyroptosis, and autophagy, and those related to fibrosis, coagulation dysfunction, and hepatic encephalopathy were enriched. Seven Kyoto Encyclopedia of Genes and Genomes (KEGG pathways, cytokine-cytokine receptor interaction, hematopoietic cell lineage, lysosome, rheumatoid arthritis, malaria, and phagosome and pertussis pathways were mapped for DEGs in the failure group. All of these seven KEGG pathways were involved in the 19 KEGG pathways mapped in the dying group. Conclusion. We found that the dramatic PBMC transcriptome changes triggered by ALF progression was predominantly related to immune responses. The enriched GO terms related to cell death, fibrosis, and so on, as indicated by PBMC transcriptome analysis, seem to be useful in elucidating potential key gene sets in the progression of ALF. A better understanding of these gene sets might be of preventive or

  13. Using Variable Precision Rough Set for Selection and Classification of Biological Knowledge Integrated in DNA Gene Expression

    Directory of Open Access Journals (Sweden)

    Calvo-Dmgz D.

    2012-12-01

    Full Text Available DNA microarrays have contributed to the exponential growth of genomic and experimental data in the last decade. This large amount of gene expression data has been used by researchers seeking diagnosis of diseases like cancer using machine learning methods. In turn, explicit biological knowledge about gene functions has also grown tremendously over the last decade. This work integrates explicit biological knowledge, provided as gene sets, into the classication process by means of Variable Precision Rough Set Theory (VPRS. The proposed model is able to highlight which part of the provided biological knowledge has been important for classification. This paper presents a novel model for microarray data classification which is able to incorporate prior biological knowledge in the form of gene sets. Based on this knowledge, we transform the input microarray data into supergenes, and then we apply rough set theory to select the most promising supergenes and to derive a set of easy interpretable classification rules. The proposed model is evaluated over three breast cancer microarrays datasets obtaining successful results compared to classical classification techniques. The experimental results shows that there are not significat differences between our model and classical techniques but it is able to provide a biological-interpretable explanation of how it classifies new samples.

  14. SoFoCles: feature filtering for microarray classification based on gene ontology.

    Science.gov (United States)

    Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A

    2010-02-01

    Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.

  15. Enrichment of G2/M cell cycle phase in human pluripotent stem cells enhances HDR-mediated gene repair with customizable endonucleases.

    Science.gov (United States)

    Yang, Diane; Scavuzzo, Marissa A; Chmielowiec, Jolanta; Sharp, Robert; Bajic, Aleksandar; Borowiak, Malgorzata

    2016-02-18

    Efficient gene editing is essential to fully utilize human pluripotent stem cells (hPSCs) in regenerative medicine. Custom endonuclease-based gene targeting involves two mechanisms of DNA repair: homology directed repair (HDR) and non-homologous end joining (NHEJ). HDR is the preferred mechanism for common applications such knock-in, knock-out or precise mutagenesis, but remains inefficient in hPSCs. Here, we demonstrate that synchronizing synchronizing hPSCs in G2/M with ABT phase increases on-target gene editing, defined as correct targeting cassette integration, 3 to 6 fold. We observed improved efficiency using ZFNs, TALENs, two CRISPR/Cas9, and CRISPR/Cas9 nickase to target five genes in three hPSC lines: three human embryonic stem cell lines, neural progenitors and diabetic iPSCs. neural progenitors and diabetic iPSCs. Reversible synchronization has no effect on pluripotency or differentiation. The increase in on-target gene editing is locus-independent and specific to the cell cycle phase as G2/M phase enriched cells show a 6-fold increase in targeting efficiency compared to cells in G1 phase. Concurrently inhibiting NHEJ with SCR7 does not increase HDR or improve gene targeting efficiency further, indicating that HR is the major DNA repair mechanism after G2/M phase arrest. The approach outlined here makes gene editing in hPSCs a more viable tool for disease modeling, regenerative medicine and cell-based therapies.

  16. Gene Set Analyses of Genome-Wide Association Studies on 49 Quantitative Traits Measured in a Single Genetic Epidemiology Dataset

    Directory of Open Access Journals (Sweden)

    Jihye Kim

    2013-09-01

    Full Text Available Gene set analysis is a powerful tool for interpreting a genome-wide association study result and is gaining popularity these days. Comparison of the gene sets obtained for a variety of traits measured from a single genetic epidemiology dataset may give insights into the biological mechanisms underlying these traits. Based on the previously published single nucleotide polymorphism (SNP genotype data on 8,842 individuals enrolled in the Korea Association Resource project, we performed a series of systematic genome-wide association analyses for 49 quantitative traits of basic epidemiological, anthropometric, or blood chemistry parameters. Each analysis result was subjected to subsequent gene set analyses based on Gene Ontology (GO terms using gene set analysis software, GSA-SNP, identifying a set of GO terms significantly associated to each trait (pcorr < 0.05. Pairwise comparison of the traits in terms of the semantic similarity in their GO sets revealed surprising cases where phenotypically uncorrelated traits showed high similarity in terms of biological pathways. For example, the pH level was related to 7 other traits that showed low phenotypic correlations with it. A literature survey implies that these traits may be regulated partly by common pathways that involve neuronal or nerve systems.

  17. Modeling of Transients in an Enrichment Circuit

    International Nuclear Information System (INIS)

    Fernandino, Maria; Delmastro, Dario; Brasnarof, Daniel

    2003-01-01

    In the present work a mathematical model is presented in order to describe the dynamic behavior inside a closed enrichment loop, the latter representing a single stage of an uranium gaseous diffusion enrichment cascade.The analytical model is turned into a numerical model, and implemented through a computational code.Transients of two species separation were numerically analyzed, including setting times of each magnitude, behavior of each one of them during different transients, and redistribution of concentrations along the closed loop

  18. EWS and FUS bind a subset of transcribed genes encoding proteins enriched in RNA regulatory functions.

    Science.gov (United States)

    Luo, Yonglun; Blechingberg, Jenny; Fernandes, Ana Miguel; Li, Shengting; Fryland, Tue; Børglum, Anders D; Bolund, Lars; Nielsen, Anders Lade

    2015-11-14

    FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins and involved in the human neurological diseases amyotrophic lateral sclerosis and fronto-temporal lobar degeneration. To determine the gene regulatory functions of FUS and EWS at the level of chromatin, we have performed chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq). Our results show that FUS and EWS bind to a subset of actively transcribed genes, that binding often is downstream the poly(A)-signal, and that binding overlaps with RNA polymerase II. Functional examinations of selected target genes identified that FUS and EWS can regulate gene expression at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.

  19. Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes

    OpenAIRE

    Kreiman, Gabriel

    2004-01-01

    Sequence information and high‐throughput methods to measure gene expression levels open the door to explore transcriptional regulation using computational tools. Combinatorial regulation and sparseness of regulatory elements throughout the genome allow organisms to control the spatial and temporal patterns of gene expression. Here we study the organization of cis‐regulatory elements in sets of co‐regulated genes. We build an algorithm to search for combinations of transcription factor binding...

  20. Measurement of the enrichment of uranium in the pipework of a gas centrifuge enrichment plant

    International Nuclear Information System (INIS)

    Packer, T.W.; Lees, E.W.; Close, D.; Nixon, K.V.; Pratt, J.C.; Strittmatter, R.

    1985-01-01

    The US and UK have been separately working on the development of a NDA instrument to determine the enrichment of gaseous UF 6 at low pressures in cascade header pipework in line with the conclusions of the Hexapartite Safeguards Project viz. the instrument is capable of making a ''go/no go'' decision of whether the enrichment is less than/greater than 20%. Recently, there has been a series of very useful technical exchanges of ideas and information between the two countries. This has led to a technical formulation for such an instrumentation based on γ-ray spectrometry which, although plant-specific in certain features, nevertheless is based on the same physical principles. Experimental results from commercially operating enrichment plants are very encouraging and indicate that a complete measurement including set up time on the pipe should be attainable in about 30 minutes when measuring pipes of diameter around 110 mm. 5 refs., 4 figs

  1. Identification of Multiple Dehalogenase Genes Involved in Tetrachloroethene-to-Ethene Dechlorination in a Dehalococcoides-Dominated Enrichment Culture

    Directory of Open Access Journals (Sweden)

    Mohamed Ismaeil

    2017-01-01

    Full Text Available Chloroethenes (CEs are widespread groundwater toxicants that are reductively dechlorinated to nontoxic ethene (ETH by members of Dehalococcoides. This study established a Dehalococcoides-dominated enrichment culture (designated “YN3” that dechlorinates tetrachloroethene (PCE to ETH with high dechlorination activity, that is, complete dechlorination of 800 μM PCE to ETH within 14 days in the presence of Dehalococcoides species at 5.7±1.9×107 copies of 16S rRNA gene/mL. The metagenome of YN3 harbored 18 rdhA genes (designated YN3rdhA1–18 encoding the catalytic subunit of reductive dehalogenase (RdhA, four of which were suggested to be involved in PCE-to-ETH dechlorination based on significant increases in their transcription in response to CE addition. The predicted proteins for two of these four genes, YN3RdhA8 and YN3RdhA16, showed 94% and 97% of amino acid similarity with PceA and VcrA, which are well known to dechlorinate PCE to trichloroethene (TCE and TCE to ETH, respectively. The other two rdhAs, YN3rdhA6 and YN3rdhA12, which were never proved as rdhA for CEs, showed particularly high transcription upon addition of vinyl chloride (VC, with 75±38 and 16±8.6 mRNA copies per gene, respectively, suggesting their possible functions as novel VC-reductive dehalogenases. Moreover, metagenome data indicated the presence of three coexisting bacterial species, including novel species of the genus Bacteroides, which might promote CE dechlorination by Dehalococcoides.

  2. In Search of 'Birth Month Genes': Using Existing Data Repositories to Locate Genes Underlying Birth Month-Disease Relationships.

    Science.gov (United States)

    Boland, Mary Regina; Tatonetti, Nicholas P

    2016-01-01

    Prenatal and perinatal exposures vary seasonally (e.g., sunlight, allergens) and many diseases are linked with variance in exposure. Epidemiologists often measure these changes using birth month as a proxy for seasonal variance. Likewise, Genome-Wide Association Studies have associated or implicated these same diseases with many genes. Both disparate data types (epidemiological and genetic) can provide key insights into the underlying disease biology. We developed an algorithm that links 1) epidemiological data from birth month studies with 2) genetic data from published gene-disease association studies. Our framework uses existing data repositories - PubMed, DisGeNET and Gene Ontology - to produce a bipartite network that connects enriched seasonally varying biofactorss with birth month dependent diseases (BMDDs) through their overlapping developmental gene sets. As a proof-of-concept, we investigate 7 known BMDDs and highlight three important biological networks revealed by our algorithm and explore some interesting genetic mechanisms potentially responsible for the seasonal contribution to BMDDs.

  3. MADS goes genomic in conifers: towards determining the ancestral set of MADS-box genes in seed plants.

    Science.gov (United States)

    Gramzow, Lydia; Weilandt, Lisa; Theißen, Günter

    2014-11-01

    MADS-box genes comprise a gene family coding for transcription factors. This gene family expanded greatly during land plant evolution such that the number of MADS-box genes ranges from one or two in green algae to around 100 in angiosperms. Given the crucial functions of MADS-box genes for nearly all aspects of plant development, the expansion of this gene family probably contributed to the increasing complexity of plants. However, the expansion of MADS-box genes during one important step of land plant evolution, namely the origin of seed plants, remains poorly understood due to the previous lack of whole-genome data for gymnosperms. The newly available genome sequences of Picea abies, Picea glauca and Pinus taeda were used to identify the complete set of MADS-box genes in these conifers. In addition, MADS-box genes were identified in the growing number of transcriptomes available for gymnosperms. With these datasets, phylogenies were constructed to determine the ancestral set of MADS-box genes of seed plants and to infer the ancestral functions of these genes. Type I MADS-box genes are under-represented in gymnosperms and only a minimum of two Type I MADS-box genes have been present in the most recent common ancestor (MRCA) of seed plants. In contrast, a large number of Type II MADS-box genes were found in gymnosperms. The MRCA of extant seed plants probably possessed at least 11-14 Type II MADS-box genes. In gymnosperms two duplications of Type II MADS-box genes were found, such that the MRCA of extant gymnosperms had at least 14-16 Type II MADS-box genes. The implied ancestral set of MADS-box genes for seed plants shows simplicity for Type I MADS-box genes and remarkable complexity for Type II MADS-box genes in terms of phylogeny and putative functions. The analysis of transcriptome data reveals that gymnosperm MADS-box genes are expressed in a great variety of tissues, indicating diverse roles of MADS-box genes for the development of gymnosperms. This study is

  4. Enrichment of deleterious variants of mitochondrial DNA polymerase gene (POLG1) in bipolar disorder.

    Science.gov (United States)

    Kasahara, Takaoki; Ishiwata, Mizuho; Kakiuchi, Chihiro; Fuke, Satoshi; Iwata, Nakao; Ozaki, Norio; Kunugi, Hiroshi; Minabe, Yoshio; Nakamura, Kazuhiko; Iwata, Yasuhide; Fujii, Kumiko; Kanba, Shigenobu; Ujike, Hiroshi; Kusumi, Ichiro; Kataoka, Muneko; Matoba, Nana; Takata, Atsushi; Iwamoto, Kazuya; Yoshikawa, Takeo; Kato, Tadafumi

    2017-08-01

    Rare missense variants, which likely account for a substantial portion of the genetic 'dark matter' for a common complex disease, are challenging because the impacts of variants on disease development are difficult to substantiate. This study aimed to examine the impacts of amino acid substitution variants in the POLG1 found in bipolar disorder, as an example and proof of concept, in three different modalities of assessment: in silico predictions, in vitro biochemical assays, and clinical evaluation. We then tested whether deleterious variants in POLG1 contributed to the genetics of bipolar disorder. We searched for variants in the POLG1 gene in 796 Japanese patients with bipolar disorder and 767 controls and comprehensively investigated all 23 identified variants in the three modalities of assessment. POLG1 encodes mitochondrial DNA polymerase and is one of the causative genes for a Mendelian-inheritance mitochondrial disease, which is occasionally accompanied by mood disorders. The healthy control data from the Tohoku Medical Megabank Organization were also employed. Although the frequency of carriers of deleterious variants varied from one method to another, every assessment achieved the same conclusion that deleterious POLG1 variants were significantly enriched in the variants identified in patients with bipolar disorder compared to those in controls. Together with mitochondrial dysfunction in bipolar disorder, the present results suggested deleterious POLG1 variants as a credible risk for the multifactorial disease. © 2016 The Authors. Psychiatry and Clinical Neurosciences published by John Wiley & Sons Australia, Ltd on behalf of Japanese Society of Psychiatry and Neurology.

  5. 16S rRNA gene-based molecular analysis of mat-forming and accompanying bacteria covering organically-enriched marine sediments underlying a salmon farm in Southern Chile (Calbuco Island)

    OpenAIRE

    Aranda, Carlos; Paredes, Javier; Valenzuela, Cristian; Lam, Phyllis; Guillou, Laure

    2010-01-01

    The mat forming bacteria covering organic matter-enriched and anoxic marine sediments underlying a salmon farm in Southern Chile, were examined using 16S rRNA gene phylogenies. This mat was absent in the sea bed outside the direct influence of the farm (360 m outside fish cages). Based on nearly complete 16S rRNA gene sequences (-1500 bp), mat-forming filamentous cells were settled as the sulphur-oxidizing and putatively dissimilative nitrate-reducing Beggiatoa spp., being closely related (up...

  6. Synaptic, transcriptional and chromatin genes disrupted in autism.

    Science.gov (United States)

    De Rubeis, Silvia; He, Xin; Goldberg, Arthur P; Poultney, Christopher S; Samocha, Kaitlin; Cicek, A Erucment; Kou, Yan; Liu, Li; Fromer, Menachem; Walker, Susan; Singh, Tarinder; Klei, Lambertus; Kosmicki, Jack; Shih-Chen, Fu; Aleksic, Branko; Biscaldi, Monica; Bolton, Patrick F; Brownfeld, Jessica M; Cai, Jinlu; Campbell, Nicholas G; Carracedo, Angel; Chahrour, Maria H; Chiocchetti, Andreas G; Coon, Hilary; Crawford, Emily L; Curran, Sarah R; Dawson, Geraldine; Duketis, Eftichia; Fernandez, Bridget A; Gallagher, Louise; Geller, Evan; Guter, Stephen J; Hill, R Sean; Ionita-Laza, Juliana; Jimenz Gonzalez, Patricia; Kilpinen, Helena; Klauck, Sabine M; Kolevzon, Alexander; Lee, Irene; Lei, Irene; Lei, Jing; Lehtimäki, Terho; Lin, Chiao-Feng; Ma'ayan, Avi; Marshall, Christian R; McInnes, Alison L; Neale, Benjamin; Owen, Michael J; Ozaki, Noriio; Parellada, Mara; Parr, Jeremy R; Purcell, Shaun; Puura, Kaija; Rajagopalan, Deepthi; Rehnström, Karola; Reichenberg, Abraham; Sabo, Aniko; Sachse, Michael; Sanders, Stephan J; Schafer, Chad; Schulte-Rüther, Martin; Skuse, David; Stevens, Christine; Szatmari, Peter; Tammimies, Kristiina; Valladares, Otto; Voran, Annette; Li-San, Wang; Weiss, Lauren A; Willsey, A Jeremy; Yu, Timothy W; Yuen, Ryan K C; Cook, Edwin H; Freitag, Christine M; Gill, Michael; Hultman, Christina M; Lehner, Thomas; Palotie, Aaarno; Schellenberg, Gerard D; Sklar, Pamela; State, Matthew W; Sutcliffe, James S; Walsh, Christiopher A; Scherer, Stephen W; Zwick, Michael E; Barett, Jeffrey C; Cutler, David J; Roeder, Kathryn; Devlin, Bernie; Daly, Mark J; Buxbaum, Joseph D

    2014-11-13

    The genetic architecture of autism spectrum disorder involves the interplay of common and rare variants and their impact on hundreds of genes. Using exome sequencing, here we show that analysis of rare coding variation in 3,871 autism cases and 9,937 ancestry-matched or parental controls implicates 22 autosomal genes at a false discovery rate (FDR) < 0.05, plus a set of 107 autosomal genes strongly enriched for those likely to affect risk (FDR < 0.30). These 107 genes, which show unusual evolutionary constraint against mutations, incur de novo loss-of-function mutations in over 5% of autistic subjects. Many of the genes implicated encode proteins for synaptic formation, transcriptional regulation and chromatin-remodelling pathways. These include voltage-gated ion channels regulating the propagation of action potentials, pacemaking and excitability-transcription coupling, as well as histone-modifying enzymes and chromatin remodellers-most prominently those that mediate post-translational lysine methylation/demethylation modifications of histones.

  7. The nature of mathematical enrichment: a case study of implementation

    Directory of Open Access Journals (Sweden)

    Jennifer Susan Piggott

    2007-12-01

    Full Text Available This paper reports a framework for describing the nature of mathematics enrichment that emerged from a case study based on the work of the NRICH Project (www.nrich.maths.org team when producing “mathematics enrichment trails” (an ordered set of related mathematics problems and support materials. A range of data sources, including the trails, trail development sessions, related literature and the views of colleagues were used to inform the findings. The data were analysed using NVivo and involved the development of two complementary coding systems. One, drawn from the data itself, gave evidence of views of the content aspects of mathematical enrichment. The other, specifically designed and informed by the literature, was used to aid the analysis of the roles of teaching and learning inherent in views of enrichment described by participants. The framework describes the content of an enrichment curriculum as well as implications for teaching and learning, the experiences of learners and the features of settings where this occurs. To support this, some detail is provided on the role, nature and purpose of problem-solving and what constitutes a good problem. While emerging from a particular context, the framework highlights the need for debate concerning the audience for mathematics enrichment, particularly in questioning the commonly held belief that its value is in supporting the needs of the mathematically most able. The framework also has potential value through offering a focus for debate within the wider community concerning the nature of mathematics enrichment and as a reference point for evaluating the potential of existing or new curriculum to deliver mathematics enrichment.

  8. Network-Based Integration of GWAS and Gene Expression Identifies a HOX-Centric Network Associated with Serous Ovarian Cancer Risk

    DEFF Research Database (Denmark)

    Kar, Siddhartha P; Tyrer, Jonathan P; Li, Qiyuan

    2015-01-01

    BACKGROUND: Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified...... in the unified microarray dataset of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this dataset were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). RESULTS: Gene set enrichment analysis...

  9. Genome-Wide Gene Set Analysis for Identification of Pathways Associated with Alcohol Dependence

    Science.gov (United States)

    Biernacka, Joanna M.; Geske, Jennifer; Jenkins, Gregory D.; Colby, Colin; Rider, David N.; Karpyak, Victor M.; Choi, Doo-Sup; Fridley, Brooke L.

    2013-01-01

    It is believed that multiple genetic variants with small individual effects contribute to the risk of alcohol dependence. Such polygenic effects are difficult to detect in genome-wide association studies that test for association of the phenotype with each single nucleotide polymorphism (SNP) individually. To overcome this challenge, gene set analysis (GSA) methods that jointly test for the effects of pre-defined groups of genes have been proposed. Rather than testing for association between the phenotype and individual SNPs, these analyses evaluate the global evidence of association with a set of related genes enabling the identification of cellular or molecular pathways or biological processes that play a role in development of the disease. It is hoped that by aggregating the evidence of association for all available SNPs in a group of related genes, these approaches will have enhanced power to detect genetic associations with complex traits. We performed GSA using data from a genome-wide study of 1165 alcohol dependent cases and 1379 controls from the Study of Addiction: Genetics and Environment (SAGE), for all 200 pathways listed in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Results demonstrated a potential role of the “Synthesis and Degradation of Ketone Bodies” pathway. Our results also support the potential involvement of the “Neuroactive Ligand Receptor Interaction” pathway, which has previously been implicated in addictive disorders. These findings demonstrate the utility of GSA in the study of complex disease, and suggest specific directions for further research into the genetic architecture of alcohol dependence. PMID:22717047

  10. Screening key candidate genes and pathways involved in insulinoma by microarray analysis.

    Science.gov (United States)

    Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin

    2018-06-01

    Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.

  11. Effect of biochar amendment on the control of soil sulfonamides, antibiotic-resistant bacteria, and gene enrichment in lettuce tissues.

    Science.gov (United States)

    Ye, Mao; Sun, Mingming; Feng, Yanfang; Wan, Jinzhong; Xie, Shanni; Tian, Da; Zhao, Yu; Wu, Jun; Hu, Feng; Li, Huixin; Jiang, Xin

    2016-05-15

    Considering the potential threat of vegetables growing in antibiotic-polluted soil with high abundance of antibiotic-resistant genes (ARGs) against human health through the food chain, it is thus urgent to develop novel control technology to ensure vegetable safety. In the present work, pot experiments were conducted in lettuce cultivation to assess the impedance effect of biochar amendment on soil sulfonamides (SAs), antibiotic-resistant bacteria (ARB), and ARG enrichment in lettuce tissues. After 100 days of cultivation, lettuce cultivation with biochar amendment exhibited the greatest soil SA dissipation as well as the significant improvement of lettuce growth indices, with residual soil SAs mainly existing as the tightly bound fraction. Moreover, the SA contents in roots and new/old leaves were reduced by one to two orders of magnitude compared to those without biochar amendment. In addition, isolate counts for SA-resistant bacterial endophytes in old leaves and sul gene abundances in roots and old leaves also decreased significantly after biochar application. However, neither SA resistant bacteria nor sul genes were detected in new leaves. It was the first study to demonstrate that biochar amendment can be a practical strategy to protect lettuce safety growing in SA-polluted soil with rich ARB and ARGs. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. Blood Gene Expression Profiling of Breast Cancer Survivors Experiencing Fibrosis

    International Nuclear Information System (INIS)

    Landmark-Hoyvik, Hege; Dumeaux, Vanessa; Reinertsen, Kristin V.; Edvardsen, Hege; Fossa, Sophie D.; Borresen-Dale, Anne-Lise

    2011-01-01

    Purpose: To extend knowledge on the mechanisms and pathways involved in maintenance of radiation-induced fibrosis (RIF) by performing gene expression profiling of whole blood from breast cancer (BC) survivors with and without fibrosis 3-7 years after end of radiotherapy treatment. Methods and Materials: Gene expression profiles from blood were obtained for 254 BC survivors derived from a cohort of survivors, treated with adjuvant radiotherapy for breast cancer 3-7 years earlier. Analyses of transcriptional differences in blood gene expression between BC survivors with fibrosis (n = 31) and BC survivors without fibrosis (n = 223) were performed using R version 2.8.0 and tools from the Bioconductor project. Gene sets extracted through a literature search on fibrosis and breast cancer were subsequently used in gene set enrichment analysis. Results: Substantial differences in blood gene expression between BC survivors with and without fibrosis were observed, and 87 differentially expressed genes were identified through linear analysis. Transforming growth factor-β1 signaling was identified as the most significant gene set, showing a down-regulation of most of the core genes, together with up-regulation of a transcriptional activator of the inhibitor of fibrinolysis, Plasminogen activator inhibitor 1 in the BC survivors with fibrosis. Conclusion: Transforming growth factor-β1 signaling was found down-regulated during the maintenance phase of fibrosis as opposed to the up-regulation reported during the early, initiating phase of fibrosis. Hence, once the fibrotic tissue has developed, the maintenance phase might rather involve a deregulation of fibrinolysis and altered degradation of extracellular matrix components.

  13. Consensus strategy in genes prioritization and combined bioinformatics analysis for preeclampsia pathogenesis.

    Science.gov (United States)

    Tejera, Eduardo; Cruz-Monteagudo, Maykel; Burgos, Germán; Sánchez, María-Eugenia; Sánchez-Rodríguez, Aminael; Pérez-Castillo, Yunierkis; Borges, Fernanda; Cordeiro, Maria Natália Dias Soeiro; Paz-Y-Miño, César; Rebelo, Irene

    2017-08-08

    Preeclampsia is a multifactorial disease with unknown pathogenesis. Even when recent studies explored this disease using several bioinformatics tools, the main objective was not directed to pathogenesis. Additionally, consensus prioritization was proved to be highly efficient in the recognition of genes-disease association. However, not information is available about the consensus ability to early recognize genes directly involved in pathogenesis. Therefore our aim in this study is to apply several theoretical approaches to explore preeclampsia; specifically those genes directly involved in the pathogenesis. We firstly evaluated the consensus between 12 prioritization strategies to early recognize pathogenic genes related to preeclampsia. A communality analysis in the protein-protein interaction network of previously selected genes was done including further enrichment analysis. The enrichment analysis includes metabolic pathways as well as gene ontology. Microarray data was also collected and used in order to confirm our results or as a strategy to weight the previously enriched pathways. The consensus prioritized gene list was rationally filtered to 476 genes using several criteria. The communality analysis showed an enrichment of communities connected with VEGF-signaling pathway. This pathway is also enriched considering the microarray data. Our result point to VEGF, FLT1 and KDR as relevant pathogenic genes, as well as those connected with NO metabolism. Our results revealed that consensus strategy improve the detection and initial enrichment of pathogenic genes, at least in preeclampsia condition. Moreover the combination of the first percent of the prioritized genes with protein-protein interaction network followed by communality analysis reduces the gene space. This approach actually identifies well known genes related with pathogenesis. However, genes like HSP90, PAK2, CD247 and others included in the first 1% of the prioritized list need to be further

  14. Alu Elements as Novel Regulators of Gene Expression in Type 1 Diabetes Susceptibility Genes?

    Science.gov (United States)

    Kaur, Simranjeet; Pociot, Flemming

    2015-07-13

    Despite numerous studies implicating Alu repeat elements in various diseases, there is sparse information available with respect to the potential functional and biological roles of the repeat elements in Type 1 diabetes (T1D). Therefore, we performed a genome-wide sequence analysis of T1D candidate genes to identify embedded Alu elements within these genes. We observed significant enrichment of Alu elements within the T1D genes (p-value genes harboring Alus revealed significant enrichment for immune-mediated processes (p-value genes harboring inverted Alus (IRAlus) within their 3' untranslated regions (UTRs) that are known to regulate the expression of host mRNAs by generating double stranded RNA duplexes. Our in silico analysis predicted the formation of duplex structures by IRAlus within the 3'UTRs of T1D genes. We propose that IRAlus might be involved in regulating the expression levels of the host T1D genes.

  15. Modular enrichment measurement system for in-situ enrichment assay

    International Nuclear Information System (INIS)

    Stewart, J.P.

    1976-01-01

    A modular enrichment measurement system has been designed and is in operation within General Electric's Nuclear Fuel Fabrication Facility for the in-situ enrichment assay of uranium-bearing materials in process containers. This enrichment assay system, which is based on the ''enrichment meter'' concept, is an integral part of the site's enrichment control program and is used in the in-situ assay of the enrichment of uranium dioxide (UO 2 ) powder in process containers (five gallon pails). The assay system utilizes a commercially available modular counting system and a collimnator designed for compatability with process container transport lines and ease of operator access. The system has been upgraded to include a microprocessor-based controller to perform system operation functions and to provide data acquisition and processing functions. Standards have been fabricated and qualified for the enrichment assay of several types of uranium-bearing materials, including UO 2 powders. The assay system has performed in excess of 20,000 enrichment verification measurements annually and has significantly contributed to the facility's enrichment control program

  16. Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization.

    Science.gov (United States)

    Jia, Zhilong; Zhang, Xiang; Guan, Naiyang; Bo, Xiaochen; Barnes, Michael R; Luo, Zhigang

    2015-01-01

    RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discriminant non-negative matrix factorization (DNMF) for RNA-seq data. To the best of our knowledge, this is the first work to explore the utility of DNMF for gene ranking. When incorporating Fisher's discriminant criteria and setting the reduced dimension as two, DNMF learns two factors to approximate the original gene expression data, abstracting the up-regulated or down-regulated metagene by using the sample label information. The first factor denotes all the genes' weights of two metagenes as the additive combination of all genes, while the second learned factor represents the expression values of two metagenes. In the gene ranking stage, all the genes are ranked as a descending sequence according to the differential values of the metagene weights. Leveraging the nature of NMF and Fisher's criterion, DNMF can robustly boost the gene ranking performance. The Area Under the Curve analysis of differential expression analysis on two benchmarking tests of four RNA-seq data sets with similar phenotypes showed that our proposed DNMF-based gene ranking method outperforms other widely used methods. Moreover, the Gene Set Enrichment Analysis also showed DNMF outweighs others. DNMF is also computationally efficient, substantially outperforming all other benchmarked methods. Consequently, we suggest DNMF is an effective method for the analysis of differential gene expression and gene ranking for RNA-seq data.

  17. South Australia, uranium enrichment

    International Nuclear Information System (INIS)

    1976-02-01

    The Report sets out the salient data relating to the establishment of a uranium processing centre at Redcliff in South Australia. It is conceived as a major development project for the Commonwealth, the South Australian Government and Australian Industry comprising the refining and enrichment of uranium produced from Australian mines. Using the data currently available in respect of markets, demand, technology and possible financial return from overseas sales, the project could be initiated immediately with hexafluoride production, followed rapidly in stages by enrichment production using the centrifuge process. A conceptual development plan is presented, involving a growth pattern that would be closely synchronised with the mining and production of yellowcake. The proposed development is presented in the form of an eight-and-half-year programme. Costs in this Report are based on 1975 values, unless otherwise stated. (Author)

  18. Gene-set analysis based on the pharmacological profiles of drugs to identify repurposing opportunities in schizophrenia.

    Science.gov (United States)

    de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome

    2016-08-01

    Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected pneratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.

  19. A set of genes previously implicated in the hypoxia response might be an important modulator in the rat ear tissue response to mechanical stretch

    Directory of Open Access Journals (Sweden)

    Orgill Dennis

    2007-11-01

    Full Text Available Abstract Background Wounds are increasingly important in our aging societies. Pathologies such as diabetes predispose patients to chronic wounds that can cause pain, infection, and amputation. The vacuum assisted closure device shows remarkable outcomes in wound healing. Its mechanism of action is unclear despite several hypotheses advanced. We previously hypothesized that micromechanical forces can heal wounds. To understand better the biological response of soft tissue to forces, rat ears in vivo were stretched and their gene expression patterns over time obtained. The absolute enrichment (AE algorithm that obtains a combined up and down regulated picture of the expression analysis was implemented. Results With the use of AE, the hypoxia gene set was the most important at a highly significant level. A co-expression network analysis showed that important co-regulated members of the hypoxia pathway include a glucose transporter (slc2a8, heme oxygenase, and nitric oxide synthase2 among others. Conclusion It appears that the hypoxia pathway may be an important modulator of response of soft tissue to forces. This finding gives us insights not only into the underlying biology, but also into clinical interventions that could be designed to mimic within wounded tissue the effects of forces without all the negative effects that forces themselves create.

  20. RGFinder: a system for determining semantically related genes using GO graph minimum spanning tree.

    Science.gov (United States)

    Taha, Kamal

    2015-01-01

    Biologists often need to know the set S' of genes that are the most functionally and semantically related to a given set S of genes. For determining the set S', most current gene similarity measures overlook the structural dependencies among the Gene Ontology (GO) terms annotating the set S, which may lead to erroneous results. We introduce in this paper a biological search engine called RGFinder that considers the structural dependencies among GO terms by employing the concept of existence dependency. RGFinder assigns a weight to each edge in GO graph to represent the degree of relatedness between the two GO terms connected by the edge. The value of the weight is determined based on the following factors: 1) type of the relation represented by the edge (e.g., an "is-a" relation is assigned a different weight than a "part-of" relation), 2) the functional relationship between the two GO terms connected by the edge, and 3) the string-substring relationship between the names of the two GO terms connected by the edge. RGFinder then constructs a minimum spanning tree of GO graph based on these weights. In the framework of RGFinder, the set S' is annotated to the GO terms located at the lowest convergences of the subtree of the minimum spanning tree that passes through the GO terms annotating set S. We evaluated RGFinder experimentally and compared it with four gene set enrichment systems. Results showed marked improvement.

  1. Network-based differential gene expression analysis suggests cell cycle related genes regulated by E2F1 underlie the molecular difference between smoker and non-smoker lung adenocarcinoma

    Science.gov (United States)

    2013-01-01

    Background Differential gene expression (DGE) analysis is commonly used to reveal the deregulated molecular mechanisms of complex diseases. However, traditional DGE analysis (e.g., the t test or the rank sum test) tests each gene independently without considering interactions between them. Top-ranked differentially regulated genes prioritized by the analysis may not directly relate to the coherent molecular changes underlying complex diseases. Joint analyses of co-expression and DGE have been applied to reveal the deregulated molecular modules underlying complex diseases. Most of these methods consist of separate steps: first to identify gene-gene relationships under the studied phenotype then to integrate them with gene expression changes for prioritizing signature genes, or vice versa. It is warrant a method that can simultaneously consider gene-gene co-expression strength and corresponding expression level changes so that both types of information can be leveraged optimally. Results In this paper, we develop a gene module based method for differential gene expression analysis, named network-based differential gene expression (nDGE) analysis, a one-step integrative process for prioritizing deregulated genes and grouping them into gene modules. We demonstrate that nDGE outperforms existing methods in prioritizing deregulated genes and discovering deregulated gene modules using simulated data sets. When tested on a series of smoker and non-smoker lung adenocarcinoma data sets, we show that top differentially regulated genes identified by the rank sum test in different sets are not consistent while top ranked genes defined by nDGE in different data sets significantly overlap. nDGE results suggest that a differentially regulated gene module, which is enriched for cell cycle related genes and E2F1 targeted genes, plays a role in the molecular differences between smoker and non-smoker lung adenocarcinoma. Conclusions In this paper, we develop nDGE to prioritize

  2. Influence of hexavalent chromium on lactate-enriched Hanford groundwater microbial communities.

    Energy Technology Data Exchange (ETDEWEB)

    Somenahally, Anil C [ORNL; Mosher, Jennifer J [ORNL; Yuan, Tong [University of Oklahoma; Podar, Mircea [ORNL; Phelps, Tommy Joe [ORNL; Brown, Steven D [ORNL; Yang, Zamin Koo [ORNL; Hazen, Terry C [ORNL; Arkin, Adam [Lawrence Berkeley National Laboratory (LBNL); Palumbo, Anthony Vito [ORNL; Zhou, Jizhong [University of Oklahoma; Elias, Dwayne A [ORNL

    2013-01-01

    Microbial reduction and immobilization of chromate (Cr(VI)) is a plausible bioremediation strategy. However, higher Cr(VI) concentrations may impose stress on native Cr-reducing communities. We sought to determine if Cr(VI) would influence the lactate enriched native microbial community structure and function in groundwater from the Cr contaminated site at Hanford, WA. Steady state continuous flow bioreactors were amended with lactate and Cr(VI) (0.0, 0.1 and 3.0 mg/L). Microbial growth, metabolites, Cr(VI) concentrations, 16S rRNA gene sequences and GeoChip based functional gene composition in bioreactors were monitored for 15 weeks. Temporal trends and some differences in growth, metabolite profiles, and community composition were observed, largely between Low-Cr and High-Cr bioreactors. In both High-Cr and Low-Cr bioreactors, Cr(VI) was reduced in the bioreactors. With lactate enrichment, the native communities did not significantly differ between Cr concentrations. Native bacterial communities were diverse, whereas after lactate enrichment, Pelosinus spp., and Sporotalea spp., were the most predominant groups in all bioreactors. Similarly, the Archaea diversity significantly decreased from Methanosaeta (35%), Methanosarcina (17%), Halobacteriales (12%), Methanoregula (8%) and others, to mostly Methanosarcina spp. (95%) after lactate enrichment. Composition of several key functional genes was distinct in Low-Cr bioreactors compared to High-Cr. Among the Cr resistant probes (chrA), Burkholderia vietnamiensis, Comamonas testosterone and Ralstonia pickettii proliferated in Cr amended bioreactors. In-situ fermentative conditions facilitated Cr(VI) reduction, and as a result the 3.0 mg/L Cr(VI) did not appear to give chromate reducing strains a competitive advantage for proliferation or for increasing Cr-reduction.

  3. SSHscreen and SSHdb, generic software for microarray based gene discovery: application to the stress response in cowpea

    Directory of Open Access Journals (Sweden)

    Oelofse Dean

    2010-04-01

    Full Text Available Abstract Background Suppression subtractive hybridization is a popular technique for gene discovery from non-model organisms without an annotated genome sequence, such as cowpea (Vigna unguiculata (L. Walp. We aimed to use this method to enrich for genes expressed during drought stress in a drought tolerant cowpea line. However, current methods were inefficient in screening libraries and management of the sequence data, and thus there was a need to develop software tools to facilitate the process. Results Forward and reverse cDNA libraries enriched for cowpea drought response genes were screened on microarrays, and the R software package SSHscreen 2.0.1 was developed (i to normalize the data effectively using spike-in control spot normalization, and (ii to select clones for sequencing based on the calculation of enrichment ratios with associated statistics. Enrichment ratio 3 values for each clone showed that 62% of the forward library and 34% of the reverse library clones were significantly differentially expressed by drought stress (adjusted p value 88% of the clones in both libraries were derived from rare transcripts in the original tester samples, thus supporting the notion that suppression subtractive hybridization enriches for rare transcripts. A set of 118 clones were chosen for sequencing, and drought-induced cowpea genes were identified, the most interesting encoding a late embryogenesis abundant Lea5 protein, a glutathione S-transferase, a thaumatin, a universal stress protein, and a wound induced protein. A lipid transfer protein and several components of photosynthesis were down-regulated by the drought stress. Reverse transcriptase quantitative PCR confirmed the enrichment ratio values for the selected cowpea genes. SSHdb, a web-accessible database, was developed to manage the clone sequences and combine the SSHscreen data with sequence annotations derived from BLAST and Blast2GO. The self-BLAST function within SSHdb grouped

  4. WaveSeq: a novel data-driven method of detecting histone modification enrichments using wavelets.

    Directory of Open Access Journals (Sweden)

    Apratim Mitra

    Full Text Available BACKGROUND: Chromatin immunoprecipitation followed by next-generation sequencing is a genome-wide analysis technique that can be used to detect various epigenetic phenomena such as, transcription factor binding sites and histone modifications. Histone modification profiles can be either punctate or diffuse which makes it difficult to distinguish regions of enrichment from background noise. With the discovery of histone marks having a wide variety of enrichment patterns, there is an urgent need for analysis methods that are robust to various data characteristics and capable of detecting a broad range of enrichment patterns. RESULTS: To address these challenges we propose WaveSeq, a novel data-driven method of detecting regions of significant enrichment in ChIP-Seq data. Our approach utilizes the wavelet transform, is free of distributional assumptions and is robust to diverse data characteristics such as low signal-to-noise ratios and broad enrichment patterns. Using publicly available datasets we showed that WaveSeq compares favorably with other published methods, exhibiting high sensitivity and precision for both punctate and diffuse enrichment regions even in the absence of a control data set. The application of our algorithm to a complex histone modification data set helped make novel functional discoveries which further underlined its utility in such an experimental setup. CONCLUSIONS: WaveSeq is a highly sensitive method capable of accurate identification of enriched regions in a broad range of data sets. WaveSeq can detect both narrow and broad peaks with a high degree of accuracy even in low signal-to-noise ratio data sets. WaveSeq is also suited for application in complex experimental scenarios, helping make biologically relevant functional discoveries.

  5. Environmental Enrichment Mitigates Deficits after Repetitive Mild Traumatic Brain Injury.

    Science.gov (United States)

    Liu, Xixia; Qiu, Jianhua; Alcon, Sasha; Hashim, Jumana; Meehan, William P; Mannix, Rebekah

    2017-08-15

    Although environmental enrichment has been shown to improve functional and histologic outcomes in pre-clinical moderate-to-severe traumatic brain injury (TBI), there are a paucity of pre-clinical data regarding enrichment strategies in the setting of repetitive mild traumatic brain injury (rmTBI). Given the vast numbers of athletes and those in the military who sustain rmTBI, the mounting evidence of the long-term and progressive sequelae of rmTBI, and the lack of targeted therapies to mitigate these sequelae, successful enrichment interventions in rmTBI could have large public health significance. Here, we evaluated enrichment strategies in an established pre-clinical rmTBI model. Seventy-one male C57BL/6 mice were randomized to two different housing conditions, environmental enrichment (EE) or normal condition (NC), then subjected to rmTBI injury (seven injuries in 9 days) or sham injury (anesthesia only). Functional outcomes in all four groups (NC-TBI, EE-TBI, NC-sham, and EE-sham) were assessed by motor, exploratory/anxiety, and mnemonic behavioral tests. At the synaptic level, N-methyl d-aspartate receptor (NMDAR) subunit expression of phosphorylated glutamate receptor 1 (GluR1), phosphorylated Ca 2+ /calmodulin-dependent protein kinase II (CaMKII), and calpain were evaluated by western blot. Compared to injured NC-TBI mice, EE-TBI mice had improved memory and decreased anxiety and exploratory activity post-injury. Treatment with enrichment also corresponded to normal NMDAR subunit expression, decreased GluR1 phosphorylation, decreased phosphorylated CaMKII, and normal calpain expression post-rmTBI. These data suggest that enrichment strategies may improve functional outcomes and mitigate synaptic changes post-rmTBI. Given that enrichment strategies are feasible in the clinical setting, particularly for athletes and soldiers for whom the risk of repetitive injury is greatest, these data suggest that clinical trials may be warranted.

  6. Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model.

    Science.gov (United States)

    Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O; Eijlander, Robyn T; Kuipers, Oscar P

    2018-02-09

    Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes.

  7. Scaling proprioceptor gene transcription by retrograde NT3 signaling.

    Directory of Open Access Journals (Sweden)

    Jun Lee

    Full Text Available Cell-type specific intrinsic programs instruct neuronal subpopulations before target-derived factors influence later neuronal maturation. Retrograde neurotrophin signaling controls neuronal survival and maturation of dorsal root ganglion (DRG sensory neurons, but how these potent signaling pathways intersect with transcriptional programs established at earlier developmental stages remains poorly understood. Here we determine the consequences of genetic alternation of NT3 signaling on genome-wide transcription programs in proprioceptors, an important sensory neuron subpopulation involved in motor reflex behavior. We find that the expression of many proprioceptor-enriched genes is dramatically altered by genetic NT3 elimination, independent of survival-related activities. Combinatorial analysis of gene expression profiles with proprioceptors isolated from mice expressing surplus muscular NT3 identifies an anticorrelated gene set with transcriptional levels scaled in opposite directions. Voluntary running experiments in adult mice further demonstrate the maintenance of transcriptional adjustability of genes expressed by DRG neurons, pointing to life-long gene expression plasticity in sensory neurons.

  8. Identification of differentially expressed genes and biological pathways in bladder cancer

    Science.gov (United States)

    Tang, Fucai; He, Zhaohui; Lei, Hanqi; Chen, Yuehan; Lu, Zechao; Zeng, Guohua; Wang, Hangtao

    2018-01-01

    The purpose of the present study was to identify key genes and investigate the related molecular mechanisms of bladder cancer (BC) progression. From the Gene Expression Omnibus database, the gene expression dataset GSE7476 was downloaded, which contained 43 BC samples and 12 normal bladder tissues. GSE7476 was analyzed to screen the differentially expressed genes (DEGs). Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses were performed for the DEGs using the DAVID database, and a protein-protein interaction (PPI) network was then constructed using Cytoscape software. The results of the GO analysis showed that the upregulated DEGs were significantly enriched in cell division, nucleoplasm and protein binding, while the downregulated DEGs were significantly enriched in ‘extracellular matrix organization’, ‘proteinaceous extracellular matrix’ and ‘heparin binding’. The results of the KEGG pathway analysis showed that the upregulated DEGs were significantly enriched in the ‘cell cycle’, whereas the downregulated DEGs were significantly enriched in ‘complement and coagulation cascades’. JUN, cyclin-dependent kinase 1, FOS, PCNA, TOP2A, CCND1 and CDH1 were found to be hub genes in the PPI network. Sub-networks revealed that these gene were enriched in significant pathways, including the ‘cell cycle’ signaling pathway and ‘PI3K-Akt signaling pathway’. In summary, the present study identified DEGs and key target genes in the progression of BC, providing potential molecular targets and diagnostic biomarkers for the treatment of BC. PMID:29532898

  9. Gene expression profiles in asbestos-exposed epithelial and mesothelial lung cell lines

    Directory of Open Access Journals (Sweden)

    Kaski Samuel

    2007-03-01

    Full Text Available Abstract Background Asbestos has been shown to cause chromosomal damage and DNA aberrations. Exposure to asbestos causes many lung diseases e.g. asbestosis, malignant mesothelioma, and lung cancer, but the disease-related processes are still largely unknown. We exposed the human cell lines A549, Beas-2B and Met5A to crocidolite asbestos and determined time-dependent gene expression profiles by using Affymetrix arrays. The hybridization data was analyzed by using an algorithm specifically designed for clustering of short time series expression data. A canonical correlation analysis was applied to identify correlations between the cell lines, and a Gene Ontology analysis method for the identification of enriched, differentially expressed biological processes. Results We recognized a large number of previously known as well as new potential asbestos-associated genes and biological processes, and identified chromosomal regions enriched with genes potentially contributing to common responses to asbestos in these cell lines. These include genes such as the thioredoxin domain containing gene (TXNDC and the potential tumor suppressor, BCL2/adenovirus E1B 19kD-interacting protein gene (BNIP3L, GO-terms such as "positive regulation of I-kappaB kinase/NF-kappaB cascade" and "positive regulation of transcription, DNA-dependent", and chromosomal regions such as 2p22, 9p13, and 14q21. We present the complete data sets as Additional files. Conclusion This study identifies several interesting targets for further investigation in relation to asbestos-associated diseases.

  10. A microarray analysis of sex- and gonad-biased gene expression in the zebrafish: Evidence for masculinization of the transcriptome

    Directory of Open Access Journals (Sweden)

    Mo Qianxing

    2009-12-01

    Full Text Available Abstract Background In many taxa, males and females are very distinct phenotypically, and these differences often reflect divergent selective pressures acting on the sexes. Phenotypic sexual dimorphism almost certainly reflects differing patterns of gene expression between the sexes, and microarray studies have documented widespread sexually dimorphic gene expression. Although the evolutionary significance of sexual dimorphism in gene expression remains unresolved, these studies have led to the formulation of a hypothesis that male-driven evolution has resulted in the masculinization of animal transcriptomes. Here we use a microarray assessment of sex- and gonad-biased gene expression to test this hypothesis in zebrafish. Results By using zebrafish Affymetrix microarrays to compare gene expression patterns in male and female somatic and gonadal tissues, we identified a large number of genes (5899 demonstrating differences in transcript abundance between male and female Danio rerio. Under conservative statistical significance criteria, all sex-biases in gene expression were due to differences between testes and ovaries. Male-enriched genes were more abundant than female-enriched genes, and expression bias for male-enriched genes was greater in magnitude than that for female-enriched genes. We also identified a large number of genes demonstrating elevated transcript abundance in testes and ovaries relative to male body and female body, respectively. Conclusion Overall our results support the hypothesis that male-biased evolutionary pressures have resulted in male-biased patterns of gene expression. Interestingly, our results seem to be at odds with a handful of other microarray-based studies of sex-specific gene expression patterns in zebrafish. However, ours was the only study designed to address this specific hypothesis, and major methodological differences among studies could explain the discrepancies. Regardless, all of these studies agree

  11. Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans

    Science.gov (United States)

    Iraola, Gregorio; Vazquez, Gustavo; Spangenberg, Lucía; Naya, Hugo

    2012-01-01

    Although there have been great advances in understanding bacterial pathogenesis, there is still a lack of integrative information about what makes a bacterium a human pathogen. The advent of high-throughput sequencing technologies has dramatically increased the amount of completed bacterial genomes, for both known human pathogenic and non-pathogenic strains; this information is now available to investigate genetic features that determine pathogenic phenotypes in bacteria. In this work we determined presence/absence patterns of different virulence-related genes among more than finished bacterial genomes from both human pathogenic and non-pathogenic strains, belonging to different taxonomic groups (i.e: Actinobacteria, Gammaproteobacteria, Firmicutes, etc.). An accuracy of 95% using a cross-fold validation scheme with in-fold feature selection is obtained when classifying human pathogens and non-pathogens. A reduced subset of highly informative genes () is presented and applied to an external validation set. The statistical model was implemented in the BacFier v1.0 software (freely available at ), that displays not only the prediction (pathogen/non-pathogen) and an associated probability for pathogenicity, but also the presence/absence vector for the analyzed genes, so it is possible to decipher the subset of virulence genes responsible for the classification on the analyzed genome. Furthermore, we discuss the biological relevance for bacterial pathogenesis of the core set of genes, corresponding to eight functional categories, all with evident and documented association with the phenotypes of interest. Also, we analyze which functional categories of virulence genes were more distinctive for pathogenicity in each taxonomic group, which seems to be a completely new kind of information and could lead to important evolutionary conclusions. PMID:22916122

  12. Association between expression of random gene sets and survival is evident in multiple cancer types and may be explained by sub-classification

    Science.gov (United States)

    2018-01-01

    One of the goals of cancer research is to identify a set of genes that cause or control disease progression. However, although multiple such gene sets were published, these are usually in very poor agreement with each other, and very few of the genes proved to be functional therapeutic targets. Furthermore, recent findings from a breast cancer gene-expression cohort showed that sets of genes selected randomly can be used to predict survival with a much higher probability than expected. These results imply that many of the genes identified in breast cancer gene expression analysis may not be causal of cancer progression, even though they can still be highly predictive of prognosis. We performed a similar analysis on all the cancer types available in the cancer genome atlas (TCGA), namely, estimating the predictive power of random gene sets for survival. Our work shows that most cancer types exhibit the property that random selections of genes are more predictive of survival than expected. In contrast to previous work, this property is not removed by using a proliferation signature, which implies that proliferation may not always be the confounder that drives this property. We suggest one possible solution in the form of data-driven sub-classification to reduce this property significantly. Our results suggest that the predictive power of random gene sets may be used to identify the existence of sub-classes in the data, and thus may allow better understanding of patient stratification. Furthermore, by reducing the observed bias this may allow more direct identification of biologically relevant, and potentially causal, genes. PMID:29470520

  13. Genes Underlying Positive Influence Of Prenatal Environmental ...

    African Journals Online (AJOL)

    Genes Underlying Positive Influence Of Prenatal Environmental Enrichment And ... Prenatal environmental enrichment (EE) has been proven to positively affect but ... Conclusion: The negative-positive prenatal effect could contribute to altered ...

  14. Adaptive Roles of SSY1 and SIR3 During Cycles of Growth and Starvation in Saccharomyces cerevisiae Populations Enriched for Quiescent or Nonquiescent Cells.

    Science.gov (United States)

    Wloch-Salamon, Dominika M; Tomala, Katarzyna; Aggeli, Dimitra; Dunn, Barbara

    2017-06-07

    Over its evolutionary history, Saccharomyces cerevisiae has evolved to be well-adapted to fluctuating nutrient availability. In the presence of sufficient nutrients, yeast cells continue to proliferate, but upon starvation haploid yeast cells enter stationary phase and differentiate into nonquiescent (NQ) and quiescent (Q) cells. Q cells survive stress better than NQ cells and show greater viability when nutrient-rich conditions are restored. To investigate the genes that may be involved in the differentiation of Q and NQ cells, we serially propagated yeast populations that were enriched for either only Q or only NQ cell types over many repeated growth-starvation cycles. After 30 cycles (equivalent to 300 generations), each enriched population produced a higher proportion of the enriched cell type compared to the starting population, suggestive of adaptive change. We also observed differences in each population's fitness suggesting possible tradeoffs: clones from NQ lines were better adapted to logarithmic growth, while clones from Q lines were better adapted to starvation. Whole-genome sequencing of clones from Q- and NQ-enriched lines revealed mutations in genes involved in the stress response and survival in limiting nutrients ( ECM21 , RSP5 , MSN1 , SIR4 , and IRA2 ) in both Q and NQ lines, but also differences between the two lines: NQ line clones had recurrent independent mutations affecting the Ssy1p-Ptr3p-Ssy5p (SPS) amino acid sensing pathway, while Q line clones had recurrent, independent mutations in SIR3 and FAS1 Our results suggest that both sets of enriched-cell type lines responded to common, as well as distinct, selective pressures. Copyright © 2017 Wloch-Salamon et al.

  15. Pathway Distiller - multisource biological pathway consolidation.

    Science.gov (United States)

    Doderer, Mark S; Anguiano, Zachry; Suresh, Uthra; Dashnamoorthy, Ravi; Bishop, Alexander J R; Chen, Yidong

    2012-01-01

    One method to understand and evaluate an experiment that produces a large set of genes, such as a gene expression microarray analysis, is to identify overrepresentation or enrichment for biological pathways. Because pathways are able to functionally describe the set of genes, much effort has been made to collect curated biological pathways into publicly accessible databases. When combining disparate databases, highly related or redundant pathways exist, making their consolidation into pathway concepts essential. This will facilitate unbiased, comprehensive yet streamlined analysis of experiments that result in large gene sets. After gene set enrichment finds representative pathways for large gene sets, pathways are consolidated into representative pathway concepts. Three complementary, but different methods of pathway consolidation are explored. Enrichment Consolidation combines the set of the pathways enriched for the signature gene list through iterative combining of enriched pathways with other pathways with similar signature gene sets; Weighted Consolidation utilizes a Protein-Protein Interaction network based gene-weighting approach that finds clusters of both enriched and non-enriched pathways limited to the experiments' resultant gene list; and finally the de novo Consolidation method uses several measurements of pathway similarity, that finds static pathway clusters independent of any given experiment. We demonstrate that the three consolidation methods provide unified yet different functional insights of a resultant gene set derived from a genome-wide profiling experiment. Results from the methods are presented, demonstrating their applications in biological studies and comparing with a pathway web-based framework that also combines several pathway databases. Additionally a web-based consolidation framework that encompasses all three methods discussed in this paper, Pathway Distiller (http://cbbiweb.uthscsa.edu/PathwayDistiller), is established to allow

  16. Identification of a core set of rhizobial infection genes using data from single cell-types

    Directory of Open Access Journals (Sweden)

    Da-Song eChen

    2015-07-01

    Full Text Available Genome-wide expression studies on nodulation have varied in their scale from entire root systems to dissected nodules or root sections containing nodule primordia. More recently efforts have focused on developing methods for isolation of root hairs from infected plants and the application of laser-capture microdissection technology to nodules. Here we analyze two published data sets to identify a core set of infection genes that are expressed in the nodule and in root hairs during infection. Among the genes identified were those encoding phenylpropanoid biosynthesis enzymes including Chalcone-O-Methyltransferase which is required for the production of the potent Nod gene inducer 4’,4-dihydroxy-2-methoxychalcone. A promoter-GUS analysis in transgenic hairy roots for two genes encoding Chalcone-O-Methyltransferase isoforms revealed their expression in rhizobially infected root hairs and the nodule infection zone but not in the nitrogen fixation zone. We also describe a group of Rhizobially Induced Peroxidases whose expression overlaps with the production of superoxide in rhizobially infected root hairs and in nodules and roots. Finally, we identify a cohort of co-regulated transcription factors as candidate regulators of these processes.

  17. Genetic engineering of syringyl-enriched lignin in plants

    Science.gov (United States)

    Chiang, Vincent Lee; Li, Laigeng

    2004-11-02

    The present invention relates to a novel DNA sequence, which encodes a previously unidentified lignin biosynthetic pathway enzyme, sinapyl alcohol dehydrogenase (SAD) that regulates the biosynthesis of syringyl lignin in plants. Also provided are methods for incorporating this novel SAD gene sequence or substantially similar sequences into a plant genome for genetic engineering of syringyl-enriched lignin in plants.

  18. Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data.

    Science.gov (United States)

    Lee, Hyeonjeong; Shin, Miyoung

    2017-01-01

    The problem of discovering genetic markers as disease signatures is of great significance for the successful diagnosis, treatment, and prognosis of complex diseases. Even if many earlier studies worked on identifying disease markers from a variety of biological resources, they mostly focused on the markers of genes or gene-sets (i.e., pathways). However, these markers may not be enough to explain biological interactions between genetic variables that are related to diseases. Thus, in this study, our aim is to investigate distinctive associations among active pathways (i.e., pathway-sets) shown each in case and control samples which can be observed from gene expression and/or methylation data. The pathway-sets are obtained by identifying a set of associated pathways that are often active together over a significant number of class samples. For this purpose, gene expression or methylation profiles are first analyzed to identify significant (active) pathways via gene-set enrichment analysis. Then, regarding these active pathways, an association rule mining approach is applied to examine interesting pathway-sets in each class of samples (case or control). By doing so, the sets of associated pathways often working together in activity profiles are finally chosen as our distinctive signature of each class. The identified pathway-sets are aggregated into a pathway activity network (PAN), which facilitates the visualization of differential pathway associations between case and control samples. From our experiments with two publicly available datasets, we could find interesting PAN structures as the distinctive signatures of breast cancer and uterine leiomyoma cancer, respectively. Our pathway-set markers were shown to be superior or very comparable to other genetic markers (such as genes or gene-sets) in disease classification. Furthermore, the PAN structure, which can be constructed from the identified markers of pathway-sets, could provide deeper insights into

  19. THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer's Disease.

    Science.gov (United States)

    Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K

    2016-11-30

    There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer's disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer's disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer's disease brains. The biological pathways associated with Alzheimer's disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature.

  20. Oncogenic driver genes and the inflammatory microenvironment dictate liver tumor phenotype

    DEFF Research Database (Denmark)

    Matter, Matthias S; Marquardt, Jens U; Andersen, Jesper B

    2016-01-01

    The majority of hepatocellular carcinoma (HCC) develops in the background of chronic liver inflammation caused by viral hepatitis and alcoholic or non-alcoholic steatohepatitis. However, the impact of different types of chronic inflammatory microenvironments on the phenotypes of tumors generated...... with transcriptome profiles from human HCCs further demonstrated that AKT-CAT tumors generated in the context of chronic liver inflammation showed enrichment of poor prognosis gene sets or decrease of good prognosis gene sets. In contrast, DDC had a more subtle effect on AKT-NRAS(G12V) tumors and primarily enhanced...... by distinct oncogenes is largely unresolved. To address this issue, we generated murine liver tumors by constitutively active AKT-1 (AKT) and β-catenin (CAT) followed by induction of chronic liver inflammation by 3,5-diethoxycarbonyl-1,4-dihydrocollidine (DDC) and carbon tetrachloride (CCl4 ). Also...

  1. Smoking-induced gene expression changes in the bronchial airway are reflected in nasal and buccal epithelium

    Directory of Open Access Journals (Sweden)

    Zhang Xiaohui

    2008-05-01

    Full Text Available Abstract Background Cigarette smoking is a leading cause of preventable death and a significant cause of lung cancer and chronic obstructive pulmonary disease. Prior studies have demonstrated that smoking creates a field of molecular injury throughout the airway epithelium exposed to cigarette smoke. We have previously characterized gene expression in the bronchial epithelium of never smokers and identified the gene expression changes that occur in the mainstem bronchus in response to smoking. In this study, we explored relationships in whole-genome gene expression between extrathorcic (buccal and nasal and intrathoracic (bronchial epithelium in healthy current and never smokers. Results Using genes that have been previously defined as being expressed in the bronchial airway of never smokers (the "normal airway transcriptome", we found that bronchial and nasal epithelium from non-smokers were most similar in gene expression when compared to other epithelial and nonepithelial tissues, with several antioxidant, detoxification, and structural genes being highly expressed in both the bronchus and nose. Principle component analysis of previously defined smoking-induced genes from the bronchus suggested that smoking had a similar effect on gene expression in nasal epithelium. Gene set enrichment analysis demonstrated that this set of genes was also highly enriched among the genes most altered by smoking in both nasal and buccal epithelial samples. The expression of several detoxification genes was commonly altered by smoking in all three respiratory epithelial tissues, suggesting a common airway-wide response to tobacco exposure. Conclusion Our findings support a relationship between gene expression in extra- and intrathoracic airway epithelial cells and extend the concept of a smoking-induced field of injury to epithelial cells that line the mouth and nose. This relationship could potentially be utilized to develop a non-invasive biomarker for

  2. A Targeted Enrichment Strategy for Massively Parallel Sequencing of Angiosperm Plastid Genomes

    Directory of Open Access Journals (Sweden)

    Gregory W. Stull

    2013-02-01

    Full Text Available Premise of the study: We explored a targeted enrichment strategy to facilitate rapid and low-cost next-generation sequencing (NGS of numerous complete plastid genomes from across the phylogenetic breadth of angiosperms. Methods and Results: A custom RNA probe set including the complete sequences of 22 previously sequenced eudicot plastomes was designed to facilitate hybridization-based targeted enrichment of eudicot plastid genomes. Using this probe set and an Agilent SureSelect targeted enrichment kit, we conducted an enrichment experiment including 24 angiosperms (22 eudicots, two monocots, which were subsequently sequenced on a single lane of the Illumina GAIIx with single-end, 100-bp reads. This approach yielded nearly complete to complete plastid genomes with exceptionally high coverage (mean coverage: 717×, even for the two monocots. Conclusions: Our enrichment experiment was highly successful even though many aspects of the capture process employed were suboptimal. Hence, significant improvements to this methodology are feasible. With this general approach and probe set, it should be possible to sequence more than 300 essentially complete plastid genomes in a single Illumina GAIIx lane (achieving 50× mean coverage. However, given the complications of pooling numerous samples for multiplex sequencing and the limited number of barcodes (e.g., 96 available in commercial kits, we recommend 96 samples as a current practical maximum for multiplex plastome sequencing. This high-throughput approach should facilitate large-scale plastid genome sequencing at any level of phylogenetic diversity in angiosperms.

  3. Filter-Adapted Fluorescent In Situ Hybridization (FA-FISH) for Filtration-Enriched Circulating Tumor Cells.

    Science.gov (United States)

    Oulhen, Marianne; Pailler, Emma; Faugeroux, Vincent; Farace, Françoise

    2017-01-01

    Circulating tumor cells (CTCs) may represent an easily accessible source of tumor material to assess genetic aberrations such as gene-rearrangements or gene-amplifications and screen cancer patients eligible for targeted therapies. As the number of CTCs is a critical parameter to identify such biomarkers, we developed fluorescent in situ hybridization (FISH) for CTCs enriched on filters (filter-adapted-FISH, FA-FISH). Here, we describe the FA-FISH protocol, the combination of immunofluorescent staining (DAPI/CD45) and FA-FISH techniques, as well as the semi-automated microscopy method that we developed to improve the feasibility and reliability of FISH analyses in filtration-enriched CTC.

  4. Genetic architecture of gene expression in the chicken

    Directory of Open Access Journals (Sweden)

    Stanley Dragana

    2013-01-01

    Full Text Available Abstract Background The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. Results Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. Conclusions We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.

  5. Experimental Validation of a Permeability Model for Enrichment Membranes

    International Nuclear Information System (INIS)

    Orellano, Pablo; Brasnarof, Daniel; Florido Pablo

    2003-01-01

    An experimental loop with a real scale diffuser, in a single enrichment-stage configuration, was operated with air at different process conditions, in order to characterize the membrane permeability.Using these experimental data, an analytical geometric-and-morphologic-based model was validated.It is conclude that a new set of independent measurements, i.e. enrichment, is necessary in order to fully characterize diffusers, because of its internal parameters are not univocally determinated with permeability experimental data only

  6. Genes and co-expression modules common to drought and bacterial stress responses in Arabidopsis and rice.

    Directory of Open Access Journals (Sweden)

    Rafi Shaik

    Full Text Available Plants are simultaneously exposed to multiple stresses resulting in enormous changes in the molecular landscape within the cell. Identification and characterization of the synergistic and antagonistic components of stress response mechanisms contributing to the cross talk between stresses is of high priority to explore and enhance multiple stress responses. To this end, we performed meta-analysis of drought (abiotic, bacterial (biotic stress response in rice and Arabidopsis by analyzing a total of 386 microarray samples belonging to 20 microarray studies and identified approximately 3100 and 900 DEGs in rice and Arabidopsis, respectively. About 38.5% (1214 and 28.7% (272 DEGs were common to drought and bacterial stresses in rice and Arabidopsis, respectively. A majority of these common DEGs showed conserved expression status in both stresses. Gene ontology enrichment analysis clearly demarcated the response and regulation of various plant hormones and related biological processes. Fatty acid metabolism and biosynthesis of alkaloids were upregulated and, nitrogen metabolism and photosynthesis was downregulated in both stress conditions. WRKY transcription family genes were highly enriched in all upregulated gene sets while 'CO-like' TF family showed inverse relationship of expression between drought and bacterial stresses. Weighted gene co-expression network analysis divided DEG sets into multiple modules that show high co-expression and identified stress specific hub genes with high connectivity. Detection of consensus modules based on DEGs common to drought and bacterial stress revealed 9 and 4 modules in rice and Arabidopsis, respectively, with conserved and reversed co-expression patterns.

  7. Gene Ontology Consortium: going forward.

    Science.gov (United States)

    2015-01-01

    The Gene Ontology (GO; http://www.geneontology.org) is a community-based bioinformatics resource that supplies information about gene product function using ontologies to represent biological knowledge. Here we describe improvements and expansions to several branches of the ontology, as well as updates that have allowed us to more efficiently disseminate the GO and capture feedback from the research community. The Gene Ontology Consortium (GOC) has expanded areas of the ontology such as cilia-related terms, cell-cycle terms and multicellular organism processes. We have also implemented new tools for generating ontology terms based on a set of logical rules making use of templates, and we have made efforts to increase our use of logical definitions. The GOC has a new and improved web site summarizing new developments and documentation, serving as a portal to GO data. Users can perform GO enrichment analysis, and search the GO for terms, annotations to gene products, and associated metadata across multiple species using the all-new AmiGO 2 browser. We encourage and welcome the input of the research community in all biological areas in our continued effort to improve the Gene Ontology. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Identification of the Key Genes and Pathways in Esophageal Carcinoma.

    Science.gov (United States)

    Su, Peng; Wen, Shiwang; Zhang, Yuefeng; Li, Yong; Xu, Yanzhao; Zhu, Yonggang; Lv, Huilai; Zhang, Fan; Wang, Mingbo; Tian, Ziqiang

    2016-01-01

    Objective . Esophageal carcinoma (EC) is a frequently common malignancy of gastrointestinal cancer in the world. This study aims to screen key genes and pathways in EC and elucidate the mechanism of it. Methods . 5 microarray datasets of EC were downloaded from Gene Expression Omnibus. Differentially expressed genes (DEGs) were screened by bioinformatics analysis. Gene Ontology (GO) enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment, and protein-protein interaction (PPI) network construction were performed to obtain the biological roles of DEGs in EC. Quantitative real-time polymerase chain reaction (qRT-PCR) was used to verify the expression level of DEGs in EC. Results . A total of 1955 genes were filtered as DEGs in EC. The upregulated genes were significantly enriched in cell cycle and the downregulated genes significantly enriched in Endocytosis. PPI network displayed CDK4 and CCT3 were hub proteins in the network. The expression level of 8 dysregulated DEGs including CDK4, CCT3, THSD4, SIM2, MYBL2, CENPF, CDCA3, and CDKN3 was validated in EC compared to adjacent nontumor tissues and the results were matched with the microarray analysis. Conclusion . The significantly DEGs including CDK4, CCT3, THSD4, and SIM2 may play key roles in tumorigenesis and development of EC involved in cell cycle and Endocytosis.

  9. Identification of the Key Genes and Pathways in Esophageal Carcinoma

    Directory of Open Access Journals (Sweden)

    Peng Su

    2016-01-01

    Full Text Available Objective. Esophageal carcinoma (EC is a frequently common malignancy of gastrointestinal cancer in the world. This study aims to screen key genes and pathways in EC and elucidate the mechanism of it. Methods. 5 microarray datasets of EC were downloaded from Gene Expression Omnibus. Differentially expressed genes (DEGs were screened by bioinformatics analysis. Gene Ontology (GO enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG enrichment, and protein-protein interaction (PPI network construction were performed to obtain the biological roles of DEGs in EC. Quantitative real-time polymerase chain reaction (qRT-PCR was used to verify the expression level of DEGs in EC. Results. A total of 1955 genes were filtered as DEGs in EC. The upregulated genes were significantly enriched in cell cycle and the downregulated genes significantly enriched in Endocytosis. PPI network displayed CDK4 and CCT3 were hub proteins in the network. The expression level of 8 dysregulated DEGs including CDK4, CCT3, THSD4, SIM2, MYBL2, CENPF, CDCA3, and CDKN3 was validated in EC compared to adjacent nontumor tissues and the results were matched with the microarray analysis. Conclusion. The significantly DEGs including CDK4, CCT3, THSD4, and SIM2 may play key roles in tumorigenesis and development of EC involved in cell cycle and Endocytosis.

  10. Genomic determinants of sporulation in Bacilli and Clostridia: towards the minimal set of sporulation-specific genes.

    Science.gov (United States)

    Galperin, Michael Y; Mekhedov, Sergei L; Puigbo, Pere; Smirnov, Sergey; Wolf, Yuri I; Rigden, Daniel J

    2012-11-01

    Three classes of low-G+C Gram-positive bacteria (Firmicutes), Bacilli, Clostridia and Negativicutes, include numerous members that are capable of producing heat-resistant endospores. Spore-forming firmicutes include many environmentally important organisms, such as insect pathogens and cellulose-degrading industrial strains, as well as human pathogens responsible for such diseases as anthrax, botulism, gas gangrene and tetanus. In the best-studied model organism Bacillus subtilis, sporulation involves over 500 genes, many of which are conserved among other bacilli and clostridia. This work aimed to define the genomic requirements for sporulation through an analysis of the presence of sporulation genes in various firmicutes, including those with smaller genomes than B. subtilis. Cultivable spore-formers were found to have genomes larger than 2300 kb and encompass over 2150 protein-coding genes of which 60 are orthologues of genes that are apparently essential for sporulation in B. subtilis. Clostridial spore-formers lack, among others, spoIIB, sda, spoVID and safA genes and have non-orthologous displacements of spoIIQ and spoIVFA, suggesting substantial differences between bacilli and clostridia in the engulfment and spore coat formation steps. Many B. subtilis sporulation genes, particularly those encoding small acid-soluble spore proteins and spore coat proteins, were found only in the family Bacillaceae, or even in a subset of Bacillus spp. Phylogenetic profiles of sporulation genes, compiled in this work, confirm the presence of a common sporulation gene core, but also illuminate the diversity of the sporulation processes within various lineages. These profiles should help further experimental studies of uncharacterized widespread sporulation genes, which would ultimately allow delineation of the minimal set(s) of sporulation-specific genes in Bacilli and Clostridia. Published 2012. This article is a U.S. Government work and is in the public domain in the USA.

  11. Uranium enrichment

    International Nuclear Information System (INIS)

    1990-01-01

    This report looks at the following issues: How much Soviet uranium ore and enriched uranium are imported into the United States and what is the extent to which utilities flag swap to disguise these purchases? What are the U.S.S.R.'s enriched uranium trading practices? To what extent are utilities required to return used fuel to the Soviet Union as part of the enriched uranium sales agreement? Why have U.S. utilities ended their contracts to buy enrichment services from DOE?

  12. A survey of genomic studies supports association of circadian clock genes with bipolar disorder spectrum illnesses and lithium response.

    Directory of Open Access Journals (Sweden)

    Michael J McCarthy

    Full Text Available Circadian rhythm abnormalities in bipolar disorder (BD have led to a search for genetic abnormalities in circadian "clock genes" associated with BD. However, no significant clock gene findings have emerged from genome-wide association studies (GWAS. At least three factors could account for this discrepancy: complex traits are polygenic, the organization of the clock is more complex than previously recognized, and/or genetic risk for BD may be shared across multiple illnesses. To investigate these issues, we considered the clock gene network at three levels: essential "core" clock genes, upstream circadian clock modulators, and downstream clock controlled genes. Using relaxed thresholds for GWAS statistical significance, we determined the rates of clock vs. control genetic associations with BD, and four additional illnesses that share clinical features and/or genetic risk with BD (major depression, schizophrenia, attention deficit/hyperactivity. Then we compared the results to a set of lithium-responsive genes. Associations with BD-spectrum illnesses and lithium-responsiveness were both enriched among core clock genes but not among upstream clock modulators. Associations with BD-spectrum illnesses and lithium-responsiveness were also enriched among pervasively rhythmic clock-controlled genes but not among genes that were less pervasively rhythmic or non-rhythmic. Our analysis reveals previously unrecognized associations between clock genes and BD-spectrum illnesses, partly reconciling previously discordant results from past GWAS and candidate gene studies.

  13. The multicellularity genes of dictyostelid social amoebas

    Science.gov (United States)

    Glöckner, Gernot; Lawal, Hajara M.; Felder, Marius; Singh, Reema; Singer, Gail; Weijer, Cornelis J.; Schaap, Pauline

    2016-01-01

    The evolution of multicellularity enabled specialization of cells, but required novel signalling mechanisms for regulating cell differentiation. Early multicellular organisms are mostly extinct and the origins of these mechanisms are unknown. Here using comparative genome and transcriptome analysis across eight uni- and multicellular amoebozoan genomes, we find that 80% of proteins essential for the development of multicellular Dictyostelia are already present in their unicellular relatives. This set is enriched in cytosolic and nuclear proteins, and protein kinases. The remaining 20%, unique to Dictyostelia, mostly consists of extracellularly exposed and secreted proteins, with roles in sensing and recognition, while several genes for synthesis of signals that induce cell-type specialization were acquired by lateral gene transfer. Across Dictyostelia, changes in gene expression correspond more strongly with phenotypic innovation than changes in protein functional domains. We conclude that the transition to multicellularity required novel signals and sensors rather than novel signal processing mechanisms. PMID:27357338

  14. Ginger and turmeric expressed sequence tags identify signature genes for rhizome identity and development and the biosynthesis of curcuminoids, gingerols and terpenoids

    Science.gov (United States)

    2013-01-01

    Background Ginger (Zingiber officinale) and turmeric (Curcuma longa) accumulate important pharmacologically active metabolites at high levels in their rhizomes. Despite their importance, relatively little is known regarding gene expression in the rhizomes of ginger and turmeric. Results In order to identify rhizome-enriched genes and genes encoding specialized metabolism enzymes and pathway regulators, we evaluated an assembled collection of expressed sequence tags (ESTs) from eight different ginger and turmeric tissues. Comparisons to publicly available sorghum rhizome ESTs revealed a total of 777 gene transcripts expressed in ginger/turmeric and sorghum rhizomes but apparently absent from other tissues. The list of rhizome-specific transcripts was enriched for genes associated with regulation of tissue growth, development, and transcription. In particular, transcripts for ethylene response factors and AUX/IAA proteins appeared to accumulate in patterns mirroring results from previous studies regarding rhizome growth responses to exogenous applications of auxin and ethylene. Thus, these genes may play important roles in defining rhizome growth and development. Additional associations were made for ginger and turmeric rhizome-enriched MADS box transcription factors, their putative rhizome-enriched homologs in sorghum, and rhizomatous QTLs in rice. Additionally, analysis of both primary and specialized metabolism genes indicates that ginger and turmeric rhizomes are primarily devoted to the utilization of leaf supplied sucrose for the production and/or storage of specialized metabolites associated with the phenylpropanoid pathway and putative type III polyketide synthase gene products. This finding reinforces earlier hypotheses predicting roles of this enzyme class in the production of curcuminoids and gingerols. Conclusion A significant set of genes were found to be exclusively or preferentially expressed in the rhizome of ginger and turmeric. Specific

  15. Selected nondestructive assay instrumentation for an international safeguards system at uranium enrichment plants

    International Nuclear Information System (INIS)

    Tape, J.W.; Baker, M.P.; Strittmatter, R.; Jain, M.; Evans, M.L.

    1979-01-01

    A selected set of nondestructive assay instruments for an international safeguards system at uranium enrichment plants is currently under development. These instruments are of three types: in-line enrichment meters for feed, product, and tails streams; area radiation monitors for direct detection of high-enriched uranium production, and an enrichment meter for spent alumina trap material. The current status of the development of each of these instruments is discussed, with supporting data, as well as the role each would play in a total international safeguards system. 5 figures

  16. Coverage and characteristics of the Affymetrix GeneChip Human Mapping 100K SNP set.

    Directory of Open Access Journals (Sweden)

    2006-05-01

    Full Text Available Improvements in technology have made it possible to conduct genome-wide association mapping at costs within reach of academic investigators, and experiments are currently being conducted with a variety of high-throughput platforms. To provide an appropriate context for interpreting results of such studies, we summarize here results of an investigation of one of the first of these technologies to be publicly available, the Affymetrix GeneChip Human Mapping 100K set of single nucleotide polymorphisms (SNPs. In a systematic analysis of the pattern and distribution of SNPs in the Mapping 100K set, we find that SNPs in this set are undersampled from coding regions (both nonsynonymous and synonymous and oversampled from regions outside genes, relative to SNPs in the overall HapMap database. In addition, we utilize a novel multilocus linkage disequilibrium (LD coefficient based on information content (analogous to the information content scores commonly used for linkage mapping that is equivalent to the familiar measure r2 in the special case of two loci. Using this approach, we are able to summarize for any subset of markers, such as the Affymetrix Mapping 100K set, the information available for association mapping in that subset, relative to the information available in the full set of markers included in the HapMap, and highlight circumstances in which this multilocus measure of LD provides substantial additional insight about the haplotype structure in a region over pairwise measures of LD.

  17. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Science.gov (United States)

    Fauteux, François; Strömvik, Martina V

    2009-01-01

    Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs

  18. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Directory of Open Access Journals (Sweden)

    Fauteux François

    2009-10-01

    Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

  19. Study of Transients in an Enrichment Closed Loop

    International Nuclear Information System (INIS)

    Fernandino, M.

    2002-06-01

    In the present thesis a mathematic model is presented in order to describe the dynamic behavior inside a closed enrichment loop, the latter representing a single stage of an uranium gaseous diffusion enrichment cascade.The analytical model is turned into a numerical model, and implemented through a computational code.For the verification of the model, measurements were taken in an experimental circuit using air as the process fluid.This circuit was instrumented so as to register its characteristic thermohydraulic variables.The measured transients were simulated, comparing the numerical results with the experimental measurements.A good agreement between the characteristic setting times and the thermohydraulic parameters evolution was observed.Besides, other transients of two species separation were numerically analyzed, including setting times of each magnitude, behavior of each one of them during different transients, and redistribution of concentrations

  20. Bacterial Community Profiling of H2/CO2 or Formate-Utilizing Acetogens Enriched from Diverse Ecosystems

    Science.gov (United States)

    Han, R.; Zhang, L.; Fu, B.; Liu, H.

    2014-12-01

    Synthetic gases are usually generated from either cellulosic agricultural waste combustion or industrial release and could be subsequently transformed into acetate, ethanol, and/or butyrate by homoacetogenic bacteria, which commonly possess reductive acetyl-CoA synthesis pathway. Homoacetogen-based syngas fermentation technology provides an alternative solution to link greenhouse gas emission control and cellulosic solid waste treatment with biofuels production. The objective of our current project is to hunt for homoacetogens with capabilities of highly efficiently converting syngases to chemical solvents. In this study, we evaluated homoacetogens population dynamics during enrichments and pinpointed dominant homoacetogens representing diverse ecosystems enriched by different substrates. We enriched homoacetogens from four different samples including waste activate sludge, freshwater sediment, anaerobic methanogenic sludge, and cow manure using H2/CO2 (4:1) or formate as substrate for homoacetogen enrichment. Along with the formyltetrahydrofolate synthetase (FTHFS) gene (fhs gene)-specific real time qPCR assay and Terminal Restriction Fragment Length Polymorphism (T-RFLP) analysis, 16S rRNA based 454 high-throughput pyrosequencing was applied to reveal the population dynamic and community structure during enrichment from different origins. Enrichment of homoacetogenic populations coincided with accumulations of short chain fatty acids such as acetate and butyrate. 454 high-throughput pyrosequencing revealed Firmicutes and Spirochaetes populations became dominant while the overall microbial diversity decreased after enrichment. The most abundant sequences among the four origins belonged to the following phyla: Firmicutes, Spirochaetes, Proteobacteria, and Bacteroidetes, accounting for 62.1%-99.1% of the total reads. The major putative homoacetogenic species enriched on H2/CO2 or formate belonged to Clostridium spp., Acetobacterium spp., Acetoanaerobium spp

  1. Identification of self-consistent modulons from bacterial microarray expression data with the help of structured regulon gene sets

    KAUST Repository

    Permina, Elizaveta A.; Medvedeva, Yulia; Baeck, Pia M.; Hegde, Shubhada R.; Mande, Shekhar C.; Makeev, Vsevolod J.

    2013-01-01

    interactions helps to evaluate parameters for regulatory subnetwork inference. We suggest a procedure for modulon construction where a seed regulon is iteratively updated with genes having expression patterns similar to those for regulon member genes. A set

  2. Bioinformatics analysis of RNA-seq data revealed critical genes in colon adenocarcinoma.

    Science.gov (United States)

    Xi, W-D; Liu, Y-J; Sun, X-B; Shan, J; Yi, L; Zhang, T-T

    2017-07-01

    RNA-seq data of colon adenocarcinoma (COAD) were analyzed with bioinformatics tools to discover critical genes in the disease. Relevant small molecule drugs, transcription factors (TFs) and microRNAs (miRNAs) were also investigated. RNA-seq data of COAD were downloaded from The Cancer Genome Atlas (TCGA). Differential analysis was performed with package edgeR. False positive discovery (FDR) 1 were set as the cut-offs to screen out differentially expressed genes (DEGs). Gene coexpression network was constructed with package Ebcoexpress. GO enrichment analysis was performed for the DEGs in the gene coexpression network with DAVID. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was also performed for the genes with KOBASS 2.0. Modules were identified with MCODE of Cytoscape. Relevant small molecules drugs were predicted by Connectivity map. Relevant miRNAs and TFs were searched by WebGestalt. A total of 457 DEGs, including 255 up-regulated and 202 down-regulated genes, were identified from 437 COAD and 39 control samples. A gene coexpression network was constructed containing 40 DEGs and 101 edges. The genes were mainly associated with collagen fibril organization, extracellular matrix organization and translation. Two modules were identified from the gene coexpression network, which were implicated in muscle contraction and extracellular matrix organization, respectively. Several critical genes were disclosed, such as MYH11, COL5A2 and ribosomal proteins. Nine relevant small molecule drugs were identified, such as scriptaid and STOCK1N-35874. Accordingly, a total of 17 TFs and 10 miRNAs related to COAD were acquired, such as ETS2, NFAT, AP4, miR-124A, MiR-9, miR-96 and let-7. Several critical genes and relevant drugs, TFs and miRNAs were revealed in COAD. These findings could advance the understanding of the disease and benefit therapy development.

  3. Systems-based biological concordance and predictive reproducibility of gene set discovery methods in cardiovascular disease.

    Science.gov (United States)

    Azuaje, Francisco; Zheng, Huiru; Camargo, Anyela; Wang, Haiying

    2011-08-01

    The discovery of novel disease biomarkers is a crucial challenge for translational bioinformatics. Demonstration of both their classification power and reproducibility across independent datasets are essential requirements to assess their potential clinical relevance. Small datasets and multiplicity of putative biomarker sets may explain lack of predictive reproducibility. Studies based on pathway-driven discovery approaches have suggested that, despite such discrepancies, the resulting putative biomarkers tend to be implicated in common biological processes. Investigations of this problem have been mainly focused on datasets derived from cancer research. We investigated the predictive and functional concordance of five methods for discovering putative biomarkers in four independently-generated datasets from the cardiovascular disease domain. A diversity of biosignatures was identified by the different methods. However, we found strong biological process concordance between them, especially in the case of methods based on gene set analysis. With a few exceptions, we observed lack of classification reproducibility using independent datasets. Partial overlaps between our putative sets of biomarkers and the primary studies exist. Despite the observed limitations, pathway-driven or gene set analysis can predict potentially novel biomarkers and can jointly point to biomedically-relevant underlying molecular mechanisms. Copyright © 2011 Elsevier Inc. All rights reserved.

  4. Gas-phase UF6 enrichment monitor for enrichment plant safeguards

    International Nuclear Information System (INIS)

    Strittmatter, R.B.; Tape, J.W.

    1980-03-01

    An in-line enrichment monitor is being developed to provide real-time enrichment data for the gas-phase UF 6 feed stream of an enrichment plant. The nondestructive gamma-ray assay method can be used to determine the enrichment of natural UF 6 with a relative precision of better than 1% for a wide range of pressures

  5. Immune-related genetic enrichment in frontotemporal dementia: An analysis of genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Iris Broce

    2018-01-01

    derived 5. Functionally, we found that the expression of FTD-immune pleiotropic genes (particularly within the HLA region is altered in postmortem brain tissue from patients with FTD and is enriched in microglia/macrophages compared to other central nervous system cell types. The main study limitation is that the results represent only clinically diagnosed individuals. Also, given the complex interconnectedness of the HLA region, we were not able to define the specific gene or genes on Chr 6 responsible for our pleiotropic signal.We show immune-mediated genetic enrichment specifically in FTD, particularly within the HLA region. Our genetic results suggest that for a subset of patients, immune dysfunction may contribute to FTD risk. These findings have potential implications for clinical trials targeting immune dysfunction in patients with FTD.

  6. Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Pugalendhi Ganesh Kumar

    Full Text Available This study describes a novel approach to reducing the challenges of highly nonlinear multiclass gene expression values for cancer diagnosis. To build a fruitful system for cancer diagnosis, in this study, we introduced two levels of gene selection such as filtering and embedding for selection of potential genes and the most relevant genes associated with cancer, respectively. The filter procedure was implemented by developing a fuzzy rough set (FR-based method for redefining the criterion function of f-information (FI to identify the potential genes without discretizing the continuous gene expression values. The embedded procedure is implemented by means of a water swirl algorithm (WSA, which attempts to optimize the rule set and membership function required to classify samples using a fuzzy-rule-based multiclassification system (FRBMS. Two novel update equations are proposed in WSA, which have better exploration and exploitation abilities while designing a self-learning FRBMS. The efficiency of our new approach was evaluated on 13 multicategory and 9 binary datasets of cancer gene expression. Additionally, the performance of the proposed FRFI-WSA method in designing an FRBMS was compared with existing methods for gene selection and optimization such as genetic algorithm (GA, particle swarm optimization (PSO, and artificial bee colony algorithm (ABC on all the datasets. In the global cancer map with repeated measurements (GCM_RM dataset, the FRFI-WSA showed the smallest number of 16 most relevant genes associated with cancer using a minimal number of 26 compact rules with the highest classification accuracy (96.45%. In addition, the statistical validation used in this study revealed that the biological relevance of the most relevant genes associated with cancer and their linguistics detected by the proposed FRFI-WSA approach are better than those in the other methods. The simple interpretable rules with most relevant genes and effectively

  7. Subtype-Specific Genes that Characterize Subpopulations of Callosal Projection Neurons in Mouse Identify Molecularly Homologous Populations in Macaque Cortex.

    Science.gov (United States)

    Fame, Ryann M; Dehay, Colette; Kennedy, Henry; Macklis, Jeffrey D

    2017-03-01

    Callosal projection neurons (CPN) interconnect the neocortical hemispheres via the corpus callosum and are implicated in associative integration of multimodal information. CPN have undergone differential evolutionary elaboration, leading to increased diversity of cortical neurons-and more extensive and varied connections in neocortical gray and white matter-in primates compared with rodents. In mouse, distinct sets of genes are enriched in discrete subpopulations of CPN, indicating the molecular diversity of rodent CPN. Elements of rodent CPN functional and organizational diversity might thus be present in the further elaborated primate cortex. We address the hypothesis that genes controlling mouse CPN subtype diversity might reflect molecular patterns shared among mammals that arose prior to the divergence of rodents and primates. We find that, while early expression of the examined CPN-enriched genes, and postmigratory expression of these CPN-enriched genes in deep layers are highly conserved (e.g., Ptn, Nnmt, Cited2, Dkk3), in contrast, the examined genes expressed by superficial layer CPN show more variable levels of conservation (e.g., EphA3, Chn2). These results suggest that there has been evolutionarily differential retraction and elaboration of superficial layer CPN subpopulations between mouse and macaque, with independent derivation of novel populations in primates. Together, these data inform future studies regarding CPN subpopulations that are unique to primates and rodents, and indicate putative evolutionary relationships. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Immune-related genetic enrichment in frontotemporal dementia: An analysis of genome-wide association studies.

    Science.gov (United States)

    Broce, Iris; Karch, Celeste M; Wen, Natalie; Fan, Chun C; Wang, Yunpeng; Tan, Chin Hong; Kouri, Naomi; Ross, Owen A; Höglinger, Günter U; Muller, Ulrich; Hardy, John; Momeni, Parastoo; Hess, Christopher P; Dillon, William P; Miller, Zachary A; Bonham, Luke W; Rabinovici, Gil D; Rosen, Howard J; Schellenberg, Gerard D; Franke, Andre; Karlsen, Tom H; Veldink, Jan H; Ferrari, Raffaele; Yokoyama, Jennifer S; Miller, Bruce L; Andreassen, Ole A; Dale, Anders M; Desikan, Rahul S; Sugrue, Leo P

    2018-01-01

    Converging evidence suggests that immune-mediated dysfunction plays an important role in the pathogenesis of frontotemporal dementia (FTD). Although genetic studies have shown that immune-associated loci are associated with increased FTD risk, a systematic investigation of genetic overlap between immune-mediated diseases and the spectrum of FTD-related disorders has not been performed. Using large genome-wide association studies (GWASs) (total n = 192,886 cases and controls) and recently developed tools to quantify genetic overlap/pleiotropy, we systematically identified single nucleotide polymorphisms (SNPs) jointly associated with FTD-related disorders-namely, FTD, corticobasal degeneration (CBD), progressive supranuclear palsy (PSP), and amyotrophic lateral sclerosis (ALS)-and 1 or more immune-mediated diseases including Crohn disease, ulcerative colitis (UC), rheumatoid arthritis (RA), type 1 diabetes (T1D), celiac disease (CeD), and psoriasis. We found up to 270-fold genetic enrichment between FTD and RA, up to 160-fold genetic enrichment between FTD and UC, up to 180-fold genetic enrichment between FTD and T1D, and up to 175-fold genetic enrichment between FTD and CeD. In contrast, for CBD and PSP, only 1 of the 6 immune-mediated diseases produced genetic enrichment comparable to that seen for FTD, with up to 150-fold genetic enrichment between CBD and CeD and up to 180-fold enrichment between PSP and RA. Further, we found minimal enrichment between ALS and the immune-mediated diseases tested, with the highest levels of enrichment between ALS and RA (up to 20-fold). For FTD, at a conjunction false discovery rate enriched in microglia/macrophages compared to other central nervous system cell types. The main study limitation is that the results represent only clinically diagnosed individuals. Also, given the complex interconnectedness of the HLA region, we were not able to define the specific gene or genes on Chr 6 responsible for our pleiotropic signal. We

  9. Meta-analysis of differentiating mouse embryonic stem cell gene expression kinetics reveals early change of a small gene set.

    Directory of Open Access Journals (Sweden)

    Clive H Glover

    2006-11-01

    Full Text Available Stem cell differentiation involves critical changes in gene expression. Identification of these should provide endpoints useful for optimizing stem cell propagation as well as potential clues about mechanisms governing stem cell maintenance. Here we describe the results of a new meta-analysis methodology applied to multiple gene expression datasets from three mouse embryonic stem cell (ESC lines obtained at specific time points during the course of their differentiation into various lineages. We developed methods to identify genes with expression changes that correlated with the altered frequency of functionally defined, undifferentiated ESC in culture. In each dataset, we computed a novel statistical confidence measure for every gene which captured the certainty that a particular gene exhibited an expression pattern of interest within that dataset. This permitted a joint analysis of the datasets, despite the different experimental designs. Using a ranking scheme that favored genes exhibiting patterns of interest, we focused on the top 88 genes whose expression was consistently changed when ESC were induced to differentiate. Seven of these (103728_at, 8430410A17Rik, Klf2, Nr0b1, Sox2, Tcl1, and Zfp42 showed a rapid decrease in expression concurrent with a decrease in frequency of undifferentiated cells and remained predictive when evaluated in additional maintenance and differentiating protocols. Through a novel meta-analysis, this study identifies a small set of genes whose expression is useful for identifying changes in stem cell frequencies in cultures of mouse ESC. The methods and findings have broader applicability to understanding the regulation of self-renewal of other stem cell types.

  10. The cure: design and evaluation of a crowdsourcing game for gene selection for breast cancer survival prediction.

    Science.gov (United States)

    Good, Benjamin M; Loguercio, Salvatore; Griffith, Obi L; Nanis, Max; Wu, Chunlei; Su, Andrew I

    2014-07-29

    Molecular signatures for predicting breast cancer prognosis could greatly improve care through personalization of treatment. Computational analyses of genome-wide expression datasets have identified such signatures, but these signatures leave much to be desired in terms of accuracy, reproducibility, and biological interpretability. Methods that take advantage of structured prior knowledge (eg, protein interaction networks) show promise in helping to define better signatures, but most knowledge remains unstructured. Crowdsourcing via scientific discovery games is an emerging methodology that has the potential to tap into human intelligence at scales and in modes unheard of before. The main objective of this study was to test the hypothesis that knowledge linking expression patterns of specific genes to breast cancer outcomes could be captured from players of an open, Web-based game. We envisioned capturing knowledge both from the player's prior experience and from their ability to interpret text related to candidate genes presented to them in the context of the game. We developed and evaluated an online game called The Cure that captured information from players regarding genes for use as predictors of breast cancer survival. Information gathered from game play was aggregated using a voting approach, and used to create rankings of genes. The top genes from these rankings were evaluated using annotation enrichment analysis, comparison to prior predictor gene sets, and by using them to train and test machine learning systems for predicting 10 year survival. Between its launch in September 2012 and September 2013, The Cure attracted more than 1000 registered players, who collectively played nearly 10,000 games. Gene sets assembled through aggregation of the collected data showed significant enrichment for genes known to be related to key concepts such as cancer, disease progression, and recurrence. In terms of the predictive accuracy of models trained using this

  11. INSIGHTS INTO PRE-ENRICHMENT OF STAR CLUSTERS AND SELF-ENRICHMENT OF DWARF GALAXIES FROM THEIR INTRINSIC METALLICITY DISPERSIONS

    International Nuclear Information System (INIS)

    Leaman, Ryan

    2012-01-01

    Star clusters are known to have smaller intrinsic metallicity spreads than dwarf galaxies due to their shorter star formation timescales. Here we use individual spectroscopic [Fe/H] measurements of stars in 19 Local Group dwarf galaxies, 13 Galactic open clusters, and 49 globular clusters to show that star cluster and dwarf galaxy linear metallicity distributions are binomial in form, with all objects showing strong correlations between their mean linear metallicity Z-bar and intrinsic spread in metallicity σ(Z) 2 . A plot of σ(Z) 2 versus Z-bar shows that the correlated relationships are offset for the dwarf galaxies from the star clusters. The common binomial nature of these linear metallicity distributions can be explained with a simple inhomogeneous chemical evolution model, where the star cluster and dwarf galaxy behavior in the σ(Z) 2 - Z-bar diagram is reproduced in terms of the number of enrichment events, covering fraction, and intrinsic size of the enriched regions. The inhomogeneity of the self-enrichment sets the slope for the observed dwarf galaxy σ(Z) 2 - Z-bar correlation. The offset of the star cluster sequence from that of the dwarf galaxies is due to pre-enrichment, and the slope of the star cluster sequence represents the remnant signature of the self-enriched history of their host galaxies. The offset can be used to separate star clusters from dwarf galaxies without a priori knowledge of their luminosity or dynamical mass. The application of the inhomogeneous model to the σ(Z) 2 - Z-bar relationship provides a numerical formalism to connect the self-enrichment and pre-enrichment between star clusters and dwarf galaxies using physically motivated chemical enrichment parameters. Therefore we suggest that the σ(Z) 2 - Z-bar relationship can provide insight into what drives the efficiency of star formation and chemical evolution in galaxies, and is an important prediction for galaxy simulation models to reproduce.

  12. Selection and validation of a set of reliable reference genes for quantitative sod gene expression analysis in C. elegans

    Directory of Open Access Journals (Sweden)

    Vandesompele Jo

    2008-01-01

    Full Text Available Abstract Background In the nematode Caenorhabditis elegans the conserved Ins/IGF-1 signaling pathway regulates many biological processes including life span, stress response, dauer diapause and metabolism. Detection of differentially expressed genes may contribute to a better understanding of the mechanism by which the Ins/IGF-1 signaling pathway regulates these processes. Appropriate normalization is an essential prerequisite for obtaining accurate and reproducible quantification of gene expression levels. The aim of this study was to establish a reliable set of reference genes for gene expression analysis in C. elegans. Results Real-time quantitative PCR was used to evaluate the expression stability of 12 candidate reference genes (act-1, ama-1, cdc-42, csq-1, eif-3.C, mdh-1, gpd-2, pmp-3, tba-1, Y45F10D.4, rgs-6 and unc-16 in wild-type, three Ins/IGF-1 pathway mutants, dauers and L3 stage larvae. After geNorm analysis, cdc-42, pmp-3 and Y45F10D.4 showed the most stable expression pattern and were used to normalize 5 sod expression levels. Significant differences in mRNA levels were observed for sod-1 and sod-3 in daf-2 relative to wild-type animals, whereas in dauers sod-1, sod-3, sod-4 and sod-5 are differentially expressed relative to third stage larvae. Conclusion Our findings emphasize the importance of accurate normalization using stably expressed reference genes. The methodology used in this study is generally applicable to reliably quantify gene expression levels in the nematode C. elegans using quantitative PCR.

  13. Oxygen and tissue culture affect placental gene expression.

    Science.gov (United States)

    Brew, O; Sullivan, M H F

    2017-07-01

    Placental explant culture is an important model for studying placental development and functions. We investigated the differences in placental gene expression in response to tissue culture, atmospheric and physiologic oxygen concentrations. Placental explants were collected from normal term (38-39 weeks of gestation) placentae with no previous uterine contractile activity. Placental transcriptomic expressions were evaluated with GeneChip ® Human Genome U133 Plus 2.0 arrays (Affymetrix). We uncovered sub-sets of genes that regulate response to stress, induction of apoptosis programmed cell death, mis-regulation of cell growth, proliferation, cell morphogenesis, tissue viability, and protection from apoptosis in cultured placental explants. We also identified a sub-set of genes with highly unstable pattern of expression after exposure to tissue culture. Tissue culture irrespective of oxygen concentration induced dichotomous increase in significant gene expression and increased enrichment of significant pathways and transcription factor targets (TFTs) including HIF1A. The effect was exacerbated by culture at atmospheric oxygen concentration, where further up-regulation of TFTs including PPARA, CEBPD, HOXA9 and down-regulated TFTs such as JUND/FOS suggest intrinsic heightened key biological and metabolic mechanisms such as glucose use, lipid biosynthesis, protein metabolism; apoptosis, inflammatory responses; and diminished trophoblast proliferation, differentiation, invasion, regeneration, and viability. These findings demonstrate that gene expression patterns differ between pre-culture and cultured explants, and the gene expression of explants cultured at atmospheric oxygen concentration favours stressed, pro-inflammatory and increased apoptotic transcriptomic response. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Phylogenetic and functional diversity within toluene-degrading, sulphate-reducing consortia enriched from a contaminated aquifer.

    Science.gov (United States)

    Kuppardt, Anke; Kleinsteuber, Sabine; Vogt, Carsten; Lüders, Tillmann; Harms, Hauke; Chatzinotas, Antonis

    2014-08-01

    Three toluene-degrading microbial consortia were enriched under sulphate-reducing conditions from different zones of a benzene, toluene, ethylbenzene and xylenes (BTEX) plume of two connected contaminated aquifers. Two cultures were obtained from a weakly contaminated zone of the lower aquifer, while one culture originated from the highly contaminated upper aquifer. We hypothesised that the different habitat characteristics are reflected by distinct degrader populations. Degradation of toluene with concomitant production of sulphide was demonstrated in laboratory microcosms and the enrichment cultures were phylogenetically characterised. The benzylsuccinate synthase alpha-subunit (bssA) marker gene, encoding the enzyme initiating anaerobic toluene degradation, was targeted to characterise the catabolic diversity within the enrichment cultures. It was shown that the hydrogeochemical parameters in the different zones of the plume determined the microbial composition of the enrichment cultures. Both enrichment cultures from the weakly contaminated zone were of a very similar composition, dominated by Deltaproteobacteria with the Desulfobulbaceae (a Desulfopila-related phylotype) as key players. Two different bssA sequence types were found, which were both affiliated to genes from sulphate-reducing Deltaproteobacteria. In contrast, the enrichment culture from the highly contaminated zone was dominated by Clostridia with a Desulfosporosinus-related phylotype as presumed key player. A distinct bssA sequence type with high similarity to other recently detected sequences from clostridial toluene degraders was dominant in this culture. This work contributes to our understanding of the niche partitioning between degrader populations in distinct compartments of BTEX-contaminated aquifers.

  15. Network motif-based identification of transcription factor-target gene relationships by integrating multi-source biological data

    Directory of Open Access Journals (Sweden)

    de los Reyes Benildo G

    2008-04-01

    Full Text Available Abstract Background Integrating data from multiple global assays and curated databases is essential to understand the spatio-temporal interactions within cells. Different experiments measure cellular processes at various widths and depths, while databases contain biological information based on established facts or published data. Integrating these complementary datasets helps infer a mutually consistent transcriptional regulatory network (TRN with strong similarity to the structure of the underlying genetic regulatory modules. Decomposing the TRN into a small set of recurring regulatory patterns, called network motifs (NM, facilitates the inference. Identifying NMs defined by specific transcription factors (TF establishes the framework structure of a TRN and allows the inference of TF-target gene relationship. This paper introduces a computational framework for utilizing data from multiple sources to infer TF-target gene relationships on the basis of NMs. The data include time course gene expression profiles, genome-wide location analysis data, binding sequence data, and gene ontology (GO information. Results The proposed computational framework was tested using gene expression data associated with cell cycle progression in yeast. Among 800 cell cycle related genes, 85 were identified as candidate TFs and classified into four previously defined NMs. The NMs for a subset of TFs are obtained from literature. Support vector machine (SVM classifiers were used to estimate NMs for the remaining TFs. The potential downstream target genes for the TFs were clustered into 34 biologically significant groups. The relationships between TFs and potential target gene clusters were examined by training recurrent neural networks whose topologies mimic the NMs to which the TFs are classified. The identified relationships between TFs and gene clusters were evaluated using the following biological validation and statistical analyses: (1 Gene set enrichment

  16. Gene expression changes in the course of normal brain aging are sexually dimorphic

    Science.gov (United States)

    Berchtold, Nicole C.; Cribbs, David H.; Coleman, Paul D.; Rogers, Joseph; Head, Elizabeth; Kim, Ronald; Beach, Tom; Miller, Carol; Troncoso, Juan; Trojanowski, John Q.; Zielke, H. Ronald; Cotman, Carl W.

    2008-01-01

    Gene expression profiles were assessed in the hippocampus, entorhinal cortex, superior-frontal gyrus, and postcentral gyrus across the lifespan of 55 cognitively intact individuals aged 20–99 years. Perspectives on global gene changes that are associated with brain aging emerged, revealing two overarching concepts. First, different regions of the forebrain exhibited substantially different gene profile changes with age. For example, comparing equally powered groups, 5,029 probe sets were significantly altered with age in the superior-frontal gyrus, compared with 1,110 in the entorhinal cortex. Prominent change occurred in the sixth to seventh decades across cortical regions, suggesting that this period is a critical transition point in brain aging, particularly in males. Second, clear gender differences in brain aging were evident, suggesting that the brain undergoes sexually dimorphic changes in gene expression not only in development but also in later life. Globally across all brain regions, males showed more gene change than females. Further, Gene Ontology analysis revealed that different categories of genes were predominantly affected in males vs. females. Notably, the male brain was characterized by global decreased catabolic and anabolic capacity with aging, with down-regulated genes heavily enriched in energy production and protein synthesis/transport categories. Increased immune activation was a prominent feature of aging in both sexes, with proportionally greater activation in the female brain. These data open opportunities to explore age-dependent changes in gene expression that set the balance between neurodegeneration and compensatory mechanisms in the brain and suggest that this balance is set differently in males and females, an intriguing idea. PMID:18832152

  17. A comprehensive family-based replication study of schizophrenia genes

    DEFF Research Database (Denmark)

    Aberg, Karolina A; Liu, Youfang; Bukszár, Jozsef

    2013-01-01

     768 control subjects from 6 databases and, after quality control 6298 individuals (including 3286 cases) from 1811 nuclear families. MAIN OUTCOMES AND MEASURES Case-control status for SCZ. RESULTS Replication results showed a highly significant enrichment of SNPs with small P values. Of the SNPs...... in an independent family-based replication study that, after quality control, consisted of 8107 SNPs. SETTING Linkage meta-analysis, brain transcriptome meta-analysis, candidate gene database, OMIM, relevant mouse studies, and expression quantitative trait locus databases. PATIENTS We included 11 185 cases and 10...

  18. Improved detection of Burkholderia pseudomallei from non-blood clinical specimens using enrichment culture and PCR: narrowing diagnostic gap in resource-constrained settings.

    Science.gov (United States)

    Tellapragada, Chaitanya; Shaw, Tushar; D'Souza, Annet; Eshwara, Vandana Kalwaje; Mukhopadhyay, Chiranjay

    2017-07-01

    To evaluate the diagnostic utility of enrichment culture and PCR for improved case detection rates of non-bacteraemic form of melioidosis in limited resource settings. Clinical specimens (n = 525) obtained from patients presenting at a tertiary care hospital of South India with clinical symptoms suggestive of community-acquired pneumonia, lower respiratory tract infections, superficial or internal abscesses, chronic skin ulcers and bone or joint infections were tested for the presence of Burkholderia pseudomallei using conventional culture (CC), enrichment culture (EC) and PCR. Sensitivity, specificity, positive and negative predictive values of CC and PCR were initially deduced using EC as the gold standard method. Further, diagnostic accuracies of all the three methods were analysed using Bayesian latent class modelling (BLCM). Detection rates of B. pseudomallei using CC, EC and PCR were 3.8%, 5.3% and 6%, respectively. Diagnostic sensitivities and specificities of CC and PCR were 71.4, 98.4% and 100 and 99.4%, respectively in comparison with EC as the gold standard test. With Bayesian latent class modelling, EC and PCR demonstrated sensitivities of 98.7 and 99.3%, respectively, while CC showed a sensitivity of 70.3% for detection of B. pseudomallei. An increase of 1.6% (95% CI: 1.08-4.32%) in the case detection rate of melioidosis was observed in the study population when EC and/or PCR were used in adjunct to the conventional culture technique. Our study findings underscore the diagnostic superiority of enrichment culture and/or PCR over conventional microbiological culture for improved case detection of melioidosis from non-blood clinical specimens. © 2017 John Wiley & Sons Ltd.

  19. Identification of Novel Gene Targets and Putative Regulators of Arsenic-Associated DNA Methylation in Human Urothelial Cells and Bladder Cancer

    Science.gov (United States)

    Rager, Julia E.; Miller, Sloane; Tulenko, Samantha E.; Smeester, Lisa; Ray, Paul D.; Yosim, Andrew; Currier, Jenna M.; Ishida, María C.; González-Horta, Maria del Carmen; Sánchez-Ramírez, Blanca; Ballinas-Casarrubias, Lourdes; Gutiérrez-Torres, Daniela S.; Drobná, Zuzana; Del Razo, Luz M.; García-Vargas, Gonzalo G.; Kim, William Y.; Zhou, Yi-Hui; Wright, Fred A.; Stýblo, Miroslav; Fry, Rebecca C.

    2016-01-01

    There is strong epidemiologic evidence linking chronic exposure to inorganic arsenic (iAs) to a myriad of adverse health effects, including cancer of the bladder. The present study set out to identify DNA methylation patterns associated with iAs and its metabolites in exfoliated urothelial cells (EUCs) that originate primarily from the urinary bladder, one of the targets of arsenic (As)-induced carcinogenesis. Genome-wide, gene-specific promoter DNA methylation levels were assessed in EUCs from 46 residents of Chihuahua, Mexico, and the relationship was examined between promoter methylation profiles and the intracellular concentrations of total As (tAs) and As species. A set of 49 differentially methylated genes was identified with increased promoter methylation associated with EUC tAs, iAs, and/or monomethylated As (MMAs) enriched for their roles in metabolic disease and cancer. Notably, no genes had differential methylation associated with EUC dimethylated As (DMAs), suggesting that DMAs may influence DNA methylation-mediated urothelial cell responses to a lesser extent than iAs or MMAs. Further analysis showed that 22 of the 49 As-associated genes (45%) are also differentially methylated in bladder cancer tissue identified using The Cancer Genome Atlas repository. Both the As- and cancer-associated genes are enriched for the binding sites of common transcription factors known to play roles in carcinogenesis, demonstrating a novel potential mechanistic link between iAs exposure and bladder cancer. PMID:26039340

  20. Identification of transcriptional factors and key genes in primary osteoporosis by DNA microarray.

    Science.gov (United States)

    Xie, Wengui; Ji, Lixin; Zhao, Teng; Gao, Pengfei

    2015-05-09

    A number of genes have been identified to be related with primary osteoporosis while less is known about the comprehensive interactions between regulating genes and proteins. We aimed to identify the differentially expressed genes (DEGs) and regulatory effects of transcription factors (TFs) involved in primary osteoporosis. The gene expression profile GSE35958 was obtained from Gene Expression Omnibus database, including 5 primary osteoporosis and 4 normal bone tissues. The differentially expressed genes between primary osteoporosis and normal bone tissues were identified by the same package in R language. The TFs of these DEGs were predicted with the Essaghir A method. DAVID (The Database for Annotation, Visualization and Integrated Discovery) was applied to perform the GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway enrichment analysis of DEGs. After analyzing regulatory effects, a regulatory network was built between TFs and the related DEGs. A total of 579 DEGs was screened, including 310 up-regulated genes and 269 down-regulated genes in primary osteoporosis samples. In GO terms, more up-regulated genes were enriched in transcription regulator activity, and secondly in transcription factor activity. A total 10 significant pathways were enriched in KEGG analysis, including colorectal cancer, Wnt signaling pathway, Focal adhesion, and MAPK signaling pathway. Moreover, total 7 TFs were enriched, of which CTNNB1, SP1, and TP53 regulated most up-regulated DEGs. The discovery of the enriched TFs might contribute to the understanding of the mechanism of primary osteoporosis. Further research on genes and TFs related to the WNT signaling pathway and MAPK pathway is urgent for clinical diagnosis and directing treatment of primary osteoporosis.

  1. Symbiont modulates expression of specific gene categories in Angomonas deanei

    Directory of Open Access Journals (Sweden)

    Luciana Loureiro Penha

    Full Text Available Trypanosomatids are parasites that cause disease in humans, animals, and plants. Most are non-pathogenic and some harbor a symbiotic bacterium. Endosymbiosis is part of the evolutionary process of vital cell functions such as respiration and photosynthesis. Angomonas deanei is an example of a symbiont-containing trypanosomatid. In this paper, we sought to investigate how symbionts influence host cells by characterising and comparing the transcriptomes of the symbiont-containing A. deanei (wild type and the symbiont-free aposymbiotic strains. The comparison revealed that the presence of the symbiont modulates several differentially expressed genes. Empirical analysis of differential gene expression showed that 216 of the 7625 modulated genes were significantly changed. Finally, gene set enrichment analysis revealed that the largest categories of genes that downregulated in the absence of the symbiont were those involved in oxidation-reduction process, ATP hydrolysis coupled proton transport and glycolysis. In contrast, among the upregulated gene categories were those involved in proteolysis, microtubule-based movement, and cellular metabolic process. Our results provide valuable information for dissecting the mechanism of endosymbiosis in A. deanei.

  2. Domestication rewired gene expression and nucleotide diversity patterns in tomato.

    Science.gov (United States)

    Sauvage, Christopher; Rau, Andrea; Aichholz, Charlotte; Chadoeuf, Joël; Sarah, Gautier; Ruiz, Manuel; Santoni, Sylvain; Causse, Mathilde; David, Jacques; Glémin, Sylvain

    2017-08-01

    Plant domestication has led to considerable phenotypic modifications from wild species to modern varieties. However, although changes in key traits have been well documented, less is known about the underlying molecular mechanisms, such as the reduction of molecular diversity or global gene co-expression patterns. In this study, we used a combination of gene expression and population genetics in wild and crop tomato to decipher the footprints of domestication. We found a set of 1729 differentially expressed genes (DEG) between the two genetic groups, belonging to 17 clusters of co-expressed DEG, suggesting that domestication affected not only individual genes but also regulatory networks. Five co-expression clusters were enriched in functional terms involving carbohydrate metabolism or epigenetic regulation of gene expression. We detected differences in nucleotide diversity between the crop and wild groups specific to DEG. Our study provides an extensive profiling of the rewiring of gene co-expression induced by the domestication syndrome in one of the main crop species. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  3. Impact of methoxyacetic acid on mouse Leydig cell gene expression

    Directory of Open Access Journals (Sweden)

    Waxman David J

    2010-06-01

    Full Text Available Abstract Background Methoxyacetic acid (MAA is the active metabolite of the widely used industrial chemical ethylene glycol monomethyl ether, which is associated with various developmental and reproductive toxicities, including neural toxicity, blood and immune disorders, limb degeneration and testicular toxicity. Testicular toxicity is caused by degeneration of germ cells in association with changes in gene expression in both germ cells and Sertoli cells of the testis. This study investigates the impact of MAA on gene expression in testicular Leydig cells, which play a critical role in germ cell survival and male reproductive function. Methods Cultured mouse TM3 Leydig cells were treated with MAA for 3, 8, and 24 h and changes in gene expression were monitored by genome-wide transcriptional profiling. Results A total of 3,912 MAA-responsive genes were identified. Ingenuity Pathway analysis identified reproductive system disease, inflammatory disease and connective tissue disorder as the top biological functions affected by MAA. The MAA-responsive genes were classified into 1,366 early responders, 1,387 mid-responders, and 1,138 late responders, based on the time required for MAA to elicit a response. Analysis of enriched functional clusters for each subgroup identified 106 MAA early response genes involved in transcription regulation, including 32 genes associated with developmental processes. 60 DNA-binding proteins responded to MAA rapidly but transiently, and may contribute to the downstream effects of MAA seen for many mid and late response genes. Genes within the phosphatidylinositol/phospholipase C/calcium signaling pathway, whose activity is required for potentiation of nuclear receptor signaling by MAA, were also enriched in the set of early MAA response genes. In contrast, many of the genes responding to MAA at later time points encode membrane proteins that contribute to cell adhesion and membrane signaling. Conclusions These findings

  4. High-solids enrichment of thermophilic microbial communities and their enzymes on bioenergy feedstocks

    Energy Technology Data Exchange (ETDEWEB)

    Reddy, A. P.; Allgaier, M.; Singer, S.W.; Hazen, T.C.; Simmons, B.A.; Hugenholtz, P.; VanderGheynst, J.S.

    2011-04-01

    Thermophilic microbial communities that are active in a high-solids environment offer great potential for the discovery of industrially relevant enzymes that efficiently deconstruct bioenergy feedstocks. In this study, finished green waste compost was used as an inoculum source to enrich microbial communities and associated enzymes that hydrolyze cellulose and hemicellulose during thermophilic high-solids fermentation of the bioenergy feedstocks switchgrass and corn stover. Methods involving the disruption of enzyme and plant cell wall polysaccharide interactions were developed to recover xylanase and endoglucanase activity from deconstructed solids. Xylanase and endoglucanase activity increased by more than a factor of 5, upon four successive enrichments on switchgrass. Overall, the changes for switchgrass were more pronounced than for corn stover; solids reduction between the first and second enrichments increased by a factor of four for switchgrass while solids reduction remained relatively constant for corn stover. Amplicon pyrosequencing analysis of small-subunit ribosomal RNA genes recovered from enriched samples indicated rapid changes in the microbial communities between the first and second enrichment with the simplified communities achieved by the third enrichment. The results demonstrate a successful approach for enrichment of unique microbial communities and enzymes active in a thermophilic high-solids environment.

  5. The Dynamics of Visual Art Dialogues: Experiences to Be Used in Hospital Settings with Visual Art Enrichment

    Directory of Open Access Journals (Sweden)

    Britt-Maj Wikström

    2011-01-01

    Full Text Available Objectives. Given that hospitals have environmental enrichment with paintings and visual art arrangement, it would be meaningful to develop and document how hospital art could be used by health professionals. Methods. The study was undertaken at an art site in Sweden. During 1-hour sessions, participants (=20 get together in an art gallery every second week five times. Results. According to the participants a new value was perceived. From qualitative analyses, three themes appear: raise association, mentally present, and door-opener. In addition 72% of the participants reported makes me happy and gives energy and inspiration, and 52% reported that dialogues increase inspiration, make you involved, and stimulate curiosity. Conclusion. The present study supported the view that visual art dialogue could be used by health care professionals in a structured manner and that meaningful art stimulation, related to a person’s experiences, could be of importance for the patients. Implementing art dialogues in hospital settings could be a fruitful working tool for nurses, a complementary manner of patient communication.

  6. Uranium enrichment

    International Nuclear Information System (INIS)

    Rae, H.K.; Melvin, J.G.

    1988-06-01

    Canada is the world's largest producer and exporter of uranium, most of which is enriched elsewhere for use as fuel in LWRs. The feasibility of a Canadian uranium-enrichment enterprise is therefore a perennial question. Recent developments in uranium-enrichment technology, and their likely impacts on separative work supply and demand, suggest an opportunity window for Canadian entry into this international market. The Canadian opportunity results from three particular impacts of the new technologies: 1) the bulk of the world's uranium-enrichment capacity is in gaseous diffusion plants which, because of their large requirements for electricity (more than 2000 kW·h per SWU), are vulnerable to competition from the new processes; 2) the decline in enrichment costs increases the economic incentive for the use of slightly-enriched uranium (SEU) fuel in CANDU reactors, thus creating a potential Canadian market; and 3) the new processes allow economic operation on a much smaller scale, which drastically reduces the investment required for market entry and is comparable with the potential Canadian SEU requirement. The opportunity is not open-ended. By the end of the century the enrichment supply industry will have adapted to the new processes and long-term customer/supplier relationships will have been established. In order to seize the opportunity, Canada must become a credible supplier during this century

  7. Identification of estrogen target genes during zebrafish embryonic development through transcriptomic analysis.

    Directory of Open Access Journals (Sweden)

    Ruixin Hao

    Full Text Available Estrogen signaling is important for vertebrate embryonic development. Here we have used zebrafish (Danio rerio as a vertebrate model to analyze estrogen signaling during development. Zebrafish embryos were exposed to 1 µM 17β-estradiol (E2 or vehicle from 3 hours to 4 days post fertilization (dpf, harvested at 1, 2, 3 and 4 dpf, and subjected to RNA extraction for transcriptome analysis using microarrays. Differentially expressed genes by E2-treatment were analyzed with hierarchical clustering followed by biological process and tissue enrichment analysis. Markedly distinct sets of genes were up and down-regulated by E2 at the four different time points. Among these genes, only the well-known estrogenic marker vtg1 was co-regulated at all time points. Despite this, the biological functional categories targeted by E2 were relatively similar throughout zebrafish development. According to knowledge-based tissue enrichment, estrogen responsive genes were clustered mainly in the liver, pancreas and brain. This was in line with the developmental dynamics of estrogen-target tissues that were visualized using transgenic zebrafish containing estrogen responsive elements driving the expression of GFP (Tg(5xERE:GFP. Finally, the identified embryonic estrogen-responsive genes were compared to already published estrogen-responsive genes identified in male adult zebrafish (Gene Expression Omnibus database. The expressions of a few genes were co-regulated by E2 in both embryonic and adult zebrafish. These could potentially be used as estrogenic biomarkers for exposure to estrogens or estrogenic endocrine disruptors in zebrafish. In conclusion, our data suggests that estrogen effects on early embryonic zebrafish development are stage- and tissue- specific.

  8. Identification of self-consistent modulons from bacterial microarray expression data with the help of structured regulon gene sets

    KAUST Repository

    Permina, Elizaveta A.

    2013-01-01

    Identification of bacterial modulons from series of gene expression measurements on microarrays is a principal problem, especially relevant for inadequately studied but practically important species. Usage of a priori information on regulatory interactions helps to evaluate parameters for regulatory subnetwork inference. We suggest a procedure for modulon construction where a seed regulon is iteratively updated with genes having expression patterns similar to those for regulon member genes. A set of genes essential for a regulon is used to control modulon updating. Essential genes for a regulon were selected as a subset of regulon genes highly related by different measures to each other. Using Escherichia coli as a model, we studied how modulon identification depends on the data, including the microarray experiments set, the adopted relevance measure and the regulon itself. We have found that results of modulon identification are highly dependent on all parameters studied and thus the resulting modulon varies substantially depending on the identification procedure. Yet, modulons that were identified correctly displayed higher stability during iterations, which allows developing a procedure for reliable modulon identification in the case of less studied species where the known regulatory interactions are sparse. Copyright © 2013 Taylor & Francis.

  9. Early repositioning through compound set enrichment analysis: a knowledge-recycling strategy.

    Science.gov (United States)

    Temesi, Gergely; Bolgár, Bence; Arany, Adám; Szalai, Csaba; Antal, Péter; Mátyus, Péter

    2014-04-01

    Despite famous serendipitous drug repositioning success stories, systematic projects have not yet delivered the expected results. However, repositioning technologies are gaining ground in different phases of routine drug development, together with new adaptive strategies. We demonstrate the power of the compound information pool, the ever-growing heterogeneous information repertoire of approved drugs and candidates as an invaluable catalyzer in this transition. Systematic, computational utilization of this information pool for candidates in early phases is an open research problem; we propose a novel application of the enrichment analysis statistical framework for fusion of this information pool, specifically for the prediction of indications. Pharmaceutical consequences are formulated for a systematic and continuous knowledge recycling strategy, utilizing this information pool throughout the drug-discovery pipeline.

  10. A transcriptome-wide study on the microRNA- and the Argonaute 1-enriched small RNA-mediated regulatory networks involved in plant leaf senescence.

    Science.gov (United States)

    Qin, J; Ma, X; Yi, Z; Tang, Z; Meng, Y

    2016-03-01

    Leaf senescence is an important physiological process during the plant life cycle. However, systemic studies on the impact of microRNAs (miRNAs) on the expression of senescence-associated genes (SAGs) are lacking. Besides, whether other Argonaute 1 (AGO1)-enriched small RNAs (sRNAs) play regulatory roles in leaf senescence remains unclear. In this study, a total of 5,123 and 1,399 AGO1-enriched sRNAs, excluding miRNAs, were identified in Arabidopsis thaliana and rice (Oryza sativa), respectively. After retrieving SAGs from the Leaf Senescence Database, all of the AGO1-enriched sRNAs and the miRBase-registered miRNAs of these two plants were included for target identification. Supported by degradome signatures, 200 regulatory pairs involving 120 AGO1-enriched sRNAs and 40 SAGs, and 266 regulatory pairs involving 64 miRNAs and 42 SAGs were discovered in Arabidopsis. Moreover, 13 genes predicted to interact with some of the above-identified target genes at protein level were validated as regulated by 17 AGO1-enriched sRNAs and ten miRNAs in Arabidopsis. In rice, only one SAG was targeted by three AGO1-enriched sRNAs, and one SAG was targeted by miR395. However, five AGO1-enriched sRNAs were conserved between Arabidopsis and rice. Target genes conserved between the two plants were identified for three of the above five sRNAs, pointing to the conserved roles of these regulatory pairs in leaf senescence or other developmental procedures. Novel targets were discovered for three of the five AGO1-enriched sRNAs in rice, indicating species-specific functions of these sRNA-target pairs. These results could advance our understanding of the sRNA-involved molecular processes modulating leaf senescence. © 2015 German Botanical Society and The Royal Botanical Society of the Netherlands.

  11. A compendium of canine normal tissue gene expression.

    Directory of Open Access Journals (Sweden)

    Joseph Briggs

    Full Text Available BACKGROUND: Our understanding of disease is increasingly informed by changes in gene expression between normal and abnormal tissues. The release of the canine genome sequence in 2005 provided an opportunity to better understand human health and disease using the dog as clinically relevant model. Accordingly, we now present the first genome-wide, canine normal tissue gene expression compendium with corresponding human cross-species analysis. METHODOLOGY/PRINCIPAL FINDINGS: The Affymetrix platform was utilized to catalogue gene expression signatures of 10 normal canine tissues including: liver, kidney, heart, lung, cerebrum, lymph node, spleen, jejunum, pancreas and skeletal muscle. The quality of the database was assessed in several ways. Organ defining gene sets were identified for each tissue and functional enrichment analysis revealed themes consistent with known physio-anatomic functions for each organ. In addition, a comparison of orthologous gene expression between matched canine and human normal tissues uncovered remarkable similarity. To demonstrate the utility of this dataset, novel canine gene annotations were established based on comparative analysis of dog and human tissue selective gene expression and manual curation of canine probeset mapping. Public access, using infrastructure identical to that currently in use for human normal tissues, has been established and allows for additional comparisons across species. CONCLUSIONS/SIGNIFICANCE: These data advance our understanding of the canine genome through a comprehensive analysis of gene expression in a diverse set of tissues, contributing to improved functional annotation that has been lacking. Importantly, it will be used to inform future studies of disease in the dog as a model for human translational research and provides a novel resource to the community at large.

  12. Nature versus nurture: A systematic approach to elucidate gene-environment interactions in the development of myopic refractive errors.

    Science.gov (United States)

    Miraldi Utz, Virginia

    2017-01-01

    Myopia is the most common eye disorder and major cause of visual impairment worldwide. As the incidence of myopia continues to rise, the need to further understand the complex roles of molecular and environmental factors controlling variation in refractive error is of increasing importance. Tkatchenko and colleagues applied a systematic approach using a combination of gene set enrichment analysis, genome-wide association studies, and functional analysis of a murine model to identify a myopia susceptibility gene, APLP2. Differential expression of refractive error was associated with time spent reading for those with low frequency variants in this gene. This provides support for the longstanding hypothesis of gene-environment interactions in refractive error development.

  13. Transcriptome analysis by GeneTrail revealed regulation of functional categories in response to alterations of iron homeostasis in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Lenhof Hans-Peter

    2011-05-01

    Full Text Available Abstract Background High-throughput technologies have opened new avenues to study biological processes and pathways. The interpretation of the immense amount of data sets generated nowadays needs to be facilitated in order to enable biologists to identify complex gene networks and functional pathways. To cope with this task multiple computer-based programs have been developed. GeneTrail is a freely available online tool that screens comparative transcriptomic data for differentially regulated functional categories and biological pathways extracted from common data bases like KEGG, Gene Ontology (GO, TRANSPATH and TRANSFAC. Additionally, GeneTrail offers a feature that allows screening of individually defined biological categories that are relevant for the respective research topic. Results We have set up GeneTrail for the use of Arabidopsis thaliana. To test the functionality of this tool for plant analysis, we generated transcriptome data of root and leaf responses to Fe deficiency and the Arabidopsis metal homeostasis mutant nas4x-1. We performed Gene Set Enrichment Analysis (GSEA with eight meaningful pairwise comparisons of transcriptome data sets. We were able to uncover several functional pathways including metal homeostasis that were affected in our experimental situations. Representation of the differentially regulated functional categories in Venn diagrams uncovered regulatory networks at the level of whole functional pathways. Over-Representation Analysis (ORA of differentially regulated genes identified in pairwise comparisons revealed specific functional plant physiological categories as major targets upon Fe deficiency and in nas4x-1. Conclusion Here, we obtained supporting evidence, that the nas4x-1 mutant was defective in metal homeostasis. It was confirmed that nas4x-1 showed Fe deficiency in roots and signs of Fe deficiency and Fe sufficiency in leaves. Besides metal homeostasis, biotic stress, root carbohydrate, leaf

  14. The precise regulation of different COR genes by individual CBF transcription factors in Arabidopsis thaliana.

    Science.gov (United States)

    Shi, Yihao; Huang, Jiaying; Sun, Tianshu; Wang, Xuefei; Zhu, Chenqi; Ai, Yuxi; Gu, Hongya

    2017-02-01

    The transcription factors CBF1/2/3 are reported to play a dominant role in the cold responsive network of Arabidopsis by directly regulating the expression levels of cold responsive (COR) genes. In this study, we obtained CRISPR/Cas9-mediated loss-of-function mutants of cbf1∼3. Over 3,000 COR genes identified by RNA-seq analysis showed a slight but significant change in their expression levels in the mutants compared to the wild-type plants after being treated at 4 °C for 12 h. The C-repeat (CRT) motif (5'-CCGAC-3') was enriched in promoters of genes that were up-regulated by CBF2 and CBF3 but not in promoters of genes up-regulated by CBF1. These data suggest that CBF2 and CBF3 play a more important role in directing the cold response by regulating different sets of downstream COR genes. More than 2/3 of COR genes were co-regulated by two or three CBFs and were involved mainly in cellular signal transduction and metabolic processes; less than 1/3 of the genes were regulated by one CBF, and those genes up-regulated were enriched in cold-related abiotic stress responses. Our results indicate that CBFs play an important role in the trade-off between cold tolerance and plant growth through the precise regulation of COR genes in the complicated transcriptional network. © 2016 The Authors. Journal of Integrative Plant Biology Published by John Wiley & Sons Australia, Ltd on behalf of Institute of Botany, Chinese Academy of Sciences.

  15. Advanced enrichment techniques

    International Nuclear Information System (INIS)

    Johnson, A.

    1988-01-01

    BNFL is in a unique position in that it has commercial experience of diffusion enrichment, and of centrifuge enrichment through its associate company Urenco. In addition BNFL is developing laser enrichment techniques as part of a UK development programme in this area. The paper describes the development programme which led to the introduction of competitive centrifuge enrichment technology by Urenco and discusses the areas where improvements have and will continue to be made in the centrifuge process. It also describes the laser development programme currently being undertaken in the UK. The paper concludes by discussing the relative merits of the various methods of uranium enrichment, with particular reference to the enrichment market likely to obtain over the rest of the century

  16. Advanced enrichment techniques

    International Nuclear Information System (INIS)

    Johnson, A.

    1987-01-01

    BNFL is in a unique position in that it has commercial experience of diffusion enrichment, and of centrifuge enrichment through its associate company Urenco. In addition BNFL is developing laser enrichment techniques as part of a UK development programme in this area. The paper describes the development programme which led to the introduction of competitive centrifuge enrichment technology by Urenco and discusses the areas where improvements have and will continue to be made in the centrifuge process. It also describes the laser development programme currently being undertaken in the UK. The paper concludes by discussing the relative merits of the various methods of uranium enrichment, with particular reference to the enrichment market likely to obtain over the rest of the century. (author)

  17. Understanding Epistatic Interactions between Genes Targeted by Non-coding Regulatory Elements in Complex Diseases

    Directory of Open Access Journals (Sweden)

    Min Kyung Sung

    2014-12-01

    Full Text Available Genome-wide association studies have proven the highly polygenic architecture of complex diseases or traits; therefore, single-locus-based methods are usually unable to detect all involved loci, especially when individual loci exert small effects. Moreover, the majority of associated single-nucleotide polymorphisms resides in non-coding regions, making it difficult to understand their phenotypic contribution. In this work, we studied epistatic interactions associated with three common diseases using Korea Association Resource (KARE data: type 2 diabetes mellitus (DM, hypertension (HT, and coronary artery disease (CAD. We showed that epistatic single-nucleotide polymorphisms (SNPs were enriched in enhancers, as well as in DNase I footprints (the Encyclopedia of DNA Elements [ENCODE] Project Consortium 2012, which suggested that the disruption of the regulatory regions where transcription factors bind may be involved in the disease mechanism. Accordingly, to identify the genes affected by the SNPs, we employed whole-genome multiple-cell-type enhancer data which discovered using DNase I profiles and Cap Analysis Gene Expression (CAGE. Assigned genes were significantly enriched in known disease associated gene sets, which were explored based on the literature, suggesting that this approach is useful for detecting relevant affected genes. In our knowledge-based epistatic network, the three diseases share many associated genes and are also closely related with each other through many epistatic interactions. These findings elucidate the genetic basis of the close relationship between DM, HT, and CAD.

  18. Recent adaptive events in human brain revealed by meta-analysis of positively selected genes.

    Directory of Open Access Journals (Sweden)

    Yue Huang

    Full Text Available BACKGROUND AND OBJECTIVES: Analysis of positively-selected genes can help us understand how human evolved, especially the evolution of highly developed cognitive functions. However, previous works have reached conflicting conclusions regarding whether human neuronal genes are over-represented among genes under positive selection. METHODS AND RESULTS: We divided positively-selected genes into four groups according to the identification approaches, compiling a comprehensive list from 27 previous studies. We showed that genes that are highly expressed in the central nervous system are enriched in recent positive selection events in human history identified by intra-species genomic scan, especially in brain regions related to cognitive functions. This pattern holds when different datasets, parameters and analysis pipelines were used. Functional category enrichment analysis supported these findings, showing that synapse-related functions are enriched in genes under recent positive selection. In contrast, immune-related functions, for instance, are enriched in genes under ancient positive selection revealed by inter-species coding region comparison. We further demonstrated that most of these patterns still hold even after controlling for genomic characteristics that might bias genome-wide identification of positively-selected genes including gene length, gene density, GC composition, and intensity of negative selection. CONCLUSION: Our rigorous analysis resolved previous conflicting conclusions and revealed recent adaptation of human brain functions.

  19. Juvenile psittacine environmental enrichment.

    Science.gov (United States)

    Simone-Freilicher, Elisabeth; Rupley, Agnes E

    2015-05-01

    Environmental enrichment is of great import to the emotional, intellectual, and physical development of the juvenile psittacine and their success in the human home environment. Five major types of enrichment include social, occupational, physical, sensory, and nutritional. Occupational enrichment includes exercise and psychological enrichment. Physical enrichment includes the cage and accessories and the external home environment. Sensory enrichment may be visual, auditory, tactile, olfactory, or taste oriented. Nutritional enrichment includes variations in appearance, type, and frequency of diet, and treats, novelty, and foraging. Two phases of the preadult period deserve special enrichment considerations: the development of autonomy and puberty. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Differential Effect of Active Smoking on Gene Expression in Male and Female Smokers

    Science.gov (United States)

    Paul, Sunirmal; Amundson, Sally A

    2015-01-01

    Smoking is the second leading cause of preventable death in the United States. Cohort epidemiological studies have demonstrated that women are more vulnerable to cigarette-smoking induced diseases than their male counterparts, however, the molecular basis of these differences has remained unknown. In this study, we explored if there were differences in the gene expression patterns between male and female smokers, and how these patterns might reflect different sex-specific responses to the stress of smoking. Using whole genome microarray gene expression profiling, we found that a substantial number of oxidant related genes were expressed in both male and female smokers, however, smoking-responsive genes did indeed differ greatly between male and female smokers. Gene set enrichment analysis (GSEA) against reference oncogenic signature gene sets identified a large number of oncogenic pathway gene-sets that were significantly altered in female smokers compared to male smokers. In addition, functional annotation with Ingenuity Pathway Analysis (IPA) identified smoking-correlated genes associated with biological functions in male and female smokers that are directly relevant to well-known smoking related pathologies. However, these relevant biological functions were strikingly overrepresented in female smokers compared to male smokers. IPA network analysis with the functional categories of immune and inflammatory response gene products suggested potential interactions between smoking response and female hormones. Our results demonstrate a striking dichotomy between male and female gene expression responses to smoking. This is the first genome-wide expression study to compare the sex-specific impacts of smoking at a molecular level and suggests a novel potential connection between sex hormone signaling and smoking-induced diseases in female smokers. PMID:25621181

  1. Uranium enrichment

    International Nuclear Information System (INIS)

    1989-01-01

    GAO was asked to address several questions concerning a number of proposed uranium enrichment bills introduced during the 100th Congress. The bill would have restructured the Department of Energy's uranium enrichment program as a government corporation to allow it to compete more effectively in the domestic and international markets. Some of GAO's findings discussed are: uranium market experts believe and existing market models show that the proposed DOE purchase of a $750 million of uranium from domestic producers may not significantly increase production because of large producer-held inventories; excess uranium enrichment production capacity exists throughout the world; therefore, foreign producers are expected to compete heavily in the United States throughout the 1990s as utilities' contracts with DOE expire; and according to a 1988 agreement between DOE's Offices of Nuclear Energy and Defense Programs, enrichment decommissioning costs, estimated to total $3.6 billion for planning purposes, will be shared by the commercial enrichment program and the government

  2. Isotope enrichment

    International Nuclear Information System (INIS)

    Lydtin, H-J.; Wilden, R.J.; Severin, P.J.W.

    1978-01-01

    The isotope enrichment method described is based on the recognition that, owing to mass diffusion and thermal diffusion in the conversion of substances at a heated substrate while depositing an element or compound onto the substrate, enrichment of the element, or a compound of the element, with a lighter isotope will occur. The cycle is repeated for as many times as is necessary to obtain the degree of enrichment required

  3. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  4. Network Expansion and Pathway Enrichment Analysis towards Biologically Significant Findings from Microarrays

    Directory of Open Access Journals (Sweden)

    Wu Xiaogang

    2012-06-01

    Full Text Available In many cases, crucial genes show relatively slight changes between groups of samples (e.g. normal vs. disease, and many genes selected from microarray differential analysis by measuring the expression level statistically are also poorly annotated and lack of biological significance. In this paper, we present an innovative approach - network expansion and pathway enrichment analysis (NEPEA for integrative microarray analysis. We assume that organized knowledge will help microarray data analysis in significant ways, and the organized knowledge could be represented as molecular interaction networks or biological pathways. Based on this hypothesis, we develop the NEPEA framework based on network expansion from the human annotated and predicted protein interaction (HAPPI database, and pathway enrichment from the human pathway database (HPD. We use a recently-published microarray dataset (GSE24215 related to insulin resistance and type 2 diabetes (T2D as case study, since this study provided a thorough experimental validation for both genes and pathways identified computationally from classical microarray analysis and pathway analysis. We perform our NEPEA analysis for this dataset based on the results from the classical microarray analysis to identify biologically significant genes and pathways. Our findings are not only consistent with the original findings mostly, but also obtained more supports from other literatures.

  5. Genome-wide methylation analysis identifies a core set of hypermethylated genes in CIMP-H colorectal cancer.

    Science.gov (United States)

    McInnes, Tyler; Zou, Donghui; Rao, Dasari S; Munro, Francesca M; Phillips, Vicky L; McCall, John L; Black, Michael A; Reeve, Anthony E; Guilford, Parry J

    2017-03-28

    Aberrant DNA methylation profiles are a characteristic of all known cancer types, epitomized by the CpG island methylator phenotype (CIMP) in colorectal cancer (CRC). Hypermethylation has been observed at CpG islands throughout the genome, but it is unclear which factors determine whether an individual island becomes methylated in cancer. DNA methylation in CRC was analysed using the Illumina HumanMethylation450K array. Differentially methylated loci were identified using Significance Analysis of Microarrays (SAM) and the Wilcoxon Signed Rank (WSR) test. Unsupervised hierarchical clustering was used to identify methylation subtypes in CRC. In this study we characterized the DNA methylation profiles of 94 CRC tissues and their matched normal counterparts. Consistent with previous studies, unsupervized hierarchical clustering of genome-wide methylation data identified three subtypes within the tumour samples, designated CIMP-H, CIMP-L and CIMP-N, that showed high, low and very low methylation levels, respectively. Differential methylation between normal and tumour samples was analysed at the individual CpG level, and at the gene level. The distribution of hypermethylation in CIMP-N tumours showed high inter-tumour variability and appeared to be highly stochastic in nature, whereas CIMP-H tumours exhibited consistent hypermethylation at a subset of genes, in addition to a highly variable background of hypermethylated genes. EYA4, TFPI2 and TLX1 were hypermethylated in more than 90% of all tumours examined. One-hundred thirty-two genes were hypermethylated in 100% of CIMP-H tumours studied and these were highly enriched for functions relating to skeletal system development (Bonferroni adjusted p value =2.88E-15), segment specification (adjusted p value =9.62E-11), embryonic development (adjusted p value =1.52E-04), mesoderm development (adjusted p value =1.14E-20), and ectoderm development (adjusted p value =7.94E-16). Our genome-wide characterization of DNA

  6. Identification and Characterization of Renal Cell Carcinoma Gene Markers

    Directory of Open Access Journals (Sweden)

    Louis S. Liou

    2007-01-01

    Full Text Available Microarray gene expression profiling has been used to distinguish histological subtypes of renal cell carcinoma (RCC, and consequently to identify specific tumor markers. The analytical procedures currently in use find sets of genes whose average differential expression across the two categories differ significantly. In general each of the markers thus identifi ed does not distinguish tumor from normal with 100% accuracy, although the group as a whole might be able to do so. For the purpose of developing a widely used economically viable diagnostic signature, however, large groups of genes are not likely to be useful. Here we use two different methods, one a support vector machine variant, and the other an exhaustive search, to reanalyze data previously generated in our Lab (Lenburg et al. 2003. We identify 158 genes, each having an expression level that is higher (lower in every tumor sample than in any normal sample, and each having a minimum differential expression across the two categorie at a signifi cance of 0.01. The set is highly enriched in cancer related genes (p = 1.6 × 10 – 12, containing 43 genes previously associated with either RCC or other types of cancer. Many of the biomarkers appear to be associated with the central alterations known to be required for cancer transformation. These include the oncogenes JAZF1, AXL, ABL2; tumor suppressors RASD1, PTPRO, TFAP2A, CDKN1C; and genes involved in proteolysis or cell-adhesion such as WASF2, and PAPPA.

  7. EURODIF: An enrichment plant for the present and beyond the year 2000

    International Nuclear Information System (INIS)

    Petit, J.F.; Barre, J.Y.

    1989-01-01

    EURODIF's George Besse uranium enrichment plant, which uses the gaseous diffusion process, was set up in France with European partners. It has the annual capacity to supply sufficient enriched uranium for 100 light water reactors of 900 MWe. The plant has been running for the last 10 years and its output is set to satisfy the market for enrichment, while making best use of the seasonal availability of electrical energy supplied by the EdF. In 80,000 hours of operation the plant has proved itself entirely satisfactory in terms of reliability, availability, safety and efficiency. From this, it can be predicted that, on the basis of current production, output can be maintained to beyond the year 2000. The improvement programme being undertaken at present will increase performance and flexibility and make the plant more competitive as it enters the market of the next decade

  8. Sexually Dimorphic Gene Expression Associated with Growth and Reproduction of Tongue Sole (Cynoglossus semilaevis) Revealed by Brain Transcriptome Analysis.

    Science.gov (United States)

    Wang, Pingping; Zheng, Min; Liu, Jian; Liu, Yongzhuang; Lu, Jianguo; Sun, Xiaowen

    2016-08-26

    In this study, we performed a comprehensive analysis of the transcriptome of one- and two-year-old male and female brains of Cynoglossus semilaevis by high-throughput Illumina sequencing. A total of 77,066 transcripts, corresponding to 21,475 unigenes, were obtained with a N50 value of 4349 bp. Of these unigenes, 33 genes were found to have significant differential expression and potentially associated with growth, from which 18 genes were down-regulated and 12 genes were up-regulated in two-year-old males, most of these genes had no significant differences in expression among one-year-old males and females and two-year-old females. A similar analysis was conducted to look for genes associated with reproduction; 25 genes were identified, among them, five genes were found to be down regulated and 20 genes up regulated in two-year-old males, again, most of the genes had no significant expression differences among the other three. The performance of up regulated genes in Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was significantly different between two-year-old males and females. Males had a high gene expression in genetic information processing, while female's highly expressed genes were mainly enriched on organismal systems. Our work identified a set of sex-biased genes potentially associated with growth and reproduction that might be the candidate factors affecting sexual dimorphism of tongue sole, laying the foundation to understand the complex process of sex determination of this economic valuable species.

  9. Sexually Dimorphic Gene Expression Associated with Growth and Reproduction of Tongue Sole (Cynoglossus semilaevis Revealed by Brain Transcriptome Analysis

    Directory of Open Access Journals (Sweden)

    Pingping Wang

    2016-08-01

    Full Text Available In this study, we performed a comprehensive analysis of the transcriptome of one- and two-year-old male and female brains of Cynoglossus semilaevis by high-throughput Illumina sequencing. A total of 77,066 transcripts, corresponding to 21,475 unigenes, were obtained with a N50 value of 4349 bp. Of these unigenes, 33 genes were found to have significant differential expression and potentially associated with growth, from which 18 genes were down-regulated and 12 genes were up-regulated in two-year-old males, most of these genes had no significant differences in expression among one-year-old males and females and two-year-old females. A similar analysis was conducted to look for genes associated with reproduction; 25 genes were identified, among them, five genes were found to be down regulated and 20 genes up regulated in two-year-old males, again, most of the genes had no significant expression differences among the other three. The performance of up regulated genes in Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG pathway enrichment analysis was significantly different between two-year-old males and females. Males had a high gene expression in genetic information processing, while female’s highly expressed genes were mainly enriched on organismal systems. Our work identified a set of sex-biased genes potentially associated with growth and reproduction that might be the candidate factors affecting sexual dimorphism of tongue sole, laying the foundation to understand the complex process of sex determination of this economic valuable species.

  10. Identification and Validation of a New Set of Five Genes for Prediction of Risk in Early Breast Cancer

    Directory of Open Access Journals (Sweden)

    Giorgio Mustacchi

    2013-05-01

    Full Text Available Molecular tests predicting the outcome of breast cancer patients based on gene expression levels can be used to assist in making treatment decisions after consideration of conventional markers. In this study we identified a subset of 20 mRNA differentially regulated in breast cancer analyzing several publicly available array gene expression data using R/Bioconductor package. Using RTqPCR we evaluate 261 consecutive invasive breast cancer cases not selected for age, adjuvant treatment, nodal and estrogen receptor status from paraffin embedded sections. The biological samples dataset was split into a training (137 cases and a validation set (124 cases. The gene signature was developed on the training set and a multivariate stepwise Cox analysis selected five genes independently associated with DFS: FGF18 (HR = 1.13, p = 0.05, BCL2 (HR = 0.57, p = 0.001, PRC1 (HR = 1.51, p = 0.001, MMP9 (HR = 1.11, p = 0.08, SERF1a (HR = 0.83, p = 0.007. These five genes were combined into a linear score (signature weighted according to the coefficients of the Cox model, as: 0.125FGF18 − 0.560BCL2 + 0.409PRC1 + 0.104MMP9 − 0.188SERF1A (HR = 2.7, 95% CI = 1.9–4.0, p < 0.001. The signature was then evaluated on the validation set assessing the discrimination ability by a Kaplan Meier analysis, using the same cut offs classifying patients at low, intermediate or high risk of disease relapse as defined on the training set (p < 0.001. Our signature, after a further clinical validation, could be proposed as prognostic signature for disease free survival in breast cancer patients where the indication for adjuvant chemotherapy added to endocrine treatment is uncertain.

  11. Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies

    Science.gov (United States)

    Medina, Ignacio; Montaner, David; Bonifaci, Nuria; Pujana, Miguel Angel; Carbonell, José; Tarraga, Joaquin; Al-Shahrour, Fatima; Dopazo, Joaquin

    2009-01-01

    Genome-wide association studies have become a popular strategy to find associations of genes to traits of interest. Despite the high-resolution available today to carry out genotyping studies, the success of its application in real studies has been limited by the testing strategy used. As an alternative to brute force solutions involving the use of very large cohorts, we propose the use of the Gene Set Analysis (GSA), a different analysis strategy based on testing the association of modules of functionally related genes. We show here how the Gene Set-based Analysis of Polymorphisms (GeSBAP), which is a simple implementation of the GSA strategy for the analysis of genome-wide association studies, provides a significant increase in the power testing for this type of studies. GeSBAP is freely available at http://bioinfo.cipf.es/gesbap/ PMID:19502494

  12. SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.

    Science.gov (United States)

    Merelli, Ivan; Calabria, Andrea; Cozzi, Paolo; Viti, Federica; Mosca, Ettore; Milanesi, Luciano

    2013-01-01

    The capability of correlating specific genotypes with human diseases is a complex issue in spite of all advantages arisen from high-throughput technologies, such as Genome Wide Association Studies (GWAS). New tools for genetic variants interpretation and for Single Nucleotide Polymorphisms (SNPs) prioritization are actually needed. Given a list of the most relevant SNPs statistically associated to a specific pathology as result of a genotype study, a critical issue is the identification of genes that are effectively related to the disease by re-scoring the importance of the identified genetic variations. Vice versa, given a list of genes, it can be of great importance to predict which SNPs can be involved in the onset of a particular disease, in order to focus the research on their effects. We propose a new bioinformatics approach to support biological data mining in the analysis and interpretation of SNPs associated to pathologies. This system can be employed to design custom genotyping chips for disease-oriented studies and to re-score GWAS results. The proposed method relies (1) on the data integration of public resources using a gene-centric database design, (2) on the evaluation of a set of static biomolecular annotations, defined as features, and (3) on the SNP scoring function, which computes SNP scores using parameters and weights set by users. We employed a machine learning classifier to set default feature weights and an ontological annotation layer to enable the enrichment of the input gene set. We implemented our method as a web tool called SNPranker 2.0 (http://www.itb.cnr.it/snpranker), improving our first published release of this system. A user-friendly interface allows the input of a list of genes, SNPs or a biological process, and to customize the features set with relative weights. As result, SNPranker 2.0 returns a list of SNPs, localized within input and ontologically enriched genes, combined with their prioritization scores. Different

  13. Recruitment of PfSET2 by RNA polymerase II to variant antigen encoding loci contributes to antigenic variation in P. falciparum.

    Directory of Open Access Journals (Sweden)

    Uchechi E Ukaegbu

    2014-01-01

    Full Text Available Histone modifications are important regulators of gene expression in all eukaryotes. In Plasmodium falciparum, these epigenetic marks regulate expression of genes involved in several aspects of host-parasite interactions, including antigenic variation. While the identities and genomic positions of many histone modifications have now been cataloged, how they are targeted to defined genomic regions remains poorly understood. For example, how variant antigen encoding loci (var are targeted for deposition of unique histone marks is a mystery that continues to perplex the field. Here we describe the recruitment of an ortholog of the histone modifier SET2 to var genes through direct interactions with the C-terminal domain (CTD of RNA polymerase II. In higher eukaryotes, SET2 is a histone methyltransferase recruited by RNA pol II during mRNA transcription; however, the ortholog in P. falciparum (PfSET2 has an atypical architecture and its role in regulating transcription is unknown. Here we show that PfSET2 binds to the unphosphorylated form of the CTD, a property inconsistent with its recruitment during mRNA synthesis. Further, we show that H3K36me3, the epigenetic mark deposited by PfSET2, is enriched at both active and silent var gene loci, providing additional evidence that its recruitment is not associated with mRNA production. Over-expression of a dominant negative form of PfSET2 designed to disrupt binding to RNA pol II induced rapid var gene expression switching, confirming both the importance of PfSET2 in var gene regulation and a role for RNA pol II in its recruitment. RNA pol II is known to transcribe non-coding RNAs from both active and silent var genes, providing a possible mechanism by which it could recruit PfSET2 to var loci. This work unifies previous reports of histone modifications, the production of ncRNAs, and the promoter activity of var introns into a mechanism that contributes to antigenic variation by malaria parasites.

  14. The leukemia-specific fusion gene ETV6/RUNX1 perturbs distinct key biological functions primarily by gene repression.

    Directory of Open Access Journals (Sweden)

    Gerhard Fuka

    Full Text Available BACKGROUND: ETV6/RUNX1 (E/R (also known as TEL/AML1 is the most frequent gene fusion in childhood acute lymphoblastic leukemia (ALL and also most likely the crucial factor for disease initiation; its role in leukemia propagation and maintenance, however, remains largely elusive. To address this issue we performed a shRNA-mediated knock-down (KD of the E/R fusion gene and investigated the ensuing consequences on genome-wide gene expression patterns and deducible regulatory functions in two E/R-positive leukemic cell lines. FINDINGS: Microarray analyses identified 777 genes whose expression was substantially altered. Although approximately equal proportions were either up- (KD-UP or down-regulated (KD-DOWN, the effects on biological processes and pathways differed considerably. The E/R KD-UP set was significantly enriched for genes included in the "cell activation", "immune response", "apoptosis", "signal transduction" and "development and differentiation" categories, whereas in the E/R KD-DOWN set only the "PI3K/AKT/mTOR signaling" and "hematopoietic stem cells" categories became evident. Comparable expression signatures obtained from primary E/R-positive ALL samples underline the relevance of these pathways and molecular functions. We also validated six differentially expressed genes representing the categories "stem cell properties", "B-cell differentiation", "immune response", "cell adhesion" and "DNA damage" with RT-qPCR. CONCLUSION: Our analyses provide the first preliminary evidence that the continuous expression of the E/R fusion gene interferes with key regulatory functions that shape the biology of this leukemia subtype. E/R may thus indeed constitute the essential driving force for the propagation and maintenance of the leukemic process irrespective of potential consequences of associated secondary changes. Finally, these findings may also provide a valuable source of potentially attractive therapeutic targets.

  15. Gene discovery from Jatropha curcas by sequencing of ESTs from normalized and full-length enriched cDNA library from developing seeds

    Directory of Open Access Journals (Sweden)

    Sugantham Priyanka Annabel

    2010-10-01

    Full Text Available Abstract Background Jatropha curcas L. is promoted as an important non-edible biodiesel crop worldwide. Jatropha oil, which is a triacylglycerol, can be directly blended with petro-diesel or transesterified with methanol and used as biodiesel. Genetic improvement in jatropha is needed to increase the seed yield, oil content, drought and pest resistance, and to modify oil composition so that it becomes a technically and economically preferred source for biodiesel production. However, genetic improvement efforts in jatropha could not take advantage of genetic engineering methods due to lack of cloned genes from this species. To overcome this hurdle, the current gene discovery project was initiated with an objective of isolating as many functional genes as possible from J. curcas by large scale sequencing of expressed sequence tags (ESTs. Results A normalized and full-length enriched cDNA library was constructed from developing seeds of J. curcas. The cDNA library contained about 1 × 106 clones and average insert size of the clones was 2.1 kb. Totally 12,084 ESTs were sequenced to average high quality read length of 576 bp. Contig analysis revealed 2258 contigs and 4751 singletons. Contig size ranged from 2-23 and there were 7333 ESTs in the contigs. This resulted in 7009 unigenes which were annotated by BLASTX. It showed 3982 unigenes with significant similarity to known genes and 2836 unigenes with significant similarity to genes of unknown, hypothetical and putative proteins. The remaining 191 unigenes which did not show similarity with any genes in the public database may encode for unique genes. Functional classification revealed unigenes related to broad range of cellular, molecular and biological functions. Among the 7009 unigenes, 6233 unigenes were identified to be potential full-length genes. Conclusions The high quality normalized cDNA library was constructed from developing seeds of J. curcas for the first time and 7009 unigenes coding

  16. Chronic ethanol exposure produces time- and brain region-dependent changes in gene coexpression networks.

    Directory of Open Access Journals (Sweden)

    Elizabeth A Osterndorff-Kahanek

    Full Text Available Repeated ethanol exposure and withdrawal in mice increases voluntary drinking and represents an animal model of physical dependence. We examined time- and brain region-dependent changes in gene coexpression networks in amygdala (AMY, nucleus accumbens (NAC, prefrontal cortex (PFC, and liver after four weekly cycles of chronic intermittent ethanol (CIE vapor exposure in C57BL/6J mice. Microarrays were used to compare gene expression profiles at 0-, 8-, and 120-hours following the last ethanol exposure. Each brain region exhibited a large number of differentially expressed genes (2,000-3,000 at the 0- and 8-hour time points, but fewer changes were detected at the 120-hour time point (400-600. Within each region, there was little gene overlap across time (~20%. All brain regions were significantly enriched with differentially expressed immune-related genes at the 8-hour time point. Weighted gene correlation network analysis identified modules that were highly enriched with differentially expressed genes at the 0- and 8-hour time points with virtually no enrichment at 120 hours. Modules enriched for both ethanol-responsive and cell-specific genes were identified in each brain region. These results indicate that chronic alcohol exposure causes global 'rewiring' of coexpression systems involving glial and immune signaling as well as neuronal genes.

  17. Microbial Community Response of an Organohalide Respiring Enrichment Culture to Permanganate Oxidation.

    Science.gov (United States)

    Sutton, Nora B; Atashgahi, Siavash; Saccenti, Edoardo; Grotenhuis, Tim; Smidt, Hauke; Rijnaarts, Huub H M

    2015-01-01

    While in situ chemical oxidation is often used to remediate tetrachloroethene (PCE) contaminated locations, very little is known about its influence on microbial composition and organohalide respiration (OHR) activity. Here, we investigate the impact of oxidation with permanganate on OHR rates, the abundance of organohalide respiring bacteria (OHRB) and reductive dehalogenase (rdh) genes using quantitative PCR, and microbial community composition through sequencing of 16S rRNA genes. A PCE degrading enrichment was repeatedly treated with low (25 μmol), medium (50 μmol), or high (100 μmol) permanganate doses, or no oxidant treatment (biotic control). Low and medium treatments led to higher OHR rates and enrichment of several OHRB and rdh genes, as compared to the biotic control. Improved degradation rates can be attributed to enrichment of (1) OHRB able to also utilize Mn oxides as a terminal electron acceptor and (2) non-dechlorinating community members of the Clostridiales and Deltaproteobacteria possibly supporting OHRB by providing essential co-factors. In contrast, high permanganate treatment disrupted dechlorination beyond cis-dichloroethene and caused at least a 2-4 orders of magnitude reduction in the abundance of all measured OHRB and rdh genes, as compared to the biotic control. High permanganate treatments resulted in a notably divergent microbial community, with increased abundances of organisms affiliated with Campylobacterales and Oceanospirillales capable of dissimilatory Mn reduction, and decreased abundance of presumed supporters of OHRB. Although OTUs classified within the OHR-supportive order Clostridiales and OHRB increased in abundance over the course of 213 days following the final 100 μmol permanganate treatment, only limited regeneration of PCE dechlorination was observed in one of three microcosms, suggesting strong chemical oxidation treatments can irreversibly disrupt OHR. Overall, this detailed investigation into dose

  18. Construction and evaluation of normalized cDNA libraries enriched with full-length sequences for rapid discovery of new genes from Sisal (Agave sisalana Perr.) different developmental stages.

    Science.gov (United States)

    Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng

    2012-10-12

    To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing.

  19. The mitochondrial genomes of Atlas Geckos (Quedenfeldtia): mitogenome assembly from transcriptomes and anchored hybrid enrichment datasets

    OpenAIRE

    Lyra, Mariana L.; Joger, Ulrich; Schulte, Ulrich; Slimani, Tahar; El Mouden, El Hassan; Bouazza, Abdellah; Künzel, Sven; Lemmon, Alan R.; Moriarty Lemmon, Emily; Vences, Miguel

    2017-01-01

    The nearly complete mitogenomes of the two species of North African Atlas geckos, Quedenfeldtia moerens and Q. trachyblepharus were assembled from anchored hybrid enrichment data and RNAseq data. Congruent assemblies were obtained for four samples included in both datasets. We recovered the 13 protein-coding genes, 22 tRNA genes, and two rRNA genes for both species, including partial control region. The order of genes agrees with that of other geckos.

  20. Global gene expression in muscle from fasted/refed trout reveals up-regulation of genes promoting myofibre hypertrophy but not myofibre production.

    Science.gov (United States)

    Rescan, Pierre-Yves; Le Cam, Aurelie; Rallière, Cécile; Montfort, Jérôme

    2017-06-07

    Compensatory growth is a phase of rapid growth, greater than the growth rate of control animals, that occurs after a period of growth-stunting conditions. Fish show a capacity for compensatory growth after alleviation of dietary restriction, but the underlying cellular mechanisms are unknown. To learn more about the contribution of genes regulating hypertrophy (an increase in muscle fibre size) and hyperplasia (the generation of new muscle fibres) in the compensatory muscle growth response in fish, we used high-density microarray analysis to investigate the global gene expression in muscle of trout during a fasting-refeeding schedule and in muscle of control-fed trout displaying normal growth. The compensatory muscle growth signature, as defined by genes up-regulated in muscles of refed trout compared with control-fed trout, showed enrichment in functional categories related to protein biosynthesis and maturation, such as RNA processing, ribonucleoprotein complex biogenesis, ribosome biogenesis, translation and protein folding. This signature was also enriched in chromatin-remodelling factors of the protein arginine N-methyl transferase family. Unexpectedly, functional categories related to cell division and DNA replication were not inferred from the molecular signature of compensatory muscle growth, and this signature contained virtually none of the genes previously reported to be up-regulated in hyperplastic growth zones of the late trout embryo myotome and to potentially be involved in production of new myofibres, notably genes encoding myogenic regulatory factors, transmembrane receptors essential for myoblast fusion or myofibrillar proteins predominant in nascent myofibres. Genes promoting myofibre growth, but not myofibre formation, were up-regulated in muscles of refed trout compared with continually fed trout. This suggests that a compensatory muscle growth response, resulting from the stimulation of hypertrophy but not the stimulation of hyperplasia

  1. Gene expression profiling of prostate tissue identifies chromatin regulation as a potential link between obesity and lethal prostate cancer.

    Science.gov (United States)

    Ebot, Ericka M; Gerke, Travis; Labbé, David P; Sinnott, Jennifer A; Zadra, Giorgia; Rider, Jennifer R; Tyekucheva, Svitlana; Wilson, Kathryn M; Kelly, Rachel S; Shui, Irene M; Loda, Massimo; Kantoff, Philip W; Finn, Stephen; Vander Heiden, Matthew G; Brown, Myles; Giovannucci, Edward L; Mucci, Lorelei A

    2017-11-01

    Obese men are at higher risk of advanced prostate cancer and cancer-specific mortality; however, the biology underlying this association remains unclear. This study examined gene expression profiles of prostate tissue to identify biological processes differentially expressed by obesity status and lethal prostate cancer. Gene expression profiling was performed on tumor (n = 402) and adjacent normal (n = 200) prostate tissue from participants in 2 prospective cohorts who had been diagnosed with prostate cancer from 1982 to 2005. Body mass index (BMI) was calculated from the questionnaire immediately preceding cancer diagnosis. Men were followed for metastases or prostate cancer-specific death (lethal disease) through 2011. Gene Ontology biological processes differentially expressed by BMI were identified using gene set enrichment analysis. Pathway scores were computed by averaging the signal intensities of member genes. Odds ratios (ORs) for lethal prostate cancer were estimated with logistic regression. Among 402 men, 48% were healthy weight, 31% were overweight, and 21% were very overweight/obese. Fifteen gene sets were enriched in tumor tissue, but not normal tissue, of very overweight/obese men versus healthy-weight men; 5 of these were related to chromatin modification and remodeling (false-discovery rate 7, 41% vs 17%; P = 2 × 10 -4 ) and an increased risk of lethal disease that was independent of grade and stage (OR, 5.26; 95% confidence interval, 2.37-12.25). This study improves our understanding of the biology of aggressive prostate cancer and identifies a potential mechanistic link between obesity and prostate cancer death that warrants further study. Cancer 2017;123:4130-4138. © 2017 American Cancer Society. © 2017 American Cancer Society.

  2. Bridging cancer biology with the clinic: relative expression of a GRHL2-mediated gene-set pair predicts breast cancer metastasis.

    Directory of Open Access Journals (Sweden)

    Xinan Yang

    Full Text Available Identification and characterization of crucial gene target(s that will allow focused therapeutics development remains a challenge. We have interrogated the putative therapeutic targets associated with the transcription factor Grainy head-like 2 (GRHL2, a critical epithelial regulatory factor. We demonstrate the possibility to define the molecular functions of critical genes in terms of their personalized expression profiles, allowing appropriate functional conclusions to be derived. A novel methodology, relative expression analysis with gene-set pairs (RXA-GSP, is designed to explore the potential clinical utility of cancer-biology discovery. Observing that Grhl2-overexpression leads to increased metastatic potential in vitro, we established a model assuming Grhl2-induced or -inhibited genes confer poor or favorable prognosis respectively for cancer metastasis. Training on public gene expression profiles of 995 breast cancer patients, this method prioritized one gene-set pair (GRHL2, CDH2, FN1, CITED2, MKI67 versus CTNNB1 and CTNNA3 from all 2717 possible gene-set pairs (GSPs. The identified GSP significantly dichotomized 295 independent patients for metastasis-free survival (log-rank tested p = 0.002; severe empirical p = 0.035. It also showed evidence of clinical prognostication in another independent 388 patients collected from three studies (log-rank tested p = 3.3e-6. This GSP is independent of most traditional prognostic indicators, and is only significantly associated with the histological grade of breast cancer (p = 0.0017, a GRHL2-associated clinical character (p = 6.8e-6, Spearman correlation, suggesting that this GSP is reflective of GRHL2-mediated events. Furthermore, a literature review indicates the therapeutic potential of the identified genes. This research demonstrates a novel strategy to integrate both biological experiments and clinical gene expression profiles for extracting and elucidating the genomic

  3. Enrichment and physiological characterization of an anaerobic ammonium-oxidizing bacterium ‘ Candidatus Brocadia sapporoensis’

    KAUST Repository

    Narita, Yuko; Zhang, Lei; Kimura, Zen-ichiro; Ali, Muhammad; Fujii, Takao; Okabe, Satoshi

    2017-01-01

    Anaerobic ammonium-oxidation (anammox) is recognized as an important microbial process in the global nitrogen cycle and wastewater treatment. In this study, we successfully enriched a novel anammox bacterium affiliated with the genus ‘Candidatus Brocadia’ with high purity (>90%) in a membrane bioreactor (MBR). The enriched bacterium was distantly related to the hitherto characterized ‘Ca. Brocadia fulgida’ and ‘Ca. Brocadia sinica’ with 96% and 93% of 16S ribosomal RNA gene sequence identity, respectively. The bacterium exhibited the common structural features of anammox bacteria and the production of hydrazine in the presence of hydroxylamine under anoxic conditions. The temperature range of anammox activity was 20 − 45°C with a maximum activity at 37°C. The maximum specific growth rate (μmax) was determined to be 0.0082h−1 at 37°C, corresponding to a doubling time of 3.5 days. The half-saturation constant (KS) for nitrite was 5±2.5μM. The anammox activity was inhibited by nitrite with 11.6mM representing the 50% inhibitory concentration (IC50) but no significant inhibition was observed in the presence of formate and acetate. The major respiratory quinone was identified to be menaquinone-7 (MK-7). Comparative genome analysis revealed that the anammox bacterium enriched in present study shared nearly half of genes with ‘Ca. Brocadia sinica’ and ‘Ca. Brocadia fulgida’. The bacterium enriched in this study showed all known physiological characteristics of anammox bacteria and can be distinguished from the close relatives by its rRNA gene sequences. Therefore, we proposed the name ‘Ca. Brocadia sapporoensis’ sp. nov.

  4. Enrichment and physiological characterization of an anaerobic ammonium-oxidizing bacterium ‘ Candidatus Brocadia sapporoensis’

    KAUST Repository

    Narita, Yuko

    2017-08-18

    Anaerobic ammonium-oxidation (anammox) is recognized as an important microbial process in the global nitrogen cycle and wastewater treatment. In this study, we successfully enriched a novel anammox bacterium affiliated with the genus ‘Candidatus Brocadia’ with high purity (>90%) in a membrane bioreactor (MBR). The enriched bacterium was distantly related to the hitherto characterized ‘Ca. Brocadia fulgida’ and ‘Ca. Brocadia sinica’ with 96% and 93% of 16S ribosomal RNA gene sequence identity, respectively. The bacterium exhibited the common structural features of anammox bacteria and the production of hydrazine in the presence of hydroxylamine under anoxic conditions. The temperature range of anammox activity was 20 − 45°C with a maximum activity at 37°C. The maximum specific growth rate (μmax) was determined to be 0.0082h−1 at 37°C, corresponding to a doubling time of 3.5 days. The half-saturation constant (KS) for nitrite was 5±2.5μM. The anammox activity was inhibited by nitrite with 11.6mM representing the 50% inhibitory concentration (IC50) but no significant inhibition was observed in the presence of formate and acetate. The major respiratory quinone was identified to be menaquinone-7 (MK-7). Comparative genome analysis revealed that the anammox bacterium enriched in present study shared nearly half of genes with ‘Ca. Brocadia sinica’ and ‘Ca. Brocadia fulgida’. The bacterium enriched in this study showed all known physiological characteristics of anammox bacteria and can be distinguished from the close relatives by its rRNA gene sequences. Therefore, we proposed the name ‘Ca. Brocadia sapporoensis’ sp. nov.

  5. DOE hands over uranium enrichment duties to government corporation

    International Nuclear Information System (INIS)

    Simpson, J.

    1993-01-01

    In an effort to renew the United States' competitiveness in the world market for uranium enrichment services, the Department of Energy (DOE) is turning over control of its Paducah, KY, and Portsmouth, OH, enrichment facilities to a for-profit organization, the United States Enrichment Corp. (USEC), which was created by last year's Energy Policy Act. William H. Timbers, Jr., a former investment banker who was appointed acting CEO in March, said the Act's mandate will mean more competitive prices for enriched reactor fuel and greater responsiveness to utility customers. As a government corporation, USEC, with current annual revenues estimated at $1.5 billion, will no longer be part of the federal budget appropriations process, but will use business management techniques, set market-based prices for enriched uranium, and pay annual dividends to the US Treasury-its sole stockholder-from earnings. The goal is to finish privatizing the corporation within two years, and to sell its stock to investors for an estimated $1 to $3 billion. USEC's success will depend in part on developing short- and long-term marketing plants to help stanch the flow of enriched-uranium customers to foreign suppliers. (DOE already has received notice from a number of US utilities that they want to be let out of their long-term enrichment contracts as they expire over the next several years).USEC's plans likely will include exploring new joint ventures with other businesses in the nuclear fuel cycle-such as suppliers, fabricators, and converters-and offering a broader range of enrichment services than DOE provided. The corporation will have to be responsive to utilities on an individual basis

  6. DNA enrichment approaches to identify unauthorized genetically modified organisms (GMOs).

    Science.gov (United States)

    Arulandhu, Alfred J; van Dijk, Jeroen P; Dobnik, David; Holst-Jensen, Arne; Shi, Jianxin; Zel, Jana; Kok, Esther J

    2016-07-01

    With the increased global production of different genetically modified (GM) plant varieties, chances increase that unauthorized GM organisms (UGMOs) may enter the food chain. At the same time, the detection of UGMOs is a challenging task because of the limited sequence information that will generally be available. PCR-based methods are available to detect and quantify known UGMOs in specific cases. If this approach is not feasible, DNA enrichment of the unknown adjacent sequences of known GMO elements is one way to detect the presence of UGMOs in a food or feed product. These enrichment approaches are also known as chromosome walking or gene walking (GW). In recent years, enrichment approaches have been coupled with next generation sequencing (NGS) analysis and implemented in, amongst others, the medical and microbiological fields. The present review will provide an overview of these approaches and an evaluation of their applicability in the identification of UGMOs in complex food or feed samples.

  7. Pectinmethylesterases (PME) and pectinmethylesterase inhibitors (PMEI) enriched during phloem fiber development in flax (Linum usitatissimum).

    Science.gov (United States)

    Pinzon-Latorre, David; Deyholos, Michael K

    2014-01-01

    Flax phloem fibers achieve their length by intrusive-diffusive growth, which requires them to penetrate the extracellular matrix of adjacent cells. Fiber elongation therefore involves extensive remodelling of cell walls and middle lamellae, including modifying the degree and pattern of methylesterification of galacturonic acid (GalA) residues of pectin. Pectin methylesterases (PME) are important enzymes for fiber elongation as they mediate the demethylesterification of GalA in muro, in either a block-wise fashion or in a random fashion. Our objective was to identify PMEs and PMEIs that mediate phloem fiber elongation in flax. For this purpose, we measured transcript abundance of candidate genes at nine different stages of stem and fiber development and found sets of genes enriched during fiber elongation and maturation as well as during xylem development. We expressed one of the flax PMEIs in E. coli and demonstrated that it was able to inhibit most of the native PME activity in the upper portion of the flax stem. These results identify key genetic components of the intrusive growth process and define targets for fiber engineering and crop improvement.

  8. Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer

    International Nuclear Information System (INIS)

    Thomassen, Mads; Tan, Qihua; Kruse, Torben A

    2008-01-01

    Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent studies. Besides classification of outcome, these global expression patterns may reflect biological mechanisms involved in metastasis of breast cancer. Our purpose has been to investigate pathways and transcription factors involved in metastasis by use of gene expression data sets. We have analyzed 8 publicly available gene expression data sets. A global approach, 'gene set enrichment analysis' as well as an approach focusing on a subset of significantly differently regulated genes, GenMAPP, has been applied to rank pathway gene sets according to differential regulation in metastasizing tumors compared to non-metastasizing tumors. Meta-analysis has been used to determine overrepresentation of pathways and transcription factors targets, concordant deregulated in metastasizing breast tumors, in several data sets. The major findings are up-regulation of cell cycle pathways and a metabolic shift towards glucose metabolism reflected in several pathways in metastasizing tumors. Growth factor pathways seem to play dual roles; EGF and PDGF pathways are decreased, while VEGF and sex-hormone pathways are increased in tumors that metastasize. Furthermore, migration, proteasome, immune system, angiogenesis, DNA repair and several signal transduction pathways are associated to metastasis. Finally several transcription factors e.g. E2F, NFY, and YY1 are identified as being involved in metastasis. By pathway meta-analysis many biological mechanisms beyond major characteristics such as proliferation are identified. Transcription factor analysis identifies a number of key factors that support central pathways. Several previously proposed treatment targets are identified and several new pathways that may

  9. Enriching Discovery Layers: A Product Comparison of Content Enrichment Services Syndetic Solutions and Content Café 2

    Directory of Open Access Journals (Sweden)

    Allison DaSilva

    2014-10-01

    Full Text Available A comparative analysis of content enrichment services, Syndetic Solutions and Content Café 2, was undertaken to explore which service would provide public library users with a superior online search and discovery experience through the enriched data elements offered, specifically looking at the cover image data element and exploring what factors impact the display of said element. A data-set of 250 items in five different formats, including books, CDs, DVDs, e-books, and video games, was searched in four North American public libraries’ discovery layers to compare the integration, extent, and quality of the cover image data element supplied by Syndetic Solutions and Content Café 2. Based on an analysis of the URLs, ISBNs, and UPCs for each of the 250 items, it was determined that the integration, and therefore the display, of the cover image data element was impacted by: (1 whether or not an ISBN or UPC was listed in the MARC bibliographic record; (2 which ISBN or UPC was listed, as items could potentially have more than one; (3 the inclusion of both ISBNs and UPCs in the record and the settings of the discovery tool; (4 the order in which the ISBNs or UPCs were listed in the record; and (5 whether or not Syndetic Solutions or Content Café 2 had the image data in its database at the time of the search. The quality of the cover image displayed was found to be impacted by the size requested by the library and the size of the image provided by the publisher. These findings may also have implications for the integration of other enriched data elements.

  10. Effects of Enrichment Presentation and Other Factors on Behavioral Welfare of Pantropical Spotted Dolphin (Stenella attenuata).

    Science.gov (United States)

    Perez, Barbara C; Mehrkam, Lindsay R; Foltz, Amanda R; Dorey, Nicole R

    2018-01-01

    Environmental enrichment is a crucial element of promoting welfare for animals in captivity. However, enrichment programs are not always formally evaluated for their efficacy. Furthermore, there is little empirical evidence of enrichment evaluation for species of small cetaceans in zoological settings. A wide range of variables may potentially influence enrichment efficacy and how it in turn affects behavior. The purpose of this study was to determine the most preferred environmental enrichment, and method of presentation, for a species that has not been well studied in captivity, the pantropical spotted dolphin (Stenella attenuata). In order to determine which enrichment items and method of presentation were most effective at eliciting enrichment interaction, we systematically examined how several variables of enrichment influenced enrichment interaction. The results suggested that presenting enrichment after training sessions influenced interaction with the enrichment. The results also indicated preference for enrichment type and a specific enrichment device. Finally, factors that influenced interaction were also found to influence aberrant behavior. The results support the premise that enrichment be "redefined" for each species and each individual.

  11. Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

    Directory of Open Access Journals (Sweden)

    Bendahmane Abdelhafid

    2011-05-01

    Full Text Available Abstract Background Melon (Cucumis melo, an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs and 3,073 single nucleotide polymorphisms (SNPs in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but

  12. Uranium enrichment plans

    International Nuclear Information System (INIS)

    Thomas, D.C.; Gagne, R.W.

    1978-01-01

    The following topics are covered: the status of the Government's existing uranium enrichment services contracts, natural uranium requirements based on the latest contract information, uncertainty in predicting natural uranium requirements based on uranium enrichment contracts, and domestic and foreign demand assumed in enrichment planning

  13. Comparative genomic analysis of Brucella abortus vaccine strain 104M reveals a set of candidate genes associated with its virulence attenuation.

    Science.gov (United States)

    Yu, Dong; Hui, Yiming; Zai, Xiaodong; Xu, Junjie; Liang, Long; Wang, Bingxiang; Yue, Junjie; Li, Shanhu

    2015-01-01

    The Brucella abortus strain 104M, a spontaneously attenuated strain, has been used as a vaccine strain in humans against brucellosis for 6 decades in China. Despite many studies, the molecular mechanisms that cause the attenuation are still unclear. Here, we determined the whole-genome sequence of 104M and conducted a comprehensive comparative analysis against the whole genome sequences of the virulent strain, A13334, and other reference strains. This analysis revealed a highly similar genome structure between 104M and A13334. The further comparative genomic analysis between 104M and A13334 revealed a set of genes missing in 104M. Some of these genes were identified to be directly or indirectly associated with virulence. Similarly, a set of mutations in the virulence-related genes was also identified, which may be related to virulence alteration. This study provides a set of candidate genes associated with virulence attenuation in B.abortus vaccine strain 104M.

  14. An improved cell recovery method for iron oxidizing bacterial (IOB) enrichments

    DEFF Research Database (Denmark)

    Yu, Ran; Graf, Joerg; Smets, Barth F.

    2008-01-01

    Two cell recovery methods for IOB enrichments were evaluated for DNA extraction and further PCR-based 16S rRNA gene clone library creation. One was a published method consisting of heating plus oxalic acid treatment and the other one was a new method based on enzymatic agarose digestion (using β...

  15. Enrichment and purification process of astragalosides and their anti ...

    African Journals Online (AJOL)

    2014-03-01

    Mar 1, 2014 ... water until there was no smell of ethanol and set aside. Enrichment and .... Detection of the effect of astragelosides on cell proliferation by. MTT assay6-7 ... relatively large leakage starting from 10 BV; mass con- centration of ...

  16. Alteration of gene expression by alcohol exposure at early neurulation.

    Science.gov (United States)

    Zhou, Feng C; Zhao, Qianqian; Liu, Yunlong; Goodlett, Charles R; Liang, Tiebing; McClintick, Jeanette N; Edenberg, Howard J; Li, Lang

    2011-02-21

    We have previously demonstrated that alcohol exposure at early neurulation induces growth retardation, neural tube abnormalities, and alteration of DNA methylation. To explore the global gene expression changes which may underline these developmental defects, microarray analyses were performed in a whole embryo mouse culture model that allows control over alcohol and embryonic variables. Alcohol caused teratogenesis in brain, heart, forelimb, and optic vesicle; a subset of the embryos also showed cranial neural tube defects. In microarray analysis (accession number GSM9545), adopting hypothesis-driven Gene Set Enrichment Analysis (GSEA) informatics and intersection analysis of two independent experiments, we found that there was a collective reduction in expression of neural specification genes (neurogenin, Sox5, Bhlhe22), neural growth factor genes [Igf1, Efemp1, Klf10 (Tieg), and Edil3], and alteration of genes involved in cell growth, apoptosis, histone variants, eye and heart development. There was also a reduction of retinol binding protein 1 (Rbp1), and de novo expression of aldehyde dehydrogenase 1B1 (Aldh1B1). Remarkably, four key hematopoiesis genes (glycophorin A, adducin 2, beta-2 microglobulin, and ceruloplasmin) were absent after alcohol treatment, and histone variant genes were reduced. The down-regulation of the neurospecification and the neurotrophic genes were further confirmed by quantitative RT-PCR. Furthermore, the gene expression profile demonstrated distinct subgroups which corresponded with two distinct alcohol-related neural tube phenotypes: an open (ALC-NTO) and a closed neural tube (ALC-NTC). Further, the epidermal growth factor signaling pathway and histone variants were specifically altered in ALC-NTO, and a greater number of neurotrophic/growth factor genes were down-regulated in the ALC-NTO than in the ALC-NTC embryos. This study revealed a set of genes vulnerable to alcohol exposure and genes that were associated with neural tube

  17. Identification of potential crucial genes associated with steroid-induced necrosis of femoral head based on gene expression profile.

    Science.gov (United States)

    Lin, Zhe; Lin, Yongsheng

    2017-09-05

    The aim of this study was to explore potential crucial genes associated with the steroid-induced necrosis of femoral head (SINFH) and to provide valid biological information for further investigation of SINFH. Gene expression profile of GSE26316, generated from 3 SINFH rat samples and 3 normal rat samples were downloaded from Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) were identified using LIMMA package. After functional enrichment analyses of DEGs, protein-protein interaction (PPI) network and sub-PPI network analyses were conducted based on the STRING database and cytoscape. In total, 59 up-regulated DEGs and 156 downregulated DEGs were identified. The up-regulated DEGs were mainly involved in functions about immunity (e.g. Fcer1A and Il7R), and the downregulated DEGs were mainly enriched in muscle system process (e.g. Tnni2, Mylpf and Myl1). The PPI network of DEGs consisted of 123 nodes and 300 interactions. Tnni2, Mylpf, and Myl1 were the top 3 outstanding genes based on both subgraph centrality and degree centrality evaluation. These three genes interacted with each other in the network. Furthermore, the significant network module was composed of 22 downregulated genes (e.g. Tnni2, Mylpf and Myl1). These genes were mainly enriched in functions like muscle system process. The DEGs related to the regulation of immune system process (e.g. Fcer1A and Il7R), and DEGs correlated with muscle system process (e.g. Tnni2, Mylpf and Myl1) may be closely associated with the progress of SINFH, which is still needed to be confirmed by experiments. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Other enrichment related contracts

    International Nuclear Information System (INIS)

    Hall, J.C.

    1978-01-01

    In addition to long-term enrichment contracts, DOE has other types of contracts: (1) short-term, fixed-commitment enrichment contract; (2) emergency sales agreement for enriched uranium; (3) feed material lease agreement; (4) enriched uranium storage agreement; and (5) feed material usage agreement

  19. Analysis of gene expression during odontogenic differentiation of cultured human dental pulp cells

    Directory of Open Access Journals (Sweden)

    Min-Seock Seo

    2012-08-01

    Full Text Available Objectives We analyzed gene-expression profiles after 14 day odontogenic induction of human dental pulp cells (DPCs using a DNA microarray and sought candidate genes possibly associated with mineralization. Materials and Methods Induced human dental pulp cells were obtained by culturing DPCs in odontogenic induction medium (OM for 14 day. Cells exposed to normal culture medium were used as controls. Total RNA was extracted from cells and analyzed by microarray analysis and the key results were confirmed selectively by reverse-transcriptase polymerase chain reaction (RT-PCR. We also performed a gene set enrichment analysis (GSEA of the microarray data. Results Six hundred and five genes among the 47,320 probes on the BeadChip differed by a factor of more than two-fold in the induced cells. Of these, 217 genes were upregulated, and 388 were down-regulated. GSEA revealed that in the induced cells, genes implicated in Apoptosis and Signaling by wingless MMTV integration (Wnt were significantly upregulated. Conclusions Genes implicated in Apoptosis and Signaling by Wnt are highly connected to the differentiation of dental pulp cells into odontoblast.

  20. The PR/SET Domain Zinc Finger Protein Prdm4 Regulates Gene Expression in Embryonic Stem Cells but Plays a Nonessential Role in the Developing Mouse Embryo

    Science.gov (United States)

    Bogani, Debora; Morgan, Marc A. J.; Nelson, Andrew C.; Costello, Ita; McGouran, Joanna F.; Kessler, Benedikt M.

    2013-01-01

    Prdm4 is a highly conserved member of the Prdm family of PR/SET domain zinc finger proteins. Many well-studied Prdm family members play critical roles in development and display striking loss-of-function phenotypes. Prdm4 functional contributions have yet to be characterized. Here, we describe its widespread expression in the early embryo and adult tissues. We demonstrate that DNA binding is exclusively mediated by the Prdm4 zinc finger domain, and we characterize its tripartite consensus sequence via SELEX (systematic evolution of ligands by exponential enrichment) and ChIP-seq (chromatin immunoprecipitation-sequencing) experiments. In embryonic stem cells (ESCs), Prdm4 regulates key pluripotency and differentiation pathways. Two independent strategies, namely, targeted deletion of the zinc finger domain and generation of a EUCOMM LacZ reporter allele, resulted in functional null alleles. However, homozygous mutant embryos develop normally and adults are healthy and fertile. Collectively, these results strongly suggest that Prdm4 functions redundantly with other transcriptional partners to cooperatively regulate gene expression in the embryo and adult animal. PMID:23918801

  1. Identification of a novel set of genes reflecting different in vivo invasive patterns of human GBM cells

    International Nuclear Information System (INIS)

    Monticone, Massimiliano; Giaretti, Walter; Pfeffer, Ulrich; Daga, Antonio; Candiani, Simona; Romeo, Francesco; Mirisola, Valentina; Viaggi, Silvia; Melloni, Ilaria; Pedemonte, Simona; Zona, Gianluigi

    2012-01-01

    Most patients affected by Glioblastoma multiforme (GBM, grade IV glioma) experience a recurrence of the disease because of the spreading of tumor cells beyond surgical boundaries. Unveiling mechanisms causing this process is a logic goal to impair the killing capacity of GBM cells by molecular targeting. We noticed that our long-term GBM cultures, established from different patients, may display two categories/types of growth behavior in an orthotopic xenograft model: expansion of the tumor mass and formation of tumor branches/nodules (nodular like, NL-type) or highly diffuse single tumor cell infiltration (HD-type). We determined by DNA microarrays the gene expression profiles of three NL-type and three HD-type long-term GBM cultures. Subsequently, individual genes with different expression levels between the two groups were identified using Significance Analysis of Microarrays (SAM). Real time RT-PCR, immunofluorescence and immunoblot analyses, were performed for a selected subgroup of regulated gene products to confirm the results obtained by the expression analysis. Here, we report the identification of a set of 34 differentially expressed genes in the two types of GBM cultures. Twenty-three of these genes encode for proteins localized to the plasma membrane and 9 of these for proteins are involved in the process of cell adhesion. This study suggests the participation in the diffuse infiltrative/invasive process of GBM cells within the CNS of a novel set of genes coding for membrane-associated proteins, which should be thus susceptible to an inhibition strategy by specific targeting. Massimiliano Monticone and Antonio Daga contributed equally to this work

  2. Identification of a novel set of genes reflecting different in vivo invasive patterns of human GBM cells.

    Science.gov (United States)

    Monticone, Massimiliano; Daga, Antonio; Candiani, Simona; Romeo, Francesco; Mirisola, Valentina; Viaggi, Silvia; Melloni, Ilaria; Pedemonte, Simona; Zona, Gianluigi; Giaretti, Walter; Pfeffer, Ulrich; Castagnola, Patrizio

    2012-08-17

    Most patients affected by Glioblastoma multiforme (GBM, grade IV glioma) experience a recurrence of the disease because of the spreading of tumor cells beyond surgical boundaries. Unveiling mechanisms causing this process is a logic goal to impair the killing capacity of GBM cells by molecular targeting.We noticed that our long-term GBM cultures, established from different patients, may display two categories/types of growth behavior in an orthotopic xenograft model: expansion of the tumor mass and formation of tumor branches/nodules (nodular like, NL-type) or highly diffuse single tumor cell infiltration (HD-type). We determined by DNA microarrays the gene expression profiles of three NL-type and three HD-type long-term GBM cultures. Subsequently, individual genes with different expression levels between the two groups were identified using Significance Analysis of Microarrays (SAM). Real time RT-PCR, immunofluorescence and immunoblot analyses, were performed for a selected subgroup of regulated gene products to confirm the results obtained by the expression analysis. Here, we report the identification of a set of 34 differentially expressed genes in the two types of GBM cultures. Twenty-three of these genes encode for proteins localized to the plasma membrane and 9 of these for proteins are involved in the process of cell adhesion. This study suggests the participation in the diffuse infiltrative/invasive process of GBM cells within the CNS of a novel set of genes coding for membrane-associated proteins, which should be thus susceptible to an inhibition strategy by specific targeting.Massimiliano Monticone and Antonio Daga contributed equally to this work.

  3. Comparative genomic analysis of SET domain family reveals the origin, expansion, and putative function of the arthropod-specific SmydA genes as histone modifiers in insects.

    Science.gov (United States)

    Jiang, Feng; Liu, Qing; Wang, Yanli; Zhang, Jie; Wang, Huimin; Song, Tianqi; Yang, Meiling; Wang, Xianhui; Kang, Le

    2017-06-01

    The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain-containing genes (SET genes) in all of the 147 arthropod genomes sequenced at the time of carrying out this experiment to understand the evolutionary history by which SET domains have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, that is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution. © The Authors 2017. Published by Oxford University Press.

  4. On-Line Enrichment Monitor for UF{sub 6} Gas Centrifuge Enrichment Plant

    Energy Technology Data Exchange (ETDEWEB)

    Ianakiev, K. D.; Boyer, B.; Favalli, A.; Goda, J. M.; Hill, T.; Keller, C.; Lombardi, M.; Paffett, M.; MacArthur, D. W.; McCluskey, C.; Moss, C. E.; Parker, R.; Smith, M. K.; Swinhoe, M. T. [Los Alamos National Laboratory, Los Alamos (United States)

    2012-06-15

    This paper is a continuation of the Advanced Enrichment Monitoring Technology for UF{sub 6} Gas Centrifuge Enrichment Plant (GCEP) work, presented in the 2010 IAEA Safeguards Symposium. Here we will present the system architecture for a planned side-by-side field trial test of passive (186-keV line spectroscopy and pressure-based correction for UF{sub 6} gas density) and active (186-keV line spectroscopy and transmission measurement based correction for UF{sub 6} gas density) enrichment monitoring systems in URENCO's enrichment plant in Capenhurst. Because the pressure and transmission measurements of UF{sub 6} are complementary, additional information on the importance of the presence of light gases and the UF{sub 6} gas temperature can be obtained by cross-correlation between simultaneous measurements of transmission, pressure and 186-keV intensity. We will discuss the calibration issues and performance in the context of accurate, on-line enrichment measurement. It is hoped that a simple and accurate on-line enrichment monitor can be built using the UF{sub 6} gas pressure provided by the Operator, based on online mass spectrometer calibration, assuming a negligible (a small fraction of percent) contribution of wall deposits. Unaccounted-for wall deposits present at the initial calibration will lead to unwanted sensitivity to changes in theUF{sub 6} gas pressure and thus to error in the enrichment results. Because the accumulated deposits in the cascade header pipe have been identified as an issue for Go/No Go measurements with the Cascade Header Enrichment Monitor (CHEM) and Continuous Enrichment Monitor (CEMO), it is important to explore their effect. Therefore we present the expected uncertainty on enrichment measurements obtained by propagating the errors introduced by deposits, gas density, etc. and will discuss the options for a deposit correction during initial calibration of an On-Line Enrichment Monitor (OLEM).

  5. Hereditary cancer genes are highly susceptible to splicing mutations

    Science.gov (United States)

    Soemedi, Rachel; Maguire, Samantha; Murray, Michael F.; Monaghan, Sean F.

    2018-01-01

    Substitutions that disrupt pre-mRNA splicing are a common cause of genetic disease. On average, 13.4% of all hereditary disease alleles are classified as splicing mutations mapping to the canonical 5′ and 3′ splice sites. However, splicing mutations present in exons and deeper intronic positions are vastly underreported. A recent re-analysis of coding mutations in exon 10 of the Lynch Syndrome gene, MLH1, revealed an extremely high rate (77%) of mutations that lead to defective splicing. This finding is confirmed by extending the sampling to five other exons in the MLH1 gene. Further analysis suggests a more general phenomenon of defective splicing driving Lynch Syndrome. Of the 36 mutations tested, 11 disrupted splicing. Furthermore, analyzing past reports suggest that MLH1 mutations in canonical splice sites also occupy a much higher fraction (36%) of total mutations than expected. When performing a comprehensive analysis of splicing mutations in human disease genes, we found that three main causal genes of Lynch Syndrome, MLH1, MSH2, and PMS2, belonged to a class of 86 disease genes which are enriched for splicing mutations. Other cancer genes were also enriched in the 86 susceptible genes. The enrichment of splicing mutations in hereditary cancers strongly argues for additional priority in interpreting clinical sequencing data in relation to cancer and splicing. PMID:29505604

  6. Hereditary cancer genes are highly susceptible to splicing mutations.

    Directory of Open Access Journals (Sweden)

    Christy L Rhine

    2018-03-01

    Full Text Available Substitutions that disrupt pre-mRNA splicing are a common cause of genetic disease. On average, 13.4% of all hereditary disease alleles are classified as splicing mutations mapping to the canonical 5' and 3' splice sites. However, splicing mutations present in exons and deeper intronic positions are vastly underreported. A recent re-analysis of coding mutations in exon 10 of the Lynch Syndrome gene, MLH1, revealed an extremely high rate (77% of mutations that lead to defective splicing. This finding is confirmed by extending the sampling to five other exons in the MLH1 gene. Further analysis suggests a more general phenomenon of defective splicing driving Lynch Syndrome. Of the 36 mutations tested, 11 disrupted splicing. Furthermore, analyzing past reports suggest that MLH1 mutations in canonical splice sites also occupy a much higher fraction (36% of total mutations than expected. When performing a comprehensive analysis of splicing mutations in human disease genes, we found that three main causal genes of Lynch Syndrome, MLH1, MSH2, and PMS2, belonged to a class of 86 disease genes which are enriched for splicing mutations. Other cancer genes were also enriched in the 86 susceptible genes. The enrichment of splicing mutations in hereditary cancers strongly argues for additional priority in interpreting clinical sequencing data in relation to cancer and splicing.

  7. The use of medium enriched uranium fuel for research reactors

    International Nuclear Information System (INIS)

    1979-01-01

    The evaluation described in the present paper concerns the use of medium enriched uranium fuel for our research reactors. The underlying assumptions set up for the evaluation are as follows: (1) At first, the use of alternative fuel should not affect, even to a small extent, research and development programs in nuclear energy utilization, which were described in the previous paper. Hence the use of lower enrichment fuel should not cause any reduction in reactor performances. (2) The fuel cycle cost for operating research reactors with alternative fuel, excepting R and D cost for such fuel, should not increase beyond an acceptable limit. (3) The use of alternative fuel should be satisfactory with respect to non-proliferation purposes, to the almost same degree as the use of 20% enriched uranium fuel

  8. Identifying arsenic trioxide (ATO) functions in leukemia cells by using time series gene expression profiles.

    Science.gov (United States)

    Yang, Hong; Lin, Shan; Cui, Jingru

    2014-02-10

    Arsenic trioxide (ATO) is presently the most active single agent in the treatment of acute promyelocytic leukemia (APL). In order to explore the molecular mechanism of ATO in leukemia cells with time series, we adopted bioinformatics strategy to analyze expression changing patterns and changes in transcription regulation modules of time series genes filtered from Gene Expression Omnibus database (GSE24946). We totally screened out 1847 time series genes for subsequent analysis. The KEGG (Kyoto encyclopedia of genes and genomes) pathways enrichment analysis of these genes showed that oxidative phosphorylation and ribosome were the top 2 significantly enriched pathways. STEM software was employed to compare changing patterns of gene expression with assigned 50 expression patterns. We screened out 7 significantly enriched patterns and 4 tendency charts of time series genes. The result of Gene Ontology showed that functions of times series genes mainly distributed in profiles 41, 40, 39 and 38. Seven genes with positive regulation of cell adhesion function were enriched in profile 40, and presented the same first increased model then decreased model as profile 40. The transcription module analysis showed that they mainly involved in oxidative phosphorylation pathway and ribosome pathway. Overall, our data summarized the gene expression changes in ATO treated K562-r cell lines with time and suggested that time series genes mainly regulated cell adhesive. Furthermore, our result may provide theoretical basis of molecular biology in treating acute promyelocytic leukemia. Copyright © 2013 Elsevier B.V. All rights reserved.

  9. Stress associated gene expression in blood cells is related to outcome in radiotherapy treated head and neck cancer patients

    International Nuclear Information System (INIS)

    Bøhn, Siv K; Blomhoff, Rune; Russnes, Kjell M; Sakhi, Amrit K; Thoresen, Magne; Holden, Marit; Moskaug, JanØ; Myhrstad, Mari C; Olstad, Ole K; Smeland, Sigbjørn

    2012-01-01

    We previously observed that a radiotherapy-induced biochemical response in plasma was associated with favourable outcome in head and neck squamous carcinoma cancer (HNSCC) patients. The aim of the present study was to compare stress associated blood cell gene expression between two sub-groups of HNSCC patients with different biochemical responses to radiotherapy. Out of 87 patients (histologically verified), 10 biochemical ‘responders’ having a high relative increase in plasma oxidative damage and a concomitant decrease in plasma antioxidants during radiotherapy and 10 ‘poor-responders’ were selected for gene-expression analysis and compared using gene set enrichment analysis. There was a significant induction of stress-relevant gene-sets in the responders following radiotherapy compared to the poor-responders. The relevance of the involvement of similar stress associated gene expression for HNSCC cancer and radioresistance was verified using two publicly available data sets of 42 HNSCC cases and 14 controls (GEO GSE6791), and radiation resistant and radiation sensitive HNSCC xenografts (E-GEOD-9716). Radiotherapy induces a systemic stress response, as revealed by induction of stress relevant gene expression in blood cells, which is associated to favourable outcome in a cohort of 87 HNSCC patients. Whether these changes in gene expression reflects a systemic effect or are biomarkers of the tumour micro-environmental status needs further study. Raw data are available at ArrayExpress under accession number E-MEXP-2460

  10. Standardized Environmental Enrichment Supports Enhanced Brain Plasticity in Healthy Rats and Prevents Cognitive Impairment in Epileptic Rats

    Science.gov (United States)

    Kouchi, Hayet Y.; Bodennec, Jacques; Morales, Anne; Georges, Béatrice; Bonnet, Chantal; Bouvard, Sandrine; Sloviter, Robert S.; Bezin, Laurent

    2013-01-01

    Environmental enrichment of laboratory animals influences brain plasticity, stimulates neurogenesis, increases neurotrophic factor expression, and protects against the effects of brain insult. However, these positive effects are not constantly observed, probably because standardized procedures of environmental enrichment are lacking. Therefore, we engineered an enriched cage (the Marlau™ cage), which offers: (1) minimally stressful social interactions; (2) increased voluntary exercise; (3) multiple entertaining activities; (4) cognitive stimulation (maze exploration), and (5) novelty (maze configuration changed three times a week). The maze, which separates food pellet and water bottle compartments, guarantees cognitive stimulation for all animals. Compared to rats raised in groups in conventional cages, rats housed in Marlau™ cages exhibited increased cortical thickness, hippocampal neurogenesis and hippocampal levels of transcripts encoding various genes involved in tissue plasticity and remodeling. In addition, rats housed in Marlau™ cages exhibited better performances in learning and memory, decreased anxiety-associated behaviors, and better recovery of basal plasma corticosterone level after acute restraint stress. Marlau™ cages also insure inter-experiment reproducibility in spatial learning and brain gene expression assays. Finally, housing rats in Marlau™ cages after severe status epilepticus at weaning prevents the cognitive impairment observed in rats subjected to the same insult and then housed in conventional cages. By providing a standardized enriched environment for rodents during housing, the Marlau™ cage should facilitate the uniformity of environmental enrichment across laboratories. PMID:23342033

  11. Standardized environmental enrichment supports enhanced brain plasticity in healthy rats and prevents cognitive impairment in epileptic rats.

    Directory of Open Access Journals (Sweden)

    Raafat P Fares

    Full Text Available Environmental enrichment of laboratory animals influences brain plasticity, stimulates neurogenesis, increases neurotrophic factor expression, and protects against the effects of brain insult. However, these positive effects are not constantly observed, probably because standardized procedures of environmental enrichment are lacking. Therefore, we engineered an enriched cage (the Marlau™ cage, which offers: (1 minimally stressful social interactions; (2 increased voluntary exercise; (3 multiple entertaining activities; (4 cognitive stimulation (maze exploration, and (5 novelty (maze configuration changed three times a week. The maze, which separates food pellet and water bottle compartments, guarantees cognitive stimulation for all animals. Compared to rats raised in groups in conventional cages, rats housed in Marlau™ cages exhibited increased cortical thickness, hippocampal neurogenesis and hippocampal levels of transcripts encoding various genes involved in tissue plasticity and remodeling. In addition, rats housed in Marlau™ cages exhibited better performances in learning and memory, decreased anxiety-associated behaviors, and better recovery of basal plasma corticosterone level after acute restraint stress. Marlau™ cages also insure inter-experiment reproducibility in spatial learning and brain gene expression assays. Finally, housing rats in Marlau™ cages after severe status epilepticus at weaning prevents the cognitive impairment observed in rats subjected to the same insult and then housed in conventional cages. By providing a standardized enriched environment for rodents during housing, the Marlau™ cage should facilitate the uniformity of environmental enrichment across laboratories.

  12. Multitarget Effects of Danqi Pill on Global Gene Expression Changes in Myocardial Ischemia

    Directory of Open Access Journals (Sweden)

    Qiyan Wang

    2018-01-01

    Full Text Available Danqi pill (DQP is a widely prescribed traditional Chinese medicine (TCM in the treatment of cardiovascular diseases. The objective of this study is to systematically characterize altered gene expression pattern induced by myocardial ischemia (MI in a rat model and to investigate the effects of DQP on global gene expression. Global mRNA expression was measured. Differentially expressed genes among the sham group, model group, and DQP group were analyzed. The gene ontology enrichment analysis and pathway analysis of differentially expressed genes were carried out. We quantified 10,813 genes. Compared with the sham group, expressions of 339 genes were upregulated and 177 genes were downregulated in the model group. The upregulated genes were enriched in extracellular matrix organization, response to wounding, and defense response pathways. Downregulated genes were enriched in fatty acid metabolism, pyruvate metabolism, PPAR signaling pathways, and so forth. This indicated that energy metabolic disorders occurred in rats with MI. In the DQP group, expressions of genes in the altered pathways were regulated back towards normal levels. DQP reversed expression of 313 of the 516 differentially expressed genes in the model group. This study provides insight into the multitarget mechanism of TCM in the treatment of complex diseases.

  13. GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.

    Science.gov (United States)

    Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E

    2016-03-11

    Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.

  14. Neoplastic and stromal cells contribute to an extracellular matrix gene expression profile defining a breast cancer subtype likely to progress.

    Directory of Open Access Journals (Sweden)

    Tiziana Triulzi

    Full Text Available We recently showed that differential expression of extracellular matrix (ECM genes delineates four subgroups of breast carcinomas (ECM1, -2, -3- and -4 with different clinical outcome. To further investigate the characteristics of ECM signature and its impact on tumor progression, we conducted unsupervised clustering analyses in 6 additional independent datasets of invasive breast tumors from different platforms for a total of 643 samples. Use of four different clustering algorithms identified ECM3 tumors as an independent group in all datasets tested. ECM3 showed a homogeneous gene pattern, consisting of 58 genes encoding 43 structural ECM proteins. From 26 to 41% of the cases were ECM3-enriched, and analysis of datasets relevant to gene expression in neoplastic or corresponding stromal cells showed that both stromal and breast carcinoma cells can coordinately express ECM3 genes. In in vitro experiments, β-estradiol induced ECM3 gene production in ER-positive breast carcinoma cell lines, whereas TGFβ induced upregulation of the genes leading to ECM3 gene classification, especially in ER-negative breast carcinoma cells and in fibroblasts. Multivariate analysis of distant metastasis-free survival in untreated breast tumor patients revealed a significant interaction between ECM3 and histological grade (p = 0.001. Cox models, estimated separately in grade I-II and grade III tumors, indicated a highly significant association between ECM3 and worse survival probability only in grade III tumors (HR = 3.0, 95% CI = 1.3-7.0, p = 0.0098. Gene Set Enrichment analysis of ECM3 compared to non-ECM3 tumors revealed significant enrichment of epithelial-mesenchymal transition (EMT genes in both grade I-II and grade III subsets of ECM3 tumors. Thus, ECM3 is a robust cluster that identifies breast carcinomas with EMT features but with accelerated metastatic potential only in the undifferentiated (grade III phenotype. These findings support the

  15. Microbial diversity of western Canadian subsurface coal beds and methanogenic coal enrichment cultures

    Energy Technology Data Exchange (ETDEWEB)

    Penner, Tara J.; Foght, Julia M. [Department of Biological Sciences, University of Alberta, Edmonton, Alberta (Canada); Budwill, Karen [Carbon and Energy Management, Alberta Innovates-Technology Futures, 250 Karl Clark Road, Edmonton, Alberta (Canada)

    2010-05-01

    Coalbed methane is an unconventional fuel source associated with certain coal seams. Biogenic methane can comprise a significant portion of the gas found in coal seams, yet the role of microbes in methanogenesis in situ is uncertain. The purpose of this study was to detect and identify major bacterial and archaeal species associated with coal sampled from sub-bituminous methane-producing coal beds in western Canada, and to examine the potential for methane biogenesis from coal. Enrichment cultures of coal samples were established to determine how nutrient amendment influenced the microbial community and methane production in the laboratory. 16S rRNA gene clone libraries were constructed using DNA extracted and amplified from uncultured coal samples and from methanogenic coal enrichment cultures. Libraries were screened using restriction fragment length polymorphism, and representative clones were sequenced. Most (> 50%) of the bacterial sequences amplified from uncultured coal samples were affiliated with Proteobacteria that exhibit nitrate reduction, nitrogen fixation and/or hydrogen utilization activities, including Pseudomonas, Thauera and Acidovorax spp., whereas enrichment cultures were dominated by Bacteroidetes, Clostridia and/or Lactobacillales. Archaeal 16S rRNA genes could not be amplified from uncultured coal, suggesting that methanogens are present in coal below the detection levels of our methods. However, enrichment cultures established with coal inocula produced significant volumes of methane and the archaeal clone libraries were dominated by sequences closely affiliated with Methanosarcina spp. Enrichment cultures incubated with coal plus organic nutrients produced more methane than either nutrient or coal supplements alone, implying that competent methanogenic consortia exist in coal beds but that nutrient limitations restrict their activity in situ. This report adds to the scant literature on coal bed microbiology and suggests how microbes may be

  16. MiRNA-TF-gene network analysis through ranking of biomolecules for multi-informative uterine leiomyoma dataset.

    Science.gov (United States)

    Mallik, Saurav; Maulik, Ujjwal

    2015-10-01

    Gene ranking is an important problem in bioinformatics. Here, we propose a new framework for ranking biomolecules (viz., miRNAs, transcription-factors/TFs and genes) in a multi-informative uterine leiomyoma dataset having both gene expression and methylation data using (statistical) eigenvector centrality based approach. At first, genes that are both differentially expressed and methylated, are identified using Limma statistical test. A network, comprising these genes, corresponding TFs from TRANSFAC and ITFP databases, and targeter miRNAs from miRWalk database, is then built. The biomolecules are then ranked based on eigenvector centrality. Our proposed method provides better average accuracy in hub gene and non-hub gene classifications than other methods. Furthermore, pre-ranked Gene set enrichment analysis is applied on the pathway database as well as GO-term databases of Molecular Signatures Database with providing a pre-ranked gene-list based on different centrality values for comparing among the ranking methods. Finally, top novel potential gene-markers for the uterine leiomyoma are provided. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. Systematically characterizing and prioritizing chemosensitivity related gene based on Gene Ontology and protein interaction network

    Directory of Open Access Journals (Sweden)

    Chen Xin

    2012-10-01

    Full Text Available Abstract Background The identification of genes that predict in vitro cellular chemosensitivity of cancer cells is of great importance. Chemosensitivity related genes (CRGs have been widely utilized to guide clinical and cancer chemotherapy decisions. In addition, CRGs potentially share functional characteristics and network features in protein interaction networks (PPIN. Methods In this study, we proposed a method to identify CRGs based on Gene Ontology (GO and PPIN. Firstly, we documented 150 pairs of drug-CCRG (curated chemosensitivity related gene from 492 published papers. Secondly, we characterized CCRGs from the perspective of GO and PPIN. Thirdly, we prioritized CRGs based on CCRGs’ GO and network characteristics. Lastly, we evaluated the performance of the proposed method. Results We found that CCRG enriched GO terms were most often related to chemosensitivity and exhibited higher similarity scores compared to randomly selected genes. Moreover, CCRGs played key roles in maintaining the connectivity and controlling the information flow of PPINs. We then prioritized CRGs using CCRG enriched GO terms and CCRG network characteristics in order to obtain a database of predicted drug-CRGs that included 53 CRGs, 32 of which have been reported to affect susceptibility to drugs. Our proposed method identifies a greater number of drug-CCRGs, and drug-CCRGs are much more significantly enriched in predicted drug-CRGs, compared to a method based on the correlation of gene expression and drug activity. The mean area under ROC curve (AUC for our method is 65.2%, whereas that for the traditional method is 55.2%. Conclusions Our method not only identifies CRGs with expression patterns strongly correlated with drug activity, but also identifies CRGs in which expression is weakly correlated with drug activity. This study provides the framework for the identification of signatures that predict in vitro cellular chemosensitivity and offers a valuable

  18. Enrichment and Molecular Analysis of Breast Cancer Disseminated Tumor Cells from Bone Marrow Using Microfiltration.

    Directory of Open Access Journals (Sweden)

    Sreeraj G Pillai

    Full Text Available Molecular characterization of disseminated tumor cells (DTCs in the bone marrow (BM of breast cancer (BC patients has been hindered by their rarity. To enrich for these cells using an antigen-independent methodology, we have evaluated a size-based microfiltration device in combination with several downstream biomarker assays.BM aspirates were collected from healthy volunteers or BC patients. Healthy BM was mixed with a specified number of BC cells to calculate recovery and fold enrichment by microfiltration. Specimens were pre-filtered using a 70 μm mesh sieve and the effluent filtered through CellSieve microfilters. Captured cells were analyzed by immunocytochemistry (ICC, FISH for HER-2/neu gene amplification status, and RNA in situ hybridization (RISH. Cells eluted from the filter were used for RNA isolation and subsequent qRT-PCR analysis for DTC biomarker gene expression.Filtering an average of 14×106 nucleated BM cells yielded approximately 17-21×103 residual BM cells. In the BC cell spiking experiments, an average of 87% (range 84-92% of tumor cells were recovered with approximately 170- to 400-fold enrichment. Captured BC cells from patients co-stained for cytokeratin and EpCAM, but not CD45 by ICC. RNA yields from 4 ml of patient BM after filtration averaged 135ng per 10 million BM cells filtered with an average RNA Integrity Number (RIN of 5.3. DTC-associated gene expression was detected by both qRT-PCR and RISH in filtered spiked or BC patient specimens but, not in control filtered normal BM.We have tested a microfiltration technique for enrichment of BM DTCs. DTC capture efficiency was shown to range from 84.3% to 92.1% with up to 400-fold enrichment using model BC cell lines. In patients, recovered DTCs can be identified and distinguished from normal BM cells using multiple antibody-, DNA-, and RNA-based biomarker assays.

  19. Uranium Enrichment, an overview

    International Nuclear Information System (INIS)

    Coates, J.H.

    1994-01-01

    This general presentation on uranium enrichment will be followed by lectures on more specific topics including descriptions of enrichment processes and assessments of the prevailing commercial and industrial situations. I shall therefore avoid as much as possible duplications with these other lectures, and rather dwell on: some theoretical aspects of enrichment in general, underlying the differences between statistical and selective processes, a review and comparison between enrichment processes, remarks of general order regarding applications, the proliferation potential of enrichment. It is noteworthy that enrichment: may occur twice in the LWR fuel cycle: first by enriching natural uranium, second by reenriching uranium recovered from reprocessing, must meet LWR requirements, and in particular higher assays required by high burn up fuel elements, bears on the structure of the entire front part of the fuel cycle, namely in the conversion/reconversion steps only involving UF 6 for the moment. (author). tabs., figs., 4 refs

  20. The gene regulatory network for breast cancer: Integrated regulatory landscape of cancer hallmarks

    Directory of Open Access Journals (Sweden)

    Frank eEmmert-Streib

    2014-02-01

    Full Text Available In this study, we infer the breast cancer gene regulatory network from gene expression data. This network is obtained from the application of the BC3Net inference algorithm to a large-scale gene expression data set consisting of $351$ patient samples. In order to elucidate the functional relevance of the inferred network, we are performing a Gene Ontology (GO analysis for its structural components. Our analysis reveals that most significant GO-terms we find for the breast cancer network represent functional modules of biological processes that are described by known cancer hallmarks, including translation, immune response, cell cycle, organelle fission, mitosis, cell adhesion, RNA processing, RNA splicing and response to wounding. Furthermore, by using a curated list of census cancer genes, we find an enrichment in these functional modules. Finally, we study cooperative effects of chromosomes based on information of interacting genes in the beast cancer network. We find that chromosome $21$ is most coactive with other chromosomes. To our knowledge this is the first study investigating the genome-scale breast cancer network.

  1. Conservation in Mammals of Genes Associated with Aggression-Related Behavioral Phenotypes in Honey Bees.

    Directory of Open Access Journals (Sweden)

    Hui Liu

    2016-06-01

    Full Text Available The emerging field of sociogenomics explores the relations between social behavior and genome structure and function. An important question is the extent to which associations between social behavior and gene expression are conserved among the Metazoa. Prior experimental work in an invertebrate model of social behavior, the honey bee, revealed distinct brain gene expression patterns in African and European honey bees, and within European honey bees with different behavioral phenotypes. The present work is a computational study of these previous findings in which we analyze, by orthology determination, the extent to which genes that are socially regulated in honey bees are conserved across the Metazoa. We found that the differentially expressed gene sets associated with alarm pheromone response, the difference between old and young bees, and the colony influence on soldier bees, are enriched in widely conserved genes, indicating that these differences have genomic bases shared with many other metazoans. By contrast, the sets of differentially expressed genes associated with the differences between African and European forager and guard bees are depleted in widely conserved genes, indicating that the genomic basis for this social behavior is relatively specific to honey bees. For the alarm pheromone response gene set, we found a particularly high degree of conservation with mammals, even though the alarm pheromone itself is bee-specific. Gene Ontology identification of human orthologs to the strongly conserved honey bee genes associated with the alarm pheromone response shows overrepresentation of protein metabolism, regulation of protein complex formation, and protein folding, perhaps associated with remodeling of critical neural circuits in response to alarm pheromone. We hypothesize that such remodeling may be an adaptation of social animals to process and respond appropriately to the complex patterns of conspecific communication essential for

  2. Conservation in Mammals of Genes Associated with Aggression-Related Behavioral Phenotypes in Honey Bees.

    Science.gov (United States)

    Liu, Hui; Robinson, Gene E; Jakobsson, Eric

    2016-06-01

    The emerging field of sociogenomics explores the relations between social behavior and genome structure and function. An important question is the extent to which associations between social behavior and gene expression are conserved among the Metazoa. Prior experimental work in an invertebrate model of social behavior, the honey bee, revealed distinct brain gene expression patterns in African and European honey bees, and within European honey bees with different behavioral phenotypes. The present work is a computational study of these previous findings in which we analyze, by orthology determination, the extent to which genes that are socially regulated in honey bees are conserved across the Metazoa. We found that the differentially expressed gene sets associated with alarm pheromone response, the difference between old and young bees, and the colony influence on soldier bees, are enriched in widely conserved genes, indicating that these differences have genomic bases shared with many other metazoans. By contrast, the sets of differentially expressed genes associated with the differences between African and European forager and guard bees are depleted in widely conserved genes, indicating that the genomic basis for this social behavior is relatively specific to honey bees. For the alarm pheromone response gene set, we found a particularly high degree of conservation with mammals, even though the alarm pheromone itself is bee-specific. Gene Ontology identification of human orthologs to the strongly conserved honey bee genes associated with the alarm pheromone response shows overrepresentation of protein metabolism, regulation of protein complex formation, and protein folding, perhaps associated with remodeling of critical neural circuits in response to alarm pheromone. We hypothesize that such remodeling may be an adaptation of social animals to process and respond appropriately to the complex patterns of conspecific communication essential for social organization.

  3. Efficient clinical-scale enrichment of lymphocytes for use in adoptive immunotherapy using a modified counterflow centrifugal elutriation program.

    Science.gov (United States)

    Powell, Daniel J; Brennan, Andrea L; Zheng, Zhaohui; Huynh, Hong; Cotte, Julio; Levine, Bruce L

    2009-01-01

    Clinical-scale lymphocyte enrichment from a leukapheresis product has been performed most routinely using costly magnetic bead separation systems that deplete monocytes, but this procedure may leave behind residual beads or antibodies in the enriched cell product. Counterflow centrifugal elutriation has been demonstrated previously to enrich monocytes efficiently for generation of dendritic cells. This study describes a modified elutriation procedure for efficient bead-free economical enrichment of lymphocytes from leukapheresis products from healthy donors and study subjects with human immunodeficiency virus (HIV) infection or malignancy. Modified program settings and conditions for the CaridianBCT Elutra device were investigated to optimize lymphocyte enrichment and recovery. Lymphocyte enrichment was measured using a novel approach utilizing cell sizing analysis on a Beckman Coulter Multisizer and confirmed by flow cytometry phenotypic analysis. Efficient enrichment and recovery of lymphocytes from leukapheresis cell products was achieved using modified elutriation settings for flow rate and fraction volume. Elutriation allowed for enrichment of larger numbers of lymphocytes compared with depletion of monocytes by bead adherence, with a trend toward increased lymphocyte purity and yield via elutriation, resulting in a substantial reduction in the cost of enrichment per cell. Importantly, significant lymphocyte enrichment could be accomplished using leukapheresis samples from healthy donors (n=12) or from study subjects with HIV infection (n=15) or malignancy (n=12). Clinical-scale closed-system elutriation can be performed efficiently for the selective enrichment of lymphocytes for immunotherapy protocols. This represents an improvement in cost, yield and purity over current methods that require the addition of monocyte-depleting beads.

  4. Preservation of bone mass and structure in hibernating black bears (Ursus americanus) through elevated expression of anabolic genes.

    Science.gov (United States)

    Fedorov, Vadim B; Goropashnaya, Anna V; Tøien, Øivind; Stewart, Nathan C; Chang, Celia; Wang, Haifang; Yan, Jun; Showe, Louise C; Showe, Michael K; Donahue, Seth W; Barnes, Brian M

    2012-06-01

    Physical inactivity reduces mechanical load on the skeleton, which leads to losses of bone mass and strength in non-hibernating mammalian species. Although bears are largely inactive during hibernation, they show no loss in bone mass and strength. To obtain insight into molecular mechanisms preventing disuse bone loss, we conducted a large-scale screen of transcriptional changes in trabecular bone comparing winter hibernating and summer non-hibernating black bears using a custom 12,800 probe cDNA microarray. A total of 241 genes were differentially expressed (P 1.4) in the ilium bone of bears between winter and summer. The Gene Ontology and Gene Set Enrichment Analysis showed an elevated proportion in hibernating bears of overexpressed genes in six functional sets of genes involved in anabolic processes of tissue morphogenesis and development including skeletal development, cartilage development, and bone biosynthesis. Apoptosis genes demonstrated a tendency for downregulation during hibernation. No coordinated directional changes were detected for genes involved in bone resorption, although some genes responsible for osteoclast formation and differentiation (Ostf1, Rab9a, and c-Fos) were significantly underexpressed in bone of hibernating bears. Elevated expression of multiple anabolic genes without induction of bone resorption genes, and the down regulation of apoptosis-related genes, likely contribute to the adaptive mechanism that preserves bone mass and structure through prolonged periods of immobility during hibernation.

  5. Deep developmental transcriptome sequencing uncovers numerous new genes and enhances gene annotation in the sponge Amphimedon queenslandica.

    Science.gov (United States)

    Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M

    2015-05-15

    The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.

  6. R and D on laser uranium enrichment

    International Nuclear Information System (INIS)

    Anon.

    1986-01-01

    An AEC Advisory Committee on Uranium Enrichment has completed investigations into the actual condition of laser isotope separation. The working group set up for the purpose has issued a report on the series of investigations made on its development and measures for promoting it. The report says that the development of the process in Japan is at a fundamental stage. Noting that further efforts are needed before its future can be predicted, the report proposes a cource of research and development for the immediate future. For the atomic vapor laser isotope separation (AVLIS), government organizations are engaged in data base buildup and conducting basis engineering tests, and Japan Atomic Energy Research Institute will consider the re-enrichment of uranium recovered from reprocessing. Non-governmental unions of researchers will promote the combination of copper-vapor laser and dye laser. For the molecular laser isotope separation (MLIS), the Institute of Physical and Chemical Research will take up studies with the cooperation of the Power Reactor and Nuclear Fuel Development Corporation. In chapters covering the philosophy of laser uranium enrichment technology development, the report deals with its significance, actual conditions and tasks, and goals and measures for its promotion. (Nogami, K.)

  7. Comments on Smith Barney's uranium enrichment analysis

    International Nuclear Information System (INIS)

    Rezendes, V.S.

    1990-07-01

    In a May 1990 report, Smith Barney, Harris Upham and Co. concluded that DOE's uranium enrichment program should be restructured as a government corporation; all past costs have been recovered, and DOE's customers have been overcharged about $1.2 billion; the government should retain responsibility for environment and decommissioning costs associated with enriched uranium production before the corporation's formation; and at some future time the corporation could be sold to the private sector. This report agrees with Smith Barney's recommendation to restructure the enrichment program as a government corporation, but disagrees that DOE's customers have paid for all past costs. According to the author, Smith Barney did not identify the total environmental or decommissioning costs between the government and the corporation. Since these costs are largely undefined, but could amount to billions, Congress should immediately require the program to begin setting aside funds for these costs. DOE estimates that government purchases are responsible for 50 percent of the decommissioning costs; therefore, the government should share these costs by matching the corporation's fund contributions. This requirement should continue until the existing plants have been decommissioned

  8. Multiplex target enrichment using DNA indexing for ultra-high throughput SNP detection.

    LENUS (Irish Health Repository)

    Kenny, Elaine M

    2011-02-01

    Screening large numbers of target regions in multiple DNA samples for sequence variation is an important application of next-generation sequencing but an efficient method to enrich the samples in parallel has yet to be reported. We describe an advanced method that combines DNA samples using indexes or barcodes prior to target enrichment to facilitate this type of experiment. Sequencing libraries for multiple individual DNA samples, each incorporating a unique 6-bp index, are combined in equal quantities, enriched using a single in-solution target enrichment assay and sequenced in a single reaction. Sequence reads are parsed based on the index, allowing sequence analysis of individual samples. We show that the use of indexed samples does not impact on the efficiency of the enrichment reaction. For three- and nine-indexed HapMap DNA samples, the method was found to be highly accurate for SNP identification. Even with sequence coverage as low as 8x, 99% of sequence SNP calls were concordant with known genotypes. Within a single experiment, this method can sequence the exonic regions of hundreds of genes in tens of samples for sequence and structural variation using as little as 1 μg of input DNA per sample.

  9. gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels.

    Science.gov (United States)

    Larson, Nicholas B; McDonnell, Shannon; Cannon Albright, Lisa; Teerlink, Craig; Stanford, Janet; Ostrander, Elaine A; Isaacs, William B; Xu, Jianfeng; Cooney, Kathleen A; Lange, Ethan; Schleutker, Johanna; Carpten, John D; Powell, Isaac; Bailey-Wilson, Joan E; Cussenot, Olivier; Cancel-Tassin, Geraldine; Giles, Graham G; MacInnis, Robert J; Maier, Christiane; Whittemore, Alice S; Hsieh, Chih-Lin; Wiklund, Fredrik; Catalona, William J; Foulkes, William; Mandal, Diptasri; Eeles, Rosalind; Kote-Jarai, Zsofia; Ackerman, Michael J; Olson, Timothy M; Klein, Christopher J; Thibodeau, Stephen N; Schaid, Daniel J

    2017-05-01

    Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results. © 2017 WILEY PERIODICALS, INC.

  10. Identification of a novel set of genes reflecting different in vivo invasive patterns of human GBM cells

    Directory of Open Access Journals (Sweden)

    Monticone Massimiliano

    2012-08-01

    Full Text Available Abstract Background Most patients affected by Glioblastoma multiforme (GBM, grade IV glioma experience a recurrence of the disease because of the spreading of tumor cells beyond surgical boundaries. Unveiling mechanisms causing this process is a logic goal to impair the killing capacity of GBM cells by molecular targeting. We noticed that our long-term GBM cultures, established from different patients, may display two categories/types of growth behavior in an orthotopic xenograft model: expansion of the tumor mass and formation of tumor branches/nodules (nodular like, NL-type or highly diffuse single tumor cell infiltration (HD-type. Methods We determined by DNA microarrays the gene expression profiles of three NL-type and three HD-type long-term GBM cultures. Subsequently, individual genes with different expression levels between the two groups were identified using Significance Analysis of Microarrays (SAM. Real time RT-PCR, immunofluorescence and immunoblot analyses, were performed for a selected subgroup of regulated gene products to confirm the results obtained by the expression analysis. Results Here, we report the identification of a set of 34 differentially expressed genes in the two types of GBM cultures. Twenty-three of these genes encode for proteins localized to the plasma membrane and 9 of these for proteins are involved in the process of cell adhesion. Conclusions This study suggests the participation in the diffuse infiltrative/invasive process of GBM cells within the CNS of a novel set of genes coding for membrane-associated proteins, which should be thus susceptible to an inhibition strategy by specific targeting. Massimiliano Monticone and Antonio Daga contributed equally to this work

  11. BlueBerry Isolate, Pterostilbene, Functions as a Potential Anticancer Stem Cell Agent in Suppressing Irradiation-Mediated Enrichment of Hepatoma Stem Cells

    Directory of Open Access Journals (Sweden)

    Chi-Ming Lee

    2013-01-01

    Full Text Available For many malignancies, radiation therapy remains the second option only to surgery in terms of its curative potential. However, radiation-induced tumor cell death is limited by a number of factors, including the adverse response of the tumor microenvironment to the treatment and either intrinsic or acquired mechanisms of evasive resistance, and the existence of cancer stem cells (CSCs. In this study, we demonstrated that using different doses of irradiation led to the enrichment of CD133+ Mahlavu cells using flow cytometric method. Subsequently, CD133+ Mahlavu cells enriched by irradiation were characterized for their stemness gene expression, self-renewal, migration/invasion abilities, and radiation resistance. Having established irradiation-enriched CD133+ Mahlavu cells with CSC properties, we evaluated a phytochemical, pterostilbene (PT, found abundantly in blueberries, against irradiation-enriched CSCs. It was shown that PT treatment dose-dependently reduced the enrichment of CD133+ Mahlavu cells upon irradiation; PT treatment also prevented tumor sphere formation, reduced stemness gene expression, and suppressed invasion and migration abilities as well as increasing apoptosis of CD133+ Mahlavu CSCs. Based on our experimental data, pterostilbene could be used to prevent the enrichment of CD133+ hepatoma CSCs and should be considered for future clinical testing as a combined agent for HCC patients.

  12. Transcription factor control of growth rate dependent genes in Saccharomyces cerevisiae: A three factor design

    DEFF Research Database (Denmark)

    Fazio, Alessandro; Jewett, Michael Christopher; Daran-Lapujade, Pascale

    2008-01-01

    , such as Ace2 and Swi6, and stress response regulators, such as Yap1, were also shown to have significantly enriched target sets. Conclusion: Our work, which is the first genome-wide gene expression study to investigate specific growth rate and consider the impact of oxygen availability, provides a more......Background: Characterization of cellular growth is central to understanding living systems. Here, we applied a three-factor design to study the relationship between specific growth rate and genome-wide gene expression in 36 steady-state chemostat cultures of Saccharomyces cerevisiae. The three...... factors we considered were specific growth rate, nutrient limitation, and oxygen availability. Results: We identified 268 growth rate dependent genes, independent of nutrient limitation and oxygen availability. The transcriptional response was used to identify key areas in metabolism around which m...

  13. Environmental enrichment delays pup-induced maternal behavior in rats.

    Science.gov (United States)

    Mann, Phyllis E; Gervais, Kristen J

    2011-05-01

    Adult, virgin rats do not spontaneously display maternal behavior when exposed to foster pups. However, continuous daily exposure of the female to foster pups for about 5-7 days can induce a set of maternal behaviors similar to those shown by postpartum dams. Induction latencies depend upon a number of factors, including the stress and anxiety levels of the female. The goal of this study was to attempt to mitigate the likely stressfulness of being singly housed during testing by enriching the rat's home cage environment and to determine if the concomitant environmental change would alter the latency to express maternal behavior. In addition, the effect of varying the number of test pups used for testing was examined. Two groups of virgin Sprague-Dawley rats were first tested on the elevated plus maze after 1 week of exposure to either control (standard housing) or enriched conditions. One week later, maternal behavior testing began using one or three pups. Upon completion of maternal behavior testing, plasma corticosterone concentrations were determined following a mild stressor. The data indicate that enrichment tends to increase anxiety-like behaviors in the elevated plus maze. In addition, enrichment delayed the onset of maternal behavior irrespective of the number of test pups. There were no effects of environmental enrichment on plasma corticosterone levels following exposure to a stressor. These results indicate that what is considered a modestly enriched environment delays the expression of pup-oriented responses and does not apparently reduce stress or improve performance on all behavioral tasks. Copyright © 2011 Wiley Periodicals, Inc.

  14. Network Analysis of Human Genes Influencing Susceptibility to Mycobacterial Infections

    Science.gov (United States)

    Lipner, Ettie M.; Garcia, Benjamin J.; Strong, Michael

    2016-01-01

    Tuberculosis and nontuberculous mycobacterial infections constitute a high burden of pulmonary disease in humans, resulting in over 1.5 million deaths per year. Building on the premise that genetic factors influence the instance, progression, and defense of infectious disease, we undertook a systems biology approach to investigate relationships among genetic factors that may play a role in increased susceptibility or control of mycobacterial infections. We combined literature and database mining with network analysis and pathway enrichment analysis to examine genes, pathways, and networks, involved in the human response to Mycobacterium tuberculosis and nontuberculous mycobacterial infections. This approach allowed us to examine functional relationships among reported genes, and to identify novel genes and enriched pathways that may play a role in mycobacterial susceptibility or control. Our findings suggest that the primary pathways and genes influencing mycobacterial infection control involve an interplay between innate and adaptive immune proteins and pathways. Signaling pathways involved in autoimmune disease were significantly enriched as revealed in our networks. Mycobacterial disease susceptibility networks were also examined within the context of gene-chemical relationships, in order to identify putative drugs and nutrients with potential beneficial immunomodulatory or anti-mycobacterial effects. PMID:26751573

  15. Repression of Middle Sporulation Genes in Saccharomyces cerevisiae by the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4 Methylation

    Science.gov (United States)

    Jaiswal, Deepika; Jezek, Meagan; Quijote, Jeremiah; Lum, Joanna; Choi, Grace; Kulkarni, Rushmie; Park, DoHwan; Green, Erin M.

    2017-01-01

    The conserved yeast histone methyltransferase Set1 targets H3 lysine 4 (H3K4) for mono, di, and trimethylation and is linked to active transcription due to the euchromatic distribution of these methyl marks and the recruitment of Set1 during transcription. However, loss of Set1 results in increased expression of multiple classes of genes, including genes adjacent to telomeres and middle sporulation genes, which are repressed under normal growth conditions because they function in meiotic progression and spore formation. The mechanisms underlying Set1-mediated gene repression are varied, and still unclear in some cases, although repression has been linked to both direct and indirect action of Set1, associated with noncoding transcription, and is often dependent on the H3K4me2 mark. We show that Set1, and particularly the H3K4me2 mark, are implicated in repression of a subset of middle sporulation genes during vegetative growth. In the absence of Set1, there is loss of the DNA-binding transcriptional regulator Sum1 and the associated histone deacetylase Hst1 from chromatin in a locus-specific manner. This is linked to increased H4K5ac at these loci and aberrant middle gene expression. These data indicate that, in addition to DNA sequence, histone modification status also contributes to proper localization of Sum1. Our results also show that the role for Set1 in middle gene expression control diverges as cells receive signals to undergo meiosis. Overall, this work dissects an unexplored role for Set1 in gene-specific repression, and provides important insights into a new mechanism associated with the control of gene expression linked to meiotic differentiation. PMID:29066473

  16. Comprehensive evaluation of disease- and trait-specific enrichment for eight functional elements among GWAS-identified variants.

    Science.gov (United States)

    Markunas, Christina A; Johnson, Eric O; Hancock, Dana B

    2017-07-01

    Genome-wide association study (GWAS)-identified variants are enriched for functional elements. However, we have limited knowledge of how functional enrichment may differ by disease/trait and tissue type. We tested a broad set of eight functional elements for enrichment among GWAS-identified SNPs (p Enrichment analyses were conducted using logistic regression, with Bonferroni correction. Overall, a significant enrichment was observed for all functional elements, except sequence motifs. Missense SNPs showed the strongest magnitude of enrichment. eQTLs were the only functional element significantly enriched across all diseases/traits. Magnitudes of enrichment were generally similar across diseases/traits, where enrichment was statistically significant. Blood vs. brain tissue effects on enrichment were dependent on disease/trait and functional element (e.g., cardiovascular disease: eQTLs P TissueDifference  = 1.28 × 10 -6 vs. enhancers P TissueDifference  = 0.94). Identifying disease/trait-relevant functional elements and tissue types could provide new insight into the underlying biology, by guiding a priori GWAS analyses (e.g., brain enhancer elements for psychiatric disease) or facilitating post hoc interpretation.

  17. Uranium enrichment

    International Nuclear Information System (INIS)

    Mohrhauer, H.

    1982-01-01

    The separation of uranium isotopes in order to enrich the fuel for light water reactors with the light isotope U-235 is an important part of the nuclear fuel cycle. After the basic principals of isotope separation the gaseous diffusion and the centrifuge process are explained. Both these techniques are employed on an industrial scale. In addition a short review is given on other enrichment techniques which have been demonstrated at least on a laboratory scale. After some remarks on the present situation on the enrichment market the progress in the development and the industrial exploitation of the gas centrifuge process by the trinational Urenco-Centec organisation is presented. (orig.)

  18. United States uranium enrichment policies

    International Nuclear Information System (INIS)

    Roberts, R.W.

    1977-01-01

    ERDA's uranium enrichment program policies governing the manner in which ERDA's enrichment complex is being operated and expanded to meet customer requirements for separative work, research and development activities directed at providing technology alternatives for future enrichment capacity, and establishing the framework for additional domestic uranium enrichment capacity to meet the domestic and foreign nuclear industry's growing demand for enrichment services are considered. The ERDA enrichment complex consists of three gaseous diffusion plants located in Oak Ridge, Tennessee; Paducah, Kentucky; and Portsmouth, Ohio. Today, these plants provide uranium enrichment services for commercial nuclear power generation. These enrichment services are provided under contracts between the Government and the utility customers. ERDA's program involves a major pilot plant cascade, and pursues an advanced isotope separation technique for the late 1980's. That the United States must develop additional domestic uranium enrichment capacity is discussed

  19. Corexit 9500 Enhances Oil Biodegradation and Changes Active Bacterial Community Structure of Oil-Enriched Microcosms

    OpenAIRE

    Techtmann, Stephen M.; Zhuang, Mobing; Campo, Pablo; Holder, Edith; Elk, Michael; Hazen, Terry C.; Conmy, Robyn; Santo Domingo, Jorge W.

    2017-01-01

    To better understand the impacts of Corexit 9500 on the structure and activity levels of hydrocarbon-degrading microbial communities, we analyzed next-generation 16S rRNA gene sequencing libraries of hydrocarbon enrichments grown at 5 and 25°C using both DNA and RNA extracts as the sequencing templates. Oil biodegradation patterns in both 5 and 25°C enrichments were consistent with those reported in the literature (i.e., aliphatics were degraded faster than aromatics). Slight increases in bio...

  20. Glutamine-enriched enteral nutrition in very low-birth-weight infants

    NARCIS (Netherlands)

    van den Berg, Anemone; van Zwol, Annelies; Moll, Henriëtte A.; Fetter, Willem P. F.; van Elburg, Ruurd M.

    2007-01-01

    Objective: To determine the effect of glutamine-enriched enteral nutrition in very low- birth- weight infants on the incidence of allergic and infectious diseases during the first year of life. Design: Follow- up study. Setting: Tertiary care hospital. Participants: All surviving infants who

  1. A set of vectors for introduction of antibiotic resistance genes by in vitro Cre-mediated recombination

    Directory of Open Access Journals (Sweden)

    Vassetzky Yegor S

    2008-12-01

    Full Text Available Abstract Background Introduction of new antibiotic resistance genes in the plasmids of interest is a frequent task in molecular cloning practice. Classical approaches involving digestion with restriction endonucleases and ligation are time-consuming. Findings We have created a set of insertion vectors (pINS carrying genes that provide resistance to various antibiotics (puromycin, blasticidin and G418 and containing a loxP site. Each vector (pINS-Puro, pINS-Blast or pINS-Neo contains either a chloramphenicol or a kanamycin resistance gene and is unable to replicate in most E. coli strains as it contains a conditional R6Kγ replication origin. Introduction of the antibiotic resistance genes into the vector of interest is achieved by Cre-mediated recombination between the replication-incompetent pINS and a replication-competent target vector. The recombination mix is then transformed into E. coli and selected by the resistance marker (kanamycin or chloramphenicol present in pINS, which allows to recover the recombinant plasmids with 100% efficiency. Conclusion Here we propose a simple strategy that allows to introduce various antibiotic-resistance genes into any plasmid containing a replication origin, an ampicillin resistance gene and a loxP site.

  2. Blueprint for domestic uranium enrichment

    International Nuclear Information System (INIS)

    1981-01-01

    The AEC advisory committee on domestic production of uranium enrichment has studied for more than a year how to achieve the domestic enrichment of uranium by the construction and operation of a commercial enriching plant using centrifugal separation method, and the report was submitted to the Atomic Energy Commission on August 18, 1980. Japan has depended wholly on overseas services for her uranium enrichment needs, but the development of domestic enrichment has been carried on in parallel. The AEC decided to construct a uranium enrichment pilot plant using centrifuges, and it has been forwarded as a national project. The plant is operated by the Power Reactor and Nuclear Fuel Development Corp. since 1979. The capacity of the plant will be raised to approximately 75 ton SWU a year. The centrifuges already operated have provided the first delivery of fuel of about 1 ton for the ATR ''Fugen''. The demand-supply balance of uranium enrichment service, the significance of the domestic enrichment of uranium, the evaluation of uranium enrichment technology, the target for domestic enrichment plan, the measures to promote domestic uranium enrichment, and the promotion of the construction of a demonstration plant are reported. (Kako, I.)

  3. Inflammatory and mitochondrial gene expression data in GPER-deficient cardiomyocytes from male and female mice

    Directory of Open Access Journals (Sweden)

    Hao Wang

    2017-02-01

    Full Text Available We previously showed that cardiomyocyte-specific G protein-coupled estrogen receptor (GPER gene deletion leads to sex-specific adverse effects on cardiac structure and function; alterations which may be due to distinct differences in mitochondrial and inflammatory processes between sexes. Here, we provide the results of Gene Set Enrichment Analysis (GSEA based on the DNA microarray data from GPER-knockout versus GPER-intact (intact cardiomyocytes. This article contains complete data on the mitochondrial and inflammatory response-related gene expression changes that were significant in GPER knockout versus intact cardiomyocytes from adult male and female mice. The data are supplemental to our original research article “Cardiomyocyte-specific deletion of the G protein-coupled estrogen receptor (GPER leads to left ventricular dysfunction and adverse remodeling: a sex-specific gene profiling” (Wang et al., 2016 [1]. Data have been deposited to the Gene Expression Omnibus (GEO database repository with the dataset identifier GSE86843.

  4. Novel Myopia Genes and Pathways Identified From Syndromic Forms of Myopia

    Science.gov (United States)

    Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.

    2018-01-01

    Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1

  5. The Use of Gene Ontology Term and KEGG Pathway Enrichment for Analysis of Drug Half-Life.

    Directory of Open Access Journals (Sweden)

    Yu-Hang Zhang

    Full Text Available A drug's biological half-life is defined as the time required for the human body to metabolize or eliminate 50% of the initial drug dosage. Correctly measuring the half-life of a given drug is helpful for the safe and accurate usage of the drug. In this study, we investigated which gene ontology (GO terms and biological pathways were highly related to the determination of drug half-life. The investigated drugs, with known half-lives, were analyzed based on their enrichment scores for associated GO terms and KEGG pathways. These scores indicate which GO terms or KEGG pathways the drug targets. The feature selection method, minimum redundancy maximum relevance, was used to analyze these GO terms and KEGG pathways and to identify important GO terms and pathways, such as sodium-independent organic anion transmembrane transporter activity (GO:0015347, monoamine transmembrane transporter activity (GO:0008504, negative regulation of synaptic transmission (GO:0050805, neuroactive ligand-receptor interaction (hsa04080, serotonergic synapse (hsa04726, and linoleic acid metabolism (hsa00591, among others. This analysis confirmed our results and may show evidence for a new method in studying drug half-lives and building effective computational methods for the prediction of drug half-lives.

  6. A systematic study on drug-response associated genes using baseline gene expressions of the Cancer Cell Line Encyclopedia

    Science.gov (United States)

    Liu, Xiaoming; Yang, Jiasheng; Zhang, Yi; Fang, Yun; Wang, Fayou; Wang, Jun; Zheng, Xiaoqi; Yang, Jialiang

    2016-03-01

    We have studied drug-response associated (DRA) gene expressions by applying a systems biology framework to the Cancer Cell Line Encyclopedia data. More than 4,000 genes are inferred to be DRA for at least one drug, while the number of DRA genes for each drug varies dramatically from almost 0 to 1,226. Functional enrichment analysis shows that the DRA genes are significantly enriched in genes associated with cell cycle and plasma membrane. Moreover, there might be two patterns of DRA genes between genders. There are significantly shared DRA genes between male and female for most drugs, while very little DRA genes tend to be shared between the two genders for a few drugs targeting sex-specific cancers (e.g., PD-0332991 for breast cancer and ovarian cancer). Our analyses also show substantial difference for DRA genes between young and old samples, suggesting the necessity of considering the age effects for personalized medicine in cancers. Lastly, differential module and key driver analyses confirm cell cycle related modules as top differential ones for drug sensitivity. The analyses also reveal the role of TSPO, TP53, and many other immune or cell cycle related genes as important key drivers for DRA network modules. These key drivers provide new drug targets to improve the sensitivity of cancer therapy.

  7. Evolution of closely linked gene pairs in vertebrate genomes

    NARCIS (Netherlands)

    Franck, E.; Hulsen, T.; Huynen, M.A.; Jong, de W.W.; Lunsen, N.H.; Madsen, O.

    2008-01-01

    The orientation of closely linked genes in mammalian genomes is not random: there are more head-to-head (h2h) gene pairs than expected. To understand the origin of this enrichment in h2h gene pairs, we have analyzed the phylogenetic distribution of gene pairs separated by less than 600 bp of

  8. Uranium enrichment. Enrichment processes

    International Nuclear Information System (INIS)

    Alexandre, M.; Quaegebeur, J.P.

    2009-01-01

    Despite the remarkable progresses made in the diversity and the efficiency of the different uranium enrichment processes, only two industrial processes remain today which satisfy all of enriched uranium needs: the gaseous diffusion and the centrifugation. This article describes both processes and some others still at the demonstration or at the laboratory stage of development: 1 - general considerations; 2 - gaseous diffusion: physical principles, implementation, utilisation in the world; 3 - centrifugation: principles, elementary separation factor, flows inside a centrifuge, modeling of separation efficiencies, mechanical design, types of industrial centrifuges, realisation of cascades, main characteristics of the centrifugation process; 4 - aerodynamic processes: vortex process, nozzle process; 5 - chemical exchange separation processes: Japanese ASAHI process, French CHEMEX process; 6 - laser-based processes: SILVA process, SILMO process; 7 - electromagnetic and ionic processes: mass spectrometer and calutron, ion cyclotron resonance, rotating plasmas; 8 - thermal diffusion; 9 - conclusion. (J.S.)

  9. Genome-wide strategies identify downstream target genes of chick connective tissue-associated transcription factors.

    Science.gov (United States)

    Orgeur, Mickael; Martens, Marvin; Leonte, Georgeta; Nassari, Sonya; Bonnin, Marie-Ange; Börno, Stefan T; Timmermann, Bernd; Hecht, Jochen; Duprez, Delphine; Stricker, Sigmar

    2018-03-29

    Connective tissues support organs and play crucial roles in development, homeostasis and fibrosis, yet our understanding of their formation is still limited. To gain insight into the molecular mechanisms of connective tissue specification, we selected five zinc-finger transcription factors - OSR1, OSR2, EGR1, KLF2 and KLF4 - based on their expression patterns and/or known involvement in connective tissue subtype differentiation. RNA-seq and ChIP-seq profiling of chick limb micromass cultures revealed a set of common genes regulated by all five transcription factors, which we describe as a connective tissue core expression set. This common core was enriched with genes associated with axon guidance and myofibroblast signature, including fibrosis-related genes. In addition, each transcription factor regulated a specific set of signalling molecules and extracellular matrix components. This suggests a concept whereby local molecular niches can be created by the expression of specific transcription factors impinging on the specification of local microenvironments. The regulatory network established here identifies common and distinct molecular signatures of limb connective tissue subtypes, provides novel insight into the signalling pathways governing connective tissue specification, and serves as a resource for connective tissue development. © 2018. Published by The Company of Biologists Ltd.

  10. Effects of normal saline and selenium-enriched hot spring water on experimentally induced rhinosinusitis in rats.

    Science.gov (United States)

    Kim, Dong-Hyun; Yeo, Sang Won

    2013-01-01

    This prospective, randomized, and controlled study examined the effects of normal saline and selenium-enriched hot spring water on experimentally induced rhinosinusitis in rats. The study comprised two control groups (untreated and saline-treated) and three experimental groups of Sprague Dawley rats. The experimental groups received an instillation of lipopolysaccharide (LPS) only, LPS+normal saline (LPS/saline), or LPS+selenium-enriched hot spring water (LPS/selenium). Histopathological changes were identified using hematoxylin-eosin staining. Leakage of exudate was identified using fluorescence microscopy. Microvascular permeability was measured using the Evans blue dye technique. Expression of the Muc5ac gene was measured using reverse transcription-polymerase chain reaction. Mucosal edema and expression of the Muc5ac gene were significantly lower in the LPS/saline group than in the LPS group. Microvascular permeability, mucosal edema, and expression of the Muc5ac gene were significantly lower in the LPS/selenium group than in the LPS group. Mucosal edema was similar in the LPS/selenium group and LPS/saline group, but capillary permeability and Muc5ac expression were lower in the LPS/selenium group. This study shows that normal saline and selenium-enriched hot spring water reduce inflammatory activity and mucus hypersecretion in LPS-induced rhinosinusitis in rats. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  11. Promotion of uranium enrichment business

    International Nuclear Information System (INIS)

    Kurushima, Morihiro

    1981-01-01

    The Committee on Nuclear Power has studied on the basic nuclear power policy, establishing its five subcommittees, entrusted by the Ministry of Nternational Trade and Industry. The results of examination by the subcommittee on uranium enrichment business are given along with a report in this connection by the Committee. In order to establish the nuclear fuel cycle, the aspect of uranium enrichment is essential. The uranium enrichment by centrifugal process has proceeded steadily in Power Reactor and Nuclear Fuel Development Corporation. The following matters are described: the need for domestic uranium enrichment, the outlook for overseas enrichment services and the schedule for establishing domestic enrichment business, the current state of technology development, the position of the prototype enrichment plant, the course to be taken to establish enrichment business the main organization operating the prototype and commercial plants, the system of supplying centrifuges, the domestic conversion of natural uranium the subsidies for uranium enrichment business. (J.P.N.)

  12. Prediction of epigenetically regulated genes in breast cancer cell lines

    Energy Technology Data Exchange (ETDEWEB)

    Loss, Leandro A; Sadanandam, Anguraj; Durinck, Steffen; Nautiyal, Shivani; Flaucher, Diane; Carlton, Victoria EH; Moorhead, Martin; Lu, Yontao; Gray, Joe W; Faham, Malek; Spellman, Paul; Parvin, Bahram

    2010-05-04

    Methylation of CpG islands within the DNA promoter regions is one mechanism that leads to aberrant gene expression in cancer. In particular, the abnormal methylation of CpG islands may silence associated genes. Therefore, using high-throughput microarrays to measure CpG island methylation will lead to better understanding of tumor pathobiology and progression, while revealing potentially new biomarkers. We have examined a recently developed high-throughput technology for measuring genome-wide methylation patterns called mTACL. Here, we propose a computational pipeline for integrating gene expression and CpG island methylation profles to identify epigenetically regulated genes for a panel of 45 breast cancer cell lines, which is widely used in the Integrative Cancer Biology Program (ICBP). The pipeline (i) reduces the dimensionality of the methylation data, (ii) associates the reduced methylation data with gene expression data, and (iii) ranks methylation-expression associations according to their epigenetic regulation. Dimensionality reduction is performed in two steps: (i) methylation sites are grouped across the genome to identify regions of interest, and (ii) methylation profles are clustered within each region. Associations between the clustered methylation and the gene expression data sets generate candidate matches within a fxed neighborhood around each gene. Finally, the methylation-expression associations are ranked through a logistic regression, and their significance is quantified through permutation analysis. Our two-step dimensionality reduction compressed 90% of the original data, reducing 137,688 methylation sites to 14,505 clusters. Methylation-expression associations produced 18,312 correspondences, which were used to further analyze epigenetic regulation. Logistic regression was used to identify 58 genes from these correspondences that showed a statistically signifcant negative correlation between methylation profles and gene expression in the

  13. Speeding disease gene discovery by sequence based candidate prioritization

    Directory of Open Access Journals (Sweden)

    Porteous David J

    2005-03-01

    Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.

  14. Pectinmethylesterases (PME and pectinmethylesterase inhibitors (PMEI enriched during phloem fiber development in flax (Linum usitatissimum.

    Directory of Open Access Journals (Sweden)

    David Pinzon-Latorre

    Full Text Available Flax phloem fibers achieve their length by intrusive-diffusive growth, which requires them to penetrate the extracellular matrix of adjacent cells. Fiber elongation therefore involves extensive remodelling of cell walls and middle lamellae, including modifying the degree and pattern of methylesterification of galacturonic acid (GalA residues of pectin. Pectin methylesterases (PME are important enzymes for fiber elongation as they mediate the demethylesterification of GalA in muro, in either a block-wise fashion or in a random fashion. Our objective was to identify PMEs and PMEIs that mediate phloem fiber elongation in flax. For this purpose, we measured transcript abundance of candidate genes at nine different stages of stem and fiber development and found sets of genes enriched during fiber elongation and maturation as well as during xylem development. We expressed one of the flax PMEIs in E. coli and demonstrated that it was able to inhibit most of the native PME activity in the upper portion of the flax stem. These results identify key genetic components of the intrusive growth process and define targets for fiber engineering and crop improvement.

  15. Isolation of cowpea genes conferring drought tolerance ...

    African Journals Online (AJOL)

    The main objective of this study was to identify and isolate the genes conferring drought tolerance in cowpea. A cDNA library enriched for cowpea genes expressed specifically during responses to drought was constructed. A procedure called suppression subtractive hybridisation (SSH) was successfully employed to obtain ...

  16. Screening of potential biomarkers in uterine leiomyomas disease via gene expression profiling analysis.

    Science.gov (United States)

    Liu, Xuhui; Liu, Yanfei; Zhao, Jingrong; Liu, Yan

    2018-05-01

    The present study aimed to screen potential biomarkers for uterine leiomyomas disease, particularly target genes associated with the mediator of RNA polymerase II transcription subunit 12 (MED12) mutation. The microarray data of GSE30673, including 10 MED12 wild-type myometrium, 8 MED12 mutation leiomyoma and 2 MED12 wild-type leiomyoma samples, were downloaded from the Gene Expression Omnibus database. Compared with myometrium samples, differently-expressed genes (DEGs) in the MED12 mutation and wild-type leiomyoma samples were identified using the Limma package. The two sets of DEGs obtained were intersected to screen common DEGs. The DEGs in the MED12 mutation and wild-type leiomyoma samples, and common DEGs were defined as group A, B and C. Gene Ontology (GO) and pathway enrichment analyses were performed using the Database for Annotation, Visualization and Integrated Discovery online tool. Based on the Kyoto Encyclopedia of Genes and Genomes database, pathway relation networks were constructed. DEGs in GO terms and pathways were intersected to screen important DEGs. Subsequently, a gene co‑expression network was constructed and visualized using Cytoscape software. Reverse transcription‑quantitative polymerase chain reaction was used to detect the expression levels of important DEGs. A total of 1,258 DEGs in group A were screened, and enriched for extracellular matrix (ECM) organization and ECM‑receptor interaction. In addition, a total of 1,571 DEGs in group B were enriched for cell adhesion. Furthermore, 391 DEGs were involved in extracellular matrix organization. Pathway relation networks of group A, B and C were constructed with nodes of 48, 39, and 28, respectively. Finally, 135 important DEGs were obtained, including Acyl‑CoA synthetase medium‑chain family member 3, protein S (α) (PROS1) and F11 receptor. A gene co‑expression network with 68 nodes was constructed. The expression of caspase 1 (CASP1) and aldehyde dehydrogenase 1 family member

  17. PLANT HOMOLOGOUS TO PARAFIBROMIN is a component of the PAF1 complex and assists in regulating expression of genes within H3K27ME3-enriched chromatin.

    Science.gov (United States)

    Park, Sunchung; Oh, Sookyung; Ek-Ramos, Julissa; van Nocker, Steven

    2010-06-01

    The human Paf1 complex (Paf1C) subunit Parafibromin assists in mediating output from the Wingless/Int signaling pathway, and dysfunction of the encoding gene HRPT2 conditions specific cancer-related disease phenotypes. Here, we characterize the organismal and molecular roles of PLANT HOMOLOGOUS TO PARAFIBROMIN (PHP), the Arabidopsis (Arabidopsis thaliana) homolog of Parafibromin. PHP resides in an approximately 670-kD protein complex in nuclear extracts, and physically interacts with other known Paf1C-related proteins in vivo. In striking contrast to the developmental pleiotropy conferred by mutation in other plant Paf1C component genes in Arabidopsis, loss of PHP specifically conditioned accelerated phase transition from vegetative growth to flowering and resulted in misregulation of a very limited subset of genes that included the flowering repressor FLOWERING LOCUS C. Those genes targeted by PHP were distinguished from the bulk of Arabidopsis genes and other plant Paf1C targets by strong enrichment for trimethylation of lysine-27 on histone H3 (H3K27me3) within chromatin. These findings suggest that PHP is a component of a plant Paf1C protein in Arabidopsis, but has a more specialized role in modulating expression of a subset of Paf1C targets.

  18. From high enriched to low enriched uranium fuel in research reactors

    Energy Technology Data Exchange (ETDEWEB)

    Van Den Berghe, S.; Leenaers, A.; Koonen, E.; Moons, F.; Sannen, L. [Nuclear Materials Science Institute, SCK.CEN, Boeretang 200, B-2400 Mol (Belgium)

    2010-07-01

    Since the 1970's, global efforts have been going on to replace the high-enriched (>90% {sup 235}U), low-density UAlx research reactor fuel with high-density, low enriched (<20% {sup 235}U) replacements. This search is driven by the attempt to reduce the civil use of high-enriched material because of proliferation risks and terrorist threats. American initiatives, such as the Global Threat Reduction Initiative (GTRI) and the Reduced Enrichment for Research and Test Reactors (RERTR) program have triggered the development of reliable low-enriched fuel types for these reactors, which can replace the high enriched ones without loss of performance. Most success has presently been obtained with U{sub 3}Si{sub 2} dispersion fuel, which is currently used in many research reactors in the world. However, efforts to search for a replacement with even higher density, which will also allow the conversion of some high flux research reactors that currently cannot change to U{sub 3}Si{sub 2} (eg. BR2 in Belgium), have continued and are for the moment mainly directed towards the U(Mo) alloy fuel (7-10 w% Mo). This paper provides an overview of the past efforts and presents the current status of the U(Mo) development. (authors)

  19. From high enriched to low enriched uranium fuel in research reactors

    International Nuclear Information System (INIS)

    Van Den Berghe, S.; Leenaers, A.; Koonen, E.; Moons, F.; Sannen, L.

    2010-01-01

    Since the 1970's, global efforts have been going on to replace the high-enriched (>90% 235 U), low-density UAlx research reactor fuel with high-density, low enriched ( 235 U) replacements. This search is driven by the attempt to reduce the civil use of high-enriched material because of proliferation risks and terrorist threats. American initiatives, such as the Global Threat Reduction Initiative (GTRI) and the Reduced Enrichment for Research and Test Reactors (RERTR) program have triggered the development of reliable low-enriched fuel types for these reactors, which can replace the high enriched ones without loss of performance. Most success has presently been obtained with U 3 Si 2 dispersion fuel, which is currently used in many research reactors in the world. However, efforts to search for a replacement with even higher density, which will also allow the conversion of some high flux research reactors that currently cannot change to U 3 Si 2 (eg. BR2 in Belgium), have continued and are for the moment mainly directed towards the U(Mo) alloy fuel (7-10 w% Mo). This paper provides an overview of the past efforts and presents the current status of the U(Mo) development. (authors)

  20. Enrichment of provitamin A content in wheat (Triticum aestivum L.) by introduction of the bacterial carotenoid biosynthetic genes CrtB and CrtI.

    Science.gov (United States)

    Wang, Cheng; Zeng, Jian; Li, Yin; Hu, Wei; Chen, Ling; Miao, Yingjie; Deng, Pengyi; Yuan, Cuihong; Ma, Cheng; Chen, Xi; Zang, Mingli; Wang, Qiong; Li, Kexiu; Chang, Junli; Wang, Yuesheng; Yang, Guangxiao; He, Guangyuan

    2014-06-01

    Carotenoid content is a primary determinant of wheat nutritional value and affects its end-use quality. Wheat grains contain very low carotenoid levels and trace amounts of provitamin A content. In order to enrich the carotenoid content in wheat grains, the bacterial phytoene synthase gene (CrtB) and carotene desaturase gene (CrtI) were transformed into the common wheat cultivar Bobwhite. Expression of CrtB or CrtI alone slightly increased the carotenoid content in the grains of transgenic wheat, while co-expression of both genes resulted in a darker red/yellow grain phenotype, accompanied by a total carotenoid content increase of approximately 8-fold achieving 4.76 μg g(-1) of seed dry weight, a β-carotene increase of 65-fold to 3.21 μg g(-1) of seed dry weight, and a provitamin A content (sum of α-carotene, β-carotene, and β-cryptoxanthin) increase of 76-fold to 3.82 μg g(-1) of seed dry weight. The high provitamin A content in the transgenic wheat was stably inherited over four generations. Quantitative PCR analysis revealed that enhancement of provitamin A content in transgenic wheat was also a result of the highly coordinated regulation of endogenous carotenoid biosynthetic genes, suggesting a metabolic feedback regulation in the wheat carotenoid biosynthetic pathway. These transgenic wheat lines are not only valuable for breeding wheat varieties with nutritional benefits for human health but also for understanding the mechanism regulating carotenoid biosynthesis in wheat endosperm. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  1. Uranium enrichment by gas centrifuge

    International Nuclear Information System (INIS)

    Heriot, I.D.

    1988-01-01

    After recalling the physical principles and the techniques of centrifuge enrichment the report describes the centrifuge enrichment programmes of the various countries concerned and compares this technology with other enrichment technologies like gaseous diffusion, laser, aerodynamic devices and chemical processes. The centrifuge enrichment process is said to be able to replace with advantage the existing enrichment facilities in the short and medium term. Future prospects of the process are also described, like recycled uranium enrichment and economic improvements; research and development needs to achieve the economic prospects are also indicated. Finally the report takes note of the positive aspect of centrifuge enrichment as far as safeguards and nuclear safety are concerned. 27 figs, 113 refs

  2. GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge.

    Science.gov (United States)

    Wagner, Florian

    2015-01-01

    Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets.

  3. Derived enriched uranium market

    International Nuclear Information System (INIS)

    Rutkowski, E.

    1996-01-01

    The potential impact on the uranium market of highly enriched uranium from nuclear weapons dismantling in the Russian Federation and the USA is analyzed. Uranium supply, conversion, and enrichment factors are outlined for each country; inventories are also listed. The enrichment component and conversion components are expected to cause little disruption to uranium markets. The uranium component of Russian derived enriched uranium hexafluoride is unresolved; US legislation places constraints on its introduction into the US market

  4. Identification of genes expressed in the hermaphrodite germ line of C. elegans using SAGE

    Science.gov (United States)

    Wang, Xin; Zhao, Yongjun; Wong, Kim; Ehlers, Peter; Kohara, Yuji; Jones, Steven J; Marra, Marco A; Holt, Robert A; Moerman, Donald G; Hansen, Dave

    2009-01-01

    Background Germ cells must progress through elaborate developmental stages from an undifferentiated germ cell to a fully differentiated gamete. Some of these stages include exiting mitosis and entering meiosis, progressing through the various stages of meiotic prophase, adopting either a male (sperm) or female (oocyte) fate, and completing meiosis. Additionally, many of the factors needed to drive embryogenesis are synthesized in the germ line. To increase our understanding of the genes that might be necessary for the formation and function of the germ line, we have constructed a SAGE library from hand dissected C. elegans hermaphrodite gonads. Results We found that 4699 genes, roughly 21% of all known C. elegans genes, are expressed in the adult hermaphrodite germ line. Ribosomal genes are highly expressed in the germ line; roughly four fold above their expression levels in the soma. We further found that 1063 of the germline-expressed genes have enriched expression in the germ line as compared to the soma. A comparison of these 1063 germline-enriched genes with a similar list of genes prepared using microarrays revealed an overlap of 460 genes, mutually reinforcing the two lists. Additionally, we identified 603 germline-enriched genes, supported by in situ expression data, which were not previously identified. We also found >4 fold enrichment for RNA binding proteins in the germ line as compared to the soma. Conclusion Using multiple technological platforms provides a more complete picture of global gene expression patterns. Genes involved in RNA metabolism are expressed at a significantly higher level in the germ line than the soma, suggesting a stronger reliance on RNA metabolism for control of the expression of genes in the germ line. Additionally, the number and expression level of germ line expressed genes on the X chromosome is lower than expected based on a random distribution. PMID:19426519

  5. Identification of genes expressed in the hermaphrodite germ line of C. elegans using SAGE

    Directory of Open Access Journals (Sweden)

    Holt Robert A

    2009-05-01

    Full Text Available Abstract Background Germ cells must progress through elaborate developmental stages from an undifferentiated germ cell to a fully differentiated gamete. Some of these stages include exiting mitosis and entering meiosis, progressing through the various stages of meiotic prophase, adopting either a male (sperm or female (oocyte fate, and completing meiosis. Additionally, many of the factors needed to drive embryogenesis are synthesized in the germ line. To increase our understanding of the genes that might be necessary for the formation and function of the germ line, we have constructed a SAGE library from hand dissected C. elegans hermaphrodite gonads. Results We found that 4699 genes, roughly 21% of all known C. elegans genes, are expressed in the adult hermaphrodite germ line. Ribosomal genes are highly expressed in the germ line; roughly four fold above their expression levels in the soma. We further found that 1063 of the germline-expressed genes have enriched expression in the germ line as compared to the soma. A comparison of these 1063 germline-enriched genes with a similar list of genes prepared using microarrays revealed an overlap of 460 genes, mutually reinforcing the two lists. Additionally, we identified 603 germline-enriched genes, supported by in situ expression data, which were not previously identified. We also found >4 fold enrichment for RNA binding proteins in the germ line as compared to the soma. Conclusion Using multiple technological platforms provides a more complete picture of global gene expression patterns. Genes involved in RNA metabolism are expressed at a significantly higher level in the germ line than the soma, suggesting a stronger reliance on RNA metabolism for control of the expression of genes in the germ line. Additionally, the number and expression level of germ line expressed genes on the X chromosome is lower than expected based on a random distribution.

  6. Identification of the Core Set of Carbon-Associated Genes in a Bioenergy Grassland Soil.

    Directory of Open Access Journals (Sweden)

    Adina Howe

    Full Text Available Despite the central role of soil microbial communities in global carbon (C cycling, little is known about soil microbial community structure and even less about their metabolic pathways. Efforts to characterize soil communities often focus on identifying differences in gene content across environmental gradients, but an alternative question is what genes are similar in soils. These genes may indicate critical species or potential functions that are required in all soils. Here we identified the "core" set of C cycling sequences widely present in multiple soil metagenomes from a fertilized prairie (FP. Of 226,887 sequences associated with known enzymes involved in the synthesis, metabolism, and transport of carbohydrates, 843 were identified to be consistently prevalent across four replicate soil metagenomes. This core metagenome was functionally and taxonomically diverse, representing five enzyme classes and 99 enzyme families within the CAZy database. Though it only comprised 0.4% of all CAZy-associated genes identified in FP metagenomes, the core was found to be comprised of functions similar to those within cumulative soils. The FP CAZy-associated core sequences were present in multiple publicly available soil metagenomes and most similar to soils sharing geographic proximity. In soil ecosystems, where high diversity remains a key challenge for metagenomic investigations, these core genes represent a subset of critical functions necessary for carbohydrate metabolism, which can be targeted to evaluate important C fluxes in these and other similar soils.

  7. Transcriptional control in the segmentation gene network of Drosophila.

    Directory of Open Access Journals (Sweden)

    Mark D Schroeder

    2004-09-01

    Full Text Available The segmentation gene network of Drosophila consists of maternal and zygotic factors that generate, by transcriptional (cross- regulation, expression patterns of increasing complexity along the anterior-posterior axis of the embryo. Using known binding site information for maternal and zygotic gap transcription factors, the computer algorithm Ahab recovers known segmentation control elements (modules with excellent success and predicts many novel modules within the network and genome-wide. We show that novel module predictions are highly enriched in the network and typically clustered proximal to the promoter, not only upstream, but also in intronic space and downstream. When placed upstream of a reporter gene, they consistently drive patterned blastoderm expression, in most cases faithfully producing one or more pattern elements of the endogenous gene. Moreover, we demonstrate for the entire set of known and newly validated modules that Ahab's prediction of binding sites correlates well with the expression patterns produced by the modules, revealing basic rules governing their composition. Specifically, we show that maternal factors consistently act as activators and that gap factors act as repressors, except for the bimodal factor Hunchback. Our data suggest a simple context-dependent rule for its switch from repressive to activating function. Overall, the composition of modules appears well fitted to the spatiotemporal distribution of their positive and negative input factors. Finally, by comparing Ahab predictions with different categories of transcription factor input, we confirm the global regulatory structure of the segmentation gene network, but find odd skipped behaving like a primary pair-rule gene. The study expands our knowledge of the segmentation gene network by increasing the number of experimentally tested modules by 50%. For the first time, the entire set of validated modules is analyzed for binding site composition under a

  8. Alteration of gene expression by zinc oxide nanoparticles or zinc sulfate in vivo and comparison with in vitro data: A harmonious case.

    Science.gov (United States)

    Zhang, Wei-Dong; Zhao, Yong; Zhang, Hong-Fu; Wang, Shu-Kun; Hao, Zhi-Hui; Liu, Jing; Yuan, Yu-Qing; Zhang, Peng-Fei; Yang, Hong-Di; Shen, Wei; Li, Lan

    2016-08-01

    Granulosa cells (GCs) are those somatic cells closest to the female germ cell. GCs play a vital role in oocyte growth and development, and the oocyte is necessary for multiplication of a species. Zinc oxide (ZnO) nanoparticles (NPs) readily cross biologic barriers to be absorbed into biologic systems that make them promising candidates as food additives. The objective of the present investigation was to explore the impact of intact NPs on gene expression and the functional classification of altered genes in hen GCs in vivo, to compare the data from in vivo and in vitro studies, and finally to point out the adverse effects of ZnO NPs on the reproductive system. After a 24-week treatment, hen GCs were isolated and gene expression was quantified. Intact NPs were found in the ovary and other organs. Zn levels were similar in ZnO-NP-100 mg/kg- and ZnSO4-100 mg/kg-treated hen ovaries. ZnO-NP-100 mg/kg and ZnSO4-100 mg/kg regulated the expression of the same sets of genes, and they also altered the expression of different sets of genes individually. The number of genes altered by the ZnO-NP-100 mg/kg and ZnSO4-100 mg/kg treatments was different. Gene Ontology (GO) functional analysis reported that different results for the two treatments and, in Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment, 12 pathways (out of the top 20 pathways) in each treatment were different. These results suggested that intact NPs and Zn(2+) had different effects on gene expression in GCs in vivo. In our recent publication, we noted that intact NPs and Zn(2+) differentially altered gene expression in GCs in vitro. However, GO functional classification and KEGG pathway enrichment analyses revealed close similarities for the changed genes in vivo and in vitro after ZnO NP treatment. Furthermore, close similarities were observed for the changed genes after ZnSO4 treatments in vivo and in vitro by GO functional classification and KEGG pathway enrichment analyses. Therefore

  9. Radiometric enrichment of nonradioactive ores

    International Nuclear Information System (INIS)

    Mokrousov, V.A.; Lileev, V.A.

    1979-01-01

    Considered are the methods of mineral enrichment based on the use of the radioation of various types. The physical essence of enrichment processes is presented, their classification is given. Described are the ore properties influencing the efficiency of radiometric enrichment, methods of the properties study and estimation of ore enrichment. New possibilities opened by radiometric enrichment in the technology of primary processing of mineral raw materials are elucidated. A considerable attention is paid to the main and auxiliary equipment for radiometric enrichment. The foundations of the safety engineering are presented in a brief form. Presented are also results of investigations and practical works in the field of enrichment of ores of non-ferrous, ferrous and non-metallic minerals with the help of radiometric methods

  10. PWR fuel of high enrichment with erbia and enriched gadolinia

    International Nuclear Information System (INIS)

    Bejmer, Klaes-Håkan; Malm, Christian

    2011-01-01

    Today standard PWR fuel is licensed for operation up to 65-70 MWd/kgU, which in most cases corresponds to an enrichment of more than 5 w/o "2"3"5U. Due to criticality safety reason of storage and transportation, only fuel up to 5 w/o "2"3"5U enrichment is so far used. New fuel storage installations and transportation casks are necessary investments before the reactivity level of the fresh fuel can be significantly increased. These investments and corresponding licensing work takes time, and in the meantime a solution that requires burnable poisons in all pellets of the fresh high-enriched fuel might be used. By using very small amounts of burnable absorber in every pellet the initial reactivity can be reduced to today's levels. This study presents core calculations with fuel assemblies enriched to almost 6 w/o "2"3"5U mixed with a small amount of erbia. Some of the assemblies also contain gadolinia. The results are compared to a reference case containing assemblies with 4.95 w/o "2"3"5U without erbia, utilizing only gadolinia as burnable poison. The comparison shows that the number of fresh fuel assemblies can be reduced by 21% (which increases the batch burnup by 24%) by utilizing the erbia fuel concept. However, increased cost of uranium due to higher enrichment is not fully compensated for by the cost gain due to the reduction of the number assemblies. Hence, the fuel cycle cost becomes slightly higher for the high enrichment erbia case than for the reference case. (author)

  11. The gene expression profile of resistant and susceptible Bombyx mori strains reveals cypovirus-associated variations in host gene transcript levels.

    Science.gov (United States)

    Guo, Rui; Wang, Simei; Xue, Renyu; Cao, Guangli; Hu, Xiaolong; Huang, Moli; Zhang, Yangqi; Lu, Yahong; Zhu, Liyuan; Chen, Fei; Liang, Zi; Kuang, Sulan; Gong, Chengliang

    2015-06-01

    High-throughput paired-end RNA sequencing (RNA-Seq) was performed to investigate the gene expression profile of a susceptible Bombyx mori strain, Lan5, and a resistant B. mori strain, Ou17, which were both orally infected with B. mori cypovirus (BmCPV) in the midgut. There were 330 and 218 up-regulated genes, while there were 147 and 260 down-regulated genes in the Lan5 and Ou17 strains, respectively. Gene ontology (GO) enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment for differentially expressed genes (DEGs) were carried out. Moreover, gene interaction network (STRING) analyses were performed to analyze the relationships among the shared DEGs. Some of these genes were related and formed a large network, in which the genes for B. mori cuticular protein RR-2 motif 123 (BmCPR123) and the gene for B. mori DNA replication licensing factor Mcm2-like (BmMCM2) were key genes among the common up-regulated DEGs, whereas the gene for B. mori heat shock protein 20.1 (Bmhsp20.1) was the central gene among the shared down-regulated DEGs between Lan5 vs Lan5-CPV and Ou17 vs Ou17-CPV. These findings established a comprehensive database of genes that are differentially expressed in response to BmCPV infection between silkworm strains that differed in resistance to BmCPV and implied that these DEGs might be involved in B. mori immune responses against BmCPV infection.

  12. Integrated design of SIGMA uranium enrichment plants

    International Nuclear Information System (INIS)

    Rivarola, Martin E.; Brasnarof, Daniel O.

    1999-01-01

    In the present work, we describe a preliminary analysis of the design feedbacks in a Uranium Enrichment Plant, using the SIGMA concept. Starting from the result of this analysis, a computer code has been generated, which allows finding the optimal configurations of plants, for a fixed production rate. The computer code developed includes the model of the Thermohydraulic loop of a SIGMA module. The model contains numerical calculations of the main components of the circuit. During the calculations, the main components are dimensioned, for a posterior cost compute. The program also makes an estimation of the enrichment gain of the porous membrane, for each separation stage. Once the dimensions of the main components are known, using the enrichment cascade calculation, the capital and operation costs of the plant could be determined. At this point it is simple to calculate a leveled cost of the Separative Work Unit (SWU). A numerical optimizer is also included in the program. This optimizer finds the optimal cascade configuration, for a given set of design parameters. The whole-integrated program permits to investigate in detail the feedback in the component design. Therefore, the sensibility of the more relevant parameters can be computed, with respect of the economical variables of the plant. (author)

  13. Partial Least Squares Based Gene Expression Analysis in EBV- Positive and EBV-Negative Posttransplant Lymphoproliferative Disorders.

    Science.gov (United States)

    Wu, Sa; Zhang, Xin; Li, Zhi-Ming; Shi, Yan-Xia; Huang, Jia-Jia; Xia, Yi; Yang, Hang; Jiang, Wen-Qi

    2013-01-01

    Post-transplant lymphoproliferative disorder (PTLD) is a common complication of therapeutic immunosuppression after organ transplantation. Gene expression profile facilitates the identification of biological difference between Epstein-Barr virus (EBV) positive and negative PTLDs. Previous studies mainly implemented variance/regression analysis without considering unaccounted array specific factors. The aim of this study is to investigate the gene expression difference between EBV positive and negative PTLDs through partial least squares (PLS) based analysis. With a microarray data set from the Gene Expression Omnibus database, we performed PLS based analysis. We acquired 1188 differentially expressed genes. Pathway and Gene Ontology enrichment analysis identified significantly over-representation of dysregulated genes in immune response and cancer related biological processes. Network analysis identified three hub genes with degrees higher than 15, including CREBBP, ATXN1, and PML. Proteins encoded by CREBBP and PML have been reported to be interact with EBV before. Our findings shed light on expression distinction of EBV positive and negative PTLDs with the hope to offer theoretical support for future therapeutic study.

  14. Economical Feedback of Increasing Fuel Enrichment on Electricity Cost for VVER-1000

    Directory of Open Access Journals (Sweden)

    Mohammed Saad Dwiddar

    2015-08-01

    Full Text Available A methodology of evaluating the economics of the front-end nuclear fuel cycle with a price change sensitivity analysis for a VVER-1000 reactor core as a case study is presented. The effect of increasing the fuel enrichment and its corresponding reactor cycle length on the energy cost is investigated. The enrichment component was found to represent the highly expenses dynamic component affecting the economics of the front-end fuel cycle. Nevertheless, the increase of the fuel enrichment will increase the reactor cycle length, which will have a positive feedback on the electricity generation cost (cent/KWh. A long reactor operation time with a cheaper energy cost set the nuclear energy as a competitive alternative when compared with other energy sources.

  15. 77 FR 14838 - General Electric-Hitachi Global Laser Enrichment LLC, Commercial Laser-Based Uranium Enrichment...

    Science.gov (United States)

    2012-03-13

    ... Laser Enrichment LLC, Commercial Laser-Based Uranium Enrichment Facility, Wilmington, North Carolina... a license to General Electric-Hitachi Global Laser Enrichment LLC (GLE or the applicant) to authorize construction of a laser-based uranium enrichment facility and possession and use of byproduct...

  16. Comparative Transcriptomics Reveals Differential Gene Expression Related to Colletotrichum gloeosporioides Resistance in the Octoploid Strawberry

    Directory of Open Access Journals (Sweden)

    Feng Wang

    2017-05-01

    Full Text Available The strawberry is an important fruit worldwide; however, the development of the strawberry industry is limited by fungal disease. Anthracnose is caused by the pathogen Colletotrichum gloeosporioides and leads to large-scale losses in strawberry quality and production. However, the transcriptional response of strawberry to infection with C. gloeosporioides is poorly understood. In the present study, the strawberry leaf transcriptome of the ‘Yanli’ and ‘Benihoppe’ cultivars were deep sequenced via an RNA-seq analysis to study C. gloeosporioides resistance in strawberry. Among the sequences, differentially expressed genes were annotated with Gene Ontology terms and subjected to pathway enrichment analysis. Significant categories included defense, plant–pathogen interactions and flavonoid biosynthesis were identified. The comprehensive transcriptome data set provides molecular insight into C. gloeosporioides resistance genes in resistant and susceptible strawberry cultivars. Our findings can enhance breeding efforts in strawberry.

  17. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

    Science.gov (United States)

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-11-01

    The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. Expression profile analysis of long noncoding RNA in HER-2-enriched subtype breast cancer by next-generation sequencing and bioinformatics

    Directory of Open Access Journals (Sweden)

    Yang F

    2016-02-01

    Full Text Available Fan Yang, Shixu Lyu, Siyang Dong, Yehuan Liu, Xiaohua Zhang, Ouchen Wang Department of Surgical Oncology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang, People’s Republic of China Background: Human epidermal growth factor receptor 2 (HER-2-enriched subtype breast cancer is associated with a more aggressive phenotype and shorter survival time. Long noncoding RNAs (LncRNAs have essential roles in tumorigenesis and occupy a central place in cancer progression. Notably, few studies have focused on the dysregulation of LncRNAs in the HER-2-enriched subtype breast cancer. In this study, we analyzed the expression profile of LncRNAs and mRNAs in this particular subtype of breast cancer. Methods: Seven pairs of HER-2-enriched subtype breast cancer and normal tissue were sequenced. We screened out differently expressed genes and measured the correlation of the expression levels of dysregulated LncRNAs and HER-2 by Pearson’s correlation coefficient analysis. Gene ontology analysis and pathway analysis were used to understand the biological roles of these differently expressed genes. Pathway act network and coexpression network were constructed. Results: More than 1,300 LncRNAs and 2,800 mRNAs, which were significantly differently expressed, were identified. Among these LncRNAs, AFAP1-AS1 was the most dysregulated LncRNA, while ORM2 was the most dysregulated mRNA. LOC100288637 had the highest positive correlation coefficient of 0.93 with HER-2, while RPL13P5 had the highest negative correlation coefficient of -0.87. The pathway act network showed that MAPK signaling pathway, PI3K-Akt signaling pathway, metabolic pathways, cell cycle, and regulation of actin cytoskeleton were highly related with HER-2-enriched subtype breast cancer. Coexpression network recognized LINC00636, LINC01405, ADARB2-AS1, ST8SIA6-AS1, LINC00511, and DPP10-AS1 as core genes. Conclusion: These results analyze the functions of LncRNAs and provide

  19. Simultaneous inference of phenotype-associated genes and relevant tissues from GWAS data via Bayesian integration of multiple tissue-specific gene networks.

    Science.gov (United States)

    Wu, Mengmeng; Lin, Zhixiang; Ma, Shining; Chen, Ting; Jiang, Rui; Wong, Wing Hung

    2017-12-01

    Although genome-wide association studies (GWAS) have successfully identified thousands of genomic loci associated with hundreds of complex traits in the past decade, the debate about such problems as missing heritability and weak interpretability has been appealing for effective computational methods to facilitate the advanced analysis of the vast volume of existing and anticipated genetic data. Towards this goal, gene-level integrative GWAS analysis with the assumption that genes associated with a phenotype tend to be enriched in biological gene sets or gene networks has recently attracted much attention, due to such advantages as straightforward interpretation, less multiple testing burdens, and robustness across studies. However, existing methods in this category usually exploit non-tissue-specific gene networks and thus lack the ability to utilize informative tissue-specific characteristics. To overcome this limitation, we proposed a Bayesian approach called SIGNET (Simultaneously Inference of GeNEs and Tissues) to integrate GWAS data and multiple tissue-specific gene networks for the simultaneous inference of phenotype-associated genes and relevant tissues. Through extensive simulation studies, we showed the effectiveness of our method in finding both associated genes and relevant tissues for a phenotype. In applications to real GWAS data of 14 complex phenotypes, we demonstrated the power of our method in both deciphering genetic basis and discovering biological insights of a phenotype. With this understanding, we expect to see SIGNET as a valuable tool for integrative GWAS analysis, thereby boosting the prevention, diagnosis, and treatment of human inherited diseases and eventually facilitating precision medicine.

  20. Rice Transcriptome Analysis to Identify Possible Herbicide Quinclorac Detoxification Genes

    Directory of Open Access Journals (Sweden)

    Wenying eXu

    2015-09-01

    Full Text Available Quinclorac is a highly selective auxin-type herbicide, and is widely used in the effective control of barnyard grass in paddy rice fields, improving the world’s rice yield. The herbicide mode of action of quinclorac has been proposed and hormone interactions affect quinclorac signaling. Because of widespread use, quinclorac may be transported outside rice fields with the drainage waters, leading to soil and water pollution and environmental health problems.In this study, we used 57K Affymetrix rice whole-genome array to identify quinclorac signaling response genes to study the molecular mechanisms of action and detoxification of quinclorac in rice plants. Overall, 637 probe sets were identified with differential expression levels under either 6 or 24 h of quinclorac treatment. Auxin-related genes such as GH3 and OsIAAs responded to quinclorac treatment. Gene Ontology analysis showed that genes of detoxification-related family genes were significantly enriched, including cytochrome P450, GST, UGT, and ABC and drug transporter genes. Moreover, real-time RT-PCR analysis showed that top candidate P450 families such as CYP81, CYP709C and CYP72A genes were universally induced by different herbicides. Some Arabidopsis genes for the same P450 family were up-regulated under quinclorac treatment.We conduct rice whole-genome GeneChip analysis and the first global identification of quinclorac response genes. This work may provide potential markers for detoxification of quinclorac and biomonitors of environmental chemical pollution.