WorldWideScience

Sample records for gene set enrichment

  1. IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering.

    Science.gov (United States)

    Wu, Lingxiang; Chen, Xiujie; Zhang, Denan; Zhang, Wubing; Liu, Lei; Ma, Hongzhe; Yang, Jingbo; Xie, Hongbo; Liu, Bo; Jin, Qing

    2016-01-01

    Analysis of gene sets has been widely applied in various high-throughput biological studies. One weakness in the traditional methods is that they neglect the heterogeneity of genes expressions in samples which may lead to the omission of some specific and important gene sets. It is also difficult for them to reflect the severities of disease and provide expression profiles of gene sets for individuals. We developed an application software called IGSA that leverages a powerful analytical capacity in gene sets enrichment and samples clustering. IGSA calculates gene sets expression scores for each sample and takes an accumulating clustering strategy to let the samples gather into the set according to the progress of disease from mild to severe. We focus on gastric, pancreatic and ovarian cancer data sets for the performance of IGSA. We also compared the results of IGSA in KEGG pathways enrichment with David, GSEA, SPIA, ssGSEA and analyzed the results of IGSA clustering and different similarity measurement methods. Notably, IGSA is proved to be more sensitive and specific in finding significant pathways, and can indicate related changes in pathways with the severity of disease. In addition, IGSA provides with significant gene sets profile for each sample.

  2. Ranking metrics in gene set enrichment analysis: do they matter?

    Science.gov (United States)

    Zyla, Joanna; Marczyk, Michal; Weiner, January; Polanska, Joanna

    2017-05-12

    There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results. In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA . Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner

  3. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

    Science.gov (United States)

    Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

    2015-11-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.

  4. Constellation Map: Downstream visualization and interpretation of gene set enrichment results [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Yan Tan

    2015-06-01

    Full Text Available Summary: Gene set enrichment analysis (GSEA approaches are widely used to identify coordinately regulated genes associated with phenotypes of interest. Here, we present Constellation Map, a tool to visualize and interpret the results when enrichment analyses yield a long list of significantly enriched gene sets. Constellation Map identifies commonalities that explain the enrichment of multiple top-scoring gene sets and maps the relationships between them. Constellation Map can help investigators take full advantage of GSEA and facilitates the biological interpretation of enrichment results. Availability: Constellation Map is freely available as a GenePattern module at http://www.genepattern.org.

  5. Comparative study on gene set and pathway topology-based enrichment methods.

    Science.gov (United States)

    Bayerlová, Michaela; Jung, Klaus; Kramer, Frank; Klemm, Florian; Bleckmann, Annalen; Beißbarth, Tim

    2015-10-22

    Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis. We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods. In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower. We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both

  6. Zebrafish Expression Ontology of Gene Sets (ZEOGS): a tool to analyze enrichment of zebrafish anatomical terms in large gene sets.

    Science.gov (United States)

    Prykhozhij, Sergey V; Marsico, Annalisa; Meijsing, Sebastiaan H

    2013-09-01

    The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene expression

  7. Zebrafish Expression Ontology of Gene Sets (ZEOGS): A Tool to Analyze Enrichment of Zebrafish Anatomical Terms in Large Gene Sets

    Science.gov (United States)

    Marsico, Annalisa

    2013-01-01

    Abstract The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene

  8. Gene set of nuclear-encoded mitochondrial regulators is enriched for common inherited variation in obesity.

    Directory of Open Access Journals (Sweden)

    Nadja Knoll

    Full Text Available There are hints of an altered mitochondrial function in obesity. Nuclear-encoded genes are relevant for mitochondrial function (3 gene sets of known relevant pathways: (1 16 nuclear regulators of mitochondrial genes, (2 91 genes for oxidative phosphorylation and (3 966 nuclear-encoded mitochondrial genes. Gene set enrichment analysis (GSEA showed no association with type 2 diabetes mellitus in these gene sets. Here we performed a GSEA for the same gene sets for obesity. Genome wide association study (GWAS data from a case-control approach on 453 extremely obese children and adolescents and 435 lean adult controls were used for GSEA. For independent confirmation, we analyzed 705 obesity GWAS trios (extremely obese child and both biological parents and a population-based GWAS sample (KORA F4, n = 1,743. A meta-analysis was performed on all three samples. In each sample, the distribution of significance levels between the respective gene set and those of all genes was compared using the leading-edge-fraction-comparison test (cut-offs between the 50(th and 95(th percentile of the set of all gene-wise corrected p-values as implemented in the MAGENTA software. In the case-control sample, significant enrichment of associations with obesity was observed above the 50(th percentile for the set of the 16 nuclear regulators of mitochondrial genes (p(GSEA,50 = 0.0103. This finding was not confirmed in the trios (p(GSEA,50 = 0.5991, but in KORA (p(GSEA,50 = 0.0398. The meta-analysis again indicated a trend for enrichment (p(MAGENTA,50 = 0.1052, p(MAGENTA,75 = 0.0251. The GSEA revealed that weak association signals for obesity might be enriched in the gene set of 16 nuclear regulators of mitochondrial genes.

  9. Tracking difference in gene expression in a time-course experiment using gene set enrichment analysis.

    Directory of Open Access Journals (Sweden)

    Pui Shan Wong

    Full Text Available Fistulifera sp. strain JPCC DA0580 is a newly sequenced pennate diatom that is capable of simultaneously growing and accumulating lipids. This is a unique trait, not found in other related microalgae so far. It is able to accumulate between 40 to 60% of its cell weight in lipids, making it a strong candidate for the production of biofuel. To investigate this characteristic, we used RNA-Seq data gathered at four different times while Fistulifera sp. strain JPCC DA0580 was grown in oil accumulating and non-oil accumulating conditions. We then adapted gene set enrichment analysis (GSEA to investigate the relationship between the difference in gene expression of 7,822 genes and metabolic functions in our data. We utilized information in the KEGG pathway database to create the gene sets and changed GSEA to use re-sampling so that data from the different time points could be included in the analysis. Our GSEA method identified photosynthesis, lipid synthesis and amino acid synthesis related pathways as processes that play a significant role in oil production and growth in Fistulifera sp. strain JPCC DA0580. In addition to GSEA, we visualized the results by creating a network of compounds and reactions, and plotted the expression data on top of the network. This made existing graph algorithms available to us which we then used to calculate a path that metabolizes glucose into triacylglycerol (TAG in the smallest number of steps. By visualizing the data this way, we observed a separate up-regulation of genes at different times instead of a concerted response. We also identified two metabolic paths that used less reactions than the one shown in KEGG and showed that the reactions were up-regulated during the experiment. The combination of analysis and visualization methods successfully analyzed time-course data, identified important metabolic pathways and provided new hypotheses for further research.

  10. Identification of a set of genes showing regionally enriched expression in the mouse brain

    Directory of Open Access Journals (Sweden)

    Marra Marco A

    2008-07-01

    Full Text Available Abstract Background The Pleiades Promoter Project aims to improve gene therapy by designing human mini-promoters ( Results We have utilized LongSAGE to identify regionally enriched transcripts in the adult mouse brain. As supplemental strategies, we also performed a meta-analysis of published literature and inspected the Allen Brain Atlas in situ hybridization data. From a set of approximately 30,000 mouse genes, 237 were identified as showing specific or enriched expression in 30 target regions of the mouse brain. GO term over-representation among these genes revealed co-involvement in various aspects of central nervous system development and physiology. Conclusion Using a multi-faceted expression validation approach, we have identified mouse genes whose human orthologs are good candidates for design of mini-promoters. These mouse genes represent molecular markers in several discrete brain regions/cell-types, which could potentially provide a mechanistic explanation of unique functions performed by each region. This set of markers may also serve as a resource for further studies of gene regulatory elements influencing brain expression.

  11. Gene-Based Analysis of Regionally Enriched Cortical Genes in GWAS Data Sets of Cognitive Traits and Psychiatric Disorders

    DEFF Research Database (Denmark)

    Ersland, Kari M; Christoforou, Andrea; Stansberg, Christine

    2012-01-01

    the regionally enriched cortical genes to mine a genome-wide association study (GWAS) of the Norwegian Cognitive NeuroGenetics (NCNG) sample of healthy adults for association to nine psychometric tests measures. In addition, we explored GWAS data sets for the serious psychiatric disorders schizophrenia (SCZ) (n...

  12. Application of biclustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

    Directory of Open Access Journals (Sweden)

    Andrew Williams

    2015-12-01

    Full Text Available Background: The presence of diverse types of nanomaterials (NMs in commerce is growing at an exponential pace. As a result, human exposure to these materials in the environment is inevitable, necessitating the need for rapid and reliable toxicity testing methods to accurately assess the potential hazards associated with NMs. In this study, we applied biclustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related biclusters of genes showing similar expression profiles were identified. The identified biclusters were then used to conduct a gene set enrichment analysis on pulmonary gene expression profiles derived from mice exposed to nano-titanium dioxide (nano-TiO2, carbon black (CB or carbon nanotubes (CNTs to determine the disease significance of these data-driven gene sets.Results: Biclusters representing inflammation (chemokine activity, DNA binding, cell cycle, apoptosis, reactive oxygen species (ROS and fibrosis processes were identified. All of the NM studies were significant with respect to the bicluster related to chemokine activity (DAVID; FDR p-value = 0.032. The bicluster related to pulmonary fibrosis was enriched in studies where toxicity induced by CNT and CB studies was investigated, suggesting the potential for these materials to induce lung fibrosis. The pro-fibrogenic potential of CNTs is well established. Although CB has not been shown to induce fibrosis, it induces stronger inflammatory, oxidative stress and DNA damage responses than nano-TiO2 particles.Conclusion: The results of the analysis correctly identified all NMs to be inflammogenic and only CB and CNTs as potentially fibrogenic. In addition to identifying several

  13. A cross-study gene set enrichment analysis identifies critical pathways in endometriosis

    Directory of Open Access Journals (Sweden)

    Bai Chunyan

    2009-09-01

    Full Text Available Abstract Background Endometriosis is an enigmatic disease. Gene expression profiling of endometriosis has been used in several studies, but few studies went further to classify subtypes of endometriosis based on expression patterns and to identify possible pathways involved in endometriosis. Some of the observed pathways are more inconsistent between the studies, and these candidate pathways presumably only represent a fraction of the pathways involved in endometriosis. Methods We applied a standardised microarray preprocessing and gene set enrichment analysis to six independent studies, and demonstrated increased concordance between these gene datasets. Results We find 16 up-regulated and 19 down-regulated pathways common in ovarian endometriosis data sets, 22 up-regulated and one down-regulated pathway common in peritoneal endometriosis data sets. Among them, 12 up-regulated and 1 down-regulated were found consistent between ovarian and peritoneal endometriosis. The main canonical pathways identified are related to immunological and inflammatory disease. Early secretory phase has the most over-represented pathways in the three uterine cycle phases. There are no overlapping significant pathways between the dataset from human endometrial endothelial cells and the datasets from ovarian endometriosis which used whole tissues. Conclusion The study of complex diseases through pathway analysis is able to highlight genes weakly connected to the phenotype which may be difficult to detect by using classical univariate statistics. By standardised microarray preprocessing and GSEA, we have increased the concordance in identifying many biological mechanisms involved in endometriosis. The identified gene pathways will shed light on the understanding of endometriosis and promote the development of novel therapies.

  14. FunGeneNet: a web tool to estimate enrichment of functional interactions in experimental gene sets.

    Science.gov (United States)

    Tiys, Evgeny S; Ivanisenko, Timofey V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2018-02-09

    experimental gene sets, both for different global networks and for different types of interactions. Using examples of thyroid cancer and apoptosis networks, we have shown that the links over-represented in the analyzed network in comparison with the random ones make possible a biological interpretation of the original gene/protein sets. The FunGeneNet web tool for assessment of the functional enrichment of networks is available at http://www-bionet.sscc.ru/fungenenet/ .

  15. An Independent Filter for Gene Set Testing Based on Spectral Enrichment

    NARCIS (Netherlands)

    Frost, H Robert; Li, Zhigang; Asselbergs, Folkert W; Moore, Jason H

    2015-01-01

    Gene set testing has become an indispensable tool for the analysis of high-dimensional genomic data. An important motivation for testing gene sets, rather than individual genomic variables, is to improve statistical power by reducing the number of tested hypotheses. Given the dramatic growth in

  16. NetGen: a novel network-based probabilistic generative model for gene set functional enrichment analysis.

    Science.gov (United States)

    Sun, Duanchen; Liu, Yinliang; Zhang, Xiang-Sun; Wu, Ling-Yun

    2017-09-21

    High-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct parent or descendant terms in the GO hierarchical structure. Highly redundant terms make users difficult to analyze the underlying biological processes. In this paper, a novel network-based probabilistic generative model, NetGen, was proposed to perform the functional enrichment analysis. An additional protein-protein interaction (PPI) network was explicitly used to assist the identification of significantly enriched GO terms. NetGen achieved a superior performance than the existing methods in the simulation studies. The effectiveness of NetGen was explored further on four real datasets. Notably, several GO terms which were not directly linked with the active gene list for each disease were identified. These terms were closely related to the corresponding diseases when accessed to the curated literatures. NetGen has been implemented in the R package CopTea publicly available at GitHub ( http://github.com/wulingyun/CopTea/ ). Our procedure leads to a more reasonable and interpretable result of the functional enrichment analysis. As a novel term combination-based functional enrichment analysis method, NetGen is complementary to current individual term-based methods, and can help to explore the underlying pathogenesis of complex diseases.

  17. Using OWL reasoning to support the generation of novel gene sets for enrichment analysis.

    Science.gov (United States)

    Osumi-Sutherland, David J; Ponta, Enrico; Courtot, Melanie; Parkinson, Helen; Badi, Laura

    2018-02-14

    The Gene Ontology (GO) consists of over 40,000 terms for biological processes, cell components and gene product activities linked into a graph structure by over 90,000 relationships. It has been used to annotate the functions and cellular locations of several million gene products. The graph structure is used by a variety of tools to group annotated genes into sets whose products share function or location. These gene sets are widely used to interpret the results of genomics experiments by assessing which sets are significantly over- or under-represented in results lists. F Hoffmann-La Roche Ltd. has developed a bespoke, manually maintained controlled vocabulary (RCV) for use in over-representation analysis. Many terms in this vocabulary group GO terms in novel ways that cannot easily be derived using the graph structure of the GO. For example, some RCV terms group GO terms by the cell, chemical or tissue type they refer to. Recent improvements in the content and formal structure of the GO make it possible to use logical queries in Web Ontology Language (OWL) to automatically map these cross-cutting classifications to sets of GO terms. We used this approach to automate mapping between RCV and GO, largely replacing the increasingly unsustainable manual mapping process. We then tested the utility of the resulting groupings for over-representation analysis. We successfully mapped 85% of RCV terms to logical OWL definitions and showed that these could be used to recapitulate and extend manual mappings between RCV terms and the sets of GO terms subsumed by them. We also show that gene sets derived from the resulting GO terms sets can be used to detect the signatures of cell and tissue types in whole genome expression data. The rich formal structure of the GO makes it possible to use reasoning to dynamically generate novel, biologically relevant groupings of GO terms. GO term groupings generated with this approach can be used in. over-representation analysis to detect

  18. The Schizophrenia-Associated BRD1 Gene Regulates Behavior, Neurotransmission, and Expression of Schizophrenia Risk Enriched Gene Sets in Mice.

    Science.gov (United States)

    Qvist, Per; Christensen, Jane Hvarregaard; Vardya, Irina; Rajkumar, Anto Praveen; Mørk, Arne; Paternoster, Veerle; Füchtbauer, Ernst-Martin; Pallesen, Jonatan; Fryland, Tue; Dyrvig, Mads; Hauberg, Mads Engel; Lundsberg, Birgitte; Fejgin, Kim; Nyegaard, Mette; Jensen, Kimmo; Nyengaard, Jens Randel; Mors, Ole; Didriksen, Michael; Børglum, Anders Dupont

    2017-07-01

    The schizophrenia-associated BRD1 gene encodes a transcriptional regulator whose comprehensive chromatin interactome is enriched with schizophrenia risk genes. However, the biology underlying the disease association of BRD1 remains speculative. This study assessed the transcriptional drive of a schizophrenia-associated BRD1 risk variant in vitro. Accordingly, to examine the effects of reduced Brd1 expression, we generated a genetically modified Brd1 +/- mouse and subjected it to behavioral, electrophysiological, molecular, and integrative genomic analyses with focus on schizophrenia-relevant parameters. Brd1 +/- mice displayed cerebral histone H3K14 hypoacetylation and a broad range of behavioral changes with translational relevance to schizophrenia. These behaviors were accompanied by striatal dopamine/serotonin abnormalities and cortical excitation-inhibition imbalances involving loss of parvalbumin immunoreactive interneurons. RNA-sequencing analyses of cortical and striatal micropunches from Brd1 +/- and wild-type mice revealed differential expression of genes enriched for schizophrenia risk, including several schizophrenia genome-wide association study risk genes (e.g., calcium channel subunits [Cacna1c and Cacnb2], cholinergic muscarinic receptor 4 [Chrm4)], dopamine receptor D 2 [Drd2], and transcription factor 4 [Tcf4]). Integrative analyses further found differentially expressed genes to cluster in functional networks and canonical pathways associated with mental illness and molecular signaling processes (e.g., glutamatergic, monoaminergic, calcium, cyclic adenosine monophosphate [cAMP], dopamine- and cAMP-regulated neuronal phosphoprotein 32 kDa [DARPP-32], and cAMP responsive element binding protein signaling [CREB]). Our study bridges the gap between genetic association and pathogenic effects and yields novel insights into the unfolding molecular changes in the brain of a new schizophrenia model that incorporates genetic risk at three levels: allelic

  19. Enriching the gene set analysis of genome-wide data by incorporating directionality of gene expression and combining statistical hypotheses and methods

    Science.gov (United States)

    Väremo, Leif; Nielsen, Jens; Nookaew, Intawat

    2013-01-01

    Gene set analysis (GSA) is used to elucidate genome-wide data, in particular transcriptome data. A multitude of methods have been proposed for this step of the analysis, and many of them have been compared and evaluated. Unfortunately, there is no consolidated opinion regarding what methods should be preferred, and the variety of available GSA software and implementations pose a difficulty for the end-user who wants to try out different methods. To address this, we have developed the R package Piano that collects a range of GSA methods into the same system, for the benefit of the end-user. Further on we refine the GSA workflow by using modifications of the gene-level statistics. This enables us to divide the resulting gene set P-values into three classes, describing different aspects of gene expression directionality at gene set level. We use our fully implemented workflow to investigate the impact of the individual components of GSA by using microarray and RNA-seq data. The results show that the evaluated methods are globally similar and the major separation correlates well with our defined directionality classes. As a consequence of this, we suggest to use a consensus scoring approach, based on multiple GSA runs. In combination with the directionality classes, this constitutes a more thorough basis for an enriched biological interpretation. PMID:23444143

  20. Literature mining, gene-set enrichment and pathway analysis for target identification in Behçet's disease.

    Science.gov (United States)

    Wilson, Paul; Larminie, Christopher; Smith, Rona

    2016-01-01

    To use literature mining to catalogue Behçet's associated genes, and advanced computational methods to improve the understanding of the pathways and signalling mechanisms that lead to the typical clinical characteristics of Behçet's patients. To extend this technique to identify potential treatment targets for further experimental validation. Text mining methods combined with gene enrichment tools, pathway analysis and causal analysis algorithms. This approach identified 247 human genes associated with Behçet's disease and the resulting disease map, comprising 644 nodes and 19220 edges, captured important details of the relationships between these genes and their associated pathways, as described in diverse data repositories. Pathway analysis has identified how Behçet's associated genes are likely to participate in innate and adaptive immune responses. Causal analysis algorithms have identified a number of potential therapeutic strategies for further investigation. Computational methods have captured pertinent features of the prominent disease characteristics presented in Behçet's disease and have highlighted NOD2, ICOS and IL18 signalling as potential therapeutic strategies.

  1. Bi-directional gene set enrichment and canonical correlation analysis identify key diet-sensitive pathways and biomarkers of metabolic syndrome

    Directory of Open Access Journals (Sweden)

    Gaora Peadar Ó

    2010-10-01

    Full Text Available Abstract Background Currently, a number of bioinformatics methods are available to generate appropriate lists of genes from a microarray experiment. While these lists represent an accurate primary analysis of the data, fewer options exist to contextualise those lists. The development and validation of such methods is crucial to the wider application of microarray technology in the clinical setting. Two key challenges in clinical bioinformatics involve appropriate statistical modelling of dynamic transcriptomic changes, and extraction of clinically relevant meaning from very large datasets. Results Here, we apply an approach to gene set enrichment analysis that allows for detection of bi-directional enrichment within a gene set. Furthermore, we apply canonical correlation analysis and Fisher's exact test, using plasma marker data with known clinical relevance to aid identification of the most important gene and pathway changes in our transcriptomic dataset. After a 28-day dietary intervention with high-CLA beef, a range of plasma markers indicated a marked improvement in the metabolic health of genetically obese mice. Tissue transcriptomic profiles indicated that the effects were most dramatic in liver (1270 genes significantly changed; p Conclusion Bi-directional gene set enrichment analysis more accurately reflects dynamic regulatory behaviour in biochemical pathways, and as such highlighted biologically relevant changes that were not detected using a traditional approach. In such cases where transcriptomic response to treatment is exceptionally large, canonical correlation analysis in conjunction with Fisher's exact test highlights the subset of pathways showing strongest correlation with the clinical markers of interest. In this case, we have identified selenoamino acid metabolism and steroid biosynthesis as key pathways mediating the observed relationship between metabolic health and high-CLA beef. These results indicate that this type of

  2. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery.

    Science.gov (United States)

    Jia, Zhilong; Liu, Ying; Guan, Naiyang; Bo, Xiaochen; Luo, Zhigang; Barnes, Michael R

    2016-05-27

    Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and

  3. Redundancy control in pathway databases (ReCiPa): an application for improving gene-set enrichment analysis in Omics studies and "Big data" biology.

    Science.gov (United States)

    Vivar, Juan C; Pemu, Priscilla; McPherson, Ruth; Ghosh, Sujoy

    2013-08-01

    Abstract Unparalleled technological advances have fueled an explosive growth in the scope and scale of biological data and have propelled life sciences into the realm of "Big Data" that cannot be managed or analyzed by conventional approaches. Big Data in the life sciences are driven primarily via a diverse collection of 'omics'-based technologies, including genomics, proteomics, metabolomics, transcriptomics, metagenomics, and lipidomics. Gene-set enrichment analysis is a powerful approach for interrogating large 'omics' datasets, leading to the identification of biological mechanisms associated with observed outcomes. While several factors influence the results from such analysis, the impact from the contents of pathway databases is often under-appreciated. Pathway databases often contain variously named pathways that overlap with one another to varying degrees. Ignoring such redundancies during pathway analysis can lead to the designation of several pathways as being significant due to high content-similarity, rather than truly independent biological mechanisms. Statistically, such dependencies also result in correlated p values and overdispersion, leading to biased results. We investigated the level of redundancies in multiple pathway databases and observed large discrepancies in the nature and extent of pathway overlap. This prompted us to develop the application, ReCiPa (Redundancy Control in Pathway Databases), to control redundancies in pathway databases based on user-defined thresholds. Analysis of genomic and genetic datasets, using ReCiPa-generated overlap-controlled versions of KEGG and Reactome pathways, led to a reduction in redundancy among the top-scoring gene-sets and allowed for the inclusion of additional gene-sets representing possibly novel biological mechanisms. Using obesity as an example, bioinformatic analysis further demonstrated that gene-sets identified from overlap-controlled pathway databases show stronger evidence of prior association

  4. Pathway profiles based on gene-set enrichment analysis in the honey bee Apis mellifera under brood rearing-suppressed conditions.

    Science.gov (United States)

    Kim, Kyungmun; Kim, Ju Hyeon; Kim, Young Ho; Hong, Seong-Eui; Lee, Si Hyeock

    2018-01-01

    Perturbation of normal behaviors in honey bee colonies by any external factor can immediately reduce the colony's capacity for brood rearing, which can eventually lead to colony collapse. To investigate the effects of brood-rearing suppression on the biology of honey bee workers, gene-set enrichment analysis of the transcriptomes of worker bees with or without suppressed brood rearing was performed. When brood rearing was suppressed, pathways associated with both protein degradation and synthesis were simultaneously over-represented in both nurses and foragers, and their overall pathway representation profiles resembled those of normal foragers and nurses, respectively. Thus, obstruction of normal labor induced over-representation in pathways related with reshaping of worker bee physiology, suggesting that transition of labor is physiologically reversible. In addition, some genes associated with the regulation of neuronal excitability, cellular and nutritional stress and aggressiveness were over-expressed under brood rearing suppression perhaps to manage in-hive stress under unfavorable conditions. Copyright © 2017 Elsevier Inc. All rights reserved.

  5. Integrative set enrichment testing for multiple omics platforms

    Directory of Open Access Journals (Sweden)

    Poisson Laila M

    2011-11-01

    Full Text Available Abstract Background Enrichment testing assesses the overall evidence of differential expression behavior of the elements within a defined set. When we have measured many molecular aspects, e.g. gene expression, metabolites, proteins, it is desirable to assess their differential tendencies jointly across platforms using an integrated set enrichment test. In this work we explore the properties of several methods for performing a combined enrichment test using gene expression and metabolomics as the motivating platforms. Results Using two simulation models we explored the properties of several enrichment methods including two novel methods: the logistic regression 2-degree of freedom Wald test and the 2-dimensional permutation p-value for the sum-of-squared statistics test. In relation to their univariate counterparts we find that the joint tests can improve our ability to detect results that are marginal univariately. We also find that joint tests improve the ranking of associated pathways compared to their univariate counterparts. However, there is a risk of Type I error inflation with some methods and self-contained methods lose specificity when the sets are not representative of underlying association. Conclusions In this work we show that consideration of data from multiple platforms, in conjunction with summarization via a priori pathway information, leads to increased power in detection of genomic associations with phenotypes.

  6. Witnessing stressful events induces glutamatergic synapse pathway alterations and gene set enrichment of positive EPSP regulation within the VTA of adult mice: An ontology based approach

    Science.gov (United States)

    Brewer, Jacob S.

    It is well known that exposure to severe stress increases the risk for developing mood disorders. Currently, the neurobiological and genetic mechanisms underlying the functional effects of psychological stress are poorly understood. Presenting a major obstacle to the study of psychological stress is the inability of current animal models of stress to distinguish between physical and psychological stressors. A novel paradigm recently developed by Warren et al., is able to tease apart the effects of physical and psychological stress in adult mice by allowing these mice to "witness," the social defeat of another mouse thus removing confounding variables associated with physical stressors. Using this 'witness' model of stress and RNA-Seq technology, the current study aims to study the genetic effects of psychological stress. After, witnessing the social defeat of another mouse, VTA tissue was extracted, sequenced, and analyzed for differential expression. Since genes often work together in complex networks, a pathway and gene ontology (GO) analysis was performed using data from the differential expression analysis. The pathway and GO analyzes revealed a perturbation of the glutamatergic synapse pathway and an enrichment of positive excitatory post-synaptic potential regulation. This is consistent with the excitatory synapse theory of depression. Together these findings demonstrate a dysregulation of the mesolimbic reward pathway at the gene level as a result of psychological stress potentially contributing to depressive like behaviors.

  7. Gene set analysis using variance component tests.

    Science.gov (United States)

    Huang, Yen-Tsung; Lin, Xihong

    2013-06-28

    Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.

  8. Optimal set of selected uranium enrichments that minimizes blending consequences

    International Nuclear Information System (INIS)

    Nachlas, J.A.; Kurstedt, H.A. Jr.; Lobber, J.S. Jr.

    1977-01-01

    Identities, quantities, and costs associated with producing a set of selected enrichments and blending them to provide fuel for existing reactors are investigated using an optimization model constructed with appropriate constraints. Selected enrichments are required for either nuclear reactor fuel standardization or potential uranium enrichment alternatives such as the gas centrifuge. Using a mixed-integer linear program, the model minimizes present worth costs for a 39-product-enrichment reference case. For four ingredients, the marginal blending cost is only 0.18% of the total direct production cost. Natural uranium is not an optimal blending ingredient. Optimal values reappear in most sets of ingredient enrichments

  9. Gene set analysis for GWAS

    DEFF Research Database (Denmark)

    Debrabant, Birgit; Soerensen, Mette

    2014-01-01

    Abstract We discuss the use of modified Kolmogorov-Smirnov (KS) statistics in the context of gene set analysis and review corresponding null and alternative hypotheses. Especially, we show that, when enhancing the impact of highly significant genes in the calculation of the test statistic, the co...

  10. Knowledge Enrichment Analysis for Human Tissue- Specific Genes Uncover New Biological Insights

    Directory of Open Access Journals (Sweden)

    Gong Xiu-Jun

    2012-06-01

    Full Text Available The expression and regulation of genes in different tissues are fundamental questions to be answered in biology. Knowledge enrichment analysis for tissue specific (TS and housekeeping (HK genes may help identify their roles in biological process or diseases and gain new biological insights.In this paper, we performed the knowledge enrichment analysis for 17,343 genes in 84 human tissues using Gene Set Enrichment Analysis (GSEA and Hypergeometric Analysis (HA against three biological ontologies: Gene Ontology (GO, KEGG pathways and Disease Ontology (DO respectively.The analyses results demonstrated that the functions of most gene groups are consistent with their tissue origins. Meanwhile three interesting new associations for HK genes and the skeletal muscle tissuegenes are found. Firstly, Hypergeometric analysis against KEGG database for HK genes disclosed that three disease terms (Parkinson’s disease, Huntington’s disease, Alzheimer’s disease are intensively enriched.Secondly, Hypergeometric analysis against the KEGG database for Skeletal Muscle tissue genes shows that two cardiac diseases of “Hypertrophic cardiomyopathy (HCM” and “Arrhythmogenic right ventricular cardiomyopathy (ARVC” are heavily enriched, which are also considered as no relationship with skeletal functions.Thirdly, “Prostate cancer” is intensively enriched in Hypergeometric analysis against the disease ontology (DO for the Skeletal Muscle tissue genes, which is a much unexpected phenomenon.

  11. GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists

    Directory of Open Access Journals (Sweden)

    Steinfeld Israel

    2009-02-01

    Full Text Available Abstract Background Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results. Results GOrilla is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression. GOrilla employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the top of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, GOrilla computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms. Conclusion GOrilla is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. GOrilla's unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. GOrilla is publicly available at: http://cbl-gorilla.cs.technion.ac.il

  12. Gene Ontology and KEGG Enrichment Analyses of Genes Related to Age-Related Macular Degeneration

    Directory of Open Access Journals (Sweden)

    Jian Zhang

    2014-01-01

    Full Text Available Identifying disease genes is one of the most important topics in biomedicine and may facilitate studies on the mechanisms underlying disease. Age-related macular degeneration (AMD is a serious eye disease; it typically affects older adults and results in a loss of vision due to retina damage. In this study, we attempt to develop an effective method for distinguishing AMD-related genes. Gene ontology and KEGG enrichment analyses of known AMD-related genes were performed, and a classification system was established. In detail, each gene was encoded into a vector by extracting enrichment scores of the gene set, including it and its direct neighbors in STRING, and gene ontology terms or KEGG pathways. Then certain feature-selection methods, including minimum redundancy maximum relevance and incremental feature selection, were adopted to extract key features for the classification system. As a result, 720 GO terms and 11 KEGG pathways were deemed the most important factors for predicting AMD-related genes.

  13. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool.

    Science.gov (United States)

    Chen, Edward Y; Tan, Christopher M; Kou, Yan; Duan, Qiaonan; Wang, Zichen; Meirelles, Gabriela Vaz; Clark, Neil R; Ma'ayan, Avi

    2013-04-15

    System-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement. Here, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios. Enrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr.

  14. Separate enrichment analysis of pathways for up- and downregulated genes.

    Science.gov (United States)

    Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng

    2014-03-06

    Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.

  15. Enrichment of putative PAX8 target genes at serous epithelial ovarian cancer susceptibility loci

    DEFF Research Database (Denmark)

    Kar, Siddhartha P; Adler, Emily; Tyrer, Jonathan

    2017-01-01

    BACKGROUND: Genome-wide association studies (GWAS) have identified 18 loci associated with serous ovarian cancer (SOC) susceptibility but the biological mechanisms driving these findings remain poorly characterised. Germline cancer risk loci may be enriched for target genes of transcription factors...... (TFs) critical to somatic tumorigenesis. METHODS: All 615 TF-target sets from the Molecular Signatures Database were evaluated using gene set enrichment analysis (GSEA) and three GWAS for SOC risk: discovery (2196 cases/4396 controls), replication (7035 cases/21 693 controls; independent from discovery...... to interact with PAX8 in the literature to the PAX8-target set and applying an alternative to GSEA, interval enrichment, further confirmed this association (P=0.006). Fifteen of the 157 genes from this expanded PAX8 pathway were near eight loci associated with SOC risk at P

  16. Nociceptor-Enriched Genes Required for Normal Thermal Nociception

    Directory of Open Access Journals (Sweden)

    Ken Honjo

    2016-07-01

    Full Text Available Here, we describe a targeted reverse genetic screen for thermal nociception genes in Drosophila larvae. Using laser capture microdissection and microarray analyses of nociceptive and non-nociceptive neurons, we identified 275 nociceptor-enriched genes. We then tested the function of the enriched genes with nociceptor-specific RNAi and thermal nociception assays. Tissue-specific RNAi targeted against 14 genes caused insensitive thermal nociception while targeting of 22 genes caused hypersensitive thermal nociception. Previously uncategorized genes were named for heat resistance (i.e., boilerman, fire dancer, oven mitt, trivet, thawb, and bunker gear or heat sensitivity (firelighter, black match, eucalyptus, primacord, jet fuel, detonator, gasoline, smoke alarm, and jetboil. Insensitive nociception phenotypes were often associated with severely reduced branching of nociceptor neurites and hyperbranched dendrites were seen in two of the hypersensitive cases. Many genes that we identified are conserved in mammals.

  17. Genes misregulated in C. elegans deficient in Dicer, RDE-4, or RDE-1 are enriched for innate immunity genes.

    Science.gov (United States)

    Welker, Noah C; Habig, Jeffrey W; Bass, Brenda L

    2007-07-01

    We describe the first microarray analysis of a whole animal containing a mutation in the Dicer gene. We used adult Caenorhabditis elegans and, to distinguish among different roles of Dicer, we also performed microarray analyses of animals with mutations in rde-4 and rde-1, which are involved in silencing by siRNA, but not miRNA. Surprisingly, we find that the X chromosome is greatly enriched for genes regulated by Dicer. Comparison of all three microarray data sets indicates the majority of Dicer-regulated genes are not dependent on RDE-4 or RDE-1, including the X-linked genes. However, all three data sets are enriched in genes important for innate immunity and, specifically, show increased expression of innate immunity genes.

  18. Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

    Science.gov (United States)

    Osato, Naoki

    2018-01-19

    Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional

  19. Gene set analysis of the EADGENE chicken data-set

    DEFF Research Database (Denmark)

    Skarman, Axel; Jiang, Li; Hornshøj, Henrik

    2009-01-01

     Abstract Background: Gene set analysis is considered to be a way of improving our biological interpretation of the observed expression patterns. This paper describes different methods applied to analyse expression data from a chicken DNA microarray dataset. Results: Applying different gene set...... analyses to the chicken expression data led to different ranking of the Gene Ontology terms tested. A method for prediction of possible annotations was applied. Conclusion: Biological interpretation based on gene set analyses dependent on the statistical method used. Methods for predicting the possible...

  20. Effect of the absolute statistic on gene-sampling gene-set analysis methods.

    Science.gov (United States)

    Nam, Dougu

    2017-06-01

    Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.

  1. Transcriptional profiles of supragranular-enriched genes associate with corticocortical network architecture in the human brain.

    Science.gov (United States)

    Krienen, Fenna M; Yeo, B T Thomas; Ge, Tian; Buckner, Randy L; Sherwood, Chet C

    2016-01-26

    The human brain is patterned with disproportionately large, distributed cerebral networks that connect multiple association zones in the frontal, temporal, and parietal lobes. The expansion of the cortical surface, along with the emergence of long-range connectivity networks, may be reflected in changes to the underlying molecular architecture. Using the Allen Institute's human brain transcriptional atlas, we demonstrate that genes particularly enriched in supragranular layers of the human cerebral cortex relative to mouse distinguish major cortical classes. The topography of transcriptional expression reflects large-scale brain network organization consistent with estimates from functional connectivity MRI and anatomical tracing in nonhuman primates. Microarray expression data for genes preferentially expressed in human upper layers (II/III), but enriched only in lower layers (V/VI) of mouse, were cross-correlated to identify molecular profiles across the cerebral cortex of postmortem human brains (n = 6). Unimodal sensory and motor zones have similar molecular profiles, despite being distributed across the cortical mantle. Sensory/motor profiles were anticorrelated with paralimbic and certain distributed association network profiles. Tests of alternative gene sets did not consistently distinguish sensory and motor regions from paralimbic and association regions: (i) genes enriched in supragranular layers in both humans and mice, (ii) genes cortically enriched in humans relative to nonhuman primates, (iii) genes related to connectivity in rodents, (iv) genes associated with human and mouse connectivity, and (v) 1,454 gene sets curated from known gene ontologies. Molecular innovations of upper cortical layers may be an important component in the evolution of long-range corticocortical projections.

  2. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Boris P Hejblum

    2015-06-01

    Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.

  3. Statistical assessment of crosstalk enrichment between gene groups in biological networks.

    Science.gov (United States)

    McCormack, Theodore; Frings, Oliver; Alexeyenko, Andrey; Sonnhammer, Erik L L

    2013-01-01

    Analyzing groups of functionally coupled genes or proteins in the context of global interaction networks has become an important aspect of bioinformatic investigations. Assessing the statistical significance of crosstalk enrichment between or within groups of genes can be a valuable tool for functional annotation of experimental gene sets. Here we present CrossTalkZ, a statistical method and software to assess the significance of crosstalk enrichment between pairs of gene or protein groups in large biological networks. We demonstrate that the standard z-score is generally an appropriate and unbiased statistic. We further evaluate the ability of four different methods to reliably recover crosstalk within known biological pathways. We conclude that the methods preserving the second-order topological network properties perform best. Finally, we show how CrossTalkZ can be used to annotate experimental gene sets using known pathway annotations and that its performance at this task is superior to gene enrichment analysis (GEA). CrossTalkZ (available at http://sonnhammer.sbc.su.se/download/software/CrossTalkZ/) is implemented in C++, easy to use, fast, accepts various input file formats, and produces a number of statistics. These include z-score, p-value, false discovery rate, and a test of normality for the null distributions.

  4. Gene-ontology enrichment analysis in two independent family-based samples highlights biologically plausible processes for autism spectrum disorders.

    LENUS (Irish Health Repository)

    Anney, Richard J L

    2012-02-01

    Recent genome-wide association studies (GWAS) have implicated a range of genes from discrete biological pathways in the aetiology of autism. However, despite the strong influence of genetic factors, association studies have yet to identify statistically robust, replicated major effect genes or SNPs. We apply the principle of the SNP ratio test methodology described by O\\'Dushlaine et al to over 2100 families from the Autism Genome Project (AGP). Using a two-stage design we examine association enrichment in 5955 unique gene-ontology classifications across four groupings based on two phenotypic and two ancestral classifications. Based on estimates from simulation we identify excess of association enrichment across all analyses. We observe enrichment in association for sets of genes involved in diverse biological processes, including pyruvate metabolism, transcription factor activation, cell-signalling and cell-cycle regulation. Both genes and processes that show enrichment have previously been examined in autistic disorders and offer biologically plausibility to these findings.

  5. Novel gene sets improve set-level classification of prokaryotic gene expression data.

    Science.gov (United States)

    Holec, Matěj; Kuželka, Ondřej; Železný, Filip

    2015-10-28

    Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.

  6. Model-based gene set analysis for Bioconductor.

    Science.gov (United States)

    Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien

    2011-07-01

    Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.

  7. GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Chris Cheadle

    2007-01-01

    Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

  8. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  9. Gene set analysis for interpreting genetic studies

    DEFF Research Database (Denmark)

    Pers, Tune H

    2016-01-01

    Interpretation of genome-wide association study (GWAS) results is lacking behind the discovery of new genetic associations. Consequently, there is an urgent need for data-driven methods for interpreting genetic association studies. Gene set analysis (GSA) can identify aetiologic pathways...

  10. Integrative analysis of survival-associated gene sets in breast cancer.

    Science.gov (United States)

    Varn, Frederick S; Ung, Matthew H; Lou, Shao Ke; Cheng, Chao

    2015-03-12

    Patient gene expression information has recently become a clinical feature used to evaluate breast cancer prognosis. The emergence of prognostic gene sets that take advantage of these data has led to a rich library of information that can be used to characterize the molecular nature of a patient's cancer. Identifying robust gene sets that are consistently predictive of a patient's clinical outcome has become one of the main challenges in the field. We inputted our previously established BASE algorithm with patient gene expression data and gene sets from MSigDB to develop the gene set activity score (GSAS), a metric that quantitatively assesses a gene set's activity level in a given patient. We utilized this metric, along with patient time-to-event data, to perform survival analyses to identify the gene sets that were significantly correlated with patient survival. We then performed cross-dataset analyses to identify robust prognostic gene sets and to classify patients by metastasis status. Additionally, we created a gene set network based on component gene overlap to explore the relationship between gene sets derived from MSigDB. We developed a novel gene set based on this network's topology and applied the GSAS metric to characterize its role in patient survival. Using the GSAS metric, we identified 120 gene sets that were significantly associated with patient survival in all datasets tested. The gene overlap network analysis yielded a novel gene set enriched in genes shared by the robustly predictive gene sets. This gene set was highly correlated to patient survival when used alone. Most interestingly, removal of the genes in this gene set from the gene pool on MSigDB resulted in a large reduction in the number of predictive gene sets, suggesting a prominent role for these genes in breast cancer progression. The GSAS metric provided a useful medium by which we systematically investigated how gene sets from MSigDB relate to breast cancer patient survival. We used

  11. Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer

    Directory of Open Access Journals (Sweden)

    Pingzhao Hu

    2006-01-01

    Full Text Available Background: Microarray technology has been previously used to identify genes that are differentially expressed between tumour and normal samples in a single study, as well as in syntheses involving multiple studies. When integrating results from several Affymetrix microarray datasets, previous studies summarized probeset-level data, which may potentially lead to a loss of information available at the probe-level. In this paper, we present an approach for integrating results across studies while taking probe-level data into account. Additionally, we follow a new direction in the analysis of microarray expression data, namely to focus on the variation of expression phenotypes in predefined gene sets, such as pathways. This targeted approach can be helpful for revealing information that is not easily visible from the changes in the individual genes. Results: We used a recently developed method to integrate Affymetrix expression data across studies. The idea is based on a probe-level based test statistic developed for testing for differentially expressed genes in individual studies. We incorporated this test statistic into a classic random-effects model for integrating data across studies. Subsequently, we used a gene set enrichment test to evaluate the significance of enriched biological pathways in the differentially expressed genes identified from the integrative analysis. We compared statistical and biological significance of the prognostic gene expression signatures and pathways identified in the probe-level model (PLM with those in the probeset-level model (PSLM. Our integrative analysis of Affymetrix microarray data from 110 prostate cancer samples obtained from three studies reveals thousands of genes significantly correlated with tumour cell differentiation. The bioinformatics analysis, mapping these genes to the publicly available KEGG database, reveals evidence that tumour cell differentiation is significantly associated with many

  12. The Molecular Signatures Database (MSigDB) hallmark gene set collection.

    Science.gov (United States)

    Liberzon, Arthur; Birger, Chet; Thorvaldsdóttir, Helga; Ghandi, Mahmoud; Mesirov, Jill P; Tamayo, Pablo

    2015-12-23

    The Molecular Signatures Database (MSigDB) is one of the most widely used and comprehensive databases of gene sets for performing gene set enrichment analysis. Since its creation, MSigDB has grown beyond its roots in metabolic disease and cancer to include >10,000 gene sets. These better represent a wider range of biological processes and diseases, but the utility of the database is reduced by increased redundancy across, and heterogeneity within, gene sets. To address this challenge, here we use a combination of automated approaches and expert curation to develop a collection of "hallmark" gene sets as part of MSigDB. Each hallmark in this collection consists of a "refined" gene set, derived from multiple "founder" sets, that conveys a specific biological state or process and displays coherent expression. The hallmarks effectively summarize most of the relevant information of the original founder sets and, by reducing both variation and redundancy, provide more refined and concise inputs for gene set enrichment analysis.

  13. Uniform approximation is more appropriate for Wilcoxon Rank-Sum Test in gene set analysis.

    Directory of Open Access Journals (Sweden)

    Zhide Fang

    Full Text Available Gene set analysis is widely used to facilitate biological interpretations in the analyses of differential expression from high throughput profiling data. Wilcoxon Rank-Sum (WRS test is one of the commonly used methods in gene set enrichment analysis. It compares the ranks of genes in a gene set against those of genes outside the gene set. This method is easy to implement and it eliminates the dichotomization of genes into significant and non-significant in a competitive hypothesis testing. Due to the large number of genes being examined, it is impractical to calculate the exact null distribution for the WRS test. Therefore, the normal distribution is commonly used as an approximation. However, as we demonstrate in this paper, the normal approximation is problematic when a gene set with relative small number of genes is tested against the large number of genes in the complementary set. In this situation, a uniform approximation is substantially more powerful, more accurate, and less intensive in computation. We demonstrate the advantage of the uniform approximations in Gene Ontology (GO term analysis using simulations and real data sets.

  14. Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets.

    Science.gov (United States)

    Springer, Mark S; Gatesy, John

    2018-02-26

    coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including the critical assumption that recombination occurs freely among but not within coalescence genes (c-genes), which are the fundamental units of analysis for these methods. Each c-gene has a single branching history, and large sets of these independent gene histories should be the input for genome-scale coalescence estimates of phylogeny. By contrast, numerous studies have reported the results of coalescence analyses in which complete protein-coding sequences are treated as c-genes even though exons for these loci can span more than a megabase of DNA. Empirical estimates of recombination breakpoints suggest that c-genes may be much shorter, especially when large clades with many species are the focus of analysis. Although this idea has been challenged recently in the literature, the inverse relationship between c-gene size and increased taxon sampling in a dataset-the 'recombination ratchet'-is a fundamental property of c-genes. For taxonomic groups characterized by genes with long intron sequences, complete protein-coding sequences are likely not valid c-genes and are inappropriate units of analysis for summary coalescence methods unless they occur in recombination deserts that are devoid of incomplete lineage sorting (ILS). Finally, it has been argued that coalescence methods are robust when the no-recombination within loci assumption is violated, but recombination must matter at some scale because ILS, a by-product of recombination, is the raison d'etre for coalescence methods. That is, extensive recombination is required to yield the large number of independently segregating c-genes used to infer a species tree. If coalescent methods are powerful

  15. Mutation intolerant genes and targets of FMRP are enriched for nonsynonymous alleles in schizophrenia.

    Science.gov (United States)

    Leonenko, Ganna; Richards, Alexander L; Walters, James T; Pocklington, Andrew; Chambert, Kimberly; Al Eissa, Mariam M; Sharp, Sally I; O'Brien, Niamh L; Curtis, David; Bass, Nicholas J; McQuillin, Andrew; Hultman, Christina; Moran, Jennifer L; McCarroll, Steven A; Sklar, Pamela; Neale, Benjamin M; Holmans, Peter A; Owen, Michael J; Sullivan, Patrick F; O'Donovan, Michael C

    2017-10-01

    Risk of schizophrenia is conferred by alleles occurring across the full spectrum of frequencies from common SNPs of weak effect through to ultra rare alleles, some of which may be moderately to highly penetrant. Previous studies have suggested that some of the risk of schizophrenia is attributable to uncommon alleles represented on Illumina exome arrays. Here, we present the largest study of exomic variation in schizophrenia to date, using samples from the United Kingdom and Sweden (10,011 schizophrenia cases and 13,791 controls). Single variants, genes, and gene sets were analyzed for association with schizophrenia. No single variant or gene reached genome-wide significance. Among candidate gene sets, we found significant enrichment for rare alleles (minor allele frequency [MAF] schizophrenia by excluding a role for uncommon exomic variants (0.01 ≤ MAF ≥ 0.001) that confer a relatively large effect (odds ratio [OR] > 4). We also show risk alleles within this frequency range exist, but confer smaller effects and should be identified by larger studies. © 2017 Wiley Periodicals, Inc.

  16. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Christian J. Michel

    2017-12-01

    Full Text Available A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C 3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X , using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X , in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level, and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R . We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions. This property is true for all cardinalities of X motifs (from 4 to 20 and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non- X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together

  17. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

    Science.gov (United States)

    Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

    2017-12-03

    A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first

  18. Systematic enrichment analysis of gene expression profiling studies identifies consensus pathways implicated in colorectal cancer development

    Directory of Open Access Journals (Sweden)

    Jesús Lascorz

    2011-01-01

    Full Text Available Background: A large number of gene expression profiling (GEP studies on colorectal carcinogenesis have been performed but no reliable gene signature has been identified so far due to the lack of reproducibility in the reported genes. There is growing evidence that functionally related genes, rather than individual genes, contribute to the etiology of complex traits. We used, as a novel approach, pathway enrichment tools to define functionally related genes that are consistently up- or down-regulated in colorectal carcinogenesis. Materials and Methods: We started the analysis with 242 unique annotated genes that had been reported by any of three recent meta-analyses covering GEP studies on genes differentially expressed in carcinoma vs normal mucosa. Most of these genes (218, 91.9% had been reported in at least three GEP studies. These 242 genes were submitted to bioinformatic analysis using a total of nine tools to detect enrichment of Gene Ontology (GO categories or Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. As a final consistency criterion the pathway categories had to be enriched by several tools to be taken into consideration. Results: Our pathway-based enrichment analysis identified the categories of ribosomal protein constituents, extracellular matrix receptor interaction, carbonic anhydrase isozymes, and a general category related to inflammation and cellular response as significantly and consistently overrepresented entities. Conclusions: We triaged the genes covered by the published GEP literature on colorectal carcinogenesis and subjected them to multiple enrichment tools in order to identify the consistently enriched gene categories. These turned out to have known functional relationships to cancer development and thus deserve further investigation.

  19. Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.

    Science.gov (United States)

    Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M

    2011-11-01

    Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.

  20. Benchmarking methods and data sets for ligand enrichment assessment in virtual screening.

    Science.gov (United States)

    Xia, Jie; Tilahun, Ermias Lemma; Reid, Terry-Elinor; Zhang, Liangren; Wang, Xiang Simon

    2015-01-01

    Retrospective small-scale virtual screening (VS) based on benchmarking data sets has been widely used to estimate ligand enrichments of VS approaches in the prospective (i.e. real-world) efforts. However, the intrinsic differences of benchmarking sets to the real screening chemical libraries can cause biased assessment. Herein, we summarize the history of benchmarking methods as well as data sets and highlight three main types of biases found in benchmarking sets, i.e. "analogue bias", "artificial enrichment" and "false negative". In addition, we introduce our recent algorithm to build maximum-unbiased benchmarking sets applicable to both ligand-based and structure-based VS approaches, and its implementations to three important human histone deacetylases (HDACs) isoforms, i.e. HDAC1, HDAC6 and HDAC8. The leave-one-out cross-validation (LOO CV) demonstrates that the benchmarking sets built by our algorithm are maximum-unbiased as measured by property matching, ROC curves and AUCs. Copyright © 2014 Elsevier Inc. All rights reserved.

  1. MAGMA: generalized gene-set analysis of GWAS data.

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  2. MAGMA: Generalized Gene-Set Analysis of GWAS Data

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  3. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  4. MAGMA: generalized gene-set analysis of GWAS data.

    Science.gov (United States)

    de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

    2015-04-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.

  5. Diversity of reductive dehalogenase genes from environmental samples and enrichment cultures identified with degenerate primer PCR screens.

    Directory of Open Access Journals (Sweden)

    Laura Audrey Hug

    2013-11-01

    Full Text Available Reductive dehalogenases are the critical enzymes for anaerobic organohalide respiration, a microbial metabolic process that has been harnessed for bioremediation efforts to resolve chlorinated solvent contamination in groundwater and is implicated in the global halogen cycle. Reductive dehalogenase sequence diversity is informative for the dechlorination potential of the site or enrichment culture. A suite of degenerate PCR primers targeting a comprehensive curated set of reductive dehalogenase genes was designed and applied to twelve DNA samples extracted from contaminated and pristine sites, as well as six enrichment cultures capable of reducing chlorinated compounds to non-toxic end-products. The amplified gene products from four environmental sites and two enrichment cultures were sequenced using Illumina HiSeq, and the reductive dehalogenase complement of each sample determined. The results indicate that the diversity of the reductive dehalogenase gene family is much deeper than is currently accounted for: one-third of the translated proteins have less than 70% pairwise amino acid identity to database sequences. Approximately 60% of the sequenced reductive dehalogenase genes were broadly distributed, being identified in four or more samples, and often in previously sequenced genomes as well. In contrast, 17% of the sequenced reductive dehalogenases were unique, present in only a single sample and bearing less than 90% pairwise amino acid identity to any previously identified proteins. Many of the broadly distributed reductive dehalogenases are uncharacterized in terms of their substrate specificity, making these intriguing targets for further biochemical experimentation. Finally, comparison of samples from a contaminated site and an enrichment culture derived from the same site eight years prior allowed examination of the effect of the enrichment process.

  6. Digital gene expression profiling of flax (Linum usitatissimum L.) stem peel identifies genes enriched in fiber-bearing phloem tissue.

    Science.gov (United States)

    Guo, Yuan; Qiu, Caisheng; Long, Songhua; Chen, Ping; Hao, Dongmei; Preisner, Marta; Wang, Hui; Wang, Yufu

    2017-08-30

    To better understand the molecular mechanisms and gene expression characteristics associated with development of bast fiber cell within flax stem phloem, the gene expression profiling of flax stem peels and leaves were screened, using Illumina's Digital Gene Expression (DGE) analysis. Four DGE libraries (2 for stem peel and 2 for leaf), ranging from 6.7 to 9.2 million clean reads were obtained, which produced 7.0 million and 6.8 million mapped reads for flax stem peel and leave, respectively. By differential gene expression analysis, a total of 975 genes, of which 708 (73%) genes have protein-coding annotation, were identified as phloem enriched genes putatively involved in the processes of polysaccharide and cell wall metabolism. Differential expression genes (DEGs) was validated using quantitative RT-PCR, the expression pattern of all nine genes determined by qRT-PCR fitted in well with that obtained by sequencing analysis. Cluster and Gene Ontology (GO) analysis revealed that a large number of genes related to metabolic process, catalytic activity and binding category were expressed predominantly in the stem peels. The Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of the phloem enriched genes suggested approximately 111 biological pathways. The large number of genes and pathways produced from DGE sequencing will expand our understanding of the complex molecular and cellular events in flax bast fiber development and provide a foundation for future studies on fiber development in other bast fiber crops. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Groundwater fluoride enrichment in an active rift setting: Central Kenya Rift case study

    Energy Technology Data Exchange (ETDEWEB)

    Olaka, Lydia A., E-mail: lydiaolaka@gmail.com [Department of Geology, University of Nairobi, P.O Box 30197, Nairobi (Kenya); Wilke, Franziska D.H. [Geoforschungs Zentrum, Telegrafenberg, 14473 Potsdam (Germany); Olago, Daniel O.; Odada, Eric O. [Department of Geology, University of Nairobi, P.O Box 30197, Nairobi (Kenya); Mulch, Andreas [Senckenberg Biodiversity and Climate Research Centre, Senckenberganlage 25, 60325 Frankfurt (Germany); Institut für Geowissenschaften, Goethe Universität Frankfurt, Altenhöferallee 1, 60438 Frankfurt (Germany); Musolff, Andreas [UFZ-Helmholtz-Centre for Environmental Research, Department of Hydrogeology, Permoserstr. 15, 04318 Leipzig (Germany)

    2016-03-01

    Groundwater is used extensively in the Central Kenya Rift for domestic and agricultural demands. In these active rift settings groundwater can exhibit high fluoride levels. In order to address water security and reduce human exposure to high fluoride in drinking water, knowledge of the source and geochemical processes of enrichment are required. A study was therefore carried out within the Naivasha catchment (Kenya) to understand the genesis, enrichment and seasonal variations of fluoride in the groundwater. Rocks, rain, surface and groundwater sources were sampled for hydrogeochemical and isotopic investigations, the data was statistically and geospatially analyzed. Water sources have variable fluoride concentrations between 0.02–75 mg/L. 73% exceed the health limit (1.5 mg/L) in both dry and wet seasons. F{sup −} concentrations in rivers are lower (0.2–9.2 mg/L) than groundwater (0.09 to 43.6 mg/L) while saline lake waters have the highest concentrations (0.27–75 mg/L). The higher values are confined to elevations below 2000 masl. Oxygen (δ{sup 18}O) and hydrogen (δD) isotopic values range from − 6.2 to + 5.8‰ and − 31.3 to + 33.3‰, respectively, they are also highly variable in the rift floor where they attain maximum values. Fluoride base levels in the precursor vitreous volcanic rocks are higher (between 3750–6000 ppm) in minerals such as cordierite and muscovite while secondary minerals like illite and kaolinite have lower remnant fluoride (< 1000 ppm). Thus, geochemical F{sup −} enrichment in regional groundwater is mainly due to a) rock alteration, i.e. through long residence times and natural discharge and/or enhanced leakages of deep seated geothermal water reservoirs, b) secondary concentration fortification of natural reservoirs through evaporation, through reduced recharge and/or enhanced abstraction and c) through additional enrichment of fluoride after volcanic emissions. The findings are useful to help improve water management

  8. Amygdala-enriched genes identified by microarray technology are restricted to specific amygdaloid subnuclei

    OpenAIRE

    Zirlinger, M.; Kreiman, Gabriel; Anderson, D. J.

    2001-01-01

    Microarray technology represents a potentially powerful method for identifying cell type- and regionally restricted genes expressed in the brain. Here we have combined a microarray analysis of differential gene expression among five selected brain regions, including the amygdala, cerebellum, hippocampus, olfactory bulb, and periaqueductal gray, with in situ hybridization. On average, 0.3% of the 34,000 genes interrogated were highly enriched in each of the five regions...

  9. Integrated Enrichment Analysis of Variants and Pathways in Genome-Wide Association Studies Indicates Central Role for IL-2 Signaling Genes in Type 1 Diabetes, and Cytokine Signaling Genes in Crohn's Disease

    Science.gov (United States)

    Carbonetto, Peter; Stephens, Matthew

    2013-01-01

    Pathway analyses of genome-wide association studies aggregate information over sets of related genes, such as genes in common pathways, to identify gene sets that are enriched for variants associated with disease. We develop a model-based approach to pathway analysis, and apply this approach to data from the Wellcome Trust Case Control Consortium (WTCCC) studies. Our method offers several benefits over existing approaches. First, our method not only interrogates pathways for enrichment of disease associations, but also estimates the level of enrichment, which yields a coherent way to promote variants in enriched pathways, enhancing discovery of genes underlying disease. Second, our approach allows for multiple enriched pathways, a feature that leads to novel findings in two diseases where the major histocompatibility complex (MHC) is a major determinant of disease susceptibility. Third, by modeling disease as the combined effect of multiple markers, our method automatically accounts for linkage disequilibrium among variants. Interrogation of pathways from eight pathway databases yields strong support for enriched pathways, indicating links between Crohn's disease (CD) and cytokine-driven networks that modulate immune responses; between rheumatoid arthritis (RA) and “Measles” pathway genes involved in immune responses triggered by measles infection; and between type 1 diabetes (T1D) and IL2-mediated signaling genes. Prioritizing variants in these enriched pathways yields many additional putative disease associations compared to analyses without enrichment. For CD and RA, 7 of 8 additional non-MHC associations are corroborated by other studies, providing validation for our approach. For T1D, prioritization of IL-2 signaling genes yields strong evidence for 7 additional non-MHC candidate disease loci, as well as suggestive evidence for several more. Of the 7 strongest associations, 4 are validated by other studies, and 3 (near IL-2 signaling genes RAF1, MAPK14

  10. Benchmarking Methods and Data Sets for Ligand Enrichment Assessment in Virtual Screening

    Science.gov (United States)

    Xia, Jie; Tilahun, Ermias Lemma; Reid, Terry-Elinor; Zhang, Liangren; Wang, Xiang Simon

    2014-01-01

    Retrospective small-scale virtual screening (VS) based on benchmarking data sets has been widely used to estimate ligand enrichments of VS approaches in the prospective (i.e. real-world) efforts. However, the intrinsic differences of benchmarking sets to the real screening chemical libraries can cause biased assessment. Herein, we summarize the history of benchmarking methods as well as data sets and highlight three main types of biases found in benchmarking sets, i.e. “analogue bias”, “artificial enrichment” and “false negative”. In addition, we introduced our recent algorithm to build maximum-unbiased benchmarking sets applicable to both ligand-based and structure-based VS approaches, and its implementations to three important human histone deacetylase (HDAC) isoforms, i.e. HDAC1, HDAC6 and HDAC8. The Leave-One-Out Cross-Validation (LOO CV) demonstrates that the benchmarking sets built by our algorithm are maximum-unbiased in terms of property matching, ROC curves and AUCs. PMID:25481478

  11. Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

    Science.gov (United States)

    Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

    2012-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.

  12. Principles for the organization of gene-sets.

    Science.gov (United States)

    Li, Wentian; Freudenberg, Jan; Oswald, Michaela

    2015-12-01

    A gene-set, an important concept in microarray expression analysis and systems biology, is a collection of genes and/or their products (i.e. proteins) that have some features in common. There are many different ways to construct gene-sets, but a systematic organization of these ways is lacking. Gene-sets are mainly organized ad hoc in current public-domain databases, with group header names often determined by practical reasons (such as the types of technology in obtaining the gene-sets or a balanced number of gene-sets under a header). Here we aim at providing a gene-set organization principle according to the level at which genes are connected: homology, physical map proximity, chemical interaction, biological, and phenotypic-medical levels. We also distinguish two types of connections between genes: actual connection versus sharing of a label. Actual connections denote direct biological interactions, whereas shared label connection denotes shared membership in a group. Some extensions of the framework are also addressed such as overlapping of gene-sets, modules, and the incorporation of other non-protein-coding entities such as microRNAs. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-03-01

    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  14. Integrating genome-wide association study and expression quantitative trait loci data identifies multiple genes and gene set associated with neuroticism.

    Science.gov (United States)

    Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng

    2017-08-01

    Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.

  15. A Bayesian variable selection procedure for ranking overlapping gene sets

    DEFF Research Database (Denmark)

    Skarman, Axel; Mahdi Shariati, Mohammad; Janss, Luc

    2012-01-01

    Background Genome-wide expression profiling using microarrays or sequence-based technologies allows us to identify genes and genetic pathways whose expression patterns influence complex traits. Different methods to prioritize gene sets, such as the genes in a given molecular pathway, have been de...

  16. Using targeted enrichment of nuclear genes to increase phylogenetic resolution in the neotropical rain forest genus Inga (Leguminosae: Mimosoideae

    Directory of Open Access Journals (Sweden)

    James A Nicholls

    2015-09-01

    Full Text Available Evolutionary radiations are prominent and pervasive across many plant lineages in diverse geographical and ecological settings; in neotropical rainforests there is growing evidence suggesting that a significant fraction of species richness is the result of recent radiations. Understanding the evolutionary trajectories and mechanisms underlying these radiations demands much greater phylogenetic resolution than is currently available for these groups. The neotropical tree genus Inga (Leguminosae is a good example, with ~300 extant species and a crown age of 2-10 MY, yet over 6kb of plastid and nuclear DNA sequence data gives only poor phylogenetic resolution among species. Here we explore the use of larger-scale nuclear gene data obtained though targeted enrichment to increase phylogenetic resolution within Inga. Transcriptome data from three Inga species were used to select 264 nuclear loci for targeted enrichment and sequencing. Following quality control to remove probable paralogs from these sequence data, the final dataset comprised 259,313 bases from 194 loci for 24 accessions representing 22 Inga species and an outgroup (Zygia. Bayesian phylogenies reconstructed using either all loci concatenated or a subset of 60 loci in a gene-tree/species-tree approach yielded highly resolved phylogenies. We used coalescent approaches to show that the same targeted enrichment data also have significant power to discriminate among alternative within-species population histories in the widespread species I. umbellifera. In either application, targeted enrichment simplifies the informatics challenge of identifying orthologous loci associated with de novo genome sequencing. We conclude that targeted enrichment provides the large volumes of phylogenetically-informative sequence data required to resolve relationships within recent plant species radiations, both at the species level and for within-species phylogeographic studies.

  17. Diversity of bacteria and glycosyl hydrolase family 48 genes in cellulolytic consortia enriched from thermophilic biocompost.

    Science.gov (United States)

    Izquierdo, Javier A; Sizova, Maria V; Lynd, Lee R

    2010-06-01

    The enrichment from nature of novel microbial communities with high cellulolytic activity is useful in the identification of novel organisms and novel functions that enhance the fundamental understanding of microbial cellulose degradation. In this work we identify predominant organisms in three cellulolytic enrichment cultures with thermophilic compost as an inoculum. Community structure based on 16S rRNA gene clone libraries featured extensive representation of clostridia from cluster III, with minor representation of clostridial clusters I and XIV and a novel Lutispora species cluster. Our studies reveal different levels of 16S rRNA gene diversity, ranging from 3 to 18 operational taxonomic units (OTUs), as well as variability in community membership across the three enrichment cultures. By comparison, glycosyl hydrolase family 48 (GHF48) diversity analyses revealed a narrower breadth of novel clostridial genes associated with cultured and uncultured cellulose degraders. The novel GHF48 genes identified in this study were related to the novel clostridia Clostridium straminisolvens and Clostridium clariflavum, with one cluster sharing as little as 73% sequence similarity with the closest known relative. In all, 14 new GHF48 gene sequences were added to the known diversity of 35 genes from cultured species.

  18. Targeted Enrichment of Large Gene Families for Phylogenetic Inference: Phylogeny and Molecular Evolution of Photosynthesis Genes in the Portullugo Clade (Caryophyllales).

    Science.gov (United States)

    Moore, Abigail J; Vos, Jurriaan M De; Hancock, Lillian P; Goolsby, Eric; Edwards, Erika J

    2018-05-01

    Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the "portullugo" (Caryophyllales), a moderately sized lineage of flowering plants (~ 2200 species) that includes the cacti and harbors many evolutionary transitions to C$_{\\mathrm{4}}$ and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C$_{\\mathrm{4}}$ and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C$_{\\mathrm{4}}$ and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75-218 loci across 74 taxa, with ~ 50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae $+$ Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.

  19. Length bias correction in gene ontology enrichment analysis using logistic regression.

    Science.gov (United States)

    Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H

    2012-01-01

    When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.

  20. Differential Gene Expression Profiling of Enriched Human Spermatogonia after Short- and Long-Term Culture

    Directory of Open Access Journals (Sweden)

    Sabine Conrad

    2014-01-01

    Full Text Available This study aimed to provide a molecular signature for enriched adult human stem/progenitor spermatogonia during short-term (<2 weeks and long-term culture (up to more than 14 months in comparison to human testicular fibroblasts and human embryonic stem cells. Human spermatogonia were isolated by CD49f magnetic activated cell sorting and collagen−/laminin+ matrix binding from primary testis cultures obtained from ten adult men. For transcriptomic analysis, single spermatogonia-like cells were collected based on their morphology and dimensions using a micromanipulation system from the enriched germ cell cultures. Immunocytochemical, RT-PCR and microarray analyses revealed that the analyzed populations of cells were distinct at the molecular level. The germ- and pluripotency-associated genes and genes of differentiation/spermatogenesis pathway were highly expressed in enriched short-term cultured spermatogonia. After long-term culture, a proportion of cells retained and aggravated the “spermatogonial” gene expression profile with the expression of germ and pluripotency-associated genes, while in the majority of long-term cultured cells this molecular profile, typical for the differentiation pathway, was reduced and more genes related to the extracellular matrix production and attachment were expressed. The approach we provide here to study the molecular status of in vitro cultured spermatogonia may be important to optimize the culture conditions and to evaluate the germ cell plasticity in the future.

  1. Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

    Directory of Open Access Journals (Sweden)

    Tintle Nathan L

    2012-08-01

    Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

  2. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes.

    Directory of Open Access Journals (Sweden)

    Simone de Jong

    Full Text Available Despite large-scale genome-wide association studies (GWAS, the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1, is located in, and regulated by the major histocompatibility (MHC complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network.

  3. Clusters of Antibiotic Resistance Genes Enriched Together Stay Together in Swine Agriculture.

    Science.gov (United States)

    Johnson, Timothy A; Stedtfeld, Robert D; Wang, Qiong; Cole, James R; Hashsham, Syed A; Looft, Torey; Zhu, Yong-Guan; Tiedje, James M

    2016-04-12

    Antibiotic resistance is a worldwide health risk, but the influence of animal agriculture on the genetic context and enrichment of individual antibiotic resistance alleles remains unclear. Using quantitative PCR followed by amplicon sequencing, we quantified and sequenced 44 genes related to antibiotic resistance, mobile genetic elements, and bacterial phylogeny in microbiomes from U.S. laboratory swine and from swine farms from three Chinese regions. We identified highly abundant resistance clusters: groups of resistance and mobile genetic element alleles that cooccur. For example, the abundance of genes conferring resistance to six classes of antibiotics together with class 1 integrase and the abundance of IS6100-type transposons in three Chinese regions are directly correlated. These resistance cluster genes likely colocalize in microbial genomes in the farms. Resistance cluster alleles were dramatically enriched (up to 1 to 10% as abundant as 16S rRNA) and indicate that multidrug-resistant bacteria are likely the norm rather than an exception in these communities. This enrichment largely occurred independently of phylogenetic composition; thus, resistance clusters are likely present in many bacterial taxa. Furthermore, resistance clusters contain resistance genes that confer resistance to antibiotics independently of their particular use on the farms. Selection for these clusters is likely due to the use of only a subset of the broad range of chemicals to which the clusters confer resistance. The scale of animal agriculture and its wastes, the enrichment and horizontal gene transfer potential of the clusters, and the vicinity of large human populations suggest that managing this resistance reservoir is important for minimizing human risk. Agricultural antibiotic use results in clusters of cooccurring resistance genes that together confer resistance to multiple antibiotics. The use of a single antibiotic could select for an entire suite of resistance genes if

  4. A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

    Science.gov (United States)

    Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.

  5. DNMT1 is associated with cell cycle and DNA replication gene sets in diffuse large B-cell lymphoma.

    Science.gov (United States)

    Loo, Suet Kee; Ab Hamid, Suzina Sheikh; Musa, Mustaffa; Wong, Kah Keng

    2018-01-01

    Dysregulation of DNA (cytosine-5)-methyltransferase 1 (DNMT1) is associated with the pathogenesis of various types of cancer. It has been previously shown that DNMT1 is frequently expressed in diffuse large B-cell lymphoma (DLBCL), however its functions remain to be elucidated in the disease. In this study, we gene expression profiled (GEP) shRNA targeting DNMT1(shDNMT1)-treated germinal center B-cell-like DLBCL (GCB-DLBCL)-derived cell line (i.e. HT) compared with non-silencing shRNA (control shRNA)-treated HT cells. Independent gene set enrichment analysis (GSEA) performed using GEPs of shRNA-treated HT cells and primary GCB-DLBCL cases derived from two publicly-available datasets (i.e. GSE10846 and GSE31312) produced three separate lists of enriched gene sets for each gene sets collection from Molecular Signatures Database (MSigDB). Subsequent Venn analysis identified 268, 145 and six consensus gene sets from analyzing gene sets in C2 collection (curated gene sets), C5 sub-collection [gene sets from gene ontology (GO) biological process ontology] and Hallmark collection, respectively to be enriched in positive correlation with DNMT1 expression profiles in shRNA-treated HT cells, GSE10846 and GSE31312 datasets [false discovery rate (FDR) 0.8) with DNMT1 expression and significantly downregulated (log fold-change <-1.35; p<0.05) following DNMT1 silencing in HT cells. These results suggest the involvement of DNMT1 in the activation of cell cycle and DNA replication in DLBCL cells. Copyright © 2017 Elsevier GmbH. All rights reserved.

  6. Discovery of cancer common and specific driver gene sets

    Science.gov (United States)

    2017-01-01

    Abstract Cancer is known as a disease mainly caused by gene alterations. Discovery of mutated driver pathways or gene sets is becoming an important step to understand molecular mechanisms of carcinogenesis. However, systematically investigating commonalities and specificities of driver gene sets among multiple cancer types is still a great challenge, but this investigation will undoubtedly benefit deciphering cancers and will be helpful for personalized therapy and precision medicine in cancer treatment. In this study, we propose two optimization models to de novo discover common driver gene sets among multiple cancer types (ComMDP) and specific driver gene sets of one certain or multiple cancer types to other cancers (SpeMDP), respectively. We first apply ComMDP and SpeMDP to simulated data to validate their efficiency. Then, we further apply these methods to 12 cancer types from The Cancer Genome Atlas (TCGA) and obtain several biologically meaningful driver pathways. As examples, we construct a common cancer pathway model for BRCA and OV, infer a complex driver pathway model for BRCA carcinogenesis based on common driver gene sets of BRCA with eight cancer types, and investigate specific driver pathways of the liquid cancer lymphoblastic acute myeloid leukemia (LAML) versus other solid cancer types. In these processes more candidate cancer genes are also found. PMID:28168295

  7. Composting-Like Conditions Are More Efficient for Enrichment and Diversity of Organisms Containing Cellulase-Encoding Genes than Submerged Cultures.

    Directory of Open Access Journals (Sweden)

    Senta Heiss-Blanquet

    Full Text Available Cost-effective biofuel production from lignocellulosic biomass depends on efficient degradation of the plant cell wall. One of the major obstacles for the development of a cost-efficient process is the lack of resistance of currently used fungal enzymes to harsh conditions such as high temperature. Adapted, thermophilic microbial communities provide a huge reservoir of potentially interesting lignocellulose-degrading enzymes for improvement of the cellulose hydrolysis step. In order to identify such enzymes, a leaf and wood chip compost was enriched on a mixture of thermo-chemically pretreated wheat straw, poplar and Miscanthus under thermophile conditions, but in two different set-ups. Unexpectedly, metagenome sequencing revealed that incubation of the lignocellulosic substrate with compost as inoculum in a suspension culture resulted in an impoverishment of putative cellulase- and hemicellulase-encoding genes. However, mimicking composting conditions without liquid phase yielded a high number and diversity of glycoside hydrolase genes and an enrichment of genes encoding cellulose binding domains. These identified genes were most closely related to species from Actinobacteria, which seem to constitute important players of lignocellulose degradation under the applied conditions. The study highlights that subtle changes in an enrichment set-up can have an important impact on composition and functions of the microcosm. Composting-like conditions were found to be the most successful method for enrichment in species with high biomass degrading capacity.

  8. Geo-Enrichment and Semantic Enhancement of Metadata Sets to Augment Discovery in Geoportals

    Directory of Open Access Journals (Sweden)

    Bernhard Vockner

    2014-03-01

    Full Text Available Geoportals are established to function as main gateways to find, evaluate, and start “using” geographic information. Still, current geoportal implementations face problems in optimizing the discovery process due to semantic heterogeneity issues, which leads to low recall and low precision in performing text-based searches. Therefore, we propose an enhanced semantic discovery approach that supports multilingualism and information domain context. Thus, we present workflow that enriches existing structured metadata with synonyms, toponyms, and translated terms derived from user-defined keywords based on multilingual thesauri and ontologies. To make the results easier and understandable, we also provide automated translation capabilities for the resource metadata to support the user in conceiving the thematic content of the descriptive metadata, even if it has been documented using a language the user is not familiar with. In addition, to text-enable spatial filtering capabilities, we add additional location name keywords to metadata sets. These are based on the existing bounding box and shall tweak discovery scores when performing single text line queries. In order to improve the user’s search experience, we tailor faceted search strategies presenting an enhanced query interface for geo-metadata discovery that are transparently leveraging the underlying thesauri and ontologies.

  9. Microbial gene functions enriched in the Deepwater Horizon deep-sea oil plume

    Energy Technology Data Exchange (ETDEWEB)

    Lu, Z.; Deng, Y.; Nostrand, J.D. Van; He, Z.; Voordeckers, J.; Zhou, A.; Lee, Y.-J.; Mason, O.U.; Dubinsky, E.; Chavarria, K.; Tom, L.; Fortney, J.; Lamendella, R.; Jansson, J.K.; D?haeseleer, P.; Hazen, T.C.; Zhou, J.

    2011-06-15

    The Deepwater Horizon oil spill in the Gulf of Mexico is the deepest and largest offshore spill in U.S. history and its impacts on marine ecosystems are largely unknown. Here, we showed that the microbial community functional composition and structure were dramatically altered in a deep-sea oil plume resulting from the spill. A variety of metabolic genes involved in both aerobic and anaerobic hydrocarbon degradation were highly enriched in the plume compared to outside the plume, indicating a great potential for intrinsic bioremediation or natural attenuation in the deep-sea. Various other microbial functional genes relevant to carbon, nitrogen, phosphorus, sulfur and iron cycling, metal resistance, and bacteriophage replication were also enriched in the plume. Together, these results suggest that the indigenous marine microbial communities could play a significant role in biodegradation of oil spills in deep-sea environments.

  10. Screening Key Genes Associated with the Development and Progression of Non-small Cell Lung Cancer Based on Gene-enrichment Analysis and Meta-analysis

    Directory of Open Access Journals (Sweden)

    Wenwu HE

    2012-07-01

    Full Text Available Background and objective Non-small cell lung cancer (NSCLC is one of the most common malignant tumors; however, its causes are still not completely understood. This study was designed to screen the key genes and pathways related to NSCLC occurrence and development and to establish the scientific foundation for the genetic mechanisms and targeted therapy of NSCLC. Methods Both gene set-enrichment analysis (GSEA and meta-analysis (meta were used to screen the critical pathways and genes that might be corretacted with the development and progression of lung cancer at the transcription level. Results Using the GSEA and meta methods, focal adhesion and regulation of actin cytoskeleton were determined to be the more prominent overlapping significant pathways. In the focal adhesion pathway, 31 genes were statistically significant (P<0.05, whereas in the regulation of actin cytoskeleton pathway, 32 genes were statistically significant (P<0.05. Conclusion The focal adhesion and the regulation of actin cytoskeleton pathways might play important roles in the occurrence and development of NSCLC. Further studies are needed to determine the biological function for the positiue genes.

  11. Mammalian transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes and are predicted to act as transcriptional activator hubs.

    Science.gov (United States)

    Joshi, Anagha

    2014-12-30

    Transcriptional hotspots are defined as genomic regions bound by multiple factors. They have been identified recently as cell type specific enhancers regulating developmentally essential genes in many species such as worm, fly and humans. The in-depth analysis of hotspots across multiple cell types in same species still remains to be explored and can bring new biological insights. We therefore collected 108 transcription-related factor (TF) ChIP sequencing data sets in ten murine cell types and classified the peaks in each cell type in three groups according to binding occupancy as singletons (low-occupancy), combinatorials (mid-occupancy) and hotspots (high-occupancy). The peaks in the three groups clustered largely according to the occupancy, suggesting priming of genomic loci for mid occupancy irrespective of cell type. We then characterized hotspots for diverse structural functional properties. The genes neighbouring hotspots had a small overlap with hotspot genes in other cell types and were highly enriched for cell type specific function. Hotspots were enriched for sequence motifs of key TFs in that cell type and more than 90% of hotspots were occupied by pioneering factors. Though we did not find any sequence signature in the three groups, the H3K4me1 binding profile had bimodal peaks at hotspots, distinguishing hotspots from mono-modal H3K4me1 singletons. In ES cells, differentially expressed genes after perturbation of activators were enriched for hotspot genes suggesting hotspots primarily act as transcriptional activator hubs. Finally, we proposed that ES hotspots might be under control of SetDB1 and not DNMT for silencing. Transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes. In ES cells, they are predicted to act as transcriptional activator hubs and might be under SetDB1 control for silencing.

  12. Enrichment of short interspersed transposable elements to embryonic stem cell-specific hypomethylated gene regions.

    Science.gov (United States)

    Muramoto, Hiroki; Yagi, Shintaro; Hirabayashi, Keiji; Sato, Shinya; Ohgane, Jun; Tanaka, Satoshi; Shiota, Kunio

    2010-08-01

    Embryonic stem cells (ESCs) have a distinctive epigenome, which includes their genome-wide DNA methylation modification status, as represented by the ESC-specific hypomethylation of tissue-dependent and differentially methylated regions (T-DMRs) of Pou5f1 and Nanog. Here, we conducted a genome-wide investigation of sequence characteristics associated with T-DMRs that were differentially methylated between ESCs and somatic cells, by focusing on transposable elements including short interspersed elements (SINEs), long interspersed elements (LINEs) and long terminal repeats (LTRs). We found that hypomethylated T-DMRs were predominantly present in SINE-rich/LINE-poor genomic loci. The enrichment for SINEs spread over 300 kb in cis and there existed SINE-rich genomic domains spreading continuously over 1 Mb, which contained multiple hypomethylated T-DMRs. The characterization of sequence information showed that the enriched SINEs were relatively CpG rich and belonged to specific subfamilies. A subset of the enriched SINEs were hypomethylated T-DMRs in ESCs at Dppa3 gene locus, although SINEs are overall methylated in both ESCs and the liver. In conclusion, we propose that SINE enrichment is the genomic property of regions harboring hypomethylated T-DMRs in ESCs, which is a novel aspect of the ESC-specific epigenomic information.

  13. Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population.

    Science.gov (United States)

    Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

    2016-06-02

    Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  14. Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population

    Science.gov (United States)

    Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

    2016-01-01

    Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. PMID:27189991

  15. APPRIS 2017: principal isoforms for multiple gene sets

    Science.gov (United States)

    Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso

    2018-01-01

    Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475

  16. Evidence for intron length conservation in a set of mammalian genes associated with embryonic development

    LENUS (Irish Health Repository)

    2011-10-05

    Abstract Background We carried out an analysis of intron length conservation across a diverse group of nineteen mammalian species. Motivated by recent research suggesting a role for time delays associated with intron transcription in gene expression oscillations required for early embryonic patterning, we searched for examples of genes that showed the most extreme conservation of total intron content in mammals. Results Gene sets annotated as being involved in pattern specification in the early embryo or containing the homeobox DNA-binding domain, were significantly enriched among genes with highly conserved intron content. We used ancestral sequences reconstructed with probabilistic models that account for insertion and deletion mutations to distinguish insertion and deletion events on lineages leading to human and mouse from their last common ancestor. Using a randomization procedure, we show that genes containing the homeobox domain show less change in intron content than expected, given the number of insertion and deletion events within their introns. Conclusions Our results suggest selection for gene expression precision or the existence of additional development-associated genes for which transcriptional delay is functionally significant.

  17. Machine learning approaches to supporting the identification of photoreceptor-enriched genes based on expression data

    Directory of Open Access Journals (Sweden)

    Simpson David

    2006-03-01

    Full Text Available Abstract Background Retinal photoreceptors are highly specialised cells, which detect light and are central to mammalian vision. Many retinal diseases occur as a result of inherited dysfunction of the rod and cone photoreceptor cells. Development and maintenance of photoreceptors requires appropriate regulation of the many genes specifically or highly expressed in these cells. Over the last decades, different experimental approaches have been developed to identify photoreceptor enriched genes. Recent progress in RNA analysis technology has generated large amounts of gene expression data relevant to retinal development. This paper assesses a machine learning methodology for supporting the identification of photoreceptor enriched genes based on expression data. Results Based on the analysis of publicly-available gene expression data from the developing mouse retina generated by serial analysis of gene expression (SAGE, this paper presents a predictive methodology comprising several in silico models for detecting key complex features and relationships encoded in the data, which may be useful to distinguish genes in terms of their functional roles. In order to understand temporal patterns of photoreceptor gene expression during retinal development, a two-way cluster analysis was firstly performed. By clustering SAGE libraries, a hierarchical tree reflecting relationships between developmental stages was obtained. By clustering SAGE tags, a more comprehensive expression profile for photoreceptor cells was revealed. To demonstrate the usefulness of machine learning-based models in predicting functional associations from the SAGE data, three supervised classification models were compared. The results indicated that a relatively simple instance-based model (KStar model performed significantly better than relatively more complex algorithms, e.g. neural networks. To deal with the problem of functional class imbalance occurring in the dataset, two data re

  18. GeneTopics - interpretation of gene sets via literature-driven topic models

    Science.gov (United States)

    2013-01-01

    Background Annotation of a set of genes is often accomplished through comparison to a library of labelled gene sets such as biological processes or canonical pathways. However, this approach might fail if the employed libraries are not up to date with the latest research, don't capture relevant biological themes or are curated at a different level of granularity than is required to appropriately analyze the input gene set. At the same time, the vast biomedical literature offers an unstructured repository of the latest research findings that can be tapped to provide thematic sub-groupings for any input gene set. Methods Our proposed method relies on a gene-specific text corpus and extracts commonalities between documents in an unsupervised manner using a topic model approach. We automatically determine the number of topics summarizing the corpus and calculate a gene relevancy score for each topic allowing us to eliminate non-specific topics. As a result we obtain a set of literature topics in which each topic is associated with a subset of the input genes providing directly interpretable keywords and corresponding documents for literature research. Results We validate our method based on labelled gene sets from the KEGG metabolic pathway collection and the genetic association database (GAD) and show that the approach is able to detect topics consistent with the labelled annotation. Furthermore, we discuss the results on three different types of experimentally derived gene sets, (1) differentially expressed genes from a cardiac hypertrophy experiment in mice, (2) altered transcript abundance in human pancreatic beta cells, and (3) genes implicated by GWA studies to be associated with metabolite levels in a healthy population. In all three cases, we are able to replicate findings from the original papers in a quick and semi-automated manner. Conclusions Our approach provides a novel way of automatically generating meaningful annotations for gene sets that are directly

  19. The Resistome of Farmed Fish Feces Contributes to the Enrichment of Antibiotic Resistance Genes in Sediments below Baltic Sea Fish Farms.

    Science.gov (United States)

    Muziasari, Windi I; Pitkänen, Leena K; Sørum, Henning; Stedtfeld, Robert D; Tiedje, James M; Virta, Marko

    2016-01-01

    Our previous studies showed that particular antibiotic resistance genes (ARGs) were enriched locally in sediments below fish farms in the Northern Baltic Sea, Finland, even when the selection pressure from antibiotics was negligible. We assumed that a constant influx of farmed fish feces could be the plausible source of the ARGs enriched in the farm sediments. In the present study, we analyzed the composition of the antibiotic resistome from the intestinal contents of 20 fish from the Baltic Sea farms. We used a high-throughput method, WaferGen qPCR array with 364 primer sets to detect and quantify ARGs, mobile genetic elements (MGE), and the 16S rRNA gene. Despite a considerably wide selection of qPCR primer sets, only 28 genes were detected in the intestinal contents. The detected genes were ARGs encoding resistance to sulfonamide ( sul1 ), trimethoprim ( dfrA1 ), tetracycline [ tet(32), tetM, tetO, tetW ], aminoglycoside ( aadA1, aadA2 ), chloramphenicol ( catA1 ), and efflux-pumps resistance genes ( emrB, matA, mefA, msrA ). The detected genes also included class 1 integron-associated genes ( intI1, qacE Δ 1 ) and transposases ( tnpA ). Importantly, most of the detected genes were the same genes enriched in the farm sediments. This preliminary study suggests that feces from farmed fish contribute to the ARG enrichment in farm sediments despite the lack of contemporaneous antibiotic treatments at the farms. We observed that the intestinal contents of individual farmed fish had their own resistome compositions. Our result also showed that the total relative abundances of transposases and tet genes were significantly correlated ( p = 0.001, R 2 = 0.71). In addition, we analyzed the mucosal skin and gill filament resistomes of the farmed fish but only one multidrug-efflux resistance gene ( emrB ) was detected. To our knowledge, this is the first study reporting the resistome of farmed fish using a culture-independent method. Determining the possible sources of

  20. Expressed sequence enrichment for candidate gene analysis of citrus tristeza virus resistance.

    Science.gov (United States)

    Bernet, G P; Bretó, M P; Asins, M J

    2004-02-01

    Several studies have reported markers linked to a putative resistance gene from Poncirus trifoliata ( Ctv-R) located at linkage group 4 that confers resistance against one of the most important citrus pathogens, citrus tristeza virus (CTV). To be successful in both marker-assisted selection and transformation experiments, its accurate mapping is needed. Several factors may affect its localization, among them two are considered here: the definition of resistance and the genetic background of progeny. Two progenies derived from P. trifoliata, by self-pollination and by crossing with sour orange ( Citrus aurantium), a citrus rootstock well-adapted to arid and semi-arid areas, were used for linkage group-4 marker enrichment. Two new methodologies were used to enrich this region with expressed sequences. The enrichment of group 4 resulted in the fusion of several C. aurantium linkage groups. The new one A(7+3+4) is now saturated with 48 markers including expressed sequences. Surprisingly, sour orange was as resistant to the CTV isolate tested as was P. trifoliata, and three hybrids that carry Ctv-R, as deduced from its flanking markers, are susceptible to CTV. The new linkage maps were used to map Ctv-R under the hypothesis of monogenic inheritance. Its position on linkage group 4 of P. trifoliata differs from the location previously reported in other progenies. The genetic analysis of virus-plant interaction in the family derived from C. aurantium after a CTV chronic infection showed the segregation of five types of interaction, which is not compatible with the hypothesis of a single gene controlling resistance. Two major issues are discussed: another type of genetic analysis of CTV resistance is needed to avoid the assumption of monogenic inheritance, and transferring Ctv-R from P. trifoliata to sour orange might not avoid the CTV decline of sweet orange trees.

  1. Enriched expression of the ciliopathy gene Ick in cell proliferating regions of adult mice.

    Science.gov (United States)

    Tsutsumi, Ryotaro; Chaya, Taro; Furukawa, Takahisa

    2018-04-07

    Cilia are essential for sensory and motile functions across species. In humans, ciliary dysfunction causes "ciliopathies", which show severe developmental abnormalities in various tissues. Several missense mutations in intestinal cell kinase (ICK) gene lead to endocrine-cerebro-osteodysplasia syndrome or short rib-polydactyly syndrome, lethal recessive developmental ciliopathies. We and others previously reported that Ick-deficient mice exhibit neonatal lethality with developmental defects. Mechanistically, Ick regulates intraflagellar transport and cilia length at ciliary tips. Although Ick plays important roles during mammalian development, roles of Ick at the adult stage are poorly understood. In the current study, we investigated the Ick gene expression in adult mouse tissues. RT-PCR analysis showed that Ick is ubiquitously expressed, with enrichment in the retina, brain, lung, intestine, and reproductive system. In the adult brain, we found that Ick expression is enriched in the walls of the lateral ventricle, in the rostral migratory stream of the olfactory bulb, and in the subgranular zone of the hippocampal dentate gyrus by in situ hybridization analysis. We also observed that Ick staining pattern is similar to pachytene spermatocyte to spermatid markers in the mature testis and to an intestinal stem cell marker in the adult small intestine. These results suggest that Ick is expressed in proliferating regions in the adult mouse brain, testis, and intestine. Copyright © 2018 Elsevier B.V. All rights reserved.

  2. Pathway-Enriched Gene Signature Associated with 53BP1 Response to PARP Inhibition in Triple-Negative Breast Cancer.

    Science.gov (United States)

    Hassan, Saima; Esch, Amanda; Liby, Tiera; Gray, Joe W; Heiser, Laura M

    2017-12-01

    Effective treatment of patients with triple-negative (ER-negative, PR-negative, HER2-negative) breast cancer remains a challenge. Although PARP inhibitors are being evaluated in clinical trials, biomarkers are needed to identify patients who will most benefit from anti-PARP therapy. We determined the responses of three PARP inhibitors (veliparib, olaparib, and talazoparib) in a panel of eight triple-negative breast cancer cell lines. Therapeutic responses and cellular phenotypes were elucidated using high-content imaging and quantitative immunofluorescence to assess markers of DNA damage (53BP1) and apoptosis (cleaved PARP). We determined the pharmacodynamic changes as percentage of cells positive for 53BP1, mean number of 53BP1 foci per cell, and percentage of cells positive for cleaved PARP. Inspired by traditional dose-response measures of cell viability, an EC 50 value was calculated for each cellular phenotype and each PARP inhibitor. The EC 50 values for both 53BP1 metrics strongly correlated with IC 50 values for each PARP inhibitor. Pathway enrichment analysis identified a set of DNA repair and cell cycle-associated genes that were associated with 53BP1 response following PARP inhibition. The overall accuracy of our 63 gene set in predicting response to olaparib in seven breast cancer patient-derived xenograft tumors was 86%. In triple-negative breast cancer patients who had not received anti-PARP therapy, the predicted response rate of our gene signature was 45%. These results indicate that 53BP1 is a biomarker of response to anti-PARP therapy in the laboratory, and our DNA damage response gene signature may be used to identify patients who are most likely to respond to PARP inhibition. Mol Cancer Ther; 16(12); 2892-901. ©2017 AACR . ©2017 American Association for Cancer Research.

  3. Using Goal-Setting Strategies To Enrich the Practicum and Internship Experiences of Beginning Counselors.

    Science.gov (United States)

    Curtis, Russell C.

    2000-01-01

    Goal setting can be an effective way to help beginning counselors focus on important developmental issues. This article argues that counselors and supervisors must consider issues related to goal-setting theory and understand the process by which goals are set so that optimal learning experiences are created. (Author/MKA)

  4. EWS and FUS bind a subset of transcribed genes encoding proteins enriched in RNA regulatory functions.

    Science.gov (United States)

    Luo, Yonglun; Blechingberg, Jenny; Fernandes, Ana Miguel; Li, Shengting; Fryland, Tue; Børglum, Anders D; Bolund, Lars; Nielsen, Anders Lade

    2015-11-14

    FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins and involved in the human neurological diseases amyotrophic lateral sclerosis and fronto-temporal lobar degeneration. To determine the gene regulatory functions of FUS and EWS at the level of chromatin, we have performed chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq). Our results show that FUS and EWS bind to a subset of actively transcribed genes, that binding often is downstream the poly(A)-signal, and that binding overlaps with RNA polymerase II. Functional examinations of selected target genes identified that FUS and EWS can regulate gene expression at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.

  5. Neuron-Enriched Gene Expression Patterns are Regionally Anti-Correlated with Oligodendrocyte-Enriched Patterns in the Adult Mouse and Human Brain.

    Science.gov (United States)

    Tan, Powell Patrick Cheng; French, Leon; Pavlidis, Paul

    2013-01-01

    An important goal in neuroscience is to understand gene expression patterns in the brain. The recent availability of comprehensive and detailed expression atlases for mouse and human creates opportunities to discover global patterns and perform cross-species comparisons. Recently we reported that the major source of variation in gene transcript expression in the adult normal mouse brain can be parsimoniously explained as reflecting regional variation in glia to neuron ratios, and is correlated with degree of connectivity and location in the brain along the anterior-posterior axis. Here we extend this investigation to two gene expression assays of adult normal human brains that consisted of over 300 brain region samples, and perform comparative analyses of brain-wide expression patterns to the mouse. We performed principal components analysis (PCA) on the regional gene expression of the adult human brain to identify the expression pattern that has the largest variance. As in the mouse, we observed that the first principal component is composed of two anti-correlated patterns enriched in oligodendrocyte and neuron markers respectively. However, we also observed interesting discordant patterns between the two species. For example, a few mouse neuron markers show expression patterns that are more correlated with the human oligodendrocyte-enriched pattern and vice-versa. In conclusion, our work provides insights into human brain function and evolution by probing global relationships between regional cell type marker expression patterns in the human and mouse brain.

  6. Early repositioning through compound set enrichment analysis: a knowledge-recycling strategy.

    Science.gov (United States)

    Temesi, Gergely; Bolgár, Bence; Arany, Adám; Szalai, Csaba; Antal, Péter; Mátyus, Péter

    2014-04-01

    Despite famous serendipitous drug repositioning success stories, systematic projects have not yet delivered the expected results. However, repositioning technologies are gaining ground in different phases of routine drug development, together with new adaptive strategies. We demonstrate the power of the compound information pool, the ever-growing heterogeneous information repertoire of approved drugs and candidates as an invaluable catalyzer in this transition. Systematic, computational utilization of this information pool for candidates in early phases is an open research problem; we propose a novel application of the enrichment analysis statistical framework for fusion of this information pool, specifically for the prediction of indications. Pharmaceutical consequences are formulated for a systematic and continuous knowledge recycling strategy, utilizing this information pool throughout the drug-discovery pipeline.

  7. Improving Gene Therapy Efficiency through the Enrichment of Human Hematopoietic Stem Cells.

    Science.gov (United States)

    Masiuk, Katelyn E; Brown, Devin; Laborada, Jennifer; Hollis, Roger P; Urbinati, Fabrizia; Kohn, Donald B

    2017-09-06

    Lentiviral vector (LV)-based hematopoietic stem cell (HSC) gene therapy is becoming a promising clinical strategy for the treatment of genetic blood diseases. However, the current approach of modifying 1 × 10 8 to 1 × 10 9 CD34 + cells per patient requires large amounts of LV, which is expensive and technically challenging to produce at clinical scale. Modification of bulk CD34 + cells uses LV inefficiently, because the majority of CD34 + cells are short-term progenitors with a limited post-transplant lifespan. Here, we utilized a clinically relevant, immunomagnetic bead (IB)-based method to purify CD34 + CD38 - cells from human bone marrow (BM) and mobilized peripheral blood (mPB). IB purification of CD34 + CD38 - cells enriched severe combined immune deficiency (SCID) repopulating cell (SRC) frequency an additional 12-fold beyond standard CD34 + purification and did not affect gene marking of long-term HSCs. Transplant of purified CD34 + CD38 - cells led to delayed myeloid reconstitution, which could be rescued by the addition of non-transduced CD38 + cells. Importantly, LV modification and transplantation of IB-purified CD34 + CD38 - cells/non-modified CD38 + cells into immune-deficient mice achieved long-term gene-marked engraftment comparable with modification of bulk CD34 + cells, while utilizing ∼7-fold less LV. Thus, we demonstrate a translatable method to improve the clinical and commercial viability of gene therapy for genetic blood cell diseases. Copyright © 2017 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.

  8. Community Composition of Nitrous Oxide-Related Genes in Salt Marsh Sediments Exposed to Nitrogen Enrichment.

    Science.gov (United States)

    Angell, John H; Peng, Xuefeng; Ji, Qixing; Craick, Ian; Jayakumar, Amal; Kearns, Patrick J; Ward, Bess B; Bowen, Jennifer L

    2018-01-01

    Salt marshes provide many key ecosystem services that have tremendous ecological and economic value. One critical service is the removal of fixed nitrogen from coastal waters, which limits the negative effects of eutrophication resulting from increased nutrient supply. Nutrient enrichment of salt marsh sediments results in higher rates of nitrogen cycling and, commonly, a concurrent increase in the flux of nitrous oxide, an important greenhouse gas. Little is known, however, regarding controls on the microbial communities that contribute to nitrous oxide fluxes in marsh sediments. To address this disconnect, we generated profiles of microbial communities and communities of micro-organisms containing specific nitrogen cycling genes that encode several enzymes ( amoA, norB, nosZ) related to nitrous oxide flux from salt marsh sediments. We hypothesized that communities of microbes responsible for nitrogen transformations will be structured by nitrogen availability. Taxa that respond positively to high nitrogen inputs may be responsible for the elevated rates of nitrogen cycling processes measured in fertilized sediments. Our data show that, with the exception of ammonia-oxidizing archaea, the community composition of organisms involved in the production and consumption of nitrous oxide was altered under nutrient enrichment. These results suggest that previously measured rates of nitrous oxide production and consumption are likely the result of changes in community structure, not simply changes in microbial activity.

  9. EWS and FUS bind a subset of transcribed genes encoding proteins enriched in RNA regulatory functions

    DEFF Research Database (Denmark)

    Luo, Yonglun; Friis, Jenny Blechingberg; Fernandes, Ana Miguel

    2015-01-01

    at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. Conclusions The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes...... involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.......Background FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins...

  10. Enrichment of human hematopoietic stem/progenitor cells facilitates transduction for stem cell gene therapy.

    Science.gov (United States)

    Baldwin, Kismet; Urbinati, Fabrizia; Romero, Zulema; Campo-Fernandez, Beatriz; Kaufman, Michael L; Cooper, Aaron R; Masiuk, Katelyn; Hollis, Roger P; Kohn, Donald B

    2015-05-01

    Autologous hematopoietic stem cell (HSC) gene therapy for sickle cell disease has the potential to treat this illness without the major immunological complications associated with allogeneic transplantation. However, transduction efficiency by β-globin lentiviral vectors using CD34-enriched cell populations is suboptimal and large vector production batches may be needed for clinical trials. Transducing a cell population more enriched for HSC could greatly reduce vector needs and, potentially, increase transduction efficiency. CD34(+) /CD38(-) cells, comprising ∼1%-3% of all CD34(+) cells, were isolated from healthy cord blood CD34(+) cells by fluorescence-activated cell sorting and transduced with a lentiviral vector expressing an antisickling form of beta-globin (CCL-β(AS3) -FB). Isolated CD34(+) /CD38(-) cells were able to generate progeny over an extended period of long-term culture (LTC) compared to the CD34(+) cells and required up to 40-fold less vector for transduction compared to bulk CD34(+) preparations containing an equivalent number of CD34(+) /CD38(-) cells. Transduction of isolated CD34(+) /CD38(-) cells was comparable to CD34(+) cells measured by quantitative PCR at day 14 with reduced vector needs, and average vector copy/cell remained higher over time for LTC initiated from CD34(+) /38(-) cells. Following in vitro erythroid differentiation, HBBAS3 mRNA expression was similar in cultures derived from CD34(+) /CD38(-) cells or unfractionated CD34(+) cells. In vivo studies showed equivalent engraftment of transduced CD34(+) /CD38(-) cells when transplanted in competition with 100-fold more CD34(+) /CD38(+) cells. This work provides initial evidence for the beneficial effects from isolating human CD34(+) /CD38(-) cells to use significantly less vector and potentially improve transduction for HSC gene therapy. © 2015 AlphaMed Press.

  11. A reference gene set for sex pheromone biosynthesis and degradation genes from the diamondback moth, Plutella xylostella, based on genome and transcriptome digital gene expression analyses.

    Science.gov (United States)

    He, Peng; Zhang, Yun-Fei; Hong, Duan-Yang; Wang, Jun; Wang, Xing-Liang; Zuo, Ling-Hua; Tang, Xian-Fu; Xu, Wei-Ming; He, Ming

    2017-03-01

    comprehensive gene data set of sex pheromone biosynthesis and degradation enzyme related genes in DBM created by genome- and transcriptome-wide identification, characterization and expression profiling. Our findings provide a basis to better understand the function of genes with tissue enriched expression. The results also provide information on the genes involved in sex pheromone biosynthesis and degradation, and may be useful to identify potential gene targets for pest control strategies by disrupting the insect-insect communication using pheromone-based behavioral antagonists.

  12. Omega-3 Fatty Acid Enriched Chevon (Goat Meat Lowers Plasma Cholesterol Levels and Alters Gene Expressions in Rats

    Directory of Open Access Journals (Sweden)

    Mahdi Ebrahimi

    2014-01-01

    Full Text Available In this study, control chevon (goat meat and omega-3 fatty acid enriched chevon were obtained from goats fed a 50% oil palm frond diet and commercial goat concentrate for 100 days, respectively. Goats fed the 50% oil palm frond diet contained high amounts of α-linolenic acid (ALA in their meat compared to goats fed the control diet. The chevon was then used to prepare two types of pellets (control or enriched chevon that were then fed to twenty-male-four-month-old Sprague-Dawley rats (n=10 in each group for 12 weeks to evaluate their effects on plasma cholesterol levels, tissue fatty acids, and gene expression. There was a significant increase in ALA and docosahexaenoic acid (DHA in the muscle tissues and liver of the rats fed the enriched chevon compared with the control group. Plasma cholesterol also decreased (P<0.05 in rats fed the enriched chevon compared to the control group. The rat pellets containing enriched chevon significantly upregulated the key transcription factor PPAR-γ and downregulated SREBP-1c expression relative to the control group. The results showed that the omega-3 fatty acid enriched chevon increased the omega-3 fatty acids in the rat tissues and altered PPAR-γ and SREBP-1c genes expression.

  13. Omega-3 fatty acid enriched chevon (goat meat) lowers plasma cholesterol levels and alters gene expressions in rats.

    Science.gov (United States)

    Ebrahimi, Mahdi; Rajion, Mohamed Ali; Meng, Goh Yong; Soleimani Farjam, Abdoreza

    2014-01-01

    In this study, control chevon (goat meat) and omega-3 fatty acid enriched chevon were obtained from goats fed a 50% oil palm frond diet and commercial goat concentrate for 100 days, respectively. Goats fed the 50% oil palm frond diet contained high amounts of α-linolenic acid (ALA) in their meat compared to goats fed the control diet. The chevon was then used to prepare two types of pellets (control or enriched chevon) that were then fed to twenty-male-four-month-old Sprague-Dawley rats (n = 10 in each group) for 12 weeks to evaluate their effects on plasma cholesterol levels, tissue fatty acids, and gene expression. There was a significant increase in ALA and docosahexaenoic acid (DHA) in the muscle tissues and liver of the rats fed the enriched chevon compared with the control group. Plasma cholesterol also decreased (P < 0.05) in rats fed the enriched chevon compared to the control group. The rat pellets containing enriched chevon significantly upregulated the key transcription factor PPAR-γ and downregulated SREBP-1c expression relative to the control group. The results showed that the omega-3 fatty acid enriched chevon increased the omega-3 fatty acids in the rat tissues and altered PPAR-γ and SREBP-1c genes expression.

  14. Enrichment of deleterious variants of mitochondrial DNA polymerase gene (POLG1) in bipolar disorder.

    Science.gov (United States)

    Kasahara, Takaoki; Ishiwata, Mizuho; Kakiuchi, Chihiro; Fuke, Satoshi; Iwata, Nakao; Ozaki, Norio; Kunugi, Hiroshi; Minabe, Yoshio; Nakamura, Kazuhiko; Iwata, Yasuhide; Fujii, Kumiko; Kanba, Shigenobu; Ujike, Hiroshi; Kusumi, Ichiro; Kataoka, Muneko; Matoba, Nana; Takata, Atsushi; Iwamoto, Kazuya; Yoshikawa, Takeo; Kato, Tadafumi

    2017-08-01

    Rare missense variants, which likely account for a substantial portion of the genetic 'dark matter' for a common complex disease, are challenging because the impacts of variants on disease development are difficult to substantiate. This study aimed to examine the impacts of amino acid substitution variants in the POLG1 found in bipolar disorder, as an example and proof of concept, in three different modalities of assessment: in silico predictions, in vitro biochemical assays, and clinical evaluation. We then tested whether deleterious variants in POLG1 contributed to the genetics of bipolar disorder. We searched for variants in the POLG1 gene in 796 Japanese patients with bipolar disorder and 767 controls and comprehensively investigated all 23 identified variants in the three modalities of assessment. POLG1 encodes mitochondrial DNA polymerase and is one of the causative genes for a Mendelian-inheritance mitochondrial disease, which is occasionally accompanied by mood disorders. The healthy control data from the Tohoku Medical Megabank Organization were also employed. Although the frequency of carriers of deleterious variants varied from one method to another, every assessment achieved the same conclusion that deleterious POLG1 variants were significantly enriched in the variants identified in patients with bipolar disorder compared to those in controls. Together with mitochondrial dysfunction in bipolar disorder, the present results suggested deleterious POLG1 variants as a credible risk for the multifactorial disease. © 2016 The Authors. Psychiatry and Clinical Neurosciences published by John Wiley & Sons Australia, Ltd on behalf of Japanese Society of Psychiatry and Neurology.

  15. Neonicotinoid Insecticides Alter the Gene Expression Profile of Neuron-Enriched Cultures from Neonatal Rat Cerebellum

    Directory of Open Access Journals (Sweden)

    Junko Kimura-Kuroda

    2016-10-01

    Full Text Available Neonicotinoids are considered safe because of their low affinities to mammalian nicotinic acetylcholine receptors (nAChRs relative to insect nAChRs. However, because of importance of nAChRs in mammalian brain development, there remains a need to establish the safety of chronic neonicotinoid exposures with regards to children’s health. Here we examined the effects of longterm (14 days and low dose (1 μM exposure of neuron-enriched cultures from neonatal rat cerebellum to nicotine and two neonicotinoids: acetamiprid and imidacloprid. Immunocytochemistry revealed no differences in the number or morphology of immature neurons or glial cells in any group versus untreated control cultures. However, a slight disturbance in Purkinje cell dendritic arborization was observed in the exposed cultures. Next we performed transcriptome analysis on total RNAs using microarrays, and identified significant differential expression (p < 0.05, q < 0.05, ≥1.5 fold between control cultures versus nicotine-, acetamiprid-, or imidacloprid-exposed cultures in 34, 48, and 67 genes, respectively. Common to all exposed groups were nine genes essential for neurodevelopment, suggesting that chronic neonicotinoid exposure alters the transcriptome of the developing mammalian brain in a similar way to nicotine exposure. Our results highlight the need for further careful investigations into the effects of neonicotinoids in the developing mammalian brain.

  16. ADAGE signature analysis: differential expression analysis with data-defined gene sets.

    Science.gov (United States)

    Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S

    2017-11-22

    Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed

  17. Approaching the axiomatic enrichment of the Gene Ontology from a lexical perspective.

    Science.gov (United States)

    Quesada-Martínez, Manuel; Mikroyannidi, Eleni; Fernández-Breis, Jesualdo Tomás; Stevens, Robert

    2015-09-01

    The main goal of this work is to measure how lexical regularities in biomedical ontology labels can be used for the automatic creation of formal relationships between classes, and to evaluate the results of applying our approach to the Gene Ontology (GO). In recent years, we have developed a method for the lexical analysis of regularities in biomedical ontology labels, and we showed that the labels can present a high degree of regularity. In this work, we extend our method with a cross-products extension (CPE) metric, which estimates the potential interest of a specific regularity for axiomatic enrichment in the lexical analysis, using information on exact matches in external ontologies. The GO consortium recently enriched the GO by using so-called cross-product extensions. Cross-products are generated by establishing axioms that relate a given GO class with classes from the GO or other biomedical ontologies. We apply our method to the GO and study how its lexical analysis can identify and reconstruct the cross-products that are defined by the GO consortium. The label of the classes of the GO are highly regular in lexical terms, and the exact matches with labels of external ontologies affect 80% of the GO classes. The CPE metric reveals that 31.48% of the classes that exhibit regularities have fragments that are classes into two external ontologies that are selected for our experiment, namely, the Cell Ontology and the Chemical Entities of Biological Interest ontology, and 18.90% of them are fully decomposable into smaller parts. Our results show that the CPE metric permits our method to detect GO cross-product extensions with a mean recall of 62% and a mean precision of 28%. The study is completed with an analysis of false positives to explain this precision value. We think that our results support the claim that our lexical approach can contribute to the axiomatic enrichment of biomedical ontologies and that it can provide new insights into the engineering of

  18. Alteration of synaptic activity-regulating genes underlying functional improvement by long-term exposure to an enriched environment in the adult brain.

    Science.gov (United States)

    Lee, Min-Young; Yu, Ji Hea; Kim, Ji Yeon; Seo, Jung Hwa; Park, Eun Sook; Kim, Chul Hoon; Kim, Hyongbum; Cho, Sung-Rae

    2013-01-01

    Housing animals in an enriched environment (EE) enhances behavioral function. However, the mechanism underlying this EE-mediated functional improvement and the resultant changes in gene expression have yet to be elucidated. We attempted to investigate the underlying mechanisms associated with long-term exposure to an EE by evaluating gene expression patterns. We housed 6-week-old CD-1 (ICR) mice in standard cages or an EE comprising a running wheel, novel objects, and social interaction for 2 months. Motor and cognitive performances were evaluated using the rotarod test and passive avoidance test, and gene expression profile was investigated in the cerebral hemispheres using microarray and gene set enrichment analysis (GSEA). In behavioral assessment, an EE significantly enhanced rotarod performance and short-term working memory. Microarray analysis revealed that genes associated with neuronal activity were significantly altered by an EE. GSEA showed that genes involved in synaptic transmission and postsynaptic signal transduction were globally upregulated, whereas those associated with reuptake by presynaptic neurotransmitter transporters were downregulated. In particular, both microarray and GSEA demonstrated that EE exposure increased opioid signaling, acetylcholine release cycle, and postsynaptic neurotransmitter receptors but decreased Na+ / Cl- -dependent neurotransmitter transporters, including dopamine transporter Slc6a3 in the brain. Western blotting confirmed that SLC6A3, DARPP32 (PPP1R1B), and P2RY12 were largely altered in a region-specific manner. An EE enhanced motor and cognitive function through the alteration of synaptic activity-regulating genes, improving the efficient use of neurotransmitters and synaptic plasticity by the upregulation of genes associated with postsynaptic receptor activity and downregulation of presynaptic reuptake by neurotransmitter transporters.

  19. Microarray analysis identifies a common set of cellular genes modulated by different HCV replicon clones

    Directory of Open Access Journals (Sweden)

    Gerosolimo Germano

    2008-06-01

    Full Text Available Abstract Background Hepatitis C virus (HCV RNA synthesis and protein expression affect cell homeostasis by modulation of gene expression. The impact of HCV replication on global cell transcription has not been fully evaluated. Thus, we analysed the expression profiles of different clones of human hepatoma-derived Huh-7 cells carrying a self-replicating HCV RNA which express all viral proteins (HCV replicon system. Results First, we compared the expression profile of HCV replicon clone 21-5 with both the Huh-7 parental cells and the 21-5 cured (21-5c cells. In these latter, the HCV RNA has been eliminated by IFN-α treatment. To confirm data, we also analyzed microarray results from both the 21-5 and two other HCV replicon clones, 22-6 and 21-7, compared to the Huh-7 cells. The study was carried out by using the Applied Biosystems (AB Human Genome Survey Microarray v1.0 which provides 31,700 probes that correspond to 27,868 human genes. Microarray analysis revealed a specific transcriptional program induced by HCV in replicon cells respect to both IFN-α-cured and Huh-7 cells. From the original datasets of differentially expressed genes, we selected by Venn diagrams a final list of 38 genes modulated by HCV in all clones. Most of the 38 genes have never been described before and showed high fold-change associated with significant p-value, strongly supporting data reliability. Classification of the 38 genes by Panther System identified functional categories that were significantly enriched in this gene set, such as histones and ribosomal proteins as well as extracellular matrix and intracellular protein traffic. The dataset also included new genes involved in lipid metabolism, extracellular matrix and cytoskeletal network, which may be critical for HCV replication and pathogenesis. Conclusion Our data provide a comprehensive analysis of alterations in gene expression induced by HCV replication and reveal modulation of new genes potentially useful

  20. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer.

    Science.gov (United States)

    Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-10-04

    Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC. Copyright © 2013 Elsevier Inc. All rights reserved.

  1. Enriching the ECSI model using brand strength in the retail setting

    Directory of Open Access Journals (Sweden)

    Paraskevi Sarantidou

    2017-10-01

    Full Text Available Purpose - The purpose of this paper is to investigate the role of the retailer’s brand strength as a potential predictor of loyalty. It also examines the role of customer satisfaction (CS to the retailer’s loyalty as well as its impact on the retailer’s brand strength. Design/methodology/approach - The study was conducted in the grocery context and in a market under recession using the European Customer Satisfaction Index (ECSI model. Data were collected through a telephone survey from 2,000 participants responsible for the household grocery shopping with a quota of 250 respondents from each of the leading grocery retailers in Greece. A formative measurement model was developed and the collected data were analyzed using partial least square path modeling. Findings - The findings revealed that the strength of the retailer’s brand and CS influence retail loyalty and that brand strength mediate the strength of CS to loyalty. Results also suggested that the expectations and the perceptions toward the retailer’s product offering are the most important drivers of CS and loyalty. Thus, the study has proved the importance of the functional store attributes to CS and loyalty in the grocery store setting. Originality/value - Research examining the suitability of the ECSI model in the grocery setting and in a market under economic crisis is scarce. This paper addresses these shortcomings by examining a customer loyalty model which incorporates the brand strength construct and investigates the role of brand strength as a potential predictor of loyalty as well as the role of CS in the brand strength and loyalty.

  2. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer

    International Nuclear Information System (INIS)

    Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-01-01

    Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC

  3. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer

    Energy Technology Data Exchange (ETDEWEB)

    Pandi, Narayanan Sathiya, E-mail: sathiyapandi@gmail.com; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-10-04

    Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC.

  4. Identification and functional analysis of endothelial tip cell-enriched genes.

    Science.gov (United States)

    del Toro, Raquel; Prahst, Claudia; Mathivet, Thomas; Siegfried, Geraldine; Kaminker, Joshua S; Larrivee, Bruno; Breant, Christiane; Duarte, Antonio; Takakura, Nobuyuki; Fukamizu, Akiyoshi; Penninger, Josef; Eichmann, Anne

    2010-11-11

    Sprouting of developing blood vessels is mediated by specialized motile endothelial cells localized at the tips of growing capillaries. Following behind the tip cells, endothelial stalk cells form the capillary lumen and proliferate. Expression of the Notch ligand Delta-like-4 (Dll4) in tip cells suppresses tip cell fate in neighboring stalk cells via Notch signaling. In DLL4(+/-) mouse mutants, most retinal endothelial cells display morphologic features of tip cells. We hypothesized that these mouse mutants could be used to isolate tip cells and so to determine their genetic repertoire. Using transcriptome analysis of retinal endothelial cells isolated from DLL4(+/-) and wild-type mice, we identified 3 clusters of tip cell-enriched genes, encoding extracellular matrix degrading enzymes, basement membrane components, and secreted molecules. Secreted molecules endothelial-specific molecule 1, angiopoietin 2, and apelin bind to cognate receptors on endothelial stalk cells. Knockout mice and zebrafish morpholino knockdown of apelin showed delayed angiogenesis and reduced proliferation of stalk cells expressing the apelin receptor APJ. Thus, tip cells may regulate angiogenesis via matrix remodeling, production of basement membrane, and release of secreted molecules, some of which regulate stalk cell behavior.

  5. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene.

    Science.gov (United States)

    Casucci, Monica; Falcone, Laura; Camisa, Barbara; Norelli, Margherita; Porcellini, Simona; Stornaiuolo, Anna; Ciceri, Fabio; Traversari, Catia; Bordignon, Claudio; Bonini, Chiara; Bondanza, Attilio

    2018-01-01

    Chimeric antigen receptor (CAR)-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR). We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6). We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting.

  6. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene

    Directory of Open Access Journals (Sweden)

    Monica Casucci

    2018-03-01

    Full Text Available Chimeric antigen receptor (CAR-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR. We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6. We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting.

  7. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene

    Science.gov (United States)

    Casucci, Monica; Falcone, Laura; Camisa, Barbara; Norelli, Margherita; Porcellini, Simona; Stornaiuolo, Anna; Ciceri, Fabio; Traversari, Catia; Bordignon, Claudio; Bonini, Chiara; Bondanza, Attilio

    2018-01-01

    Chimeric antigen receptor (CAR)-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR). We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6). We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting. PMID:29619024

  8. Identification of Genes Enriched in GnRH Neurons by Translating Ribosome Affinity Purification and RNAseq in Mice.

    Science.gov (United States)

    Burger, Laura L; Vanacker, Charlotte; Phumsatitpong, Chayarndorn; Wagenmaker, Elizabeth R; Wang, Luhong; Olson, David P; Moenter, Suzanne M

    2018-04-01

    Gonadotropin-releasing hormone (GnRH) neurons are a nexus of fertility regulation. We used translating ribosome affinity purification coupled with RNA sequencing to examine messenger RNAs of GnRH neurons in adult intact and gonadectomized (GDX) male and female mice. GnRH neuron ribosomes were tagged with green fluorescent protein (GFP) and GFP-labeled polysomes isolated by immunoprecipitation, producing one RNA fraction enhanced for GnRH neuron transcripts and one RNA fraction depleted. Complementary DNA libraries were created from each fraction and 50-base, paired-end sequencing done and differential expression (enhanced fraction/depleted fraction) determined with a threshold of >1.5- or <0.66-fold (false discovery rate P ≤ 0.05). A core of ∼840 genes was differentially expressed in GnRH neurons in all treatments, including enrichment for Gnrh1 (∼40-fold), and genes critical for GnRH neuron and/or gonadotrope development. In contrast, non-neuronal transcripts were not enriched or were de-enriched. Several epithelial markers were also enriched, consistent with the olfactory epithelial origins of GnRH neurons. Interestingly, many synaptic transmission pathways were de-enriched, in accordance with relatively low innervation of GnRH neurons. The most striking difference between intact and GDX mice of both sexes was a marked downregulation of genes associated with oxidative phosphorylation and upregulation of glucose transporters in GnRH neurons from GDX mice. This may suggest that GnRH neurons switch to an alternate fuel to increase adenosine triphosphate production in the absence of negative feedback when GnRH release is elevated. Knowledge of the GnRH neuron translatome and its regulation can guide functional studies and can be extended to disease states, such as polycystic ovary syndrome.

  9. Identification of a robust gene signature that predicts breast cancer outcome in independent data sets

    International Nuclear Information System (INIS)

    Korkola, James E; Waldman, Frederic M; Blaveri, Ekaterina; DeVries, Sandy; Moore, Dan H II; Hwang, E Shelley; Chen, Yunn-Yi; Estep, Anne LH; Chew, Karen L; Jensen, Ronald H

    2007-01-01

    Breast cancer is a heterogeneous disease, presenting with a wide range of histologic, clinical, and genetic features. Microarray technology has shown promise in predicting outcome in these patients. We profiled 162 breast tumors using expression microarrays to stratify tumors based on gene expression. A subset of 55 tumors with extensive follow-up was used to identify gene sets that predicted outcome. The predictive gene set was further tested in previously published data sets. We used different statistical methods to identify three gene sets associated with disease free survival. A fourth gene set, consisting of 21 genes in common to all three sets, also had the ability to predict patient outcome. To validate the predictive utility of this derived gene set, it was tested in two published data sets from other groups. This gene set resulted in significant separation of patients on the basis of survival in these data sets, correctly predicting outcome in 62–65% of patients. By comparing outcome prediction within subgroups based on ER status, grade, and nodal status, we found that our gene set was most effective in predicting outcome in ER positive and node negative tumors. This robust gene selection with extensive validation has identified a predictive gene set that may have clinical utility for outcome prediction in breast cancer patients

  10. A novel CpG island set identifies tissue-specific methylation at developmental gene loci.

    Directory of Open Access Journals (Sweden)

    Robert Illingworth

    2008-01-01

    Full Text Available CpG islands (CGIs are dense clusters of CpG sequences that punctuate the CpG-deficient human genome and associate with many gene promoters. As CGIs also differ from bulk chromosomal DNA by their frequent lack of cytosine methylation, we devised a CGI enrichment method based on nonmethylated CpG affinity chromatography. The resulting library was sequenced to define a novel human blood CGI set that includes many that are not detected by current algorithms. Approximately half of CGIs were associated with annotated gene transcription start sites, the remainder being intra- or intergenic. Using an array representing over 17,000 CGIs, we established that 6%-8% of CGIs are methylated in genomic DNA of human blood, brain, muscle, and spleen. Inter- and intragenic CGIs are preferentially susceptible to methylation. CGIs showing tissue-specific methylation were overrepresented at numerous genetic loci that are essential for development, including HOX and PAX family members. The findings enable a comprehensive analysis of the roles played by CGI methylation in normal and diseased human tissues.

  11. Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

    Science.gov (United States)

    Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

    2014-05-01

    Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  12. Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains

    Directory of Open Access Journals (Sweden)

    Bharti Arvind K

    2008-12-01

    Full Text Available Abstract Background Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR and methylation spanning linker libraries (MSLL. These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends. Results A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig, while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%. These two techniques were compared with other gene-enrichment methods, and shown to be complementary. Conclusion MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of

  13. The Dynamics of Visual Art Dialogues: Experiences to Be Used in Hospital Settings with Visual Art Enrichment

    Directory of Open Access Journals (Sweden)

    Britt-Maj Wikström

    2011-01-01

    Full Text Available Objectives. Given that hospitals have environmental enrichment with paintings and visual art arrangement, it would be meaningful to develop and document how hospital art could be used by health professionals. Methods. The study was undertaken at an art site in Sweden. During 1-hour sessions, participants (=20 get together in an art gallery every second week five times. Results. According to the participants a new value was perceived. From qualitative analyses, three themes appear: raise association, mentally present, and door-opener. In addition 72% of the participants reported makes me happy and gives energy and inspiration, and 52% reported that dialogues increase inspiration, make you involved, and stimulate curiosity. Conclusion. The present study supported the view that visual art dialogue could be used by health care professionals in a structured manner and that meaningful art stimulation, related to a person’s experiences, could be of importance for the patients. Implementing art dialogues in hospital settings could be a fruitful working tool for nurses, a complementary manner of patient communication.

  14. Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior.

    Science.gov (United States)

    Windhorst, Dafna A; Mileva-Seitz, Viara R; Rippe, Ralph C A; Tiemeier, Henning; Jaddoe, Vincent W V; Verhulst, Frank C; van IJzendoorn, Marinus H; Bakermans-Kranenburg, Marian J

    2016-08-01

    In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and gene-set approaches in tests of Gene by Environment (G × E) effects on complex behavior. This approach can offer an important alternative or complement to candidate gene and genome-wide environmental interaction (GWEI) studies in the search for genetic variation underlying individual differences in behavior. Genetic variants in 12 autosomal dopaminergic genes were available in an ethnically homogenous part of a population-based cohort. Harsh parenting was assessed with maternal (n = 1881) and paternal (n = 1710) reports at age 3. Externalizing behavior was assessed with the Child Behavior Checklist (CBCL) at age 5 (71 ± 3.7 months). We conducted gene-set analyses of the association between variation in dopaminergic genes and externalizing behavior, stratified for harsh parenting. The association was statistically significant or approached significance for children without harsh parenting experiences, but was absent in the group with harsh parenting. Similarly, significant associations between single genes and externalizing behavior were only found in the group without harsh parenting. Effect sizes in the groups with and without harsh parenting did not differ significantly. Gene-environment interaction tests were conducted for individual genetic variants, resulting in two significant interaction effects (rs1497023 and rs4922132) after correction for multiple testing. Our findings are suggestive of G × E interplay, with associations between dopamine genes and externalizing behavior present in children without harsh parenting, but not in children with harsh parenting experiences. Harsh parenting may overrule the role of genetic factors in externalizing behavior. Gene-based and gene-set

  15. Metagenomic survey of methanesulfonic acid (MSA catabolic genes in an Atlantic Ocean surface water sample and in a partial enrichment

    Directory of Open Access Journals (Sweden)

    Ana C. Henriques

    2016-10-01

    Full Text Available Methanesulfonic acid (MSA is a relevant intermediate of the biogeochemical cycle of sulfur and environmental microorganisms assume an important role in the mineralization of this compound. Several methylotrophic bacterial strains able to grow on MSA have been isolated from soil or marine water and two conserved operons, msmABCD coding for MSA monooxygenase and msmEFGH coding for a transport system, have been repeatedly encountered in most of these strains. Homologous sequences have also been amplified directly from the environment or observed in marine metagenomic data, but these showed a base composition (G + C content very different from their counterparts from cultivated bacteria. The aim of this study was to understand which microorganisms within the coastal surface oceanic microflora responded to MSA as a nutrient and how the community evolved in the early phases of an enrichment by means of metagenome and gene-targeted amplicon sequencing. From the phylogenetic point of view, the community shifted significantly with the disappearance of all signals related to the Archaea, the Pelagibacteraceae and phylum SAR406, and the increase in methylotroph-harboring taxa, accompanied by other groups so far not known to comprise methylotrophs such as the Hyphomonadaceae. At the functional level, the abundance of several genes related to sulfur metabolism and methylotrophy increased during the enrichment and the allelic distribution of gene msmA diagnostic for MSA monooxygenase altered considerably. Even more dramatic was the disappearance of MSA import-related gene msmE, which suggests that alternative transporters must be present in the enriched community and illustrate the inadequacy of msmE as an ecofunctional marker for MSA degradation at sea.

  16. Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool.

    Science.gov (United States)

    Auerbach, Raymond K; Chen, Bin; Butte, Atul J

    2013-08-01

    Biological analysis has shifted from identifying genes and transcripts to mapping these genes and transcripts to biological functions. The ENCODE Project has generated hundreds of ChIP-Seq experiments spanning multiple transcription factors and cell lines for public use, but tools for a biomedical scientist to analyze these data are either non-existent or tailored to narrow biological questions. We present the ENCODE ChIP-Seq Significance Tool, a flexible web application leveraging public ENCODE data to identify enriched transcription factors in a gene or transcript list for comparative analyses. The ENCODE ChIP-Seq Significance Tool is written in JavaScript on the client side and has been tested on Google Chrome, Apple Safari and Mozilla Firefox browsers. Server-side scripts are written in PHP and leverage R and a MySQL database. The tool is available at http://encodeqt.stanford.edu. abutte@stanford.edu Supplementary material is available at Bioinformatics online.

  17. Direct cloning from enrichment cultures, a reliable strategy for isolation of complete operons and genes from microbial consortia.

    Science.gov (United States)

    Entcheva, P; Liebl, W; Johann, A; Hartsch, T; Streit, W R

    2001-01-01

    Enrichment cultures of microbial consortia enable the diverse metabolic and catabolic activities of these populations to be studied on a molecular level and to be explored as potential sources for biotechnology processes. We have used a combined approach of enrichment culture and direct cloning to construct cosmid libraries with large (>30-kb) inserts from microbial consortia. Enrichment cultures were inoculated with samples from five environments, and high amounts of avidin were added to the cultures to favor growth of biotin-producing microbes. DNA was extracted from three of these enrichment cultures and used to construct cosmid libraries; each library consisted of between 6,000 and 35,000 clones, with an average insert size of 30 to 40 kb. The inserts contained a diverse population of genomic DNA fragments isolated from the consortia organisms. These three libraries were used to complement the Escherichia coli biotin auxotrophic strain ATCC 33767 Delta(bio-uvrB). Initial screens resulted in the isolation of seven different complementing cosmid clones, carrying biotin biosynthesis operons. Biotin biosynthesis capabilities and growth under defined conditions of four of these clones were studied. Biotin measured in the different culture supernatants ranged from 42 to 3,800 pg/ml/optical density unit. Sequencing the identified biotin synthesis genes revealed high similarities to bio operons from gram-negative bacteria. In addition, random sequencing identified other interesting open reading frames, as well as two operons, the histidine utilization operon (hut), and the cluster of genes involved in biosynthesis of molybdopterin cofactors in bacteria (moaABCDE).

  18. Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior

    NARCIS (Netherlands)

    J. Windhorst (Judith); V. Mileva-Seitz (Viara); R.C.A. Rippe (Ralph C.A.); H.W. Tiemeier (Henning); V.W.V. Jaddoe (Vincent); F.C. Verhulst (Frank); M.H. van IJzendoorn (Rien); M.J. Bakermans-Kranenburg (Marian)

    2016-01-01

    textabstractBackground: In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and

  19. Toxoplasmosis and Polygenic Disease Susceptibility Genes: Extensive Toxoplasma gondii Host/Pathogen Interactome Enrichment in Nine Psychiatric or Neurological Disorders

    Directory of Open Access Journals (Sweden)

    C. J. Carter

    2013-01-01

    Full Text Available Toxoplasma gondii is not only implicated in schizophrenia and related disorders, but also in Alzheimer's or Parkinson's disease, cancer, cardiac myopathies, and autoimmune disorders. During its life cycle, the pathogen interacts with ~3000 host genes or proteins. Susceptibility genes for multiple sclerosis, Alzheimer's disease, schizophrenia, bipolar disorder, depression, childhood obesity, Parkinson's disease, attention deficit hyperactivity disorder (multiple sclerosis, and autism (, but not anorexia or chronic fatigue are highly enriched in the human arm of this interactome and 18 (ADHD to 33% (MS of the susceptibility genes relate to it. The signalling pathways involved in the susceptibility gene/interactome overlaps are relatively specific and relevant to each disease suggesting a means whereby susceptibility genes could orient the attentions of a single pathogen towards disruption of the specific pathways that together contribute (positively or negatively to the endophenotypes of different diseases. Conditional protein knockdown, orchestrated by T. gondii proteins or antibodies binding to those of the host (pathogen derived autoimmunity and metabolite exchange, may contribute to this disruption. Susceptibility genes may thus be related to the causes and influencers of disease, rather than (and as well as to the disease itself.

  20. Genome-Wide Association Studies Suggest Limited Immune Gene Enrichment in Schizophrenia Compared to 5 Autoimmune Diseases

    DEFF Research Database (Denmark)

    Pouget, Jennie G; Gonçalves, Vanessa F; Spain, Sarah L

    2016-01-01

    There has been intense debate over the immunological basis of schizophrenia, and the potential utility of adjunct immunotherapies. The major histocompatibility complex is consistently the most powerful region of association in genome-wide association studies (GWASs) of schizophrenia and has been...... in immune genes contributes to schizophrenia. We show that there is no enrichment of immune loci outside of the MHC region in the largest genetic study of schizophrenia conducted to date, in contrast to 5 diseases of known immune origin. Among 108 regions of the genome previously associated...

  1. Transcriptome and Gene Ontology (GO) Enrichment Analysis Reveals Genes Involved in Biotin Metabolism That Affect L-Lysine Production in Corynebacterium glutamicum.

    Science.gov (United States)

    Kim, Hong-Il; Kim, Jong-Hyeon; Park, Young-Jin

    2016-03-09

    Corynebacterium glutamicum is widely used for amino acid production. In the present study, 543 genes showed a significant change in their mRNA expression levels in L-lysine-producing C. glutamicum ATCC21300 than that in the wild-type C. glutamicum ATCC13032. Among these 543 differentially expressed genes (DEGs), 28 genes were up- or downregulated. In addition, 454 DEGs were functionally enriched and categorized based on BLAST sequence homologies and gene ontology (GO) annotations using the Blast2GO software. Interestingly, NCgl0071 (bioB, encoding biotin synthase) was expressed at levels ~20-fold higher in the L-lysine-producing ATCC21300 strain than that in the wild-type ATCC13032 strain. Five other genes involved in biotin metabolism or transport--NCgl2515 (bioA, encoding adenosylmethionine-8-amino-7-oxononanoate aminotransferase), NCgl2516 (bioD, encoding dithiobiotin synthetase), NCgl1883, NCgl1884, and NCgl1885--were also expressed at significantly higher levels in the L-lysine-producing ATCC21300 strain than that in the wild-type ATCC13032 strain, which we determined using both next-generation RNA sequencing and quantitative real-time PCR analysis. When we disrupted the bioB gene in C. glutamicum ATCC21300, L-lysine production decreased by approximately 76%, and the three genes involved in biotin transport (NCgl1883, NCgl1884, and NCgl1885) were significantly downregulated. These results will be helpful to improve our understanding of C. glutamicum for industrial amino acid production.

  2. Candidate genes for COPD in two large data sets.

    Science.gov (United States)

    Bakke, P S; Zhu, G; Gulsvik, A; Kong, X; Agusti, A G N; Calverley, P M A; Donner, C F; Levy, R D; Make, B J; Paré, P D; Rennard, S I; Vestbo, J; Wouters, E F M; Anderson, W; Lomas, D A; Silverman, E K; Pillai, S G

    2011-02-01

    Lack of reproducibility of findings has been a criticism of genetic association studies on complex diseases, such as chronic obstructive pulmonary disease (COPD). We selected 257 polymorphisms of 16 genes with reported or potential relationships to COPD and genotyped these variants in a case-control study that included 953 COPD cases and 956 control subjects. We explored the association of these polymorphisms to three COPD phenotypes: a COPD binary phenotype and two quantitative traits (post-bronchodilator forced expiratory volume in 1 s (FEV₁) % predicted and FEV₁/forced vital capacity (FVC)). The polymorphisms significantly associated to these phenotypes in this first study were tested in a second, family-based study that included 635 pedigrees with 1,910 individuals. Significant associations to the binary COPD phenotype in both populations were seen for STAT1 (rs13010343) and NFKBIB/SIRT2 (rs2241704) (p<0.05). Single-nucleotide polymorphisms rs17467825 and rs1155563 of the GC gene were significantly associated with FEV₁ % predicted and FEV₁/FVC, respectively, in both populations (p<0.05). This study has replicated associations to COPD phenotypes in the STAT1, NFKBIB/SIRT2 and GC genes in two independent populations, the associations of the former two genes representing novel findings.

  3. Cardiovascular risk and lifestyle habits of consumers of a phytosterol-enriched yogurt in a real-life setting.

    Science.gov (United States)

    Paillard, F; Bruckert, E; Naelten, G; Picard, P; van Ganse, E

    2015-06-01

    Data on the characteristics of consumers of phytosterol-enriched products and modalities of consumption are rare. An observational study evaluating the lifestyle characteristics and cardiovascular risk (CVR) profile of phytosterol-enriched yogurt consumers was performed in France. Subjects were recruited from general practitioners via electronic medical records. Data were obtained from 358 consumers and 422 nonconsumers with 519 subject questionnaires (243 consumers, 276 nonconsumers; 67% response). Consumers had more cardiovascular risk factors than nonconsumers (2.0 ± 1.5 versus 1.6 ± 1.4; P Phytosterol-enriched yogurt intake conformed to recommendations in two-thirds of consumers and was mainly consumed because of concerns over cholesterol levels and CVR. The higher cardiovascular disease risk profile of phytosterol-enriched yogurt consumers corresponds to a population for whom European guidelines recommend lifestyle changes to manage cholesterol. The coherence of the data in terms of risk factors, adherence to lifestyle recommendations and the consumption of phytosterol-enriched yogurt conforming to recommendations reflects a health-conscious consumer population. © 2014 The British Dietetic Association Ltd.

  4. Clustering based gene expression feature selection method: A computational approach to enrich the classifier efficiency of differentially expressed genes

    KAUST Repository

    Abusamra, Heba

    2016-07-20

    The native nature of high dimension low sample size of gene expression data make the classification task more challenging. Therefore, feature (gene) selection become an apparent need. Selecting a meaningful and relevant genes for classifier not only decrease the computational time and cost, but also improve the classification performance. Among different approaches of feature selection methods, however most of them suffer from several problems such as lack of robustness, validation issues etc. Here, we present a new feature selection technique that takes advantage of clustering both samples and genes. Materials and methods We used leukemia gene expression dataset [1]. The effectiveness of the selected features were evaluated by four different classification methods; support vector machines, k-nearest neighbor, random forest, and linear discriminate analysis. The method evaluate the importance and relevance of each gene cluster by summing the expression level for each gene belongs to this cluster. The gene cluster consider important, if it satisfies conditions depend on thresholds and percentage otherwise eliminated. Results Initial analysis identified 7120 differentially expressed genes of leukemia (Fig. 15a), after applying our feature selection methodology we end up with specific 1117 genes discriminating two classes of leukemia (Fig. 15b). Further applying the same method with more stringent higher positive and lower negative threshold condition, number reduced to 58 genes have be tested to evaluate the effectiveness of the method (Fig. 15c). The results of the four classification methods are summarized in Table 11. Conclusions The feature selection method gave good results with minimum classification error. Our heat-map result shows distinct pattern of refines genes discriminating between two classes of leukemia.

  5. Microbial functional genes enriched in the Xiangjiang River sediments with heavy metal contamination.

    Science.gov (United States)

    Jie, Shiqi; Li, Mingming; Gan, Min; Zhu, Jianyu; Yin, Huaqun; Liu, Xueduan

    2016-08-08

    Xiangjiang River (Hunan, China) has been contaminated with heavy metal for several decades by surrounding factories. However, little is known about the influence of a gradient of heavy metal contamination on the diversity, structure of microbial functional gene in sediment. To deeply understand the impact of heavy metal contamination on microbial community, a comprehensive functional gene array (GeoChip 5.0) has been used to study the functional genes structure, composition, diversity and metabolic potential of microbial community from three heavy metal polluted sites of Xiangjiang River. A total of 25595 functional genes involved in different biogeochemical processes have been detected in three sites, and different diversities and structures of microbial functional genes were observed. The analysis of gene overlapping, unique genes, and various diversity indices indicated a significant correlation between the level of heavy metal contamination and the functional diversity. Plentiful resistant genes related to various metal were detected, such as copper, arsenic, chromium and mercury. The results indicated a significantly higher abundance of genes involved in metal resistance including sulfate reduction genes (dsr) in studied site with most serious heavy metal contamination, such as cueo, mer, metc, merb, tehb and terc gene. With regard to the relationship between the environmental variables and microbial functional structure, S, Cu, Cd, Hg and Cr were the dominating factor shaping the microbial distribution pattern in three sites. This study suggests that high level of heavy metal contamination resulted in higher functional diversity and the abundance of metal resistant genes. These variation therefore significantly contribute to the resistance, resilience and stability of the microbial community subjected to the gradient of heavy metals contaminant in Xiangjiang River.

  6. Phylogenetics and evolution of Trx SET genes in fully sequenced land plants.

    Science.gov (United States)

    Zhu, Xinyu; Chen, Caoyi; Wang, Baohua

    2012-04-01

    Plant Trx SET proteins are involved in H3K4 methylation and play a key role in plant floral development. Genes encoding Trx SET proteins constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. To investigate the evolutionary history of the Trx SET gene family, we made a comprehensive evolutionary analysis on this gene family from 13 major representatives of green plants. A novel clustering (here named as cpTrx clade), which included the III-1, III-2, and III-4 orthologous groups, previously resolved was identified. Our analysis showed that plant Trx proteins possessed a variety of domain organizations and gene structures among paralogs. Additional domains such as PHD, PWWP, and FYR were early integrated into primordial SET-PostSET domain organization of cpTrx clade. We suggested that the PostSET domain was lost in some members of III-4 orthologous group during the evolution of land plants. At least four classes of gene structures had been formed at the early evolutionary stage of land plants. Three intronless orphan Trx SET genes from the Physcomitrella patens (moss) were identified, and supposedly, their parental genes have been eliminated from the genome. The structural differences among evolutionary groups of plant Trx SET genes with different functions were described, contributing to the design of further experimental studies.

  7. Annotating gene sets by mining large literature collections with protein networks.

    Science.gov (United States)

    Wang, Sheng; Ma, Jianzhu; Yu, Michael Ku; Zheng, Fan; Huang, Edward W; Han, Jiawei; Peng, Jian; Ideker, Trey

    2018-01-01

    Analysis of patient genomes and transcriptomes routinely recognizes new gene sets associated with human disease. Here we present an integrative natural language processing system which infers common functions for a gene set through automatic mining of the scientific literature with biological networks. This system links genes with associated literature phrases and combines these links with protein interactions in a single heterogeneous network. Multiscale functional annotations are inferred based on network distances between phrases and genes and then visualized as an ontology of biological concepts. To evaluate this system, we predict functions for gene sets representing known pathways and find that our approach achieves substantial improvement over the conventional text-mining baseline method. Moreover, our system discovers novel annotations for gene sets or pathways without previously known functions. Two case studies demonstrate how the system is used in discovery of new cancer-related pathways with ontological annotations.

  8. Laser capture microdissection of enriched populations of neurons or single neurons for gene expression analysis after traumatic brain injury.

    Science.gov (United States)

    Boone, Deborah R; Sell, Stacy L; Hellmich, Helen Lee

    2013-04-10

    Long-term cognitive disability after TBI is associated with injury-induced neurodegeneration in the hippocampus-a region in the medial temporal lobe that is critical for learning, memory and executive function. Hence our studies focus on gene expression analysis of specific neuronal populations in distinct subregions of the hippocampus. The technique of laser capture microdissection (LCM), introduced in 1996 by Emmert-Buck, et al., has allowed for significant advances in gene expression analysis of single cells and enriched populations of cells from heterogeneous tissues such as the mammalian brain that contains thousands of functional cell types. We use LCM and a well established rat model of traumatic brain injury (TBI) to investigate the molecular mechanisms that underlie the pathogenesis of TBI. Following fluid-percussion TBI, brains are removed at pre-determined times post-injury, immediately frozen on dry ice, and prepared for sectioning in a cryostat. The rat brains can be embedded in OCT and sectioned immediately, or stored several months at -80 °C before sectioning for laser capture microdissection. Additionally, we use LCM to study the effects of TBI on circadian rhythms. For this, we capture neurons from the suprachiasmatic nuclei that contain the master clock of the mammalian brain. Here, we demonstrate the use of LCM to obtain single identified neurons (injured and degenerating, Fluoro-Jade-positive, or uninjured, Fluoro-Jade-negative) and enriched populations of hippocampal neurons for subsequent gene expression analysis by real time PCR and/or whole-genome microarrays. These LCM-enabled studies have revealed that the selective vulnerability of anatomically distinct regions of the rat hippocampus are reflected in the different gene expression profiles of different populations of neurons obtained by LCM from these distinct regions. The results from our single-cell studies, where we compare the transcriptional profiles of dying and adjacent surviving

  9. The Gene Ontology Differs in Bursa of Fabricius Between Two Breeds of Ducks Post Hatching by Enriching the Differentially Expressed Genes

    Directory of Open Access Journals (Sweden)

    H Liu

    Full Text Available ABSTRACT The bursa of Fabricius (BF is the central humoral immune organ unique to birds. The present study investigated the possible difference on a molecular level between two duck breeds. The digital gene expression profiling (DGE technology was used to enrich the differentially expressed genes (DEGs in BF between the Jianchang and Nonghua-P strains of ducks. DGE data identified 195 DEGs in the bursa. Gene Ontology (GO analysis suggested that DEGs were mainly enriched in the metabolic pathways and ribosome components. Pathways analysis identified the spliceosome, RNA transport, RNA degradation process, Jak-STAT signaling pathway, TNF signaling pathway and B cell receptor signaling pathway. The results indicated that the main difference in the BF between the two duck strains was in the capabilities of protein formation and B cell development. These data have revealed the main divergence in the BF on a molecular level between genetically different duck breeds and may help to perform molecular breeding programs in poultry in the future.

  10. Flavanol-Enriched Cocoa Powder Alters the Intestinal Microbiota, Tissue and Fluid Metabolite Profiles, and Intestinal Gene Expression in Pigs.

    Science.gov (United States)

    Jang, Saebyeol; Sun, Jianghao; Chen, Pei; Lakshman, Sukla; Molokin, Aleksey; Harnly, James M; Vinyard, Bryan T; Urban, Joseph F; Davis, Cindy D; Solano-Aguilar, Gloria

    2016-04-01

    Consumption of cocoa-derived polyphenols has been associated with several health benefits; however, their effects on the intestinal microbiome and related features of host intestinal health are not adequately understood. The objective of this study was to determine the effects of eating flavanol-enriched cocoa powder on the composition of the gut microbiota, tissue metabolite profiles, and intestinal immune status. Male pigs (5 mo old, 28 kg mean body weight) were supplemented with 0, 2.5, 10, or 20 g flavanol-enriched cocoa powder/d for 27 d. Metabolites in serum, urine, the proximal colon contents, liver, and adipose tissue; bacterial abundance in the intestinal contents and feces; and intestinal tissue gene expression of inflammatory markers and Toll-like receptors (TLRs) were then determined. O-methyl-epicatechin-glucuronide conjugates dose-dependently increased (Pcocoa powder. The concentration of 3-hydroxyphenylpropionic acid isomers in urine decreased as the dose of cocoa powder fed to pigs increased (75-85%,Pcocoa powder/d, respectively. Moreover, consumption of cocoa powder reducedTLR9gene expression in ileal Peyer's patches (67-80%,Pcocoa powder/d compared with pigs not supplemented with cocoa powder. This study demonstrates that consumption of cocoa powder by pigs can contribute to gut health by enhancing the abundance ofLactobacillusandBifidobacteriumspecies and modulating markers of localized intestinal immunity. © 2016 American Society for Nutrition.

  11. Science Teaching Experiences in Informal Settings: One Way to Enrich the Preparation Program for Preservice Science Teachers

    Science.gov (United States)

    Hsu, Pei-Ling

    2016-01-01

    The high attrition rate of new science teachers demonstrates the urgent need to incorporate effective practices in teacher preparation programs to better equip preservice science teachers. The purpose of the study is to demonstrate a way to enrich preservice science teachers' preparation by incorporating informal science teaching practice into…

  12. Two new loci and gene sets related to sex determination and cancer progression are associated with susceptibility to testicular germ cell tumor.

    Science.gov (United States)

    Kristiansen, Wenche; Karlsson, Robert; Rounge, Trine B; Whitington, Thomas; Andreassen, Bettina K; Magnusson, Patrik K; Fosså, Sophie D; Adami, Hans-Olov; Turnbull, Clare; Haugen, Trine B; Grotmol, Tom; Wiklund, Fredrik

    2015-07-15

    Genome-wide association (GWA) studies have reported 19 distinct susceptibility loci for testicular germ cell tumor (TGCT). A GWA study for TGCT was performed by genotyping 610 240 single-nucleotide polymorphisms (SNPs) in 1326 cases and 6687 controls from Sweden and Norway. No novel genome-wide significant associations were observed in this discovery stage. We put forward 27 SNPs from 15 novel regions and 12 SNPs previously reported, for replication in 710 case-parent triads and 289 cases and 290 controls. Predefined biological pathways and processes, in addition to a custom-built sex-determination gene set, were subject to enrichment analyses using Meta-Analysis Gene Set Enrichment of Variant Associations (M) and Improved Gene Set Enrichment Analysis for Genome-wide Association Study (I). In the combined meta-analysis, we observed genome-wide significant association for rs7501939 on chromosome 17q12 (OR = 0.78, 95% CI = 0.72-0.84, P = 1.1 × 10(-9)) and rs2195987 on chromosome 19p12 (OR = 0.76, 95% CI: 0.69-0.84, P = 3.2 × 10(-8)). The marker rs7501939 on chromosome 17q12 is located in an intron of the HNF1B gene, encoding a member of the homeodomain-containing superfamily of transcription factors. The sex-determination gene set (false discovery rate, FDRM cancer and apoptosis, was associated with TGCT (FDR utero are implicated in the development of TGCT. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  13. Improving enrichment of circulating fetal DNA for genetic testing: size fractionation followed by whole gene amplification.

    Science.gov (United States)

    Jorgez, Carolina J; Bischoff, Farideh Z

    2009-01-01

    Among the pitfalls of using cell-free fetal DNA in plasma for prenatal diagnosis is quality of the recovered DNA fragments and concomitant presence of maternal DNA (>95%). Our objective is to provide alternative methods for achieving enrichment and high-quality fetal DNA from plasma. Cell-free DNA from 31 pregnant women and 18 controls (10 males and 8 females) were size separated using agarose gel electrophoresis. DNA fragments of 100-300, 500-700 and 1,500-2,000 bp were excised and extracted, followed by whole genome amplification (WGA) of recovered fragments. Levels of beta-globin and DYS1 were measured. Distribution of beta-globin size fragments was similar among pregnant women and controls. Among control male cases, distribution of size fragments was the same for both beta-globin and DYS1. Among maternal cases confirmed to be male, the smallest size fragment (100-300 bp) accounted for nearly 50% (39.76 +/- 17.55%) of the recovered DYS1-DNA (fetal) and only 10% (10.40 +/- 6.49%) of beta-globin (total) DNA. After WGA of plasma fragments from pregnant women, DYS1 sequence amplification was best observed when using the 100-300 bp fragments as template. Combination of electrophoresis for size separation and WGA led to enriched fetal DNA from plasma. This novel combination of strategies is more likely to permit universal clinical applications of cell-free fetal DNA. Copyright 2009 S. Karger AG, Basel.

  14. Improving functional modules discovery by enriching interaction networks with gene profiles

    KAUST Repository

    Salem, Saeed; Alroobi, Rami; Banitaan, Shadi; Seridi, Loqmane; Aljarah, Ibrahim; Brewer, James

    2013-01-01

    networks. We demonstrate the effectiveness of CLARM on Yeast and Human interaction datasets, and gene expression and molecular function profiles. Experiments on these real datasets show that the CLARM approach is competitive to well established functional

  15. Transcription factor binding site enrichment analysis predicts drivers of altered gene expression in nonalcoholic steatohepatitis

    Czech Academy of Sciences Publication Activity Database

    Lake, A.D.; Chaput, A.L.; Novák, Petr; Cherrington, N.J.; Smith, C.L.

    2016-01-01

    Roč. 122, December 15 (2016), s. 62-71 ISSN 0006-2952 Institutional support: RVO:60077344 Keywords : Transcription factor * Liver * Gene expression * Bioinformatics Subject RIV: CE - Biochemistry Impact factor: 4.581, year: 2016

  16. Effect of biochar amendment on the control of soil sulfonamides, antibiotic-resistant bacteria, and gene enrichment in lettuce tissues

    International Nuclear Information System (INIS)

    Ye, Mao; Sun, Mingming; Feng, Yanfang; Wan, Jinzhong; Xie, Shanni; Tian, Da; Zhao, Yu; Wu, Jun; Hu, Feng; Li, Huixin; Jiang, Xin

    2016-01-01

    Highlights: • Biochar can prevent soil sulfonamides from accumulating in lettuce tissues. • ARB enrichment in lettuce tissues decreased significantly after biochar amendment. • Impedance effect of biochar addition on soil ARGs was also quite effective. • Biochar application can be a practical strategy to protect vegetable safety. - Abstract: Considering the potential threat of vegetables growing in antibiotic-polluted soil with high abundance of antibiotic-resistant genes (ARGs) against human health through the food chain, it is thus urgent to develop novel control technology to ensure vegetable safety. In the present work, pot experiments were conducted in lettuce cultivation to assess the impedance effect of biochar amendment on soil sulfonamides (SAs), antibiotic-resistant bacteria (ARB), and ARG enrichment in lettuce tissues. After 100 days of cultivation, lettuce cultivation with biochar amendment exhibited the greatest soil SA dissipation as well as the significant improvement of lettuce growth indices, with residual soil SAs mainly existing as the tightly bound fraction. Moreover, the SA contents in roots and new/old leaves were reduced by one to two orders of magnitude compared to those without biochar amendment. In addition, isolate counts for SA-resistant bacterial endophytes in old leaves and sul gene abundances in roots and old leaves also decreased significantly after biochar application. However, neither SA resistant bacteria nor sul genes were detected in new leaves. It was the first study to demonstrate that biochar amendment can be a practical strategy to protect lettuce safety growing in SA-polluted soil with rich ARB and ARGs.

  17. Effect of biochar amendment on the control of soil sulfonamides, antibiotic-resistant bacteria, and gene enrichment in lettuce tissues

    Energy Technology Data Exchange (ETDEWEB)

    Ye, Mao [State Key Laboratory of Soil and Sustainable Agriculture, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008 (China); Sun, Mingming [Soil Ecology Lab, College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing 210095 (China); Feng, Yanfang, E-mail: fengyanfang@163.com [Institute of Agricultural Resources and Environment, Jiangsu Academy of Agricultural Sciences, Nanjing 210014 (China); Wan, Jinzhong [Nanjing Institute of Environmental Science, Ministry of Environmental Protection of China, Nanjing 210042 (China); Xie, Shanni; Tian, Da [Soil Ecology Lab, College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing 210095 (China); Zhao, Yu [Collaborative Innovation Center of Advanced Microstructures, Jiangsu Provincial Key Laboratory of Photonic and Electronic Materials, School of Electronic Science and Engineering, Nanjing University, Nanjing 210093 (China); Wu, Jun; Hu, Feng; Li, Huixin [Soil Ecology Lab, College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing 210095 (China); Jiang, Xin, E-mail: Jiangxin@issas.ac.cn [State Key Laboratory of Soil and Sustainable Agriculture, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008 (China)

    2016-05-15

    Highlights: • Biochar can prevent soil sulfonamides from accumulating in lettuce tissues. • ARB enrichment in lettuce tissues decreased significantly after biochar amendment. • Impedance effect of biochar addition on soil ARGs was also quite effective. • Biochar application can be a practical strategy to protect vegetable safety. - Abstract: Considering the potential threat of vegetables growing in antibiotic-polluted soil with high abundance of antibiotic-resistant genes (ARGs) against human health through the food chain, it is thus urgent to develop novel control technology to ensure vegetable safety. In the present work, pot experiments were conducted in lettuce cultivation to assess the impedance effect of biochar amendment on soil sulfonamides (SAs), antibiotic-resistant bacteria (ARB), and ARG enrichment in lettuce tissues. After 100 days of cultivation, lettuce cultivation with biochar amendment exhibited the greatest soil SA dissipation as well as the significant improvement of lettuce growth indices, with residual soil SAs mainly existing as the tightly bound fraction. Moreover, the SA contents in roots and new/old leaves were reduced by one to two orders of magnitude compared to those without biochar amendment. In addition, isolate counts for SA-resistant bacterial endophytes in old leaves and sul gene abundances in roots and old leaves also decreased significantly after biochar application. However, neither SA resistant bacteria nor sul genes were detected in new leaves. It was the first study to demonstrate that biochar amendment can be a practical strategy to protect lettuce safety growing in SA-polluted soil with rich ARB and ARGs.

  18. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    Hettne, K.M.; Boorsma, A.; Dartel, D.A. van; Goeman, J.J.; Jong, E. de; Piersma, A.H.; Stierum, R.H.; Kleinjans, J.C.; Kors, J.A.

    2013-01-01

    BACKGROUND: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set

  19. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    Hettne, K.M.; Boorsma, A.; Dartel, van D.A.M.; Goeman, J.J.; Jong, de E.; Piersma, A.H.; Stierum, R.H.; Kleinjans, J.C.; Kors, J.A.

    2013-01-01

    Background: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set

  20. The null hypothesis of GSEA, and a novel statistical model for competitive gene set analysis

    DEFF Research Database (Denmark)

    Debrabant, Birgit

    2017-01-01

    MOTIVATION: Competitive gene set analysis intends to assess whether a specific set of genes is more associated with a trait than the remaining genes. However, the statistical models assumed to date to underly these methods do not enable a clear cut formulation of the competitive null hypothesis....... This is a major handicap to the interpretation of results obtained from a gene set analysis. RESULTS: This work presents a hierarchical statistical model based on the notion of dependence measures, which overcomes this problem. The two levels of the model naturally reflect the modular structure of many gene set...... analysis methods. We apply the model to show that the popular GSEA method, which recently has been claimed to test the self-contained null hypothesis, actually tests the competitive null if the weight parameter is zero. However, for this result to hold strictly, the choice of the dependence measures...

  1. Enrichment of HP1a on Drosophila chromosome 4 genes creates an alternate chromatin structure critical for regulation in this heterochromatic domain.

    Directory of Open Access Journals (Sweden)

    Nicole C Riddle

    2012-09-01

    Full Text Available Chromatin environments differ greatly within a eukaryotic genome, depending on expression state, chromosomal location, and nuclear position. In genomic regions characterized by high repeat content and high gene density, chromatin structure must silence transposable elements but permit expression of embedded genes. We have investigated one such region, chromosome 4 of Drosophila melanogaster. Using chromatin-immunoprecipitation followed by microarray (ChIP-chip analysis, we examined enrichment patterns of 20 histone modifications and 25 chromosomal proteins in S2 and BG3 cells, as well as the changes in several marks resulting from mutations in key proteins. Active genes on chromosome 4 are distinct from those in euchromatin or pericentric heterochromatin: while there is a depletion of silencing marks at the transcription start sites (TSSs, HP1a and H3K9me3, but not H3K9me2, are enriched strongly over gene bodies. Intriguingly, genes on chromosome 4 are less frequently associated with paused polymerase. However, when the chromatin is altered by depleting HP1a or POF, the RNA pol II enrichment patterns of many chromosome 4 genes shift, showing a significant decrease over gene bodies but not at TSSs, accompanied by lower expression of those genes. Chromosome 4 genes have a low incidence of TRL/GAGA factor binding sites and a low T(m downstream of the TSS, characteristics that could contribute to a low incidence of RNA polymerase pausing. Our data also indicate that EGG and POF jointly regulate H3K9 methylation and promote HP1a binding over gene bodies, while HP1a targeting and H3K9 methylation are maintained at the repeats by an independent mechanism. The HP1a-enriched, POF-associated chromatin structure over the gene bodies may represent one type of adaptation for genes embedded in repetitive DNA.

  2. Genome-wide survey and developmental expression mapping of zebrafish SET domain-containing genes.

    Directory of Open Access Journals (Sweden)

    Xiao-Jian Sun

    Full Text Available SET domain-containing proteins represent an evolutionarily conserved family of epigenetic regulators, which are responsible for most histone lysine methylation. Since some of these genes have been revealed to be essential for embryonic development, we propose that the zebrafish, a vertebrate model organism possessing many advantages for developmental studies, can be utilized to study the biological functions of these genes and the related epigenetic mechanisms during early development. To this end, we have performed a genome-wide survey of zebrafish SET domain genes. 58 genes total have been identified. Although gene duplication events give rise to several lineage-specific paralogs, clear reciprocal orthologous relationship reveals high conservation between zebrafish and human SET domain genes. These data were further subject to an evolutionary analysis ranging from yeast to human, leading to the identification of putative clusters of orthologous groups (COGs of this gene family. By means of whole-mount mRNA in situ hybridization strategy, we have also carried out a developmental expression mapping of these genes. A group of maternal SET domain genes, which are implicated in the programming of histone modification states in early development, have been identified and predicted to be responsible for all known sites of SET domain-mediated histone methylation. Furthermore, some genes show specific expression patterns in certain tissues at certain stages, suggesting the involvement of epigenetic mechanisms in the development of these systems. These results provide a global view of zebrafish SET domain histone methyltransferases in evolutionary and developmental dimensions and pave the way for using zebrafish to systematically study the roles of these genes during development.

  3. Effect of biochar amendment on the control of soil sulfonamides, antibiotic-resistant bacteria, and gene enrichment in lettuce tissues.

    Science.gov (United States)

    Ye, Mao; Sun, Mingming; Feng, Yanfang; Wan, Jinzhong; Xie, Shanni; Tian, Da; Zhao, Yu; Wu, Jun; Hu, Feng; Li, Huixin; Jiang, Xin

    2016-05-15

    Considering the potential threat of vegetables growing in antibiotic-polluted soil with high abundance of antibiotic-resistant genes (ARGs) against human health through the food chain, it is thus urgent to develop novel control technology to ensure vegetable safety. In the present work, pot experiments were conducted in lettuce cultivation to assess the impedance effect of biochar amendment on soil sulfonamides (SAs), antibiotic-resistant bacteria (ARB), and ARG enrichment in lettuce tissues. After 100 days of cultivation, lettuce cultivation with biochar amendment exhibited the greatest soil SA dissipation as well as the significant improvement of lettuce growth indices, with residual soil SAs mainly existing as the tightly bound fraction. Moreover, the SA contents in roots and new/old leaves were reduced by one to two orders of magnitude compared to those without biochar amendment. In addition, isolate counts for SA-resistant bacterial endophytes in old leaves and sul gene abundances in roots and old leaves also decreased significantly after biochar application. However, neither SA resistant bacteria nor sul genes were detected in new leaves. It was the first study to demonstrate that biochar amendment can be a practical strategy to protect lettuce safety growing in SA-polluted soil with rich ARB and ARGs. Copyright © 2015 Elsevier B.V. All rights reserved.

  4. GOMA: functional enrichment analysis tool based on GO modules

    Institute of Scientific and Technical Information of China (English)

    Qiang Huang; Ling-Yun Wu; Yong Wang; Xiang-Sun Zhang

    2013-01-01

    Analyzing the function of gene sets is a critical step in interpreting the results of high-throughput experiments in systems biology.A variety of enrichment analysis tools have been developed in recent years,but most output a long list of significantly enriched terms that are often redundant,making it difficult to extract the most meaningful functions.In this paper,we present GOMA,a novel enrichment analysis method based on the new concept of enriched functional Gene Ontology (GO) modules.With this method,we systematically revealed functional GO modules,i.e.,groups of functionally similar GO terms,via an optimization model and then ranked them by enrichment scores.Our new method simplifies enrichment analysis results by reducing redundancy,thereby preventing inconsistent enrichment results among functionally similar terms and providing more biologically meaningful results.

  5. Gene set analysis: limitations in popular existing methods and proposed improvements.

    Science.gov (United States)

    Mishra, Pashupati; Törönen, Petri; Leino, Yrjö; Holm, Liisa

    2014-10-01

    Gene set analysis is the analysis of a set of genes that collectively contribute to a biological process. Most popular gene set analysis methods are based on empirical P-value that requires large number of permutations. Despite numerous gene set analysis methods developed in the past decade, the most popular methods still suffer from serious limitations. We present a gene set analysis method (mGSZ) based on Gene Set Z-scoring function (GSZ) and asymptotic P-values. Asymptotic P-value calculation requires fewer permutations, and thus speeds up the gene set analysis process. We compare the GSZ-scoring function with seven popular gene set scoring functions and show that GSZ stands out as the best scoring function. In addition, we show improved performance of the GSA method when the max-mean statistics is replaced by the GSZ scoring function. We demonstrate the importance of both gene and sample permutations by showing the consequences in the absence of one or the other. A comparison of asymptotic and empirical methods of P-value estimation demonstrates a clear advantage of asymptotic P-value over empirical P-value. We show that mGSZ outperforms the state-of-the-art methods based on two different evaluations. We compared mGSZ results with permutation and rotation tests and show that rotation does not improve our asymptotic P-values. We also propose well-known asymptotic distribution models for three of the compared methods. mGSZ is available as R package from cran.r-project.org. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. CAsubtype: An R Package to Identify Gene Sets Predictive of Cancer Subtypes and Clinical Outcomes.

    Science.gov (United States)

    Kong, Hualei; Tong, Pan; Zhao, Xiaodong; Sun, Jielin; Li, Hua

    2018-03-01

    In the past decade, molecular classification of cancer has gained high popularity owing to its high predictive power on clinical outcomes as compared with traditional methods commonly used in clinical practice. In particular, using gene expression profiles, recent studies have successfully identified a number of gene sets for the delineation of cancer subtypes that are associated with distinct prognosis. However, identification of such gene sets remains a laborious task due to the lack of tools with flexibility, integration and ease of use. To reduce the burden, we have developed an R package, CAsubtype, to efficiently identify gene sets predictive of cancer subtypes and clinical outcomes. By integrating more than 13,000 annotated gene sets, CAsubtype provides a comprehensive repertoire of candidates for new cancer subtype identification. For easy data access, CAsubtype further includes the gene expression and clinical data of more than 2000 cancer patients from TCGA. CAsubtype first employs principal component analysis to identify gene sets (from user-provided or package-integrated ones) with robust principal components representing significantly large variation between cancer samples. Based on these principal components, CAsubtype visualizes the sample distribution in low-dimensional space for better understanding of the distinction between samples and classifies samples into subgroups with prevalent clustering algorithms. Finally, CAsubtype performs survival analysis to compare the clinical outcomes between the identified subgroups, assessing their clinical value as potentially novel cancer subtypes. In conclusion, CAsubtype is a flexible and well-integrated tool in the R environment to identify gene sets for cancer subtype identification and clinical outcome prediction. Its simple R commands and comprehensive data sets enable efficient examination of the clinical value of any given gene set, thus facilitating hypothesis generating and testing in biological and

  7. Reduced expression of brain-enriched microRNAs in glioblastomas permits targeted regulation of a cell death gene.

    Directory of Open Access Journals (Sweden)

    Rebecca L Skalsky

    Full Text Available Glioblastoma is a highly aggressive malignant tumor involving glial cells in the human brain. We used high-throughput sequencing to comprehensively profile the small RNAs expressed in glioblastoma and non-tumor brain tissues. MicroRNAs (miRNAs made up the large majority of small RNAs, and we identified over 400 different cellular pre-miRNAs. No known viral miRNAs were detected in any of the samples analyzed. Cluster analysis revealed several miRNAs that were significantly down-regulated in glioblastomas, including miR-128, miR-124, miR-7, miR-139, miR-95, and miR-873. Post-transcriptional editing was observed for several miRNAs, including the miR-376 family, miR-411, miR-381, and miR-379. Using the deep sequencing information, we designed a lentiviral vector expressing a cell suicide gene, the herpes simplex virus thymidine kinase (HSV-TK gene, under the regulation of a miRNA, miR-128, that was found to be enriched in non-tumor brain tissue yet down-regulated in glioblastomas, Glioblastoma cells transduced with this vector were selectively killed when cultured in the presence of ganciclovir. Using an in vitro model to recapitulate expression of brain-enriched miRNAs, we demonstrated that neuronally differentiated SH-SY5Y cells transduced with the miRNA-regulated HSV-TK vector are protected from killing by expression of endogenous miR-128. Together, these results provide an in-depth analysis of miRNA dysregulation in glioblastoma and demonstrate the potential utility of these data in the design of miRNA-regulated therapies for the treatment of brain cancers.

  8. Improving functional modules discovery by enriching interaction networks with gene profiles

    KAUST Repository

    Salem, Saeed

    2013-05-01

    Recent advances in proteomic and transcriptomic technologies resulted in the accumulation of vast amount of high-throughput data that span multiple biological processes and characteristics in different organisms. Much of the data come in the form of interaction networks and mRNA expression arrays. An important task in systems biology is functional modules discovery where the goal is to uncover well-connected sub-networks (modules). These discovered modules help to unravel the underlying mechanisms of the observed biological processes. While most of the existing module discovery methods use only the interaction data, in this work we propose, CLARM, which discovers biological modules by incorporating gene profiles data with protein-protein interaction networks. We demonstrate the effectiveness of CLARM on Yeast and Human interaction datasets, and gene expression and molecular function profiles. Experiments on these real datasets show that the CLARM approach is competitive to well established functional module discovery methods.

  9. Gene and miRNA expression signature of Lewis lung carcinoma LLC1 cells in extracellular matrix enriched microenvironment

    International Nuclear Information System (INIS)

    Stankevicius, Vaidotas; Vasauskas, Gintautas; Bulotiene, Danute; Butkyte, Stase; Jarmalaite, Sonata; Rotomskis, Ricardas; Suziedelis, Kestutis

    2016-01-01

    The extracellular matrix (ECM), one of the key components of tumor microenvironment, has a tremendous impact on cancer development and highly influences tumor cell features. ECM affects vital cellular functions such as cell differentiation, migration, survival and proliferation. Gene and protein expression levels are regulated in cell-ECM interaction dependent manner as well. The rate of unsuccessful clinical trials, based on cell culture research models lacking the ECM microenvironment, indicates the need for alternative models and determines the shift to three-dimensional (3D) laminin rich ECM models, better simulating tissue organization. Recognized advantages of 3D models suggest the development of new anticancer treatment strategies. This is among the most promising directions of 3D cell cultures application. However, detailed analysis at the molecular level of 2D/3D cell cultures and tumors in vivo is still needed to elucidate cellular pathways most promising for the development of targeted therapies. In order to elucidate which biological pathways are altered during microenvironmental shift we have analyzed whole genome mRNA and miRNA expression differences in LLC1 cells cultured in 2D or 3D culture conditions. In our study we used DNA microarrays for whole genome analysis of mRNA and miRNA expression differences in LLC1 cells cultivated in 2D or 3D culture conditions. Next, we indicated the most common enriched functional categories using KEGG pathway enrichment analysis. Finally, we validated the microarray data by quantitative PCR in LLC1 cells cultured under 2D or 3D conditions or LLC1 tumors implanted in experimental animals. Microarray gene expression analysis revealed that 1884 genes and 77 miRNAs were significantly altered in LLC1 cells after 48 h cell growth under 2D and ECM based 3D cell growth conditions. Pathway enrichment results indicated metabolic pathway, MAP kinase, cell adhesion and immune response as the most significantly altered

  10. Identification of Multiple Dehalogenase Genes Involved in Tetrachloroethene-to-Ethene Dechlorination in a Dehalococcoides-Dominated Enrichment Culture

    Directory of Open Access Journals (Sweden)

    Mohamed Ismaeil

    2017-01-01

    Full Text Available Chloroethenes (CEs are widespread groundwater toxicants that are reductively dechlorinated to nontoxic ethene (ETH by members of Dehalococcoides. This study established a Dehalococcoides-dominated enrichment culture (designated “YN3” that dechlorinates tetrachloroethene (PCE to ETH with high dechlorination activity, that is, complete dechlorination of 800 μM PCE to ETH within 14 days in the presence of Dehalococcoides species at 5.7±1.9×107 copies of 16S rRNA gene/mL. The metagenome of YN3 harbored 18 rdhA genes (designated YN3rdhA1–18 encoding the catalytic subunit of reductive dehalogenase (RdhA, four of which were suggested to be involved in PCE-to-ETH dechlorination based on significant increases in their transcription in response to CE addition. The predicted proteins for two of these four genes, YN3RdhA8 and YN3RdhA16, showed 94% and 97% of amino acid similarity with PceA and VcrA, which are well known to dechlorinate PCE to trichloroethene (TCE and TCE to ETH, respectively. The other two rdhAs, YN3rdhA6 and YN3rdhA12, which were never proved as rdhA for CEs, showed particularly high transcription upon addition of vinyl chloride (VC, with 75±38 and 16±8.6 mRNA copies per gene, respectively, suggesting their possible functions as novel VC-reductive dehalogenases. Moreover, metagenome data indicated the presence of three coexisting bacterial species, including novel species of the genus Bacteroides, which might promote CE dechlorination by Dehalococcoides.

  11. Identification of a conserved set of upregulated genes in mouse skeletal muscle hypertrophy and regrowth.

    Science.gov (United States)

    Chaillou, Thomas; Jackson, Janna R; England, Jonathan H; Kirby, Tyler J; Richards-White, Jena; Esser, Karyn A; Dupont-Versteegden, Esther E; McCarthy, John J

    2015-01-01

    The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. Copyright © 2015 the American Physiological Society.

  12. Dimethylated H3K27 Is a Repressive Epigenetic Histone Mark in the Protist Entamoeba histolytica and Is Significantly Enriched in Genes Silenced via the RNAi Pathway*

    Science.gov (United States)

    Foda, Bardees M.; Singh, Upinder

    2015-01-01

    RNA interference (RNAi) is a fundamental biological process that plays a crucial role in regulation of gene expression in many organisms. Transcriptional gene silencing (TGS) is one of the important nuclear roles of RNAi. Our previous data show that Entamoeba histolytica has a robust RNAi pathway that links to TGS via Argonaute 2-2 (Ago2-2) associated 27-nucleotide small RNAs with 5′-polyphosphate termini. Here, we report the first repressive histone mark to be identified in E. histolytica, dimethylation of H3K27 (H3K27Me2), and demonstrate that it is enriched at genes that are silenced by RNAi-mediated TGS. An RNAi-silencing trigger can induce H3K27Me2 deposits at both episomal and chromosomal loci, mediating gene silencing. Our data support two phases of RNAi-mediated TGS: an active silencing phase where the RNAi trigger is present and both H3K27Me2 and Ago2-2 concurrently enrich at chromosomal loci; and an established silencing phase in which the RNAi trigger is removed, but gene silencing with H3K27Me2 enrichment persist independently of Ago2-2 deposition. Importantly, some genes display resistance to chromosomal silencing despite induction of functional small RNAs. In those situations, the RNAi-triggering plasmid that is maintained episomally gets partially silenced and has H3K27Me2 enrichment, but the chromosomal copy displays no repressive histone enrichment. Our data are consistent with a model in which H3K27Me2 is a repressive histone modification, which is strongly associated with transcriptional repression. This is the first example of an epigenetic histone modification that functions to mediate RNAi-mediated TGS in the deep-branching eukaryote E. histolytica. PMID:26149683

  13. Dimethylated H3K27 Is a Repressive Epigenetic Histone Mark in the Protist Entamoeba histolytica and Is Significantly Enriched in Genes Silenced via the RNAi Pathway.

    Science.gov (United States)

    Foda, Bardees M; Singh, Upinder

    2015-08-21

    RNA interference (RNAi) is a fundamental biological process that plays a crucial role in regulation of gene expression in many organisms. Transcriptional gene silencing (TGS) is one of the important nuclear roles of RNAi. Our previous data show that Entamoeba histolytica has a robust RNAi pathway that links to TGS via Argonaute 2-2 (Ago2-2) associated 27-nucleotide small RNAs with 5'-polyphosphate termini. Here, we report the first repressive histone mark to be identified in E. histolytica, dimethylation of H3K27 (H3K27Me2), and demonstrate that it is enriched at genes that are silenced by RNAi-mediated TGS. An RNAi-silencing trigger can induce H3K27Me2 deposits at both episomal and chromosomal loci, mediating gene silencing. Our data support two phases of RNAi-mediated TGS: an active silencing phase where the RNAi trigger is present and both H3K27Me2 and Ago2-2 concurrently enrich at chromosomal loci; and an established silencing phase in which the RNAi trigger is removed, but gene silencing with H3K27Me2 enrichment persist independently of Ago2-2 deposition. Importantly, some genes display resistance to chromosomal silencing despite induction of functional small RNAs. In those situations, the RNAi-triggering plasmid that is maintained episomally gets partially silenced and has H3K27Me2 enrichment, but the chromosomal copy displays no repressive histone enrichment. Our data are consistent with a model in which H3K27Me2 is a repressive histone modification, which is strongly associated with transcriptional repression. This is the first example of an epigenetic histone modification that functions to mediate RNAi-mediated TGS in the deep-branching eukaryote E. histolytica. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  14. Genome-wide Anaplasma phagocytophilum AnkA-DNA interactions are enriched in intergenic regions and gene promoters and correlate with infection-induced differential gene expression.

    Directory of Open Access Journals (Sweden)

    J Stephen Dumler

    2016-09-01

    Full Text Available Anaplasma phagocytophilum, an obligate intracellular prokaryote, infects neutrophils and alters cardinal functions via reprogrammed transcription. Large contiguous regions of neutrophil chromosomes are differentially expressed during infection. Secreted A. phagocytophilum effector AnkA transits into the neutrophil or granulocyte nucleus to complex with DNA in heterochromatin across all chromosomes. AnkA binds to gene promoters to dampen cis-transcription and also has features of matrix attachment region (MAR-binding proteins that regulate three-dimensional chromatin architecture and coordinate transcriptional programs encoded in topologically-associated chromatin domains. We hypothesize that identification of additional AnkA binding sites will better delineate how A. phagocytophilum infection results in reprogramming of the neutrophil genome. Using AnkA-binding ChIP-seq, we showed that AnkA binds broadly throughout all chromosomes in a reproducible pattern, especially at: i intergenic regions predicted to be matrix attachment regions (MARs; ii within predicted lamina-associated domains; and iii at promoters ≤3,000 bp upstream of transcriptional start sites. These findings provide genome-wide support for AnkA as a regulator of cis-gene transcription. Moreover, the dominant mark of AnkA in distal intergenic regions known to be AT-enriched, coupled with frequent enrichment in the nuclear lamina, provides strong support for its role as a MAR-binding protein and genome re-organizer. AnkA must be considered a prime candidate to promote neutrophil reprogramming and subsequent functional changes that belie improved microbial fitness and pathogenicity.

  15. DOSE RESPONSE FROM HIGH THROUGHPUT GENE EXPRESSION STUDIES AND THE INFLUENCE OF TIME AND CELL LINE ON INFERRED MODE OF ACTION BY ONTOLOGIC ENRICHMENT (SOT)

    Science.gov (United States)

    Gene expression with ontologic enrichment and connectivity mapping tools is widely used to infer modes of action (MOA) for therapeutic drugs. Despite progress in high-throughput (HT) genomic systems, strategies suitable to identify industrial chemical MOA are needed. The L1000 is...

  16. Mechanism-based biomarker gene sets for glutathione depletion-related hepatotoxicity in rats

    International Nuclear Information System (INIS)

    Gao Weihua; Mizukawa, Yumiko; Nakatsu, Noriyuki; Minowa, Yosuke; Yamada, Hiroshi; Ohno, Yasuo; Urushidani, Tetsuro

    2010-01-01

    Chemical-induced glutathione depletion is thought to be caused by two types of toxicological mechanisms: PHO-type glutathione depletion [glutathione conjugated with chemicals such as phorone (PHO) or diethyl maleate (DEM)], and BSO-type glutathione depletion [i.e., glutathione synthesis inhibited by chemicals such as L-buthionine-sulfoximine (BSO)]. In order to identify mechanism-based biomarker gene sets for glutathione depletion in rat liver, male SD rats were treated with various chemicals including PHO (40, 120 and 400 mg/kg), DEM (80, 240 and 800 mg/kg), BSO (150, 450 and 1500 mg/kg), and bromobenzene (BBZ, 10, 100 and 300 mg/kg). Liver samples were taken 3, 6, 9 and 24 h after administration and examined for hepatic glutathione content, physiological and pathological changes, and gene expression changes using Affymetrix GeneChip Arrays. To identify differentially expressed probe sets in response to glutathione depletion, we focused on the following two courses of events for the two types of mechanisms of glutathione depletion: a) gene expression changes occurring simultaneously in response to glutathione depletion, and b) gene expression changes after glutathione was depleted. The gene expression profiles of the identified probe sets for the two types of glutathione depletion differed markedly at times during and after glutathione depletion, whereas Srxn1 was markedly increased for both types as glutathione was depleted, suggesting that Srxn1 is a key molecule in oxidative stress related to glutathione. The extracted probe sets were refined and verified using various compounds including 13 additional positive or negative compounds, and they established two useful marker sets. One contained three probe sets (Akr7a3, Trib3 and Gstp1) that could detect conjugation-type glutathione depletors any time within 24 h after dosing, and the other contained 14 probe sets that could detect glutathione depletors by any mechanism. These two sets, with appropriate scoring

  17. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    Science.gov (United States)

    2013-01-01

    Background Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the

  18. Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach.

    Directory of Open Access Journals (Sweden)

    Nilotpal Chowdhury

    Full Text Available Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis.The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets.Four microarray series (having 742 patients were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA.Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed.To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and

  19. Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach.

    Science.gov (United States)

    Chowdhury, Nilotpal; Sapru, Shantanu

    2015-01-01

    Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting

  20. Germanium enrichment in supergene settings: evidence from the Cristal nonsulfide Zn prospect, Bongará district, northern Peru

    Science.gov (United States)

    Mondillo, Nicola; Arfè, Giuseppe; Herrington, Richard; Boni, Maria; Wilkinson, Clara; Mormone, Angela

    2018-02-01

    Supergene nonsulfide ores form from the weathering of sulfide mineralization. Given the geochemical affinity of Ge to Si4+ and Fe3+, weathering of Ge-bearing sulfides could potentially lead to Ge enrichments in silicate and Fe-oxy-hydroxide minerals, although bulk rock Ge concentrations in supergene nonsulfide deposits are rarely reported. Here, we present the results of an investigation into Ge concentrations and deportment in the Cristal supergene Zn nonsulfide prospect (Bongará, northern Peru), which formed from the weathering of a preexisting Mississippi Valley-type (MVT) sulfide deposit. Material examined in this study originates from drillcore recovered from oxidized Zn-rich bodies 15-20 m thick, containing 5-45 wt% Zn and Ge concentrations 100 ppm. Microanalysis and laser ablation-ICP-MS show that precursor sphalerite is rich in both Fe (mean Fe = 8.19 wt%) and Ge (mean Ge = 142 ppm). Using the mineral geothermometer GGIMFis—geothermometer for Ga, Ge, In, Mn, and Fe in sphalerite—proposed by Frenzel et al. (Ore Geol Rev 76:52-78, 2016), sphalerite trace element data from the Cristal prospect suggest a possible formation temperature ( T GGIMFis) of 225 ± 50 °C, anomalously high for a MVT deposit. Germanium concentrations measured in both goethite (mean values 100 to 229 ppm, max 511 ppm) and hemimorphite (mean values 39 to 137 ppm, max 258 ppm) are similar to concentrations measured in hypogene sphalerite. Additionally, the Ge concentrations recorded in bulk rock analyses of sphalerite-bearing and oxidized samples are also similar. A persistent warm-humid climate is interpreted for the region, resulting in the development of an oxidation zone favoring the formation of abundant Zn hydrosilicates and Fe hydroxides, both able to incorporate Ge in their crystal structure. In this scenario, Ge has been prevented from dispersion during the weathering of the Ge-bearing sulfide bodies and remains in the resultant nonsulfide ore.

  1. Accurate Gene Expression-Based Biodosimetry Using a Minimal Set of Human Gene Transcripts

    Energy Technology Data Exchange (ETDEWEB)

    Tucker, James D., E-mail: jtucker@biology.biosci.wayne.edu [Department of Biological Sciences, Wayne State University, Detroit, Michigan (United States); Joiner, Michael C. [Department of Radiation Oncology, Wayne State University, Detroit, Michigan (United States); Thomas, Robert A.; Grever, William E.; Bakhmutsky, Marina V. [Department of Biological Sciences, Wayne State University, Detroit, Michigan (United States); Chinkhota, Chantelle N.; Smolinski, Joseph M. [Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan (United States); Divine, George W. [Department of Public Health Sciences, Henry Ford Hospital, Detroit, Michigan (United States); Auner, Gregory W. [Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan (United States)

    2014-03-15

    Purpose: Rapid and reliable methods for conducting biological dosimetry are a necessity in the event of a large-scale nuclear event. Conventional biodosimetry methods lack the speed, portability, ease of use, and low cost required for triaging numerous victims. Here we address this need by showing that polymerase chain reaction (PCR) on a small number of gene transcripts can provide accurate and rapid dosimetry. The low cost and relative ease of PCR compared with existing dosimetry methods suggest that this approach may be useful in mass-casualty triage situations. Methods and Materials: Human peripheral blood from 60 adult donors was acutely exposed to cobalt-60 gamma rays at doses of 0 (control) to 10 Gy. mRNA expression levels of 121 selected genes were obtained 0.5, 1, and 2 days after exposure by reverse-transcriptase real-time PCR. Optimal dosimetry at each time point was obtained by stepwise regression of dose received against individual gene transcript expression levels. Results: Only 3 to 4 different gene transcripts, ASTN2, CDKN1A, GDF15, and ATM, are needed to explain ≥0.87 of the variance (R{sup 2}). Receiver-operator characteristics, a measure of sensitivity and specificity, of 0.98 for these statistical models were achieved at each time point. Conclusions: The actual and predicted radiation doses agree very closely up to 6 Gy. Dosimetry at 8 and 10 Gy shows some effect of saturation, thereby slightly diminishing the ability to quantify higher exposures. Analyses of these gene transcripts may be advantageous for use in a field-portable device designed to assess exposures in mass casualty situations or in clinical radiation emergencies.

  2. Investigating the effect of paralogs on microarray gene-set analysis

    LENUS (Irish Health Repository)

    Faure, Andre J

    2011-01-24

    Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

  3. The Use of Gene Ontology Term and KEGG Pathway Enrichment for Analysis of Drug Half-Life.

    Directory of Open Access Journals (Sweden)

    Yu-Hang Zhang

    Full Text Available A drug's biological half-life is defined as the time required for the human body to metabolize or eliminate 50% of the initial drug dosage. Correctly measuring the half-life of a given drug is helpful for the safe and accurate usage of the drug. In this study, we investigated which gene ontology (GO terms and biological pathways were highly related to the determination of drug half-life. The investigated drugs, with known half-lives, were analyzed based on their enrichment scores for associated GO terms and KEGG pathways. These scores indicate which GO terms or KEGG pathways the drug targets. The feature selection method, minimum redundancy maximum relevance, was used to analyze these GO terms and KEGG pathways and to identify important GO terms and pathways, such as sodium-independent organic anion transmembrane transporter activity (GO:0015347, monoamine transmembrane transporter activity (GO:0008504, negative regulation of synaptic transmission (GO:0050805, neuroactive ligand-receptor interaction (hsa04080, serotonergic synapse (hsa04726, and linoleic acid metabolism (hsa00591, among others. This analysis confirmed our results and may show evidence for a new method in studying drug half-lives and building effective computational methods for the prediction of drug half-lives.

  4. Multiplex real-time PCR for detection of Staphylococcus aureus, mecA and Panton-Valentine Leukocidin (PVL genes from selective enrichments from animals and retail meat.

    Directory of Open Access Journals (Sweden)

    Valeria Velasco

    Full Text Available The aim of this study was to compare a real-time PCR assay, with a conventional culture/PCR method, to detect S. aureus, mecA and Panton-Valentine Leukocidin (PVL genes in animals and retail meat, using a two-step selective enrichment protocol. A total of 234 samples were examined (77 animal nasal swabs, 112 retail raw meat, and 45 deli meat. The multiplex real-time PCR targeted the genes: nuc (identification of S. aureus, mecA (associated with methicillin resistance and PVL (virulence factor, and the primary and secondary enrichment samples were assessed. The conventional culture/PCR method included the two-step selective enrichment, selective plating, biochemical testing, and multiplex PCR for confirmation. The conventional culture/PCR method recovered 95/234 positive S. aureus samples. Application of real-time PCR on samples following primary and secondary enrichment detected S. aureus in 111/234 and 120/234 samples respectively. For detection of S. aureus, the kappa statistic was 0.68-0.88 (from substantial to almost perfect agreement and 0.29-0.77 (from fair to substantial agreement for primary and secondary enrichments, using real-time PCR. For detection of mecA gene, the kappa statistic was 0-0.49 (from no agreement beyond that expected by chance to moderate agreement for primary and secondary enrichment samples. Two pork samples were mecA gene positive by all methods. The real-time PCR assay detected the mecA gene in samples that were negative for S. aureus, but positive for Staphylococcus spp. The PVL gene was not detected in any sample by the conventional culture/PCR method or the real-time PCR assay. Among S. aureus isolated by conventional culture/PCR method, the sequence type ST398, and multi-drug resistant strains were found in animals and raw meat samples. The real-time PCR assay may be recommended as a rapid method for detection of S. aureus and the mecA gene, with further confirmation of methicillin-resistant S. aureus (MRSA

  5. Multiplex Real-Time PCR for Detection of Staphylococcus aureus, mecA and Panton-Valentine Leukocidin (PVL) Genes from Selective Enrichments from Animals and Retail Meat

    Science.gov (United States)

    Velasco, Valeria; Sherwood, Julie S.; Rojas-García, Pedro P.; Logue, Catherine M.

    2014-01-01

    The aim of this study was to compare a real-time PCR assay, with a conventional culture/PCR method, to detect S. aureus, mecA and Panton-Valentine Leukocidin (PVL) genes in animals and retail meat, using a two-step selective enrichment protocol. A total of 234 samples were examined (77 animal nasal swabs, 112 retail raw meat, and 45 deli meat). The multiplex real-time PCR targeted the genes: nuc (identification of S. aureus), mecA (associated with methicillin resistance) and PVL (virulence factor), and the primary and secondary enrichment samples were assessed. The conventional culture/PCR method included the two-step selective enrichment, selective plating, biochemical testing, and multiplex PCR for confirmation. The conventional culture/PCR method recovered 95/234 positive S. aureus samples. Application of real-time PCR on samples following primary and secondary enrichment detected S. aureus in 111/234 and 120/234 samples respectively. For detection of S. aureus, the kappa statistic was 0.68–0.88 (from substantial to almost perfect agreement) and 0.29–0.77 (from fair to substantial agreement) for primary and secondary enrichments, using real-time PCR. For detection of mecA gene, the kappa statistic was 0–0.49 (from no agreement beyond that expected by chance to moderate agreement) for primary and secondary enrichment samples. Two pork samples were mecA gene positive by all methods. The real-time PCR assay detected the mecA gene in samples that were negative for S. aureus, but positive for Staphylococcus spp. The PVL gene was not detected in any sample by the conventional culture/PCR method or the real-time PCR assay. Among S. aureus isolated by conventional culture/PCR method, the sequence type ST398, and multi-drug resistant strains were found in animals and raw meat samples. The real-time PCR assay may be recommended as a rapid method for detection of S. aureus and the mecA gene, with further confirmation of methicillin-resistant S. aureus (MRSA) using

  6. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    Directory of Open Access Journals (Sweden)

    Hettne Kristina M

    2013-01-01

    Full Text Available Abstract Background Availability of chemical response-specific lists of genes (gene sets for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM, and that these can be used with gene set analysis (GSA methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human and 588 (mouse gene sets from the Comparative Toxicogenomics Database (CTD. We tested for significant differential expression (SDE (false discovery rate -corrected p-values Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Conclusions Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect.

  7. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets.

    Science.gov (United States)

    Khan, Aziz; Mathelier, Anthony

    2017-05-31

    A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. To address this gap, we have developed the Intervene tool, which provides an easy and automated interface for the effective intersection and visualization of genomic region or list sets, thus facilitating their analysis and interpretation. Intervene contains three modules: venn to generate Venn diagrams of up to six sets, upset to generate UpSet plots of multiple sets, and pairwise to compute and visualize intersections of multiple sets as clustered heat maps. Intervene, and its interactive web ShinyApp companion, generate publication-quality figures for the interpretation of genomic region and list sets. Intervene and its web application companion provide an easy command line and an interactive web interface to compute intersections of multiple genomic and list sets. They have the capacity to plot intersections using easy-to-interpret visual approaches. Intervene is developed and designed to meet the needs of both computer scientists and biologists. The source code is freely available at https://bitbucket.org/CBGR/intervene , with the web application available at https://asntech.shinyapps.io/intervene .

  8. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    K.M. Hettne (Kristina); J. Boorsma (Jeffrey); D.A.M. van Dartel (Dorien A M); J.J. Goeman (Jelle); E.C. de Jong (Esther); A.H. Piersma (Aldert); R.H. Stierum (Rob); J. Kleinjans (Jos); J.A. Kors (Jan)

    2013-01-01

    textabstractBackground: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with

  9. Improved detection of Burkholderia pseudomallei from non-blood clinical specimens using enrichment culture and PCR: narrowing diagnostic gap in resource-constrained settings.

    Science.gov (United States)

    Tellapragada, Chaitanya; Shaw, Tushar; D'Souza, Annet; Eshwara, Vandana Kalwaje; Mukhopadhyay, Chiranjay

    2017-07-01

    To evaluate the diagnostic utility of enrichment culture and PCR for improved case detection rates of non-bacteraemic form of melioidosis in limited resource settings. Clinical specimens (n = 525) obtained from patients presenting at a tertiary care hospital of South India with clinical symptoms suggestive of community-acquired pneumonia, lower respiratory tract infections, superficial or internal abscesses, chronic skin ulcers and bone or joint infections were tested for the presence of Burkholderia pseudomallei using conventional culture (CC), enrichment culture (EC) and PCR. Sensitivity, specificity, positive and negative predictive values of CC and PCR were initially deduced using EC as the gold standard method. Further, diagnostic accuracies of all the three methods were analysed using Bayesian latent class modelling (BLCM). Detection rates of B. pseudomallei using CC, EC and PCR were 3.8%, 5.3% and 6%, respectively. Diagnostic sensitivities and specificities of CC and PCR were 71.4, 98.4% and 100 and 99.4%, respectively in comparison with EC as the gold standard test. With Bayesian latent class modelling, EC and PCR demonstrated sensitivities of 98.7 and 99.3%, respectively, while CC showed a sensitivity of 70.3% for detection of B. pseudomallei. An increase of 1.6% (95% CI: 1.08-4.32%) in the case detection rate of melioidosis was observed in the study population when EC and/or PCR were used in adjunct to the conventional culture technique. Our study findings underscore the diagnostic superiority of enrichment culture and/or PCR over conventional microbiological culture for improved case detection of melioidosis from non-blood clinical specimens. © 2017 John Wiley & Sons Ltd.

  10. Enrichment of CD44 in basal-type breast cancer correlates with EMT, cancer stem cell gene profile, and prognosis

    Directory of Open Access Journals (Sweden)

    Xu HX

    2016-01-01

    Full Text Available Hanxiao Xu,1 Yijun Tian,1 Xun Yuan,1 Yu Liu,2 Hua Wu,1 Qian Liu,1 Gen Sheng Wu,3,4 Kongming Wu1 1Department of Oncology, 2Department of Geriatrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, People’s Republic of China; 3Department of Oncology, 4Department of Pathology, Karmanos Cancer Institute, Wayne State University School of Medicine, Detroit, MI, USA Abstract: Cluster of differentiation 44 (CD44 is a transmembrane glycoprotein that serves as the receptor for the extracellular matrix component hyaluronic acid. CD44 has been reported to play key roles in cell proliferation, motility, and survival, but its role in breast cancer remains controversial. In this study, we conducted a meta-analysis. A total of 23 published Gene Expression Omnibus databases were included to evaluate the association between CD44 mRNA expression and clinicopathological characteristics or prognosis of the patients with breast cancer. Our analysis revealed that CD44 expression was associated with clinicopathological features, including the histological grade, estrogen receptor status, progesterone receptor status, and human epidermal growth factor receptor-2 status. Higher levels of CD44 expression were observed in the basal subtype of breast cancer both at the mRNA and protein levels (odds ratio [OR] =2.08, 95% confidence interval [CI]: 1.72–2.52; OR =2.11, 95% CI: 1.67–2.68. Patients with CD44 overexpression exhibited significantly worse overall survival (hazard ratio =1.27; 95% CI: 1.04–1.55. Whole gene profile analysis revealed that CD44 expression was enriched in basal-type breast cancer and correlated with epithelial–mesenchymal transition and cancer stem cell gene profiles. In summary, our analyses indicated that CD44 potentially might be a prognostic marker for breast cancer and thus can serve as a therapeutic target for basal-type breast cancer. Keywords: breast cancer, CD44, survival prediction, meta

  11. Optimal structural inference of signaling pathways from unordered and overlapping gene sets.

    Science.gov (United States)

    Acharya, Lipi R; Judeh, Thair; Wang, Guangdi; Zhu, Dongxiao

    2012-02-15

    A plethora of bioinformatics analysis has led to the discovery of numerous gene sets, which can be interpreted as discrete measurements emitted from latent signaling pathways. Their potential to infer signaling pathway structures, however, has not been sufficiently exploited. Existing methods accommodating discrete data do not explicitly consider signal cascading mechanisms that characterize a signaling pathway. Novel computational methods are thus needed to fully utilize gene sets and broaden the scope from focusing only on pairwise interactions to the more general cascading events in the inference of signaling pathway structures. We propose a gene set based simulated annealing (SA) algorithm for the reconstruction of signaling pathway structures. A signaling pathway structure is a directed graph containing up to a few hundred nodes and many overlapping signal cascades, where each cascade represents a chain of molecular interactions from the cell surface to the nucleus. Gene sets in our context refer to discrete sets of genes participating in signal cascades, the basic building blocks of a signaling pathway, with no prior information about gene orderings in the cascades. From a compendium of gene sets related to a pathway, SA aims to search for signal cascades that characterize the optimal signaling pathway structure. In the search process, the extent of overlap among signal cascades is used to measure the optimality of a structure. Throughout, we treat gene sets as random samples from a first-order Markov chain model. We evaluated the performance of SA in three case studies. In the first study conducted on 83 KEGG pathways, SA demonstrated a significantly better performance than Bayesian network methods. Since both SA and Bayesian network methods accommodate discrete data, use a 'search and score' network learning strategy and output a directed network, they can be compared in terms of performance and computational time. In the second study, we compared SA and

  12. Meta-analysis of differentiating mouse embryonic stem cell gene expression kinetics reveals early change of a small gene set.

    Directory of Open Access Journals (Sweden)

    Clive H Glover

    2006-11-01

    Full Text Available Stem cell differentiation involves critical changes in gene expression. Identification of these should provide endpoints useful for optimizing stem cell propagation as well as potential clues about mechanisms governing stem cell maintenance. Here we describe the results of a new meta-analysis methodology applied to multiple gene expression datasets from three mouse embryonic stem cell (ESC lines obtained at specific time points during the course of their differentiation into various lineages. We developed methods to identify genes with expression changes that correlated with the altered frequency of functionally defined, undifferentiated ESC in culture. In each dataset, we computed a novel statistical confidence measure for every gene which captured the certainty that a particular gene exhibited an expression pattern of interest within that dataset. This permitted a joint analysis of the datasets, despite the different experimental designs. Using a ranking scheme that favored genes exhibiting patterns of interest, we focused on the top 88 genes whose expression was consistently changed when ESC were induced to differentiate. Seven of these (103728_at, 8430410A17Rik, Klf2, Nr0b1, Sox2, Tcl1, and Zfp42 showed a rapid decrease in expression concurrent with a decrease in frequency of undifferentiated cells and remained predictive when evaluated in additional maintenance and differentiating protocols. Through a novel meta-analysis, this study identifies a small set of genes whose expression is useful for identifying changes in stem cell frequencies in cultures of mouse ESC. The methods and findings have broader applicability to understanding the regulation of self-renewal of other stem cell types.

  13. Motif enrichment tool.

    Science.gov (United States)

    Blatti, Charles; Sinha, Saurabh

    2014-07-01

    The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Glutamatergic and GABAergic gene sets in attention-deficit/hyperactivity disorder

    DEFF Research Database (Denmark)

    Naaijen, Jill; Bralten, Janita; Poelmans, Geert

    2017-01-01

    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance...... within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms...... is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants...

  15. Selection and validation of a set of reliable reference genes for quantitative sod gene expression analysis in C. elegans

    Directory of Open Access Journals (Sweden)

    Vandesompele Jo

    2008-01-01

    Full Text Available Abstract Background In the nematode Caenorhabditis elegans the conserved Ins/IGF-1 signaling pathway regulates many biological processes including life span, stress response, dauer diapause and metabolism. Detection of differentially expressed genes may contribute to a better understanding of the mechanism by which the Ins/IGF-1 signaling pathway regulates these processes. Appropriate normalization is an essential prerequisite for obtaining accurate and reproducible quantification of gene expression levels. The aim of this study was to establish a reliable set of reference genes for gene expression analysis in C. elegans. Results Real-time quantitative PCR was used to evaluate the expression stability of 12 candidate reference genes (act-1, ama-1, cdc-42, csq-1, eif-3.C, mdh-1, gpd-2, pmp-3, tba-1, Y45F10D.4, rgs-6 and unc-16 in wild-type, three Ins/IGF-1 pathway mutants, dauers and L3 stage larvae. After geNorm analysis, cdc-42, pmp-3 and Y45F10D.4 showed the most stable expression pattern and were used to normalize 5 sod expression levels. Significant differences in mRNA levels were observed for sod-1 and sod-3 in daf-2 relative to wild-type animals, whereas in dauers sod-1, sod-3, sod-4 and sod-5 are differentially expressed relative to third stage larvae. Conclusion Our findings emphasize the importance of accurate normalization using stably expressed reference genes. The methodology used in this study is generally applicable to reliably quantify gene expression levels in the nematode C. elegans using quantitative PCR.

  16. Uranium enrichment. Enrichment processes

    International Nuclear Information System (INIS)

    Alexandre, M.; Quaegebeur, J.P.

    2009-01-01

    Despite the remarkable progresses made in the diversity and the efficiency of the different uranium enrichment processes, only two industrial processes remain today which satisfy all of enriched uranium needs: the gaseous diffusion and the centrifugation. This article describes both processes and some others still at the demonstration or at the laboratory stage of development: 1 - general considerations; 2 - gaseous diffusion: physical principles, implementation, utilisation in the world; 3 - centrifugation: principles, elementary separation factor, flows inside a centrifuge, modeling of separation efficiencies, mechanical design, types of industrial centrifuges, realisation of cascades, main characteristics of the centrifugation process; 4 - aerodynamic processes: vortex process, nozzle process; 5 - chemical exchange separation processes: Japanese ASAHI process, French CHEMEX process; 6 - laser-based processes: SILVA process, SILMO process; 7 - electromagnetic and ionic processes: mass spectrometer and calutron, ion cyclotron resonance, rotating plasmas; 8 - thermal diffusion; 9 - conclusion. (J.S.)

  17. Mining tissue specificity, gene connectivity and disease association to reveal a set of genes that modify the action of disease causing genes

    Directory of Open Access Journals (Sweden)

    Reverter Antonio

    2008-09-01

    Full Text Available Abstract Background The tissue specificity of gene expression has been linked to a number of significant outcomes including level of expression, and differential rates of polymorphism, evolution and disease association. Recent studies have also shown the importance of exploring differential gene connectivity and sequence conservation in the identification of disease-associated genes. However, no study relates gene interactions with tissue specificity and disease association. Methods We adopted an a priori approach making as few assumptions as possible to analyse the interplay among gene-gene interactions with tissue specificity and its subsequent likelihood of association with disease. We mined three large datasets comprising expression data drawn from massively parallel signature sequencing across 32 tissues, describing a set of 55,606 true positive interactions for 7,197 genes, and microarray expression results generated during the profiling of systemic inflammation, from which 126,543 interactions among 7,090 genes were reported. Results Amongst the myriad of complex relationships identified between expression, disease, connectivity and tissue specificity, some interesting patterns emerged. These include elevated rates of expression and network connectivity in housekeeping and disease-associated tissue-specific genes. We found that disease-associated genes are more likely to show tissue specific expression and most frequently interact with other disease genes. Using the thresholds defined in these observations, we develop a guilt-by-association algorithm and discover a group of 112 non-disease annotated genes that predominantly interact with disease-associated genes, impacting on disease outcomes. Conclusion We conclude that parameters such as tissue specificity and network connectivity can be used in combination to identify a group of genes, not previously confirmed as disease causing, that are involved in interactions with disease causing

  18. Can survival prediction be improved by merging gene expression data sets?

    Directory of Open Access Journals (Sweden)

    Haleh Yasrebi

    Full Text Available BACKGROUND: High-throughput gene expression profiling technologies generating a wealth of data, are increasingly used for characterization of tumor biopsies for clinical trials. By applying machine learning algorithms to such clinically documented data sets, one hopes to improve tumor diagnosis, prognosis, as well as prediction of treatment response. However, the limited number of patients enrolled in a single trial study limits the power of machine learning approaches due to over-fitting. One could partially overcome this limitation by merging data from different studies. Nevertheless, such data sets differ from each other with regard to technical biases, patient selection criteria and follow-up treatment. It is therefore not clear at all whether the advantage of increased sample size outweighs the disadvantage of higher heterogeneity of merged data sets. Here, we present a systematic study to answer this question specifically for breast cancer data sets. We use survival prediction based on Cox regression as an assay to measure the added value of merged data sets. RESULTS: Using time-dependent Receiver Operating Characteristic-Area Under the Curve (ROC-AUC and hazard ratio as performance measures, we see in overall no significant improvement or deterioration of survival prediction with merged data sets as compared to individual data sets. This apparently was due to the fact that a few genes with strong prognostic power were not available on all microarray platforms and thus were not retained in the merged data sets. Surprisingly, we found that the overall best performance was achieved with a single-gene predictor consisting of CYB5D1. CONCLUSIONS: Merging did not deteriorate performance on average despite (a The diversity of microarray platforms used. (b The heterogeneity of patients cohorts. (c The heterogeneity of breast cancer disease. (d Substantial variation of time to death or relapse. (e The reduced number of genes in the merged data

  19. Identification of self-consistent modulons from bacterial microarray expression data with the help of structured regulon gene sets

    KAUST Repository

    Permina, Elizaveta A.; Medvedeva, Yulia; Baeck, Pia M.; Hegde, Shubhada R.; Mande, Shekhar C.; Makeev, Vsevolod J.

    2013-01-01

    interactions helps to evaluate parameters for regulatory subnetwork inference. We suggest a procedure for modulon construction where a seed regulon is iteratively updated with genes having expression patterns similar to those for regulon member genes. A set

  20. Chronic vitamin A-enriched diet feeding regulates hypercholesterolaemia through transcriptional regulation of reverse cholesterol transport pathway genes in obese rat model of WNIN/GR-Ob strain

    Directory of Open Access Journals (Sweden)

    Shanmugam M Jeyakumar

    2016-01-01

    Full Text Available Background & objectives: Hepatic scavenger receptor class B1 (SR-B1, a high-density lipoprotein (HDL receptor, is involved in the selective uptake of HDL-associated esterified cholesterol (EC, thereby regulates cholesterol homoeostasis and improves reverse cholesterol transport. Previously, we reported in euglycaemic obese rats (WNIN/Ob strain that feeding of vitamin A-enriched diet normalized hypercholesterolaemia, possibly through hepatic SR-B1-mediated pathway. This study was aimed to test whether it would be possible to normalize hypercholesterolaemia in glucose-intolerant obese rat model (WNIN/GR/Ob through similar mechanism by feeding identical vitamin A-enriched diet. Methods: In this study, 30 wk old male lean and obese rats of WNIN/GR-Ob strain were divided into two groups and received either stock diet or vitamin A-enriched diet (2.6 mg or 129 mg vitamin A/kg diet for 14 wk. Blood and other tissues were collected for various biochemical analyses. Results: Chronic vitamin A-enriched diet feeding decreased hypercholesterolaemia and normalized abnormally elevated plasma HDL-cholesterol (HDL-C levels in obese rats as compared to stock diet-fed obese groups. Further, decreased free cholesterol (FC and increased esterified cholesterol (EC contents of plasma cholesterol were observed, which were reflected in higher EC to FC ratio of vitamin A-enriched diet-fed obese rats. However, neither lecithin-cholesterol acyltransferase (LCAT activity of plasma nor its expression (both gene and protein in the liver were altered. On the contrary, hepatic cholesterol levels significantly increased in vitamin A-enriched diet fed obese rats. Hepatic SR-B1 expression (both mRNA and protein remained unaltered among groups. Vitamin A-enriched diet fed obese rats showed a significant increase in hepatic low-density lipoprotein receptor mRNA levels, while the expression of genes involved in HDL synthesis, namely, ATP-binding cassette protein 1 (ABCA1 and

  1. Shrinkage covariance matrix approach based on robust trimmed mean in gene sets detection

    Science.gov (United States)

    Karjanto, Suryaefiza; Ramli, Norazan Mohamed; Ghani, Nor Azura Md; Aripin, Rasimah; Yusop, Noorezatty Mohd

    2015-02-01

    Microarray involves of placing an orderly arrangement of thousands of gene sequences in a grid on a suitable surface. The technology has made a novelty discovery since its development and obtained an increasing attention among researchers. The widespread of microarray technology is largely due to its ability to perform simultaneous analysis of thousands of genes in a massively parallel manner in one experiment. Hence, it provides valuable knowledge on gene interaction and function. The microarray data set typically consists of tens of thousands of genes (variables) from just dozens of samples due to various constraints. Therefore, the sample covariance matrix in Hotelling's T2 statistic is not positive definite and become singular, thus it cannot be inverted. In this research, the Hotelling's T2 statistic is combined with a shrinkage approach as an alternative estimation to estimate the covariance matrix to detect significant gene sets. The use of shrinkage covariance matrix overcomes the singularity problem by converting an unbiased to an improved biased estimator of covariance matrix. Robust trimmed mean is integrated into the shrinkage matrix to reduce the influence of outliers and consequently increases its efficiency. The performance of the proposed method is measured using several simulation designs. The results are expected to outperform existing techniques in many tested conditions.

  2. Gene Sets for Utilization of Primary and Secondary Nutrition Supplies in the Distal Gut of Endangered Iberian Lynx

    Science.gov (United States)

    Alcaide, María; Messina, Enzo; Richter, Michael; Bargiela, Rafael; Peplies, Jörg; Huws, Sharon A.; Newbold, Charles J.; Golyshin, Peter N.; Simón, Miguel A.; López, Guillermo; Yakimov, Michail M.; Ferrer, Manuel

    2012-01-01

    Recent studies have indicated the existence of an extensive trans-genomic trans-mural co-metabolism between gut microbes and animal hosts that is diet-, host phylogeny- and provenance-influenced. Here, we analyzed the biodiversity at the level of small subunit rRNA gene sequence and the metabolic composition of 18 Mbp of consensus metagenome sequences and activity characteristics of bacterial intra-cellular extracts, in wild Iberian lynx (Lynx pardinus) fecal samples. Bacterial signatures (14.43% of all of the Firmicutes reads and 6.36% of total reads) related to the uncultured anaerobic commensals Anaeroplasma spp., which are typically found in ovine and bovine rumen, were first identified. The lynx gut was further characterized by an over-representation of ‘presumptive’ aquaporin aqpZ genes and genes encoding ‘active’ lysosomal-like digestive enzymes that are possibly needed to acquire glycerol, sugars and amino acids from glycoproteins, glyco(amino)lipids, glyco(amino)glycans and nucleoside diphosphate sugars. Lynx gut was highly enriched (28% of the total glycosidases) in genes encoding α-amylase and related enzymes, although it exhibited low rate of enzymatic activity indicative of starch degradation. The preponderance of β-xylosidase activity in protein extracts further suggests lynx gut microbes being most active for the metabolism of β-xylose containing plant N-glycans, although β-xylosidases sequences constituted only 1.5% of total glycosidases. These collective and unique bacterial, genetic and enzymatic activity signatures suggest that the wild lynx gut microbiota not only harbors gene sets underpinning sugar uptake from primary animal tissues (with the monotypic dietary profile of the wild lynx consisting of 80–100% wild rabbits) but also for the hydrolysis of prey-derived plant biomass. Although, the present investigation corresponds to a single sample and some of the statements should be considered qualitative, the data most likely

  3. Gene sets for utilization of primary and secondary nutrition supplies in the distal gut of endangered Iberian lynx.

    Directory of Open Access Journals (Sweden)

    María Alcaide

    Full Text Available Recent studies have indicated the existence of an extensive trans-genomic trans-mural co-metabolism between gut microbes and animal hosts that is diet-, host phylogeny- and provenance-influenced. Here, we analyzed the biodiversity at the level of small subunit rRNA gene sequence and the metabolic composition of 18 Mbp of consensus metagenome sequences and activity characteristics of bacterial intra-cellular extracts, in wild Iberian lynx (Lynx pardinus fecal samples. Bacterial signatures (14.43% of all of the Firmicutes reads and 6.36% of total reads related to the uncultured anaerobic commensals Anaeroplasma spp., which are typically found in ovine and bovine rumen, were first identified. The lynx gut was further characterized by an over-representation of 'presumptive' aquaporin aqpZ genes and genes encoding 'active' lysosomal-like digestive enzymes that are possibly needed to acquire glycerol, sugars and amino acids from glycoproteins, glyco(aminolipids, glyco(aminoglycans and nucleoside diphosphate sugars. Lynx gut was highly enriched (28% of the total glycosidases in genes encoding α-amylase and related enzymes, although it exhibited low rate of enzymatic activity indicative of starch degradation. The preponderance of β-xylosidase activity in protein extracts further suggests lynx gut microbes being most active for the metabolism of β-xylose containing plant N-glycans, although β-xylosidases sequences constituted only 1.5% of total glycosidases. These collective and unique bacterial, genetic and enzymatic activity signatures suggest that the wild lynx gut microbiota not only harbors gene sets underpinning sugar uptake from primary animal tissues (with the monotypic dietary profile of the wild lynx consisting of 80-100% wild rabbits but also for the hydrolysis of prey-derived plant biomass. Although, the present investigation corresponds to a single sample and some of the statements should be considered qualitative, the data most likely

  4. Network-based functional enrichment

    Directory of Open Access Journals (Sweden)

    Poirel Christopher L

    2011-11-01

    Full Text Available Abstract Background Many methods have been developed to infer and reason about molecular interaction networks. These approaches often yield networks with hundreds or thousands of nodes and up to an order of magnitude more edges. It is often desirable to summarize the biological information in such networks. A very common approach is to use gene function enrichment analysis for this task. A major drawback of this method is that it ignores information about the edges in the network being analyzed, i.e., it treats the network simply as a set of genes. In this paper, we introduce a novel method for functional enrichment that explicitly takes network interactions into account. Results Our approach naturally generalizes Fisher’s exact test, a gene set-based technique. Given a function of interest, we compute the subgraph of the network induced by genes annotated to this function. We use the sequence of sizes of the connected components of this sub-network to estimate its connectivity. We estimate the statistical significance of the connectivity empirically by a permutation test. We present three applications of our method: i determine which functions are enriched in a given network, ii given a network and an interesting sub-network of genes within that network, determine which functions are enriched in the sub-network, and iii given two networks, determine the functions for which the connectivity improves when we merge the second network into the first. Through these applications, we show that our approach is a natural alternative to network clustering algorithms. Conclusions We presented a novel approach to functional enrichment that takes into account the pairwise relationships among genes annotated by a particular function. Each of the three applications discovers highly relevant functions. We used our methods to study biological data from three different organisms. Our results demonstrate the wide applicability of our methods. Our algorithms are

  5. Coverage and characteristics of the Affymetrix GeneChip Human Mapping 100K SNP set.

    Directory of Open Access Journals (Sweden)

    2006-05-01

    Full Text Available Improvements in technology have made it possible to conduct genome-wide association mapping at costs within reach of academic investigators, and experiments are currently being conducted with a variety of high-throughput platforms. To provide an appropriate context for interpreting results of such studies, we summarize here results of an investigation of one of the first of these technologies to be publicly available, the Affymetrix GeneChip Human Mapping 100K set of single nucleotide polymorphisms (SNPs. In a systematic analysis of the pattern and distribution of SNPs in the Mapping 100K set, we find that SNPs in this set are undersampled from coding regions (both nonsynonymous and synonymous and oversampled from regions outside genes, relative to SNPs in the overall HapMap database. In addition, we utilize a novel multilocus linkage disequilibrium (LD coefficient based on information content (analogous to the information content scores commonly used for linkage mapping that is equivalent to the familiar measure r2 in the special case of two loci. Using this approach, we are able to summarize for any subset of markers, such as the Affymetrix Mapping 100K set, the information available for association mapping in that subset, relative to the information available in the full set of markers included in the HapMap, and highlight circumstances in which this multilocus measure of LD provides substantial additional insight about the haplotype structure in a region over pairwise measures of LD.

  6. Gene set-based module discovery in the breast cancer transcriptome

    Directory of Open Access Journals (Sweden)

    Zhang Michael Q

    2009-02-01

    Full Text Available Abstract Background Although microarray-based studies have revealed global view of gene expression in cancer cells, we still have little knowledge about regulatory mechanisms underlying the transcriptome. Several computational methods applied to yeast data have recently succeeded in identifying expression modules, which is defined as co-expressed gene sets under common regulatory mechanisms. However, such module discovery methods are not applied cancer transcriptome data. Results In order to decode oncogenic regulatory programs in cancer cells, we developed a novel module discovery method termed EEM by extending a previously reported module discovery method, and applied it to breast cancer expression data. Starting from seed gene sets prepared based on cis-regulatory elements, ChIP-chip data, and gene locus information, EEM identified 10 principal expression modules in breast cancer based on their expression coherence. Moreover, EEM depicted their activity profiles, which predict regulatory programs in each subtypes of breast tumors. For example, our analysis revealed that the expression module regulated by the Polycomb repressive complex 2 (PRC2 is downregulated in triple negative breast cancers, suggesting similarity of transcriptional programs between stem cells and aggressive breast cancer cells. We also found that the activity of the PRC2 expression module is negatively correlated to the expression of EZH2, a component of PRC2 which belongs to the E2F expression module. E2F-driven EZH2 overexpression may be responsible for the repression of the PRC2 expression modules in triple negative tumors. Furthermore, our network analysis predicts regulatory circuits in breast cancer cells. Conclusion These results demonstrate that the gene set-based module discovery approach is a powerful tool to decode regulatory programs in cancer cells.

  7. Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans

    Science.gov (United States)

    Iraola, Gregorio; Vazquez, Gustavo; Spangenberg, Lucía; Naya, Hugo

    2012-01-01

    Although there have been great advances in understanding bacterial pathogenesis, there is still a lack of integrative information about what makes a bacterium a human pathogen. The advent of high-throughput sequencing technologies has dramatically increased the amount of completed bacterial genomes, for both known human pathogenic and non-pathogenic strains; this information is now available to investigate genetic features that determine pathogenic phenotypes in bacteria. In this work we determined presence/absence patterns of different virulence-related genes among more than finished bacterial genomes from both human pathogenic and non-pathogenic strains, belonging to different taxonomic groups (i.e: Actinobacteria, Gammaproteobacteria, Firmicutes, etc.). An accuracy of 95% using a cross-fold validation scheme with in-fold feature selection is obtained when classifying human pathogens and non-pathogens. A reduced subset of highly informative genes () is presented and applied to an external validation set. The statistical model was implemented in the BacFier v1.0 software (freely available at ), that displays not only the prediction (pathogen/non-pathogen) and an associated probability for pathogenicity, but also the presence/absence vector for the analyzed genes, so it is possible to decipher the subset of virulence genes responsible for the classification on the analyzed genome. Furthermore, we discuss the biological relevance for bacterial pathogenesis of the core set of genes, corresponding to eight functional categories, all with evident and documented association with the phenotypes of interest. Also, we analyze which functional categories of virulence genes were more distinctive for pathogenicity in each taxonomic group, which seems to be a completely new kind of information and could lead to important evolutionary conclusions. PMID:22916122

  8. Identification of a core set of rhizobial infection genes using data from single cell-types

    Directory of Open Access Journals (Sweden)

    Da-Song eChen

    2015-07-01

    Full Text Available Genome-wide expression studies on nodulation have varied in their scale from entire root systems to dissected nodules or root sections containing nodule primordia. More recently efforts have focused on developing methods for isolation of root hairs from infected plants and the application of laser-capture microdissection technology to nodules. Here we analyze two published data sets to identify a core set of infection genes that are expressed in the nodule and in root hairs during infection. Among the genes identified were those encoding phenylpropanoid biosynthesis enzymes including Chalcone-O-Methyltransferase which is required for the production of the potent Nod gene inducer 4’,4-dihydroxy-2-methoxychalcone. A promoter-GUS analysis in transgenic hairy roots for two genes encoding Chalcone-O-Methyltransferase isoforms revealed their expression in rhizobially infected root hairs and the nodule infection zone but not in the nitrogen fixation zone. We also describe a group of Rhizobially Induced Peroxidases whose expression overlaps with the production of superoxide in rhizobially infected root hairs and in nodules and roots. Finally, we identify a cohort of co-regulated transcription factors as candidate regulators of these processes.

  9. Expression map of a complete set of gustatory receptor genes in chemosensory organs of Bombyx mori.

    Science.gov (United States)

    Guo, Huizhen; Cheng, Tingcai; Chen, Zhiwei; Jiang, Liang; Guo, Youbing; Liu, Jianqiu; Li, Shenglong; Taniai, Kiyoko; Asaoka, Kiyoshi; Kadono-Okuda, Keiko; Arunkumar, Kallare P; Wu, Jiaqi; Kishino, Hirohisa; Zhang, Huijie; Seth, Rakesh K; Gopinathan, Karumathil P; Montagné, Nicolas; Jacquin-Joly, Emmanuelle; Goldsmith, Marian R; Xia, Qingyou; Mita, Kazuei

    2017-03-01

    Most lepidopteran species are herbivores, and interaction with host plants affects their gene expression and behavior as well as their genome evolution. Gustatory receptors (Grs) are expected to mediate host plant selection, feeding, oviposition and courtship behavior. However, due to their high diversity, sequence divergence and extremely low level of expression it has been difficult to identify precisely a complete set of Grs in Lepidoptera. By manual annotation and BAC sequencing, we improved annotation of 43 gene sequences compared with previously reported Grs in the most studied lepidopteran model, the silkworm, Bombyx mori, and identified 7 new tandem copies of BmGr30 on chromosome 7, bringing the total number of BmGrs to 76. Among these, we mapped 68 genes to chromosomes in a newly constructed chromosome distribution map and 8 genes to scaffolds; we also found new evidence for large clusters of BmGrs, especially from the bitter receptor family. RNA-seq analysis of diverse BmGr expression patterns in chemosensory organs of larvae and adults enabled us to draw a precise organ specific map of BmGr expression. Interestingly, most of the clustered genes were expressed in the same tissues and more than half of the genes were expressed in larval maxillae, larval thoracic legs and adult legs. For example, BmGr63 showed high expression levels in all organs in both larval and adult stages. By contrast, some genes showed expression limited to specific developmental stages or organs and tissues. BmGr19 was highly expressed in larval chemosensory organs (especially antennae and thoracic legs), the single exon genes BmGr53 and BmGr67 were expressed exclusively in larval tissues, the BmGr27-BmGr31 gene cluster on chr7 displayed a high expression level limited to adult legs and the candidate CO 2 receptor BmGr2 was highly expressed in adult antennae, where few other Grs were expressed. Transcriptional analysis of the Grs in B. mori provides a valuable new reference for

  10. Identification of the Core Set of Carbon-Associated Genes in a Bioenergy Grassland Soil.

    Directory of Open Access Journals (Sweden)

    Adina Howe

    Full Text Available Despite the central role of soil microbial communities in global carbon (C cycling, little is known about soil microbial community structure and even less about their metabolic pathways. Efforts to characterize soil communities often focus on identifying differences in gene content across environmental gradients, but an alternative question is what genes are similar in soils. These genes may indicate critical species or potential functions that are required in all soils. Here we identified the "core" set of C cycling sequences widely present in multiple soil metagenomes from a fertilized prairie (FP. Of 226,887 sequences associated with known enzymes involved in the synthesis, metabolism, and transport of carbohydrates, 843 were identified to be consistently prevalent across four replicate soil metagenomes. This core metagenome was functionally and taxonomically diverse, representing five enzyme classes and 99 enzyme families within the CAZy database. Though it only comprised 0.4% of all CAZy-associated genes identified in FP metagenomes, the core was found to be comprised of functions similar to those within cumulative soils. The FP CAZy-associated core sequences were present in multiple publicly available soil metagenomes and most similar to soils sharing geographic proximity. In soil ecosystems, where high diversity remains a key challenge for metagenomic investigations, these core genes represent a subset of critical functions necessary for carbohydrate metabolism, which can be targeted to evaluate important C fluxes in these and other similar soils.

  11. The enrichment of TATA box and the scarcity of depleted proximal nucleosome in the promoters of duplicated yeast genes.

    Science.gov (United States)

    Kim, Yuseob; Lee, Jang H; Babbitt, Gregory A

    2010-01-01

    Population genetic theory of gene duplication suggests that the preservation of duplicate copies requires functional divergence upon duplication. Genes that can be readily modified to produce new gene expression patterns may thus be duplicated often. In yeast, genes exhibit dichotomous expression patterns based on their promoter architectures. The expression of genes that contain TATA box or occupied proximal nucleosome (OPN) tends to be variable and respond to external signals. On the other hand, genes without TATA box or with depleted proximal nucleosome (DPN) are expressed constitutively. We find that recent duplicates in the yeast genome are heavily biased to be TATA box containing genes and not to be DPN genes. This suggests that variably expressed genes, due to the functional organization in their promoters, have higher duplicability than constitutively expressed genes.

  12. Genome-Wide Temporal Expression Profiling in Caenorhabditis elegans Identifies a Core Gene Set Related to Long-Term Memory.

    Science.gov (United States)

    Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila

    2017-07-12

    The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.

  13. Enrichment of G2/M cell cycle phase in human pluripotent stem cells enhances HDR-mediated gene repair with customizable endonucleases.

    Science.gov (United States)

    Yang, Diane; Scavuzzo, Marissa A; Chmielowiec, Jolanta; Sharp, Robert; Bajic, Aleksandar; Borowiak, Malgorzata

    2016-02-18

    Efficient gene editing is essential to fully utilize human pluripotent stem cells (hPSCs) in regenerative medicine. Custom endonuclease-based gene targeting involves two mechanisms of DNA repair: homology directed repair (HDR) and non-homologous end joining (NHEJ). HDR is the preferred mechanism for common applications such knock-in, knock-out or precise mutagenesis, but remains inefficient in hPSCs. Here, we demonstrate that synchronizing synchronizing hPSCs in G2/M with ABT phase increases on-target gene editing, defined as correct targeting cassette integration, 3 to 6 fold. We observed improved efficiency using ZFNs, TALENs, two CRISPR/Cas9, and CRISPR/Cas9 nickase to target five genes in three hPSC lines: three human embryonic stem cell lines, neural progenitors and diabetic iPSCs. neural progenitors and diabetic iPSCs. Reversible synchronization has no effect on pluripotency or differentiation. The increase in on-target gene editing is locus-independent and specific to the cell cycle phase as G2/M phase enriched cells show a 6-fold increase in targeting efficiency compared to cells in G1 phase. Concurrently inhibiting NHEJ with SCR7 does not increase HDR or improve gene targeting efficiency further, indicating that HR is the major DNA repair mechanism after G2/M phase arrest. The approach outlined here makes gene editing in hPSCs a more viable tool for disease modeling, regenerative medicine and cell-based therapies.

  14. Systems-based biological concordance and predictive reproducibility of gene set discovery methods in cardiovascular disease.

    Science.gov (United States)

    Azuaje, Francisco; Zheng, Huiru; Camargo, Anyela; Wang, Haiying

    2011-08-01

    The discovery of novel disease biomarkers is a crucial challenge for translational bioinformatics. Demonstration of both their classification power and reproducibility across independent datasets are essential requirements to assess their potential clinical relevance. Small datasets and multiplicity of putative biomarker sets may explain lack of predictive reproducibility. Studies based on pathway-driven discovery approaches have suggested that, despite such discrepancies, the resulting putative biomarkers tend to be implicated in common biological processes. Investigations of this problem have been mainly focused on datasets derived from cancer research. We investigated the predictive and functional concordance of five methods for discovering putative biomarkers in four independently-generated datasets from the cardiovascular disease domain. A diversity of biosignatures was identified by the different methods. However, we found strong biological process concordance between them, especially in the case of methods based on gene set analysis. With a few exceptions, we observed lack of classification reproducibility using independent datasets. Partial overlaps between our putative sets of biomarkers and the primary studies exist. Despite the observed limitations, pathway-driven or gene set analysis can predict potentially novel biomarkers and can jointly point to biomedically-relevant underlying molecular mechanisms. Copyright © 2011 Elsevier Inc. All rights reserved.

  15. Genome-Wide Gene Set Analysis for Identification of Pathways Associated with Alcohol Dependence

    Science.gov (United States)

    Biernacka, Joanna M.; Geske, Jennifer; Jenkins, Gregory D.; Colby, Colin; Rider, David N.; Karpyak, Victor M.; Choi, Doo-Sup; Fridley, Brooke L.

    2013-01-01

    It is believed that multiple genetic variants with small individual effects contribute to the risk of alcohol dependence. Such polygenic effects are difficult to detect in genome-wide association studies that test for association of the phenotype with each single nucleotide polymorphism (SNP) individually. To overcome this challenge, gene set analysis (GSA) methods that jointly test for the effects of pre-defined groups of genes have been proposed. Rather than testing for association between the phenotype and individual SNPs, these analyses evaluate the global evidence of association with a set of related genes enabling the identification of cellular or molecular pathways or biological processes that play a role in development of the disease. It is hoped that by aggregating the evidence of association for all available SNPs in a group of related genes, these approaches will have enhanced power to detect genetic associations with complex traits. We performed GSA using data from a genome-wide study of 1165 alcohol dependent cases and 1379 controls from the Study of Addiction: Genetics and Environment (SAGE), for all 200 pathways listed in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Results demonstrated a potential role of the “Synthesis and Degradation of Ketone Bodies” pathway. Our results also support the potential involvement of the “Neuroactive Ligand Receptor Interaction” pathway, which has previously been implicated in addictive disorders. These findings demonstrate the utility of GSA in the study of complex disease, and suggest specific directions for further research into the genetic architecture of alcohol dependence. PMID:22717047

  16. Cardiac-enriched BAF chromatin-remodeling complex subunit Baf60c regulates gene expression programs essential for heart development and function

    Directory of Open Access Journals (Sweden)

    Xin Sun

    2018-01-01

    Full Text Available How chromatin-remodeling complexes modulate gene networks to control organ-specific properties is not well understood. For example, Baf60c (Smarcd3 encodes a cardiac-enriched subunit of the SWI/SNF-like BAF chromatin complex, but its role in heart development is not fully understood. We found that constitutive loss of Baf60c leads to embryonic cardiac hypoplasia and pronounced cardiac dysfunction. Conditional deletion of Baf60c in cardiomyocytes resulted in postnatal dilated cardiomyopathy with impaired contractile function. Baf60c regulates a gene expression program that includes genes encoding contractile proteins, modulators of sarcomere function, and cardiac metabolic genes. Many of the genes deregulated in Baf60c null embryos are targets of the MEF2/SRF co-factor Myocardin (MYOCD. In a yeast two-hybrid screen, we identified MYOCD as a BAF60c interacting factor; we showed that BAF60c and MYOCD directly and functionally interact. We conclude that Baf60c is essential for coordinating a program of gene expression that regulates the fundamental functional properties of cardiomyocytes.

  17. Enrichment of conserved synaptic activity-responsive element in neuronal genes predicts a coordinated response of MEF2, CREB and SRF.

    Directory of Open Access Journals (Sweden)

    Fernanda M Rodríguez-Tornos

    Full Text Available A unique synaptic activity-responsive element (SARE sequence, composed of the consensus binding sites for SRF, MEF2 and CREB, is necessary for control of transcriptional upregulation of the Arc gene in response to synaptic activity. We hypothesize that this sequence is a broad mechanism that regulates gene expression in response to synaptic activation and during plasticity; and that analysis of SARE-containing genes could identify molecular mechanisms involved in brain disorders. To search for conserved SARE sequences in the mammalian genome, we used the SynoR in silico tool, and found the SARE cluster predominantly in the regulatory regions of genes expressed specifically in the nervous system; most were related to neural development and homeostatic maintenance. Two of these SARE sequences were tested in luciferase assays and proved to promote transcription in response to neuronal activation. Supporting the predictive capacity of our candidate list, up-regulation of several SARE containing genes in response to neuronal activity was validated using external data and also experimentally using primary cortical neurons and quantitative real time RT-PCR. The list of SARE-containing genes includes several linked to mental retardation and cognitive disorders, and is significantly enriched in genes that encode mRNA targeted by FMRP (fragile X mental retardation protein. Our study thus supports the idea that SARE sequences are relevant transcriptional regulatory elements that participate in plasticity. In addition, it offers a comprehensive view of how activity-responsive transcription factors coordinate their actions and increase the selectivity of their targets. Our data suggest that analysis of SARE-containing genes will reveal yet-undescribed pathways of synaptic plasticity and additional candidate genes disrupted in mental disease.

  18. Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets

    Directory of Open Access Journals (Sweden)

    Karacali Bilge

    2007-10-01

    Full Text Available Abstract Background Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a all genes on the microarray platform and b a list of known disease-related genes (a priori selection. We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms. Results Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform. Conclusion Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine

  19. MADS goes genomic in conifers: towards determining the ancestral set of MADS-box genes in seed plants.

    Science.gov (United States)

    Gramzow, Lydia; Weilandt, Lisa; Theißen, Günter

    2014-11-01

    MADS-box genes comprise a gene family coding for transcription factors. This gene family expanded greatly during land plant evolution such that the number of MADS-box genes ranges from one or two in green algae to around 100 in angiosperms. Given the crucial functions of MADS-box genes for nearly all aspects of plant development, the expansion of this gene family probably contributed to the increasing complexity of plants. However, the expansion of MADS-box genes during one important step of land plant evolution, namely the origin of seed plants, remains poorly understood due to the previous lack of whole-genome data for gymnosperms. The newly available genome sequences of Picea abies, Picea glauca and Pinus taeda were used to identify the complete set of MADS-box genes in these conifers. In addition, MADS-box genes were identified in the growing number of transcriptomes available for gymnosperms. With these datasets, phylogenies were constructed to determine the ancestral set of MADS-box genes of seed plants and to infer the ancestral functions of these genes. Type I MADS-box genes are under-represented in gymnosperms and only a minimum of two Type I MADS-box genes have been present in the most recent common ancestor (MRCA) of seed plants. In contrast, a large number of Type II MADS-box genes were found in gymnosperms. The MRCA of extant seed plants probably possessed at least 11-14 Type II MADS-box genes. In gymnosperms two duplications of Type II MADS-box genes were found, such that the MRCA of extant gymnosperms had at least 14-16 Type II MADS-box genes. The implied ancestral set of MADS-box genes for seed plants shows simplicity for Type I MADS-box genes and remarkable complexity for Type II MADS-box genes in terms of phylogeny and putative functions. The analysis of transcriptome data reveals that gymnosperm MADS-box genes are expressed in a great variety of tissues, indicating diverse roles of MADS-box genes for the development of gymnosperms. This study is

  20. Evaluation of endogenous control genes for gene expression studies across multiple tissues and in the specific sets of fat- and muscle-type samples of the pig.

    Science.gov (United States)

    Gu, Y R; Li, M Z; Zhang, K; Chen, L; Jiang, A A; Wang, J Y; Li, X W

    2011-08-01

    To normalize a set of quantitative real-time PCR (q-PCR) data, it is essential to determine an optimal number/set of housekeeping genes, as the abundance of housekeeping genes can vary across tissues or cells during different developmental stages, or even under certain environmental conditions. In this study, of the 20 commonly used endogenous control genes, 13, 18 and 17 genes exhibited credible stability in 56 different tissues, 10 types of adipose tissue and five types of muscle tissue, respectively. Our analysis clearly showed that three optimal housekeeping genes are adequate for an accurate normalization, which correlated well with the theoretical optimal number (r ≥ 0.94). In terms of economical and experimental feasibility, we recommend the use of the three most stable housekeeping genes for calculating the normalization factor. Based on our results, the three most stable housekeeping genes in all analysed samples (TOP2B, HSPCB and YWHAZ) are recommended for accurate normalization of q-PCR data. We also suggest that two different sets of housekeeping genes are appropriate for 10 types of adipose tissue (the HSPCB, ALDOA and GAPDH genes) and five types of muscle tissue (the TOP2B, HSPCB and YWHAZ genes), respectively. Our report will serve as a valuable reference for other studies aimed at measuring tissue-specific mRNA abundance in porcine samples. © 2011 Blackwell Verlag GmbH.

  1. Comprehensive ecosystem model-data synthesis using multiple data sets at two temperate forest free-air CO2 enrichment experiments: Model performance at ambient CO2 concentration

    Science.gov (United States)

    Walker, Anthony P.; Hanson, Paul J.; De Kauwe, Martin G.; Medlyn, Belinda E.; Zaehle, Sönke; Asao, Shinichi; Dietze, Michael; Hickler, Thomas; Huntingford, Chris; Iversen, Colleen M.; Jain, Atul; Lomas, Mark; Luo, Yiqi; McCarthy, Heather; Parton, William J.; Prentice, I. Colin; Thornton, Peter E.; Wang, Shusen; Wang, Ying-Ping; Warlind, David; Weng, Ensheng; Warren, Jeffrey M.; Woodward, F. Ian; Oren, Ram; Norby, Richard J.

    2014-05-01

    Free-air CO2 enrichment (FACE) experiments provide a remarkable wealth of data which can be used to evaluate and improve terrestrial ecosystem models (TEMs). In the FACE model-data synthesis project, 11 TEMs were applied to two decadelong FACE experiments in temperate forests of the southeastern U.S.—the evergreen Duke Forest and the deciduous Oak Ridge Forest. In this baseline paper, we demonstrate our approach to model-data synthesis by evaluating the models' ability to reproduce observed net primary productivity (NPP), transpiration, and leaf area index (LAI) in ambient CO2 treatments. Model outputs were compared against observations using a range of goodness-of-fit statistics. Many models simulated annual NPP and transpiration within observed uncertainty. We demonstrate, however, that high goodness-of-fit values do not necessarily indicate a successful model, because simulation accuracy may be achieved through compensating biases in component variables. For example, transpiration accuracy was sometimes achieved with compensating biases in leaf area index and transpiration per unit leaf area. Our approach to model-data synthesis therefore goes beyond goodness-of-fit to investigate the success of alternative representations of component processes. Here we demonstrate this approach by comparing competing model hypotheses determining peak LAI. Of three alternative hypotheses—(1) optimization to maximize carbon export, (2) increasing specific leaf area with canopy depth, and (3) the pipe model—the pipe model produced peak LAI closest to the observations. This example illustrates how data sets from intensive field experiments such as FACE can be used to reduce model uncertainty despite compensating biases by evaluating individual model assumptions.

  2. PLANT HOMOLOGOUS TO PARAFIBROMIN is a component of the PAF1 complex and assists in regulating expression of genes within H3K27ME3-enriched chromatin.

    Science.gov (United States)

    Park, Sunchung; Oh, Sookyung; Ek-Ramos, Julissa; van Nocker, Steven

    2010-06-01

    The human Paf1 complex (Paf1C) subunit Parafibromin assists in mediating output from the Wingless/Int signaling pathway, and dysfunction of the encoding gene HRPT2 conditions specific cancer-related disease phenotypes. Here, we characterize the organismal and molecular roles of PLANT HOMOLOGOUS TO PARAFIBROMIN (PHP), the Arabidopsis (Arabidopsis thaliana) homolog of Parafibromin. PHP resides in an approximately 670-kD protein complex in nuclear extracts, and physically interacts with other known Paf1C-related proteins in vivo. In striking contrast to the developmental pleiotropy conferred by mutation in other plant Paf1C component genes in Arabidopsis, loss of PHP specifically conditioned accelerated phase transition from vegetative growth to flowering and resulted in misregulation of a very limited subset of genes that included the flowering repressor FLOWERING LOCUS C. Those genes targeted by PHP were distinguished from the bulk of Arabidopsis genes and other plant Paf1C targets by strong enrichment for trimethylation of lysine-27 on histone H3 (H3K27me3) within chromatin. These findings suggest that PHP is a component of a plant Paf1C protein in Arabidopsis, but has a more specialized role in modulating expression of a subset of Paf1C targets.

  3. Glutamatergic and GABAergic gene sets in attention-deficit/hyperactivity disorder: association to overlapping traits in ADHD and autism.

    Science.gov (United States)

    Naaijen, J; Bralten, J; Poelmans, G; Glennon, J C; Franke, B; Buitelaar, J K

    2017-01-10

    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) often co-occur. Both are highly heritable; however, it has been difficult to discover genetic risk variants. Glutamate and GABA are main excitatory and inhibitory neurotransmitters in the brain; their balance is essential for proper brain development and functioning. In this study we investigated the role of glutamate and GABA genetics in ADHD severity, autism symptom severity and inhibitory performance, based on gene set analysis, an approach to investigate multiple genetic variants simultaneously. Common variants within glutamatergic and GABAergic genes were investigated using the MAGMA software in an ADHD case-only sample (n=931), in which we assessed ASD symptoms and response inhibition on a Stop task. Gene set analysis for ADHD symptom severity, divided into inattention and hyperactivity/impulsivity symptoms, autism symptom severity and inhibition were performed using principal component regression analyses. Subsequently, gene-wide association analyses were performed. The glutamate gene set showed an association with severity of hyperactivity/impulsivity (P=0.009), which was robust to correcting for genome-wide association levels. The GABA gene set showed nominally significant association with inhibition (P=0.04), but this did not survive correction for multiple comparisons. None of single gene or single variant associations was significant on their own. By analyzing multiple genetic variants within candidate gene sets together, we were able to find genetic associations supporting the involvement of excitatory and inhibitory neurotransmitter systems in ADHD and ASD symptom severity in ADHD.

  4. Meta-analysis of Drosophila circadian microarray studies identifies a novel set of rhythmically expressed genes.

    Directory of Open Access Journals (Sweden)

    Kevin P Keegan

    2007-11-01

    Full Text Available Five independent groups have reported microarray studies that identify dozens of rhythmically expressed genes in the fruit fly Drosophila melanogaster. Limited overlap among the lists of discovered genes makes it difficult to determine which, if any, exhibit truly rhythmic patterns of expression. We reanalyzed data from all five reports and found two sources for the observed discrepancies, the use of different expression pattern detection algorithms and underlying variation among the datasets. To improve upon the methods originally employed, we developed a new analysis that involves compilation of all existing data, application of identical transformation and standardization procedures followed by ANOVA-based statistical prescreening, and three separate classes of post hoc analysis: cross-correlation to various cycling waveforms, autocorrelation, and a previously described fast Fourier transform-based technique. Permutation-based statistical tests were used to derive significance measures for all post hoc tests. We find application of our method, most significantly the ANOVA prescreening procedure, significantly reduces the false discovery rate relative to that observed among the results of the original five reports while maintaining desirable statistical power. We identify a set of 81 cycling transcripts previously found in one or more of the original reports as well as a novel set of 133 transcripts not found in any of the original studies. We introduce a novel analysis method that compensates for variability observed among the original five Drosophila circadian array reports. Based on the statistical fidelity of our meta-analysis results, and the results of our initial validation experiments (quantitative RT-PCR, we predict many of our newly found genes to be bona fide cyclers, and suggest that they may lead to new insights into the pathways through which clock mechanisms regulate behavioral rhythms.

  5. Flavanol-Enriched Cocoa Powder Alters the Intestinal Microbiota, Tissue and Fluid Metabolite Profiles, and Intestinal Gene Expression in Pigs1234

    Science.gov (United States)

    Jang, Saebyeol; Sun, Jianghao; Chen, Pei; Lakshman, Sukla; Molokin, Aleksey; Harnly, James M; Vinyard, Bryan T; Urban, Joseph F; Davis, Cindy D; Solano-Aguilar, Gloria

    2016-01-01

    Background: Consumption of cocoa-derived polyphenols has been associated with several health benefits; however, their effects on the intestinal microbiome and related features of host intestinal health are not adequately understood. Objective: The objective of this study was to determine the effects of eating flavanol-enriched cocoa powder on the composition of the gut microbiota, tissue metabolite profiles, and intestinal immune status. Methods: Male pigs (5 mo old, 28 kg mean body weight) were supplemented with 0, 2.5, 10, or 20 g flavanol-enriched cocoa powder/d for 27 d. Metabolites in serum, urine, the proximal colon contents, liver, and adipose tissue; bacterial abundance in the intestinal contents and feces; and intestinal tissue gene expression of inflammatory markers and Toll-like receptors (TLRs) were then determined. Results: O-methyl-epicatechin-glucuronide conjugates dose-dependently increased (P cocoa powder. The concentration of 3-hydroxyphenylpropionic acid isomers in urine decreased as the dose of cocoa powder fed to pigs increased (75–85%, P cocoa powder/d, respectively. Moreover, consumption of cocoa powder reduced TLR9 gene expression in ileal Peyer’s patches (67–80%, P cocoa powder/d compared with pigs not supplemented with cocoa powder. Conclusion: This study demonstrates that consumption of cocoa powder by pigs can contribute to gut health by enhancing the abundance of Lactobacillus and Bifidobacterium species and modulating markers of localized intestinal immunity. PMID:26936136

  6. An ancient dental gene set governs development and continuous regeneration of teeth in sharks.

    Science.gov (United States)

    Rasch, Liam J; Martin, Kyle J; Cooper, Rory L; Metscher, Brian D; Underwood, Charlie J; Fraser, Gareth J

    2016-07-15

    The evolution of oral teeth is considered a major contributor to the overall success of jawed vertebrates. This is especially apparent in cartilaginous fishes including sharks and rays, which develop elaborate arrays of highly specialized teeth, organized in rows and retain the capacity for life-long regeneration. Perpetual regeneration of oral teeth has been either lost or highly reduced in many other lineages including important developmental model species, so cartilaginous fishes are uniquely suited for deep comparative analyses of tooth development and regeneration. Additionally, sharks and rays can offer crucial insights into the characters of the dentition in the ancestor of all jawed vertebrates. Despite this, tooth development and regeneration in chondrichthyans is poorly understood and remains virtually uncharacterized from a developmental genetic standpoint. Using the emerging chondrichthyan model, the catshark (Scyliorhinus spp.), we characterized the expression of genes homologous to those known to be expressed during stages of early dental competence, tooth initiation, morphogenesis, and regeneration in bony vertebrates. We have found that expression patterns of several genes from Hh, Wnt/β-catenin, Bmp and Fgf signalling pathways indicate deep conservation over ~450 million years of tooth development and regeneration. We describe how these genes participate in the initial emergence of the shark dentition and how they are redeployed during regeneration of successive tooth generations. We suggest that at the dawn of the vertebrate lineage, teeth (i) were most likely continuously regenerative structures, and (ii) utilised a core set of genes from members of key developmental signalling pathways that were instrumental in creating a dental legacy redeployed throughout vertebrate evolution. These data lay the foundation for further experimental investigations utilizing the unique regenerative capacity of chondrichthyan models to answer evolutionary

  7. Isotope enrichment

    International Nuclear Information System (INIS)

    Lydtin, H-J.; Wilden, R.J.; Severin, P.J.W.

    1978-01-01

    The isotope enrichment method described is based on the recognition that, owing to mass diffusion and thermal diffusion in the conversion of substances at a heated substrate while depositing an element or compound onto the substrate, enrichment of the element, or a compound of the element, with a lighter isotope will occur. The cycle is repeated for as many times as is necessary to obtain the degree of enrichment required

  8. Development of a set of SNP markers present in expressed genes of the apple.

    Science.gov (United States)

    Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S

    2008-11-01

    Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.

  9. Uranium enrichment

    International Nuclear Information System (INIS)

    1990-01-01

    This report looks at the following issues: How much Soviet uranium ore and enriched uranium are imported into the United States and what is the extent to which utilities flag swap to disguise these purchases? What are the U.S.S.R.'s enriched uranium trading practices? To what extent are utilities required to return used fuel to the Soviet Union as part of the enriched uranium sales agreement? Why have U.S. utilities ended their contracts to buy enrichment services from DOE?

  10. Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes

    OpenAIRE

    Kreiman, Gabriel

    2004-01-01

    Sequence information and high‐throughput methods to measure gene expression levels open the door to explore transcriptional regulation using computational tools. Combinatorial regulation and sparseness of regulatory elements throughout the genome allow organisms to control the spatial and temporal patterns of gene expression. Here we study the organization of cis‐regulatory elements in sets of co‐regulated genes. We build an algorithm to search for combinations of transcription factor binding...

  11. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes

    DEFF Research Database (Denmark)

    de Jong, Simone; Boks, Marco P M; Fuller, Tova F

    2012-01-01

    Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood...... of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co......, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes...

  12. Uranium enrichment

    International Nuclear Information System (INIS)

    1989-01-01

    GAO was asked to address several questions concerning a number of proposed uranium enrichment bills introduced during the 100th Congress. The bill would have restructured the Department of Energy's uranium enrichment program as a government corporation to allow it to compete more effectively in the domestic and international markets. Some of GAO's findings discussed are: uranium market experts believe and existing market models show that the proposed DOE purchase of a $750 million of uranium from domestic producers may not significantly increase production because of large producer-held inventories; excess uranium enrichment production capacity exists throughout the world; therefore, foreign producers are expected to compete heavily in the United States throughout the 1990s as utilities' contracts with DOE expire; and according to a 1988 agreement between DOE's Offices of Nuclear Energy and Defense Programs, enrichment decommissioning costs, estimated to total $3.6 billion for planning purposes, will be shared by the commercial enrichment program and the government

  13. Transcriptome-wide selection of a reliable set of reference genes for gene expression studies in potato cyst nematodes (Globodera spp.).

    Science.gov (United States)

    Sabeh, Michael; Duceppe, Marc-Olivier; St-Arnaud, Marc; Mimee, Benjamin

    2018-01-01

    Relative gene expression analyses by qRT-PCR (quantitative reverse transcription PCR) require an internal control to normalize the expression data of genes of interest and eliminate the unwanted variation introduced by sample preparation. A perfect reference gene should have a constant expression level under all the experimental conditions. However, the same few housekeeping genes selected from the literature or successfully used in previous unrelated experiments are often routinely used in new conditions without proper validation of their stability across treatments. The advent of RNA-Seq and the availability of public datasets for numerous organisms are opening the way to finding better reference genes for expression studies. Globodera rostochiensis is a plant-parasitic nematode that is particularly yield-limiting for potato. The aim of our study was to identify a reliable set of reference genes to study G. rostochiensis gene expression. Gene expression levels from an RNA-Seq database were used to identify putative reference genes and were validated with qRT-PCR analysis. Three genes, GR, PMP-3, and aaRS, were found to be very stable within the experimental conditions of this study and are proposed as reference genes for future work.

  14. A Meta-Analysis of Multiple Matched Copy Number and Transcriptomics Data Sets for Inferring Gene Regulatory Relationships

    Science.gov (United States)

    Newton, Richard; Wernisch, Lorenz

    2014-01-01

    Inferring gene regulatory relationships from observational data is challenging. Manipulation and intervention is often required to unravel causal relationships unambiguously. However, gene copy number changes, as they frequently occur in cancer cells, might be considered natural manipulation experiments on gene expression. An increasing number of data sets on matched array comparative genomic hybridisation and transcriptomics experiments from a variety of cancer pathologies are becoming publicly available. Here we explore the potential of a meta-analysis of thirty such data sets. The aim of our analysis was to assess the potential of in silico inference of trans-acting gene regulatory relationships from this type of data. We found sufficient correlation signal in the data to infer gene regulatory relationships, with interesting similarities between data sets. A number of genes had highly correlated copy number and expression changes in many of the data sets and we present predicted potential trans-acted regulatory relationships for each of these genes. The study also investigates to what extent heterogeneity between cell types and between pathologies determines the number of statistically significant predictions available from a meta-analysis of experiments. PMID:25148247

  15. Gene discovery from Jatropha curcas by sequencing of ESTs from normalized and full-length enriched cDNA library from developing seeds

    Directory of Open Access Journals (Sweden)

    Sugantham Priyanka Annabel

    2010-10-01

    Full Text Available Abstract Background Jatropha curcas L. is promoted as an important non-edible biodiesel crop worldwide. Jatropha oil, which is a triacylglycerol, can be directly blended with petro-diesel or transesterified with methanol and used as biodiesel. Genetic improvement in jatropha is needed to increase the seed yield, oil content, drought and pest resistance, and to modify oil composition so that it becomes a technically and economically preferred source for biodiesel production. However, genetic improvement efforts in jatropha could not take advantage of genetic engineering methods due to lack of cloned genes from this species. To overcome this hurdle, the current gene discovery project was initiated with an objective of isolating as many functional genes as possible from J. curcas by large scale sequencing of expressed sequence tags (ESTs. Results A normalized and full-length enriched cDNA library was constructed from developing seeds of J. curcas. The cDNA library contained about 1 × 106 clones and average insert size of the clones was 2.1 kb. Totally 12,084 ESTs were sequenced to average high quality read length of 576 bp. Contig analysis revealed 2258 contigs and 4751 singletons. Contig size ranged from 2-23 and there were 7333 ESTs in the contigs. This resulted in 7009 unigenes which were annotated by BLASTX. It showed 3982 unigenes with significant similarity to known genes and 2836 unigenes with significant similarity to genes of unknown, hypothetical and putative proteins. The remaining 191 unigenes which did not show similarity with any genes in the public database may encode for unique genes. Functional classification revealed unigenes related to broad range of cellular, molecular and biological functions. Among the 7009 unigenes, 6233 unigenes were identified to be potential full-length genes. Conclusions The high quality normalized cDNA library was constructed from developing seeds of J. curcas for the first time and 7009 unigenes coding

  16. Classification of Non-Small Cell Lung Cancer Using Significance Analysis of Microarray-Gene Set Reduction Algorithm

    Directory of Open Access Journals (Sweden)

    Lei Zhang

    2016-01-01

    Full Text Available Among non-small cell lung cancer (NSCLC, adenocarcinoma (AC, and squamous cell carcinoma (SCC are two major histology subtypes, accounting for roughly 40% and 30% of all lung cancer cases, respectively. Since AC and SCC differ in their cell of origin, location within the lung, and growth pattern, they are considered as distinct diseases. Gene expression signatures have been demonstrated to be an effective tool for distinguishing AC and SCC. Gene set analysis is regarded as irrelevant to the identification of gene expression signatures. Nevertheless, we found that one specific gene set analysis method, significance analysis of microarray-gene set reduction (SAMGSR, can be adopted directly to select relevant features and to construct gene expression signatures. In this study, we applied SAMGSR to a NSCLC gene expression dataset. When compared with several novel feature selection algorithms, for example, LASSO, SAMGSR has equivalent or better performance in terms of predictive ability and model parsimony. Therefore, SAMGSR is a feature selection algorithm, indeed. Additionally, we applied SAMGSR to AC and SCC subtypes separately to discriminate their respective stages, that is, stage II versus stage I. Few overlaps between these two resulting gene signatures illustrate that AC and SCC are technically distinct diseases. Therefore, stratified analyses on subtypes are recommended when diagnostic or prognostic signatures of these two NSCLC subtypes are constructed.

  17. Uranium enrichment

    International Nuclear Information System (INIS)

    Rae, H.K.; Melvin, J.G.

    1988-06-01

    Canada is the world's largest producer and exporter of uranium, most of which is enriched elsewhere for use as fuel in LWRs. The feasibility of a Canadian uranium-enrichment enterprise is therefore a perennial question. Recent developments in uranium-enrichment technology, and their likely impacts on separative work supply and demand, suggest an opportunity window for Canadian entry into this international market. The Canadian opportunity results from three particular impacts of the new technologies: 1) the bulk of the world's uranium-enrichment capacity is in gaseous diffusion plants which, because of their large requirements for electricity (more than 2000 kW·h per SWU), are vulnerable to competition from the new processes; 2) the decline in enrichment costs increases the economic incentive for the use of slightly-enriched uranium (SEU) fuel in CANDU reactors, thus creating a potential Canadian market; and 3) the new processes allow economic operation on a much smaller scale, which drastically reduces the investment required for market entry and is comparable with the potential Canadian SEU requirement. The opportunity is not open-ended. By the end of the century the enrichment supply industry will have adapted to the new processes and long-term customer/supplier relationships will have been established. In order to seize the opportunity, Canada must become a credible supplier during this century

  18. Gene Set Analyses of Genome-Wide Association Studies on 49 Quantitative Traits Measured in a Single Genetic Epidemiology Dataset

    Directory of Open Access Journals (Sweden)

    Jihye Kim

    2013-09-01

    Full Text Available Gene set analysis is a powerful tool for interpreting a genome-wide association study result and is gaining popularity these days. Comparison of the gene sets obtained for a variety of traits measured from a single genetic epidemiology dataset may give insights into the biological mechanisms underlying these traits. Based on the previously published single nucleotide polymorphism (SNP genotype data on 8,842 individuals enrolled in the Korea Association Resource project, we performed a series of systematic genome-wide association analyses for 49 quantitative traits of basic epidemiological, anthropometric, or blood chemistry parameters. Each analysis result was subjected to subsequent gene set analyses based on Gene Ontology (GO terms using gene set analysis software, GSA-SNP, identifying a set of GO terms significantly associated to each trait (pcorr < 0.05. Pairwise comparison of the traits in terms of the semantic similarity in their GO sets revealed surprising cases where phenotypically uncorrelated traits showed high similarity in terms of biological pathways. For example, the pH level was related to 7 other traits that showed low phenotypic correlations with it. A literature survey implies that these traits may be regulated partly by common pathways that involve neuronal or nerve systems.

  19. Uranium enrichment

    International Nuclear Information System (INIS)

    Mohrhauer, H.

    1982-01-01

    The separation of uranium isotopes in order to enrich the fuel for light water reactors with the light isotope U-235 is an important part of the nuclear fuel cycle. After the basic principals of isotope separation the gaseous diffusion and the centrifuge process are explained. Both these techniques are employed on an industrial scale. In addition a short review is given on other enrichment techniques which have been demonstrated at least on a laboratory scale. After some remarks on the present situation on the enrichment market the progress in the development and the industrial exploitation of the gas centrifuge process by the trinational Urenco-Centec organisation is presented. (orig.)

  20. Enriched whole genome sequencing identified compensatory mutations in the RNA polymerase gene of rifampicin-resistant Mycobacterium leprae strains.

    Science.gov (United States)

    Lavania, Mallika; Singh, Itu; Turankar, Ravindra P; Gupta, Anuj Kumar; Ahuja, Madhvi; Pathak, Vinay; Sengupta, Utpal

    2018-01-01

    Despite more than three decades of multidrug therapy (MDT), leprosy remains a major public health issue in several endemic countries, including India. The emergence of drug resistance in Mycobacterium leprae (M. leprae) is a cause of concern and poses a threat to the leprosy-control program, which might ultimately dampen the achievement of the elimination program of the country. Rifampicin resistance in clinical strains of M. leprae are supposed to arise from harboring bacterial strains with mutations in the 81-bp rifampicin resistance determining region (RRDR) of the rpoB gene. However, complete dynamics of rifampicin resistance are not explained only by this mutation in leprosy strains. To understand the role of other compensatory mutations and transmission dynamics of drug-resistant leprosy, a genome-wide sequencing of 11 M. leprae strains - comprising five rifampicin-resistant strains, five sensitive strains, and one reference strain - was done in this study. We observed the presence of compensatory mutations in two rifampicin-resistant strains in rpoC and mmpL7 genes, along with rpoB , that may additionally be responsible for conferring resistance in those strains. Our findings support the role for compensatory mutation(s) in RNA polymerase gene(s), resulting in rifampicin resistance in relapsed leprosy patients.

  1. Transposable elements are enriched within or in close proximity to xenobiotic-metabolizing cytochrome P450 genes

    Directory of Open Access Journals (Sweden)

    Li Xianchun

    2007-03-01

    Full Text Available Abstract Background Transposons, i.e. transposable elements (TEs, are the major internal spontaneous mutation agents for the variability of eukaryotic genomes. To address the general issue of whether transposons mediate genomic changes in environment-adaptation genes, we scanned two alleles per each of the six xenobiotic-metabolizing Helicoverpa zea cytochrome P450 loci, including CYP6B8, CYP6B27, CYP321A1, CYP321A2, CYP9A12v3 and CYP9A14, for the presence of transposon insertions by genome walking and sequence analysis. We also scanned thirteen Drosophila melanogaster P450s genes for TE insertions by in silico mapping and literature search. Results Twelve novel transposons, including LINEs (long interspersed nuclear elements, SINEs (short interspersed nuclear elements, MITEs (miniature inverted-repeat transposable elements, one full-length transib-like transposon, and one full-length Tcl-like DNA transpson, are identified from the alleles of the six H. zea P450 genes. The twelve transposons are inserted into the 5'flanking region, 3'flanking region, exon, or intron of the six environment-adaptation P450 genes. In D. melanogaster, seven out of the eight Drosophila P450s (CYP4E2, CYP6A2, CYP6A8, CYP6A9, CYP6G1, CYP6W1, CYP12A4, CYP12D1 implicated in insecticide resistance are associated with a variety of transposons. By contrast, all the five Drosophila P450s (CYP302A1, CYP306A1, CYP307A1, CYP314A1 and CYP315A1 involved in ecdysone biosynthesis and developmental regulation are free of TE insertions. Conclusion These results indicate that TEs are selectively retained within or in close proximity to xenobiotic-metabolizing P450 genes.

  2. RaMP: A Comprehensive Relational Database of Metabolomics Pathways for Pathway Enrichment Analysis of Genes and Metabolites.

    Science.gov (United States)

    Zhang, Bofei; Hu, Senyang; Baskin, Elizabeth; Patt, Andrew; Siddiqui, Jalal K; Mathé, Ewy A

    2018-02-22

    The value of metabolomics in translational research is undeniable, and metabolomics data are increasingly generated in large cohorts. The functional interpretation of disease-associated metabolites though is difficult, and the biological mechanisms that underlie cell type or disease-specific metabolomics profiles are oftentimes unknown. To help fully exploit metabolomics data and to aid in its interpretation, analysis of metabolomics data with other complementary omics data, including transcriptomics, is helpful. To facilitate such analyses at a pathway level, we have developed RaMP (Relational database of Metabolomics Pathways), which combines biological pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG), Reactome, WikiPathways, and the Human Metabolome DataBase (HMDB). To the best of our knowledge, an off-the-shelf, public database that maps genes and metabolites to biochemical/disease pathways and can readily be integrated into other existing software is currently lacking. For consistent and comprehensive analysis, RaMP enables batch and complex queries (e.g., list all metabolites involved in glycolysis and lung cancer), can readily be integrated into pathway analysis tools, and supports pathway overrepresentation analysis given a list of genes and/or metabolites of interest. For usability, we have developed a RaMP R package (https://github.com/Mathelab/RaMP-DB), including a user-friendly RShiny web application, that supports basic simple and batch queries, pathway overrepresentation analysis given a list of genes or metabolites of interest, and network visualization of gene-metabolite relationships. The package also includes the raw database file (mysql dump), thereby providing a stand-alone downloadable framework for public use and integration with other tools. In addition, the Python code needed to recreate the database on another system is also publicly available (https://github.com/Mathelab/RaMP-BackEnd). Updates for databases in RaMP will be

  3. A dual origin of the Xist gene from a protein-coding gene and a set of transposable elements.

    Directory of Open Access Journals (Sweden)

    Eugeny A Elisaphenko

    2008-06-01

    Full Text Available X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC. Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA.

  4. GSHR, a Web-Based Platform Provides Gene Set-Level Analyses of Hormone Responses in Arabidopsis

    Directory of Open Access Journals (Sweden)

    Xiaojuan Ran

    2018-01-01

    Full Text Available Phytohormones regulate diverse aspects of plant growth and environmental responses. Recent high-throughput technologies have promoted a more comprehensive profiling of genes regulated by different hormones. However, these omics data generally result in large gene lists that make it challenging to interpret the data and extract insights into biological significance. With the rapid accumulation of theses large-scale experiments, especially the transcriptomic data available in public databases, a means of using this information to explore the transcriptional networks is needed. Different platforms have different architectures and designs, and even similar studies using the same platform may obtain data with large variances because of the highly dynamic and flexible effects of plant hormones; this makes it difficult to make comparisons across different studies and platforms. Here, we present a web server providing gene set-level analyses of Arabidopsis thaliana hormone responses. GSHR collected 333 RNA-seq and 1,205 microarray datasets from the Gene Expression Omnibus, characterizing transcriptomic changes in Arabidopsis in response to phytohormones including abscisic acid, auxin, brassinosteroids, cytokinins, ethylene, gibberellins, jasmonic acid, salicylic acid, and strigolactones. These data were further processed and organized into 1,368 gene sets regulated by different hormones or hormone-related factors. By comparing input gene lists to these gene sets, GSHR helped to identify gene sets from the input gene list regulated by different phytohormones or related factors. Together, GSHR links prior information regarding transcriptomic changes induced by hormones and related factors to newly generated data and facilities cross-study and cross-platform comparisons; this helps facilitate the mining of biologically significant information from large-scale datasets. The GSHR is freely available at http://bioinfo.sibs.ac.cn/GSHR/.

  5. Enriched whole genome sequencing identified compensatory mutations in the RNA polymerase gene of rifampicin-resistant Mycobacterium leprae strains

    Directory of Open Access Journals (Sweden)

    Lavania M

    2018-01-01

    Full Text Available Mallika Lavania,1 Itu Singh,1 Ravindra P Turankar,1 Anuj Kumar Gupta,2 Madhvi Ahuja,1 Vinay Pathak,1 Utpal Sengupta1 1Stanley Browne Laboratory, The Leprosy Mission Trust India, TLM Community Hospital Nand Nagari, 2Agilent Technologies India Pvt Ltd, Jasola District Centre, New Delhi, India Abstract: Despite more than three decades of multidrug therapy (MDT, leprosy remains a major public health issue in several endemic countries, including India. The emergence of drug resistance in Mycobacterium leprae (M. leprae is a cause of concern and poses a threat to the leprosy-control program, which might ultimately dampen the achievement of the elimination program of the country. Rifampicin resistance in clinical strains of M. leprae are supposed to arise from harboring bacterial strains with mutations in the 81-bp rifampicin resistance determining region (RRDR of the rpoB gene. However, complete dynamics of rifampicin resistance are not explained only by this mutation in leprosy strains. To understand the role of other compensatory mutations and transmission dynamics of drug-resistant leprosy, a genome-wide sequencing of 11 M. leprae strains – comprising five rifampicin-resistant strains, five sensitive strains, and one reference strain – was done in this study. We observed the presence of compensatory mutations in two rifampicin-resistant strains in rpoC and mmpL7 genes, along with rpoB, that may additionally be responsible for conferring resistance in those strains. Our findings support the role for compensatory mutation(s in RNA polymerase gene(s, resulting in rifampicin resistance in relapsed leprosy patients. Keywords: leprosy, rifampicin resistance, compensatory mutations, next generation sequencing, relapsed, MDT, India

  6. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records.

    Science.gov (United States)

    Jiang, Li; Edwards, Stefan M; Thomsen, Bo; Workman, Christopher T; Guldbrandtsen, Bernt; Sørensen, Peter

    2014-09-24

    Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data

  7. Performance of single and concatenated sets of mitochondrial genes at inferring metazoan relationships relative to full mitogenome data.

    Directory of Open Access Journals (Sweden)

    Justin C Havird

    Full Text Available Mitochondrial (mt genes are some of the most popular and widely-utilized genetic loci in phylogenetic studies of metazoan taxa. However, their linked nature has raised questions on whether using the entire mitogenome for phylogenetics is overkill (at best or pseudoreplication (at worst. Moreover, no studies have addressed the comparative phylogenetic utility of mitochondrial genes across individual lineages within the entire Metazoa. To comment on the phylogenetic utility of individual mt genes as well as concatenated subsets of genes, we analyzed mitogenomic data from 1865 metazoan taxa in 372 separate lineages spanning genera to subphyla. Specifically, phylogenies inferred from these datasets were statistically compared to ones generated from all 13 mt protein-coding (PC genes (i.e., the "supergene" set to determine which single genes performed "best" at, and the minimum number of genes required to, recover the "supergene" topology. Surprisingly, the popular marker COX1 performed poorest, while ND5, ND4, and ND2 were most likely to reproduce the "supergene" topology. Averaged across all lineages, the longest ∼2 mt PC genes were sufficient to recreate the "supergene" topology, although this average increased to ∼5 genes for datasets with 40 or more taxa. Furthermore, concatenation of the three "best" performing mt PC genes outperformed that of the three longest mt PC genes (i.e, ND5, COX1, and ND4. Taken together, while not all mt PC genes are equally interchangeable in phylogenetic studies of the metazoans, some subset can serve as a proxy for the 13 mt PC genes. However, the exact number and identity of these genes is specific to the lineage in question and cannot be applied indiscriminately across the Metazoa.

  8. Enrichment of provitamin A content in wheat (Triticum aestivum L.) by introduction of the bacterial carotenoid biosynthetic genes CrtB and CrtI.

    Science.gov (United States)

    Wang, Cheng; Zeng, Jian; Li, Yin; Hu, Wei; Chen, Ling; Miao, Yingjie; Deng, Pengyi; Yuan, Cuihong; Ma, Cheng; Chen, Xi; Zang, Mingli; Wang, Qiong; Li, Kexiu; Chang, Junli; Wang, Yuesheng; Yang, Guangxiao; He, Guangyuan

    2014-06-01

    Carotenoid content is a primary determinant of wheat nutritional value and affects its end-use quality. Wheat grains contain very low carotenoid levels and trace amounts of provitamin A content. In order to enrich the carotenoid content in wheat grains, the bacterial phytoene synthase gene (CrtB) and carotene desaturase gene (CrtI) were transformed into the common wheat cultivar Bobwhite. Expression of CrtB or CrtI alone slightly increased the carotenoid content in the grains of transgenic wheat, while co-expression of both genes resulted in a darker red/yellow grain phenotype, accompanied by a total carotenoid content increase of approximately 8-fold achieving 4.76 μg g(-1) of seed dry weight, a β-carotene increase of 65-fold to 3.21 μg g(-1) of seed dry weight, and a provitamin A content (sum of α-carotene, β-carotene, and β-cryptoxanthin) increase of 76-fold to 3.82 μg g(-1) of seed dry weight. The high provitamin A content in the transgenic wheat was stably inherited over four generations. Quantitative PCR analysis revealed that enhancement of provitamin A content in transgenic wheat was also a result of the highly coordinated regulation of endogenous carotenoid biosynthetic genes, suggesting a metabolic feedback regulation in the wheat carotenoid biosynthetic pathway. These transgenic wheat lines are not only valuable for breeding wheat varieties with nutritional benefits for human health but also for understanding the mechanism regulating carotenoid biosynthesis in wheat endosperm. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  9. A set of genes previously implicated in the hypoxia response might be an important modulator in the rat ear tissue response to mechanical stretch

    Directory of Open Access Journals (Sweden)

    Orgill Dennis

    2007-11-01

    Full Text Available Abstract Background Wounds are increasingly important in our aging societies. Pathologies such as diabetes predispose patients to chronic wounds that can cause pain, infection, and amputation. The vacuum assisted closure device shows remarkable outcomes in wound healing. Its mechanism of action is unclear despite several hypotheses advanced. We previously hypothesized that micromechanical forces can heal wounds. To understand better the biological response of soft tissue to forces, rat ears in vivo were stretched and their gene expression patterns over time obtained. The absolute enrichment (AE algorithm that obtains a combined up and down regulated picture of the expression analysis was implemented. Results With the use of AE, the hypoxia gene set was the most important at a highly significant level. A co-expression network analysis showed that important co-regulated members of the hypoxia pathway include a glucose transporter (slc2a8, heme oxygenase, and nitric oxide synthase2 among others. Conclusion It appears that the hypoxia pathway may be an important modulator of response of soft tissue to forces. This finding gives us insights not only into the underlying biology, but also into clinical interventions that could be designed to mimic within wounded tissue the effects of forces without all the negative effects that forces themselves create.

  10. Hydrothermal, biogenic, and seawater components in metalliferous black shales of the Brooks Range, Alaska: Synsedimentary metal enrichment in a carbonate ramp setting

    Science.gov (United States)

    Slack, John F.; Selby, David; Dumoulin, Julie A.

    2015-01-01

    Trace element and Os isotope data for Lisburne Group metalliferous black shales of Middle Mississippian (early Chesterian) age in the Brooks Range of northern Alaska suggest that metals were sourced chiefly from local seawater (including biogenic detritus) but also from externally derived hydrothermal fluids. These black shales are interbedded with phosphorites and limestones in sequences 3 to 35 m thick; deposition occurred mainly on a carbonate ramp during intermittent upwelling under varying redox conditions, from suboxic to anoxic to sulfidic. Deposition of the black shales at ~335 Ma was broadly contemporaneous with sulfide mineralization in the Red Dog and Drenchwater Zn-Pb-Ag deposits, which formed in a distal marginal basin.Relative to the composition of average black shale, the metalliferous black shales (n = 29) display large average enrichment factors (>10) for Zn (10.1), Cd (11.0), and Ag (20.1). Small enrichments (>2–seawater. Such moderate enrichments, which are common in other metalliferous black shales, suggest wholly marine sources (seawater and biogenic material) for these metals, given similar trends for enrichment factors in organic-rich sediments of modern upwelling zones on the Namibian, Peruvian, and Chilean shelves. The largest enrichment factors for Zn and Ag are much higher (1.4 × 107 and 2.9 × 107, respectively), consistent with an appreciable hydrothermal component. Other metals such as Cu, Pb, and Tl that are concentrated in several black shale samples, and are locally abundant in the Red Dog and Drenchwater Zn-Pb-Ag deposits, may have a partly hydrothermal origin but this cannot be fully established with the available data. Enrichments in Cr (up to 7.8 × 106) are attributed to marine and not hydrothermal processes. The presence in some samples of large enrichments in Eu (up to 6.1 × 107) relative to modern seawater and of small positive Eu anomalies (Eu/Eu* up to 1.12) are considered unrelated to hydrothermal activity, instead

  11. Comprehensive Analysis of MILE Gene Expression Data Set Advances Discovery of Leukaemia Type and Subtype Biomarkers.

    Science.gov (United States)

    Labaj, Wojciech; Papiez, Anna; Polanski, Andrzej; Polanska, Joanna

    2017-03-01

    Large collections of data in studies on cancer such as leukaemia provoke the necessity of applying tailored analysis algorithms to ensure supreme information extraction. In this work, a custom-fit pipeline is demonstrated for thorough investigation of the voluminous MILE gene expression data set. Three analyses are accomplished, each for gaining a deeper understanding of the processes underlying leukaemia types and subtypes. First, the main disease groups are tested for differential expression against the healthy control as in a standard case-control study. Here, the basic knowledge on molecular mechanisms is confirmed quantitatively and by literature references. Second, pairwise comparison testing is performed for juxtaposing the main leukaemia types among each other. In this case by means of the Dice coefficient similarity measure the general relations are pointed out. Moreover, lists of candidate main leukaemia group biomarkers are proposed. Finally, with this approach being successful, the third analysis provides insight into all of the studied subtypes, followed by the emergence of four leukaemia subtype biomarkers. In addition, the class enhanced DEG signature obtained on the basis of novel pipeline processing leads to significantly better classification power of multi-class data classifiers. The developed methodology consisting of batch effect adjustment, adaptive noise and feature filtration coupled with adequate statistical testing and biomarker definition proves to be an effective approach towards knowledge discovery in high-throughput molecular biology experiments.

  12. Quantitative modeling of gene networks of biological systems using fuzzy Petri nets and fuzzy sets

    Directory of Open Access Journals (Sweden)

    Raed I. Hamed

    2018-01-01

    Full Text Available Quantitative demonstrating of organic frameworks has turned into an essential computational methodology in the configuration of novel and investigation of existing natural frameworks. Be that as it may, active information that portrays the framework's elements should be known keeping in mind the end goal to get pertinent results with the routine displaying strategies. This information is frequently robust or even difficult to get. Here, we exhibit a model of quantitative fuzzy rational demonstrating approach that can adapt to obscure motor information and hence deliver applicable results despite the fact that dynamic information is fragmented or just dubiously characterized. Besides, the methodology can be utilized as a part of the blend with the current cutting edge quantitative demonstrating strategies just in specific parts of the framework, i.e., where the data are absent. The contextual analysis of the methodology suggested in this paper is performed on the model of nine-quality genes. We propose a kind of FPN model in light of fuzzy sets to manage the quantitative modeling of biological systems. The tests of our model appear that the model is practical and entirely powerful for information impersonation and thinking of fuzzy expert frameworks.

  13. The Genome of the Generalist Plant Pathogen Fusarium avenaceum Is Enriched with Genes Involved in Redox, Signaling and Secondary Metabolism

    Science.gov (United States)

    Lysøe, Erik; Harris, Linda J.; Walkowiak, Sean; Subramaniam, Rajagopal; Divon, Hege H.; Riiser, Even S.; Llorens, Carlos; Gabaldón, Toni; Kistler, H. Corby; Jonkers, Wilfried; Kolseth, Anna-Karin; Nielsen, Kristian F.; Thrane, Ulf; Frandsen, Rasmus J. N.

    2014-01-01

    Fusarium avenaceum is a fungus commonly isolated from soil and associated with a wide range of host plants. We present here three genome sequences of F. avenaceum, one isolated from barley in Finland and two from spring and winter wheat in Canada. The sizes of the three genomes range from 41.6–43.1 MB, with 13217–13445 predicted protein-coding genes. Whole-genome analysis showed that the three genomes are highly syntenic, and share>95% gene orthologs. Comparative analysis to other sequenced Fusaria shows that F. avenaceum has a very large potential for producing secondary metabolites, with between 75 and 80 key enzymes belonging to the polyketide, non-ribosomal peptide, terpene, alkaloid and indole-diterpene synthase classes. In addition to known metabolites from F. avenaceum, fuscofusarin and JM-47 were detected for the first time in this species. Many protein families are expanded in F. avenaceum, such as transcription factors, and proteins involved in redox reactions and signal transduction, suggesting evolutionary adaptation to a diverse and cosmopolitan ecology. We found that 20% of all predicted proteins were considered to be secreted, supporting a life in the extracellular space during interaction with plant hosts. PMID:25409087

  14. Cloning and characterization of a novel human zinc finger gene, hKid3, from a C2H2-ZNF enriched human embryonic cDNA library

    International Nuclear Information System (INIS)

    Gao Li; Sun Chong; Qiu Hongling; Liu Hui; Shao Huanjie; Wang Jun; Li Wenxin

    2004-01-01

    To investigate the zinc finger genes involved in human embryonic development, we constructed a C 2 H 2 -ZNF enriched human embryonic cDNA library, from which a novel human gene named hKid3 was identified. The hKid3 cDNA encodes a 554 amino acid protein with an amino-terminal KRAB domain and 11 carboxyl-terminal C 2 H 2 zinc finger motifs. Northern blot analysis indicates that two hKid3 transcripts of 6 and 8.5 kb express in human fetal brain and kidney. The 6 kb transcript can also be detected in human adult brain, heart, and skeletal muscle while the 8.5 kb transcript appears to be embryo-specific. GFP-fused hKid3 protein is localized to nuclei and the ZF domain is necessary and sufficient for nuclear localization. To explore the DNA-binding specificity of hKid3, an oligonucleotide library was selected by GST fusion protein of hKid3 ZF domain, and the consensus core sequence 5'-CCAC-3' was evaluated by competitive electrophoretic mobility shift assay. Moreover, The KRAB domain of hKid3 exhibits transcription repressor activity when tested in GAL4 fusion protein assay. These results indicate that hKid3 may function as a transcription repressor with regulated expression pattern during human development of brain and kidney

  15. Mountain pine beetles colonizing historical and naive host trees are associated with a bacterial community highly enriched in genes contributing to terpene metabolism.

    Science.gov (United States)

    Adams, Aaron S; Aylward, Frank O; Adams, Sandye M; Erbilgin, Nadir; Aukema, Brian H; Currie, Cameron R; Suen, Garret; Raffa, Kenneth F

    2013-06-01

    The mountain pine beetle, Dendroctonus ponderosae, is a subcortical herbivore native to western North America that can kill healthy conifers by overcoming host tree defenses, which consist largely of high terpene concentrations. The mechanisms by which these beetles contend with toxic compounds are not well understood. Here, we explore a component of the hypothesis that beetle-associated bacterial symbionts contribute to the ability of D. ponderosae to overcome tree defenses by assisting with terpene detoxification. Such symbionts may facilitate host tree transitions during range expansions currently being driven by climate change. For example, this insect has recently breached the historical geophysical barrier of the Canadian Rocky Mountains, providing access to näive tree hosts and unprecedented connectivity to eastern forests. We use culture-independent techniques to describe the bacterial community associated with D. ponderosae beetles and their galleries from their historical host, Pinus contorta, and their more recent host, hybrid P. contorta-Pinus banksiana. We show that these communities are enriched with genes involved in terpene degradation compared with other plant biomass-processing microbial communities. These pine beetle microbial communities are dominated by members of the genera Pseudomonas, Rahnella, Serratia, and Burkholderia, and the majority of genes involved in terpene degradation belong to these genera. Our work provides the first metagenome of bacterial communities associated with a bark beetle and is consistent with a potential microbial contribution to detoxification of tree defenses needed to survive the subcortical environment.

  16. The SET1 Complex Selects Actively Transcribed Target Genes via Multivalent Interaction with CpG Island Chromatin.

    Science.gov (United States)

    Brown, David A; Di Cerbo, Vincenzo; Feldmann, Angelika; Ahn, Jaewoo; Ito, Shinsuke; Blackledge, Neil P; Nakayama, Manabu; McClellan, Michael; Dimitrova, Emilia; Turberfield, Anne H; Long, Hannah K; King, Hamish W; Kriaucionis, Skirmantas; Schermelleh, Lothar; Kutateladze, Tatiana G; Koseki, Haruhiko; Klose, Robert J

    2017-09-05

    Chromatin modifications and the promoter-associated epigenome are important for the regulation of gene expression. However, the mechanisms by which chromatin-modifying complexes are targeted to the appropriate gene promoters in vertebrates and how they influence gene expression have remained poorly defined. Here, using a combination of live-cell imaging and functional genomics, we discover that the vertebrate SET1 complex is targeted to actively transcribed gene promoters through CFP1, which engages in a form of multivalent chromatin reading that involves recognition of non-methylated DNA and histone H3 lysine 4 trimethylation (H3K4me3). CFP1 defines SET1 complex occupancy on chromatin, and its multivalent interactions are required for the SET1 complex to place H3K4me3. In the absence of CFP1, gene expression is perturbed, suggesting that normal targeting and function of the SET1 complex are central to creating an appropriately functioning vertebrate promoter-associated epigenome. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  17. The SET1 Complex Selects Actively Transcribed Target Genes via Multivalent Interaction with CpG Island Chromatin

    Directory of Open Access Journals (Sweden)

    David A. Brown

    2017-09-01

    Full Text Available Chromatin modifications and the promoter-associated epigenome are important for the regulation of gene expression. However, the mechanisms by which chromatin-modifying complexes are targeted to the appropriate gene promoters in vertebrates and how they influence gene expression have remained poorly defined. Here, using a combination of live-cell imaging and functional genomics, we discover that the vertebrate SET1 complex is targeted to actively transcribed gene promoters through CFP1, which engages in a form of multivalent chromatin reading that involves recognition of non-methylated DNA and histone H3 lysine 4 trimethylation (H3K4me3. CFP1 defines SET1 complex occupancy on chromatin, and its multivalent interactions are required for the SET1 complex to place H3K4me3. In the absence of CFP1, gene expression is perturbed, suggesting that normal targeting and function of the SET1 complex are central to creating an appropriately functioning vertebrate promoter-associated epigenome.

  18. Genomic determinants of sporulation in Bacilli and Clostridia: towards the minimal set of sporulation-specific genes.

    Science.gov (United States)

    Galperin, Michael Y; Mekhedov, Sergei L; Puigbo, Pere; Smirnov, Sergey; Wolf, Yuri I; Rigden, Daniel J

    2012-11-01

    Three classes of low-G+C Gram-positive bacteria (Firmicutes), Bacilli, Clostridia and Negativicutes, include numerous members that are capable of producing heat-resistant endospores. Spore-forming firmicutes include many environmentally important organisms, such as insect pathogens and cellulose-degrading industrial strains, as well as human pathogens responsible for such diseases as anthrax, botulism, gas gangrene and tetanus. In the best-studied model organism Bacillus subtilis, sporulation involves over 500 genes, many of which are conserved among other bacilli and clostridia. This work aimed to define the genomic requirements for sporulation through an analysis of the presence of sporulation genes in various firmicutes, including those with smaller genomes than B. subtilis. Cultivable spore-formers were found to have genomes larger than 2300 kb and encompass over 2150 protein-coding genes of which 60 are orthologues of genes that are apparently essential for sporulation in B. subtilis. Clostridial spore-formers lack, among others, spoIIB, sda, spoVID and safA genes and have non-orthologous displacements of spoIIQ and spoIVFA, suggesting substantial differences between bacilli and clostridia in the engulfment and spore coat formation steps. Many B. subtilis sporulation genes, particularly those encoding small acid-soluble spore proteins and spore coat proteins, were found only in the family Bacillaceae, or even in a subset of Bacillus spp. Phylogenetic profiles of sporulation genes, compiled in this work, confirm the presence of a common sporulation gene core, but also illuminate the diversity of the sporulation processes within various lineages. These profiles should help further experimental studies of uncharacterized widespread sporulation genes, which would ultimately allow delineation of the minimal set(s) of sporulation-specific genes in Bacilli and Clostridia. Published 2012. This article is a U.S. Government work and is in the public domain in the USA.

  19. Isotope enrichment

    International Nuclear Information System (INIS)

    Garbuny, M.

    1979-01-01

    The invention discloses a method for deriving, from a starting material including an element having a plurality of isotopes, derived material enriched in one isotope of the element. The starting material is deposited on a substrate at less than a critical submonatomic surface density, typically less than 10 16 atoms per square centimeter. The deposit is then selectively irradiated by a laser (maser or electronic oscillator) beam with monochromatic coherent radiation resonant with the one isotope causing the material including the one istope to escape from the substrate. The escaping enriched material is then collected. Where the element has two isotopes, one of which is to be collected, the deposit may be irradiated with radiation resonant with the other isotope and the residual material enriched in the one isotope may be evaporated from the substrate and collected

  20. Using Variable Precision Rough Set for Selection and Classification of Biological Knowledge Integrated in DNA Gene Expression

    Directory of Open Access Journals (Sweden)

    Calvo-Dmgz D.

    2012-12-01

    Full Text Available DNA microarrays have contributed to the exponential growth of genomic and experimental data in the last decade. This large amount of gene expression data has been used by researchers seeking diagnosis of diseases like cancer using machine learning methods. In turn, explicit biological knowledge about gene functions has also grown tremendously over the last decade. This work integrates explicit biological knowledge, provided as gene sets, into the classication process by means of Variable Precision Rough Set Theory (VPRS. The proposed model is able to highlight which part of the provided biological knowledge has been important for classification. This paper presents a novel model for microarray data classification which is able to incorporate prior biological knowledge in the form of gene sets. Based on this knowledge, we transform the input microarray data into supergenes, and then we apply rough set theory to select the most promising supergenes and to derive a set of easy interpretable classification rules. The proposed model is evaluated over three breast cancer microarrays datasets obtaining successful results compared to classical classification techniques. The experimental results shows that there are not significat differences between our model and classical techniques but it is able to provide a biological-interpretable explanation of how it classifies new samples.

  1. Platform dependence of inference on gene-wise and gene-set involvement in human lung development

    Directory of Open Access Journals (Sweden)

    Kho Alvin T

    2009-06-01

    Full Text Available Abstract Background With the recent development of microarray technologies, the comparability of gene expression data obtained from different platforms poses an important problem. We evaluated two widely used platforms, Affymetrix U133 Plus 2.0 and the Illumina HumanRef-8 v2 Expression Bead Chips, for comparability in a biological system in which changes may be subtle, namely fetal lung tissue as a function of gestational age. Results We performed the comparison via sequence-based probe matching between the two platforms. "Significance grouping" was defined as a measure of comparability. Using both expression correlation and significance grouping as measures of comparability, we demonstrated that despite overall cross-platform differences at the single gene level, increased correlation between the two platforms was found in genes with higher expression level, higher probe overlap, and lower p-value. We also demonstrated that biological function as determined via KEGG pathways or GO categories is more consistent across platforms than single gene analysis. Conclusion We conclude that while the comparability of the platforms at the single gene level may be increased by increasing sample size, they are highly comparable ontologically even for subtle differences in a relatively small sample size. Biologically relevant inference should therefore be reproducible across laboratories using different platforms.

  2. Hydrothermal, biogenic, and seawater components in metalliferous black shales of the Brooks Range, Alaska: Synsedimentary metal enrichment in a carbonate ramp setting

    Science.gov (United States)

    Slack, John F.; Selby, David; Dumoulin, Julie A.

    2015-01-01

    Trace element and Os isotope data for Lisburne Group metalliferous black shales of Middle Mississippian (early Chesterian) age in the Brooks Range of northern Alaska suggest that metals were sourced chiefly from local seawater (including biogenic detritus) but also from externally derived hydrothermal fluids. These black shales are interbedded with phosphorites and limestones in sequences 3 to 35 m thick; deposition occurred mainly on a carbonate ramp during intermittent upwelling under varying redox conditions, from suboxic to anoxic to sulfidic. Deposition of the black shales at ~335 Ma was broadly contemporaneous with sulfide mineralization in the Red Dog and Drenchwater Zn-Pb-Ag deposits, which formed in a distal marginal basin.Relative to the composition of average black shale, the metalliferous black shales (n = 29) display large average enrichment factors (>10) for Zn (10.1), Cd (11.0), and Ag (20.1). Small enrichments (>2–rare earth elements except Ce, Nd, and Sm. A detailed stratigraphic profile over 23 m in the Skimo Creek area (central Brooks Range) indicates that samples from at and near the top of the section, which accumulated during a period of major upwelling and is broadly correlative with the stratigraphic levels of the Red Dog and Drenchwater Zn-Pb-Ag deposits, have the highest Zn/TOC (total organic carbon), Cu/TOC, and Tl/TOC ratios for calculated marine fractions (no detrital component) of these three metals.Average authigenic (detrital-free) contents of Mo, V, U, Ni, Cu, Cd, Pb, Ge, Re, Se, As, Sb, Tl, Pd, and Au show enrichment factors of 4.3 × 103 to 1.2 × 106 relative to modern seawater. Such moderate enrichments, which are common in other metalliferous black shales, suggest wholly marine sources (seawater and biogenic material) for these metals, given similar trends for enrichment factors in organic-rich sediments of modern upwelling zones on the Namibian, Peruvian, and Chilean shelves. The largest enrichment factors for Zn and Ag

  3. Determining Semantically Related Significant Genes.

    Science.gov (United States)

    Taha, Kamal

    2014-01-01

    GO relation embodies some aspects of existence dependency. If GO term xis existence-dependent on GO term y, the presence of y implies the presence of x. Therefore, the genes annotated with the function of the GO term y are usually functionally and semantically related to the genes annotated with the function of the GO term x. A large number of gene set enrichment analysis methods have been developed in recent years for analyzing gene sets enrichment. However, most of these methods overlook the structural dependencies between GO terms in GO graph by not considering the concept of existence dependency. We propose in this paper a biological search engine called RSGSearch that identifies enriched sets of genes annotated with different functions using the concept of existence dependency. We observe that GO term xcannot be existence-dependent on GO term y, if x- and y- have the same specificity (biological characteristics). After encoding into a numeric format the contributions of GO terms annotating target genes to the semantics of their lowest common ancestors (LCAs), RSGSearch uses microarray experiment to identify the most significant LCA that annotates the result genes. We evaluated RSGSearch experimentally and compared it with five gene set enrichment systems. Results showed marked improvement.

  4. Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Pugalendhi Ganesh Kumar

    Full Text Available This study describes a novel approach to reducing the challenges of highly nonlinear multiclass gene expression values for cancer diagnosis. To build a fruitful system for cancer diagnosis, in this study, we introduced two levels of gene selection such as filtering and embedding for selection of potential genes and the most relevant genes associated with cancer, respectively. The filter procedure was implemented by developing a fuzzy rough set (FR-based method for redefining the criterion function of f-information (FI to identify the potential genes without discretizing the continuous gene expression values. The embedded procedure is implemented by means of a water swirl algorithm (WSA, which attempts to optimize the rule set and membership function required to classify samples using a fuzzy-rule-based multiclassification system (FRBMS. Two novel update equations are proposed in WSA, which have better exploration and exploitation abilities while designing a self-learning FRBMS. The efficiency of our new approach was evaluated on 13 multicategory and 9 binary datasets of cancer gene expression. Additionally, the performance of the proposed FRFI-WSA method in designing an FRBMS was compared with existing methods for gene selection and optimization such as genetic algorithm (GA, particle swarm optimization (PSO, and artificial bee colony algorithm (ABC on all the datasets. In the global cancer map with repeated measurements (GCM_RM dataset, the FRFI-WSA showed the smallest number of 16 most relevant genes associated with cancer using a minimal number of 26 compact rules with the highest classification accuracy (96.45%. In addition, the statistical validation used in this study revealed that the biological relevance of the most relevant genes associated with cancer and their linguistics detected by the proposed FRFI-WSA approach are better than those in the other methods. The simple interpretable rules with most relevant genes and effectively

  5. Identification of self-consistent modulons from bacterial microarray expression data with the help of structured regulon gene sets

    KAUST Repository

    Permina, Elizaveta A.

    2013-01-01

    Identification of bacterial modulons from series of gene expression measurements on microarrays is a principal problem, especially relevant for inadequately studied but practically important species. Usage of a priori information on regulatory interactions helps to evaluate parameters for regulatory subnetwork inference. We suggest a procedure for modulon construction where a seed regulon is iteratively updated with genes having expression patterns similar to those for regulon member genes. A set of genes essential for a regulon is used to control modulon updating. Essential genes for a regulon were selected as a subset of regulon genes highly related by different measures to each other. Using Escherichia coli as a model, we studied how modulon identification depends on the data, including the microarray experiments set, the adopted relevance measure and the regulon itself. We have found that results of modulon identification are highly dependent on all parameters studied and thus the resulting modulon varies substantially depending on the identification procedure. Yet, modulons that were identified correctly displayed higher stability during iterations, which allows developing a procedure for reliable modulon identification in the case of less studied species where the known regulatory interactions are sparse. Copyright © 2013 Taylor & Francis.

  6. Differentially Expressed Genes in Endometrium and Corpus Luteum of Holstein Cows Selected for High and Low Fertility Are Enriched for Sequence Variants Associated with Fertility.

    Science.gov (United States)

    Moore, Stephen G; Pryce, Jennie E; Hayes, Ben J; Chamberlain, Amanda J; Kemper, Kathryn E; Berry, Donagh P; McCabe, Matt; Cormican, Paul; Lonergan, Pat; Fair, Trudee; Butler, Stephen T

    2016-01-01

    Despite the importance of fertility in humans and livestock, there has been little success dissecting the genetic basis of fertility. Our hypothesis was that genes differentially expressed in the endometrium and corpus luteum on Day 13 of the estrous cycle between cows with either good or poor genetic merit for fertility would be enriched for genetic variants associated with fertility. We combined a unique genetic model of fertility (cattle that have been selected for high and low fertility and show substantial difference in fertility) with gene expression data from these cattle and genome-wide association study (GWAS) results in ∼20,000 cattle to identify quantitative trait loci (QTL) regions and sequence variants associated with genetic variation in fertility. Two hundred and forty-five QTL regions and 17 sequence variants associated primarily with prostaglandin F2alpha, steroidogenesis, mRNA processing, energy status, and immune-related processes were identified. Ninety-three of the QTL regions were validated by two independent GWAS, with signals for fertility detected primarily on chromosomes 18, 5, 7, 8, and 29. Plausible causative mutations were identified, including one missense variant significantly associated with fertility and predicted to affect the protein function of EIF4EBP3. The results of this study enhance our understanding of 1) the contribution of the endometrium and corpus luteum transcriptome to phenotypic fertility differences and 2) the genetic architecture of fertility in dairy cattle. Including these variants in predictions of genomic breeding values may improve the rate of genetic gain for this critical trait. © 2016 by the Society for the Study of Reproduction, Inc.

  7. Liver-Enriched Gene 1, a Glycosylated Secretory Protein, Binds to FGFR and Mediates an Anti-stress Pathway to Protect Liver Development in Zebrafish.

    Directory of Open Access Journals (Sweden)

    Minjie Hu

    2016-02-01

    Full Text Available Unlike mammals and birds, teleost fish undergo external embryogenesis, and therefore their embryos are constantly challenged by stresses from their living environment. These stresses, when becoming too harsh, will cause arrest of cell proliferation, abnormal cell death or senescence. Such organisms have to evolve a sophisticated anti-stress mechanism to protect the process of embryogenesis/organogenesis. However, very few signaling molecule(s mediating such activity have been identified. liver-enriched gene 1 (leg1 is an uncharacterized gene that encodes a novel secretory protein containing a single domain DUF781 (domain of unknown function 781 that is well conserved in vertebrates. In the zebrafish genome, there are two copies of leg1, namely leg1a and leg1b. leg1a and leg1b are closely linked on chromosome 20 and share high homology, but are differentially expressed. In this report, we generated two leg1a mutant alleles using the TALEN technique, then characterized liver development in the mutants. We show that a leg1a mutant exhibits a stress-dependent small liver phenotype that can be prevented by chemicals blocking the production of reactive oxygen species. Further studies reveal that Leg1a binds to FGFR3 and mediates a novel anti-stress pathway to protect liver development through enhancing Erk activity. More importantly, we show that the binding of Leg1a to FGFR relies on the glycosylation at the 70th asparagine (Asn(70 or N(70, and mutating the Asn(70 to Ala(70 compromised Leg1's function in liver development. Therefore, Leg1 plays a unique role in protecting liver development under different stress conditions by serving as a secreted signaling molecule/modulator.

  8. Effects of long-term environmental enrichment on anxiety, memory, hippocampal plasticity and overall brain gene expression in C57BL6 mice

    Directory of Open Access Journals (Sweden)

    Melanie Hüttenrauch

    2016-08-01

    Full Text Available There is ample evidence that physical activity exerts positive effects on a variety of brain functions by facilitating neuroprotective processes and influencing neuroplasticity. Accordingly, numerous studies have shown that continuous exercise can successfully diminish or prevent the pathology of neurodegenerative diseases such as Alzheimer’s disease in transgenic mouse models. However, the long-term effect of physical activity on brain health of aging WT mice has not been studied in detail yet. Here, we show that prolonged physical and cognitive stimulation, mediated by an enriched environment (EE paradigm for a duration of eleven months, leads to reduced anxiety and improved spatial reference memory in C57BL6 wildtype (WT mice. While the number of CA1 pyramidal neurons remained unchanged between standard housed (SH and EE mice, the number of dentate gyrus (DG neurons, as well as the CA1 and DG volume were significantly increased in EE mice. A whole-brain deep sequencing transcriptome analysis, carried out to better understand the molecular mechanisms underlying the observed effects, revealed an up-regulation of a variety of genes upon EE, mainly associated with synaptic plasticity and transcription regulation. The present findings corroborate the impact of continuous physical activity as a potential prospective route in the prevention of age-related cognitive decline and neurodegenerative disorders.

  9. Uranium enrichment

    International Nuclear Information System (INIS)

    1991-08-01

    This paper reports that in 1990 the Department of Energy began a two-year project to illustrate the technical and economic feasibility of a new uranium enrichment technology-the atomic vapor laser isotope separation (AVLIS) process. GAO believes that completing the AVLIS demonstration project will provide valuable information about the technical viability and cost of building an AVLIS plant and will keep future plant construction options open. However, Congress should be aware that DOE still needs to adequately demonstrate AVLIS with full-scale equipment and develop convincing cost projects. Program activities, such as the plant-licensing process, that must be completed before a plant is built, could take many years. Further, an updated and expanded uranium enrichment analysis will be needed before any decision is made about building an AVLIS plant. GAO, which has long supported legislation that would restructure DOE's uranium enrichment program as a government corporation, encourages DOE's goal of transferring AVLIS to the corporation. This could reduce the government's financial risk and help ensure that the decision to build an AVLIS plant is based on commercial concerns. DOE, however, has no alternative plans should the government corporation not be formed. Further, by curtailing a planned public access program, which would have given private firms an opportunity to learn about the technology during the demonstration project, DOE may limit its ability to transfer AVLIS to the private sector

  10. A reference gene set for sex pheromone biosynthesis and degradation genes from the diamondback moth, Plutella xylostella, based on genome and transcriptome digital gene expression analyses

    OpenAIRE

    He, Peng; Zhang, Yun-Fei; Hong, Duan-Yang; Wang, Jun; Wang, Xing-Liang; Zuo, Ling-Hua; Tang, Xian-Fu; Xu, Wei-Ming; He, Ming

    2017-01-01

    Background Female moths synthesize species-specific sex pheromone components and release them to attract male moths, which depend on precise sex pheromone chemosensory system to locate females. Two types of genes involved in the sex pheromone biosynthesis and degradation pathways play essential roles in this important moth behavior. To understand the function of genes in the sex pheromone pathway, this study investigated the genome-wide and digital gene expression of sex pheromone biosynthesi...

  11. Gene-set analysis based on the pharmacological profiles of drugs to identify repurposing opportunities in schizophrenia.

    Science.gov (United States)

    de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome

    2016-08-01

    Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected pneratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.

  12. Identification and Validation of a New Set of Five Genes for Prediction of Risk in Early Breast Cancer

    Directory of Open Access Journals (Sweden)

    Giorgio Mustacchi

    2013-05-01

    Full Text Available Molecular tests predicting the outcome of breast cancer patients based on gene expression levels can be used to assist in making treatment decisions after consideration of conventional markers. In this study we identified a subset of 20 mRNA differentially regulated in breast cancer analyzing several publicly available array gene expression data using R/Bioconductor package. Using RTqPCR we evaluate 261 consecutive invasive breast cancer cases not selected for age, adjuvant treatment, nodal and estrogen receptor status from paraffin embedded sections. The biological samples dataset was split into a training (137 cases and a validation set (124 cases. The gene signature was developed on the training set and a multivariate stepwise Cox analysis selected five genes independently associated with DFS: FGF18 (HR = 1.13, p = 0.05, BCL2 (HR = 0.57, p = 0.001, PRC1 (HR = 1.51, p = 0.001, MMP9 (HR = 1.11, p = 0.08, SERF1a (HR = 0.83, p = 0.007. These five genes were combined into a linear score (signature weighted according to the coefficients of the Cox model, as: 0.125FGF18 − 0.560BCL2 + 0.409PRC1 + 0.104MMP9 − 0.188SERF1A (HR = 2.7, 95% CI = 1.9–4.0, p < 0.001. The signature was then evaluated on the validation set assessing the discrimination ability by a Kaplan Meier analysis, using the same cut offs classifying patients at low, intermediate or high risk of disease relapse as defined on the training set (p < 0.001. Our signature, after a further clinical validation, could be proposed as prognostic signature for disease free survival in breast cancer patients where the indication for adjuvant chemotherapy added to endocrine treatment is uncertain.

  13. Monte Carlo analyses of TRX slightly enriched uranium-H2O critical experiments with ENDF/B-IV and related data sets (AWBA Development Program)

    International Nuclear Information System (INIS)

    Hardy, J. Jr.

    1977-12-01

    Four H 2 O-moderated, slightly-enriched-uranium critical experiments were analyzed by Monte Carlo methods with ENDF/B-IV data. These were simple metal-rod lattices comprising Cross Section Evaluation Working Group thermal reactor benchmarks TRX-1 through TRX-4. Generally good agreement with experiment was obtained for calculated integral parameters: the epi-thermal/thermal ratio of U238 capture (rho 28 ) and of U235 fission (delta 25 ), the ratio of U238 capture to U235 fission (CR*), and the ratio of U238 fission to U235 fission (delta 28 ). Full-core Monte Carlo calculations for two lattices showed good agreement with cell Monte Carlo-plus-multigroup P/sub l/ leakage corrections. Newly measured parameters for the low energy resonances of U238 significantly improved rho 28 . In comparison with other CSEWG analyses, the strong correlation between K/sub eff/ and rho 28 suggests that U238 resonance capture is the major problem encountered in analyzing these lattices

  14. The PR/SET Domain Zinc Finger Protein Prdm4 Regulates Gene Expression in Embryonic Stem Cells but Plays a Nonessential Role in the Developing Mouse Embryo

    Science.gov (United States)

    Bogani, Debora; Morgan, Marc A. J.; Nelson, Andrew C.; Costello, Ita; McGouran, Joanna F.; Kessler, Benedikt M.

    2013-01-01

    Prdm4 is a highly conserved member of the Prdm family of PR/SET domain zinc finger proteins. Many well-studied Prdm family members play critical roles in development and display striking loss-of-function phenotypes. Prdm4 functional contributions have yet to be characterized. Here, we describe its widespread expression in the early embryo and adult tissues. We demonstrate that DNA binding is exclusively mediated by the Prdm4 zinc finger domain, and we characterize its tripartite consensus sequence via SELEX (systematic evolution of ligands by exponential enrichment) and ChIP-seq (chromatin immunoprecipitation-sequencing) experiments. In embryonic stem cells (ESCs), Prdm4 regulates key pluripotency and differentiation pathways. Two independent strategies, namely, targeted deletion of the zinc finger domain and generation of a EUCOMM LacZ reporter allele, resulted in functional null alleles. However, homozygous mutant embryos develop normally and adults are healthy and fertile. Collectively, these results strongly suggest that Prdm4 functions redundantly with other transcriptional partners to cooperatively regulate gene expression in the embryo and adult animal. PMID:23918801

  15. Identification of a set of endogenous reference genes for miRNA expression studies in Parkinson's disease blood samples.

    Science.gov (United States)

    Serafin, Alice; Foco, Luisa; Blankenburg, Hagen; Picard, Anne; Zanigni, Stefano; Zanon, Alessandra; Pramstaller, Peter P; Hicks, Andrew A; Schwienbacher, Christine

    2014-10-10

    Research on microRNAs (miRNAs) is becoming an increasingly attractive field, as these small RNA molecules are involved in several physiological functions and diseases. To date, only few studies have assessed the expression of blood miRNAs related to Parkinson's disease (PD) using microarray and quantitative real-time PCR (qRT-PCR). Measuring miRNA expression involves normalization of qRT-PCR data using endogenous reference genes for calibration, but their choice remains a delicate problem with serious impact on the resulting expression levels. The aim of the present study was to evaluate the suitability of a set of commonly used small RNAs as normalizers and to identify which of these miRNAs might be considered reliable reference genes in qRT-PCR expression analyses on PD blood samples. Commonly used reference genes snoRNA RNU24, snRNA RNU6B, snoRNA Z30 and miR-103a-3p were selected from the literature. We then analyzed the effect of using these genes as reference, alone or in any possible combination, on the measured expression levels of the target genes miR-30b-5p and miR-29a-3p, which have been previously reported to be deregulated in PD blood samples. We identified RNU24 and Z30 as a reliable and stable pair of reference genes in PD blood samples.

  16. A set of vectors for introduction of antibiotic resistance genes by in vitro Cre-mediated recombination

    Directory of Open Access Journals (Sweden)

    Vassetzky Yegor S

    2008-12-01

    Full Text Available Abstract Background Introduction of new antibiotic resistance genes in the plasmids of interest is a frequent task in molecular cloning practice. Classical approaches involving digestion with restriction endonucleases and ligation are time-consuming. Findings We have created a set of insertion vectors (pINS carrying genes that provide resistance to various antibiotics (puromycin, blasticidin and G418 and containing a loxP site. Each vector (pINS-Puro, pINS-Blast or pINS-Neo contains either a chloramphenicol or a kanamycin resistance gene and is unable to replicate in most E. coli strains as it contains a conditional R6Kγ replication origin. Introduction of the antibiotic resistance genes into the vector of interest is achieved by Cre-mediated recombination between the replication-incompetent pINS and a replication-competent target vector. The recombination mix is then transformed into E. coli and selected by the resistance marker (kanamycin or chloramphenicol present in pINS, which allows to recover the recombinant plasmids with 100% efficiency. Conclusion Here we propose a simple strategy that allows to introduce various antibiotic-resistance genes into any plasmid containing a replication origin, an ampicillin resistance gene and a loxP site.

  17. Pathway enrichment analysis approach based on topological structure and updated annotation of pathway.

    Science.gov (United States)

    Yang, Qian; Wang, Shuyuan; Dai, Enyu; Zhou, Shunheng; Liu, Dianming; Liu, Haizhou; Meng, Qianqian; Jiang, Bin; Jiang, Wei

    2017-08-16

    Pathway enrichment analysis has been widely used to identify cancer risk pathways, and contributes to elucidating the mechanism of tumorigenesis. However, most of the existing approaches use the outdated pathway information and neglect the complex gene interactions in pathway. Here, we first reviewed the existing widely used pathway enrichment analysis approaches briefly, and then, we proposed a novel topology-based pathway enrichment analysis (TPEA) method, which integrated topological properties and global upstream/downstream positions of genes in pathways. We compared TPEA with four widely used pathway enrichment analysis tools, including database for annotation, visualization and integrated discovery (DAVID), gene set enrichment analysis (GSEA), centrality-based pathway enrichment (CePa) and signaling pathway impact analysis (SPIA), through analyzing six gene expression profiles of three tumor types (colorectal cancer, thyroid cancer and endometrial cancer). As a result, we identified several well-known cancer risk pathways that could not be obtained by the existing tools, and the results of TPEA were more stable than that of the other tools in analyzing different data sets of the same cancer. Ultimately, we developed an R package to implement TPEA, which could online update KEGG pathway information and is available at the Comprehensive R Archive Network (CRAN): https://cran.r-project.org/web/packages/TPEA/. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. Genome-wide methylation analysis identifies a core set of hypermethylated genes in CIMP-H colorectal cancer.

    Science.gov (United States)

    McInnes, Tyler; Zou, Donghui; Rao, Dasari S; Munro, Francesca M; Phillips, Vicky L; McCall, John L; Black, Michael A; Reeve, Anthony E; Guilford, Parry J

    2017-03-28

    Aberrant DNA methylation profiles are a characteristic of all known cancer types, epitomized by the CpG island methylator phenotype (CIMP) in colorectal cancer (CRC). Hypermethylation has been observed at CpG islands throughout the genome, but it is unclear which factors determine whether an individual island becomes methylated in cancer. DNA methylation in CRC was analysed using the Illumina HumanMethylation450K array. Differentially methylated loci were identified using Significance Analysis of Microarrays (SAM) and the Wilcoxon Signed Rank (WSR) test. Unsupervised hierarchical clustering was used to identify methylation subtypes in CRC. In this study we characterized the DNA methylation profiles of 94 CRC tissues and their matched normal counterparts. Consistent with previous studies, unsupervized hierarchical clustering of genome-wide methylation data identified three subtypes within the tumour samples, designated CIMP-H, CIMP-L and CIMP-N, that showed high, low and very low methylation levels, respectively. Differential methylation between normal and tumour samples was analysed at the individual CpG level, and at the gene level. The distribution of hypermethylation in CIMP-N tumours showed high inter-tumour variability and appeared to be highly stochastic in nature, whereas CIMP-H tumours exhibited consistent hypermethylation at a subset of genes, in addition to a highly variable background of hypermethylated genes. EYA4, TFPI2 and TLX1 were hypermethylated in more than 90% of all tumours examined. One-hundred thirty-two genes were hypermethylated in 100% of CIMP-H tumours studied and these were highly enriched for functions relating to skeletal system development (Bonferroni adjusted p value =2.88E-15), segment specification (adjusted p value =9.62E-11), embryonic development (adjusted p value =1.52E-04), mesoderm development (adjusted p value =1.14E-20), and ectoderm development (adjusted p value =7.94E-16). Our genome-wide characterization of DNA

  19. Identification of a novel set of genes reflecting different in vivo invasive patterns of human GBM cells

    Directory of Open Access Journals (Sweden)

    Monticone Massimiliano

    2012-08-01

    Full Text Available Abstract Background Most patients affected by Glioblastoma multiforme (GBM, grade IV glioma experience a recurrence of the disease because of the spreading of tumor cells beyond surgical boundaries. Unveiling mechanisms causing this process is a logic goal to impair the killing capacity of GBM cells by molecular targeting. We noticed that our long-term GBM cultures, established from different patients, may display two categories/types of growth behavior in an orthotopic xenograft model: expansion of the tumor mass and formation of tumor branches/nodules (nodular like, NL-type or highly diffuse single tumor cell infiltration (HD-type. Methods We determined by DNA microarrays the gene expression profiles of three NL-type and three HD-type long-term GBM cultures. Subsequently, individual genes with different expression levels between the two groups were identified using Significance Analysis of Microarrays (SAM. Real time RT-PCR, immunofluorescence and immunoblot analyses, were performed for a selected subgroup of regulated gene products to confirm the results obtained by the expression analysis. Results Here, we report the identification of a set of 34 differentially expressed genes in the two types of GBM cultures. Twenty-three of these genes encode for proteins localized to the plasma membrane and 9 of these for proteins are involved in the process of cell adhesion. Conclusions This study suggests the participation in the diffuse infiltrative/invasive process of GBM cells within the CNS of a novel set of genes coding for membrane-associated proteins, which should be thus susceptible to an inhibition strategy by specific targeting. Massimiliano Monticone and Antonio Daga contributed equally to this work

  20. Identification of a novel set of genes reflecting different in vivo invasive patterns of human GBM cells.

    Science.gov (United States)

    Monticone, Massimiliano; Daga, Antonio; Candiani, Simona; Romeo, Francesco; Mirisola, Valentina; Viaggi, Silvia; Melloni, Ilaria; Pedemonte, Simona; Zona, Gianluigi; Giaretti, Walter; Pfeffer, Ulrich; Castagnola, Patrizio

    2012-08-17

    Most patients affected by Glioblastoma multiforme (GBM, grade IV glioma) experience a recurrence of the disease because of the spreading of tumor cells beyond surgical boundaries. Unveiling mechanisms causing this process is a logic goal to impair the killing capacity of GBM cells by molecular targeting.We noticed that our long-term GBM cultures, established from different patients, may display two categories/types of growth behavior in an orthotopic xenograft model: expansion of the tumor mass and formation of tumor branches/nodules (nodular like, NL-type) or highly diffuse single tumor cell infiltration (HD-type). We determined by DNA microarrays the gene expression profiles of three NL-type and three HD-type long-term GBM cultures. Subsequently, individual genes with different expression levels between the two groups were identified using Significance Analysis of Microarrays (SAM). Real time RT-PCR, immunofluorescence and immunoblot analyses, were performed for a selected subgroup of regulated gene products to confirm the results obtained by the expression analysis. Here, we report the identification of a set of 34 differentially expressed genes in the two types of GBM cultures. Twenty-three of these genes encode for proteins localized to the plasma membrane and 9 of these for proteins are involved in the process of cell adhesion. This study suggests the participation in the diffuse infiltrative/invasive process of GBM cells within the CNS of a novel set of genes coding for membrane-associated proteins, which should be thus susceptible to an inhibition strategy by specific targeting.Massimiliano Monticone and Antonio Daga contributed equally to this work.

  1. Identification of a novel set of genes reflecting different in vivo invasive patterns of human GBM cells

    International Nuclear Information System (INIS)

    Monticone, Massimiliano; Giaretti, Walter; Pfeffer, Ulrich; Daga, Antonio; Candiani, Simona; Romeo, Francesco; Mirisola, Valentina; Viaggi, Silvia; Melloni, Ilaria; Pedemonte, Simona; Zona, Gianluigi

    2012-01-01

    Most patients affected by Glioblastoma multiforme (GBM, grade IV glioma) experience a recurrence of the disease because of the spreading of tumor cells beyond surgical boundaries. Unveiling mechanisms causing this process is a logic goal to impair the killing capacity of GBM cells by molecular targeting. We noticed that our long-term GBM cultures, established from different patients, may display two categories/types of growth behavior in an orthotopic xenograft model: expansion of the tumor mass and formation of tumor branches/nodules (nodular like, NL-type) or highly diffuse single tumor cell infiltration (HD-type). We determined by DNA microarrays the gene expression profiles of three NL-type and three HD-type long-term GBM cultures. Subsequently, individual genes with different expression levels between the two groups were identified using Significance Analysis of Microarrays (SAM). Real time RT-PCR, immunofluorescence and immunoblot analyses, were performed for a selected subgroup of regulated gene products to confirm the results obtained by the expression analysis. Here, we report the identification of a set of 34 differentially expressed genes in the two types of GBM cultures. Twenty-three of these genes encode for proteins localized to the plasma membrane and 9 of these for proteins are involved in the process of cell adhesion. This study suggests the participation in the diffuse infiltrative/invasive process of GBM cells within the CNS of a novel set of genes coding for membrane-associated proteins, which should be thus susceptible to an inhibition strategy by specific targeting. Massimiliano Monticone and Antonio Daga contributed equally to this work

  2. Genetic investigation of 100 heart genes in sudden unexplained death victims in a forensic setting

    DEFF Research Database (Denmark)

    Christiansen, Sofie Lindgren; Hertz, Christin Løth; Ferrero, Laura

    2016-01-01

    indicate that broad genetic investigation of SUD victims increases the diagnostic outcome, and the investigation should comprise genes involved in both cardiomyopathies and cardiac channelopathies.European Journal of Human Genetics advance online publication, 21 September 2016; doi:10.1038/ejhg.2016.118....

  3. Comparative genomics identification of a novel set of temporally regulated hedgehog target genes in the retina.

    Science.gov (United States)

    McNeill, Brian; Perez-Iratxeta, Carol; Mazerolle, Chantal; Furimsky, Marosh; Mishina, Yuji; Andrade-Navarro, Miguel A; Wallace, Valerie A

    2012-03-01

    The hedgehog (Hh) signaling pathway is involved in numerous developmental and adult processes with many links to cancer. In vertebrates, the activity of the Hh pathway is mediated primarily through three Gli transcription factors (Gli1, 2 and 3) that can serve as transcriptional activators or repressors. The identification of Gli target genes is essential for the understanding of the Hh-mediated processes. We used a comparative genomics approach using the mouse and human genomes to identify 390 genes that contained conserved Gli binding sites. RT-qPCR validation of 46 target genes in E14.5 and P0.5 retinal explants revealed that Hh pathway activation resulted in the modulation of 30 of these targets, 25 of which demonstrated a temporal regulation. Further validation revealed that the expression of Bok, FoxA1, Sox8 and Wnt7a was dependent upon Sonic Hh (Shh) signaling in the retina and their regulation is under positive and negative controls by Gli2 and Gli3, respectively. We also show using chromatin immunoprecipitation that Gli2 binds to the Sox8 promoter, suggesting that Sox8 is an Hh-dependent direct target of Gli2. Finally, we demonstrate that the Hh pathway also modulates the expression of Sox9 and Sox10, which together with Sox8 make up the SoxE group. Previously, it has been shown that Hh and SoxE group genes promote Müller glial cell development in the retina. Our data are consistent with the possibility for a role of SoxE group genes downstream of Hh signaling on Müller cell development. Crown Copyright © 2012. Published by Elsevier Inc. All rights reserved.

  4. The imprinted brain: how genes set the balance between autism and psychosis.

    Science.gov (United States)

    Badcock, Christopher

    2011-06-01

    The imprinted brain theory proposes that autism spectrum disorder (ASD) represents a paternal bias in the expression of imprinted genes. This is reflected in a preference for mechanistic cognition and in the corresponding mentalistic deficits symptomatic of ASD. Psychotic spectrum disorder (PSD) would correspondingly result from an imbalance in favor of maternal and/or X-chromosome gene expression. If differences in gene expression were reflected locally in the human brain as mouse models and other evidence suggests they are, ASD would represent not so much an 'extreme male brain' as an extreme paternal one, with PSD correspondingly representing an extreme maternal brain. To the extent that copy number variation resembles imprinting and aneuploidy in nullifying or multiplying the expression of particular genes, it has been found to conform to the diametric model of mental illness peculiar to the imprinted brain theory. The fact that nongenetic factors such as nutrition in pregnancy can mimic and/or interact with imprinted gene expression suggests that the theory might even be able to explain the notable effect of maternal starvation on the risk of PSD - not to mention the 'autism epidemic' of modern affluent societies. Finally, the theory suggests that normality represents balanced cognition, and that genius is an extraordinary extension of cognitive configuration in both mentalistic and mechanistic directions. Were it to be proven correct, the imprinted brain theory would represent one of the biggest single advances in our understanding of the mind and of mental illness that has ever taken place, and would revolutionize psychiatric diagnosis, prevention and treatment - not to mention our understanding of epigenomics.

  5. Serum 25-Hydroxyvitamin D Levels, phosphoprotein enriched in diabetes gene product (PED/PEA-15) and leptin-to-adiponectin ratio in women with PCOS.

    Science.gov (United States)

    Savastano, Silvia; Valentino, Rossella; Di Somma, Carolina; Orio, Francesco; Pivonello, Claudia; Passaretti, Federica; Brancato, Valentina; Formisano, Pietro; Colao, Annamaria; Beguinot, Francesco; Tarantino, Giovanni

    2011-11-23

    Polycystic ovary syndrome (PCOS) is frequently associated with hypovitaminosis D. Vitamin D is endowed with pleiotropic effects, including insulin resistance (IR) and apoptotic pathway. Disruption of the complex mechanism that regulated ovarian apoptosis has been reported in PCOS. Phosphoprotein enriched in diabetes gene product (PED/PEA-15), an anti-apoptotic protein involved in type 2 diabetes mellitus (T2DM), is overexpressed in PCOS women, independently of obesity. Leptin-to-adiponectin ratio (L/A) is a biomarker of IR and low-grade inflammation in PCOS. The aim of the study was to investigate the levels of 25-hydroxy vitamin D (25(OH)D), and L/A, in association with PED/PEA-15 protein abundance, in both lean and overweight/obese (o/o) women with PCOS. PED/PEA-15 protein abundance and circulating levels of 25(OH)D, L/A, sex hormone-binding globulin, and testosterone were evaluated in 90 untreated PCOS patients (25 ± 4 yrs; range 18-34) and 40 healthy controls age and BMI comparable, from the same geographical area. FAI (free androgen index) and the homeostasis model assessment of insulin resistance (HoMA-IR) index were calculated. In o/o PCOS, 25(OH)D levels were significantly lower, and L/A values were significantly higher than in lean PCOS (p involvement in the ovarian imbalance between pro-and anti-apoptotic mechanisms, with high L/A and insulin and low 25(OH)D levels as the main determinants of PED/PEA-15 protein variability. Further studies, involving also different apoptotic pathways or inflammatory cytokines and granulosa cells are mandatory to better define the possible bidirectional relationships between 25(OH)D, PED/PEA-15 protein abundance, leptin and adiponectin in PCOS pathogenesis.

  6. Serum 25-Hydroxyvitamin D Levels, phosphoprotein enriched in diabetes gene product (PED/PEA-15 and leptin-to-adiponectin ratio in women with PCOS

    Directory of Open Access Journals (Sweden)

    Savastano Silvia

    2011-11-01

    Full Text Available Abstract Background Polycystic ovary syndrome (PCOS is frequently associated with hypovitaminosis D. Vitamin D is endowed with pleiotropic effects, including insulin resistance (IR and apoptotic pathway. Disruption of the complex mechanism that regulated ovarian apoptosis has been reported in PCOS. Phosphoprotein enriched in diabetes gene product (PED/PEA-15, an anti-apoptotic protein involved in type 2 diabetes mellitus (T2DM, is overexpressed in PCOS women, independently of obesity. Leptin-to-adiponectin ratio (L/A is a biomarker of IR and low-grade inflammation in PCOS. The aim of the study was to investigate the levels of 25-hydroxy vitamin D (25(OHD, and L/A, in association with PED/PEA-15 protein abundance, in both lean and overweight/obese (o/o women with PCOS. Patients and Methods PED/PEA-15 protein abundance and circulating levels of 25(OHD, L/A, sex hormone-binding globulin, and testosterone were evaluated in 90 untreated PCOS patients (25 ± 4 yrs; range 18-34 and 40 healthy controls age and BMI comparable, from the same geographical area. FAI (free androgen index and the homeostasis model assessment of insulin resistance (HoMA-IR index were calculated. Results In o/o PCOS, 25(OHD levels were significantly lower, and L/A values were significantly higher than in lean PCOS (p Conclusions Lower 25(OHD and higher L/A were associated to PED/PEA-15 protein abundance in PCOS, suggesting their involvement in the ovarian imbalance between pro-and anti-apoptotic mechanisms, with high L/A and insulin and low 25(OHD levels as the main determinants of PED/PEA-15 protein variability. Further studies, involving also different apoptotic pathways or inflammatory cytokines and granulosa cells are mandatory to better define the possible bidirectional relationships between 25(OHD, PED/PEA-15 protein abundance, leptin and adiponectin in PCOS pathogenesis.

  7. Feeding glycerol-enriched yeast culture improves performance, energy status, and heat shock protein gene expression of lactating Holstein cows under heat stress.

    Science.gov (United States)

    Liu, J; Ye, G; Zhou, Y; Liu, Y; Zhao, L; Liu, Y; Chen, X; Huang, D; Liao, S F; Huang, K

    2014-06-01

    This study was conducted to evaluate the effects of supplemental common yeast culture (CY) and glycerol-enriched yeast culture (GY) on performance, plasma metabolites, antioxidant status, and heat shock protein 70 (HSP70) mRNA expression in lactating Holstein cows under heat stress. During summer months, 30 healthy multiparous lactating cows (parity 3.25 ± 0.48; 60 ± 13 d in milk [DIM]; 648 ± 57 kg BW; an average milk yield of 33.8 ± 1.6 kg/d) were blocked by parity, previous milk yield, and DIM and randomly allocated to 3 dietary treatments: no supplemental yeast culture (Control), 1 L/d of CY (33.1 g yeast) per cow, and 2 L/d of GY (153.2 g glycerol and 31.6 g yeast) per cow. During the 60-d experiment, values of air temperature and relative humidity inside the barn were recorded hourly every 3 d to calculate temperature-humidity index (THI). Weekly rectal temperatures (RT) and respiration rates and daily DMI and milk yield were recorded for all cows. Milk and blood samples were taken twice monthly, and BW and BCS were obtained on d 0 and 60. In this experiment, THI values indicated cows experienced a moderate heat stress. Cows supplemented with CY and GY had greater yields of milk, energy-corrected milk and milk fat, and milk fat percent but lower HSP70 mRNA expression in peripheral blood lymphocytes than Control cows (P cows. In conclusion, either CY or GY supplementation partially mitigated the negative effects of heat stress on performance and HSP70 mRNA expression of lactating cows, and GY supplementation provided additional improvements in energy status and HSP70 gene expression of lactating cows.

  8. An 80-gene set to predict response to preoperative chemoradiotherapy for rectal cancer by principle component analysis.

    Science.gov (United States)

    Empuku, Shinichiro; Nakajima, Kentaro; Akagi, Tomonori; Kaneko, Kunihiko; Hijiya, Naoki; Etoh, Tsuyoshi; Shiraishi, Norio; Moriyama, Masatsugu; Inomata, Masafumi

    2016-05-01

    Preoperative chemoradiotherapy (CRT) for locally advanced rectal cancer not only improves the postoperative local control rate, but also induces downstaging. However, it has not been established how to individually select patients who receive effective preoperative CRT. The aim of this study was to identify a predictor of response to preoperative CRT for locally advanced rectal cancer. This study is additional to our multicenter phase II study evaluating the safety and efficacy of preoperative CRT using oral fluorouracil (UMIN ID: 03396). From April, 2009 to August, 2011, 26 biopsy specimens obtained prior to CRT were analyzed by cyclopedic microarray analysis. Response to CRT was evaluated according to a histological grading system using surgically resected specimens. To decide on the number of genes for dividing into responder and non-responder groups, we statistically analyzed the data using a dimension reduction method, a principle component analysis. Of the 26 cases, 11 were responders and 15 non-responders. No significant difference was found in clinical background data between the two groups. We determined that the optimal number of genes for the prediction of response was 80 of 40,000 and the functions of these genes were analyzed. When comparing non-responders with responders, genes expressed at a high level functioned in alternative splicing, whereas those expressed at a low level functioned in the septin complex. Thus, an 80-gene expression set that predicts response to preoperative CRT for locally advanced rectal cancer was identified using a novel statistical method.

  9. Enrichment of target sequences for next-generation sequencing applications in research and diagnostics.

    Science.gov (United States)

    Altmüller, Janine; Budde, Birgit S; Nürnberg, Peter

    2014-02-01

    Abstract Targeted re-sequencing such as gene panel sequencing (GPS) has become very popular in medical genetics, both for research projects and in diagnostic settings. The technical principles of the different enrichment methods have been reviewed several times before; however, new enrichment products are constantly entering the market, and researchers are often puzzled about the requirement to take decisions about long-term commitments, both for the enrichment product and the sequencing technology. This review summarizes important considerations for the experimental design and provides helpful recommendations in choosing the best sequencing strategy for various research projects and diagnostic applications.

  10. Uranium enrichment

    International Nuclear Information System (INIS)

    1991-11-01

    This paper analyzes under four different scenarios the adequacy of a $500 million annual deposit into a fund to pay for the cost of cleaning up the Department of Energy's (DOE) three aging uranium enrichment plants. These plants are located in Oak Ridge, Tennessee; Paducah, Kentucky; and Portsmouth, Ohio. In summary the following was found: A fixed annual $500 million deposit made into a cleanup fund would not be adequate to cover total expected cleanup costs, nor would it be adequate to cover expected decontamination and decommissioning (D and D) costs. A $500 million annual deposit indexed to an inflation rate would likely be adequate to pay for all expected cleanup costs, including D and D costs, remedial action, and depleted uranium costs

  11. 16S rRNA gene-based molecular analysis of mat-forming and accompanying bacteria covering organically-enriched marine sediments underlying a salmon farm in Southern Chile (Calbuco Island)

    OpenAIRE

    Aranda, Carlos; Paredes, Javier; Valenzuela, Cristian; Lam, Phyllis; Guillou, Laure

    2010-01-01

    The mat forming bacteria covering organic matter-enriched and anoxic marine sediments underlying a salmon farm in Southern Chile, were examined using 16S rRNA gene phylogenies. This mat was absent in the sea bed outside the direct influence of the farm (360 m outside fish cages). Based on nearly complete 16S rRNA gene sequences (-1500 bp), mat-forming filamentous cells were settled as the sulphur-oxidizing and putatively dissimilative nitrate-reducing Beggiatoa spp., being closely related (up...

  12. Comparative genomic analysis of SET domain family reveals the origin, expansion, and putative function of the arthropod-specific SmydA genes as histone modifiers in insects.

    Science.gov (United States)

    Jiang, Feng; Liu, Qing; Wang, Yanli; Zhang, Jie; Wang, Huimin; Song, Tianqi; Yang, Meiling; Wang, Xianhui; Kang, Le

    2017-06-01

    The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain-containing genes (SET genes) in all of the 147 arthropod genomes sequenced at the time of carrying out this experiment to understand the evolutionary history by which SET domains have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, that is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution. © The Authors 2017. Published by Oxford University Press.

  13. The impact of ACE gene polymorphism on the incidence and phenotype of sarcoidosis in rural and urban settings.

    Science.gov (United States)

    Kieszko, Robert; Krawczyk, Paweł; Powrózek, Tomasz; Szudy-Szczyrek, Aneta; Szczyrek, Michał; Homa, Iwona; Daniluk, Jadwiga; Milanowski, Janusz

    2016-12-01

    Sarcoidosis is a multisystem granulomatous disease of unknown etiology. Current theory on the etiology of this disease involves participation of genetic factors and unknown antigens present in the patients' environment. The aim of the study was to evaluate the prevalence of different polymorphic forms of the ACE gene in healthy individuals and sarcoidosis patients, and to estimate the risk of sarcoidosis in carriers of different ACE genotypes living in rural and urban settings. The study group included 180 patients with pulmonary sarcoidosis. Assessment of the disease was based on clinical features, laboratory and imaging examinations, as well as bronchoscopy with bronchoalveolar lavage (BAL). ACE gene polymorphism was examined in DNA isolated from peripheral blood or BAL fluid (BALF) leukocytes. Incidence of sarcoidosis was not influenced by gender, age or place of residence of the patients. There were no differences in the frequency of particular genotypes in patients with sarcoidosis and in healthy individuals. The risk of disease did not depend on the ACE gene polymorphism. There were no differences in the frequencies of the different genotypes and alleles of the ACE gene in patients with sarcoidosis divided by gender, age and place of residence or by clinical manifestation of sarcoidosis. Our results do not support the previous concept which suggested a higher incidence of sarcoidosis in individuals living in rural areas and in carriers of selected ACE genotypes. It is possible that this is related to the changing environment of rural areas, increasing urbanization and pollution.

  14. The first set of EST resource for gene discovery and marker development in pigeonpea (Cajanus cajan L.

    Directory of Open Access Journals (Sweden)

    Byregowda Munishamappa

    2010-03-01

    .8% in molecular function. Further, 19 genes were identified differentially expressed between FW- responsive genotypes and 20 between SMD- responsive genotypes. Generated ESTs were compiled together with 908 ESTs available in public domain, at the time of analysis, and a set of 5,085 unigenes were defined that were used for identification of molecular markers in pigeonpea. For instance, 3,583 simple sequence repeat (SSR motifs were identified in 1,365 unigenes and 383 primer pairs were designed. Assessment of a set of 84 primer pairs on 40 elite pigeonpea lines showed polymorphism with 15 (28.8% markers with an average of four alleles per marker and an average polymorphic information content (PIC value of 0.40. Similarly, in silico mining of 133 contigs with ≥ 5 sequences detected 102 single nucleotide polymorphisms (SNPs in 37 contigs. As an example, a set of 10 contigs were used for confirming in silico predicted SNPs in a set of four genotypes using wet lab experiments. Occurrence of SNPs were confirmed for all the 6 contigs for which scorable and sequenceable amplicons were generated. PCR amplicons were not obtained in case of 4 contigs. Recognition sites for restriction enzymes were identified for 102 SNPs in 37 contigs that indicates possibility of assaying SNPs in 37 genes using cleaved amplified polymorphic sequences (CAPS assay. Conclusion The pigeonpea EST dataset generated here provides a transcriptomic resource for gene discovery and development of functional markers associated with biotic stress resistance. Sequence analyses of this dataset have showed conservation of a considerable number of pigeonpea transcripts across legume and model plant species analysed as well as some putative pigeonpea specific genes. Validation of identified biotic stress responsive genes should provide candidate genes for allele mining as well as candidate markers for molecular breeding.

  15. Construction and evaluation of normalized cDNA libraries enriched with full-length sequences for rapid discovery of new genes from Sisal (Agave sisalana Perr.) different developmental stages.

    Science.gov (United States)

    Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng

    2012-10-12

    To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing.

  16. Candidate genes for chronic obstructive pulmonary disease in two large data sets

    DEFF Research Database (Denmark)

    Bakke, P S; Zhu, G; Gulsvik, A

    2011-01-01

    Lack of reproducibility of findings has been a criticism of genetic association studies in complex diseases like chronic obstructive pulmonary disease (COPD). We selected 257 polymorphisms of 16 genes with reported or potential relationshipsto COPD and genotyped these variants in a case......-control study which included 953 COPD cases and 956 control subjects. We explored the association of these polymorphisms to three COPD phenotypes: a COPD binary phenotype and two quantitative traits (post bronchodilator FEV1 in percent predicted and FEV1/FVC). The polymorphisms significantly associated...... to these phenotypes in this first study were tested in a second, family based, study that included 635 pedigrees with 1910 individuals. Significant associations to the binary COPD phenotype in both populations were seen for STAT1 (rs13010343) and NFKBIB/SIRT2 (rs2241704) (p

  17. twzPEA: A Topology and Working Zone Based Pathway Enrichment Analysis Framework

    Science.gov (United States)

    Sensitive detection of involvement and adaptation of key signaling, regulatory, and metabolic pathways holds the key to deciphering molecular mechanisms such as those in the biomass-to-biofuel conversion process in yeast. Typical gene set enrichment analyses often do not use topology information in...

  18. Environmental enrichment for aquatic animals.

    Science.gov (United States)

    Corcoran, Mike

    2015-05-01

    Aquatic animals are the most popular pets in the United States based on the number of owned pets. They are popular display animals and are increasingly used in research settings. Enrichment of captive animals is an important element of zoo and laboratory medicine. The importance of enrichment for aquatic animals has been slower in implementation. For a long time, there was debate over whether or not fish were able to experience pain or form long-term memories. As that debate has reduced and the consciousness of more aquatic animals is accepted, the need to discuss enrichment for these animals has increased. Copyright © 2015 Elsevier Inc. All rights reserved.

  19. South Australia, uranium enrichment

    International Nuclear Information System (INIS)

    1976-02-01

    The Report sets out the salient data relating to the establishment of a uranium processing centre at Redcliff in South Australia. It is conceived as a major development project for the Commonwealth, the South Australian Government and Australian Industry comprising the refining and enrichment of uranium produced from Australian mines. Using the data currently available in respect of markets, demand, technology and possible financial return from overseas sales, the project could be initiated immediately with hexafluoride production, followed rapidly in stages by enrichment production using the centrifuge process. A conceptual development plan is presented, involving a growth pattern that would be closely synchronised with the mining and production of yellowcake. The proposed development is presented in the form of an eight-and-half-year programme. Costs in this Report are based on 1975 values, unless otherwise stated. (Author)

  20. RBiomirGS: an all-in-one miRNA gene set analysis solution featuring target mRNA mapping and expression profile integration

    Directory of Open Access Journals (Sweden)

    Jing Zhang

    2018-01-01

    Full Text Available Background With the continuous discovery of microRNA’s (miRNA association with a wide range of biological and cellular processes, expression profile-based functional characterization of such post-transcriptional regulation is crucial for revealing its significance behind particular phenotypes. Profound advancement in bioinformatics has been made to enable in depth investigation of miRNA’s role in regulating cellular and molecular events, resulting in a huge quantity of software packages covering different aspects of miRNA functional analysis. Therefore, an all-in-one software solution is in demand for a comprehensive yet highly efficient workflow. Here we present RBiomirGS, an R package for a miRNA gene set (GS analysis. Methods The package utilizes multiple databases for target mRNA mapping, estimates miRNA effect on the target mRNAs through miRNA expression profile and conducts a logistic regression-based GS enrichment. Additionally, human ortholog Entrez ID conversion functionality is included for target mRNAs. Results By incorporating all the core steps into one package, RBiomirGS eliminates the need for switching between different software packages. The modular structure of RBiomirGS enables various access points to the analysis, with which users can choose the most relevant functionalities for their workflow. Conclusions With RBiomirGS, users are able to assess the functional significance of the miRNA expression profile under the corresponding experimental condition by minimal input and intervention. Accordingly, RBiomirGS encompasses an all-in-one solution for miRNA GS analysis. RBiomirGS is available on GitHub (http://github.com/jzhangc/RBiomirGS. More information including instruction and examples can be found on website (http://kenstoreylab.com/?page_id=2865.

  1. Repression of Middle Sporulation Genes in Saccharomyces cerevisiae by the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4 Methylation

    Science.gov (United States)

    Jaiswal, Deepika; Jezek, Meagan; Quijote, Jeremiah; Lum, Joanna; Choi, Grace; Kulkarni, Rushmie; Park, DoHwan; Green, Erin M.

    2017-01-01

    The conserved yeast histone methyltransferase Set1 targets H3 lysine 4 (H3K4) for mono, di, and trimethylation and is linked to active transcription due to the euchromatic distribution of these methyl marks and the recruitment of Set1 during transcription. However, loss of Set1 results in increased expression of multiple classes of genes, including genes adjacent to telomeres and middle sporulation genes, which are repressed under normal growth conditions because they function in meiotic progression and spore formation. The mechanisms underlying Set1-mediated gene repression are varied, and still unclear in some cases, although repression has been linked to both direct and indirect action of Set1, associated with noncoding transcription, and is often dependent on the H3K4me2 mark. We show that Set1, and particularly the H3K4me2 mark, are implicated in repression of a subset of middle sporulation genes during vegetative growth. In the absence of Set1, there is loss of the DNA-binding transcriptional regulator Sum1 and the associated histone deacetylase Hst1 from chromatin in a locus-specific manner. This is linked to increased H4K5ac at these loci and aberrant middle gene expression. These data indicate that, in addition to DNA sequence, histone modification status also contributes to proper localization of Sum1. Our results also show that the role for Set1 in middle gene expression control diverges as cells receive signals to undergo meiosis. Overall, this work dissects an unexplored role for Set1 in gene-specific repression, and provides important insights into a new mechanism associated with the control of gene expression linked to meiotic differentiation. PMID:29066473

  2. A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database

    Directory of Open Access Journals (Sweden)

    Tripputi Mark

    2006-10-01

    Full Text Available Abstract Background Many of the most popular pre-processing methods for Affymetrix expression arrays, such as RMA, gcRMA, and PLIER, simultaneously analyze data across a set of predetermined arrays to improve precision of the final measures of expression. One problem associated with these algorithms is that expression measurements for a particular sample are highly dependent on the set of samples used for normalization and results obtained by normalization with a different set may not be comparable. A related problem is that an organization producing and/or storing large amounts of data in a sequential fashion will need to either re-run the pre-processing algorithm every time an array is added or store them in batches that are pre-processed together. Furthermore, pre-processing of large numbers of arrays requires loading all the feature-level data into memory which is a difficult task even with modern computers. We utilize a scheme that produces all the information necessary for pre-processing using a very large training set that can be used for summarization of samples outside of the training set. All subsequent pre-processing tasks can be done on an individual array basis. We demonstrate the utility of this approach by defining a new version of the Robust Multi-chip Averaging (RMA algorithm which we refer to as refRMA. Results We assess performance based on multiple sets of samples processed over HG U133A Affymetrix GeneChip® arrays. We show that the refRMA workflow, when used in conjunction with a large, biologically diverse training set, results in the same general characteristics as that of RMA in its classic form when comparing overall data structure, sample-to-sample correlation, and variation. Further, we demonstrate that the refRMA workflow and reference set can be robustly applied to naïve organ types and to benchmark data where its performance indicates respectable results. Conclusion Our results indicate that a biologically diverse

  3. New generation enrichment monitoring technology for gas centrifuge enrichment plants

    International Nuclear Information System (INIS)

    Ianakiev, Kiril D.; Alexandrov, Boian S.; Boyer, Brian D.; Hill, Thomas R.; Macarthur, Duncan W.; Marks, Thomas; Moss, Calvin E.; Sheppard, Gregory A.; Swinhoe, Martyn T.

    2008-01-01

    The continuous enrichment monitor, developed and fielded in the 1990s by the International Atomic Energy Agency, provided a go-no-go capability to distinguish between UF 6 containing low enriched (approximately 4% 235 U) and highly enriched (above 20% 235 U) uranium. This instrument used the 22-keV line from a 109 Cd source as a transmission source to achieve a high sensitivity to the UF 6 gas absorption. The 1.27-yr half-life required that the source be periodically replaced and the instrument recalibrated. The instrument's functionality and accuracy were limited by the fact that measured gas density and gas pressure were treated as confidential facility information. The modern safeguarding of a gas centrifuge enrichment plant producing low-enriched UF 6 product aims toward a more quantitative flow and enrichment monitoring concept that sets new standards for accuracy stability, and confidence. An instrument must be accurate enough to detect the diversion of a significant quantity of material, have virtually zero false alarms, and protect the operator's proprietary process information. We discuss a new concept for advanced gas enrichment assay measurement technology. This design concept eliminates the need for the periodic replacement of a radioactive source as well as the need for maintenance by experts. Some initial experimental results will be presented.

  4. A new sequence data set of SSU rRNA gene for Scleractinia and its phylogenetic and ecological applications

    KAUST Repository

    Arrigoni, Roberto; Vacherie, Benoî t; Benzoni, Francesca; Stefani, Fabrizio; Karsenti, Eric; Jaillon, Olivier; Not, Fabrice; Nunes, Flavia; Payri, Claude; Wincker, Patrick; Barbe, Valé rie

    2016-01-01

    Scleractinian corals (i.e. hard corals) play a fundamental role in building and maintaining coral reefs, one of the most diverse ecosystems on Earth. Nevertheless, their phylogenies remain largely unresolved and little is known about dispersal and survival of their planktonic larval phase. The small subunit ribosomal RNA (SSU rRNA) is a commonly used gene for DNA barcoding in several metazoans, and small variable regions of SSU rRNA are widely adopted as barcode marker to investigate marine plankton community structure worldwide. Here, we provide a large sequence data set of the complete SSU rRNA gene from 298 specimens, representing all known extant reef coral families and a total of 106 genera. The secondary structure was extremely conserved within the order with few exceptions due to insertions or deletions occurring in the variable regions. Remarkable differences in SSU rRNA length and base composition were detected between and within acroporids (Acropora, Montipora, Isopora and Alveopora) compared to other corals. The V4 and V9 regions seem to be promising barcode loci because variation at commonly used barcode primer binding sites was extremely low, while their levels of divergence allowed families and genera to be distinguished. A time-calibrated phylogeny of Scleractinia is provided, and mutation rate heterogeneity is demonstrated across main lineages. The use of this data set as a valuable reference for investigating aspects of ecology, biology, molecular taxonomy and evolution of scleractinian corals is discussed.

  5. Transcriptional differences between normal and glioma-derived glial progenitor cells identify a core set of dysregulated genes.

    Science.gov (United States)

    Auvergne, Romane M; Sim, Fraser J; Wang, Su; Chandler-Militello, Devin; Burch, Jaclyn; Al Fanek, Yazan; Davis, Danielle; Benraiss, Abdellatif; Walter, Kevin; Achanta, Pragathi; Johnson, Mahlon; Quinones-Hinojosa, Alfredo; Natesan, Sridaran; Ford, Heide L; Goldman, Steven A

    2013-06-27

    Glial progenitor cells (GPCs) are a potential source of malignant gliomas. We used A2B5-based sorting to extract tumorigenic GPCs from human gliomas spanning World Health Organization grades II-IV. Messenger RNA profiling identified a cohort of genes that distinguished A2B5+ glioma tumor progenitor cells (TPCs) from A2B5+ GPCs isolated from normal white matter. A core set of genes and pathways was substantially dysregulated in A2B5+ TPCs, which included the transcription factor SIX1 and its principal cofactors, EYA1 and DACH2. Small hairpin RNAi silencing of SIX1 inhibited the expansion of glioma TPCs in vitro and in vivo, suggesting a critical and unrecognized role of the SIX1-EYA1-DACH2 system in glioma genesis or progression. By comparing the expression patterns of glioma TPCs with those of normal GPCs, we have identified a discrete set of pathways by which glial tumorigenesis may be better understood and more specifically targeted. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.

  6. A new sequence data set of SSU rRNA gene for Scleractinia and its phylogenetic and ecological applications

    KAUST Repository

    Arrigoni, Roberto

    2016-11-27

    Scleractinian corals (i.e. hard corals) play a fundamental role in building and maintaining coral reefs, one of the most diverse ecosystems on Earth. Nevertheless, their phylogenies remain largely unresolved and little is known about dispersal and survival of their planktonic larval phase. The small subunit ribosomal RNA (SSU rRNA) is a commonly used gene for DNA barcoding in several metazoans, and small variable regions of SSU rRNA are widely adopted as barcode marker to investigate marine plankton community structure worldwide. Here, we provide a large sequence data set of the complete SSU rRNA gene from 298 specimens, representing all known extant reef coral families and a total of 106 genera. The secondary structure was extremely conserved within the order with few exceptions due to insertions or deletions occurring in the variable regions. Remarkable differences in SSU rRNA length and base composition were detected between and within acroporids (Acropora, Montipora, Isopora and Alveopora) compared to other corals. The V4 and V9 regions seem to be promising barcode loci because variation at commonly used barcode primer binding sites was extremely low, while their levels of divergence allowed families and genera to be distinguished. A time-calibrated phylogeny of Scleractinia is provided, and mutation rate heterogeneity is demonstrated across main lineages. The use of this data set as a valuable reference for investigating aspects of ecology, biology, molecular taxonomy and evolution of scleractinian corals is discussed.

  7. Reconstruction of gene regulatory modules from RNA silencing of IFN-α modulators: experimental set-up and inference method.

    Science.gov (United States)

    Grassi, Angela; Di Camillo, Barbara; Ciccarese, Francesco; Agnusdei, Valentina; Zanovello, Paola; Amadori, Alberto; Finesso, Lorenzo; Indraccolo, Stefano; Toffolo, Gianna Maria

    2016-03-12

    Inference of gene regulation from expression data may help to unravel regulatory mechanisms involved in complex diseases or in the action of specific drugs. A challenging task for many researchers working in the field of systems biology is to build up an experiment with a limited budget and produce a dataset suitable to reconstruct putative regulatory modules worth of biological validation. Here, we focus on small-scale gene expression screens and we introduce a novel experimental set-up and a customized method of analysis to make inference on regulatory modules starting from genetic perturbation data, e.g. knockdown and overexpression data. To illustrate the utility of our strategy, it was applied to produce and analyze a dataset of quantitative real-time RT-PCR data, in which interferon-α (IFN-α) transcriptional response in endothelial cells is investigated by RNA silencing of two candidate IFN-α modulators, STAT1 and IFIH1. A putative regulatory module was reconstructed by our method, revealing an intriguing feed-forward loop, in which STAT1 regulates IFIH1 and they both negatively regulate IFNAR1. STAT1 regulation on IFNAR1 was object of experimental validation at the protein level. Detailed description of the experimental set-up and of the analysis procedure is reported, with the intent to be of inspiration for other scientists who want to realize similar experiments to reconstruct gene regulatory modules starting from perturbations of possible regulators. Application of our approach to the study of IFN-α transcriptional response modulators in endothelial cells has led to many interesting novel findings and new biological hypotheses worth of validation.

  8. Association between expression of random gene sets and survival is evident in multiple cancer types and may be explained by sub-classification

    Science.gov (United States)

    2018-01-01

    One of the goals of cancer research is to identify a set of genes that cause or control disease progression. However, although multiple such gene sets were published, these are usually in very poor agreement with each other, and very few of the genes proved to be functional therapeutic targets. Furthermore, recent findings from a breast cancer gene-expression cohort showed that sets of genes selected randomly can be used to predict survival with a much higher probability than expected. These results imply that many of the genes identified in breast cancer gene expression analysis may not be causal of cancer progression, even though they can still be highly predictive of prognosis. We performed a similar analysis on all the cancer types available in the cancer genome atlas (TCGA), namely, estimating the predictive power of random gene sets for survival. Our work shows that most cancer types exhibit the property that random selections of genes are more predictive of survival than expected. In contrast to previous work, this property is not removed by using a proliferation signature, which implies that proliferation may not always be the confounder that drives this property. We suggest one possible solution in the form of data-driven sub-classification to reduce this property significantly. Our results suggest that the predictive power of random gene sets may be used to identify the existence of sub-classes in the data, and thus may allow better understanding of patient stratification. Furthermore, by reducing the observed bias this may allow more direct identification of biologically relevant, and potentially causal, genes. PMID:29470520

  9. Juvenile psittacine environmental enrichment.

    Science.gov (United States)

    Simone-Freilicher, Elisabeth; Rupley, Agnes E

    2015-05-01

    Environmental enrichment is of great import to the emotional, intellectual, and physical development of the juvenile psittacine and their success in the human home environment. Five major types of enrichment include social, occupational, physical, sensory, and nutritional. Occupational enrichment includes exercise and psychological enrichment. Physical enrichment includes the cage and accessories and the external home environment. Sensory enrichment may be visual, auditory, tactile, olfactory, or taste oriented. Nutritional enrichment includes variations in appearance, type, and frequency of diet, and treats, novelty, and foraging. Two phases of the preadult period deserve special enrichment considerations: the development of autonomy and puberty. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Optimization to the Culture Conditions for Phellinus Production with Regression Analysis and Gene-Set Based Genetic Algorithm

    Science.gov (United States)

    Li, Zhongwei; Xin, Yuezhen; Wang, Xun; Sun, Beibei; Xia, Shengyu; Li, Hui

    2016-01-01

    Phellinus is a kind of fungus and is known as one of the elemental components in drugs to avoid cancers. With the purpose of finding optimized culture conditions for Phellinus production in the laboratory, plenty of experiments focusing on single factor were operated and large scale of experimental data were generated. In this work, we use the data collected from experiments for regression analysis, and then a mathematical model of predicting Phellinus production is achieved. Subsequently, a gene-set based genetic algorithm is developed to optimize the values of parameters involved in culture conditions, including inoculum size, PH value, initial liquid volume, temperature, seed age, fermentation time, and rotation speed. These optimized values of the parameters have accordance with biological experimental results, which indicate that our method has a good predictability for culture conditions optimization. PMID:27610365

  11. Other enrichment related contracts

    International Nuclear Information System (INIS)

    Hall, J.C.

    1978-01-01

    In addition to long-term enrichment contracts, DOE has other types of contracts: (1) short-term, fixed-commitment enrichment contract; (2) emergency sales agreement for enriched uranium; (3) feed material lease agreement; (4) enriched uranium storage agreement; and (5) feed material usage agreement

  12. Comparative genomic analysis of Brucella abortus vaccine strain 104M reveals a set of candidate genes associated with its virulence attenuation.

    Science.gov (United States)

    Yu, Dong; Hui, Yiming; Zai, Xiaodong; Xu, Junjie; Liang, Long; Wang, Bingxiang; Yue, Junjie; Li, Shanhu

    2015-01-01

    The Brucella abortus strain 104M, a spontaneously attenuated strain, has been used as a vaccine strain in humans against brucellosis for 6 decades in China. Despite many studies, the molecular mechanisms that cause the attenuation are still unclear. Here, we determined the whole-genome sequence of 104M and conducted a comprehensive comparative analysis against the whole genome sequences of the virulent strain, A13334, and other reference strains. This analysis revealed a highly similar genome structure between 104M and A13334. The further comparative genomic analysis between 104M and A13334 revealed a set of genes missing in 104M. Some of these genes were identified to be directly or indirectly associated with virulence. Similarly, a set of mutations in the virulence-related genes was also identified, which may be related to virulence alteration. This study provides a set of candidate genes associated with virulence attenuation in B.abortus vaccine strain 104M.

  13. Enrichment of antibiotic resistance genes in soil receiving composts derived from swine manure, yard wastes, or food wastes, and evidence for multiyear persistence of swine Clostridium spp.

    Science.gov (United States)

    Scott, Andrew; Tien, Yuan-Ching; Drury, Craig F; Reynolds, W Daniel; Topp, Edward

    2018-03-01

    The impact of amendment with swine manure compost (SMC), yard waste compost (YWC), or food waste compost (FWC) on the abundance of antibiotic resistance genes in soil was evaluated. Following a commercial-scale application of the composts in a field experiment, soils were sampled periodically for a decade, and archived air-dried. Soil DNA was extracted and gene targets quantified by qPCR. Compared with untreated control soil, all 3 amendment types increased the abundance of gene targets for up to 4 years postapplication. The abundance of several gene targets was much higher in soil amended with SMC than in soil receiving either YWC or FWC. The gene target ermB remained higher in the SMC treatment for a decade postapplication. Clostridia were significantly more abundant in the SMC-amended soil throughout the decade following application. Eight percent of Clostridium spp. isolates from the SMC treatment carried ermB. Overall, addition of organic amendments to soils has the potential to increase the abundance of antibiotic resistance genes. Amendments of fecal origin, such as SMC, will in addition entrain bacteria carrying antibiotic resistance genes. Environmentally recalcitrant clostridia, and the antibiotic resistance genes that they carry, will persist for many years under field conditions following the application of SMC.

  14. Environmental enrichment increases transcriptional and epigenetic differentiation between mouse dorsal and ventral dentate gyrus.

    Science.gov (United States)

    Zhang, Tie-Yuan; Keown, Christopher L; Wen, Xianglan; Li, Junhao; Vousden, Dulcie A; Anacker, Christoph; Bhattacharyya, Urvashi; Ryan, Richard; Diorio, Josie; O'Toole, Nicholas; Lerch, Jason P; Mukamel, Eran A; Meaney, Michael J

    2018-01-19

    Early life experience influences stress reactivity and mental health through effects on cognitive-emotional functions that are, in part, linked to gene expression in the dorsal and ventral hippocampus. The hippocampal dentate gyrus (DG) is a major site for experience-dependent plasticity associated with sustained transcriptional alterations, potentially mediated by epigenetic modifications. Here, we report comprehensive DNA methylome, hydroxymethylome and transcriptome data sets from mouse dorsal and ventral DG. We find genome-wide transcriptional and methylation differences between dorsal and ventral DG, including at key developmental transcriptional factors. Peripubertal environmental enrichment increases hippocampal volume and enhances dorsal DG-specific differences in gene expression. Enrichment also enhances dorsal-ventral differences in DNA methylation, including at binding sites of the transcription factor NeuroD1, a regulator of adult neurogenesis. These results indicate a dorsal-ventral asymmetry in transcription and methylation that parallels well-known functional and anatomical differences, and that may be enhanced by environmental enrichment.

  15. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Directory of Open Access Journals (Sweden)

    Alamar Santiago

    2009-09-01

    Full Text Available Abstract Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new

  16. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Science.gov (United States)

    Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

    2009-01-01

    Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an

  17. Development of a versatile enrichment analysis tool reveals associations between the maternal brain and mental health disorders, including autism

    Science.gov (United States)

    2013-01-01

    Background A recent study of lateral septum (LS) suggested a large number of autism-related genes with altered expression in the postpartum state. However, formally testing the findings for enrichment of autism-associated genes proved to be problematic with existing software. Many gene-disease association databases have been curated which are not currently incorporated in popular, full-featured enrichment tools, and the use of custom gene lists in these programs can be difficult to perform and interpret. As a simple alternative, we have developed the Modular Single-set Enrichment Test (MSET), a minimal tool that enables one to easily evaluate expression data for enrichment of any conceivable gene list of interest. Results The MSET approach was validated by testing several publicly available expression data sets for expected enrichment in areas of autism, attention deficit hyperactivity disorder (ADHD), and arthritis. Using nine independent, unique autism gene lists extracted from association databases and two recent publications, a striking consensus of enrichment was detected within gene expression changes in LS of postpartum mice. A network of 160 autism-related genes was identified, representing developmental processes such as synaptic plasticity, neuronal morphogenesis, and differentiation. Additionally, maternal LS displayed enrichment for genes associated with bipolar disorder, schizophrenia, ADHD, and depression. Conclusions The transition to motherhood includes the most fundamental social bonding event in mammals and features naturally occurring changes in sociability. Some individuals with autism, schizophrenia, or other mental health disorders exhibit impaired social traits. Genes involved in these deficits may also contribute to elevated sociability in the maternal brain. To date, this is the first study to show a significant, quantitative link between the maternal brain and mental health disorders using large scale gene expression data. Thus, the

  18. Uranium enriched granites in Sweden

    International Nuclear Information System (INIS)

    Wilson, M.R.; Aakerblom, G.

    1980-01-01

    Granites with uranium contents higher than normal occur in a variety of geological settings in the Swedish Precambrian, and represent a variety of granite types and ages. They may have been generated by (1) the anatexis of continental crust (2) processes occurring at a much greater depth. They commonly show enrichement in F, Sn, W and/or Mo. Only in one case is an important uranium mineralization thought to be directly related to a uranium-enriched granite, while the majority of epigenetic uranium mineralizations with economic potential are related to hydrothermal processes in areas where the bedrock is regionally uranium-enhanced. (Authors)

  19. The acute effects on duodenal gene expression in healthy men following consumption of a low-fat meal enriched with theobromine or fat.

    Science.gov (United States)

    Smolders, Lotte; Mensink, Ronald P; Boekschoten, Mark V; de Ridder, Rogier J J; Plat, Jogchum

    2018-01-26

    Increasing apoA-I synthesis may improve HDL functionality and lower CVD risk. As theobromine and fat increase fasting apoA-I concentrations, and the intestine is involved in apoA-I production, the acute effects of both were studied on duodenal gene transcription to better understand underlying mechanisms. In this crossover study, 8 healthy men received once a low fat (LF) meal, a LF meal plus theobromine (850 mg), or a high fat (HF) meal. Five hours after meal intake duodenal biopsies were taken for microarray analysis. Theobromine and HF consumption did not change duodenal apoA-I expression. Theobromine did not change gene expression related to lipid and cholesterol metabolism, whereas those related to glycogen/glucose breakdown were downregulated. HF consumption increased gene expression related to lipid and cholesterol uptake and transport, and to glucose storage, while it decreased those related to glucose uptake. Furthermore, genes related to inflammation were upregulated, but inflammation markers in plasma were not changed. In healthy men, acute theobromine and fat consumption did not change duodenal apoA-I mRNA, but inhibited expression of genes related to glucose metabolism. Furthermore, HF intake activated in the duodenum expression of genes related to lipid and cholesterol metabolism and to inflammation.

  20. Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies

    Science.gov (United States)

    Medina, Ignacio; Montaner, David; Bonifaci, Nuria; Pujana, Miguel Angel; Carbonell, José; Tarraga, Joaquin; Al-Shahrour, Fatima; Dopazo, Joaquin

    2009-01-01

    Genome-wide association studies have become a popular strategy to find associations of genes to traits of interest. Despite the high-resolution available today to carry out genotyping studies, the success of its application in real studies has been limited by the testing strategy used. As an alternative to brute force solutions involving the use of very large cohorts, we propose the use of the Gene Set Analysis (GSA), a different analysis strategy based on testing the association of modules of functionally related genes. We show here how the Gene Set-based Analysis of Polymorphisms (GeSBAP), which is a simple implementation of the GSA strategy for the analysis of genome-wide association studies, provides a significant increase in the power testing for this type of studies. GeSBAP is freely available at http://bioinfo.cipf.es/gesbap/ PMID:19502494

  1. A MultiSite GatewayTM vector set for the functional analysis of genes in the model Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Nagels Durand Astrid

    2012-09-01

    Full Text Available Abstract Background Recombinatorial cloning using the GatewayTM technology has been the method of choice for high-throughput omics projects, resulting in the availability of entire ORFeomes in GatewayTM compatible vectors. The MultiSite GatewayTM system allows combining multiple genetic fragments such as promoter, ORF and epitope tag in one single reaction. To date, this technology has not been accessible in the yeast Saccharomyces cerevisiae, one of the most widely used experimental systems in molecular biology, due to the lack of appropriate destination vectors. Results Here, we present a set of three-fragment MultiSite GatewayTM destination vectors that have been developed for gene expression in S. cerevisiae and that allow the assembly of any promoter, open reading frame, epitope tag arrangement in combination with any of four auxotrophic markers and three distinct replication mechanisms. As an example of its applicability, we used yeast three-hybrid to provide evidence for the assembly of a ternary complex of plant proteins involved in jasmonate signalling and consisting of the JAZ, NINJA and TOPLESS proteins. Conclusion Our vectors make MultiSite GatewayTM cloning accessible in S. cerevisiae and implement a fast and versatile cloning method for the high-throughput functional analysis of (heterologous proteins in one of the most widely used model organisms for molecular biology research.

  2. Report of the Subcommittee on Domestic Uranium Enrichment

    International Nuclear Information System (INIS)

    1981-01-01

    A report by the Subcommittee on Domestic Uranium Enrichment to the Atomic Energy Commission is described; which covers the procedure of the domestic uranium enrichment by centrifugal process up to the commercial production, reviewing the current situation in this field. Domestic uranium enrichment is important in the aspects of securing stable enrichment service, establishing sound fuel cycle, and others. As the future target, the production around the year 2000 is set at 3,000 tons SWU per year at least. The business of uranium enrichment, which is now developed in the Power Reactor and Nuclear Fuel Development Corporation, is to be carried out by private enterprise. The contents are as follows: demand and supply balance of uranium enrichment service, significance of domestic uranium enrichment, evaluation of centrifugal uranium enrichment technology, the target of domestic uranium enrichment, the policy of domestic uranium enrichment promotion. (J.P.N.)

  3. Derived enriched uranium market

    International Nuclear Information System (INIS)

    Rutkowski, E.

    1996-01-01

    The potential impact on the uranium market of highly enriched uranium from nuclear weapons dismantling in the Russian Federation and the USA is analyzed. Uranium supply, conversion, and enrichment factors are outlined for each country; inventories are also listed. The enrichment component and conversion components are expected to cause little disruption to uranium markets. The uranium component of Russian derived enriched uranium hexafluoride is unresolved; US legislation places constraints on its introduction into the US market

  4. Uranium enrichment plans

    International Nuclear Information System (INIS)

    Thomas, D.C.; Gagne, R.W.

    1978-01-01

    The following topics are covered: the status of the Government's existing uranium enrichment services contracts, natural uranium requirements based on the latest contract information, uncertainty in predicting natural uranium requirements based on uranium enrichment contracts, and domestic and foreign demand assumed in enrichment planning

  5. Post-prandial effects of hazelnut-enriched high fat meal on LDL oxidative status, oxidative and inflammatory gene expression of healthy subjects: a randomized trial.

    Science.gov (United States)

    Di Renzo, L; Merra, G; Botta, R; Gualtieri, P; Manzo, A; Perrone, M A; Mazza, M; Cascapera, S; De Lorenzo, A

    2017-04-01

    Postprandial oxidative stress is characterized by an increased susceptibility of the organism towards oxidative damage after consumption of a meal rich in lipids and/or carbohydrates. Micronutrients modulate the immune system and exert a protective action by reducing low-density lipoproteins oxidation (ox-LDL) via induction of antioxidant enzymes. The clinical study was a randomized and cross-over trial, conducted through the CONSORT flowchart. We evaluated the gene expression of 103 genes related to oxidative stress (HOSp) and human inflammasome pathways (HIp), and ox-LDL level at fasting and after 40 g raw "Tonda Gentile delle Langhe" hazelnut consumption, in association with a McDonald's® Meal (McDM) in 22 healthy human volunteers. Ox-LDL levels significantly increased comparing no dietary treatment (NDT) vs. McDM, and decreased comparing McDM vs. McDM + H (p<0.05). Percentage of significant genes expressed after each dietary treatment were the follows: (A) NDT vs. McDM: 3.88% HIp and 17.48% HOSp; (B) NDT vs. McDM + H: 17.48% HIp and 23.30% HOSp; (C) McDM vs. McDM + H: 17.48% HIp and 33.98% HOSp. Hazelnut consumption reduced post prandial risk factors of atherosclerosis, such as ox-LDL, and the expression of inflammation and oxidative stress related genes. Chronic studies on larger population are necessary before definitive conclusions.

  6. The acute effects on duodenal gene expression in healthy men following consumption of a low-fat meal enriched with theobromine or fat

    NARCIS (Netherlands)

    Smolders, Lotte; Mensink, Ronald P.; Boekschoten, Mark V.; Ridder, De Rogier J.J.; Plat, Jogchum

    2018-01-01

    Increasing apoA-I synthesis may improve HDL functionality and lower CVD risk. As theobromine and fat increase fasting apoA-I concentrations, and the intestine is involved in apoA-I production, the acute effects of both were studied on duodenal gene transcription to better understand underlying

  7. The map-1 gene family in root-knot nematodes, Meloidogyne spp.: a set of taxonomically restricted genes specific to clonal species.

    Directory of Open Access Journals (Sweden)

    Iva Tomalova

    Full Text Available Taxonomically restricted genes (TRGs, i.e., genes that are restricted to a limited subset of phylogenetically related organisms, may be important in adaptation. In parasitic organisms, TRG-encoded proteins are possible determinants of the specificity of host-parasite interactions. In the root-knot nematode (RKN Meloidogyne incognita, the map-1 gene family encodes expansin-like proteins that are secreted into plant tissues during parasitism, thought to act as effectors to promote successful root infection. MAP-1 proteins exhibit a modular architecture, with variable number and arrangement of 58 and 13-aa domains in their central part. Here, we address the evolutionary origins of this gene family using a combination of bioinformatics and molecular biology approaches. Map-1 genes were solely identified in one single member of the phylum Nematoda, i.e., the genus Meloidogyne, and not detected in any other nematode, thus indicating that the map-1 gene family is indeed a TRG family. A phylogenetic analysis of the distribution of map-1 genes in RKNs further showed that these genes are specifically present in species that reproduce by mitotic parthenogenesis, with the exception of M. floridensis, and could not be detected in RKNs reproducing by either meiotic parthenogenesis or amphimixis. These results highlight the divergence between mitotic and meiotic RKN species as a critical transition in the evolutionary history of these parasites. Analysis of the sequence conservation and organization of repeated domains in map-1 genes suggests that gene duplication(s together with domain loss/duplication have contributed to the evolution of the map-1 family, and that some strong selection mechanism may be acting upon these genes to maintain their functional role(s in the specificity of the plant-RKN interactions.

  8. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records

    DEFF Research Database (Denmark)

    Jiang, Li; Edwards, Stefan M.; Thomsen, Bo

    2014-01-01

    from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text...

  9. Interaction between Social/Psychosocial Factors and Genetic Variants on Body Mass Index: A Gene-Environment Interaction Analysis in a Longitudinal Setting.

    Science.gov (United States)

    Zhao, Wei; Ware, Erin B; He, Zihuai; Kardia, Sharon L R; Faul, Jessica D; Smith, Jennifer A

    2017-09-29

    Obesity, which develops over time, is one of the leading causes of chronic diseases such as cardiovascular disease. However, hundreds of BMI (body mass index)-associated genetic loci identified through large-scale genome-wide association studies (GWAS) only explain about 2.7% of BMI variation. Most common human traits are believed to be influenced by both genetic and environmental factors. Past studies suggest a variety of environmental features that are associated with obesity, including socioeconomic status and psychosocial factors. This study combines both gene/regions and environmental factors to explore whether social/psychosocial factors (childhood and adult socioeconomic status, social support, anger, chronic burden, stressful life events, and depressive symptoms) modify the effect of sets of genetic variants on BMI in European American and African American participants in the Health and Retirement Study (HRS). In order to incorporate longitudinal phenotype data collected in the HRS and investigate entire sets of single nucleotide polymorphisms (SNPs) within gene/region simultaneously, we applied a novel set-based test for gene-environment interaction in longitudinal studies (LGEWIS). Childhood socioeconomic status (parental education) was found to modify the genetic effect in the gene/region around SNP rs9540493 on BMI in European Americans in the HRS. The most significant SNP (rs9540488) by childhood socioeconomic status interaction within the rs9540493 gene/region was suggestively replicated in the Multi-Ethnic Study of Atherosclerosis (MESA) ( p = 0.07).

  10. Interaction between Social/Psychosocial Factors and Genetic Variants on Body Mass Index: A Gene-Environment Interaction Analysis in a Longitudinal Setting

    Directory of Open Access Journals (Sweden)

    Wei Zhao

    2017-09-01

    Full Text Available Obesity, which develops over time, is one of the leading causes of chronic diseases such as cardiovascular disease. However, hundreds of BMI (body mass index-associated genetic loci identified through large-scale genome-wide association studies (GWAS only explain about 2.7% of BMI variation. Most common human traits are believed to be influenced by both genetic and environmental factors. Past studies suggest a variety of environmental features that are associated with obesity, including socioeconomic status and psychosocial factors. This study combines both gene/regions and environmental factors to explore whether social/psychosocial factors (childhood and adult socioeconomic status, social support, anger, chronic burden, stressful life events, and depressive symptoms modify the effect of sets of genetic variants on BMI in European American and African American participants in the Health and Retirement Study (HRS. In order to incorporate longitudinal phenotype data collected in the HRS and investigate entire sets of single nucleotide polymorphisms (SNPs within gene/region simultaneously, we applied a novel set-based test for gene-environment interaction in longitudinal studies (LGEWIS. Childhood socioeconomic status (parental education was found to modify the genetic effect in the gene/region around SNP rs9540493 on BMI in European Americans in the HRS. The most significant SNP (rs9540488 by childhood socioeconomic status interaction within the rs9540493 gene/region was suggestively replicated in the Multi-Ethnic Study of Atherosclerosis (MESA (p = 0.07.

  11. Uranium Enrichment, an overview

    International Nuclear Information System (INIS)

    Coates, J.H.

    1994-01-01

    This general presentation on uranium enrichment will be followed by lectures on more specific topics including descriptions of enrichment processes and assessments of the prevailing commercial and industrial situations. I shall therefore avoid as much as possible duplications with these other lectures, and rather dwell on: some theoretical aspects of enrichment in general, underlying the differences between statistical and selective processes, a review and comparison between enrichment processes, remarks of general order regarding applications, the proliferation potential of enrichment. It is noteworthy that enrichment: may occur twice in the LWR fuel cycle: first by enriching natural uranium, second by reenriching uranium recovered from reprocessing, must meet LWR requirements, and in particular higher assays required by high burn up fuel elements, bears on the structure of the entire front part of the fuel cycle, namely in the conversion/reconversion steps only involving UF 6 for the moment. (author). tabs., figs., 4 refs

  12. Differential gene expression in granulosa cells from polycystic ovary syndrome patients with and without insulin resistance: identification of susceptibility gene sets through network analysis.

    Science.gov (United States)

    Kaur, Surleen; Archer, Kellie J; Devi, M Gouri; Kriplani, Alka; Strauss, Jerome F; Singh, Rita

    2012-10-01

    Polycystic ovary syndrome (PCOS) is a heterogeneous, genetically complex, endocrine disorder of uncertain etiology in women. Our aim was to compare the gene expression profiles in stimulated granulosa cells of PCOS women with and without insulin resistance vs. matched controls. This study included 12 normal ovulatory women (controls), 12 women with PCOS without evidence for insulin resistance (PCOS non-IR), and 16 women with insulin resistance (PCOS-IR) undergoing in vitro fertilization. Granulosa cell gene expression profiling was accomplished using Affymetrix Human Genome-U133 arrays. Differentially expressed genes were classified according to gene ontology using ingenuity pathway analysis tools. Microarray results for selected genes were confirmed by real-time quantitative PCR. A total of 211 genes were differentially expressed in PCOS non-IR and PCOS-IR granulosa cells (fold change≥1.5; P≤0.001) vs. matched controls. Diabetes mellitus and inflammation genes were significantly increased in PCOS-IR patients. Real-time quantitative PCR confirmed higher expression of NCF2 (2.13-fold), TCF7L2 (1.92-fold), and SERPINA1 (5.35-fold). Increased expression of inflammation genes ITGAX (3.68-fold) and TAB2 (1.86-fold) was confirmed in PCOS non-IR. Different cardiometabolic disease genes were differentially expressed in the two groups. Decreased expression of CAV1 (-3.58-fold) in PCOS non-IR and SPARC (-1.88-fold) in PCOS-IR was confirmed. Differential expression of genes involved in TGF-β signaling (IGF2R, increased; and HAS2, decreased), and oxidative stress (TXNIP, increased) was confirmed in both groups. Microarray analysis demonstrated differential expression of genes linked to diabetes mellitus, inflammation, cardiovascular diseases, and infertility in the granulosa cells of PCOS women with and without insulin resistance. Because these dysregulated genes are also involved in oxidative stress, lipid metabolism, and insulin signaling, we hypothesize that these

  13. Interaction between dopamine D2 receptor genotype and parental rule-setting in adolescent alcohol use: evidence for a gene-parenting interaction.

    NARCIS (Netherlands)

    Zwaluw, C.S. van der; Engels, R.C.E.M.; Vermulst, A.A.; Franke, B.; Buitelaar, J.K.; Verkes, R.J.; Scholte, R.H.

    2010-01-01

    Association studies investigating the link between the dopamine D2 receptor gene (DRD2) and alcohol (mis)use have shown inconsistent results. This may be due to lack of attention for environmental factors. High levels of parental rule-setting are associated with lower levels of adolescent alcohol

  14. Bridging cancer biology with the clinic: relative expression of a GRHL2-mediated gene-set pair predicts breast cancer metastasis.

    Directory of Open Access Journals (Sweden)

    Xinan Yang

    Full Text Available Identification and characterization of crucial gene target(s that will allow focused therapeutics development remains a challenge. We have interrogated the putative therapeutic targets associated with the transcription factor Grainy head-like 2 (GRHL2, a critical epithelial regulatory factor. We demonstrate the possibility to define the molecular functions of critical genes in terms of their personalized expression profiles, allowing appropriate functional conclusions to be derived. A novel methodology, relative expression analysis with gene-set pairs (RXA-GSP, is designed to explore the potential clinical utility of cancer-biology discovery. Observing that Grhl2-overexpression leads to increased metastatic potential in vitro, we established a model assuming Grhl2-induced or -inhibited genes confer poor or favorable prognosis respectively for cancer metastasis. Training on public gene expression profiles of 995 breast cancer patients, this method prioritized one gene-set pair (GRHL2, CDH2, FN1, CITED2, MKI67 versus CTNNB1 and CTNNA3 from all 2717 possible gene-set pairs (GSPs. The identified GSP significantly dichotomized 295 independent patients for metastasis-free survival (log-rank tested p = 0.002; severe empirical p = 0.035. It also showed evidence of clinical prognostication in another independent 388 patients collected from three studies (log-rank tested p = 3.3e-6. This GSP is independent of most traditional prognostic indicators, and is only significantly associated with the histological grade of breast cancer (p = 0.0017, a GRHL2-associated clinical character (p = 6.8e-6, Spearman correlation, suggesting that this GSP is reflective of GRHL2-mediated events. Furthermore, a literature review indicates the therapeutic potential of the identified genes. This research demonstrates a novel strategy to integrate both biological experiments and clinical gene expression profiles for extracting and elucidating the genomic

  15. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records

    DEFF Research Database (Denmark)

    Jiang, Li; Edwards, Stefan M.; Thomsen, Bo

    2014-01-01

    from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining......Background: Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic...

  16. Selection and validation of a set of reliable reference genes for quantitative RT-PCR studies in the brain of the Cephalopod Mollusc Octopus vulgaris

    Directory of Open Access Journals (Sweden)

    Biffali Elio

    2009-07-01

    Full Text Available Abstract Background Quantitative real-time polymerase chain reaction (RT-qPCR is valuable for studying the molecular events underlying physiological and behavioral phenomena. Normalization of real-time PCR data is critical for a reliable mRNA quantification. Here we identify reference genes to be utilized in RT-qPCR experiments to normalize and monitor the expression of target genes in the brain of the cephalopod mollusc Octopus vulgaris, an invertebrate. Such an approach is novel for this taxon and of advantage in future experiments given the complexity of the behavioral repertoire of this species when compared with its relatively simple neural organization. Results We chose 16S, and 18S rRNA, actB, EEF1A, tubA and ubi as candidate reference genes (housekeeping genes, HKG. The expression of 16S and 18S was highly variable and did not meet the requirements of candidate HKG. The expression of the other genes was almost stable and uniform among samples. We analyzed the expression of HKG into two different set of animals using tissues taken from the central nervous system (brain parts and mantle (here considered as control tissue by BestKeeper, geNorm and NormFinder. We found that HKG expressions differed considerably with respect to brain area and octopus samples in an HKG-specific manner. However, when the mantle is treated as control tissue and the entire central nervous system is considered, NormFinder revealed tubA and ubi as the most suitable HKG pair. These two genes were utilized to evaluate the relative expression of the genes FoxP, creb, dat and TH in O. vulgaris. Conclusion We analyzed the expression profiles of some genes here identified for O. vulgaris by applying RT-qPCR analysis for the first time in cephalopods. We validated candidate reference genes and found the expression of ubi and tubA to be the most appropriate to evaluate the expression of target genes in the brain of different octopuses. Our results also underline the

  17. Effect of bioaugmentation by cellulolytic bacteria enriched from sheep rumen on methane production from wheat straw.

    Science.gov (United States)

    Ozbayram, E Gozde; Kleinsteuber, Sabine; Nikolausz, Marcell; Ince, Bahar; Ince, Orhan

    2017-08-01

    The aim of this study was to determine the potential of bioaugmentation with cellulolytic rumen microbiota to enhance the anaerobic digestion of lignocellulosic feedstock. An anaerobic cellulolytic culture was enriched from sheep rumen fluid using wheat straw as substrate under mesophilic conditions. To investigate the effects of bioaugmentation on methane production from straw, the enrichment culture was added to batch reactors in proportions of 2% (Set-1) and 4% (Set-2) of the microbial cell number of the standard inoculum slurry. The methane production in the bioaugmented reactors was higher than in the control reactors. After 30 days of batch incubation, the average methane yield was 154 mL N CH 4 g VS -1 in the control reactors. Addition of 2% enrichment culture did not enhance methane production, whereas in Set-2 the methane yield was increased by 27%. The bacterial communities were examined by 454 amplicon sequencing of 16S rRNA genes, while terminal restriction fragment length polymorphism (T-RFLP) fingerprinting of mcrA genes was applied to analyze the methanogenic communities. The results highlighted that relative abundances of Ruminococcaceae and Lachnospiraceae increased during the enrichment. However, Cloacamonaceae, which were abundant in the standard inoculum, dominated the bacterial communities of all batch reactors. T-RFLP profiles revealed that Methanobacteriales were predominant in the rumen fluid, whereas the enrichment culture was dominated by Methanosarcinales. In the batch rectors, the most abundant methanogens were affiliated to Methanobacteriales and Methanomicrobiales. Our results suggest that bioaugmentation with sheep rumen enrichment cultures can enhance the performance of digesters treating lignocellulosic feedstock. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Sex hormones and gene expression signatures in peripheral blood from postmenopausal women - the NOWAC postgenome study

    Directory of Open Access Journals (Sweden)

    Rylander Charlotta

    2011-03-01

    Full Text Available Abstract Background Postmenopausal hormone therapy (HT influences endogenous hormone concentrations and increases the risk of breast cancer. Gene expression profiling may reveal the mechanisms behind this relationship. Our objective was to explore potential associations between sex hormones and gene expression in whole blood from a population-based, random sample of postmenopausal women Methods Gene expression, as measured by the Applied Biosystems microarray platform, was compared between hormone therapy (HT users and non-users and between high and low hormone plasma concentrations using both gene-wise analysis and gene set analysis. Gene sets found to be associated with HT use were further analysed for enrichment in functional clusters and network predictions. The gene expression matrix included 285 samples and 16185 probes and was adjusted for significant technical variables. Results Gene-wise analysis revealed several genes significantly associated with different types of HT use. The functional cluster analyses provided limited information on these genes. Gene set analysis revealed 22 gene sets that were enriched between high and low estradiol concentration (HT-users excluded. Among these were seven oestrogen related gene sets, including our gene list associated with systemic estradiol use, which thereby represents a novel oestrogen signature. Seven gene sets were related to immune response. Among the 15 gene sets enriched for progesterone, 11 overlapped with estradiol. No significant gene expression patterns were found for testosterone, follicle stimulating hormone (FSH or sex hormone binding globulin (SHBG. Conclusions Distinct gene expression patterns associated with sex hormones are detectable in a random group of postmenopausal women, as demonstrated by the finding of a novel oestrogen signature.

  19. A tandem sequence motif acts as a distance-dependent enhancer in a set of genes involved in translation by binding the proteins NonO and SFPQ

    Directory of Open Access Journals (Sweden)

    Roepcke Stefan

    2011-12-01

    Full Text Available Abstract Background Bioinformatic analyses of expression control sequences in promoters of co-expressed or functionally related genes enable the discovery of common regulatory sequence motifs that might be involved in co-ordinated gene expression. By studying promoter sequences of the human ribosomal protein genes we recently identified a novel highly specific Localized Tandem Sequence Motif (LTSM. In this work we sought to identify additional genes and LTSM-binding proteins to elucidate potential regulatory mechanisms. Results Genome-wide analyses allowed finding a considerable number of additional LTSM-positive genes, the products of which are involved in translation, among them, translation initiation and elongation factors, and 5S rRNA. Electromobility shift assays then showed specific signals demonstrating the binding of protein complexes to LTSM in ribosomal protein gene promoters. Pull-down assays with LTSM-containing oligonucleotides and subsequent mass spectrometric analysis identified the related multifunctional nucleotide binding proteins NonO and SFPQ in the binding complex. Functional characterization then revealed that LTSM enhances the transcriptional activity of the promoters in dependency of the distance from the transcription start site. Conclusions Our data demonstrate the power of bioinformatic analyses for the identification of biologically relevant sequence motifs. LTSM and the here found LTSM-binding proteins NonO and SFPQ were discovered through a synergistic combination of bioinformatic and biochemical methods and are regulators of the expression of a set of genes of the translational apparatus in a distance-dependent manner.

  20. Taxonomically Different Co-Microsymbionts of a Relict Legume, Oxytropis popoviana, Have Complementary Sets of Symbiotic Genes and Together Increase the Efficiency of Plant Nodulation.

    Science.gov (United States)

    Safronova, Vera I; Belimov, Andrey A; Sazanova, Anna L; Chirak, Elizaveta R; Verkhozina, Alla V; Kuznetsova, Irina G; Andronov, Evgeny E; Puhalsky, Jan V; Tikhonovich, Igor A

    2018-06-20

    Ten rhizobial strains were isolated from root nodules of a relict legume Oxytropis popoviana Peschkova. For identification of the isolates, sequencing of rrs, the internal transcribed spacer region, and housekeeping genes recA, glnII, and rpoB was used. Nine fast-growing isolates were Mesorhizobium-related; eight strains were identified as M. japonicum and one isolate belonged to M. kowhaii. The only slow-growing isolate was identified as a Bradyrhizobium sp. Two strains, M. japonicum Opo-242 and Bradyrhizobium sp. strain Opo-243, were isolated from the same nodule. Symbiotic genes of these isolates were searched throughout the whole-genome sequences. The common nodABC genes and other symbiotic genes required for plant nodulation and nitrogen fixation were present in the isolate Opo-242. Strain Opo-243 did not contain the principal nod, nif, and fix genes; however, five genes (nodP, nodQ, nifL, nolK, and noeL) affecting the specificity of plant-rhizobia interactions but absent in isolate Opo-242 were detected. Strain Opo-243 could not induce nodules but significantly accelerated the root nodule formation after coinoculation with isolate Opo-242. Thus, we demonstrated that taxonomically different strains of the archaic symbiotic system can be co-microsymbionts infecting the same nodule and promoting the nodulation process due to complementary sets of symbiotic genes.

  1. Identification and Construction of Combinatory Cancer Hallmark-Based Gene Signature Sets to Predict Recurrence and Chemotherapy Benefit in Stage II Colorectal Cancer.

    Science.gov (United States)

    Gao, Shanwu; Tibiche, Chabane; Zou, Jinfeng; Zaman, Naif; Trifiro, Mark; O'Connor-McCourt, Maureen; Wang, Edwin

    2016-01-01

    Decisions regarding adjuvant therapy in patients with stage II colorectal cancer (CRC) have been among the most challenging and controversial in oncology over the past 20 years. To develop robust combinatory cancer hallmark-based gene signature sets (CSS sets) that more accurately predict prognosis and identify a subset of patients with stage II CRC who could gain survival benefits from adjuvant chemotherapy. Thirteen retrospective studies of patients with stage II CRC who had clinical follow-up and adjuvant chemotherapy were analyzed. Respective totals of 162 and 843 patients from 2 and 11 independent cohorts were used as the discovery and validation cohorts, respectively. A total of 1005 patients with stage II CRC were included in the 13 cohorts. Among them, 84 of 416 patients in 3 independent cohorts received fluorouracil-based adjuvant chemotherapy. Identification of CSS sets to predict relapse-free survival and identify a subset of patients with stage II CRC who could gain substantial survival benefits from fluorouracil-based adjuvant chemotherapy. Eight cancer hallmark-based gene signatures (30 genes each) were identified and used to construct CSS sets for determining prognosis. The CSS sets were validated in 11 independent cohorts of 767 patients with stage II CRC who did not receive adjuvant chemotherapy. The CSS sets accurately stratified patients into low-, intermediate-, and high-risk groups. Five-year relapse-free survival rates were 94%, 78%, and 45%, respectively, representing 60%, 28%, and 12% of patients with stage II disease. The 416 patients with CSS set-defined high-risk stage II CRC who received fluorouracil-based adjuvant chemotherapy showed a substantial gain in survival benefits from the treatment (ie, recurrence reduced by 30%-40% in 5 years). The CSS sets substantially outperformed other prognostic predictors of stage 2 CRC. They are more accurate and robust for prognostic predictions and facilitate the identification of patients with stage

  2. Gene-enriched draft genome of the cattle tick Rhipicephalus microplus: assembly by the hybrid Pacific Biosciences/Illumina approach enabled analysis of the highly repetitive genome.

    Science.gov (United States)

    Barrero, Roberto A; Guerrero, Felix D; Black, Michael; McCooke, John; Chapman, Brett; Schilkey, Faye; Pérez de León, Adalberto A; Miller, Robert J; Bruns, Sara; Dobry, Jason; Mikhaylenko, Galina; Stormo, Keith; Bell, Callum; Tao, Quanzhou; Bogden, Robert; Moolhuijzen, Paula M; Hunter, Adam; Bellgard, Matthew I

    2017-08-01

    The genome of the cattle tick Rhipicephalus microplus, an ectoparasite with global distribution, is estimated to be 7.1Gbp in length and consists of approximately 70% repetitive DNA. We report the draft assembly of a tick genome that utilized a hybrid sequencing and assembly approach to capture the repetitive fractions of the genome. Our hybrid approach produced an assembly consisting of 2.0Gbp represented in 195,170 scaffolds with a N50 of 60,284bp. The Rmi v2.0 assembly is 51.46% repetitive with a large fraction of unclassified repeats, short interspersed elements, long interspersed elements and long terminal repeats. We identified 38,827 putative R. microplus gene loci, of which 24,758 were protein coding genes (≥100 amino acids). OrthoMCL comparative analysis against 11 selected species including insects and vertebrates identified 10,835 and 3,423 protein coding gene loci that are unique to R. microplus or common to both R. microplus and Ixodes scapularis ticks, respectively. We identified 191 microRNA loci, of which 168 have similarity to known miRNAs and 23 represent novel miRNA families. We identified the genomic loci of several highly divergent R. microplus esterases with sequence similarity to acetylcholinesterase. Additionally we report the finding of a novel cytochrome P450 CYP41 homolog that shows similar protein folding structures to known CYP41 proteins known to be involved in acaricide resistance. Copyright © 2017 Australian Society for Parasitology. Published by Elsevier Ltd. All rights reserved.

  3. IPAD: the Integrated Pathway Analysis Database for Systematic Enrichment Analysis.

    Science.gov (United States)

    Zhang, Fan; Drabier, Renee

    2012-01-01

    Next-Generation Sequencing (NGS) technologies and Genome-Wide Association Studies (GWAS) generate millions of reads and hundreds of datasets, and there is an urgent need for a better way to accurately interpret and distill such large amounts of data. Extensive pathway and network analysis allow for the discovery of highly significant pathways from a set of disease vs. healthy samples in the NGS and GWAS. Knowledge of activation of these processes will lead to elucidation of the complex biological pathways affected by drug treatment, to patient stratification studies of new and existing drug treatments, and to understanding the underlying anti-cancer drug effects. There are approximately 141 biological human pathway resources as of Jan 2012 according to the Pathguide database. However, most currently available resources do not contain disease, drug or organ specificity information such as disease-pathway, drug-pathway, and organ-pathway associations. Systematically integrating pathway, disease, drug and organ specificity together becomes increasingly crucial for understanding the interrelationships between signaling, metabolic and regulatory pathway, drug action, disease susceptibility, and organ specificity from high-throughput omics data (genomics, transcriptomics, proteomics and metabolomics). We designed the Integrated Pathway Analysis Database for Systematic Enrichment Analysis (IPAD, http://bioinfo.hsc.unt.edu/ipad), defining inter-association between pathway, disease, drug and organ specificity, based on six criteria: 1) comprehensive pathway coverage; 2) gene/protein to pathway/disease/drug/organ association; 3) inter-association between pathway, disease, drug, and organ; 4) multiple and quantitative measurement of enrichment and inter-association; 5) assessment of enrichment and inter-association analysis with the context of the existing biological knowledge and a "gold standard" constructed from reputable and reliable sources; and 6) cross-linking of

  4. Effects of maternal dietary selenium (Se-enriched yeast) on testis development, testosterone level and testicular steroidogenesis-related gene expression of their male kids in Taihang Black Goats.

    Science.gov (United States)

    Shi, Lei; Song, Ruigao; Yao, Xiaolei; Duan, Yunli; Ren, Youshe; Zhang, Chunxiang; Yue, Wenbin; Lei, Fulin

    2018-07-01

    To investigate the effects of maternal dietary selenium (Se-enriched yeast) on testis development, testosterone level and steroidogenesis-related gene expression in testis of their male kids, selected pregnant Taihang Black Goats were randomly allotted to four treatment groups. They were fed the basal gestation and lactation diets supplemented with 0 (control), 0.5, 2.0 and 4.0 mg of Se/kg DM. Thirty days after weaning, testes were collected from the kids. After the morphological development status of testis was examined, tissue samples were collected for analyzing testosterone concentration and histological parameters. Testosterone synthesis-related genes were detected using real-time PCR. Localization and quantification of androgen receptor (AR) in testis of goats were determined by immunohistochemical and western blot analysis. The results show that Se supplementation in the diet of dams led to higher (p kids. Excessive Se (4.0 mg/kg) can inhibit the development of testis by decreasing testicular weight and volume. The density of spermatogenic cells and Leydig cells in the Se treatment groups was significantly (p kids by modulating testosterone synthesis in goats. More attention should be given to the potential role of maternal nutrition in improving reproductive performance of their offspring. Copyright © 2018 Elsevier Inc. All rights reserved.

  5. Advanced enrichment techniques

    International Nuclear Information System (INIS)

    Johnson, A.

    1988-01-01

    BNFL is in a unique position in that it has commercial experience of diffusion enrichment, and of centrifuge enrichment through its associate company Urenco. In addition BNFL is developing laser enrichment techniques as part of a UK development programme in this area. The paper describes the development programme which led to the introduction of competitive centrifuge enrichment technology by Urenco and discusses the areas where improvements have and will continue to be made in the centrifuge process. It also describes the laser development programme currently being undertaken in the UK. The paper concludes by discussing the relative merits of the various methods of uranium enrichment, with particular reference to the enrichment market likely to obtain over the rest of the century

  6. Advanced enrichment techniques

    International Nuclear Information System (INIS)

    Johnson, A.

    1987-01-01

    BNFL is in a unique position in that it has commercial experience of diffusion enrichment, and of centrifuge enrichment through its associate company Urenco. In addition BNFL is developing laser enrichment techniques as part of a UK development programme in this area. The paper describes the development programme which led to the introduction of competitive centrifuge enrichment technology by Urenco and discusses the areas where improvements have and will continue to be made in the centrifuge process. It also describes the laser development programme currently being undertaken in the UK. The paper concludes by discussing the relative merits of the various methods of uranium enrichment, with particular reference to the enrichment market likely to obtain over the rest of the century. (author)

  7. AnovArray: a set of SAS macros for the analysis of variance of gene expression data

    Directory of Open Access Journals (Sweden)

    Renard Jean-Paul

    2005-06-01

    Full Text Available Abstract Background Analysis of variance is a powerful approach to identify differentially expressed genes in a complex experimental design for microarray and macroarray data. The advantage of the anova model is the possibility to evaluate multiple sources of variation in an experiment. Results AnovArray is a package implementing ANOVA for gene expression data using SAS® statistical software. The originality of the package is 1 to quantify the different sources of variation on all genes together, 2 to provide a quality control of the model, 3 to propose two models for a gene's variance estimation and to perform a correction for multiple comparisons. Conclusion AnovArray is freely available at http://www-mig.jouy.inra.fr/stat/AnovArray and requires only SAS® statistical software.

  8. Uranium enrichment: an overview

    International Nuclear Information System (INIS)

    Cazalet, J.

    1995-01-01

    This paper is a general presentation of uranium enrichment processes and assessments of the prevailing commercial and industrial situations. It gives first some theoretical aspects of enrichment in general and explains the differences between statistical and selective processes in particular. Then a review of the different processes is made with a comparison between them. Finally, some general remarks concerning applications are given and the risks of proliferation related to enrichment are mentioned. (J.S.). 4 refs., 5 figs., 8 tabs

  9. The enrichment secondary market

    International Nuclear Information System (INIS)

    Einbund, D.R.

    1986-01-01

    This paper will addresses two topics: the background to the present status of the enrichment secondary market and the future outlook of the secondary market in enrichment services, and the viability of the nuclear fuel brokerage industry. These two topics are inevitably connected, as most secondary market activity, not only in enrichment but also in natural uranium, has traditionally been conducted with the participation of brokers. Therefore, the author interrelates these topics

  10. Wakame and Nori in restructured meats included in cholesterol-enriched diets affect the antioxidant enzyme gene expressions and activities in Wistar rats.

    Science.gov (United States)

    Moreira, Adriana Schultz; González-Torres, Laura; Olivero-David, Raul; Bastida, Sara; Benedi, Juana; Sánchez-Muniz, Francisco J

    2010-09-01

    The effects of diets including restructured meats (RM) containing Wakame or Nori on total liver glutathione status, and several antioxidant enzyme gene expressions and activities were tested. Six groups of ten male growing Wistar rats each were fed a mix of 85% AIN-93 M diet and 15% freeze-dried RM for 35 days. The control group (C) consumed control RM, the Wakame (W) and the Nori (N) groups, RM with 5% Wakame and 5% Nori, respectively. Animals on added cholesterol diets (CC, CW, and CN) consumed their corresponding basal diets added with cholesterol (2%) and cholic acid (0.4%). Alga and dietary cholesterol significantly interact (P Nori-RM is a hypocholesterolemic food while Wakame-RM is an antioxidant food. This should be taken into account when including this kind of RM as potential functional foods in human.

  11. Robust de novo pathway enrichment with KeyPathwayMiner 5 [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Nicolas Alcaraz

    2016-06-01

    Full Text Available Identifying functional modules or novel active pathways, recently termed de novo pathway enrichment, is a computational systems biology challenge that has gained much attention during the last decade. Given a large biological interaction network, KeyPathwayMiner extracts connected subnetworks that are enriched for differentially active entities from a series of molecular profiles encoded as binary indicator matrices. Since interaction networks constantly evolve, an important question is how robust the extracted results are when the network is modified. We enable users to study this effect through several network perturbation techniques and over a range of perturbation degrees. In addition, users may now provide a gold-standard set to determine how enriched extracted pathways are with relevant genes compared to randomized versions of the original network.

  12. Heat Stress and Lipopolysaccharide Stimulation of Chicken Macrophage-Like Cell Line Activates Expression of Distinct Sets of Genes.

    Directory of Open Access Journals (Sweden)

    Anna Slawinska

    Full Text Available Acute heat stress requires immediate adjustment of the stressed individual to sudden changes of ambient temperatures. Chickens are particularly sensitive to heat stress due to development of insufficient physiological mechanisms to mitigate its effects. One of the symptoms of heat stress is endotoxemia that results from release of the lipopolysaccharide (LPS from the guts. Heat-related cytotoxicity is mitigated by the innate immune system, which is comprised mostly of phagocytic cells such as monocytes and macrophages. The objective of this study was to analyze the molecular responses of the chicken macrophage-like HD11 cell line to combined heat stress and lipopolysaccharide treatment in vitro. The cells were heat-stressed and then allowed a temperature-recovery period, during which the gene expression was investigated. LPS was added to the cells to mimic the heat-stress-related endotoxemia. Semi high-throughput gene expression analysis was used to study a gene panel comprised of heat shock proteins, stress-related genes, signaling molecules and immune response genes. HD11 cell line responded to heat stress with increased mRNA abundance of the HSP25, HSPA2 and HSPH1 chaperones as well as DNAJA4 and DNAJB6 co-chaperones. The anti-apoptotic gene BAG3 was also highly up-regulated, providing evidence that the cells expressed pro-survival processes. The immune response of the HD11 cell line to LPS in the heat stress environment (up-regulation of CCL4, CCL5, IL1B, IL8 and iNOS was higher than in thermoneutral conditions. However, the peak in the transcriptional regulation of the immune genes was after two hours of temperature-recovery. Therefore, we propose the potential influence of the extracellular heat shock proteins not only in mitigating effects of abiotic stress but also in triggering the higher level of the immune responses. Finally, use of correlation networks for the data analysis aided in discovering subtle differences in the gene

  13. Sexual and asexual oogenesis require the expression of unique and shared sets of genes in the insect Acyrthosiphon pisum

    Directory of Open Access Journals (Sweden)

    Gallot Aurore

    2012-02-01

    Full Text Available Abstract Background Although sexual reproduction is dominant within eukaryotes, asexual reproduction is widespread and has evolved independently as a derived trait in almost all major taxa. How asexuality evolved in sexual organisms is unclear. Aphids, such as Acyrthosiphon pisum, alternate between asexual and sexual reproductive means, as the production of parthenogenetic viviparous females or sexual oviparous females and males varies in response to seasonal photoperiodism. Consequently, sexual and asexual development in aphids can be analyzed simultaneously in genetically identical individuals. Results We compared the transcriptomes of aphid embryos in the stages of development during which the trajectory of oogenesis is determined for producing sexual or asexual gametes. This study design aimed at identifying genes involved in the onset of the divergent mechanisms that result in the sexual or asexual phenotype. We detected 33 genes that were differentially transcribed in sexual and asexual embryos. Functional annotation by gene ontology (GO showed a biological signature of oogenesis, cell cycle regulation, epigenetic regulation and RNA maturation. In situ hybridizations demonstrated that 16 of the differentially-transcribed genes were specifically expressed in germ cells and/or oocytes of asexual and/or sexual ovaries, and therefore may contribute to aphid oogenesis. We categorized these 16 genes by their transcription patterns in the two types of ovaries; they were: i expressed during sexual and asexual oogenesis; ii expressed during sexual and asexual oogenesis but with different localizations; or iii expressed only during sexual or asexual oogenesis. Conclusions Our results show that asexual and sexual oogenesis in aphids share common genetic programs but diverge by adapting specificities in their respective gene expression profiles in germ cells and oocytes.

  14. Uranium enrichment plans

    International Nuclear Information System (INIS)

    Gagne, R.W.; Thomas, D.C.

    1977-01-01

    The status of existing uranium enrichment contracts in the US is reviewed and expected natural uranium requirements for existing domestic uranium enrichment contracts are evaluated. Uncertainty in natural uranium requirements associated with requirements-type and fixed-commitment type contracts is discussed along with implementation of variable tails assay

  15. gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels.

    Science.gov (United States)

    Larson, Nicholas B; McDonnell, Shannon; Cannon Albright, Lisa; Teerlink, Craig; Stanford, Janet; Ostrander, Elaine A; Isaacs, William B; Xu, Jianfeng; Cooney, Kathleen A; Lange, Ethan; Schleutker, Johanna; Carpten, John D; Powell, Isaac; Bailey-Wilson, Joan E; Cussenot, Olivier; Cancel-Tassin, Geraldine; Giles, Graham G; MacInnis, Robert J; Maier, Christiane; Whittemore, Alice S; Hsieh, Chih-Lin; Wiklund, Fredrik; Catalona, William J; Foulkes, William; Mandal, Diptasri; Eeles, Rosalind; Kote-Jarai, Zsofia; Ackerman, Michael J; Olson, Timothy M; Klein, Christopher J; Thibodeau, Stephen N; Schaid, Daniel J

    2017-05-01

    Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results. © 2017 WILEY PERIODICALS, INC.

  16. Modeling of Transients in an Enrichment Circuit

    International Nuclear Information System (INIS)

    Fernandino, Maria; Delmastro, Dario; Brasnarof, Daniel

    2003-01-01

    In the present work a mathematical model is presented in order to describe the dynamic behavior inside a closed enrichment loop, the latter representing a single stage of an uranium gaseous diffusion enrichment cascade.The analytical model is turned into a numerical model, and implemented through a computational code.Transients of two species separation were numerically analyzed, including setting times of each magnitude, behavior of each one of them during different transients, and redistribution of concentrations along the closed loop

  17. Enrichment of Acinetobacter spp. from food samples.

    Science.gov (United States)

    Carvalheira, Ana; Ferreira, Vânia; Silva, Joana; Teixeira, Paula

    2016-05-01

    Relatively little is known about the role of foods in the chain of transmission of acinetobacters and the occurrence of different Acinetobacter spp. in foods. Currently, there is no standard procedure to recover acinetobacters from food in order to gain insight into the food-related ecology and epidemiology of acinetobacters. This study aimed to assess whether enrichment in Dijkshoorn enrichment medium followed by plating in CHROMagar™ Acinetobacter medium is a useful method for the isolation of Acinetobacter spp. from foods. Recovery of six Acinetobacter species from food spiked with these organisms was compared for two selective enrichment media (Baumann's enrichment and Dijkshoorn's enrichment). Significantly (p enrichment. Next, the Dijkshoorn's enrichment followed by direct plating on CHROMagar™ Acinetobacter was applied to detect Acinetobacter spp. in different foods. Fourteen different presumptive acinetobacters were recovered and assumed to represent nine different strains on the basis of REP-PCR typing. Eight of these strains were identified by rpoB gene analysis as belonging to the species Acinetobacter johnsonii, Acinetobacter calcoaceticus, Acinetobacter guillouiae and Acinetobacter gandensis. It was not possible to identify the species level of one strain which may suggests that it represents a distinct species. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. Assessment of topoisomerase II-alpha gene status by dual color chromogenic in situ hybridization in a set of Iraqi patients with invasive breast carcinoma

    Directory of Open Access Journals (Sweden)

    Rasha Abd Alraouf Neama

    2017-01-01

    Full Text Available Background: The human epidermal growth factor receptor 2(HER2 proto-oncogene is overexpressed or amplified in approximately 15%–25% of invasive breast cancers. Approximately 35% of HER2-amplified breast cancers have coamplification of the topoisomerase II-alpha (TOP2A gene encoding an enzyme that is a major target of anthracyclines. Hence, the determination of genetic alteration (amplification or deletion of both genes is considered as an important predictive factor that determines the response of breast cancer patients to treatment. The aims of this study are to determinate TOP2A status gene amplification in a set of Iraqi patients with breast cancer that have had an equivocal (2+ and positive HER2/neu by immunohistochemistry (IHC and to compare the results with estrogen receptor (ER and progesterone receptor (PR and HER2/neu status. Patients and Methods: A cross-sectional prospective study done on 53 patients with invasive breast carcinoma. Twenty-six out of total 53 cases were positive HER2/neu (3+, the remaining 27 equivocal HER2-IHC (2+ cases reanalyzed using dual-color chromogenic in situ hybridization (ZytoVision probe kit for further identification of HER2/neu gene amplification. Using chromogenic in situ hybridization (CISH, TOP2A gene status determination was done for all cases. Results: There is a direct significant correlation between TOP2A gene amplification and HER2/neu positivity, P < 0.05 in that 15 (39.4% out of 38 positive HER2/neu cases were associated with topoisomerase gene amplification. Regarding relation of topoisomerase gene to hormone receptor status (ER and PR, there was a significant negative relationship between the gene and ER receptor status. The higher level of gene amplification was noticed in ER and PR negative cases in about 13 (43.3% and 14 (48.2% for ER and PR, respectively. Conclusion: TOP2A gene status has a significantly positive correlation with HER2/neu status while it has a significantly negative

  19. Genetic variations in the CLNK gene and ZNF518B gene are associated with gout in case-control sample sets.

    Science.gov (United States)

    Jin, Tian-Bo; Ren, Yongchao; Shi, Xugang; Jiri, Mutu; He, Na; Feng, Tian; Yuan, Dongya; Kang, Longli

    2015-07-01

    A genome-wide association study of gout in European populations identified 12 genetic variants strongly associated with risk of gout, but it is unknown whether these variants are also associated with gout risk in Chinese populations. A total of 145 patients with gout and 310 healthy control patients were recruited for a case-control association study. Twelve SNPs of CLNK and ZNF518B gene were genotyped, and association analysis was performed. Odds ratios (ORs) with 95 % confidence intervals (CIs) were used to assess the association. Overall, we found four risk alleles for gout in patients: the allele "G" of rs2041215 and rs1686947 in the CLNK gene by dominant model (OR 1.66; 95 % CI 1.04-2.63; p = 0.031) (OR 2.19; 95 % CI 1.38-3.46; p = 0.001) and additive model (OR 1.39; 95 % CI 1.00-1.93; p = 0.049) (OR 1.67; 95 % CI 1.19-2.32; p = 0.003), respectively, and the allele "A" of rs10938799 and rs10016022 in ZNF518B gene by recessive model (OR 4.66; 95 % CI 1.44-15.09; p = 0.008) (OR 4.54; 95 % CI 1.23-16.76; p = 0.020). Further haplotype analysis showed that the TCATTCTGA haplotype of CLNK was more frequent among patients with gout (adjusted OR 0.48; 95 % CI 0.24-0.95; p = 0.036). Additionally, polymorphisms of rs2041215, rs10938799, and rs17467273 were also correlated with clinical pathological parameters. This study provides evidence for gout susceptibility genes, CLNK and ZNF518B, in a Chinese population, which may have potential as diagnostic and prognostic marker for gout patients.

  20. Effect of Morinda citrifolia (Noni-Enriched Diet on Hepatic Heat Shock Protein and Lipid Metabolism-Related Genes in Heat Stressed Broiler Chickens

    Directory of Open Access Journals (Sweden)

    Joshua Flees

    2017-11-01

    Full Text Available Heat stress (HS has been reported to alter fat deposition in broilers, however the underlying molecular mechanisms are not well-defined. The objectives of the current study were, therefore: (1 to determine the effects of acute (2 h and chronic (3 weeks HS on the expression of key molecular signatures involved in hepatic lipogenic and lipolytic programs, and (2 to assess if diet supplementation with dried Noni medicinal plant (0.2% of the diet modulates these effects. Broilers (480 males, 1 d were randomly assigned to 12 environmental chambers, subjected to two environmental conditions (heat stress, HS, 35°C vs. thermoneutral condition, TN, 24°C and fed two diets (control vs. Noni in a 2 × 2 factorial design. Feed intake and body weights were recorded, and blood and liver samples were collected at 2 h and 3 weeks post-heat exposure. HS depressed feed intake, reduced body weight, and up regulated the hepatic expression of heat shock protein HSP60, HSP70, HSP90 as well as key lipogenic proteins (fatty acid synthase, FASN; acetyl co-A carboxylase alpha, ACCα and ATP citrate lyase, ACLY. HS down regulated the hepatic expression of lipoprotein lipase (LPL and hepatic triacylglycerol lipase (LIPC, but up-regulated ATGL. Although it did not affect growth performance, Noni supplementation regulated the hepatic expression of lipogenic proteins in a time- and gene-specific manner. Prior to HS, Noni increased ACLY and FASN in the acute and chronic experimental conditions, respectively. During acute HS, Noni increased ACCα, but reduced FASN and ACLY expression. Under chronic HS, Noni up regulated ACCα and FASN but it down regulated ACLY. In vitro studies, using chicken hepatocyte cell lines, showed that HS down-regulated the expression of ACCα, FASN, and ACLY. Treatment with quercetin, one bioactive ingredient in Noni, up-regulated the expression of ACCα, FASN, and ACLY under TN conditions, but it appeared to down-regulate ACCα and increase ACLY

  1. Effect of Morinda citrifolia (Noni)-Enriched Diet on Hepatic Heat Shock Protein and Lipid Metabolism-Related Genes in Heat Stressed Broiler Chickens.

    Science.gov (United States)

    Flees, Joshua; Rajaei-Sharifabadi, Hossein; Greene, Elizabeth; Beer, Lesleigh; Hargis, Billy M; Ellestad, Laura; Porter, Tom; Donoghue, Annie; Bottje, Walter G; Dridi, Sami

    2017-01-01

    Heat stress (HS) has been reported to alter fat deposition in broilers, however the underlying molecular mechanisms are not well-defined. The objectives of the current study were, therefore: (1) to determine the effects of acute (2 h) and chronic (3 weeks) HS on the expression of key molecular signatures involved in hepatic lipogenic and lipolytic programs, and (2) to assess if diet supplementation with dried Noni medicinal plant (0.2% of the diet) modulates these effects. Broilers (480 males, 1 d) were randomly assigned to 12 environmental chambers, subjected to two environmental conditions (heat stress, HS, 35°C vs. thermoneutral condition, TN, 24°C) and fed two diets (control vs. Noni) in a 2 × 2 factorial design. Feed intake and body weights were recorded, and blood and liver samples were collected at 2 h and 3 weeks post-heat exposure. HS depressed feed intake, reduced body weight, and up regulated the hepatic expression of heat shock protein HSP60, HSP70, HSP90 as well as key lipogenic proteins (fatty acid synthase, FASN; acetyl co-A carboxylase alpha, ACCα and ATP citrate lyase, ACLY). HS down regulated the hepatic expression of lipoprotein lipase (LPL) and hepatic triacylglycerol lipase (LIPC), but up-regulated ATGL. Although it did not affect growth performance, Noni supplementation regulated the hepatic expression of lipogenic proteins in a time- and gene-specific manner. Prior to HS, Noni increased ACLY and FASN in the acute and chronic experimental conditions, respectively. During acute HS, Noni increased ACCα, but reduced FASN and ACLY expression. Under chronic HS, Noni up regulated ACCα and FASN but it down regulated ACLY. In vitro studies, using chicken hepatocyte cell lines, showed that HS down-regulated the expression of ACCα, FASN, and ACLY. Treatment with quercetin, one bioactive ingredient in Noni, up-regulated the expression of ACCα, FASN, and ACLY under TN conditions, but it appeared to down-regulate ACCα and increase ACLY levels

  2. Genetic analysis and fine mapping of LH1 and LH2, a set of complementary genes controlling late heading in rice (Oryza sativa L.).

    Science.gov (United States)

    Liu, Shuang; Wang, Feng; Gao, Li Jun; Li, Jin Hua; Li, Rong Bai; Gao, Han Liang; Deng, Guo Fu; Yang, Jin Shui; Luo, Xiao Jin

    2012-12-01

    Heading date in rice (Oryza sativa L.) is a critical agronomic trait with a complex inheritance. To investigate the genetic basis and mechanism of gene interaction in heading date, we conducted genetic analysis on segregation populations derived from crosses among the indica cultivars Bo B, Yuefeng B and Baoxuan 2. A set of dominant complementary genes controlling late heading, designated LH1 and LH2, were detected by molecular marker mapping. Genetic analysis revealed that Baoxuan 2 contains both dominant genes, while Bo B and Yuefeng B each possess either LH1 or LH2. Using larger populations with segregant ratios of 3 : 1, we fine-mapped LH1 to a 63-kb region near the centromere of chromosome 7 flanked by markers RM5436 and RM8034, and LH2 to a 177-kb region on the short arm of chromosome 8 between flanking markers Indel22468-3 and RM25. Some candidate genes were identified through sequencing of Bo B and Yuefeng B in these target regions. Our work provides a solid foundation for further study on gene interaction in heading date and has application in marker-assisted breeding of photosensitive hybrid rice in China.

  3. Gene

    Data.gov (United States)

    U.S. Department of Health & Human Services — Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes,...

  4. BiNChE: a web tool and library for chemical enrichment analysis based on the ChEBI ontology.

    Science.gov (United States)

    Moreno, Pablo; Beisken, Stephan; Harsha, Bhavana; Muthukrishnan, Venkatesh; Tudose, Ilinca; Dekker, Adriano; Dornfeldt, Stefanie; Taruttis, Franziska; Grosse, Ivo; Hastings, Janna; Neumann, Steffen; Steinbeck, Christoph

    2015-02-21

    Ontology-based enrichment analysis aids in the interpretation and understanding of large-scale biological data. Ontologies are hierarchies of biologically relevant groupings. Using ontology annotations, which link ontology classes to biological entities, enrichment analysis methods assess whether there is a significant over or under representation of entities for ontology classes. While many tools exist that run enrichment analysis for protein sets annotated with the Gene Ontology, there are only a few that can be used for small molecules enrichment analysis. We describe BiNChE, an enrichment analysis tool for small molecules based on the ChEBI Ontology. BiNChE displays an interactive graph that can be exported as a high-resolution image or in network formats. The tool provides plain, weighted and fragment analysis based on either the ChEBI Role Ontology or the ChEBI Structural Ontology. BiNChE aids in the exploration of large sets of small molecules produced within Metabolomics or other Systems Biology research contexts. The open-source tool provides easy and highly interactive web access to enrichment analysis with the ChEBI ontology tool and is additionally available as a standalone library.

  5. Developments in uranium enrichment

    International Nuclear Information System (INIS)

    Mohrhauer, H.

    1995-01-01

    The enrichment services market is still characterized by overcapacities. While consumption worldwide will rise by some 15% to 39,000 t SWU/a over the next ten years, capacities amount to nearly 50,000 t SWU/a. The price for enrichment services probably has reached its all time low. Prices below U.S. $ 100/kg SWU are not likely to cover costs even of the economically most advanced enrichment processes. Urenco has prepared for the difficult enrichment business in the years to come by streamlining and cost cutting measures. The company intends to hold and increase its share of more than 10% in the world market. The uranium enrichment plant of Gronau will be expanded further. Expansion beyond 1000 t is subject to another permit being granted under the Atomic Energy Act, an application for which was filed in December 1994. Centrifuge technology is the superior enrichment technology, i.e., there is still considerable potential for further development. Construction of enrichment plants employing the centrifuge technology in the United States and in France is being pursued in various phases, from feasibility studies to licensing procedures. Before these plants could be implemented, however, considerable problems of organization would have to be solved, and the market would have to change greatly, respectively. The laser process, at the present time, does not seem to be able to develop into a major industrial competitor. (orig.) [de

  6. Poster: Observing change in crowded data sets in 3D space - Visualizing gene expression in human tissues

    KAUST Repository

    Rogowski, Marcin

    2013-03-01

    We have been confronted with a real-world problem of visualizing and observing change of gene expression between different human tissues. In this paper, we are presenting a universal representation space based on two-dimensional gel electrophoresis as opposed to force-directed layouts encountered most often in similar problems. We are discussing the methods we devised to make observing change more convenient in a 3D virtual reality environment. © 2013 IEEE.

  7. TRIGA low enrichment fuel

    International Nuclear Information System (INIS)

    Gietzen, A.

    1993-01-01

    Sixty TRIGA reactors have been sold and the earliest of these are now passing twenty years of operation. All of these reactors use the uranium zirconium hydride fuel (UZrH) which provides certain unique advantages arising out of its large prompt negative temperature coefficient, very low fission product release, and high temperature capability. Eleven of these Sixty reactors are conversions from plate fuel to TRIGA fuel which were made as a result of these advantages. With only a few exceptions, TRIGA reactors have always used low-enriched uranium (LEU) fuel with an enrichment of 19.9%. The exceptions have either been converted from the standard low-enriched fuel to the 70% enriched FLIP fuel in order to achieve extended lifetime, or are higher powered reactors which were designed for long life using 93%-enriched uranium during the time when the use and export of highly enriched uranium (HEU) was not restricted. The advent of international policies focusing attention on nonproliferation and safeguards made the HEU fuels obsolete. General Atomic immediately undertook a development effort (nearly two years ago) in order to be in a position to comply with these policies for all future export sales and also to provide a low-enriched alternative to fully enriched plate-type fuels. This important work was subsequently partially supported by the U.S. Department of Energy. The laboratory and production tests have shown that higher uranium densities can be achieved to compensate for reducing the enrichment to 20%, and that the fuels maintain the characteristics of the very thoroughly proven standard TRIGA fuels. In May of 1978, General Atomic announced that these fuels were available for TRIGA reactors and for plate-type reactors with power levels up to 15 MW with General Atomic's standard commercial warranty

  8. TRIGA low enrichment fuel

    International Nuclear Information System (INIS)

    Gietzen, A.

    1993-01-01

    Sixty TRIGA reactors have been sold and the earliest of these are now passing twenty years of operation. All of these reactors use the uranium-zirconium hydride fuel (UZrH) which provides certain unique advantages arising out of its large prompt negative temperature coefficient, very low fission product release, and high temperature capability. Eleven of these Sixty reactors are conversions from plate fuel to TRIGA fuel which were made as a result of these advantages. With only a few exceptions, TRIGA reactors have always used low-enriched-uranium (LEU) fuel with an enrichment of 19.9%. The exceptions have either been converted from the standard low-enriched fuel to the 70% enriched FLIP fuel in order to achieve extended lifetime, or are higher powered reactors which were designed for long life using 93%-enriched uranium during the time when the use and export of highly enriched uranium (HEU) was not restricted. The advent of international policies focusing attention on nonproliferation and safeguards made the HEU fuels obsolete. General Atomic immediately undertook a development effort (nearly two years ago) in order to be in a position to comply with these policies for all future export sales and also to provide a low-enriched alternative to fully enriched plate-type fuels. This important work was subsequently partially supported by the U.S. Department of Energy. The laboratory and production tests have shown that higher uranium densities can be achieved to compensate for reducing the enrichment to 20%, and that the fuels maintain the characteristics of the very thoroughly proven standard TRIGA fuels. In May of 1978, General Atomic announced that these fuels were available for TRIGA reactors and for plate-type reactors with power levels up to 15 MW with GA's standard commercial warranty

  9. Cosmetic applications of glucitol-core containing gallotannins from a proprietary phenolic-enriched red maple (Acer rubrum) leaves extract: inhibition of melanogenesis via down-regulation of tyrosinase and melanogenic gene expression in B16F10 melanoma cells.

    Science.gov (United States)

    Ma, Hang; Xu, Jialin; DaSilva, Nicholas A; Wang, Ling; Wei, Zhengxi; Guo, Liangran; Johnson, Shelby L; Lu, Wei; Xu, Jun; Gu, Qiong; Seeram, Navindra P

    2017-05-01

    The red maple (Acer rubrum) is a rich source of phenolic compounds which possess galloyl groups attached to different positions of a 1,5-anhydro-D-glucitol core. While these glucitol-core containing gallotannins (GCGs) have reported anti-oxidant and anti-glycative effects, they have not yet been evaluated for their cosmetic applications. Herein, the anti-tyrosinase and anti-melanogenic effects of a proprietary phenolic-enriched red maple leaves extract [Maplifa ™ ; contains ca. 45% ginnalin A (GA) along with other GCGs] were investigated using enzyme and cellular assays. The GCGs showed anti-tyrosinase activity with IC 50 values ranging from 101.4 to 1047.3 μM and their mechanism of tyrosinase inhibition (using GA as a representative GCG) was evaluated by chelating and computational/modeling studies. GA reduced melanin content in murine melanoma B16F10 cells by 79.1 and 56.7% (at non-toxic concentrations of 25 and 50 μM, respectively), and its mechanisms of anti-melanogenic effects were evaluated by using methods including fluorescent probe (DCF-DA), real-time PCR, and western blot experiments. These data indicated that GA was able to: (1) reduce the levels of reactive oxygen species, (2) down-regulate the expression of MITF, TYR, TRP-1, and TRP-2 gene levels in a time-dependent manner, and (3) significantly reduce protein expression of the TRP-2 gene. Therefore, the anti-melanogenic effects of red maple GCGs warrant further investigation of this proprietary natural product extract for potential cosmetic applications.

  10. Independent evolution of the core and accessory gene sets in the genus Neisseria: insights gained from the genome of Neisseria lactamica isolate 020-06

    Directory of Open Access Journals (Sweden)

    White Brian

    2010-11-01

    Full Text Available Abstract Background The genus Neisseria contains two important yet very different pathogens, N. meningitidis and N. gonorrhoeae, in addition to non-pathogenic species, of which N. lactamica is the best characterized. Genomic comparisons of these three bacteria will provide insights into the mechanisms and evolution of pathogenesis in this group of organisms, which are applicable to understanding these processes more generally. Results Non-pathogenic N. lactamica exhibits very similar population structure and levels of diversity to the meningococcus, whilst gonococci are essentially recent descendents of a single clone. All three species share a common core gene set estimated to comprise around 1190 CDSs, corresponding to about 60% of the genome. However, some of the nucleotide sequence diversity within this core genome is particular to each group, indicating that cross-species recombination is rare in this shared core gene set. Other than the meningococcal cps region, which encodes the polysaccharide capsule, relatively few members of the large accessory gene pool are exclusive to one species group, and cross-species recombination within this accessory genome is frequent. Conclusion The three Neisseria species groups represent coherent biological and genetic groupings which appear to be maintained by low rates of inter-species horizontal genetic exchange within the core genome. There is extensive evidence for exchange among positively selected genes and the accessory genome and some evidence of hitch-hiking of housekeeping genes with other loci. It is not possible to define a 'pathogenome' for this group of organisms and the disease causing phenotypes are therefore likely to be complex, polygenic, and different among the various disease-associated phenotypes observed.

  11. UMD-USHbases: a comprehensive set of databases to record and analyse pathogenic mutations and unclassified variants in seven Usher syndrome causing genes.

    Science.gov (United States)

    Baux, David; Faugère, Valérie; Larrieu, Lise; Le Guédard-Méreuze, Sandie; Hamroun, Dalil; Béroud, Christophe; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise

    2008-08-01

    Using the Universal Mutation Database (UMD) software, we have constructed "UMD-USHbases", a set of relational databases of nucleotide variations for seven genes involved in Usher syndrome (MYO7A, CDH23, PCDH15, USH1C, USH1G, USH3A and USH2A). Mutations in the Usher syndrome type I causing genes are also recorded in non-syndromic hearing loss cases and mutations in USH2A in non-syndromic retinitis pigmentosa. Usher syndrome provides a particular challenge for molecular diagnostics because of the clinical and molecular heterogeneity. As many mutations are missense changes, and all the genes also contain apparently non-pathogenic polymorphisms, well-curated databases are crucial for accurate interpretation of pathogenicity. Tools are provided to assess the pathogenicity of mutations, including conservation of amino acids and analysis of splice-sites. Reference amino acid alignments are provided. Apparently non-pathogenic variants in patients with Usher syndrome, at both the nucleotide and amino acid level, are included. The UMD-USHbases currently contain more than 2,830 entries including disease causing mutations, unclassified variants or non-pathogenic polymorphisms identified in over 938 patients. In addition to data collected from 89 publications, 15 novel mutations identified in our laboratory are recorded in MYO7A (6), CDH23 (8), or PCDH15 (1) genes. Information is given on the relative involvement of the seven genes, the number and distribution of variants in each gene. UMD-USHbases give access to a software package that provides specific routines and optimized multicriteria research and sorting tools. These databases should assist clinicians and geneticists seeking information about mutations responsible for Usher syndrome.

  12. Enriched pathways for major depressive disorder identified from a genome-wide association study.

    Science.gov (United States)

    Kao, Chung-Feng; Jia, Peilin; Zhao, Zhongming; Kuo, Po-Hsiu

    2012-11-01

    Major depressive disorder (MDD) has caused a substantial burden of disease worldwide with moderate heritability. Despite efforts through conducting numerous association studies and now, genome-wide association (GWA) studies, the success of identifying susceptibility loci for MDD has been limited, which is partially attributed to the complex nature of depression pathogenesis. A pathway-based analytic strategy to investigate the joint effects of various genes within specific biological pathways has emerged as a powerful tool for complex traits. The present study aimed to identify enriched pathways for depression using a GWA dataset for MDD. For each gene, we estimated its gene-wise p value using combined and minimum p value, separately. Canonical pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) and BioCarta were used. We employed four pathway-based analytic approaches (gene set enrichment analysis, hypergeometric test, sum-square statistic, sum-statistic). We adjusted for multiple testing using Benjamini & Hochberg's method to report significant pathways. We found 17 significantly enriched pathways for depression, which presented low-to-intermediate crosstalk. The top four pathways were long-term depression (p⩽1×10-5), calcium signalling (p⩽6×10-5), arrhythmogenic right ventricular cardiomyopathy (p⩽1.6×10-4) and cell adhesion molecules (p⩽2.2×10-4). In conclusion, our comprehensive pathway analyses identified promising pathways for depression that are related to neurotransmitter and neuronal systems, immune system and inflammatory response, which may be involved in the pathophysiological mechanisms underlying depression. We demonstrated that pathway enrichment analysis is promising to facilitate our understanding of complex traits through a deeper interpretation of GWA data. Application of this comprehensive analytic strategy in upcoming GWA data for depression could validate the findings reported in this study.

  13. Schizophrenia and vitamin D related genes could have been subject to latitude-driven adaptation

    Directory of Open Access Journals (Sweden)

    Monticelli Antonella

    2010-11-01

    Full Text Available Abstract Background Many natural phenomena are directly or indirectly related to latitude. Living at different latitudes, indeed, has its consequences with being exposed to different climates, diets, light/dark cycles, etc. In humans, one of the best known examples of genetic traits following a latitudinal gradient is skin pigmentation. Nevertheless, also several diseases show latitudinal clinals such as hypertension, cancer, dismetabolic conditions, schizophrenia, Parkinson's disease and many more. Results We investigated, for the first time on a wide genomic scale, the latitude-driven adaptation phenomena. In particular, we selected a set of genes showing signs of latitude-dependent population differentiation. The biological characterization of these genes showed enrichment for neural-related processes. In light of this, we investigated whether genes associated to neuropsychiatric diseases were enriched by Latitude-Related Genes (LRGs. We found a strong enrichment of LRGs in the set of genes associated to schizophrenia. In an attempt to try to explain this possible link between latitude and schizophrenia, we investigated their associations with vitamin D. We found in a set of vitamin D related genes a significant enrichment of both LRGs and of genes involved in schizophrenia. Conclusions Our results suggest a latitude-driven adaptation for both schizophrenia and vitamin D related genes. In addition we confirm, at a molecular level, the link between schizophrenia and vitamin D. Finally, we discuss a model in which schizophrenia is, at least partly, a maladaptive by-product of latitude dependent adaptive changes in vitamin D metabolism.

  14. Schizophrenia and vitamin D related genes could have been subject to latitude-driven adaptation.

    Science.gov (United States)

    Amato, Roberto; Pinelli, Michele; Monticelli, Antonella; Miele, Gennaro; Cocozza, Sergio

    2010-11-11

    Many natural phenomena are directly or indirectly related to latitude. Living at different latitudes, indeed, has its consequences with being exposed to different climates, diets, light/dark cycles, etc. In humans, one of the best known examples of genetic traits following a latitudinal gradient is skin pigmentation. Nevertheless, also several diseases show latitudinal clinals such as hypertension, cancer, dismetabolic conditions, schizophrenia, Parkinson's disease and many more. We investigated, for the first time on a wide genomic scale, the latitude-driven adaptation phenomena. In particular, we selected a set of genes showing signs of latitude-dependent population differentiation. The biological characterization of these genes showed enrichment for neural-related processes. In light of this, we investigated whether genes associated to neuropsychiatric diseases were enriched by Latitude-Related Genes (LRGs). We found a strong enrichment of LRGs in the set of genes associated to schizophrenia. In an attempt to try to explain this possible link between latitude and schizophrenia, we investigated their associations with vitamin D. We found in a set of vitamin D related genes a significant enrichment of both LRGs and of genes involved in schizophrenia. Our results suggest a latitude-driven adaptation for both schizophrenia and vitamin D related genes. In addition we confirm, at a molecular level, the link between schizophrenia and vitamin D. Finally, we discuss a model in which schizophrenia is, at least partly, a maladaptive by-product of latitude dependent adaptive changes in vitamin D metabolism.

  15. Histone H4 Lys 20 methyltransferase SET8 promotes androgen receptor-mediated transcription activation in prostate cancer

    Energy Technology Data Exchange (ETDEWEB)

    Yao, Lushuai [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China); University of Chinese Academy of Sciences, Beijing 100049 (China); Li, Yanyan; Du, Fengxia [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China); Han, Xiao [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China); University of Chinese Academy of Sciences, Beijing 100049 (China); Li, Xiaohua [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China); Niu, Yuanjie [Chawnshang Chang Sex Hormone Research Center, Tianjin Institute of Urology, Tianjin Medical University, Tianjin 300070 (China); Ren, Shancheng, E-mail: renshancheng@gmail.com [Department of Urology, Shanghai Changhai Hospital, Second Military Medical University, Shanghai 200433 (China); Sun, Yingli, E-mail: sunyl@big.ac.cn [Laboratory of Genome Variations and Precision Bio-Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101 (China)

    2014-07-18

    Highlights: • Dihydrotestosterone stimulates H4K20me1 enrichment at the PSA promoter. • SET8 promotes AR-mediated transcription activation. • SET8 interacts with AR and promotes cell proliferation. - Abstract: Histone methylation status in different lysine residues has an important role in transcription regulation. The effect of H4K20 monomethylation (H4K20me1) on androgen receptor (AR)-mediated gene transcription remains unclear. Here we show that AR agonist stimulates the enrichment of H4K20me1 and SET8 at the promoter of AR target gene PSA in an AR dependent manner. Furthermore, SET8 is crucial for the transcription activation of PSA. Co-immunoprecipitation analyses demonstrate that SET8 interacts with AR. Therefore, we conclude that SET8 is involved in AR-mediated transcription activation, possibly through its interaction with AR and H4K20me1 modification.

  16. Alpha-gliadin genes from the A, B, and D genomes of wheat contain different sets of celiac disease epitopes

    Directory of Open Access Journals (Sweden)

    van Veelen Peter A

    2006-01-01

    Full Text Available Abstract Background Bread wheat (Triticum aestivum is an important staple food. However, wheat gluten proteins cause celiac disease (CD in 0.5 to 1% of the general population. Among these proteins, the α-gliadins contain several peptides that are associated to the disease. Results We obtained 230 distinct α-gliadin gene sequences from severaldiploid wheat species representing the ancestral A, B, and D genomes of the hexaploid bread wheat. The large majority of these sequences (87% contained an internal stop codon. All α-gliadin sequences could be distinguished according to the genome of origin on the basis of sequence similarity, of the average length of the polyglutamine repeats, and of the differences in the presence of four peptides that have been identified as T cell stimulatory epitopes in CD patients through binding to HLA-DQ2/8. By sequence similarity, α-gliadins from the public database of hexaploid T. aestivum could be assigned directly to chromosome 6A, 6B, or 6D. T. monococcum (A genome sequences, as well as those from chromosome 6A of bread wheat, almost invariably contained epitope glia-α9 and glia-α20, but never the intact epitopes glia-α and glia-α2. A number of sequences from T. speltoides, as well as a number of sequences fromchromosome 6B of bread wheat, did not contain any of the four T cell epitopes screened for. The sequences from T. tauschii (D genome, as well as those from chromosome 6D of bread wheat, were found to contain all of these T cell epitopes in variable combinations per gene. The differences in epitope composition resulted mainly from point mutations. These substitutions appeared to be genome specific. Conclusion Our analysis shows that α-gliadin sequences from the three genomes of bread wheat form distinct groups. The four known T cell stimulatory epitopes are distributed non-randomly across the sequences, indicating that the three genomes contribute differently to epitope content. A systematic

  17. The competitive enrichment market

    International Nuclear Information System (INIS)

    Parks, J.W.; Huffman, F.C.

    1984-01-01

    With the enactment of the ''Private Ownership of Special Nuclear Materials Act'' in 1964, the U.S. Government made provisions to enter into the uranium enrichment services business. Since nuclear power was in its infancy and the Government was promoting its growth as well as trying to help U.S. industry sell reactors overseas, the initial contracts (Requirements Contracts) for enrichment services placed most of the risks associated with the supplying of the services on the Government. Projections of nuclear power additions continued to grow and in 1972 the Atomic Energy Commission (AEC) stopped contracting under Requirements Contracts in order to study which mode of contracting best suited the commercial development of the industry. In mid-1973, the AEC introduced the Long-Term Fixed Commitment (LTFC) contract which shifted the risk to the customer. By mid-1974, AEC had contracts which completely used the enrichment capacity of its complex and refused to accept requests for additional contracts. This action further convinced European nations that they should continue to develop their own enrichment capacity and resulted in the EURODIF and URENCO projects. Before this time the U.S. supplied 100% of the world market for enriching services

  18. Enrichment: Dealing with overcapacity

    International Nuclear Information System (INIS)

    Peterson, C.H.

    1989-01-01

    Today's surplus of enrichment capacity will continue until at least the end of this century. This will challenge the ingenuity of the separative work unit (SWU) suppliers as they attempt to keep market share and remain profitable in a very competitive marketplace. The utilities will be faced with attractive choices, but making the best choice will require careful analysis and increased attention to market factors. Current demand projections will probably prove too high to the extent that more reactors are canceled or delayed. The DOE has the vast majority of the unused capacity, so it will feel the most immediate impact of this large surplus in productive capacity. The DOE has responded to these market challenges by planning another reorganization of its enriching operations. Without a major agreement among the governments affected by the current surplus in enrichment capacity, the future will see lower prices, more competitive terms, and the gradual substitution of centrifuge or laser enrichment for the gaseous diffusion plants. The competition that is forcing the gaseous diffusion prices down to marginal cost will provide the long-term price basis for the enrichment industry

  19. Laser and gas centrifuge enrichment

    Energy Technology Data Exchange (ETDEWEB)

    Heinonen, Olli [Senior Fellow, Belfer Center for Science and International Affairs, Harvard Kennedy School, Cambridge, Massachusetts (United States)

    2014-05-09

    Principles of uranium isotope enrichment using various laser and gas centrifuge techniques are briefly discussed. Examples on production of high enriched uranium are given. Concerns regarding the possibility of using low end technologies to produce weapons grade uranium are explained. Based on current assessments commercial enrichment services are able to cover the global needs of enriched uranium in the foreseeable future.

  20. Oxygen enrichment incineration

    International Nuclear Information System (INIS)

    Kim, Jeong Guk; Yang, Hee Chul; Park, Geun Il; Kim, Joon Hyung

    2000-10-01

    Oxygen enriched combustion technology has recently been used in waste incineration. To apply the oxygen enrichment on alpha-bearing waste incineration, which is being developed, a state-of-an-art review has been performed. The use of oxygen or oxygen-enriched air instead of air in incineration would result in increase of combustion efficiency and capacity, and reduction of off-gas product. Especially, the off-gas could be reduced below a quarter, which might reduce off-gas treatment facilities, and also increase an efficiency of off-gas treatment. However, the use of oxygen might also lead to local overheating and high nitrogen oxides (NOx) formation. To overcome these problems, an application of low NOx oxy-fuel burner and recycling of a part of off-gas to combustion chamber have been suggested

  1. Oxygen enrichment incineration

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Jeong Guk; Yang, Hee Chul; Park, Geun Il; Kim, Joon Hyung

    2000-10-01

    Oxygen enriched combustion technology has recently been used in waste incineration. To apply the oxygen enrichment on alpha-bearing waste incineration, which is being developed, a state-of-an-art review has been performed. The use of oxygen or oxygen-enriched air instead of air in incineration would result in increase of combustion efficiency and capacity, and reduction of off-gas product. Especially, the off-gas could be reduced below a quarter, which might reduce off-gas treatment facilities, and also increase an efficiency of off-gas treatment. However, the use of oxygen might also lead to local overheating and high nitrogen oxides (NOx) formation. To overcome these problems, an application of low NOx oxy-fuel burner and recycling of a part of off-gas to combustion chamber have been suggested.

  2. Genetic variation in a microRNA-502 minding site in SET8 gene confers clinical outcome of non-small cell lung cancer in a Chinese population.

    Directory of Open Access Journals (Sweden)

    Jiali Xu

    Full Text Available BACKGROUND: Genetic variants may influence microRNA-target interaction through modulate their binding affinity, creating or destroying miRNA-binding sites. SET8, a member of the SET domain-containing methyltransferase, has been implicated in a variety array of biological processes. METHODS: Using Taqman assay, we genotyped a polymorphism rs16917496 T>C within the miR-502 binding site in the 3'-untranslated region of the SET8 gene in 576 non-small cell lung cancer (NSCLC patients. Functions of rs16917496 were investigated using luciferase activity assay and validated by immunostaining. RESULTS: Log-rank test and cox regression indicated that the CC genotype was associated with a longer survival and a reduced risk of death for NSCLC [58.0 vs. 41.0 months, P = 0.031; hazard ratio = 0.44, 95% confidential interval: 0.26-0.74]. Further stepwise regression analysis suggested rs16917496 was an independently favorable factor for prognosis and the protective effect more prominent in never smokers, patients without diabetes and patients who received chemotherapy. A significant interaction was observed between rs16917496 and smoking status in relation to NSCLC survival (PC located at miR-502 binding site contributes to NSCLC survival by altering SET8 expression through modulating miRNA-target interaction.

  3. Centrifuge enrichment program

    International Nuclear Information System (INIS)

    Astley, E.R.

    1976-01-01

    Exxon Nuclear has been active in privately funded research and development of centrifuge enrichment technology since 1972. In October of 1975, Exxon Nuclear submitted a proposal to design, construct, and operate a 3000-MT SWU/yr centrifuge enrichment plant, under the provisions of the proposed Nuclear Fuel Assurance Act of 1975. The U.S. Energy Research and Development Administration (ERDA) accepted the proposal as a basis for negotiation. It was proposed to build a 1000-MT SWU/yr demonstration increment to be operational in 1982; and after successful operation for about one year, expand the facilities into a 3000-MT SWU/yr plant. As part of the overall centrifuge enrichment plant, a dedicated centrifuge manufacturing plant would be constructed; sized to support the full 3000-MT SWU/yr plant. The selection of the centrifuge process by Exxon Nuclear was based on an extremely thorough evaluation of current and projected enrichment technology; results show that the technology is mature and the process will be cost effective. The substantial savings in energy (about 93%) from utilization of the centrifuge option rather than gaseous diffusion is a compelling argument. As part of this program, Exxon Nuclear has a large hardware R and D program, plus a prototype centrifuge manufacturing capability in Malta, New York. To provide a full-scale machine and limited cascade test capability, Exxon Nuclear is constructing a $4,000,000 Centrifuge Test Facility in Richland, Washington. This facility was to initiate operations in the Fall of 1976. Exxon Nuclear is convinced that the centrifuge enrichment process is the rational selection for emergence of a commercial enrichment industry

  4. HU participates in expression of a specific set of genes required for growth and survival at acidic pH in Escherichia coli.

    Science.gov (United States)

    Bi, Hongkai; Sun, Lianle; Fukamachi, Toshihiko; Saito, Hiromi; Kobayashi, Hiroshi

    2009-05-01

    The major histone-like Escherichia coli protein, HU, is composed of alpha and beta subunits respectively encoded by hupA and hupB in Escherichia coli. A mutant deficient in both hupA and hupB grew at a slightly slower rate than the wild type at pH 7.5. Growth of the mutant diminished with a decrease in pH, and no growth was observed at pH 4.6. Mutants of either hupA or hupB grew at all pH levels tested. The arginine-dependent survival at pH 2.5 was diminished approximately 60-fold by the deletion of both hupA and hupB, whereas the survival was slightly affected by the deletion of either hupA or hupB. The mRNA levels of adiA and adiC, which respectively encode arginine decarboxylase and arginine/agmatine antiporter, were low in the mutant deficient in both hupA and hupB. The deletion of both hupA and hupB had little effect on survival at pH 2.5 in the presence of glutamate or lysine, and expression of the genes for glutamate and lysine decarboxylases was not impaired by the deletion of the HU genes. These results suggest that HU regulates expression of the specific set of genes required for growth and survival in acidic environments.

  5. US enrichment reduction studies

    International Nuclear Information System (INIS)

    1979-06-01

    A major national program, the Reduced Enrichment Research and Test Reactor (RERTR) Program, is currently under way in the U.S., centered at the Argonne National Laboratory (ANL), to reduce the potential of research and test reactor fuels for increasing the proliferation of nuclear explosive devices. The main objective of the program is to provide the technical means by which the uranium enrichment to be used in these reactors can be reduced to less than 20% without significant economic and performance penalties. The criteria, basis and goals of the program are consistent with the results of a number of case studies which have been performed as part of the program

  6. Advanced uranium enrichment processes

    International Nuclear Information System (INIS)

    Clerc, M.; Plurien, P.

    1986-01-01

    Three advanced Uranium enrichment processes are dealt with in the report: AVLIS (Atomic Vapour LASER Isotope Separation), MLIS (Molecular LASER Isotope Separation) and PSP (Plasma Separation Process). The description of the physical and technical features of the processes constitutes a major part of the report. If further presents comparisons with existing industrially used enrichment technologies, gives information on actual development programmes and budgets and ends with a chapter on perspectives and conclusions. An extensive bibliography of the relevant open literature is added to the different subjects discussed. The report was drawn up by the nuclear research Centre (CEA) Saclay on behalf of the Commission of the European Communities

  7. Uranium Conversion & Enrichment

    Energy Technology Data Exchange (ETDEWEB)

    Karpius, Peter Joseph [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

    2017-02-06

    The isotopes of uranium that are found in nature, and hence in ‘fresh’ Yellowcake’, are not in relative proportions that are suitable for power or weapons applications. The goal of conversion then is to transform the U3O8 yellowcake into UF6. Conversion and enrichment of uranium is usually required to obtain material with enough 235U to be usable as fuel in a reactor or weapon. The cost, size, and complexity of practical conversion and enrichment facilities aid in nonproliferation by design.

  8. Hyb-Seq: Combining Target Enrichment and Genome Skimming for Plant Phylogenomics

    Directory of Open Access Journals (Sweden)

    Kevin Weitemier

    2014-08-01

    Full Text Available Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics.

  9. Hyb-Seq: Combining target enrichment and genome skimming for plant phylogenomics1

    Science.gov (United States)

    Weitemier, Kevin; Straub, Shannon C. K.; Cronn, Richard C.; Fishbein, Mark; Schmickl, Roswitha; McDonnell, Angela; Liston, Aaron

    2014-01-01

    • Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. • Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca) were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp) followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera) resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. • Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics. PMID:25225629

  10. An Enriching Community.

    Science.gov (United States)

    Holland, Nancy A.; Burroughs, Jean

    2001-01-01

    Successful school-community partnerships in Volusia (Florida) Public Schools are the results of marketing creatively, meeting community members' needs, and bringing the right people together. The 3-year old program now offers students of all ages an expanding list of enrichment classes on many subjects for a nominal fee. (MLH)

  11. Uranium enrichment techniques

    International Nuclear Information System (INIS)

    Hamdoun, N.A.

    2007-01-01

    This article includes an introduction about the isotopes of natural uranium, their existence and the difficulty of the separation between them. Then it goes to the details of a number of methods used to enrich uranium: Gaseous Diffusion method, Electromagnetic method, Jet method, Centrifugal method, Chemical method, Laser method and Plasma method.

  12. Requirements for enrichment tools

    NARCIS (Netherlands)

    Boer, A.; Winkels, R.; Trompper, M.

    2016-01-01

    This report gives a high level overview of requirements for Enrichment tools in the Openlaws.eu project. Openlaws.eu aims to initiate a platform and develop a vision for Big Open Legal Data (BOLD): an open framework for legislation, case law, and legal literature from across Europe.

  13. Enriching the Catalog

    Science.gov (United States)

    Tennant, Roy

    2004-01-01

    After decades of costly and time-consuming effort, nearly all libraries have completed the retrospective conversion of their card catalogs to electronic form. However, bibliographic systems still are really not much more than card catalogs on wheels. Enriched content that Amazon.com takes for granted--such as digitized tables of contents, cover…

  14. Availability of enrichment services

    International Nuclear Information System (INIS)

    Svenke, E.

    1977-01-01

    The report summarizes major uncertainties which are likely to influence future demands for uranium isotopic enrichment. Since for the next decade the development of nuclear power will be largely concerned with the increment in demand the timely need for enrichment capacity will be particularly sensitive to assumptions about growth rates. Existing worldwide capacity together with capacities under construction will be sufficient well into the 1980's. However, long decision and construction leadtime, uncertainty as to future demand as well as other factors, specifically high capital need, all of which entail financial risks, create hindrances to a timely development of increment. The adequacy of current technology is well demonstrated in plant operation and new technology is under way. Technology is, however, not freely available on a purely commercial basis. Commercial willingness, which anticipates a limited degree of financial risk, is requesting both long term back-up from the utilities that would parallel their firm decisions on the acquisition of nuclear power units, and a protective government umbrella. This situation depends on the symbiotic relationship that exists between the nuclear power generating organizations, the enrichment undertakings and the governments involved. The report accordingly stresses the need for a more cooperative approach and this, moreover, at the multinational level. There is otherwise a risk that proper resources and financing means will not be allocated to the enrichment sector. Export limitations that request the highest degree of industrial processing of nuclear fuel, i.e. the compulsory enrichment of natural uranium, do not serve the interests of overall industrial efficiency

  15. Promotion of uranium enrichment business

    International Nuclear Information System (INIS)

    Kurushima, Morihiro

    1981-01-01

    The Committee on Nuclear Power has studied on the basic nuclear power policy, establishing its five subcommittees, entrusted by the Ministry of Nternational Trade and Industry. The results of examination by the subcommittee on uranium enrichment business are given along with a report in this connection by the Committee. In order to establish the nuclear fuel cycle, the aspect of uranium enrichment is essential. The uranium enrichment by centrifugal process has proceeded steadily in Power Reactor and Nuclear Fuel Development Corporation. The following matters are described: the need for domestic uranium enrichment, the outlook for overseas enrichment services and the schedule for establishing domestic enrichment business, the current state of technology development, the position of the prototype enrichment plant, the course to be taken to establish enrichment business the main organization operating the prototype and commercial plants, the system of supplying centrifuges, the domestic conversion of natural uranium the subsidies for uranium enrichment business. (J.P.N.)

  16. United States uranium enrichment policies

    International Nuclear Information System (INIS)

    Roberts, R.W.

    1977-01-01

    ERDA's uranium enrichment program policies governing the manner in which ERDA's enrichment complex is being operated and expanded to meet customer requirements for separative work, research and development activities directed at providing technology alternatives for future enrichment capacity, and establishing the framework for additional domestic uranium enrichment capacity to meet the domestic and foreign nuclear industry's growing demand for enrichment services are considered. The ERDA enrichment complex consists of three gaseous diffusion plants located in Oak Ridge, Tennessee; Paducah, Kentucky; and Portsmouth, Ohio. Today, these plants provide uranium enrichment services for commercial nuclear power generation. These enrichment services are provided under contracts between the Government and the utility customers. ERDA's program involves a major pilot plant cascade, and pursues an advanced isotope separation technique for the late 1980's. That the United States must develop additional domestic uranium enrichment capacity is discussed

  17. Uranium-enriched granites in Sweden

    International Nuclear Information System (INIS)

    Wilson, M.R.; Aakerblom, G.

    1980-01-01

    Granites with uranium contents higher than normal occur in a variety of geological settings in the Swedish Precambrian, and represent a variety of granite types and ages. They may have been generated by the anatexis of continental crust or processes occurring at a much greater depth. They commonly show enrichment in F, Sn, W and/or Mo. Only in one case is an important uranium mineralization thought to be directly related to a uranium-enriched granite, while the majority of epigenetic uranium mineralizations with economic potential are related to hydrothermal processes in areas where the bedrock is regionally uranium-enhanced. (author)

  18. Enrichment and Preservation of Architectural Knowledge

    DEFF Research Database (Denmark)

    Beetz, Jakob; Blümel, Ina; Dietze, Stefan

    2016-01-01

    In the context of the EU FP7 DURAARK project (2013–2016), inter-disciplinary methods, technologies and tools have been researched and developed, that support the Long Term Preservation of semantically enriched digital representations of built structures. The results of the research efforts include...... approaches of semi-automatically deriving building models from point cloud data sets acquired from laser scans and the integration and overlay of such representations with explicit Building Information Models (BIM). We introduce novel ways for the further semantic enrichment of such hybrid building models...

  19. Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

    Science.gov (United States)

    Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

    2018-04-23

    Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis

  20. Fine-scale linkage mapping reveals a small set of candidate genes influencing honey bee grooming behavior in response to Varroa mites.

    Directory of Open Access Journals (Sweden)

    Miguel E Arechavaleta-Velasco

    Full Text Available Populations of honey bees in North America have been experiencing high annual colony mortality for 15-20 years. Many apicultural researchers believe that introduced parasites called Varroa mites (V. destructor are the most important factor in colony deaths. One important resistance mechanism that limits mite population growth in colonies is the ability of some lines of honey bees to groom mites from their bodies. To search for genes influencing this trait, we used an Illumina Bead Station genotyping array to determine the genotypes of several hundred worker bees at over a thousand single-nucleotide polymorphisms in a family that was apparently segregating for alleles influencing this behavior. Linkage analyses provided a genetic map with 1,313 markers anchored to genome sequence. Genotypes were analyzed for association with grooming behavior, measured as the time that individual bees took to initiate grooming after mites were placed on their thoraces. Quantitative-trait-locus interval mapping identified a single chromosomal region that was significant at the chromosome-wide level (p<0.05 on chromosome 5 with a LOD score of 2.72. The 95% confidence interval for quantitative trait locus location contained only 27 genes (honey bee official gene annotation set 2 including Atlastin, Ataxin and Neurexin-1 (AmNrx1, which have potential neurodevelopmental and behavioral effects. Atlastin and Ataxin homologs are associated with neurological diseases in humans. AmNrx1 codes for a presynaptic protein with many alternatively spliced isoforms. Neurexin-1 influences the growth, maintenance and maturation of synapses in the brain, as well as the type of receptors most prominent within synapses. Neurexin-1 has also been associated with autism spectrum disorder and schizophrenia in humans, and self-grooming behavior in mice.

  1. Thy1.2 driven expression of transgenic His₆-SUMO2 in the brain of mice alters a restricted set of genes.

    Science.gov (United States)

    Rossner, Moritz J; Tirard, Marilyn

    2014-08-05

    Protein SUMOylation is a post-translational protein modification with a key regulatory role in nerve cell development and function, but its function in mammals in vivo has only been studied cursorily. We generated two new transgenic mouse lines that express His6-tagged SUMO1 and SUMO2 driven by the Thy1.2 promoter. The brains of mice of the two lines express transgenic His6-SUMO peptides and conjugate them to substrates in vivo but cytoarchitecture and synaptic organization of adult transgenic mouse brains are indistinguishable from the wild-type situation. We investigated the impact of transgenic SUMO expression on gene transcription in the hippocampus by performing genome wide analyses using microarrays. Surprisingly, no changes were observed in Thy1.2::His6-SUMO1 transgenic mice and only a restricted set of genes were upregulated in Thy1.2::His6-SUMO2 mice. Among these, Penk1 (Preproenkephalin 1), which encodes Met-enkephalin neuropeptides, showed the highest degree of alteration. Accordingly, a significant increase in Met-enkephalin peptide levels in the hippocampus of Thy1.2::His6-SUMO2 was detected, but the expression levels and cellular localization of Met-enkephalin receptors were not changed. Thus, transgenic neuronal expression of His6-SUMO1 or His6-SUMO2 only induces very minor phenotypical changes in mice. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Future of uranium enrichment

    International Nuclear Information System (INIS)

    Hosmer, C.

    1981-01-01

    The increasing amount of separative work being done in government facilities to produce low-enriched uranium fuel for nuclear utilities again raises the question: should this business-type, industrial function be burned over the private industry. The idea is being looked at by the Reagan administration, but faces problems of national security as well as from the unique nature of the business. This article suggests that a joint government-private venture combining enriching, reprocessing, and waste disposal could be the answer. Further, a separate entity using advanced laser technology to deplete existing uranium tails and lease them for fertile blankets in breeder reactors might earn substantial revenues to help reduce the national debt

  3. Clinicopathologic and gene expression parameters predict liver cancer prognosis

    International Nuclear Information System (INIS)

    Hao, Ke; Zhong, Hua; Greenawalt, Danielle; Ferguson, Mark D; Ng, Irene O; Sham, Pak C; Poon, Ronnie T; Molony, Cliona; Schadt, Eric E; Dai, Hongyue; Luk, John M; Lamb, John; Zhang, Chunsheng; Xie, Tao; Wang, Kai; Zhang, Bin; Chudin, Eugene; Lee, Nikki P; Mao, Mao

    2011-01-01

    The prognosis of hepatocellular carcinoma (HCC) varies following surgical resection and the large variation remains largely unexplained. Studies have revealed the ability of clinicopathologic parameters and gene expression to predict HCC prognosis. However, there has been little systematic effort to compare the performance of these two types of predictors or combine them in a comprehensive model. Tumor and adjacent non-tumor liver tissues were collected from 272 ethnic Chinese HCC patients who received curative surgery. We combined clinicopathologic parameters and gene expression data (from both tissue types) in predicting HCC prognosis. Cross-validation and independent studies were employed to assess prediction. HCC prognosis was significantly associated with six clinicopathologic parameters, which can partition the patients into good- and poor-prognosis groups. Within each group, gene expression data further divide patients into distinct prognostic subgroups. Our predictive genes significantly overlap with previously published gene sets predictive of prognosis. Moreover, the predictive genes were enriched for genes that underwent normal-to-tumor gene network transformation. Previously documented liver eSNPs underlying the HCC predictive gene signatures were enriched for SNPs that associated with HCC prognosis, providing support that these genes are involved in key processes of tumorigenesis. When applied individually, clinicopathologic parameters and gene expression offered similar predictive power for HCC prognosis. In contrast, a combination of the two types of data dramatically improved the power to predict HCC prognosis. Our results also provided a framework for understanding the impact of gene expression on the processes of tumorigenesis and clinical outcome

  4. Combining target enrichment with barcode multiplexing for high throughput SNP discovery

    Directory of Open Access Journals (Sweden)

    Lunke Sebastian

    2010-11-01

    Full Text Available Abstract Background The primary goal of genetic linkage analysis is to identify genes affecting a phenotypic trait. After localisation of the linkage region, efficient genetic dissection of the disease linked loci requires that functional variants are identified across the loci. These functional variations are difficult to detect due to extent of genetic diversity and, to date, incomplete cataloguing of the large number of variants present both within and between populations. Massively parallel sequencing platforms offer unprecedented capacity for variant discovery, however the number of samples analysed are still limited by cost per sample. Some progress has been made in reducing the cost of resequencing using either multiplexing methodologies or through the utilisation of targeted enrichment technologies which provide the ability to resequence genomic areas of interest rather that full genome sequencing. Results We developed a method that combines current multiplexing methodologies with a solution-based target enrichment method to further reduce the cost of resequencing where region-specific sequencing is required. Our multiplex/enrichment strategy produced high quality data with nominal reduction of sequencing depth. We undertook a genotyping study and were successful in the discovery of novel SNP alleles in all samples at uniplex, duplex and pentaplex levels. Conclusion Our work describes the successful combination of a targeted enrichment method and index barcode multiplexing to reduce costs, time and labour associated with processing large sample sets. Furthermore, we have shown that the sequencing depth obtained is adequate for credible SNP genotyping analysis at uniplex, duplex and pentaplex levels.

  5. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

    Science.gov (United States)

    Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

    2009-02-01

    Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

  6. Modulation of microbial consortia enriched from different polluted environments during petroleum biodegradation.

    Science.gov (United States)

    Omrani, Rahma; Spini, Giulia; Puglisi, Edoardo; Saidane, Dalila

    2018-04-01

    Environmental microbial communities are key players in the bioremediation of hydrocarbon pollutants. Here we assessed changes in bacterial abundance and diversity during the degradation of Tunisian Zarzatine oil by four indigenous bacterial consortia enriched from a petroleum station soil, a refinery reservoir soil, a harbor sediment and seawater. The four consortia were found to efficiently degrade up to 92.0% of total petroleum hydrocarbons after 2 months of incubation. Illumina 16S rRNA gene sequencing revealed that the consortia enriched from soil and sediments were dominated by species belonging to Pseudomonas and Acinetobacter genera, while in the seawater-derived consortia Dietzia, Fusobacterium and Mycoplana emerged as dominant genera. We identified a number of species whose relative abundances bloomed from small to high percentages: Dietzia daqingensis in the seawater microcosms, and three OTUs classified as Acinetobacter venetianus in all two soils and sediment derived microcosms. Functional analyses on degrading genes were conducted by comparing PCR results of the degrading genes alkB, ndoB, cat23, xylA and nidA1 with inferences obtained by PICRUSt analysis of 16S amplicon data: the two data sets were partly in agreement and suggest a relationship between the catabolic genes detected and the rate of biodegradation obtained. The work provides detailed insights about the modulation of bacterial communities involved in petroleum biodegradation and can provide useful information for in situ bioremediation of oil-related pollution.

  7. Set points, settling points and some alternative models: theoretical options to understand how genes and environments combine to regulate body adiposity

    Directory of Open Access Journals (Sweden)

    John R. Speakman

    2011-11-01

    Full Text Available The close correspondence between energy intake and expenditure over prolonged time periods, coupled with an apparent protection of the level of body adiposity in the face of perturbations of energy balance, has led to the idea that body fatness is regulated via mechanisms that control intake and energy expenditure. Two models have dominated the discussion of how this regulation might take place. The set point model is rooted in physiology, genetics and molecular biology, and suggests that there is an active feedback mechanism linking adipose tissue (stored energy to intake and expenditure via a set point, presumably encoded in the brain. This model is consistent with many of the biological aspects of energy balance, but struggles to explain the many significant environmental and social influences on obesity, food intake and physical activity. More importantly, the set point model does not effectively explain the ‘obesity epidemic’ – the large increase in body weight and adiposity of a large proportion of individuals in many countries since the 1980s. An alternative model, called the settling point model, is based on the idea that there is passive feedback between the size of the body stores and aspects of expenditure. This model accommodates many of the social and environmental characteristics of energy balance, but struggles to explain some of the biological and genetic aspects. The shortcomings of these two models reflect their failure to address the gene-by-environment interactions that dominate the regulation of body weight. We discuss two additional models – the general intake model and the dual intervention point model – that address this issue and might offer better ways to understand how body fatness is controlled.

  8. Beta activity of enriched uranium

    International Nuclear Information System (INIS)

    Nambiar, P.P.V.J.; Ramachandran, V.

    1975-01-01

    Use of enriched uranium as reactor fuel necessitates its handling in various forms. For purposes of planning and organising radiation protection measures in enriched uranium handling facilities, it is necessary to have a basic knowledge of the radiation status of enriched uranium systems. The theoretical variations in beta activity and energy with U 235 enrichment are presented. Depletion is considered separately. Beta activity build up is also studied for two specific enrichments, in respect of which experimental values for specific alpha activity are available. (author)

  9. Blueprint for domestic uranium enrichment

    International Nuclear Information System (INIS)

    1981-01-01

    The AEC advisory committee on domestic production of uranium enrichment has studied for more than a year how to achieve the domestic enrichment of uranium by the construction and operation of a commercial enriching plant using centrifugal separation method, and the report was submitted to the Atomic Energy Commission on August 18, 1980. Japan has depended wholly on overseas services for her uranium enrichment needs, but the development of domestic enrichment has been carried on in parallel. The AEC decided to construct a uranium enrichment pilot plant using centrifuges, and it has been forwarded as a national project. The plant is operated by the Power Reactor and Nuclear Fuel Development Corp. since 1979. The capacity of the plant will be raised to approximately 75 ton SWU a year. The centrifuges already operated have provided the first delivery of fuel of about 1 ton for the ATR ''Fugen''. The demand-supply balance of uranium enrichment service, the significance of the domestic enrichment of uranium, the evaluation of uranium enrichment technology, the target for domestic enrichment plan, the measures to promote domestic uranium enrichment, and the promotion of the construction of a demonstration plant are reported. (Kako, I.)

  10. Uranium enrichment by gas centrifuge

    International Nuclear Information System (INIS)

    Heriot, I.D.

    1988-01-01

    After recalling the physical principles and the techniques of centrifuge enrichment the report describes the centrifuge enrichment programmes of the various countries concerned and compares this technology with other enrichment technologies like gaseous diffusion, laser, aerodynamic devices and chemical processes. The centrifuge enrichment process is said to be able to replace with advantage the existing enrichment facilities in the short and medium term. Future prospects of the process are also described, like recycled uranium enrichment and economic improvements; research and development needs to achieve the economic prospects are also indicated. Finally the report takes note of the positive aspect of centrifuge enrichment as far as safeguards and nuclear safety are concerned. 27 figs, 113 refs

  11. Radiometric enrichment of nonradioactive ores

    International Nuclear Information System (INIS)

    Mokrousov, V.A.; Lileev, V.A.

    1979-01-01

    Considered are the methods of mineral enrichment based on the use of the radioation of various types. The physical essence of enrichment processes is presented, their classification is given. Described are the ore properties influencing the efficiency of radiometric enrichment, methods of the properties study and estimation of ore enrichment. New possibilities opened by radiometric enrichment in the technology of primary processing of mineral raw materials are elucidated. A considerable attention is paid to the main and auxiliary equipment for radiometric enrichment. The foundations of the safety engineering are presented in a brief form. Presented are also results of investigations and practical works in the field of enrichment of ores of non-ferrous, ferrous and non-metallic minerals with the help of radiometric methods

  12. Microarray analysis reveals key genes and pathways in Tetralogy of Fallot

    Science.gov (United States)

    He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai

    2017-01-01

    The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF

  13. A gene pathway analysis highlights the role of cellular adhesion molecules in multiple sclerosis susceptibility

    DEFF Research Database (Denmark)

    Damotte, V; Guillot-Noel, L; Patsopoulos, N A

    2014-01-01

    adhesion molecule (CAMs) biological pathway using Cytoscape software. This network is a strong candidate, as it is involved in the crossing of the blood-brain barrier by the T cells, an early event in MS pathophysiology, and is used as an efficient therapeutic target. We drew up a list of 76 genes...... in interaction with other genes as a group. Pathway analysis is an alternative way to highlight such group of genes. Using SNP association P-values from eight multiple sclerosis (MS) GWAS data sets, we performed a candidate pathway analysis for MS susceptibility by considering genes interacting in the cell...... belonging to the CAM network. We highlighted 64 networks enriched with CAM genes with low P-values. Filtering by a percentage of CAM genes up to 50% and rejecting enriched signals mainly driven by transcription factors, we highlighted five networks associated with MS susceptibility. One of them, constituted...

  14. CELF family RNA-binding protein UNC-75 regulates two sets of mutually exclusive exons of the unc-32 gene in neuron-specific manners in Caenorhabditis elegans.

    Directory of Open Access Journals (Sweden)

    Hidehito Kuroyanagi

    Full Text Available An enormous number of alternative pre-mRNA splicing patterns in multicellular organisms are coordinately defined by a limited number of regulatory proteins and cis elements. Mutually exclusive alternative splicing should be strictly regulated and is a challenging model for elucidating regulation mechanisms. Here we provide models of the regulation of two sets of mutually exclusive exons, 4a-4c and 7a-7b, of the Caenorhabditis elegans uncoordinated (unc-32 gene, encoding the a subunit of V0 complex of vacuolar-type H(+-ATPases. We visualize selection patterns of exon 4 and exon 7 in vivo by utilizing a trio and a pair of symmetric fluorescence splicing reporter minigenes, respectively, to demonstrate that they are regulated in tissue-specific manners. Genetic analyses reveal that RBFOX family RNA-binding proteins ASD-1 and FOX-1 and a UGCAUG stretch in intron 7b are involved in the neuron-specific selection of exon 7a. Through further forward genetic screening, we identify UNC-75, a neuron-specific CELF family RNA-binding protein of unknown function, as an essential regulator for the exon 7a selection. Electrophoretic mobility shift assays specify a short fragment in intron 7a as the recognition site for UNC-75 and demonstrate that UNC-75 specifically binds via its three RNA recognition motifs to the element including a UUGUUGUGUUGU stretch. The UUGUUGUGUUGU stretch in the reporter minigenes is actually required for the selection of exon 7a in the nervous system. We compare the amounts of partially spliced RNAs in the wild-type and unc-75 mutant backgrounds and raise a model for the mutually exclusive selection of unc-32 exon 7 by the RBFOX family and UNC-75. The neuron-specific selection of unc-32 exon 4b is also regulated by UNC-75 and the unc-75 mutation suppresses the Unc phenotype of the exon-4b-specific allele of unc-32 mutants. Taken together, UNC-75 is the neuron-specific splicing factor and regulates both sets of the mutually exclusive

  15. The world enrichment market

    International Nuclear Information System (INIS)

    Gunter, L.; McCants, C.; Rutkowski, E.

    1991-01-01

    The enrichment market can be divided into two periods: the near-term market (1991 to 1995) and the long-term market (1995 and beyond). The near-term market is characterized by limited unfilled requirements of 4% per year, to be supplied by national stockpiles and excess inventories. This low-cost material will be drawn down by about 1993, causing a subsequent price rise. As the price rises, primary supplier activity is expected to increase. In the near-term, two contracting activities are apparent: spot; and intermediate-term. The current spot market is expected to last until available low cost inventories are drawn down. Recently, in attempts to gain market share, suppliers have offered attractively priced intermediate-term (3 year) contracts for 1996 to 1998. While a small spot market will continue after 1995, it is anticipated that utilities will prefer a mix of medium- and long-term (5 to 10 year) contracts from primary suppliers for most of their enrichment requirements. As national stockpiles and utility inventories are consumed, low-cost supply available to the spot market is expected to diminish. Consequently, with little low-cost supply available, the only apparent source of material will be from primary suppliers, and the resulting competition over market share is expected to be intense. (author)

  16. Evaluating biomarkers for prognostic enrichment of clinical trials.

    Science.gov (United States)

    Kerr, Kathleen F; Roth, Jeremy; Zhu, Kehao; Thiessen-Philbrook, Heather; Meisner, Allison; Wilson, Francis Perry; Coca, Steven; Parikh, Chirag R

    2017-12-01

    A potential use of biomarkers is to assist in prognostic enrichment of clinical trials, where only patients at relatively higher risk for an outcome of interest are eligible for the trial. We investigated methods for evaluating biomarkers for prognostic enrichment. We identified five key considerations when considering a biomarker and a screening threshold for prognostic enrichment: (1) clinical trial sample size, (2) calendar time to enroll the trial, (3) total patient screening costs and the total per-patient trial costs, (4) generalizability of trial results, and (5) ethical evaluation of trial eligibility criteria. Items (1)-(3) are amenable to quantitative analysis. We developed the Biomarker Prognostic Enrichment Tool for evaluating biomarkers for prognostic enrichment at varying levels of screening stringency. We demonstrate that both modestly prognostic and strongly prognostic biomarkers can improve trial metrics using Biomarker Prognostic Enrichment Tool. Biomarker Prognostic Enrichment Tool is available as a webtool at http://prognosticenrichment.com and as a package for the R statistical computing platform. In some clinical settings, even biomarkers with modest prognostic performance can be useful for prognostic enrichment. In addition to the quantitative analysis provided by Biomarker Prognostic Enrichment Tool, investigators must consider the generalizability of trial results and evaluate the ethics of trial eligibility criteria.

  17. 47 CFR 1.2111 - Assignment or transfer of control: unjust enrichment.

    Science.gov (United States)

    2010-10-01

    ... enrichment. 1.2111 Section 1.2111 Telecommunication FEDERAL COMMUNICATIONS COMMISSION GENERAL PRACTICE AND...: unjust enrichment. (a) Reporting requirement. An applicant seeking approval for a transfer of control or... an option to purchase; below market financing). (b) Unjust enrichment payment: set-aside. As...

  18. Modular enrichment measurement system for in-situ enrichment assay

    International Nuclear Information System (INIS)

    Stewart, J.P.

    1976-01-01

    A modular enrichment measurement system has been designed and is in operation within General Electric's Nuclear Fuel Fabrication Facility for the in-situ enrichment assay of uranium-bearing materials in process containers. This enrichment assay system, which is based on the ''enrichment meter'' concept, is an integral part of the site's enrichment control program and is used in the in-situ assay of the enrichment of uranium dioxide (UO 2 ) powder in process containers (five gallon pails). The assay system utilizes a commercially available modular counting system and a collimnator designed for compatability with process container transport lines and ease of operator access. The system has been upgraded to include a microprocessor-based controller to perform system operation functions and to provide data acquisition and processing functions. Standards have been fabricated and qualified for the enrichment assay of several types of uranium-bearing materials, including UO 2 powders. The assay system has performed in excess of 20,000 enrichment verification measurements annually and has significantly contributed to the facility's enrichment control program

  19. The AHL- and BDSF-dependent quorum sensing systems control specific and overlapping sets of genes in Burkholderia cenocepacia H111.

    Directory of Open Access Journals (Sweden)

    Nadine Schmid

    Full Text Available Quorum sensing in Burkholderia cenocepacia H111 involves two signalling systems that depend on different signal molecules, namely N-acyl homoserine lactones (AHLs and the diffusible signal factor cis-2-dodecenoic acid (BDSF. Previous studies have shown that AHLs and BDSF control similar phenotypic traits, including biofilm formation, proteolytic activity and pathogenicity. In this study we mapped the BDSF stimulon by RNA-Seq and shotgun proteomics analysis. We demonstrate that a set of the identified BDSF-regulated genes or proteins are also controlled by AHLs, suggesting that the two regulons partially overlap. The detailed analysis of two mutually regulated operons, one encoding three lectins and the other one encoding the large surface protein BapA and its type I secretion machinery, revealed that both AHLs and BDSF are required for full expression, suggesting that the two signalling systems operate in parallel. In accordance with this, we show that both AHLs and BDSF are required for biofilm formation and protease production.

  20. Enrichment of boron 10

    International Nuclear Information System (INIS)

    Coutinho, C.M.M.; Rodrigues Filho, J.S.R.; Umeda, K.; Echternacht, M.V.

    1990-01-01

    A isotopic separation pilot plant with five ion exchange columns interconnected in series were designed and built in the IEN. The columns are charged with a strong anionic resin in its alkaline form. The boric acid solution is introduced in the separation columns until it reaches a absorbing zone length which is sufficient to obtain the desired boron-10 isotopic concentration. The boric acid absorbing zone movement is provided by the injection of a diluted hydrochloric acid solution, which replace the boric acid throughout the columns. The absorbing zone equilibrium length is proportional to its total length. The enriched boron-10 and the depleted boron are located in the final boundary and in the initial position of the absorbing zones, respectively. (author)

  1. SNP-based pathway enrichment analysis for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Potkin Steven G

    2011-04-01

    Full Text Available Abstract Background Recently we have witnessed a surge of interest in using genome-wide association studies (GWAS to discover the genetic basis of complex diseases. Many genetic variations, mostly in the form of single nucleotide polymorphisms (SNPs, have been identified in a wide spectrum of diseases, including diabetes, cancer, and psychiatric diseases. A common theme arising from these studies is that the genetic variations discovered by GWAS can only explain a small fraction of the genetic risks associated with the complex diseases. New strategies and statistical approaches are needed to address this lack of explanation. One such approach is the pathway analysis, which considers the genetic variations underlying a biological pathway, rather than separately as in the traditional GWAS studies. A critical challenge in the pathway analysis is how to combine evidences of association over multiple SNPs within a gene and multiple genes within a pathway. Most current methods choose the most significant SNP from each gene as a representative, ignoring the joint action of multiple SNPs within a gene. This approach leads to preferential identification of genes with a greater number of SNPs. Results We describe a SNP-based pathway enrichment method for GWAS studies. The method consists of the following two main steps: 1 for a given pathway, using an adaptive truncated product statistic to identify all representative (potentially more than one SNPs of each gene, calculating the average number of representative SNPs for the genes, then re-selecting the representative SNPs of genes in the pathway based on this number; and 2 ranking all selected SNPs by the significance of their statistical association with a trait of interest, and testing if the set of SNPs from a particular pathway is significantly enriched with high ranks using a weighted Kolmogorov-Smirnov test. We applied our method to two large genetically distinct GWAS data sets of schizophrenia, one

  2. Genes Underlying Positive Influence Of Prenatal Environmental ...

    African Journals Online (AJOL)

    Genes Underlying Positive Influence Of Prenatal Environmental Enrichment And ... Prenatal environmental enrichment (EE) has been proven to positively affect but ... Conclusion: The negative-positive prenatal effect could contribute to altered ...

  3. Thermal breeder fuel enrichment zoning

    International Nuclear Information System (INIS)

    Capossela, H.J.; Dwyer, J.R.; Luce, R.G.; McCoy, D.F.; Merriman, F.C.

    1992-01-01

    A method and apparatus for improving the performance of a thermal breeder reactor having regions of higher than average moderator concentration are disclosed. The fuel modules of the reactor core contain at least two different types of fuel elements, a high enrichment fuel element and a low enrichment fuel element. The two types of fuel elements are arranged in the fuel module with the low enrichment fuel elements located between the high moderator regions and the high enrichment fuel elements. Preferably, shim rods made of a fertile material are provided in selective regions for controlling the reactivity of the reactor by movement of the shim rods into and out of the reactor core. The moderation of neutrons adjacent the high enrichment fuel elements is preferably minimized as by reducing the spacing of the high enrichment fuel elements and/or using a moderator having a reduced moderating effect. 1 figure

  4. Advanced Neutron Source enrichment study

    International Nuclear Information System (INIS)

    Bari, R.A.; Ludewig, H.; Weeks, J.R.

    1996-01-01

    A study has been performed of the impact on performance of using low-enriched uranium (20% 235 U) or medium-enriched uranium (35% 235 U) as an alternative fuel for the Advanced Neutron Source, which was initially designed to use uranium enriched to 93% 235 U. Higher fuel densities and larger volume cores were evaluated at the lower enrichments in terms of impact on neutron flux, safety, safeguards, technical feasibility, and cost. The feasibility of fabricating uranium silicide fuel at increasing material density was specifically addressed by a panel of international experts on research reactor fuels. The most viable alternative designs for the reactor at lower enrichments were identified and discussed. Several sensitivity analyses were performed to gain an understanding of the performance of the reactor at parametric values of power, fuel density, core volume, and enrichment that were interpolations between the boundary values imposed on the study or extrapolations from known technology

  5. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  6. Evaluation of the uranium enrichment demonstration plant project

    International Nuclear Information System (INIS)

    Sugitsue, Noritake

    2001-01-01

    In this report, the organization system of the uranium enrichment business is evaluated, based on the operation of the uranium enrichment demonstration plant. As a result, in uranium enrichment technology development or business, it was acknowledged that maintenance of the organization which has the Trinity of a research/engineering/operation was necessary in an industrialization stage by exceptional R and D cycle. Japan Nuclear Fuel Ltd. (JNFL) set up the Rokkashomura Aomori Uranium Enrichment Research and Development Center in November 2000. As a result, the system that company directly engaged in engineering development was prepared. And results obtained in this place is expected toward certain establishment of the uranium enrichment business of Japan. (author)

  7. Genomic Analysis Reveals Contrasting PIFq Contribution to Diurnal Rhythmic Gene Expression in PIF-Induced and -Repressed Genes.

    Science.gov (United States)

    Martin, Guiomar; Soy, Judit; Monte, Elena

    2016-01-01

    Members of the PIF quartet (PIFq; PIF1, PIF3, PIF4, and PIF5) collectively contribute to induce growth in Arabidopsis seedlings under short day (SD) conditions, specifically promoting elongation at dawn. Their action involves the direct regulation of growth-related and hormone-associated genes. However, a comprehensive definition of the PIFq-regulated transcriptome under SD is still lacking. We have recently shown that SD and free-running (LL) conditions correspond to "growth" and "no growth" conditions, respectively, correlating with greater abundance of PIF protein in SD. Here, we present a genomic analysis whereby we first define SD-regulated genes at dawn compared to LL in the wild type, followed by identification of those SD-regulated genes whose expression depends on the presence of PIFq. By using this sequential strategy, we have identified 349 PIF/SD-regulated genes, approximately 55% induced and 42% repressed by both SD and PIFq. Comparison with available databases indicates that PIF/SD-induced and PIF/SD-repressed sets are differently phased at dawn and mid-morning, respectively. In addition, we found that whereas rhythmicity of the PIF/SD-induced gene set is lost in LL, most PIF/SD-repressed genes keep their rhythmicity in LL, suggesting differential regulation of both gene sets by the circadian clock. Moreover, we also uncovered distinct overrepresented functions in the induced and repressed gene sets, in accord with previous studies in other examined PIF-regulated processes. Interestingly, promoter analyses showed that, whereas PIF/SD-induced genes are enriched in direct PIF targets, PIF/SD-repressed genes are mostly indirectly regulated by the PIFs and might be more enriched in ABA-regulated genes.

  8. Polymorphisms in sodium-dependent vitamin C transporter genes and plasma, aqueous humor and lens nucleus ascorbate concentrations in an ascorbate depleted setting.

    Science.gov (United States)

    Senthilkumari, Srinivasan; Talwar, Badri; Dharmalingam, Kuppamuthu; Ravindran, Ravilla D; Jayanthi, Ramamurthy; Sundaresan, Periasamy; Saravanan, Charu; Young, Ian S; Dangour, Alan D; Fletcher, Astrid E

    2014-07-01

    We have previously reported low concentrations of plasma ascorbate and low dietary vitamin C intake in the older Indian population and a strong inverse association of these with cataract. Little is known about ascorbate levels in aqueous humor and lens in populations habitually depleted of ascorbate and no studies in any setting have investigated whether genetic polymorphisms influence ascorbate levels in ocular tissues. Our objectives were to investigate relationships between ascorbate concentrations in plasma, aqueous humor and lens and whether these relationships are influenced by Single Nucleotide Polymorphisms (SNPs) in sodium-dependent vitamin C transporter genes (SLC23A1 and SLC23A2). We enrolled sixty patients (equal numbers of men and women, mean age 63 years) undergoing small incision cataract surgery in southern India. We measured ascorbate concentrations in plasma, aqueous humor and lens nucleus using high performance liquid chromatography. SLC23A1 SNPs (rs4257763, rs6596473) and SLC23A2 SNPs (rs1279683 and rs12479919) were genotyped using a TaqMan assay. Patients were interviewed for lifestyle factors which might influence ascorbate. Plasma vitamin C was normalized by a log10 transformation. Statistical analysis used linear regression with the slope of the within-subject associations estimated using beta (β) coefficients. The ascorbate concentrations (μmol/L) were: plasma ascorbate, median and inter-quartile range (IQR), 15.2 (7.8, 34.5), mean (SD) of aqueous humor ascorbate, 1074 (545) and lens nucleus ascorbate, 0.42 (0.16) (μmol/g lens nucleus wet weight). Minimum allele frequencies were: rs1279683 (0.28), rs12479919 (0.30), rs659647 (0.48). Decreasing concentrations of ocular ascorbate from the common to the rare genotype were observed for rs6596473 and rs12479919. The per allele difference in aqueous humor ascorbate for rs6596473 was -217 μmol/L, p humor ascorbate were higher for the GG genotype of rs6596473: GG, β = 1460 compared to

  9. Enriching Genomic Resources and Marker Development from Transcript Sequences of Jatropha curcas for Microgravity Studies

    Science.gov (United States)

    Tian, Wenlan; Paudel, Dev

    2017-01-01

    Jatropha (Jatropha curcas L.) is an economically important species with a great potential for biodiesel production. To enrich the jatropha genomic databases and resources for microgravity studies, we sequenced and annotated the transcriptome of jatropha and developed SSR and SNP markers from the transcriptome sequences. In total 1,714,433 raw reads with an average length of 441.2 nucleotides were generated. De novo assembling and clustering resulted in 115,611 uniquely assembled sequences (UASs) including 21,418 full-length cDNAs and 23,264 new jatropha transcript sequences. The whole set of UASs were fully annotated, out of which 59,903 (51.81%) were assigned with gene ontology (GO) term, 12,584 (10.88%) had orthologs in Eukaryotic Orthologous Groups (KOG), and 8,822 (7.63%) were mapped to 317 pathways in six different categories in Kyoto Encyclopedia of Genes and Genome (KEGG) database, and it contained 3,588 putative transcription factors. From the UASs, 9,798 SSRs were discovered with AG/CT as the most frequent (45.8%) SSR motif type. Further 38,693 SNPs were detected and 7,584 remained after filtering. This UAS set has enriched the current jatropha genomic databases and provided a large number of genetic markers, which can facilitate jatropha genetic improvement and many other genetic and biological studies. PMID:28154822

  10. Identification of key pathways and genes influencing prognosis in bladder urothelial carcinoma

    Directory of