WorldWideScience

Sample records for gene ontology pathways

  1. Gene Ontology

    Directory of Open Access Journals (Sweden)

    Gaston K. Mazandu

    2012-01-01

    Full Text Available The wide coverage and biological relevance of the Gene Ontology (GO, confirmed through its successful use in protein function prediction, have led to the growth in its popularity. In order to exploit the extent of biological knowledge that GO offers in describing genes or groups of genes, there is a need for an efficient, scalable similarity measure for GO terms and GO-annotated proteins. While several GO similarity measures exist, none adequately addresses all issues surrounding the design and usage of the ontology. We introduce a new metric for measuring the distance between two GO terms using the intrinsic topology of the GO-DAG, thus enabling the measurement of functional similarities between proteins based on their GO annotations. We assess the performance of this metric using a ROC analysis on human protein-protein interaction datasets and correlation coefficient analysis on the selected set of protein pairs from the CESSM online tool. This metric achieves good performance compared to the existing annotation-based GO measures. We used this new metric to assess functional similarity between orthologues, and show that it is effective at determining whether orthologues are annotated with similar functions and identifying cases where annotation is inconsistent between orthologues.

  2. An ontology-driven semantic mashup of gene and biological pathway information: application to the domain of nicotine dependence.

    Science.gov (United States)

    Sahoo, Satya S; Bodenreider, Olivier; Rutter, Joni L; Skinner, Karen J; Sheth, Amit P

    2008-10-01

    This paper illustrates how Semantic Web technologies (especially RDF, OWL, and SPARQL) can support information integration and make it easy to create semantic mashups (semantically integrated resources). In the context of understanding the genetic basis of nicotine dependence, we integrate gene and pathway information and show how three complex biological queries can be answered by the integrated knowledge base. We use an ontology-driven approach to integrate two gene resources (Entrez Gene and HomoloGene) and three pathway resources (KEGG, Reactome and BioCyc), for five organisms, including humans. We created the Entrez Knowledge Model (EKoM), an information model in OWL for the gene resources, and integrated it with the extant BioPAX ontology designed for pathway resources. The integrated schema is populated with data from the pathway resources, publicly available in BioPAX-compatible format, and gene resources for which a population procedure was created. The SPARQL query language is used to formulate queries over the integrated knowledge base to answer the three biological queries. Simple SPARQL queries could easily identify hub genes, i.e., those genes whose gene products participate in many pathways or interact with many other gene products. The identification of the genes expressed in the brain turned out to be more difficult, due to the lack of a common identification scheme for proteins. Semantic Web technologies provide a valid framework for information integration in the life sciences. Ontology-driven integration represents a flexible, sustainable and extensible solution to the integration of large volumes of information. Additional resources, which enable the creation of mappings between information sources, are required to compensate for heterogeneity across namespaces. RESOURCE PAGE: http://knoesis.wright.edu/research/lifesci/integration/structured_data/JBI-2008/

  3. PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements.

    Science.gov (United States)

    Mi, Huaiyu; Huang, Xiaosong; Muruganujan, Anushya; Tang, Haiming; Mills, Caitlin; Kang, Diane; Thomas, Paul D

    2017-01-04

    The PANTHER database (Protein ANalysis THrough Evolutionary Relationships, http://pantherdb.org) contains comprehensive information on the evolution and function of protein-coding genes from 104 completely sequenced genomes. PANTHER software tools allow users to classify new protein sequences, and to analyze gene lists obtained from large-scale genomics experiments. In the past year, major improvements include a large expansion of classification information available in PANTHER, as well as significant enhancements to the analysis tools. Protein subfamily functional classifications have more than doubled due to progress of the Gene Ontology Phylogenetic Annotation Project. For human genes (as well as a few other organisms), PANTHER now also supports enrichment analysis using pathway classifications from the Reactome resource. The gene list enrichment tools include a new 'hierarchical view' of results, enabling users to leverage the structure of the classifications/ontologies; the tools also allow users to upload genetic variant data directly, rather than requiring prior conversion to a gene list. The updated coding single-nucleotide polymorphisms (SNP) scoring tool uses an improved algorithm. The hidden Markov model (HMM) search tools now use HMMER3, dramatically reducing search times and improving accuracy of E-value statistics. Finally, the PANTHER Tree-Attribute Viewer has been implemented in JavaScript, with new views for exploring protein sequence evolution. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. The Use of Gene Ontology Term and KEGG Pathway Enrichment for Analysis of Drug Half-Life.

    Directory of Open Access Journals (Sweden)

    Yu-Hang Zhang

    Full Text Available A drug's biological half-life is defined as the time required for the human body to metabolize or eliminate 50% of the initial drug dosage. Correctly measuring the half-life of a given drug is helpful for the safe and accurate usage of the drug. In this study, we investigated which gene ontology (GO terms and biological pathways were highly related to the determination of drug half-life. The investigated drugs, with known half-lives, were analyzed based on their enrichment scores for associated GO terms and KEGG pathways. These scores indicate which GO terms or KEGG pathways the drug targets. The feature selection method, minimum redundancy maximum relevance, was used to analyze these GO terms and KEGG pathways and to identify important GO terms and pathways, such as sodium-independent organic anion transmembrane transporter activity (GO:0015347, monoamine transmembrane transporter activity (GO:0008504, negative regulation of synaptic transmission (GO:0050805, neuroactive ligand-receptor interaction (hsa04080, serotonergic synapse (hsa04726, and linoleic acid metabolism (hsa00591, among others. This analysis confirmed our results and may show evidence for a new method in studying drug half-lives and building effective computational methods for the prediction of drug half-lives.

  5. An ontology-driven semantic mash-up of gene and biological pathway information: Application to the domain of nicotine dependence

    Science.gov (United States)

    Sahoo, Satya S.; Bodenreider, Olivier; Rutter, Joni L.; Skinner, Karen J.; Sheth, Amit P.

    2008-01-01

    Objectives This paper illustrates how Semantic Web technologies (especially RDF, OWL, and SPARQL) can support information integration and make it easy to create semantic mashups (semantically integrated resources). In the context of understanding the genetic basis of nicotine dependence, we integrate gene and pathway information and show how three complex biological queries can be answered by the integrated knowledge base. Methods We use an ontology-driven approach to integrate two gene resources (Entrez Gene and HomoloGene) and three pathway resources (KEGG, Reactome and BioCyc), for five organisms, including humans. We created the Entrez Knowledge Model (EKoM), an information model in OWL for the gene resources, and integrated it with the extant BioPAX ontology designed for pathway resources. The integrated schema is populated with data from the pathway resources, publicly available in BioPAX-compatible format, and gene resources for which a population procedure was created. The SPARQL query language is used to formulate queries over the integrated knowledge base to answer the three biological queries. Results Simple SPARQL queries could easily identify hub genes, i.e., those genes whose gene products participate in many pathways or interact with many other gene products. The identification of the genes expressed in the brain turned out to be more difficult, due to the lack of a common identification scheme for proteins. Conclusion Semantic Web technologies provide a valid framework for information integration in the life sciences. Ontology-driven integration represents a flexible, sustainable and extensible solution to the integration of large volumes of information. Additional resources, which enable the creation of mappings between information sources, are required to compensate for heterogeneity across namespaces. Resource page http://knoesis.wright.edu/research/lifesci/integration/structured_data/JBI-2008/ PMID:18395495

  6. Gene Ontology Consortium: going forward.

    Science.gov (United States)

    2015-01-01

    The Gene Ontology (GO; http://www.geneontology.org) is a community-based bioinformatics resource that supplies information about gene product function using ontologies to represent biological knowledge. Here we describe improvements and expansions to several branches of the ontology, as well as updates that have allowed us to more efficiently disseminate the GO and capture feedback from the research community. The Gene Ontology Consortium (GOC) has expanded areas of the ontology such as cilia-related terms, cell-cycle terms and multicellular organism processes. We have also implemented new tools for generating ontology terms based on a set of logical rules making use of templates, and we have made efforts to increase our use of logical definitions. The GOC has a new and improved web site summarizing new developments and documentation, serving as a portal to GO data. Users can perform GO enrichment analysis, and search the GO for terms, annotations to gene products, and associated metadata across multiple species using the all-new AmiGO 2 browser. We encourage and welcome the input of the research community in all biological areas in our continued effort to improve the Gene Ontology. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Witnessing stressful events induces glutamatergic synapse pathway alterations and gene set enrichment of positive EPSP regulation within the VTA of adult mice: An ontology based approach

    Science.gov (United States)

    Brewer, Jacob S.

    It is well known that exposure to severe stress increases the risk for developing mood disorders. Currently, the neurobiological and genetic mechanisms underlying the functional effects of psychological stress are poorly understood. Presenting a major obstacle to the study of psychological stress is the inability of current animal models of stress to distinguish between physical and psychological stressors. A novel paradigm recently developed by Warren et al., is able to tease apart the effects of physical and psychological stress in adult mice by allowing these mice to "witness," the social defeat of another mouse thus removing confounding variables associated with physical stressors. Using this 'witness' model of stress and RNA-Seq technology, the current study aims to study the genetic effects of psychological stress. After, witnessing the social defeat of another mouse, VTA tissue was extracted, sequenced, and analyzed for differential expression. Since genes often work together in complex networks, a pathway and gene ontology (GO) analysis was performed using data from the differential expression analysis. The pathway and GO analyzes revealed a perturbation of the glutamatergic synapse pathway and an enrichment of positive excitatory post-synaptic potential regulation. This is consistent with the excitatory synapse theory of depression. Together these findings demonstrate a dysregulation of the mesolimbic reward pathway at the gene level as a result of psychological stress potentially contributing to depressive like behaviors.

  8. Exploring autophagy with Gene Ontology

    Science.gov (United States)

    2018-01-01

    ABSTRACT Autophagy is a fundamental cellular process that is well conserved among eukaryotes. It is one of the strategies that cells use to catabolize substances in a controlled way. Autophagy is used for recycling cellular components, responding to cellular stresses and ridding cells of foreign material. Perturbations in autophagy have been implicated in a number of pathological conditions such as neurodegeneration, cardiac disease and cancer. The growing knowledge about autophagic mechanisms needs to be collected in a computable and shareable format to allow its use in data representation and interpretation. The Gene Ontology (GO) is a freely available resource that describes how and where gene products function in biological systems. It consists of 3 interrelated structured vocabularies that outline what gene products do at the biochemical level, where they act in a cell and the overall biological objectives to which their actions contribute. It also consists of ‘annotations’ that associate gene products with the terms. Here we describe how we represent autophagy in GO, how we create and define terms relevant to autophagy researchers and how we interrelate those terms to generate a coherent view of the process, therefore allowing an interoperable description of its biological aspects. We also describe how annotation of gene products with GO terms improves data analysis and interpretation, hence bringing a significant benefit to this field of study. PMID:29455577

  9. Ontology modeling for generation of clinical pathways

    Directory of Open Access Journals (Sweden)

    Jasmine Tehrani

    2012-12-01

    Full Text Available Purpose: Increasing costs of health care, fuelled by demand for high quality, cost-effective healthcare has drove hospitals to streamline their patient care delivery systems. One such systematic approach is the adaptation of Clinical Pathways (CP as a tool to increase the quality of healthcare delivery. However, most organizations still rely on are paper-based pathway guidelines or specifications, which have limitations in process management and as a result can influence patient safety outcomes. In this paper, we present a method for generating clinical pathways based on organizational semiotics by capturing knowledge from syntactic, semantic and pragmatic to social level. Design/methodology/approach: The proposed modeling approach to generation of CPs adopts organizational semiotics and enables the generation of semantically rich representation of CP knowledge. Semantic Analysis Method (SAM is applied to explicitly represent the semantics of the concepts, their relationships and patterns of behavior in terms of an ontology chart. Norm Analysis Method (NAM is adopted to identify and formally specify patterns of behavior and rules that govern the actions identified on the ontology chart. Information collected during semantic and norm analysis is integrated to guide the generation of CPs using best practice represented in BPMN thus enabling the automation of CP. Findings: This research confirms the necessity of taking into consideration social aspects in designing information systems and automating CP. The complexity of healthcare processes can be best tackled by analyzing stakeholders, which we treat as social agents, their goals and patterns of action within the agent network. Originality/value: The current modeling methods describe CPs from a structural aspect comprising activities, properties and interrelationships. However, these methods lack a mechanism to describe possible patterns of human behavior and the conditions under which the

  10. Bayesian assignment of gene ontology terms to gene expression experiments.

    Science.gov (United States)

    Sykacek, P

    2012-09-15

    Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Source code under GPL license is available from the author. peter.sykacek@boku.ac.at.

  11. Bayesian assignment of gene ontology terms to gene expression experiments

    Science.gov (United States)

    Sykacek, P.

    2012-01-01

    Motivation: Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. Results: This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Availability: Source code under GPL license is available from the author. Contact: peter.sykacek@boku.ac.at PMID:22962488

  12. Defining functional distances over Gene Ontology

    Directory of Open Access Journals (Sweden)

    del Pozo Angela

    2008-01-01

    Full Text Available Abstract Background A fundamental problem when trying to define the functional relationships between proteins is the difficulty in quantifying functional similarities, even when well-structured ontologies exist regarding the activity of proteins (i.e. 'gene ontology' -GO-. However, functional metrics can overcome the problems in the comparing and evaluating functional assignments and predictions. As a reference of proximity, previous approaches to compare GO terms considered linkage in terms of ontology weighted by a probability distribution that balances the non-uniform 'richness' of different parts of the Direct Acyclic Graph. Here, we have followed a different approach to quantify functional similarities between GO terms. Results We propose a new method to derive 'functional distances' between GO terms that is based on the simultaneous occurrence of terms in the same set of Interpro entries, instead of relying on the structure of the GO. The coincidence of GO terms reveals natural biological links between the GO functions and defines a distance model Df which fulfils the properties of a Metric Space. The distances obtained in this way can be represented as a hierarchical 'Functional Tree'. Conclusion The method proposed provides a new definition of distance that enables the similarity between GO terms to be quantified. Additionally, the 'Functional Tree' defines groups with biological meaning enhancing its utility for protein function comparison and prediction. Finally, this approach could be for function-based protein searches in databases, and for analysing the gene clusters produced by DNA array experiments.

  13. The Gene Ontology (GO) Cellular Component Ontology: integration with SAO (Subcellular Anatomy Ontology) and other recent developments

    Science.gov (United States)

    2013-01-01

    Background The Gene Ontology (GO) (http://www.geneontology.org/) contains a set of terms for describing the activity and actions of gene products across all kingdoms of life. Each of these activities is executed in a location within a cell or in the vicinity of a cell. In order to capture this context, the GO includes a sub-ontology called the Cellular Component (CC) ontology (GO-CCO). The primary use of this ontology is for GO annotation, but it has also been used for phenotype annotation, and for the annotation of images. Another ontology with similar scope to the GO-CCO is the Subcellular Anatomy Ontology (SAO), part of the Neuroscience Information Framework Standard (NIFSTD) suite of ontologies. The SAO also covers cell components, but in the domain of neuroscience. Description Recently, the GO-CCO was enriched in content and links to the Biological Process and Molecular Function branches of GO as well as to other ontologies. This was achieved in several ways. We carried out an amalgamation of SAO terms with GO-CCO ones; as a result, nearly 100 new neuroscience-related terms were added to the GO. The GO-CCO also contains relationships to GO Biological Process and Molecular Function terms, as well as connecting to external ontologies such as the Cell Ontology (CL). Terms representing protein complexes in the Protein Ontology (PRO) reference GO-CCO terms for their species-generic counterparts. GO-CCO terms can also be used to search a variety of databases. Conclusions In this publication we provide an overview of the GO-CCO, its overall design, and some recent extensions that make use of additional spatial information. One of the most recent developments of the GO-CCO was the merging in of the SAO, resulting in a single unified ontology designed to serve the needs of GO annotators as well as the specific needs of the neuroscience community. PMID:24093723

  14. Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

    Science.gov (United States)

    Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

    2018-02-23

    Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.

  15. Extracting Cross-Ontology Weighted Association Rules from Gene Ontology Annotations.

    Science.gov (United States)

    Agapito, Giuseppe; Milano, Marianna; Guzzi, Pietro Hiram; Cannataro, Mario

    2016-01-01

    Gene Ontology (GO) is a structured repository of concepts (GO Terms) that are associated to one or more gene products through a process referred to as annotation. The analysis of annotated data is an important opportunity for bioinformatics. There are different approaches of analysis, among those, the use of association rules (AR) which provides useful knowledge, discovering biologically relevant associations between terms of GO, not previously known. In a previous work, we introduced GO-WAR (Gene Ontology-based Weighted Association Rules), a methodology for extracting weighted association rules from ontology-based annotated datasets. We here adapt the GO-WAR algorithm to mine cross-ontology association rules, i.e., rules that involve GO terms present in the three sub-ontologies of GO. We conduct a deep performance evaluation of GO-WAR by mining publicly available GO annotated datasets, showing how GO-WAR outperforms current state of the art approaches.

  16. Gene Ontology-Based Analysis of Zebrafish Omics Data Using the Web Tool Comparative Gene Ontology.

    Science.gov (United States)

    Ebrahimie, Esmaeil; Fruzangohar, Mario; Moussavi Nik, Seyyed Hani; Newman, Morgan

    2017-10-01

    Gene Ontology (GO) analysis is a powerful tool in systems biology, which uses a defined nomenclature to annotate genes/proteins within three categories: "Molecular Function," "Biological Process," and "Cellular Component." GO analysis can assist in revealing functional mechanisms underlying observed patterns in transcriptomic, genomic, and proteomic data. The already extensive and increasing use of zebrafish for modeling genetic and other diseases highlights the need to develop a GO analytical tool for this organism. The web tool Comparative GO was originally developed for GO analysis of bacterial data in 2013 ( www.comparativego.com ). We have now upgraded and elaborated this web tool for analysis of zebrafish genetic data using GOs and annotations from the Gene Ontology Consortium.

  17. Fast gene ontology based clustering for microarray experiments.

    Science.gov (United States)

    Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

    2008-11-21

    Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  18. Comparative GO: a web application for comparative gene ontology and gene ontology-based gene selection in bacteria.

    Directory of Open Access Journals (Sweden)

    Mario Fruzangohar

    Full Text Available The primary means of classifying new functions for genes and proteins relies on Gene Ontology (GO, which defines genes/proteins using a controlled vocabulary in terms of their Molecular Function, Biological Process and Cellular Component. The challenge is to present this information to researchers to compare and discover patterns in multiple datasets using visually comprehensible and user-friendly statistical reports. Importantly, while there are many GO resources available for eukaryotes, there are none suitable for simultaneous, graphical and statistical comparison between multiple datasets. In addition, none of them supports comprehensive resources for bacteria. By using Streptococcus pneumoniae as a model, we identified and collected GO resources including genes, proteins, taxonomy and GO relationships from NCBI, UniProt and GO organisations. Then, we designed database tables in PostgreSQL database server and developed a Java application to extract data from source files and loaded into database automatically. We developed a PHP web application based on Model-View-Control architecture, used a specific data structure as well as current and novel algorithms to estimate GO graphs parameters. We designed different navigation and visualization methods on the graphs and integrated these into graphical reports. This tool is particularly significant when comparing GO groups between multiple samples (including those of pathogenic bacteria from different sources simultaneously. Comparing GO protein distribution among up- or down-regulated genes from different samples can improve understanding of biological pathways, and mechanism(s of infection. It can also aid in the discovery of genes associated with specific function(s for investigation as a novel vaccine or therapeutic targets.http://turing.ersa.edu.au/BacteriaGO.

  19. Determining the semantic similarities among Gene Ontology terms.

    Science.gov (United States)

    Taha, Kamal

    2013-05-01

    We present in this paper novel techniques that determine the semantic relationships among GeneOntology (GO) terms. We implemented these techniques in a prototype system called GoSE, which resides between user application and GO database. Given a set S of GO terms, GoSE would return another set S' of GO terms, where each term in S' is semantically related to each term in S. Most current research is focused on determining the semantic similarities among GO ontology terms based solely on their IDs and proximity to one another in the GO graph structure, while overlooking the contexts of the terms, which may lead to erroneous results. The context of a GO term T is the set of other terms, whose existence in the GO graph structure is dependent on T. We propose novel techniques that determine the contexts of terms based on the concept of existence dependency. We present a stack-based sort-merge algorithm employing these techniques for determining the semantic similarities among GO terms.We evaluated GoSE experimentally and compared it with three existing methods. The results of measuring the semantic similarities among genes in KEGG and Pfam pathways retrieved from the DBGET and Sanger Pfam databases, respectively, have shown that our method outperforms the other three methods in recall and precision.

  20. The representation of heart development in the gene ontology.

    Science.gov (United States)

    Khodiyar, Varsha K; Hill, David P; Howe, Doug; Berardini, Tanya Z; Tweedie, Susan; Talmud, Philippa J; Breckenridge, Ross; Bhattarcharya, Shoumo; Riley, Paul; Scambler, Peter; Lovering, Ruth C

    2011-06-01

    An understanding of heart development is critical in any systems biology approach to cardiovascular disease. The interpretation of data generated from high-throughput technologies (such as microarray and proteomics) is also essential to this approach. However, characterizing the role of genes in the processes underlying heart development and cardiovascular disease involves the non-trivial task of data analysis and integration of previous knowledge. The Gene Ontology (GO) Consortium provides structured controlled biological vocabularies that are used to summarize previous functional knowledge for gene products across all species. One aspect of GO describes biological processes, such as development and signaling. In order to support high-throughput cardiovascular research, we have initiated an effort to fully describe heart development in GO; expanding the number of GO terms describing heart development from 12 to over 280. This new ontology describes heart morphogenesis, the differentiation of specific cardiac cell types, and the involvement of signaling pathways in heart development. This work also aligns GO with the current views of the heart development research community and its representation in the literature. This extension of GO allows gene product annotators to comprehensively capture the genetic program leading to the developmental progression of the heart. This will enable users to integrate heart development data across species, resulting in the comprehensive retrieval of information about this subject. The revised GO structure, combined with gene product annotations, should improve the interpretation of data from high-throughput methods in a variety of cardiovascular research areas, including heart development, congenital cardiac disease, and cardiac stem cell research. Additionally, we invite the heart development community to contribute to the expansion of this important dataset for the benefit of future research in this area. Copyright © 2011

  1. The Representation of Heart Development in the Gene Ontology

    Science.gov (United States)

    Khodiyar, Varsha K.; Hill, David P.; Howe, Doug; Berardini, Tanya Z.; Tweedie, Susan; Talmud, Philippa J.; Breckenridge, Ross; Bhattarcharya, Shoumo; Riley, Paul; Scambler, Peter; Lovering, Ruth C.

    2012-01-01

    An understanding of heart development is critical in any systems biology approach to cardiovascular disease. The interpretation of data generated from high-throughput technologies (such as microarray and proteomics) is also essential to this approach. However, characterizing the role of genes in the processes underlying heart development and cardiovascular disease involves the non-trivial task of data analysis and integration of previous knowledge. The Gene Ontology (GO) Consortium provides structured controlled biological vocabularies that are used to summarize previous functional knowledge for gene products across all species. One aspect of GO describes biological processes, such as development and signaling. In order to support high-throughput cardiovascular research, we have initiated an effort to fully describe heart development in GO; expanding the number of GO terms describing heart development from 12 to over 280. This new ontology describes heart morphogenesis, the differentiation of specific cardiac cell types, and the involvement of signaling pathways in heart development and aligns GO with the current views of the heart development research community and its representation in the literature. This extension of GO allows gene product annotators to comprehensively capture the genetic program leading to the developmental progression of the heart. This will enable users to integrate heart development data across species, resulting in the comprehensive retrieval of information about this subject. The revised GO structure, combined with gene product annotations, should improve the interpretation of data from high-throughput methods in a variety of cardiovascular research areas, including heart development, congenital cardiac disease, and cardiac stem cell research. Additionally, we invite the heart development community to contribute to the expansion of this important dataset for the benefit of future research in this area. PMID:21419760

  2. Gene Ontology and KEGG Enrichment Analyses of Genes Related to Age-Related Macular Degeneration

    Directory of Open Access Journals (Sweden)

    Jian Zhang

    2014-01-01

    Full Text Available Identifying disease genes is one of the most important topics in biomedicine and may facilitate studies on the mechanisms underlying disease. Age-related macular degeneration (AMD is a serious eye disease; it typically affects older adults and results in a loss of vision due to retina damage. In this study, we attempt to develop an effective method for distinguishing AMD-related genes. Gene ontology and KEGG enrichment analyses of known AMD-related genes were performed, and a classification system was established. In detail, each gene was encoded into a vector by extracting enrichment scores of the gene set, including it and its direct neighbors in STRING, and gene ontology terms or KEGG pathways. Then certain feature-selection methods, including minimum redundancy maximum relevance and incremental feature selection, were adopted to extract key features for the classification system. As a result, 720 GO terms and 11 KEGG pathways were deemed the most important factors for predicting AMD-related genes.

  3. Integrating Ontological Knowledge and Textual Evidence in Estimating Gene and Gene Product Similarity

    Energy Technology Data Exchange (ETDEWEB)

    Sanfilippo, Antonio P.; Posse, Christian; Gopalan, Banu; Tratz, Stephen C.; Gregory, Michelle L.

    2006-06-08

    With the rising influence of the Gene On-tology, new approaches have emerged where the similarity between genes or gene products is obtained by comparing Gene Ontology code annotations associ-ated with them. So far, these approaches have solely relied on the knowledge en-coded in the Gene Ontology and the gene annotations associated with the Gene On-tology database. The goal of this paper is to demonstrate that improvements to these approaches can be obtained by integrating textual evidence extracted from relevant biomedical literature.

  4. The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach.

    Science.gov (United States)

    Hindumathi, V; Kranthi, T; Rao, S B; Manimaran, P

    2014-06-01

    With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.

  5. Aligning ontologies and integrating textual evidence for pathway analysis of microarray data

    Energy Technology Data Exchange (ETDEWEB)

    Gopalan, Banu; Posse, Christian; Sanfilippo, Antonio P.; Stenzel-Poore, Mary; Stevens, S.L.; Castano, Jose; Beagley, Nathaniel; Riensche, Roderick M.; Baddeley, Bob; Simon, R.P.; Pustejovsky, James

    2006-10-08

    Expression arrays are introducing a paradigmatic change in biology by shifting experimental approaches from single gene studies to genome-level analysis, monitoring the ex-pression levels of several thousands of genes in parallel. The massive amounts of data obtained from the microarray data needs to be integrated and interpreted to infer biological meaning within the context of information-rich pathways. In this paper, we present a methodology that integrates textual information with annotations from cross-referenced ontolo-gies to map genes to pathways in a semi-automated way. We illustrate this approach and compare it favorably to other tools by analyzing the gene expression changes underlying the biological phenomena related to stroke. Stroke is the third leading cause of death and a major disabler in the United States. Through years of study, researchers have amassed a significant knowledge base about stroke, and this knowledge, coupled with new technologies, is providing a wealth of new scientific opportunities. The potential for neu-roprotective stroke therapy is enormous. However, the roles of neurogenesis, angiogenesis, and other proliferative re-sponses in the recovery process following ischemia and the molecular mechanisms that lead to these processes still need to be uncovered. Improved annotation of genomic and pro-teomic data, including annotation of pathways in which genes and proteins are involved, is required to facilitate their interpretation and clinical application. While our approach is not aimed at replacing existing curated pathway databases, it reveals multiple hidden relationships that are not evident with the way these databases analyze functional groupings of genes from the Gene Ontology.

  6. Fast Gene Ontology based clustering for microarray experiments

    Directory of Open Access Journals (Sweden)

    Ovaska Kristian

    2008-11-01

    Full Text Available Abstract Background Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. Results We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Conclusion Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  7. Prediction of human protein function according to Gene Ontology categories

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Gupta, Ramneek; Stærfeldt, Hans Henrik

    2003-01-01

    developed a method for prediction of protein function for a subset of classes from the Gene Ontology classification scheme. This subset includes several pharmaceutically interesting categories-transcription factors, receptors, ion channels, stress and immune response proteins, hormones and growth factors...

  8. GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.

    Science.gov (United States)

    Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E

    2016-03-11

    Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.

  9. Dovetailing biology and chemistry: integrating the Gene Ontology with the ChEBI chemical ontology

    Science.gov (United States)

    2013-01-01

    Background The Gene Ontology (GO) facilitates the description of the action of gene products in a biological context. Many GO terms refer to chemical entities that participate in biological processes. To facilitate accurate and consistent systems-wide biological representation, it is necessary to integrate the chemical view of these entities with the biological view of GO functions and processes. We describe a collaborative effort between the GO and the Chemical Entities of Biological Interest (ChEBI) ontology developers to ensure that the representation of chemicals in the GO is both internally consistent and in alignment with the chemical expertise captured in ChEBI. Results We have examined and integrated the ChEBI structural hierarchy into the GO resource through computationally-assisted manual curation of both GO and ChEBI. Our work has resulted in the creation of computable definitions of GO terms that contain fully defined semantic relationships to corresponding chemical terms in ChEBI. Conclusions The set of logical definitions using both the GO and ChEBI has already been used to automate aspects of GO development and has the potential to allow the integration of data across the domains of biology and chemistry. These logical definitions are available as an extended version of the ontology from http://purl.obolibrary.org/obo/go/extensions/go-plus.owl. PMID:23895341

  10. Classifying genes to the correct Gene Ontology Slim term in Saccharomyces cerevisiae using neighbouring genes with classification learning

    Directory of Open Access Journals (Sweden)

    Tsatsoulis Costas

    2010-05-01

    Full Text Available Abstract Background There is increasing evidence that gene location and surrounding genes influence the functionality of genes in the eukaryotic genome. Knowing the Gene Ontology Slim terms associated with a gene gives us insight into a gene's functionality by informing us how its gene product behaves in a cellular context using three different ontologies: molecular function, biological process, and cellular component. In this study, we analyzed if we could classify a gene in Saccharomyces cerevisiae to its correct Gene Ontology Slim term using information about its location in the genome and information from its nearest-neighbouring genes using classification learning. Results We performed experiments to establish that the MultiBoostAB algorithm using the J48 classifier could correctly classify Gene Ontology Slim terms of a gene given information regarding the gene's location and information from its nearest-neighbouring genes for training. Different neighbourhood sizes were examined to determine how many nearest neighbours should be included around each gene to provide better classification rules. Our results show that by just incorporating neighbour information from each gene's two-nearest neighbours, the percentage of correctly classified genes to their correct Gene Ontology Slim term for each ontology reaches over 80% with high accuracy (reflected in F-measures over 0.80 of the classification rules produced. Conclusions We confirmed that in classifying genes to their correct Gene Ontology Slim term, the inclusion of neighbour information from those genes is beneficial. Knowing the location of a gene and the Gene Ontology Slim information from neighbouring genes gives us insight into that gene's functionality. This benefit is seen by just including information from a gene's two-nearest neighbouring genes.

  11. Protein Annotation from Protein Interaction Networks and Gene Ontology

    OpenAIRE

    Nguyen, Cao D.; Gardiner, Katheleen J.; Cios, Krzysztof J.

    2011-01-01

    We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precis...

  12. [Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

    Science.gov (United States)

    Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

    2012-07-01

    In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.

  13. GOseek: a gene ontology search engine using enhanced keywords.

    Science.gov (United States)

    Taha, Kamal

    2013-01-01

    We propose in this paper a biological search engine called GOseek, which overcomes the limitation of current gene similarity tools. Given a set of genes, GOseek returns the most significant genes that are semantically related to the given genes. These returned genes are usually annotated to one of the Lowest Common Ancestors (LCA) of the Gene Ontology (GO) terms annotating the given genes. Most genes have several annotation GO terms. Therefore, there may be more than one LCA for the GO terms annotating the given genes. The LCA annotating the genes that are most semantically related to the given gene is the one that receives the most aggregate semantic contribution from the GO terms annotating the given genes. To identify this LCA, GOseek quantifies the contribution of the GO terms annotating the given genes to the semantics of their LCAs. That is, it encodes the semantic contribution into a numeric format. GOseek uses microarray experiment data to rank result genes based on their significance. We evaluated GOseek experimentally and compared it with a comparable gene prediction tool. Results showed marked improvement over the tool.

  14. Gene ontology based transfer learning for protein subcellular localization

    Directory of Open Access Journals (Sweden)

    Zhou Shuigeng

    2011-02-01

    Full Text Available Abstract Background Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as GO, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the GO terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology. Results In this paper, we propose a Gene Ontology Based Transfer Learning Model (GO-TLM for large-scale protein subcellular localization. The model transfers the signature-based homologous GO terms to the target proteins, and further constructs a reliable learning system to reduce the adverse affect of the potential false GO terms that are resulted from evolutionary divergence. We derive three GO kernels from the three aspects of gene ontology to measure the GO similarity of two proteins, and derive two other spectrum kernels to measure the similarity of two protein sequences. We use simple non-parametric cross validation to explicitly weigh the discriminative abilities of the five kernels, such that the time & space computational complexities are greatly reduced when compared to the complicated semi-definite programming and semi-indefinite linear programming. The five kernels are then linearly merged into one single kernel for

  15. Correlating Information Contents of Gene Ontology Terms to Infer Semantic Similarity of Gene Products

    Directory of Open Access Journals (Sweden)

    Mingxin Gan

    2014-01-01

    Full Text Available Successful applications of the gene ontology to the inference of functional relationships between gene products in recent years have raised the need for computational methods to automatically calculate semantic similarity between gene products based on semantic similarity of gene ontology terms. Nevertheless, existing methods, though having been widely used in a variety of applications, may significantly overestimate semantic similarity between genes that are actually not functionally related, thereby yielding misleading results in applications. To overcome this limitation, we propose to represent a gene product as a vector that is composed of information contents of gene ontology terms annotated for the gene product, and we suggest calculating similarity between two gene products as the relatedness of their corresponding vectors using three measures: Pearson’s correlation coefficient, cosine similarity, and the Jaccard index. We focus on the biological process domain of the gene ontology and annotations of yeast proteins to study the effectiveness of the proposed measures. Results show that semantic similarity scores calculated using the proposed measures are more consistent with known biological knowledge than those derived using a list of existing methods, suggesting the effectiveness of our method in characterizing functional relationships between gene products.

  16. Protein annotation from protein interaction networks and Gene Ontology.

    Science.gov (United States)

    Nguyen, Cao D; Gardiner, Katheleen J; Cios, Krzysztof J

    2011-10-01

    We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precision and 60% recall versus 45% and 26% for Majority and 24% and 61% for χ²-statistics, respectively. Copyright © 2011 Elsevier Inc. All rights reserved.

  17. Text Mining to Support Gene Ontology Curation and Vice Versa.

    Science.gov (United States)

    Ruch, Patrick

    2017-01-01

    In this chapter, we explain how text mining can support the curation of molecular biology databases dealing with protein functions. We also show how curated data can play a disruptive role in the developments of text mining methods. We review a decade of efforts to improve the automatic assignment of Gene Ontology (GO) descriptors, the reference ontology for the characterization of genes and gene products. To illustrate the high potential of this approach, we compare the performances of an automatic text categorizer and show a large improvement of +225 % in both precision and recall on benchmarked data. We argue that automatic text categorization functions can ultimately be embedded into a Question-Answering (QA) system to answer questions related to protein functions. Because GO descriptors can be relatively long and specific, traditional QA systems cannot answer such questions. A new type of QA system, so-called Deep QA which uses machine learning methods trained with curated contents, is thus emerging. Finally, future advances of text mining instruments are directly dependent on the availability of high-quality annotated contents at every curation step. Databases workflows must start recording explicitly all the data they curate and ideally also some of the data they do not curate.

  18. GOPET: A tool for automated predictions of Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Glatting Karl-Heinz

    2006-03-01

    Full Text Available Abstract Background Vast progress in sequencing projects has called for annotation on a large scale. A Number of methods have been developed to address this challenging task. These methods, however, either apply to specific subsets, or their predictions are not formalised, or they do not provide precise confidence values for their predictions. Description We recently established a learning system for automated annotation, trained with a broad variety of different organisms to predict the standardised annotation terms from Gene Ontology (GO. Now, this method has been made available to the public via our web-service GOPET (Gene Ontology term Prediction and Evaluation Tool. It supplies annotation for sequences of any organism. For each predicted term an appropriate confidence value is provided. The basic method had been developed for predicting molecular function GO-terms. It is now expanded to predict biological process terms. This web service is available via http://genius.embnet.dkfz-heidelberg.de/menu/biounit/open-husar Conclusion Our web service gives experimental researchers as well as the bioinformatics community a valuable sequence annotation device. Additionally, GOPET also provides less significant annotation data which may serve as an extended discovery platform for the user.

  19. Interestingness measures and strategies for mining multi-ontology multi-level association rules from gene ontology annotations for the discovery of new GO relationships.

    Science.gov (United States)

    Manda, Prashanti; McCarthy, Fiona; Bridges, Susan M

    2013-10-01

    The Gene Ontology (GO), a set of three sub-ontologies, is one of the most popular bio-ontologies used for describing gene product characteristics. GO annotation data containing terms from multiple sub-ontologies and at different levels in the ontologies is an important source of implicit relationships between terms from the three sub-ontologies. Data mining techniques such as association rule mining that are tailored to mine from multiple ontologies at multiple levels of abstraction are required for effective knowledge discovery from GO annotation data. We present a data mining approach, Multi-ontology data mining at All Levels (MOAL) that uses the structure and relationships of the GO to mine multi-ontology multi-level association rules. We introduce two interestingness measures: Multi-ontology Support (MOSupport) and Multi-ontology Confidence (MOConfidence) customized to evaluate multi-ontology multi-level association rules. We also describe a variety of post-processing strategies for pruning uninteresting rules. We use publicly available GO annotation data to demonstrate our methods with respect to two applications (1) the discovery of co-annotation suggestions and (2) the discovery of new cross-ontology relationships. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.

  20. Improving Interpretation of Cardiac Phenotypes and Enhancing Discovery With Expanded Knowledge in the Gene Ontology.

    Science.gov (United States)

    Lovering, Ruth C; Roncaglia, Paola; Howe, Douglas G; Laulederkind, Stanley J F; Khodiyar, Varsha K; Berardini, Tanya Z; Tweedie, Susan; Foulger, Rebecca E; Osumi-Sutherland, David; Campbell, Nancy H; Huntley, Rachael P; Talmud, Philippa J; Blake, Judith A; Breckenridge, Ross; Riley, Paul R; Lambiase, Pier D; Elliott, Perry M; Clapp, Lucie; Tinker, Andrew; Hill, David P

    2018-02-01

    A systems biology approach to cardiac physiology requires a comprehensive representation of how coordinated processes operate in the heart, as well as the ability to interpret relevant transcriptomic and proteomic experiments. The Gene Ontology (GO) Consortium provides structured, controlled vocabularies of biological terms that can be used to summarize and analyze functional knowledge for gene products. In this study, we created a computational resource to facilitate genetic studies of cardiac physiology by integrating literature curation with attention to an improved and expanded ontological representation of heart processes in the Gene Ontology. As a result, the Gene Ontology now contains terms that comprehensively describe the roles of proteins in cardiac muscle cell action potential, electrical coupling, and the transmission of the electrical impulse from the sinoatrial node to the ventricles. Evaluating the effectiveness of this approach to inform data analysis demonstrated that Gene Ontology annotations, analyzed within an expanded ontological context of heart processes, can help to identify candidate genes associated with arrhythmic disease risk loci. We determined that a combination of curation and ontology development for heart-specific genes and processes supports the identification and downstream analysis of genes responsible for the spread of the cardiac action potential through the heart. Annotating these genes and processes in a structured format facilitates data analysis and supports effective retrieval of gene-centric information about cardiac defects. © 2018 The Authors.

  1. Codon bias and gene ontology in holometabolous and hemimetabolous insects.

    Science.gov (United States)

    Carlini, David B; Makowski, Matthew

    2015-12-01

    The relationship between preferred codon use (PCU), developmental mode, and gene ontology (GO) was investigated in a sample of nine insect species with sequenced genomes. These species were selected to represent two distinct modes of insect development, holometabolism and hemimetabolism, with an aim toward determining whether the differences in developmental timing concomitant with developmental mode would be mirrored by differences in PCU in their developmental genes. We hypothesized that the developmental genes of holometabolous insects should be under greater selective pressure for efficient translation, manifest as increased PCU, than those of hemimetabolous insects because holometabolism requires abundant protein expression over shorter time intervals than hemimetabolism, where proteins are required more uniformly in time. Preferred codon sets were defined for each species, from which the frequency of PCU for each gene was obtained. Although there were substantial differences in the genomic base composition of holometabolous and hemimetabolous insects, both groups exhibited a general preference for GC-ending codons, with the former group having higher PCU averaged across all genes. For each species, the biological process GO term for each gene was assigned that of its Drosophila homolog(s), and PCU was calculated for each GO term category. The top two GO term categories for PCU enrichment in the holometabolous insects were anatomical structure development and cell differentiation. The increased PCU in the developmental genes of holometabolous insects may reflect a general strategy to maximize the protein production of genes expressed in bursts over short time periods, e.g., heat shock proteins. J. Exp. Zool. (Mol. Dev. Evol.) 324B: 686-698, 2015. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.

  2. Integration of the Gene Ontology into an object-oriented architecture

    Directory of Open Access Journals (Sweden)

    Zheng W Jim

    2005-05-01

    Full Text Available Abstract Background To standardize gene product descriptions, a formal vocabulary defined as the Gene Ontology (GO has been developed. GO terms have been categorized into biological processes, molecular functions, and cellular components. However, there is no single representation that integrates all the terms into one cohesive model. Furthermore, GO definitions have little information explaining the underlying architecture that forms these terms, such as the dynamic and static events occurring in a process. In contrast, object-oriented models have been developed to show dynamic and static events. A portion of the TGF-beta signaling pathway, which is involved in numerous cellular events including cancer, differentiation and development, was used to demonstrate the feasibility of integrating the Gene Ontology into an object-oriented model. Results Using object-oriented models we have captured the static and dynamic events that occur during a representative GO process, "transforming growth factor-beta (TGF-beta receptor complex assembly" (GO:0007181. Conclusion We demonstrate that the utility of GO terms can be enhanced by object-oriented technology, and that the GO terms can be integrated into an object-oriented model by serving as a basis for the generation of object functions and attributes.

  3. Networks in biological systems: An investigation of the Gene Ontology as an evolving network

    International Nuclear Information System (INIS)

    Coronnello, C; Tumminello, M; Micciche, S; Mantegna, R.N.

    2009-01-01

    Many biological systems can be described as networks where different elements interact, in order to perform biological processes. We introduce a network associated with the Gene Ontology. Specifically, we construct a correlation-based network where the vertices are the terms of the Gene Ontology and the link between each two terms is weighted on the basis of the number of genes that they have in common. We analyze a filtered network obtained from the correlation-based network and we characterize its evolution over different releases of the Gene Ontology.

  4. Representing virus-host interactions and other multi-organism processes in the Gene Ontology.

    Science.gov (United States)

    Foulger, R E; Osumi-Sutherland, D; McIntosh, B K; Hulo, C; Masson, P; Poux, S; Le Mercier, P; Lomax, J

    2015-07-28

    The Gene Ontology project is a collaborative effort to provide descriptions of gene products in a consistent and computable language, and in a species-independent manner. The Gene Ontology is designed to be applicable to all organisms but up to now has been largely under-utilized for prokaryotes and viruses, in part because of a lack of appropriate ontology terms. To address this issue, we have developed a set of Gene Ontology classes that are applicable to microbes and their hosts, improving both coverage and quality in this area of the Gene Ontology. Describing microbial and viral gene products brings with it the additional challenge of capturing both the host and the microbe. Recognising this, we have worked closely with annotation groups to test and optimize the GO classes, and we describe here a set of annotation guidelines that allow the controlled description of two interacting organisms. Building on the microbial resources already in existence such as ViralZone, UniProtKB keywords and MeGO, this project provides an integrated ontology to describe interactions between microbial species and their hosts, with mappings to the external resources above. Housing this information within the freely-accessible Gene Ontology project allows the classes and annotation structure to be utilized by a large community of biologists and users.

  5. OAHG: an integrated resource for annotating human genes with multi-level ontologies.

    Science.gov (United States)

    Cheng, Liang; Sun, Jie; Xu, Wanying; Dong, Lixiang; Hu, Yang; Zhou, Meng

    2016-10-05

    OAHG, an integrated resource, aims to establish a comprehensive functional annotation resource for human protein-coding genes (PCGs), miRNAs, and lncRNAs by multi-level ontologies involving Gene Ontology (GO), Disease Ontology (DO), and Human Phenotype Ontology (HPO). Many previous studies have focused on inferring putative properties and biological functions of PCGs and non-coding RNA genes from different perspectives. During the past several decades, a few of databases have been designed to annotate the functions of PCGs, miRNAs, and lncRNAs, respectively. A part of functional descriptions in these databases were mapped to standardize terminologies, such as GO, which could be helpful to do further analysis. Despite these developments, there is no comprehensive resource recording the function of these three important types of genes. The current version of OAHG, release 1.0 (Jun 2016), integrates three ontologies involving GO, DO, and HPO, six gene functional databases and two interaction databases. Currently, OAHG contains 1,434,694 entries involving 16,929 PCGs, 637 miRNAs, 193 lncRNAs, and 24,894 terms of ontologies. During the performance evaluation, OAHG shows the consistencies with existing gene interactions and the structure of ontology. For example, terms with more similar structure could be associated with more associated genes (Pearson correlation γ 2  = 0.2428, p < 2.2e-16).

  6. Gene Ontology annotation of the rice blast fungus, Magnaporthe oryzae

    Directory of Open Access Journals (Sweden)

    Deng Jixin

    2009-02-01

    Full Text Available Abstract Background Magnaporthe oryzae, the causal agent of blast disease of rice, is the most destructive disease of rice worldwide. The genome of this fungal pathogen has been sequenced and an automated annotation has recently been updated to Version 6 http://www.broad.mit.edu/annotation/genome/magnaporthe_grisea/MultiDownloads.html. However, a comprehensive manual curation remains to be performed. Gene Ontology (GO annotation is a valuable means of assigning functional information using standardized vocabulary. We report an overview of the GO annotation for Version 5 of M. oryzae genome assembly. Methods A similarity-based (i.e., computational GO annotation with manual review was conducted, which was then integrated with a literature-based GO annotation with computational assistance. For similarity-based GO annotation a stringent reciprocal best hits method was used to identify similarity between predicted proteins of M. oryzae and GO proteins from multiple organisms with published associations to GO terms. Significant alignment pairs were manually reviewed. Functional assignments were further cross-validated with manually reviewed data, conserved domains, or data determined by wet lab experiments. Additionally, biological appropriateness of the functional assignments was manually checked. Results In total, 6,286 proteins received GO term assignment via the homology-based annotation, including 2,870 hypothetical proteins. Literature-based experimental evidence, such as microarray, MPSS, T-DNA insertion mutation, or gene knockout mutation, resulted in 2,810 proteins being annotated with GO terms. Of these, 1,673 proteins were annotated with new terms developed for Plant-Associated Microbe Gene Ontology (PAMGO. In addition, 67 experiment-determined secreted proteins were annotated with PAMGO terms. Integration of the two data sets resulted in 7,412 proteins (57% being annotated with 1,957 distinct and specific GO terms. Unannotated proteins

  7. A methodology to migrate the gene ontology to a description logic environment using DAML+OIL.

    Science.gov (United States)

    Wroe, C J; Stevens, R; Goble, C A; Ashburner, M

    2003-01-01

    The Gene Ontology Next Generation Project (GONG) is developing a staged methodology to evolve the current representation of the Gene Ontology into DAML+OIL in order to take advantage of the richer formal expressiveness and the reasoning capabilities of the underlying description logic. Each stage provides a step level increase in formal explicit semantic content with a view to supporting validation, extension and multiple classification of the Gene Ontology. The paper introduces DAML+OIL and demonstrates the activity within each stage of the methodology and the functionality gained.

  8. Evaluating Functional Annotations of Enzymes Using the Gene Ontology.

    Science.gov (United States)

    Holliday, Gemma L; Davidson, Rebecca; Akiva, Eyal; Babbitt, Patricia C

    2017-01-01

    The Gene Ontology (GO) (Ashburner et al., Nat Genet 25(1):25-29, 2000) is a powerful tool in the informatics arsenal of methods for evaluating annotations in a protein dataset. From identifying the nearest well annotated homologue of a protein of interest to predicting where misannotation has occurred to knowing how confident you can be in the annotations assigned to those proteins is critical. In this chapter we explore what makes an enzyme unique and how we can use GO to infer aspects of protein function based on sequence similarity. These can range from identification of misannotation or other errors in a predicted function to accurate function prediction for an enzyme of entirely unknown function. Although GO annotation applies to any gene products, we focus here a describing our approach for hierarchical classification of enzymes in the Structure-Function Linkage Database (SFLD) (Akiva et al., Nucleic Acids Res 42(Database issue):D521-530, 2014) as a guide for informed utilisation of annotation transfer based on GO terms.

  9. Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

    Science.gov (United States)

    Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

    2014-01-01

    Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.

  10. Multi-label literature classification based on the Gene Ontology graph

    Directory of Open Access Journals (Sweden)

    Lu Xinghua

    2008-12-01

    Full Text Available Abstract Background The Gene Ontology is a controlled vocabulary for representing knowledge related to genes and proteins in a computable form. The current effort of manually annotating proteins with the Gene Ontology is outpaced by the rate of accumulation of biomedical knowledge in literature, which urges the development of text mining approaches to facilitate the process by automatically extracting the Gene Ontology annotation from literature. The task is usually cast as a text classification problem, and contemporary methods are confronted with unbalanced training data and the difficulties associated with multi-label classification. Results In this research, we investigated the methods of enhancing automatic multi-label classification of biomedical literature by utilizing the structure of the Gene Ontology graph. We have studied three graph-based multi-label classification algorithms, including a novel stochastic algorithm and two top-down hierarchical classification methods for multi-label literature classification. We systematically evaluated and compared these graph-based classification algorithms to a conventional flat multi-label algorithm. The results indicate that, through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods can significantly improve predictions of the Gene Ontology terms implied by the analyzed text. Furthermore, the graph-based multi-label classifiers are capable of suggesting Gene Ontology annotations (to curators that are closely related to the true annotations even if they fail to predict the true ones directly. A software package implementing the studied algorithms is available for the research community. Conclusion Through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods have better potential than the conventional flat multi-label classification approach to facilitate

  11. Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

    Science.gov (United States)

    Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

    2012-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.

  12. Protein-Protein Interaction Network and Gene Ontology

    Science.gov (United States)

    Choi, Yunkyu; Kim, Seok; Yi, Gwan-Su; Park, Jinah

    Evolution of computer technologies makes it possible to access a large amount and various kinds of biological data via internet such as DNA sequences, proteomics data and information discovered about them. It is expected that the combination of various data could help researchers find further knowledge about them. Roles of a visualization system are to invoke human abilities to integrate information and to recognize certain patterns in the data. Thus, when the various kinds of data are examined and analyzed manually, an effective visualization system is an essential part. One instance of these integrated visualizations can be combination of protein-protein interaction (PPI) data and Gene Ontology (GO) which could help enhance the analysis of PPI network. We introduce a simple but comprehensive visualization system that integrates GO and PPI data where GO and PPI graphs are visualized side-by-side and supports quick reference functions between them. Furthermore, the proposed system provides several interactive visualization methods for efficiently analyzing the PPI network and GO directedacyclic- graph such as context-based browsing and common ancestors finding.

  13. Automatic annotation of protein motif function with Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Gopalakrishnan Vanathi

    2004-09-01

    Full Text Available Abstract Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs.

  14. A robust data-driven approach for gene ontology annotation.

    Science.gov (United States)

    Li, Yanpeng; Yu, Hong

    2014-01-01

    Gene ontology (GO) and GO annotation are important resources for biological information management and knowledge discovery, but the speed of manual annotation became a major bottleneck of database curation. BioCreative IV GO annotation task aims to evaluate the performance of system that automatically assigns GO terms to genes based on the narrative sentences in biomedical literature. This article presents our work in this task as well as the experimental results after the competition. For the evidence sentence extraction subtask, we built a binary classifier to identify evidence sentences using reference distance estimator (RDE), a recently proposed semi-supervised learning method that learns new features from around 10 million unlabeled sentences, achieving an F1 of 19.3% in exact match and 32.5% in relaxed match. In the post-submission experiment, we obtained 22.1% and 35.7% F1 performance by incorporating bigram features in RDE learning. In both development and test sets, RDE-based method achieved over 20% relative improvement on F1 and AUC performance against classical supervised learning methods, e.g. support vector machine and logistic regression. For the GO term prediction subtask, we developed an information retrieval-based method to retrieve the GO term most relevant to each evidence sentence using a ranking function that combined cosine similarity and the frequency of GO terms in documents, and a filtering method based on high-level GO classes. The best performance of our submitted runs was 7.8% F1 and 22.2% hierarchy F1. We found that the incorporation of frequency information and hierarchy filtering substantially improved the performance. In the post-submission evaluation, we obtained a 10.6% F1 using a simpler setting. Overall, the experimental analysis showed our approaches were robust in both the two tasks. © The Author(s) 2014. Published by Oxford University Press.

  15. Length bias correction in gene ontology enrichment analysis using logistic regression.

    Science.gov (United States)

    Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H

    2012-01-01

    When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.

  16. Gene-ontology enrichment analysis in two independent family-based samples highlights biologically plausible processes for autism spectrum disorders.

    LENUS (Irish Health Repository)

    Anney, Richard J L

    2012-02-01

    Recent genome-wide association studies (GWAS) have implicated a range of genes from discrete biological pathways in the aetiology of autism. However, despite the strong influence of genetic factors, association studies have yet to identify statistically robust, replicated major effect genes or SNPs. We apply the principle of the SNP ratio test methodology described by O\\'Dushlaine et al to over 2100 families from the Autism Genome Project (AGP). Using a two-stage design we examine association enrichment in 5955 unique gene-ontology classifications across four groupings based on two phenotypic and two ancestral classifications. Based on estimates from simulation we identify excess of association enrichment across all analyses. We observe enrichment in association for sets of genes involved in diverse biological processes, including pyruvate metabolism, transcription factor activation, cell-signalling and cell-cycle regulation. Both genes and processes that show enrichment have previously been examined in autistic disorders and offer biologically plausibility to these findings.

  17. Aspergillus flavus Blast2GO gene ontology database: elevated growth temperature alters amino acid metabolism

    Science.gov (United States)

    The availability of a representative gene ontology (GO) database is a prerequisite for a successful functional genomics study. Using online Blast2GO resources we constructed a GO database of Aspergillus flavus. Of the predicted total 13,485 A. flavus genes 8,987 were annotated with GO terms. The mea...

  18. Annotating the Function of the Human Genome with Gene Ontology and Disease Ontology.

    Science.gov (United States)

    Hu, Yang; Zhou, Wenyang; Ren, Jun; Dong, Lixiang; Wang, Yadong; Jin, Shuilin; Cheng, Liang

    2016-01-01

    Increasing evidences indicated that function annotation of human genome in molecular level and phenotype level is very important for systematic analysis of genes. In this study, we presented a framework named Gene2Function to annotate Gene Reference into Functions (GeneRIFs), in which each functional description of GeneRIFs could be annotated by a text mining tool Open Biomedical Annotator (OBA), and each Entrez gene could be mapped to Human Genome Organisation Gene Nomenclature Committee (HGNC) gene symbol. After annotating all the records about human genes of GeneRIFs, 288,869 associations between 13,148 mRNAs and 7,182 terms, 9,496 associations between 948 microRNAs and 533 terms, and 901 associations between 139 long noncoding RNAs (lncRNAs) and 297 terms were obtained as a comprehensive annotation resource of human genome. High consistency of term frequency of individual gene (Pearson correlation = 0.6401, p = 2.2e - 16) and gene frequency of individual term (Pearson correlation = 0.1298, p = 3.686e - 14) in GeneRIFs and GOA shows our annotation resource is very reliable.

  19. Semantic Mining based on graph theory and ontologies. Case Study: Cell Signaling Pathways

    Directory of Open Access Journals (Sweden)

    Carlos R. Rangel

    2016-08-01

    Full Text Available In this paper we use concepts from graph theory and cellular biology represented as ontologies, to carry out semantic mining tasks on signaling pathway networks. Specifically, the paper describes the semantic enrichment of signaling pathway networks. A cell signaling network describes the basic cellular activities and their interactions. The main contribution of this paper is in the signaling pathway research area, it proposes a new technique to analyze and understand how changes in these networks may affect the transmission and flow of information, which produce diseases such as cancer and diabetes. Our approach is based on three concepts from graph theory (modularity, clustering and centrality frequently used on social networks analysis. Our approach consists into two phases: the first uses the graph theory concepts to determine the cellular groups in the network, which we will call them communities; the second uses ontologies for the semantic enrichment of the cellular communities. The measures used from the graph theory allow us to determine the set of cells that are close (for example, in a disease, and the main cells in each community. We analyze our approach in two cases: TGF-ß and the Alzheimer Disease.

  20. Annotating activation/inhibition relationships to protein-protein interactions using gene ontology relations.

    Science.gov (United States)

    Yim, Soorin; Yu, Hasun; Jang, Dongjin; Lee, Doheon

    2018-04-11

    Signaling pathways can be reconstructed by identifying 'effect types' (i.e. activation/inhibition) of protein-protein interactions (PPIs). Effect types are composed of 'directions' (i.e. upstream/downstream) and 'signs' (i.e. positive/negative), thereby requiring directions as well as signs of PPIs to predict signaling events from PPI networks. Here, we propose a computational method for systemically annotating effect types to PPIs using relations between functional information of proteins. We used regulates, positively regulates, and negatively regulates relations in Gene Ontology (GO) to predict directions and signs of PPIs. These relations indicate both directions and signs between GO terms so that we can project directions and signs between relevant GO terms to PPIs. Independent test results showed that our method is effective for predicting both directions and signs of PPIs. Moreover, our method outperformed a previous GO-based method that did not consider the relations between GO terms. We annotated effect types to human PPIs and validated several highly confident effect types against literature. The annotated human PPIs are available in Additional file 2 to aid signaling pathway reconstruction and network biology research. We annotated effect types to PPIs by using regulates, positively regulates, and negatively regulates relations in GO. We demonstrated that those relations are effective for predicting not only signs, but also directions of PPIs. The usefulness of those relations suggests their potential applications to other types of interactions such as protein-DNA interactions.

  1. Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions.

    Science.gov (United States)

    Hur, Junguk; Özgür, Arzucan; Xiang, Zuoshuang; He, Yongqun

    2015-01-01

    Literature mining of gene-gene interactions has been enhanced by ontology-based name classifications. However, in biomedical literature mining, interaction keywords have not been carefully studied and used beyond a collection of keywords. In this study, we report the development of a new Interaction Network Ontology (INO) that classifies >800 interaction keywords and incorporates interaction terms from the PSI Molecular Interactions (PSI-MI) and Gene Ontology (GO). Using INO-based literature mining results, a modified Fisher's exact test was established to analyze significantly over- and under-represented enriched gene-gene interaction types within a specific area. Such a strategy was applied to study the vaccine-mediated gene-gene interactions using all PubMed abstracts. The Vaccine Ontology (VO) and INO were used to support the retrieval of vaccine terms and interaction keywords from the literature. INO is aligned with the Basic Formal Ontology (BFO) and imports terms from 10 other existing ontologies. Current INO includes 540 terms. In terms of interaction-related terms, INO imports and aligns PSI-MI and GO interaction terms and includes over 100 newly generated ontology terms with 'INO_' prefix. A new annotation property, 'has literature mining keywords', was generated to allow the listing of different keywords mapping to the interaction types in INO. Using all PubMed documents published as of 12/31/2013, approximately 266,000 vaccine-associated documents were identified, and a total of 6,116 gene-pairs were associated with at least one INO term. Out of 78 INO interaction terms associated with at least five gene-pairs of the vaccine-associated sub-network, 14 terms were significantly over-represented (i.e., more frequently used) and 17 under-represented based on our modified Fisher's exact test. These over-represented and under-represented terms share some common top-level terms but are distinct at the bottom levels of the INO hierarchy. The analysis of these

  2. A new measure for functional similarity of gene products based on Gene Ontology

    Directory of Open Access Journals (Sweden)

    Lengauer Thomas

    2006-06-01

    Full Text Available Abstract Background Gene Ontology (GO is a standard vocabulary of functional terms and allows for coherent annotation of gene products. These annotations provide a basis for new methods that compare gene products regarding their molecular function and biological role. Results We present a new method for comparing sets of GO terms and for assessing the functional similarity of gene products. The method relies on two semantic similarity measures; simRel and funSim. One measure (simRel is applied in the comparison of the biological processes found in different groups of organisms. The other measure (funSim is used to find functionally related gene products within the same or between different genomes. Results indicate that the method, in addition to being in good agreement with established sequence similarity approaches, also provides a means for the identification of functionally related proteins independent of evolutionary relationships. The method is also applied to estimating functional similarity between all proteins in Saccharomyces cerevisiae and to visualizing the molecular function space of yeast in a map of the functional space. A similar approach is used to visualize the functional relationships between protein families. Conclusion The approach enables the comparison of the underlying molecular biology of different taxonomic groups and provides a new comparative genomics tool identifying functionally related gene products independent of homology. The proposed map of the functional space provides a new global view on the functional relationships between gene products or protein families.

  3. GO(vis), a gene ontology visualization tool based on multi-dimensional values.

    Science.gov (United States)

    Ning, Zi; Jiang, Zhenran

    2010-05-01

    Most of gene product similarity measurements concentrate on the information content of Gene Ontology (GO) terms or use a path-based similarity between GO terms, which may ignore other important information contained in the structure of the ontology. In our study, we integrate different GO similarity measure approaches to analyze the functional relationship of genes and gene products with a new triangle-based visualization tool called GO(Vis). The purpose of this tool is to demonstrate the effect of three important information factors when measuring the similarity between gene products. One advantage of this tool is that its important ratio can be adjusted to meet different measuring requirements according to the biological knowledge of each factor. The experimental results demonstrate that GO(Vis) can display diagrams of the functional relationship for gene products effectively.

  4. A multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors for functional gene analysis.

    Science.gov (United States)

    Weber, Kristoffer; Bartsch, Udo; Stocking, Carol; Fehse, Boris

    2008-04-01

    Functional gene analysis requires the possibility of overexpression, as well as downregulation of one, or ideally several, potentially interacting genes. Lentiviral vectors are well suited for this purpose as they ensure stable expression of complementary DNAs (cDNAs), as well as short-hairpin RNAs (shRNAs), and can efficiently transduce a wide spectrum of cell targets when packaged within the coat proteins of other viruses. Here we introduce a multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors designed according to the "building blocks" principle. Using a wide spectrum of different fluorescent markers, including drug-selectable enhanced green fluorescent protein (eGFP)- and dTomato-blasticidin-S resistance fusion proteins, LeGO vectors allow simultaneous analysis of multiple genes and shRNAs of interest within single, easily identifiable cells. Furthermore, each functional module is flanked by unique cloning sites, ensuring flexibility and individual optimization. The efficacy of these vectors for analyzing multiple genes in a single cell was demonstrated in several different cell types, including hematopoietic, endothelial, and neural stem and progenitor cells, as well as hepatocytes. LeGO vectors thus represent a valuable tool for investigating gene networks using conditional ectopic expression and knock-down approaches simultaneously.

  5. Approaching the axiomatic enrichment of the Gene Ontology from a lexical perspective.

    Science.gov (United States)

    Quesada-Martínez, Manuel; Mikroyannidi, Eleni; Fernández-Breis, Jesualdo Tomás; Stevens, Robert

    2015-09-01

    The main goal of this work is to measure how lexical regularities in biomedical ontology labels can be used for the automatic creation of formal relationships between classes, and to evaluate the results of applying our approach to the Gene Ontology (GO). In recent years, we have developed a method for the lexical analysis of regularities in biomedical ontology labels, and we showed that the labels can present a high degree of regularity. In this work, we extend our method with a cross-products extension (CPE) metric, which estimates the potential interest of a specific regularity for axiomatic enrichment in the lexical analysis, using information on exact matches in external ontologies. The GO consortium recently enriched the GO by using so-called cross-product extensions. Cross-products are generated by establishing axioms that relate a given GO class with classes from the GO or other biomedical ontologies. We apply our method to the GO and study how its lexical analysis can identify and reconstruct the cross-products that are defined by the GO consortium. The label of the classes of the GO are highly regular in lexical terms, and the exact matches with labels of external ontologies affect 80% of the GO classes. The CPE metric reveals that 31.48% of the classes that exhibit regularities have fragments that are classes into two external ontologies that are selected for our experiment, namely, the Cell Ontology and the Chemical Entities of Biological Interest ontology, and 18.90% of them are fully decomposable into smaller parts. Our results show that the CPE metric permits our method to detect GO cross-product extensions with a mean recall of 62% and a mean precision of 28%. The study is completed with an analysis of false positives to explain this precision value. We think that our results support the claim that our lexical approach can contribute to the axiomatic enrichment of biomedical ontologies and that it can provide new insights into the engineering of

  6. Muscle Research and Gene Ontology: New standards for improved data integration.

    Science.gov (United States)

    Feltrin, Erika; Campanaro, Stefano; Diehl, Alexander D; Ehler, Elisabeth; Faulkner, Georgine; Fordham, Jennifer; Gardin, Chiara; Harris, Midori; Hill, David; Knoell, Ralph; Laveder, Paolo; Mittempergher, Lorenza; Nori, Alessandra; Reggiani, Carlo; Sorrentino, Vincenzo; Volpe, Pompeo; Zara, Ivano; Valle, Giorgio; Deegan, Jennifer

    2009-01-29

    The Gene Ontology Project provides structured controlled vocabularies for molecular biology that can be used for the functional annotation of genes and gene products. In a collaboration between the Gene Ontology (GO) Consortium and the muscle biology community, we have made large-scale additions to the GO biological process and cellular component ontologies. The main focus of this ontology development work concerns skeletal muscle, with specific consideration given to the processes of muscle contraction, plasticity, development, and regeneration, and to the sarcomere and membrane-delimited compartments. Our aims were to update the existing structure to reflect current knowledge, and to resolve, in an accommodating manner, the ambiguity in the language used by the community. The updated muscle terminologies have been incorporated into the GO. There are now 159 new terms covering critical research areas, and 57 existing terms have been improved and reorganized to follow their usage in muscle literature. The revised GO structure should improve the interpretation of data from high-throughput (e.g. microarray and proteomic) experiments in the area of muscle science and muscle disease. We actively encourage community feedback on, and gene product annotation with these new terms. Please visit the Muscle Community Annotation Wiki http://wiki.geneontology.org/index.php/Muscle_Biology.

  7. Muscle Research and Gene Ontology: New standards for improved data integration

    Directory of Open Access Journals (Sweden)

    Nori Alessandra

    2009-01-01

    Full Text Available Abstract Background The Gene Ontology Project provides structured controlled vocabularies for molecular biology that can be used for the functional annotation of genes and gene products. In a collaboration between the Gene Ontology (GO Consortium and the muscle biology community, we have made large-scale additions to the GO biological process and cellular component ontologies. The main focus of this ontology development work concerns skeletal muscle, with specific consideration given to the processes of muscle contraction, plasticity, development, and regeneration, and to the sarcomere and membrane-delimited compartments. Our aims were to update the existing structure to reflect current knowledge, and to resolve, in an accommodating manner, the ambiguity in the language used by the community. Results The updated muscle terminologies have been incorporated into the GO. There are now 159 new terms covering critical research areas, and 57 existing terms have been improved and reorganized to follow their usage in muscle literature. Conclusion The revised GO structure should improve the interpretation of data from high-throughput (e.g. microarray and proteomic experiments in the area of muscle science and muscle disease. We actively encourage community feedback on, and gene product annotation with these new terms. Please visit the Muscle Community Annotation Wiki http://wiki.geneontology.org/index.php/Muscle_Biology.

  8. Gene ontology analysis of pairwise genetic associations in two genome-wide studies of sporadic ALS

    Directory of Open Access Journals (Sweden)

    Kim Nora

    2012-07-01

    Full Text Available Abstract Background It is increasingly clear that common human diseases have a complex genetic architecture characterized by both additive and nonadditive genetic effects. The goal of the present study was to determine whether patterns of both additive and nonadditive genetic associations aggregate in specific functional groups as defined by the Gene Ontology (GO. Results We first estimated all pairwise additive and nonadditive genetic effects using the multifactor dimensionality reduction (MDR method that makes few assumptions about the underlying genetic model. Statistical significance was evaluated using permutation testing in two genome-wide association studies of ALS. The detection data consisted of 276 subjects with ALS and 271 healthy controls while the replication data consisted of 221 subjects with ALS and 211 healthy controls. Both studies included genotypes from approximately 550,000 single-nucleotide polymorphisms (SNPs. Each SNP was mapped to a gene if it was within 500 kb of the start or end. Each SNP was assigned a p-value based on its strongest joint effect with the other SNPs. We then used the Exploratory Visual Analysis (EVA method and software to assign a p-value to each gene based on the overabundance of significant SNPs at the α = 0.05 level in the gene. We also used EVA to assign p-values to each GO group based on the overabundance of significant genes at the α = 0.05 level. A GO category was determined to replicate if that category was significant at the α = 0.05 level in both studies. We found two GO categories that replicated in both studies. The first, ‘Regulation of Cellular Component Organization and Biogenesis’, a GO Biological Process, had p-values of 0.010 and 0.014 in the detection and replication studies, respectively. The second, ‘Actin Cytoskeleton’, a GO Cellular Component, had p-values of 0.040 and 0.046 in the detection and replication studies, respectively. Conclusions Pathway

  9. Ontology-based literature mining of E. coli vaccine-associated gene interaction networks.

    Science.gov (United States)

    Hur, Junguk; Özgür, Arzucan; He, Yongqun

    2017-03-14

    Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of

  10. GO Explorer: A gene-ontology tool to aid in the interpretation of shotgun proteomics data

    Directory of Open Access Journals (Sweden)

    Domont Gilberto B

    2009-02-01

    Full Text Available Abstract Background Spectral counting is a shotgun proteomics approach comprising the identification and relative quantitation of thousands of proteins in complex mixtures. However, this strategy generates bewildering amounts of data whose biological interpretation is a challenge. Results Here we present a new algorithm, termed GO Explorer (GOEx, that leverages the gene ontology (GO to aid in the interpretation of proteomic data. GOEx stands out because it combines data from protein fold changes with GO over-representation statistics to help draw conclusions. Moreover, it is tightly integrated within the PatternLab for Proteomics project and, thus, lies within a complete computational environment that provides parsers and pattern recognition tools designed for spectral counting. GOEx offers three independent methods to query data: an interactive directed acyclic graph, a specialist mode where key words can be searched, and an automatic search. Its usefulness is demonstrated by applying it to help interpret the effects of perillyl alcohol, a natural chemotherapeutic agent, on glioblastoma multiform cell lines (A172. We used a new multi-surfactant shotgun proteomic strategy and identified more than 2600 proteins; GOEx pinpointed key sets of differentially expressed proteins related to cell cycle, alcohol catabolism, the Ras pathway, apoptosis, and stress response, to name a few. Conclusion GOEx facilitates organism-specific studies by leveraging GO and providing a rich graphical user interface. It is a simple to use tool, specialized for biologists who wish to analyze spectral counting data from shotgun proteomics. GOEx is available at http://pcarvalho.com/patternlab.

  11. GO Explorer: A gene-ontology tool to aid in the interpretation of shotgun proteomics data.

    Science.gov (United States)

    Carvalho, Paulo C; Fischer, Juliana Sg; Chen, Emily I; Domont, Gilberto B; Carvalho, Maria Gc; Degrave, Wim M; Yates, John R; Barbosa, Valmir C

    2009-02-24

    Spectral counting is a shotgun proteomics approach comprising the identification and relative quantitation of thousands of proteins in complex mixtures. However, this strategy generates bewildering amounts of data whose biological interpretation is a challenge. Here we present a new algorithm, termed GO Explorer (GOEx), that leverages the gene ontology (GO) to aid in the interpretation of proteomic data. GOEx stands out because it combines data from protein fold changes with GO over-representation statistics to help draw conclusions. Moreover, it is tightly integrated within the PatternLab for Proteomics project and, thus, lies within a complete computational environment that provides parsers and pattern recognition tools designed for spectral counting. GOEx offers three independent methods to query data: an interactive directed acyclic graph, a specialist mode where key words can be searched, and an automatic search. Its usefulness is demonstrated by applying it to help interpret the effects of perillyl alcohol, a natural chemotherapeutic agent, on glioblastoma multiform cell lines (A172). We used a new multi-surfactant shotgun proteomic strategy and identified more than 2600 proteins; GOEx pinpointed key sets of differentially expressed proteins related to cell cycle, alcohol catabolism, the Ras pathway, apoptosis, and stress response, to name a few. GOEx facilitates organism-specific studies by leveraging GO and providing a rich graphical user interface. It is a simple to use tool, specialized for biologists who wish to analyze spectral counting data from shotgun proteomics. GOEx is available at http://pcarvalho.com/patternlab.

  12. Exploring the key genes and pathways in enchondromas using a gene expression microarray.

    Science.gov (United States)

    Shi, Zhongju; Zhou, Hengxing; Pan, Bin; Lu, Lu; Kang, Yi; Liu, Lu; Wei, Zhijian; Feng, Shiqing

    2017-07-04

    Enchondromas are the most common primary benign osseous neoplasms that occur in the medullary bone; they can undergo malignant transformation into chondrosarcoma. However, enchondromas are always undetected in patients, and the molecular mechanism is unclear. To identify key genes and pathways associated with the occurrence and development of enchondromas, we downloaded the gene expression dataset GSE22855 and obtained the differentially expressed genes (DEGs) by analyzing high-throughput gene expression in enchondromas. In total, 635 genes were identified as DEGs. Of these, 225 genes (35.43%) were up-regulated, and the remaining 410 genes (64.57%) were down-regulated. We identified the predominant gene ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways that were significantly over-represented in the enchondromas samples compared with the control samples. Subsequently the top 10 core genes were identified from the protein-protein interaction (PPI) network. The enrichment analyses of the genes mainly involved in two significant modules showed that the DEGs were principally related to ribosomes, protein digestion and absorption, ECM-receptor interaction, focal adhesion, amoebiasis and the PI3K-Akt signaling pathway.Together, these data elucidate the molecular mechanisms underlying the occurrence and development of enchondromas and provide promising candidates for therapeutic intervention and prognostic evaluation. However, further experimental studies are needed to confirm these results.

  13. The mammalian adult neurogenesis gene ontology (MANGO provides a structural framework for published information on genes regulating adult hippocampal neurogenesis.

    Directory of Open Access Journals (Sweden)

    Rupert W Overall

    Full Text Available BACKGROUND: Adult hippocampal neurogenesis is not a single phenotype, but consists of a number of sub-processes, each of which is under complex genetic control. Interpretation of gene expression studies using existing resources often does not lead to results that address the interrelatedness of these processes. Formal structure, such as provided by ontologies, is essential in any field for comprehensive interpretation of existing knowledge but, until now, such a structure has been lacking for adult neurogenesis. METHODOLOGY/PRINCIPAL FINDINGS: We have created a resource with three components 1. A structured ontology describing the key stages in the development of adult hippocampal neural stem cells into functional granule cell neurons. 2. A comprehensive survey of the literature to annotate the results of all published reports on gene function in adult hippocampal neurogenesis (257 manuscripts covering 228 genes to the appropriate terms in our ontology. 3. An easy-to-use searchable interface to the resulting database made freely available online. The manuscript presents an overview of the database highlighting global trends such as the current bias towards research on early proliferative stages, and an example gene set enrichment analysis. A limitation of the resource is the current scope of the literature which, however, is growing by around 100 publications per year. With the ontology and database in place, new findings can be rapidly annotated and regular updates of the database will be made publicly available. CONCLUSIONS/SIGNIFICANCE: The resource we present allows relevant interpretation of gene expression screens in terms of defined stages of postnatal neuronal development. Annotation of genes by hand from the adult neurogenesis literature ensures the data are directly applicable to the system under study. We believe this approach could also serve as an example to other fields in a 'bottom-up' community effort complementing the already

  14. Systematically characterizing and prioritizing chemosensitivity related gene based on Gene Ontology and protein interaction network

    Directory of Open Access Journals (Sweden)

    Chen Xin

    2012-10-01

    Full Text Available Abstract Background The identification of genes that predict in vitro cellular chemosensitivity of cancer cells is of great importance. Chemosensitivity related genes (CRGs have been widely utilized to guide clinical and cancer chemotherapy decisions. In addition, CRGs potentially share functional characteristics and network features in protein interaction networks (PPIN. Methods In this study, we proposed a method to identify CRGs based on Gene Ontology (GO and PPIN. Firstly, we documented 150 pairs of drug-CCRG (curated chemosensitivity related gene from 492 published papers. Secondly, we characterized CCRGs from the perspective of GO and PPIN. Thirdly, we prioritized CRGs based on CCRGs’ GO and network characteristics. Lastly, we evaluated the performance of the proposed method. Results We found that CCRG enriched GO terms were most often related to chemosensitivity and exhibited higher similarity scores compared to randomly selected genes. Moreover, CCRGs played key roles in maintaining the connectivity and controlling the information flow of PPINs. We then prioritized CRGs using CCRG enriched GO terms and CCRG network characteristics in order to obtain a database of predicted drug-CRGs that included 53 CRGs, 32 of which have been reported to affect susceptibility to drugs. Our proposed method identifies a greater number of drug-CCRGs, and drug-CCRGs are much more significantly enriched in predicted drug-CRGs, compared to a method based on the correlation of gene expression and drug activity. The mean area under ROC curve (AUC for our method is 65.2%, whereas that for the traditional method is 55.2%. Conclusions Our method not only identifies CRGs with expression patterns strongly correlated with drug activity, but also identifies CRGs in which expression is weakly correlated with drug activity. This study provides the framework for the identification of signatures that predict in vitro cellular chemosensitivity and offers a valuable

  15. GOssTo: a stand-alone application and a web tool for calculating semantic similarities on the Gene Ontology

    OpenAIRE

    Caniza, Horacio; Romero, Alfonso E.; Heron, Samuel; Yang, Haixuan; Devoto, Alessandra; Frasca, Marco; Mesiti, Marco; Valentini, Giorgio; Paccanaro, Alberto

    2014-01-01

    Summary: We present GOssTo, the Gene Ontology semantic similarity Tool, a user-friendly software system for calculating semantic similarities between gene products according to the Gene Ontology. GOssTo is bundled with six semantic similarity measures, including both term- and graph-based measures, and has extension capabilities to allow the user to add new similarities. Importantly, for any measure, GOssTo can also calculate the Random Walk Contribution that has been shown to greatly improve...

  16. Gene Ontology Terms and Automated Annotation for Energy-Related Microbial Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Mukhopadhyay, Biswarup [Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States); Tyler, Brett M. [Oregon State Univ., Corvallis, OR (United States); Setubal, Joao [Univ. of Sao Paulo (Brazil); Murali, T. M. [Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States)

    2017-11-03

    Gene Ontology (GO) is one of the more widely used functional ontologies for describing gene functions at various levels. The project developed 660 GO terms for describing energy-related microbial processes and filled the known gaps in this area of the GO system, and then used these terms to describe functions of 179 genes to showcase the utilities of the new resources. It hosted a series of workshops and made presentations at key meetings to inform and train scientific community members on these terms and to receive inputs from them for the GO term generation efforts. The project has developed a website for storing and displaying the resources (http://www.mengo.biochem.vt.edu/). The outcome of the project was further disseminated through peer-reviewed publications and poster and seminar presentations.

  17. The Gene Ontology Differs in Bursa of Fabricius Between Two Breeds of Ducks Post Hatching by Enriching the Differentially Expressed Genes

    Directory of Open Access Journals (Sweden)

    H Liu

    Full Text Available ABSTRACT The bursa of Fabricius (BF is the central humoral immune organ unique to birds. The present study investigated the possible difference on a molecular level between two duck breeds. The digital gene expression profiling (DGE technology was used to enrich the differentially expressed genes (DEGs in BF between the Jianchang and Nonghua-P strains of ducks. DGE data identified 195 DEGs in the bursa. Gene Ontology (GO analysis suggested that DEGs were mainly enriched in the metabolic pathways and ribosome components. Pathways analysis identified the spliceosome, RNA transport, RNA degradation process, Jak-STAT signaling pathway, TNF signaling pathway and B cell receptor signaling pathway. The results indicated that the main difference in the BF between the two duck strains was in the capabilities of protein formation and B cell development. These data have revealed the main divergence in the BF on a molecular level between genetically different duck breeds and may help to perform molecular breeding programs in poultry in the future.

  18. Protein-protein interaction inference based on semantic similarity of Gene Ontology terms.

    Science.gov (United States)

    Zhang, Shu-Bo; Tang, Qiang-Rong

    2016-07-21

    Identifying protein-protein interactions is important in molecular biology. Experimental methods to this issue have their limitations, and computational approaches have attracted more and more attentions from the biological community. The semantic similarity derived from the Gene Ontology (GO) annotation has been regarded as one of the most powerful indicators for protein interaction. However, conventional methods based on GO similarity fail to take advantage of the specificity of GO terms in the ontology graph. We proposed a GO-based method to predict protein-protein interaction by integrating different kinds of similarity measures derived from the intrinsic structure of GO graph. We extended five existing methods to derive the semantic similarity measures from the descending part of two GO terms in the GO graph, then adopted a feature integration strategy to combines both the ascending and the descending similarity scores derived from the three sub-ontologies to construct various kinds of features to characterize each protein pair. Support vector machines (SVM) were employed as discriminate classifiers, and five-fold cross validation experiments were conducted on both human and yeast protein-protein interaction datasets to evaluate the performance of different kinds of integrated features, the experimental results suggest the best performance of the feature that combines information from both the ascending and the descending parts of the three ontologies. Our method is appealing for effective prediction of protein-protein interaction. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Is the crowd better as an assistant or a replacement in ontology engineering? An exploration through the lens of the Gene Ontology.

    Science.gov (United States)

    Mortensen, Jonathan M; Telis, Natalie; Hughey, Jacob J; Fan-Minogue, Hua; Van Auken, Kimberly; Dumontier, Michel; Musen, Mark A

    2016-04-01

    Biomedical ontologies contain errors. Crowdsourcing, defined as taking a job traditionally performed by a designated agent and outsourcing it to an undefined large group of people, provides scalable access to humans. Therefore, the crowd has the potential to overcome the limited accuracy and scalability found in current ontology quality assurance approaches. Crowd-based methods have identified errors in SNOMED CT, a large, clinical ontology, with an accuracy similar to that of experts, suggesting that crowdsourcing is indeed a feasible approach for identifying ontology errors. This work uses that same crowd-based methodology, as well as a panel of experts, to verify a subset of the Gene Ontology (200 relationships). Experts identified 16 errors, generally in relationships referencing acids and metals. The crowd performed poorly in identifying those errors, with an area under the receiver operating characteristic curve ranging from 0.44 to 0.73, depending on the methods configuration. However, when the crowd verified what experts considered to be easy relationships with useful definitions, they performed reasonably well. Notably, there are significantly fewer Google search results for Gene Ontology concepts than SNOMED CT concepts. This disparity may account for the difference in performance - fewer search results indicate a more difficult task for the worker. The number of Internet search results could serve as a method to assess which tasks are appropriate for the crowd. These results suggest that the crowd fits better as an expert assistant, helping experts with their verification by completing the easy tasks and allowing experts to focus on the difficult tasks, rather than an expert replacement. Copyright © 2016 Elsevier Inc. All rights reserved.

  20. Genetic Resources for Advanced Biofuel Production Described with the Gene Ontology

    Directory of Open Access Journals (Sweden)

    Trudy eTorto-Alalibo

    2014-10-01

    Full Text Available Dramatic increases in research in the area of microbial biofuel production coupled with high-throughput data generation on bioenergy-related microbes has led to a deluge of information in the scientific literature and in databases. Consolidating this information and making it easily accessible requires a unified vocabulary. The Gene Ontology (GO fulfills that requirement, as it is a well-developed structured vocabulary that describes the activities and locations of gene products in a consistent manner across all kingdoms of life. The Microbial Energy Gene Ontology (MENGO: http://www.mengo.biochem.vt.edu project is extending the GO to include new terms to describe microbial processes of interest to bioenergy production. Our effort has added over 600 bioenergy related terms to the Gene Ontology. These terms will aid in the comprehensive annotation of gene products from diverse energy-related microbial genomes. An area of microbial energy research that has received a lot of attention is microbial production of advanced biofuels. These include alcohols such as butanol, isopropanol, isobutanol, and fuels derived from fatty acids, isoprenoids, and polyhydroxyalkanoates. These fuels are superior to first generation biofuels (ethanol and biodiesel esterified from vegetable oil or animal fat, can be generated from non-food feedstock sources, can be used as supplements or substitutes for gasoline, diesel and jet fuels, and can be stored and distributed using existing infrastructure. Here we review the roles of genes associated with synthesis of advanced biofuels, and at the same time introduce the use of the GO to describe the functions of these genes in a standardized way.

  1. Identification of the Key Genes and Pathways in Esophageal Carcinoma.

    Science.gov (United States)

    Su, Peng; Wen, Shiwang; Zhang, Yuefeng; Li, Yong; Xu, Yanzhao; Zhu, Yonggang; Lv, Huilai; Zhang, Fan; Wang, Mingbo; Tian, Ziqiang

    2016-01-01

    Objective . Esophageal carcinoma (EC) is a frequently common malignancy of gastrointestinal cancer in the world. This study aims to screen key genes and pathways in EC and elucidate the mechanism of it. Methods . 5 microarray datasets of EC were downloaded from Gene Expression Omnibus. Differentially expressed genes (DEGs) were screened by bioinformatics analysis. Gene Ontology (GO) enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment, and protein-protein interaction (PPI) network construction were performed to obtain the biological roles of DEGs in EC. Quantitative real-time polymerase chain reaction (qRT-PCR) was used to verify the expression level of DEGs in EC. Results . A total of 1955 genes were filtered as DEGs in EC. The upregulated genes were significantly enriched in cell cycle and the downregulated genes significantly enriched in Endocytosis. PPI network displayed CDK4 and CCT3 were hub proteins in the network. The expression level of 8 dysregulated DEGs including CDK4, CCT3, THSD4, SIM2, MYBL2, CENPF, CDCA3, and CDKN3 was validated in EC compared to adjacent nontumor tissues and the results were matched with the microarray analysis. Conclusion . The significantly DEGs including CDK4, CCT3, THSD4, and SIM2 may play key roles in tumorigenesis and development of EC involved in cell cycle and Endocytosis.

  2. Identification of the Key Genes and Pathways in Esophageal Carcinoma

    Directory of Open Access Journals (Sweden)

    Peng Su

    2016-01-01

    Full Text Available Objective. Esophageal carcinoma (EC is a frequently common malignancy of gastrointestinal cancer in the world. This study aims to screen key genes and pathways in EC and elucidate the mechanism of it. Methods. 5 microarray datasets of EC were downloaded from Gene Expression Omnibus. Differentially expressed genes (DEGs were screened by bioinformatics analysis. Gene Ontology (GO enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG enrichment, and protein-protein interaction (PPI network construction were performed to obtain the biological roles of DEGs in EC. Quantitative real-time polymerase chain reaction (qRT-PCR was used to verify the expression level of DEGs in EC. Results. A total of 1955 genes were filtered as DEGs in EC. The upregulated genes were significantly enriched in cell cycle and the downregulated genes significantly enriched in Endocytosis. PPI network displayed CDK4 and CCT3 were hub proteins in the network. The expression level of 8 dysregulated DEGs including CDK4, CCT3, THSD4, SIM2, MYBL2, CENPF, CDCA3, and CDKN3 was validated in EC compared to adjacent nontumor tissues and the results were matched with the microarray analysis. Conclusion. The significantly DEGs including CDK4, CCT3, THSD4, and SIM2 may play key roles in tumorigenesis and development of EC involved in cell cycle and Endocytosis.

  3. Zebrafish Expression Ontology of Gene Sets (ZEOGS): a tool to analyze enrichment of zebrafish anatomical terms in large gene sets.

    Science.gov (United States)

    Prykhozhij, Sergey V; Marsico, Annalisa; Meijsing, Sebastiaan H

    2013-09-01

    The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene expression

  4. Zebrafish Expression Ontology of Gene Sets (ZEOGS): A Tool to Analyze Enrichment of Zebrafish Anatomical Terms in Large Gene Sets

    Science.gov (United States)

    Marsico, Annalisa

    2013-01-01

    Abstract The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene

  5. Pathway analysis of gene signatures predicting metastasis of node-negative primary breast cancer

    International Nuclear Information System (INIS)

    Yu, Jack X; Sieuwerts, Anieta M; Zhang, Yi; Martens, John WM; Smid, Marcel; Klijn, Jan GM; Wang, Yixin; Foekens, John A

    2007-01-01

    Published prognostic gene signatures in breast cancer have few genes in common. Here we provide a rationale for this observation by studying the prognostic power and the underlying biological pathways of different gene signatures. Gene signatures to predict the development of metastases in estrogen receptor-positive and estrogen receptor-negative tumors were identified using 500 re-sampled training sets and mapping to Gene Ontology Biological Process to identify over-represented pathways. The Global Test program confirmed that gene expression profilings in the common pathways were associated with the metastasis of the patients. The apoptotic pathway and cell division, or cell growth regulation and G-protein coupled receptor signal transduction, were most significantly associated with the metastatic capability of estrogen receptor-positive or estrogen-negative tumors, respectively. A gene signature derived of the common pathways predicted metastasis in an independent cohort. Mapping of the pathways represented by different published prognostic signatures showed that they share 53% of the identified pathways. We show that divergent gene sets classifying patients for the same clinical endpoint represent similar biological processes and that pathway-derived signatures can be used to predict prognosis. Furthermore, our study reveals that the underlying biology related to aggressiveness of estrogen receptor subgroups of breast cancer is quite different

  6. Delineation and interpretation of gene networks towards their effect in cellular physiology- a reverse engineering approach for the identification of critical molecular players, through the use of ontologies.

    Science.gov (United States)

    Moutselos, K; Maglogiannis, I; Chatziioannou, A

    2010-01-01

    Exploiting ontologies, provides clues regarding the involvement of certain molecular processes in the cellular phenotypic manifestation. However, identifying individual molecular actors (genes, proteins, etc.) for targeted biological validation in a generic, prioritized, fashion, based in objective measures of their effects in the cellular physiology, remains a challenge. In this work, a new meta-analysis algorithm is proposed for the holistic interpretation of the information captured in -omic experiments, that is showcased in a transcriptomic, dynamic, DNA microarray dataset, which examines the effect of mastic oil treatment in Lewis lung carcinoma cells. Through the use of the Gene Ontology this algorithm relates genes to specific cellular pathways and vice versa in order to further reverse engineer the critical role of specific genes, starting from the results of various statistical enrichment analyses. The algorithm is able to discriminate candidate hub-genes, implying critical biochemical cross-talk. Moreover, performance measures of the algorithm are derived, when evaluated with respect to the differential expression gene list of the dataset.

  7. Examination of tetrahydrobiopterin pathway genes in autism.

    Science.gov (United States)

    Schnetz-Boutaud, N C; Anderson, B M; Brown, K D; Wright, H H; Abramson, R K; Cuccaro, M L; Gilbert, J R; Pericak-Vance, M A; Haines, J L

    2009-11-01

    Autism is a complex disorder with a high degree of heritability and significant phenotypic and genotypic heterogeneity. Although candidate gene studies and genome-wide screens have failed to identify major causal loci associated with autism, numerous studies have proposed association with several variations in genes in the dopaminergic and serotonergic pathways. Because tetrahydrobiopterin (BH4) is the essential cofactor in the synthesis of these two neurotransmitters, we genotyped 25 SNPs in nine genes of the BH4 pathway in a total of 403 families. Significant nominal association was detected in the gene for 6-pyruvoyl-tetrahydropterin synthase, PTS (chromosome 11), with P = 0.009; this result was not restricted to an affected male-only subset. Multilocus interaction was detected in the BH4 pathway alone, but not across the serotonin, dopamine and BH4 pathways.

  8. Genes and (Common) Pathways Underlying Drug Addiction

    Science.gov (United States)

    Li, Chuan-Yun; Mao, Xizeng; Wei, Liping

    2008-01-01

    Drug addiction is a serious worldwide problem with strong genetic and environmental influences. Different technologies have revealed a variety of genes and pathways underlying addiction; however, each individual technology can be biased and incomplete. We integrated 2,343 items of evidence from peer-reviewed publications between 1976 and 2006 linking genes and chromosome regions to addiction by single-gene strategies, microrray, proteomics, or genetic studies. We identified 1,500 human addiction-related genes and developed KARG (http://karg.cbi.pku.edu.cn), the first molecular database for addiction-related genes with extensive annotations and a friendly Web interface. We then performed a meta-analysis of 396 genes that were supported by two or more independent items of evidence to identify 18 molecular pathways that were statistically significantly enriched, covering both upstream signaling events and downstream effects. Five molecular pathways significantly enriched for all four different types of addictive drugs were identified as common pathways which may underlie shared rewarding and addictive actions, including two new ones, GnRH signaling pathway and gap junction. We connected the common pathways into a hypothetical common molecular network for addiction. We observed that fast and slow positive feedback loops were interlinked through CAMKII, which may provide clues to explain some of the irreversible features of addiction. PMID:18179280

  9. Genes and (common pathways underlying drug addiction.

    Directory of Open Access Journals (Sweden)

    Chuan-Yun Li

    2008-01-01

    Full Text Available Drug addiction is a serious worldwide problem with strong genetic and environmental influences. Different technologies have revealed a variety of genes and pathways underlying addiction; however, each individual technology can be biased and incomplete. We integrated 2,343 items of evidence from peer-reviewed publications between 1976 and 2006 linking genes and chromosome regions to addiction by single-gene strategies, microrray, proteomics, or genetic studies. We identified 1,500 human addiction-related genes and developed KARG (http://karg.cbi.pku.edu.cn, the first molecular database for addiction-related genes with extensive annotations and a friendly Web interface. We then performed a meta-analysis of 396 genes that were supported by two or more independent items of evidence to identify 18 molecular pathways that were statistically significantly enriched, covering both upstream signaling events and downstream effects. Five molecular pathways significantly enriched for all four different types of addictive drugs were identified as common pathways which may underlie shared rewarding and addictive actions, including two new ones, GnRH signaling pathway and gap junction. We connected the common pathways into a hypothetical common molecular network for addiction. We observed that fast and slow positive feedback loops were interlinked through CAMKII, which may provide clues to explain some of the irreversible features of addiction.

  10. MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction

    Directory of Open Access Journals (Sweden)

    Kohlbacher Oliver

    2009-09-01

    Full Text Available Abstract Background Knowledge of subcellular localization of proteins is crucial to proteomics, drug target discovery and systems biology since localization and biological function are highly correlated. In recent years, numerous computational prediction methods have been developed. Nevertheless, there is still a need for prediction methods that show more robustness and higher accuracy. Results We extended our previous MultiLoc predictor by incorporating phylogenetic profiles and Gene Ontology terms. Two different datasets were used for training the system, resulting in two versions of this high-accuracy prediction method. One version is specialized for globular proteins and predicts up to five localizations, whereas a second version covers all eleven main eukaryotic subcellular localizations. In a benchmark study with five localizations, MultiLoc2 performs considerably better than other methods for animal and plant proteins and comparably for fungal proteins. Furthermore, MultiLoc2 performs clearly better when using a second dataset that extends the benchmark study to all eleven main eukaryotic subcellular localizations. Conclusion MultiLoc2 is an extensive high-performance subcellular protein localization prediction system. By incorporating phylogenetic profiles and Gene Ontology terms MultiLoc2 yields higher accuracies compared to its previous version. Moreover, it outperforms other prediction systems in two benchmarks studies. MultiLoc2 is available as user-friendly and free web-service, available at: http://www-bs.informatik.uni-tuebingen.de/Services/MultiLoc2.

  11. A-DaGO-Fun: an adaptable Gene Ontology semantic similarity-based functional analysis tool.

    Science.gov (United States)

    Mazandu, Gaston K; Chimusa, Emile R; Mbiyavanga, Mamana; Mulder, Nicola J

    2016-02-01

    Gene Ontology (GO) semantic similarity measures are being used for biological knowledge discovery based on GO annotations by integrating biological information contained in the GO structure into data analyses. To empower users to quickly compute, manipulate and explore these measures, we introduce A-DaGO-Fun (ADaptable Gene Ontology semantic similarity-based Functional analysis). It is a portable software package integrating all known GO information content-based semantic similarity measures and relevant biological applications associated with these measures. A-DaGO-Fun has the advantage not only of handling datasets from the current high-throughput genome-wide applications, but also allowing users to choose the most relevant semantic similarity approach for their biological applications and to adapt a given module to their needs. A-DaGO-Fun is freely available to the research community at http://web.cbio.uct.ac.za/ITGOM/adagofun. It is implemented in Linux using Python under free software (GNU General Public Licence). gmazandu@cbio.uct.ac.za or Nicola.Mulder@uct.ac.za Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  12. Exploring genes and pathways involved in migraine

    NARCIS (Netherlands)

    Eising, E.

    2017-01-01

    The research in this thesis was aimed at identifying genes and molecular pathways involved in migraine. To this end, two gene expression analyses were performed in brain tissue obtained from transgenic mouse models for familial hemiplegic migraine (FHM), a monogenic subtype of migraine with aura.

  13. Gene expression profiling in susceptible interaction of grapevine with its fungal pathogen Eutypa lata: Extending MapMan ontology for grapevine

    Directory of Open Access Journals (Sweden)

    Usadel Björn

    2009-08-01

    Full Text Available Abstract Background Whole genome transcriptomics analysis is a very powerful approach because it gives an overview of the activity of genes in certain cells or tissue types. However, biological interpretation of such results can be rather tedious. MapMan is a software tool that displays large datasets (e.g. gene expression data onto diagrams of metabolic pathways or other processes and thus enables easier interpretation of results. The grapevine (Vitis vinifera genome sequence has recently become available bringing a new dimension into associated research. Two microarray platforms were designed based on the TIGR Gene Index database and used in several physiological studies. Results To enable easy and effective visualization of those and further experiments, annotation of Vitis vinifera Gene Index (VvGI version 5 to MapMan ontology was set up. Due to specificities of grape physiology, we have created new pictorial representations focusing on three selected pathways: carotenoid pathway, terpenoid pathway and phenylpropanoid pathway, the products of these pathways being important for wine aroma, flavour and colour, as well as plant defence against pathogens. This new tool was validated on Affymetrix microarrays data obtained during berry ripening and it allowed the discovery of new aspects in process regulation. We here also present results on transcriptional profiling of grape plantlets after exposal to the fungal pathogen Eutypa lata using Operon microarrays including visualization of results with MapMan. The data show that the genes induced in infected plants, encode pathogenesis related proteins and enzymes of the flavonoid metabolism, which are well known as being responsive to fungal infection. Conclusion The extension of MapMan ontology to grapevine together with the newly constructed pictorial representations for carotenoid, terpenoid and phenylpropanoid metabolism provide an alternative approach to the analysis of grapevine gene expression

  14. GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.

    Science.gov (United States)

    Zhang, Song; Cao, Jing; Kong, Y Megan; Scheuermann, Richard H

    2010-04-01

    A typical approach for the interpretation of high-throughput experiments, such as gene expression microarrays, is to produce groups of genes based on certain criteria (e.g. genes that are differentially expressed). To gain more mechanistic insights into the underlying biology, overrepresentation analysis (ORA) is often conducted to investigate whether gene sets associated with particular biological functions, for example, as represented by Gene Ontology (GO) annotations, are statistically overrepresented in the identified gene groups. However, the standard ORA, which is based on the hypergeometric test, analyzes each GO term in isolation and does not take into account the dependence structure of the GO-term hierarchy. We have developed a Bayesian approach (GO-Bayes) to measure overrepresentation of GO terms that incorporates the GO dependence structure by taking into account evidence not only from individual GO terms, but also from their related terms (i.e. parents, children, siblings, etc.). The Bayesian framework borrows information across related GO terms to strengthen the detection of overrepresentation signals. As a result, this method tends to identify sets of closely related GO terms rather than individual isolated GO terms. The advantage of the GO-Bayes approach is demonstrated with a simulation study and an application example.

  15. The Cell Ontology 2016: enhanced content, modularization, and ontology interoperability.

    Science.gov (United States)

    Diehl, Alexander D; Meehan, Terrence F; Bradford, Yvonne M; Brush, Matthew H; Dahdul, Wasila M; Dougall, David S; He, Yongqun; Osumi-Sutherland, David; Ruttenberg, Alan; Sarntivijai, Sirarat; Van Slyke, Ceri E; Vasilevsky, Nicole A; Haendel, Melissa A; Blake, Judith A; Mungall, Christopher J

    2016-07-04

    The Cell Ontology (CL) is an OBO Foundry candidate ontology covering the domain of canonical, natural biological cell types. Since its inception in 2005, the CL has undergone multiple rounds of revision and expansion, most notably in its representation of hematopoietic cells. For in vivo cells, the CL focuses on vertebrates but provides general classes that can be used for other metazoans, which can be subtyped in species-specific ontologies. Recent work on the CL has focused on extending the representation of various cell types, and developing new modules in the CL itself, and in related ontologies in coordination with the CL. For example, the Kidney and Urinary Pathway Ontology was used as a template to populate the CL with additional cell types. In addition, subtypes of the class 'cell in vitro' have received improved definitions and labels to provide for modularity with the representation of cells in the Cell Line Ontology and Reagent Ontology. Recent changes in the ontology development methodology for CL include a switch from OBO to OWL for the primary encoding of the ontology, and an increasing reliance on logical definitions for improved reasoning. The CL is now mandated as a metadata standard for large functional genomics and transcriptomics projects, and is used extensively for annotation, querying, and analyses of cell type specific data in sequencing consortia such as FANTOM5 and ENCODE, as well as for the NIAID ImmPort database and the Cell Image Library. The CL is also a vital component used in the modular construction of other biomedical ontologies-for example, the Gene Ontology and the cross-species anatomy ontology, Uberon, use CL to support the consistent representation of cell types across different levels of anatomical granularity, such as tissues and organs. The ongoing improvements to the CL make it a valuable resource to both the OBO Foundry community and the wider scientific community, and we continue to experience increased interest in the

  16. Characterization of differentially expressed genes involved in pathways associated with gastric cancer.

    Directory of Open Access Journals (Sweden)

    Hao Li

    Full Text Available To explore the patterns of gene expression in gastric cancer, a total of 26 paired gastric cancer and noncancerous tissues from patients were enrolled for gene expression microarray analyses. Limma methods were applied to analyze the data, and genes were considered to be significantly differentially expressed if the False Discovery Rate (FDR value was 2. Subsequently, Gene Ontology (GO categories were used to analyze the main functions of the differentially expressed genes. According to the Kyoto Encyclopedia of Genes and Genomes (KEGG database, we found pathways significantly associated with the differential genes. Gene-Act network and co-expression network were built respectively based on the relationships among the genes, proteins and compounds in the database. 2371 mRNAs and 350 lncRNAs considered as significantly differentially expressed genes were selected for the further analysis. The GO categories, pathway analyses and the Gene-Act network showed a consistent result that up-regulated genes were responsible for tumorigenesis, migration, angiogenesis and microenvironment formation, while down-regulated genes were involved in metabolism. These results of this study provide some novel findings on coding RNAs, lncRNAs, pathways and the co-expression network in gastric cancer which will be useful to guide further investigation and target therapy for this disease.

  17. Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network

    Science.gov (United States)

    2011-01-01

    Background Vaccine literature indexing is poorly performed in PubMed due to limited hierarchy of Medical Subject Headings (MeSH) annotation in the vaccine field. Vaccine Ontology (VO) is a community-based biomedical ontology that represents various vaccines and their relations. SciMiner is an in-house literature mining system that supports literature indexing and gene name tagging. We hypothesize that application of VO in SciMiner will aid vaccine literature indexing and mining of vaccine-gene interaction networks. As a test case, we have examined vaccines for Brucella, the causative agent of brucellosis in humans and animals. Results The VO-based SciMiner (VO-SciMiner) was developed to incorporate a total of 67 Brucella vaccine terms. A set of rules for term expansion of VO terms were learned from training data, consisting of 90 biomedical articles related to Brucella vaccine terms. VO-SciMiner demonstrated high recall (91%) and precision (99%) from testing a separate set of 100 manually selected biomedical articles. VO-SciMiner indexing exhibited superior performance in retrieving Brucella vaccine-related papers over that obtained with MeSH-based PubMed literature search. For example, a VO-SciMiner search of "live attenuated Brucella vaccine" returned 922 hits as of April 20, 2011, while a PubMed search of the same query resulted in only 74 hits. Using the abstracts of 14,947 Brucella-related papers, VO-SciMiner identified 140 Brucella genes associated with Brucella vaccines. These genes included known protective antigens, virulence factors, and genes closely related to Brucella vaccines. These VO-interacting Brucella genes were significantly over-represented in biological functional categories, including metabolite transport and metabolism, replication and repair, cell wall biogenesis, intracellular trafficking and secretion, posttranslational modification, and chaperones. Furthermore, a comprehensive interaction network of Brucella vaccines and genes were

  18. SoFoCles: feature filtering for microarray classification based on gene ontology.

    Science.gov (United States)

    Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A

    2010-02-01

    Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.

  19. Identification of key pathways and genes influencing prognosis in bladder urothelial carcinoma

    Directory of Open Access Journals (Sweden)

    Ning X

    2017-03-01

    Full Text Available Xin Ning, Yaoliang Deng Department of Urology, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Province, People’s Republic of China Background: Genomic profiling can be used to identify the predictive effect of genomic subsets for determining prognosis in bladder urothelial carcinoma (BUC after radical cystectomy. This study aimed to investigate potential gene and pathway markers associated with prognosis in BUC.Methods: A microarray dataset of BUC was obtained from The Cancer Genome Atlas database. Differentially expressed genes (DEGs were identified by DESeq of the R platform. Kaplan–Meier analysis was applied for prognostic markers. Key pathways and genes were identified using bioinformatics tools, such as gene set enrichment analysis, gene ontology, the Kyoto Encyclopedia of Genes and Genomes, gene multiple association network integration algorithm (GeneMANIA, Search Tool for the Retrieval of Interacting Genes/Proteins, and Molecular Complex Detection.Results: A comparative gene set enrichment analysis of tumor and adjacent normal tissues suggested BUC tumorigenesis resulted mainly from enrichment of cell cycle and DNA damage and repair-related biological processes and pathways, including TP53 and mitotic recombination. Two hundred and fifty-six genes were identified as potential prognosis-related DEGs. Gene ontology and Kyoto Encyclopedia of Genes and Genomes analyses showed that the potential prognosis-related DEGs were enriched in angiogenesis, including the cyclic adenosine monophosphate biosynthetic process, cyclic guanosine monophosphate-protein kinase G, mitogen-activated protein kinase, Rap1, and phosphoinositide-3-kinase-AKT signaling pathway. Nine hub genes, TAGLN, ACTA2, MYH11, CALD1, MYLK, GEM, PRELP, TPM2, and OGN, were identified from the intersection of protein–protein interaction and GeneMANIA networks. Module analysis of protein–protein interaction and GeneMANIA networks mainly showed

  20. Gene dosage, expression, and ontology analysis identifies driver genes in the carcinogenesis and chemoradioresistance of cervical cancer.

    Directory of Open Access Journals (Sweden)

    Malin Lando

    2009-11-01

    Full Text Available Integrative analysis of gene dosage, expression, and ontology (GO data was performed to discover driver genes in the carcinogenesis and chemoradioresistance of cervical cancers. Gene dosage and expression profiles of 102 locally advanced cervical cancers were generated by microarray techniques. Fifty-two of these patients were also analyzed with the Illumina expression method to confirm the gene expression results. An independent cohort of 41 patients was used for validation of gene expressions associated with clinical outcome. Statistical analysis identified 29 recurrent gains and losses and 3 losses (on 3p, 13q, 21q associated with poor outcome after chemoradiotherapy. The intratumor heterogeneity, assessed from the gene dosage profiles, was low for these alterations, showing that they had emerged prior to many other alterations and probably were early events in carcinogenesis. Integration of the alterations with gene expression and GO data identified genes that were regulated by the alterations and revealed five biological processes that were significantly overrepresented among the affected genes: apoptosis, metabolism, macromolecule localization, translation, and transcription. Four genes on 3p (RYBP, GBE1 and 13q (FAM48A, MED4 correlated with outcome at both the gene dosage and expression level and were satisfactorily validated in the independent cohort. These integrated analyses yielded 57 candidate drivers of 24 genetic events, including novel loci responsible for chemoradioresistance. Further mapping of the connections among genetic events, drivers, and biological processes suggested that each individual event stimulates specific processes in carcinogenesis through the coordinated control of multiple genes. The present results may provide novel therapeutic opportunities of both early and advanced stage cervical cancers.

  1. Human microRNA target analysis and gene ontology clustering by GOmir, a novel stand-alone application.

    Science.gov (United States)

    Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia

    2009-06-16

    microRNAs (miRNAs) are single-stranded RNA molecules of about 20-23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA.

  2. Transcriptome Analysis of Porcine PBMCs Reveals the Immune Cascade Response and Gene Ontology Terms Related to Cell Death and Fibrosis in the Progression of Liver Failure

    Directory of Open Access Journals (Sweden)

    YiMin Zhang

    2018-01-01

    Full Text Available Background. The key gene sets involved in the progression of acute liver failure (ALF, which has a high mortality rate, remain unclear. This study aims to gain a deeper understanding of the transcriptional response of peripheral blood mononuclear cells (PBMCs following ALF. Methods. ALF was induced by D-galactosamine (D-gal in a porcine model. PBMCs were separated at time zero (baseline group, 36 h (failure group, and 60 h (dying group after D-gal injection. Transcriptional profiling was performed using RNA sequencing and analysed using DAVID bioinformatics resources. Results. Compared with the baseline group, 816 and 1,845 differentially expressed genes (DEGs were identified in the failure and dying groups, respectively. A total of five and two gene ontology (GO term clusters were enriched in 107 GO terms in the failure group and 154 GO terms in the dying group. These GO clusters were primarily immune-related, including genes regulating the inflammasome complex and toll-like receptor signalling pathways. Specifically, GO terms related to cell death, including apoptosis, pyroptosis, and autophagy, and those related to fibrosis, coagulation dysfunction, and hepatic encephalopathy were enriched. Seven Kyoto Encyclopedia of Genes and Genomes (KEGG pathways, cytokine-cytokine receptor interaction, hematopoietic cell lineage, lysosome, rheumatoid arthritis, malaria, and phagosome and pertussis pathways were mapped for DEGs in the failure group. All of these seven KEGG pathways were involved in the 19 KEGG pathways mapped in the dying group. Conclusion. We found that the dramatic PBMC transcriptome changes triggered by ALF progression was predominantly related to immune responses. The enriched GO terms related to cell death, fibrosis, and so on, as indicated by PBMC transcriptome analysis, seem to be useful in elucidating potential key gene sets in the progression of ALF. A better understanding of these gene sets might be of preventive or

  3. Mood stabilizing drugs regulate transcription of immune, neuronal and metabolic pathway genes in Drosophila.

    Science.gov (United States)

    Herteleer, L; Zwarts, L; Hens, K; Forero, D; Del-Favero, J; Callaerts, P

    2016-05-01

    Lithium and valproate (VPA) are drugs used in the management of bipolar disorder. Even though they reportedly act on various pathways, the transcriptional targets relevant for disease mechanism and therapeutic effect remain unclear. Furthermore, multiple studies used lymphoblasts of bipolar patients as a cellular proxy, but it remains unclear whether peripheral cells provide a good readout for the effects of these drugs in the brain. We used Drosophila culture cells and adult flies to analyze the transcriptional effects of lithium and VPA and define mechanistic pathways. Transcriptional profiles were determined for Drosophila S2-cells and adult fly heads following lithium or VPA treatment. Gene ontology categories were identified using the DAVID functional annotation tool with a cut-off of p neuronal development, neuronal function, and metabolism. (i) Transcriptional effects of lithium and VPA in Drosophila S2 cells and heads show significant overlap. (ii) The overlap between transcriptional alterations in peripheral versus neuronal cells at the single gene level is negligible, but at the gene ontology and pathway level considerable overlap can be found. (iii) Lithium and VPA act on evolutionarily conserved pathways in Drosophila and mammalian models.

  4. Systematic enrichment analysis of gene expression profiling studies identifies consensus pathways implicated in colorectal cancer development

    Directory of Open Access Journals (Sweden)

    Jesús Lascorz

    2011-01-01

    Full Text Available Background: A large number of gene expression profiling (GEP studies on colorectal carcinogenesis have been performed but no reliable gene signature has been identified so far due to the lack of reproducibility in the reported genes. There is growing evidence that functionally related genes, rather than individual genes, contribute to the etiology of complex traits. We used, as a novel approach, pathway enrichment tools to define functionally related genes that are consistently up- or down-regulated in colorectal carcinogenesis. Materials and Methods: We started the analysis with 242 unique annotated genes that had been reported by any of three recent meta-analyses covering GEP studies on genes differentially expressed in carcinoma vs normal mucosa. Most of these genes (218, 91.9% had been reported in at least three GEP studies. These 242 genes were submitted to bioinformatic analysis using a total of nine tools to detect enrichment of Gene Ontology (GO categories or Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. As a final consistency criterion the pathway categories had to be enriched by several tools to be taken into consideration. Results: Our pathway-based enrichment analysis identified the categories of ribosomal protein constituents, extracellular matrix receptor interaction, carbonic anhydrase isozymes, and a general category related to inflammation and cellular response as significantly and consistently overrepresented entities. Conclusions: We triaged the genes covered by the published GEP literature on colorectal carcinogenesis and subjected them to multiple enrichment tools in order to identify the consistently enriched gene categories. These turned out to have known functional relationships to cancer development and thus deserve further investigation.

  5. False positive reduction in protein-protein interaction predictions using gene ontology annotations

    Directory of Open Access Journals (Sweden)

    Lin Yen-Han

    2007-07-01

    Full Text Available Abstract Background Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein-protein interactions. However, the lack of robust protein-protein interaction information is a challenge. One reason for the lack of solid protein-protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein-protein interaction datasets based on highly confident experimental results has not been adequately investigated. Results Gene Ontology (GO annotations were used to reduce false positive protein-protein interactions (PPI pairs resulting from computational predictions. Using experimentally obtained PPI pairs as a training dataset, eight top-ranking keywords were extracted from GO molecular function annotations. The sensitivity of these keywords is 64.21% in the yeast experimental dataset and 80.83% in the worm experimental dataset. The specificities, a measure of recovery power, of these keywords applied to four predicted PPI datasets for each studied organisms, are 48.32% and 46.49% (by average of four datasets in yeast and worm, respectively. Based on eight top-ranking keywords and co-localization of interacting proteins a set of two knowledge rules were deduced and applied to remove false positive protein pairs. The 'strength', a measure of improvement provided by the rules was defined based on the signal-to-noise ratio and implemented to measure the applicability of knowledge rules applying to the predicted PPI datasets. Depending on the employed PPI-predicting methods, the strength varies between two and ten-fold of randomly removing protein pairs from the datasets. Conclusion Gene Ontology annotations along with the deduced knowledge rules could be implemented to partially

  6. Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

    Science.gov (United States)

    Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

    2012-01-01

    Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382

  7. Protein-Protein Interactions Prediction Based on Iterative Clique Extension with Gene Ontology Filtering

    Directory of Open Access Journals (Sweden)

    Lei Yang

    2014-01-01

    Full Text Available Cliques (maximal complete subnets in protein-protein interaction (PPI network are an important resource used to analyze protein complexes and functional modules. Clique-based methods of predicting PPI complement the data defection from biological experiments. However, clique-based predicting methods only depend on the topology of network. The false-positive and false-negative interactions in a network usually interfere with prediction. Therefore, we propose a method combining clique-based method of prediction and gene ontology (GO annotations to overcome the shortcoming and improve the accuracy of predictions. According to different GO correcting rules, we generate two predicted interaction sets which guarantee the quality and quantity of predicted protein interactions. The proposed method is applied to the PPI network from the Database of Interacting Proteins (DIP and most of the predicted interactions are verified by another biological database, BioGRID. The predicted protein interactions are appended to the original protein network, which leads to clique extension and shows the significance of biological meaning.

  8. Susceptible genes and molecular pathways related to heavy ion irradiation in oral squamous cell carcinoma cells

    International Nuclear Information System (INIS)

    Fushimi, Kazuaki; Uzawa, Katsuhiro; Ishigami, Takashi; Yamamoto, Nobuharu; Kawata, Tetsuya; Shibahara, Takahiko; Ito, Hisao; Mizoe, Jun-etsu; Tsujii, Hirohiko; Tanzawa, Hideki

    2008-01-01

    Background and purpose: Heavy ion beams are high linear energy transfer (LET) radiation characterized by a higher relative biologic effectiveness than low LET radiation. The aim of the current study was to determine the difference of gene expression between heavy ion beams and X-rays in oral squamous cell carcinoma (OSCC)-derived cells. Materials and methods: The OSCC cells were irradiated with accelerated carbon or neon ion irradiation or X-rays using three different doses. We sought to identify genes the expression of which is affected by carbon and neon ion irradiation using Affymetrix GeneChip analysis. The identified genes were analyzed using the Ingenuity Pathway Analysis Tool to investigate the functional network and gene ontology. Changes in mRNA expression in the genes were assessed by real-time quantitative reverse transcriptase-polymerase chain reaction (qRT-PCR). Results: The microarray analysis identified 84 genes that were modulated by carbon and neon ion irradiation at all doses in OSCC cells. Among the genes, three genes (TGFBR2, SMURF2, and BMP7) and two genes (CCND1 and E2F3), respectively, were found to be involved in the transforming growth factor β-signaling pathway and cell cycle:G1/S checkpoint regulation pathway. The qRT-PCR data from the five genes after heavy ion irradiation were consistent with the microarray data (P < 0.01). Conclusion: Our findings should serve as a basis for global characterization of radiation-regulated genes and pathways in heavy ion-irradiated OSCC

  9. Amelogenesis Imperfecta; Genes, Proteins, and Pathways

    Directory of Open Access Journals (Sweden)

    Claire E. L. Smith

    2017-06-01

    Full Text Available Amelogenesis imperfecta (AI is the name given to a heterogeneous group of conditions characterized by inherited developmental enamel defects. AI enamel is abnormally thin, soft, fragile, pitted and/or badly discolored, with poor function and aesthetics, causing patients problems such as early tooth loss, severe embarrassment, eating difficulties, and pain. It was first described separately from diseases of dentine nearly 80 years ago, but the underlying genetic and mechanistic basis of the condition is only now coming to light. Mutations in the gene AMELX, encoding an extracellular matrix protein secreted by ameloblasts during enamel formation, were first identified as a cause of AI in 1991. Since then, mutations in at least eighteen genes have been shown to cause AI presenting in isolation of other health problems, with many more implicated in syndromic AI. Some of the encoded proteins have well documented roles in amelogenesis, acting as enamel matrix proteins or the proteases that degrade them, cell adhesion molecules or regulators of calcium homeostasis. However, for others, function is less clear and further research is needed to understand the pathways and processes essential for the development of healthy enamel. Here, we review the genes and mutations underlying AI presenting in isolation of other health problems, the proteins they encode and knowledge of their roles in amelogenesis, combining evidence from human phenotypes, inheritance patterns, mouse models, and in vitro studies. An LOVD resource (http://dna2.leeds.ac.uk/LOVD/ containing all published gene mutations for AI presenting in isolation of other health problems is described. We use this resource to identify trends in the genes and mutations reported to cause AI in the 270 families for which molecular diagnoses have been reported by 23rd May 2017. Finally we discuss the potential value of the translation of AI genetics to clinical care with improved patient pathways and

  10. Amelogenesis Imperfecta; Genes, Proteins, and Pathways.

    Science.gov (United States)

    Smith, Claire E L; Poulter, James A; Antanaviciute, Agne; Kirkham, Jennifer; Brookes, Steven J; Inglehearn, Chris F; Mighell, Alan J

    2017-01-01

    Amelogenesis imperfecta (AI) is the name given to a heterogeneous group of conditions characterized by inherited developmental enamel defects. AI enamel is abnormally thin, soft, fragile, pitted and/or badly discolored, with poor function and aesthetics, causing patients problems such as early tooth loss, severe embarrassment, eating difficulties, and pain. It was first described separately from diseases of dentine nearly 80 years ago, but the underlying genetic and mechanistic basis of the condition is only now coming to light. Mutations in the gene AMELX , encoding an extracellular matrix protein secreted by ameloblasts during enamel formation, were first identified as a cause of AI in 1991. Since then, mutations in at least eighteen genes have been shown to cause AI presenting in isolation of other health problems, with many more implicated in syndromic AI. Some of the encoded proteins have well documented roles in amelogenesis, acting as enamel matrix proteins or the proteases that degrade them, cell adhesion molecules or regulators of calcium homeostasis. However, for others, function is less clear and further research is needed to understand the pathways and processes essential for the development of healthy enamel. Here, we review the genes and mutations underlying AI presenting in isolation of other health problems, the proteins they encode and knowledge of their roles in amelogenesis, combining evidence from human phenotypes, inheritance patterns, mouse models, and in vitro studies. An LOVD resource (http://dna2.leeds.ac.uk/LOVD/) containing all published gene mutations for AI presenting in isolation of other health problems is described. We use this resource to identify trends in the genes and mutations reported to cause AI in the 270 families for which molecular diagnoses have been reported by 23rd May 2017. Finally we discuss the potential value of the translation of AI genetics to clinical care with improved patient pathways and speculate on the

  11. Closing the loop: from paper to protein annotation using supervised Gene Ontology classification.

    Science.gov (United States)

    Gobeill, Julien; Pasche, Emilie; Vishnyakova, Dina; Ruch, Patrick

    2014-01-01

    Gene function curation of the literature with Gene Ontology (GO) concepts is one particularly time-consuming task in genomics, and the help from bioinformatics is highly requested to keep up with the flow of publications. In 2004, the first BioCreative challenge already designed a task of automatic GO concepts assignment from a full text. At this time, results were judged far from reaching the performances required by real curation workflows. In particular, supervised approaches produced the most disappointing results because of lack of training data. Ten years later, the available curation data have massively grown. In 2013, the BioCreative IV GO task revisited the automatic GO assignment task. For this issue, we investigated the power of our supervised classifier, GOCat. GOCat computes similarities between an input text and already curated instances contained in a knowledge base to infer GO concepts. The subtask A consisted in selecting GO evidence sentences for a relevant gene in a full text. For this, we designed a state-of-the-art supervised statistical approach, using a naïve Bayes classifier and the official training set, and obtained fair results. The subtask B consisted in predicting GO concepts from the previous output. For this, we applied GOCat and reached leading results, up to 65% for hierarchical recall in the top 20 outputted concepts. Contrary to previous competitions, machine learning has this time outperformed standard dictionary-based approaches. Thanks to BioCreative IV, we were able to design a complete workflow for curation: given a gene name and a full text, this system is able to select evidence sentences for curation and to deliver highly relevant GO concepts. Contrary to previous competitions, machine learning this time outperformed dictionary-based systems. Observed performances are sufficient for being used in a real semiautomatic curation workflow. GOCat is available at http://eagl.unige.ch/GOCat/. http://eagl.unige.ch/GOCat4FT/.

  12. The Proteasix Ontology.

    Science.gov (United States)

    Arguello Casteleiro, Mercedes; Klein, Julie; Stevens, Robert

    2016-06-04

    The Proteasix Ontology (PxO) is an ontology that supports the Proteasix tool; an open-source peptide-centric tool that can be used to predict automatically and in a large-scale fashion in silico the proteases involved in the generation of proteolytic cleavage fragments (peptides) The PxO re-uses parts of the Protein Ontology, the three Gene Ontology sub-ontologies, the Chemical Entities of Biological Interest Ontology, the Sequence Ontology and bespoke extensions to the PxO in support of a series of roles: 1. To describe the known proteases and their target cleaveage sites. 2. To enable the description of proteolytic cleaveage fragments as the outputs of observed and predicted proteolysis. 3. To use knowledge about the function, species and cellular location of a protease and protein substrate to support the prioritisation of proteases in observed and predicted proteolysis. The PxO is designed to describe the biological underpinnings of the generation of peptides. The peptide-centric PxO seeks to support the Proteasix tool by separating domain knowledge from the operational knowledge used in protease prediction by Proteasix and to support the confirmation of its analyses and results. The Proteasix Ontology may be found at: http://bioportal.bioontology.org/ontologies/PXO . This ontology is free and open for use by everyone.

  13. Identification of mechanosensitive genes during skeletal development: alteration of genes associated with cytoskeletal rearrangement and cell signalling pathways.

    Science.gov (United States)

    Rolfe, Rebecca A; Nowlan, Niamh C; Kenny, Elaine M; Cormican, Paul; Morris, Derek W; Prendergast, Patrick J; Kelly, Daniel; Murphy, Paula

    2014-01-20

    Mechanical stimulation is necessary for regulating correct formation of the skeleton. Here we test the hypothesis that mechanical stimulation of the embryonic skeletal system impacts expression levels of genes implicated in developmentally important signalling pathways in a genome wide approach. We use a mutant mouse model with altered mechanical stimulation due to the absence of limb skeletal muscle (Splotch-delayed) where muscle-less embryos show specific defects in skeletal elements including delayed ossification, changes in the size and shape of cartilage rudiments and joint fusion. We used Microarray and RNA sequencing analysis tools to identify differentially expressed genes between muscle-less and control embryonic (TS23) humerus tissue. We found that 680 independent genes were down-regulated and 452 genes up-regulated in humeri from muscle-less Spd embryos compared to littermate controls (at least 2-fold; corrected p-value ≤0.05). We analysed the resulting differentially expressed gene sets using Gene Ontology annotations to identify significant enrichment of genes associated with particular biological processes, showing that removal of mechanical stimuli from muscle contractions affected genes associated with development and differentiation, cytoskeletal architecture and cell signalling. Among cell signalling pathways, the most strongly disturbed was Wnt signalling, with 34 genes including 19 pathway target genes affected. Spatial gene expression analysis showed that both a Wnt ligand encoding gene (Wnt4) and a pathway antagonist (Sfrp2) are up-regulated specifically in the developing joint line, while the expression of a Wnt target gene, Cd44, is no longer detectable in muscle-less embryos. The identification of 84 genes associated with the cytoskeleton that are down-regulated in the absence of muscle indicates a number of candidate genes that are both mechanoresponsive and potentially involved in mechanotransduction, converting a mechanical stimulus

  14. Creating a Structured Adverse Outcome Pathway Knowledgebase via Ontology-Based Annotations

    Science.gov (United States)

    The Adverse Outcome Pathway (AOP) framework is increasingly used to integrate data based on traditional and emerging toxicity testing paradigms. As the number of AOP descriptions has increased, so has the need to define the AOP in computable terms. Herein, we present a comprehens...

  15. ToxPlorerTM: A Comprehensive Knowledgebase of Toxicity Pathways Using Ontology-driven Information Extraction

    Science.gov (United States)

    Realizing the potential of pathway-based toxicity testing requires a fresh look at how we describe phenomena leading to adverse effects in vivo, how we assess them in vitro and how we extrapolate them in silico across chemicals, doses and species. We developed the ToxPlorer™ fram...

  16. Knowledge retrieval from PubMed abstracts and electronic medical records with the Multiple Sclerosis Ontology.

    Science.gov (United States)

    Malhotra, Ashutosh; Gündel, Michaela; Rajput, Abdul Mateen; Mevissen, Heinz-Theodor; Saiz, Albert; Pastor, Xavier; Lozano-Rubi, Raimundo; Martinez-Lapiscina, Elena H; Martinez-Lapsicina, Elena H; Zubizarreta, Irati; Mueller, Bernd; Kotelnikova, Ekaterina; Toldo, Luca; Hofmann-Apitius, Martin; Villoslada, Pablo

    2015-01-01

    In order to retrieve useful information from scientific literature and electronic medical records (EMR) we developed an ontology specific for Multiple Sclerosis (MS). The MS Ontology was created using scientific literature and expert review under the Protégé OWL environment. We developed a dictionary with semantic synonyms and translations to different languages for mining EMR. The MS Ontology was integrated with other ontologies and dictionaries (diseases/comorbidities, gene/protein, pathways, drug) into the text-mining tool SCAIView. We analyzed the EMRs from 624 patients with MS using the MS ontology dictionary in order to identify drug usage and comorbidities in MS. Testing competency questions and functional evaluation using F statistics further validated the usefulness of MS ontology. Validation of the lexicalized ontology by means of named entity recognition-based methods showed an adequate performance (F score = 0.73). The MS Ontology retrieved 80% of the genes associated with MS from scientific abstracts and identified additional pathways targeted by approved disease-modifying drugs (e.g. apoptosis pathways associated with mitoxantrone, rituximab and fingolimod). The analysis of the EMR from patients with MS identified current usage of disease modifying drugs and symptomatic therapy as well as comorbidities, which are in agreement with recent reports. The MS Ontology provides a semantic framework that is able to automatically extract information from both scientific literature and EMR from patients with MS, revealing new pathogenesis insights as well as new clinical information.

  17. PFP: Automated prediction of gene ontology functional annotations with confidence scores using protein sequence data.

    Science.gov (United States)

    Hawkins, Troy; Chitale, Meghana; Luban, Stanislav; Kihara, Daisuke

    2009-02-15

    Protein function prediction is a central problem in bioinformatics, increasing in importance recently due to the rapid accumulation of biological data awaiting interpretation. Sequence data represents the bulk of this new stock and is the obvious target for consideration as input, as newly sequenced organisms often lack any other type of biological characterization. We have previously introduced PFP (Protein Function Prediction) as our sequence-based predictor of Gene Ontology (GO) functional terms. PFP interprets the results of a PSI-BLAST search by extracting and scoring individual functional attributes, searching a wide range of E-value sequence matches, and utilizing conventional data mining techniques to fill in missing information. We have shown it to be effective in predicting both specific and low-resolution functional attributes when sufficient data is unavailable. Here we describe (1) significant improvements to the PFP infrastructure, including the addition of prediction significance and confidence scores, (2) a thorough benchmark of performance and comparisons to other related prediction methods, and (3) applications of PFP predictions to genome-scale data. We applied PFP predictions to uncharacterized protein sequences from 15 organisms. Among these sequences, 60-90% could be annotated with a GO molecular function term at high confidence (>or=80%). We also applied our predictions to the protein-protein interaction network of the Malaria plasmodium (Plasmodium falciparum). High confidence GO biological process predictions (>or=90%) from PFP increased the number of fully enriched interactions in this dataset from 23% of interactions to 94%. Our benchmark comparison shows significant performance improvement of PFP relative to GOtcha, InterProScan, and PSI-BLAST predictions. This is consistent with the performance of PFP as the overall best predictor in both the AFP-SIG '05 and CASP7 function (FN) assessments. PFP is available as a web service at http

  18. Separate enrichment analysis of pathways for up- and downregulated genes.

    Science.gov (United States)

    Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng

    2014-03-06

    Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.

  19. Gene expression profiles reveal key pathways and genes associated with neuropathic pain in patients with spinal cord injury.

    Science.gov (United States)

    He, Xijing; Fan, Liying; Wu, Zhongheng; He, Jiaxuan; Cheng, Bin

    2017-04-01

    Previous gene expression profiling studies of neuropathic pain (NP) following spinal cord injury (SCI) have predominantly been performed in animal models. The present study aimed to investigate gene alterations in patients with spinal cord injury and to further examine the mechanisms underlying NP following SCI. The GSE69901 gene expression profile was downloaded from the public Gene Expression Omnibus database. Samples of peripheral blood mononuclear cells (PBMCs) derived from 12 patients with intractable NP and 13 control patients without pain were analyzed to identify the differentially expressed genes (DEGs), followed by functional enrichment analysis and protein‑protein interaction (PPI) network construction. In addition, a transcriptional regulation network was constructed and functional gene clustering was performed. A total of 70 upregulated and 61 downregulated DEGs were identified in the PBMC samples from patients with NP. The upregulated and downregulated genes were significantly involved in different Gene Ontology terms and pathways, including focal adhesion, T cell receptor signaling pathway and mitochondrial function. Glycogen synthase kinase 3 β (GSK3B) was identified as a hub protein in the PPI network. In addition, ornithine decarboxylase 1 (ODC1) and ornithine aminotransferase (OAT) were regulated by additional transcription factors in the regulation network. GSK3B, OAT and ODC1 were significantly enriched in two functional gene clusters, the function of mitochondrial membrane and DNA binding. Focal adhesion and the T cell receptor signaling pathway may be significantly linked with NP, and GSK3B, OAT and ODC1 may be potential targets for the treatment of NP.

  20. Evolutionary rate patterns of the Gibberellin pathway genes

    Directory of Open Access Journals (Sweden)

    Zhang Fu-min

    2009-08-01

    Full Text Available Abstract Background Analysis of molecular evolutionary patterns of different genes within metabolic pathways allows us to determine whether these genes are subject to equivalent evolutionary forces and how natural selection shapes the evolution of proteins in an interacting system. Although previous studies found that upstream genes in the pathway evolved more slowly than downstream genes, the correlation between evolutionary rate and position of the genes in metabolic pathways as well as its implications in molecular evolution are still less understood. Results We sequenced and characterized 7 core structural genes of the gibberellin biosynthetic pathway from 8 representative species of the rice tribe (Oryzeae to address alternative hypotheses regarding evolutionary rates and patterns of metabolic pathway genes. We have detected significant rate heterogeneity among 7 GA pathway genes for both synonymous and nonsynonymous sites. Such rate variation is mostly likely attributed to differences of selection intensity rather than differential mutation pressures on the genes. Unlike previous argument that downstream genes in metabolic pathways would evolve more slowly than upstream genes, the downstream genes in the GA pathway did not exhibited the elevated substitution rate and instead, the genes that encode either the enzyme at the branch point (GA20ox or enzymes catalyzing multiple steps (KO, KAO and GA3ox in the pathway had the lowest evolutionary rates due to strong purifying selection. Our branch and codon models failed to detect signature of positive selection for any lineage and codon of the GA pathway genes. Conclusion This study suggests that significant heterogeneity of evolutionary rate of the GA pathway genes is mainly ascribed to differential constraint relaxation rather than the positive selection and supports the pathway flux theory that predicts that natural selection primarily targets enzymes that have the greatest control on fluxes.

  1. Identification of differentially expressed genes and biological pathways in bladder cancer

    Science.gov (United States)

    Tang, Fucai; He, Zhaohui; Lei, Hanqi; Chen, Yuehan; Lu, Zechao; Zeng, Guohua; Wang, Hangtao

    2018-01-01

    The purpose of the present study was to identify key genes and investigate the related molecular mechanisms of bladder cancer (BC) progression. From the Gene Expression Omnibus database, the gene expression dataset GSE7476 was downloaded, which contained 43 BC samples and 12 normal bladder tissues. GSE7476 was analyzed to screen the differentially expressed genes (DEGs). Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses were performed for the DEGs using the DAVID database, and a protein-protein interaction (PPI) network was then constructed using Cytoscape software. The results of the GO analysis showed that the upregulated DEGs were significantly enriched in cell division, nucleoplasm and protein binding, while the downregulated DEGs were significantly enriched in ‘extracellular matrix organization’, ‘proteinaceous extracellular matrix’ and ‘heparin binding’. The results of the KEGG pathway analysis showed that the upregulated DEGs were significantly enriched in the ‘cell cycle’, whereas the downregulated DEGs were significantly enriched in ‘complement and coagulation cascades’. JUN, cyclin-dependent kinase 1, FOS, PCNA, TOP2A, CCND1 and CDH1 were found to be hub genes in the PPI network. Sub-networks revealed that these gene were enriched in significant pathways, including the ‘cell cycle’ signaling pathway and ‘PI3K-Akt signaling pathway’. In summary, the present study identified DEGs and key target genes in the progression of BC, providing potential molecular targets and diagnostic biomarkers for the treatment of BC. PMID:29532898

  2. DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures.

    Science.gov (United States)

    Mazandu, Gaston K; Mulder, Nicola J

    2013-09-25

    The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications. We have developed DaGO-Fun, an online tool available at http://web.cbio.uct.ac.za/ITGOM, which incorporates many different GO similarity measures for exploring, analyzing and comparing GO terms and proteins within the context of GO. It uses GO data and UniProt proteins with their GO annotations as provided by the Gene Ontology Annotation (GOA) project to precompute GO term information content (IC), enabling rapid response to user queries. The DaGO-Fun online tool presents the advantage of integrating all the relevant IC-based GO similarity measures, including topology- and annotation-based approaches to facilitate effective exploration of these measures, thus enabling users to choose the most relevant approach for their application. Furthermore, this tool includes several biological applications related to GO semantic similarity scores, including the retrieval of genes based on their GO annotations, the clustering of functionally related genes within a set, and term enrichment analysis.

  3. GGDonto ontology as a knowledge-base for genetic diseases and disorders of glycan metabolism and their causative genes.

    Science.gov (United States)

    Solovieva, Elena; Shikanai, Toshihide; Fujita, Noriaki; Narimatsu, Hisashi

    2018-04-18

    Inherited mutations in glyco-related genes can affect the biosynthesis and degradation of glycans and result in severe genetic diseases and disorders. The Glyco-Disease Genes Database (GDGDB), which provides information about these diseases and disorders as well as their causative genes, has been developed by the Research Center for Medical Glycoscience (RCMG) and released in April 2010. GDGDB currently provides information on about 80 genetic diseases and disorders caused by single-gene mutations in glyco-related genes. Many biomedical resources provide information about genetic disorders and genes involved in their pathogenesis, but resources focused on genetic disorders known to be related to glycan metabolism are lacking. With the aim of providing more comprehensive knowledge on genetic diseases and disorders of glycan biosynthesis and degradation, we enriched the content of the GDGDB database and improved the methods for data representation. We developed the Genetic Glyco-Diseases Ontology (GGDonto) and a RDF/SPARQL-based user interface using Semantic Web technologies. In particular, we represented the GGDonto content using Semantic Web languages, such as RDF, RDFS, SKOS, and OWL, and created an interactive user interface based on SPARQL queries. This user interface provides features to browse the hierarchy of the ontology, view detailed information on diseases and related genes, and find relevant background information. Moreover, it provides the ability to filter and search information by faceted and keyword searches. Focused on the molecular etiology, pathogenesis, and clinical manifestations of genetic diseases and disorders of glycan metabolism and developed as a knowledge-base for this scientific field, GGDonto provides comprehensive information on various topics, including links to aid the integration with other scientific resources. The availability and accessibility of this knowledge will help users better understand how genetic defects impact the

  4. Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks

    Directory of Open Access Journals (Sweden)

    Mazo Ilya

    2007-07-01

    Full Text Available Abstract Background Uncovering cellular roles of a protein is a task of tremendous importance and complexity that requires dedicated experimental work as well as often sophisticated data mining and processing tools. Protein functions, often referred to as its annotations, are believed to manifest themselves through topology of the networks of inter-proteins interactions. In particular, there is a growing body of evidence that proteins performing the same function are more likely to interact with each other than with proteins with other functions. However, since functional annotation and protein network topology are often studied separately, the direct relationship between them has not been comprehensively demonstrated. In addition to having the general biological significance, such demonstration would further validate the data extraction and processing methods used to compose protein annotation and protein-protein interactions datasets. Results We developed a method for automatic extraction of protein functional annotation from scientific text based on the Natural Language Processing (NLP technology. For the protein annotation extracted from the entire PubMed, we evaluated the precision and recall rates, and compared the performance of the automatic extraction technology to that of manual curation used in public Gene Ontology (GO annotation. In the second part of our presentation, we reported a large-scale investigation into the correspondence between communities in the literature-based protein networks and GO annotation groups of functionally related proteins. We found a comprehensive two-way match: proteins within biological annotation groups form significantly denser linked network clusters than expected by chance and, conversely, densely linked network communities exhibit a pronounced non-random overlap with GO groups. We also expanded the publicly available GO biological process annotation using the relations extracted by our NLP technology

  5. GOssTo: a stand-alone application and a web tool for calculating semantic similarities on the Gene Ontology.

    Science.gov (United States)

    Caniza, Horacio; Romero, Alfonso E; Heron, Samuel; Yang, Haixuan; Devoto, Alessandra; Frasca, Marco; Mesiti, Marco; Valentini, Giorgio; Paccanaro, Alberto

    2014-08-01

    We present GOssTo, the Gene Ontology semantic similarity Tool, a user-friendly software system for calculating semantic similarities between gene products according to the Gene Ontology. GOssTo is bundled with six semantic similarity measures, including both term- and graph-based measures, and has extension capabilities to allow the user to add new similarities. Importantly, for any measure, GOssTo can also calculate the Random Walk Contribution that has been shown to greatly improve the accuracy of similarity measures. GOssTo is very fast, easy to use, and it allows the calculation of similarities on a genomic scale in a few minutes on a regular desktop machine. alberto@cs.rhul.ac.uk GOssTo is available both as a stand-alone application running on GNU/Linux, Windows and MacOS from www.paccanarolab.org/gossto and as a web application from www.paccanarolab.org/gosstoweb. The stand-alone application features a simple and concise command line interface for easy integration into high-throughput data processing pipelines. © The Author 2014. Published by Oxford University Press.

  6. Prediction of novel target genes and pathways involved in irinotecan-resistant colorectal cancer.

    Directory of Open Access Journals (Sweden)

    Precious Takondwa Makondi

    Full Text Available Acquired drug resistance to the chemotherapeutic drug irinotecan (the active metabolite of which is SN-38 is one of the significant obstacles in the treatment of advanced colorectal cancer (CRC. The molecular mechanism or targets mediating irinotecan resistance are still unclear. It is urgent to find the irinotecan response biomarkers to improve CRC patients' therapy.Genetic Omnibus Database GSE42387 which contained the gene expression profiles of parental and irinotecan-resistant HCT-116 cell lines was used. Differentially expressed genes (DEGs between parental and irinotecan-resistant cells, protein-protein interactions (PPIs, gene ontologies (GOs and pathway analysis were performed to identify the overall biological changes. The most common DEGs in the PPIs, GOs and pathways were identified and were validated clinically by their ability to predict overall survival and disease free survival. The gene-gene expression correlation and gene-resistance correlation was also evaluated in CRC patients using The Cancer Genomic Atlas data (TCGA.The 135 DEGs were identified of which 36 were upregulated and 99 were down regulated. After mapping the PPI networks, the GOs and the pathways, nine genes (GNAS, PRKACB, MECOM, PLA2G4C, BMP6, BDNF, DLG4, FGF2 and FGF9 were found to be commonly enriched. Signal transduction was the most significant GO and MAPK pathway was the most significant pathway. The five genes (FGF2, FGF9, PRKACB, MECOM and PLA2G4C in the MAPK pathway were all contained in the signal transduction and the levels of those genes were upregulated. The FGF2, FGF9 and MECOM expression were highly associated with CRC patients' survival rate but not PRKACB and PLA2G4C. In addition, FGF9 was also associated with irinotecan resistance and poor disease free survival. FGF2, FGF9 and PRKACB were positively correlated with each other while MECOM correlated positively with FGF9 and PLA2G4C, and correlated negatively with FGF2 and PRKACB after doing gene-gene

  7. Dysregulated Pathway Identification of Alzheimer's Disease Based on Internal Correlation Analysis of Genes and Pathways.

    Science.gov (United States)

    Kong, Wei; Mou, Xiaoyang; Di, Benteng; Deng, Jin; Zhong, Ruxing; Wang, Shuaiqun

    2017-11-20

    Dysregulated pathway identification is an important task which can gain insight into the underlying biological processes of disease. Current pathway-identification methods focus on a set of co-expression genes and single pathways and ignore the correlation between genes and pathways. The method proposed in this study, takes into account the internal correlations not only between genes but also pathways to identifying dysregulated pathways related to Alzheimer's disease (AD), the most common form of dementia. In order to find the significantly differential genes for AD, mutual information (MI) is used to measure interdependencies between genes other than expression valves. Then, by integrating the topology information from KEGG, the significant pathways involved in the feature genes are identified. Next, the distance correlation (DC) is applied to measure the pairwise pathway crosstalks since DC has the advantage of detecting nonlinear correlations when compared to Pearson correlation. Finally, the pathway pairs with significantly different correlations between normal and AD samples are known as dysregulated pathways. The molecular biology analysis demonstrated that many dysregulated pathways related to AD pathogenesis have been discovered successfully by the internal correlation detection. Furthermore, the insights of the dysregulated pathways in the development and deterioration of AD will help to find new effective target genes and provide important theoretical guidance for drug design. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  8. Signalign: An Ontology of DNA as Signal for Comparative Gene Structure Prediction Using Information-Coding-and-Processing Techniques.

    Science.gov (United States)

    Yu, Ning; Guo, Xuan; Gu, Feng; Pan, Yi

    2016-03-01

    Conventional character-analysis-based techniques in genome analysis manifest three main shortcomings-inefficiency, inflexibility, and incompatibility. In our previous research, a general framework, called DNA As X was proposed for character-analysis-free techniques to overcome these shortcomings, where X is the intermediates, such as digit, code, signal, vector, tree, graph network, and so on. In this paper, we further implement an ontology of DNA As Signal, by designing a tool named Signalign for comparative gene structure analysis, in which DNA sequences are converted into signal series, processed by modified method of dynamic time warping and measured by signal-to-noise ratio (SNR). The ontology of DNA As Signal integrates the principles and concepts of other disciplines including information coding theory and signal processing into sequence analysis and processing. Comparing with conventional character-analysis-based methods, Signalign can not only have the equivalent or superior performance, but also enrich the tools and the knowledge library of computational biology by extending the domain from character/string to diverse areas. The evaluation results validate the success of the character-analysis-free technique for improved performances in comparative gene structure prediction.

  9. Expression profiling and gene ontology analysis in fathead minnow (Pimephales promelas) liver following exposure to pulp and paper mill effluents

    Energy Technology Data Exchange (ETDEWEB)

    Costigan, Shannon L.; Werner, Julieta; Ouellet, Jacob D.; Hill, Lauren G. [Department of Biology, Lakehead University, 955 Oliver Road, Ontario P7B 5E1, (Canada); Law, R. David, E-mail: dlaw@lakeheadu.ca [Department of Biology, Lakehead University, 955 Oliver Road, Ontario P7B 5E1, (Canada)

    2012-10-15

    Many studies link pulp and paper mill effluent (PPME) exposure to adverse effects in fish populations present in the mill receiving environments. These impacts are often characteristic of endocrine disruption and may include impaired reproduction, development and survival. While these physiological endpoints are well-characterized, the molecular mechanisms causing them are not yet understood. To investigate changes in gene transcription induced by exposure to a PPME at several stages of treatment, male and female fathead minnows (FHMs) were exposed for 6 days to 25% (v/v) secondary (biologically) treated kraft effluent (TK) or 100% (v/v) combined mill outfall (CMO) from a mill producing both kraft pulp and newsprint. The gene expression changes in the livers of these fish were analyzed using a 22 K oligonucleotide microarray. Exposure to TK or CMO resulted in significant changes in the expression levels of 105 and 238 targets in male FHMs and 296 and 133 targets in females, respectively. Targets were then functionally analyzed using gene ontology tools to identify the biological processes in fish hepatocytes that were affected by exposure to PPME after its secondary treatment. Proteolysis was affected in female FHMs exposed to both TK and CMO. In male FHMs, no processes were affected by TK exposure, while sterol, isoprenoid, steroid and cholesterol biosynthesis and electron transport were up-regulated by CMO exposure. The results presented in this study indicate that short-term exposure to PPMEs affects the expression of reproduction-related genes in the livers of both male and female FHMs, and that secondary treatment of PPMEs may not neutralize all of their metabolic effects in fish. Gene ontology analysis of microarray data may enable identification of biological processes altered by toxicant exposure and thus provide an additional tool for monitoring the impact of PPMEs on fish populations.

  10. Expression profiling and gene ontology analysis in fathead minnow (Pimephales promelas) liver following exposure to pulp and paper mill effluents

    International Nuclear Information System (INIS)

    Costigan, Shannon L.; Werner, Julieta; Ouellet, Jacob D.; Hill, Lauren G.; Law, R. David

    2012-01-01

    Many studies link pulp and paper mill effluent (PPME) exposure to adverse effects in fish populations present in the mill receiving environments. These impacts are often characteristic of endocrine disruption and may include impaired reproduction, development and survival. While these physiological endpoints are well-characterized, the molecular mechanisms causing them are not yet understood. To investigate changes in gene transcription induced by exposure to a PPME at several stages of treatment, male and female fathead minnows (FHMs) were exposed for 6 days to 25% (v/v) secondary (biologically) treated kraft effluent (TK) or 100% (v/v) combined mill outfall (CMO) from a mill producing both kraft pulp and newsprint. The gene expression changes in the livers of these fish were analyzed using a 22 K oligonucleotide microarray. Exposure to TK or CMO resulted in significant changes in the expression levels of 105 and 238 targets in male FHMs and 296 and 133 targets in females, respectively. Targets were then functionally analyzed using gene ontology tools to identify the biological processes in fish hepatocytes that were affected by exposure to PPME after its secondary treatment. Proteolysis was affected in female FHMs exposed to both TK and CMO. In male FHMs, no processes were affected by TK exposure, while sterol, isoprenoid, steroid and cholesterol biosynthesis and electron transport were up-regulated by CMO exposure. The results presented in this study indicate that short-term exposure to PPMEs affects the expression of reproduction-related genes in the livers of both male and female FHMs, and that secondary treatment of PPMEs may not neutralize all of their metabolic effects in fish. Gene ontology analysis of microarray data may enable identification of biological processes altered by toxicant exposure and thus provide an additional tool for monitoring the impact of PPMEs on fish populations.

  11. Quantum ontologies

    International Nuclear Information System (INIS)

    Stapp, H.P.

    1988-12-01

    Quantum ontologies are conceptions of the constitution of the universe that are compatible with quantum theory. The ontological orientation is contrasted to the pragmatic orientation of science, and reasons are given for considering quantum ontologies both within science, and in broader contexts. The principal quantum ontologies are described and evaluated. Invited paper at conference: Bell's Theorem, Quantum Theory, and Conceptions of the Universe, George Mason University, October 20-21, 1988. 16 refs

  12. DMPD: Signalling pathways mediating type I interferon gene expression. [Dynamic Macrophage Pathway CSML Database

    Lifescience Database Archive (English)

    Full Text Available 17904888 Signalling pathways mediating type I interferon gene expression. Edwards M...hways mediating type I interferon gene expression. PubmedID 17904888 Title Signalling pathways...R, Slater L, Johnston SL. Microbes Infect. 2007 Sep;9(11):1245-51. Epub 2007 Jul 1. (.png) (.svg) (.html) (.csml) Show Signalling pat

  13. Gene Expression Profile Reveals Abnormalities of Multiple Signaling Pathways in Mesenchymal Stem Cell Derived from Patients with Systemic Lupus Erythematosus

    Directory of Open Access Journals (Sweden)

    Yu Tang

    2012-01-01

    Full Text Available We aimed to compare bone-marrow-derived mesenchymal stem cells (BMMSCs between systemic lupus erythematosus (SLE and normal controls by means of cDNA microarray, immunohistochemistry, immunofluorescence, and immunoblotting. Our results showed there were a total of 1, 905 genes which were differentially expressed by BMMSCs derived from SLE patients, of which, 652 genes were upregulated and 1, 253 were downregulated. Gene ontology (GO analysis showed that the majority of these genes were related to cell cycle and protein binding. Pathway analysis exhibited that differentially regulated signal pathways involved actin cytoskeleton, focal adhesion, tight junction, and TGF-β pathway. The high protein level of BMP-5 and low expression of Id-1 indicated that there might be dysregulation in BMP/TGF-β signaling pathway. The expression of Id-1 in SLE BMMSCs was reversely correlated with serum TNF-α levels. The protein level of cyclin E decreased in the cell cycling regulation pathway. Moreover, the MAPK signaling pathway was activated in BMMSCs from SLE patients via phosphorylation of ERK1/2 and SAPK/JNK. The actin distribution pattern of BMMSCs from SLE patients was also found disordered. Our results suggested that there were distinguished differences of BMMSCs between SLE patients and normal controls.

  14. Clinical phenotype-based gene prioritization: an initial study using semantic similarity and the human phenotype ontology.

    Science.gov (United States)

    Masino, Aaron J; Dechene, Elizabeth T; Dulik, Matthew C; Wilkens, Alisha; Spinner, Nancy B; Krantz, Ian D; Pennington, Jeffrey W; Robinson, Peter N; White, Peter S

    2014-07-21

    Exome sequencing is a promising method for diagnosing patients with a complex phenotype. However, variant interpretation relative to patient phenotype can be challenging in some scenarios, particularly clinical assessment of rare complex phenotypes. Each patient's sequence reveals many possibly damaging variants that must be individually assessed to establish clear association with patient phenotype. To assist interpretation, we implemented an algorithm that ranks a given set of genes relative to patient phenotype. The algorithm orders genes by the semantic similarity computed between phenotypic descriptors associated with each gene and those describing the patient. Phenotypic descriptor terms are taken from the Human Phenotype Ontology (HPO) and semantic similarity is derived from each term's information content. Model validation was performed via simulation and with clinical data. We simulated 33 Mendelian diseases with 100 patients per disease. We modeled clinical conditions by adding noise and imprecision, i.e. phenotypic terms unrelated to the disease and terms less specific than the actual disease terms. We ranked the causative gene against all 2488 HPO annotated genes. The median causative gene rank was 1 for the optimal and noise cases, 12 for the imprecision case, and 60 for the imprecision with noise case. Additionally, we examined a clinical cohort of subjects with hearing impairment. The disease gene median rank was 22. However, when also considering the patient's exome data and filtering non-exomic and common variants, the median rank improved to 3. Semantic similarity can rank a causative gene highly within a gene list relative to patient phenotype characteristics, provided that imprecision is mitigated. The clinical case results suggest that phenotype rank combined with variant analysis provides significant improvement over the individual approaches. We expect that this combined prioritization approach may increase accuracy and decrease effort for

  15. Generating Gene Ontology-Disease Inferences to Explore Mechanisms of Human Disease at the Comparative Toxicogenomics Database.

    Directory of Open Access Journals (Sweden)

    Allan Peter Davis

    Full Text Available Strategies for discovering common molecular events among disparate diseases hold promise for improving understanding of disease etiology and expanding treatment options. One technique is to leverage curated datasets found in the public domain. The Comparative Toxicogenomics Database (CTD; http://ctdbase.org/ manually curates chemical-gene, chemical-disease, and gene-disease interactions from the scientific literature. The use of official gene symbols in CTD interactions enables this information to be combined with the Gene Ontology (GO file from NCBI Gene. By integrating these GO-gene annotations with CTD's gene-disease dataset, we produce 753,000 inferences between 15,700 GO terms and 4,200 diseases, providing opportunities to explore presumptive molecular underpinnings of diseases and identify biological similarities. Through a variety of applications, we demonstrate the utility of this novel resource. As a proof-of-concept, we first analyze known repositioned drugs (e.g., raloxifene and sildenafil and see that their target diseases have a greater degree of similarity when comparing GO terms vs. genes. Next, a computational analysis predicts seemingly non-intuitive diseases (e.g., stomach ulcers and atherosclerosis as being similar to bipolar disorder, and these are validated in the literature as reported co-diseases. Additionally, we leverage other CTD content to develop testable hypotheses about thalidomide-gene networks to treat seemingly disparate diseases. Finally, we illustrate how CTD tools can rank a series of drugs as potential candidates for repositioning against B-cell chronic lymphocytic leukemia and predict cisplatin and the small molecule inhibitor JQ1 as lead compounds. The CTD dataset is freely available for users to navigate pathologies within the context of extensive biological processes, molecular functions, and cellular components conferred by GO. This inference set should aid researchers, bioinformaticists, and

  16. Prediction of novel target genes and pathways involved in bevacizumab-resistant colorectal cancer

    Science.gov (United States)

    Makondi, Precious Takondwa; Lee, Chia-Hwa; Huang, Chien-Yu; Chu, Chi-Ming; Chang, Yu-Jia

    2018-01-01

    Bevacizumab combined with cytotoxic chemotherapy is the backbone of metastatic colorectal cancer (mCRC) therapy; however, its treatment efficacy is hampered by therapeutic resistance. Therefore, understanding the mechanisms underlying bevacizumab resistance is crucial to increasing the therapeutic efficacy of bevacizumab. The Gene Expression Omnibus (GEO) database (dataset, GSE86525) was used to identify the key genes and pathways involved in bevacizumab-resistant mCRC. The GEO2R web tool was used to identify differentially expressed genes (DEGs). Functional and pathway enrichment analyses of the DEGs were performed using the Database for Annotation, Visualization, and Integrated Discovery(DAVID). Protein–protein interaction (PPI) networks were established using the Search Tool for the Retrieval of Interacting Genes/Proteins database(STRING) and visualized using Cytoscape software. A total of 124 DEGs were obtained, 57 of which upregulated and 67 were downregulated. PPI network analysis showed that seven upregulated genes and nine downregulated genes exhibited high PPI degrees. In the functional enrichment, the DEGs were mainly enriched in negative regulation of phosphate metabolic process and positive regulation of cell cycle process gene ontologies (GOs); the enriched pathways were the phosphoinositide 3-kinase-serine/threonine kinase signaling pathway, bladder cancer, and microRNAs in cancer. Cyclin-dependent kinase inhibitor 1A(CDKN1A), toll-like receptor 4 (TLR4), CD19 molecule (CD19), breast cancer 1, early onset (BRCA1), platelet-derived growth factor subunit A (PDGFA), and matrix metallopeptidase 1 (MMP1) were the DEGs involved in the pathways and the PPIs. The clinical validation of the DEGs in mCRC (TNM clinical stages 3 and 4) revealed that high PDGFA expression levels were associated with poor overall survival, whereas high BRCA1 and MMP1 expression levels were associated with favorable progress free survival(PFS). The identified genes and pathways

  17. Prediction of novel target genes and pathways involved in bevacizumab-resistant colorectal cancer.

    Directory of Open Access Journals (Sweden)

    Precious Takondwa Makondi

    Full Text Available Bevacizumab combined with cytotoxic chemotherapy is the backbone of metastatic colorectal cancer (mCRC therapy; however, its treatment efficacy is hampered by therapeutic resistance. Therefore, understanding the mechanisms underlying bevacizumab resistance is crucial to increasing the therapeutic efficacy of bevacizumab. The Gene Expression Omnibus (GEO database (dataset, GSE86525 was used to identify the key genes and pathways involved in bevacizumab-resistant mCRC. The GEO2R web tool was used to identify differentially expressed genes (DEGs. Functional and pathway enrichment analyses of the DEGs were performed using the Database for Annotation, Visualization, and Integrated Discovery(DAVID. Protein-protein interaction (PPI networks were established using the Search Tool for the Retrieval of Interacting Genes/Proteins database(STRING and visualized using Cytoscape software. A total of 124 DEGs were obtained, 57 of which upregulated and 67 were downregulated. PPI network analysis showed that seven upregulated genes and nine downregulated genes exhibited high PPI degrees. In the functional enrichment, the DEGs were mainly enriched in negative regulation of phosphate metabolic process and positive regulation of cell cycle process gene ontologies (GOs; the enriched pathways were the phosphoinositide 3-kinase-serine/threonine kinase signaling pathway, bladder cancer, and microRNAs in cancer. Cyclin-dependent kinase inhibitor 1A(CDKN1A, toll-like receptor 4 (TLR4, CD19 molecule (CD19, breast cancer 1, early onset (BRCA1, platelet-derived growth factor subunit A (PDGFA, and matrix metallopeptidase 1 (MMP1 were the DEGs involved in the pathways and the PPIs. The clinical validation of the DEGs in mCRC (TNM clinical stages 3 and 4 revealed that high PDGFA expression levels were associated with poor overall survival, whereas high BRCA1 and MMP1 expression levels were associated with favorable progress free survival(PFS. The identified genes and pathways

  18. Identification of protein features encoded by alternative exons using Exon Ontology.

    Science.gov (United States)

    Tranchevent, Léon-Charles; Aubé, Fabien; Dulaurier, Louis; Benoit-Pilven, Clara; Rey, Amandine; Poret, Arnaud; Chautard, Emilie; Mortada, Hussein; Desmet, François-Olivier; Chakrama, Fatima Zahra; Moreno-Garcia, Maira Alejandra; Goillot, Evelyne; Janczarski, Stéphane; Mortreux, Franck; Bourgeois, Cyril F; Auboeuf, Didier

    2017-06-01

    Transcriptomic genome-wide analyses demonstrate massive variation of alternative splicing in many physiological and pathological situations. One major challenge is now to establish the biological contribution of alternative splicing variation in physiological- or pathological-associated cellular phenotypes. Toward this end, we developed a computational approach, named "Exon Ontology," based on terms corresponding to well-characterized protein features organized in an ontology tree. Exon Ontology is conceptually similar to Gene Ontology-based approaches but focuses on exon-encoded protein features instead of gene level functional annotations. Exon Ontology describes the protein features encoded by a selected list of exons and looks for potential Exon Ontology term enrichment. By applying this strategy to exons that are differentially spliced between epithelial and mesenchymal cells and after extensive experimental validation, we demonstrate that Exon Ontology provides support to discover specific protein features regulated by alternative splicing. We also show that Exon Ontology helps to unravel biological processes that depend on suites of coregulated alternative exons, as we uncovered a role of epithelial cell-enriched splicing factors in the AKT signaling pathway and of mesenchymal cell-enriched splicing factors in driving splicing events impacting on autophagy. Freely available on the web, Exon Ontology is the first computational resource that allows getting a quick insight into the protein features encoded by alternative exons and investigating whether coregulated exons contain the same biological information. © 2017 Tranchevent et al.; Published by Cold Spring Harbor Laboratory Press.

  19. Genes encoding enzymes of the lignin biosynthesis pathway in Eucalyptus

    Directory of Open Access Journals (Sweden)

    Ricardo Harakava

    2005-01-01

    Full Text Available Eucalyptus ESTs libraries were screened for genes involved in lignin biosynthesis. This search was performed under the perspective of recent revisions on the monolignols biosynthetic pathway. Eucalyptus orthologues of all genes of the phenylpropanoid pathway leading to lignin biosynthesis reported in other plant species were identified. A library made with mRNAs extracted from wood was enriched for genes involved in lignin biosynthesis and allowed to infer the isoforms of each gene family that play a major role in wood lignin formation. Analysis of the wood library suggests that, besides the enzymes of the phenylpropanoids pathway, chitinases, laccases, and dirigent proteins are also important for lignification. Colocalization of several enzymes on the endoplasmic reticulum membrane, as predicted by amino acid sequence analysis, supports the existence of metabolic channeling in the phenylpropanoid pathway. This study establishes a framework for future investigations on gene expression level, protein expression and enzymatic assays, sequence polymorphisms, and genetic engineering.

  20. Identification of differentially expressed genes and signaling pathways in ovarian cancer by integrated bioinformatics analysis

    Directory of Open Access Journals (Sweden)

    Yang X

    2018-03-01

    Full Text Available Xiao Yang,1 Shaoming Zhu,2 Li Li,3 Li Zhang,1 Shu Xian,1 Yanqing Wang,1 Yanxiang Cheng1 1Department of Obstetrics and Gynecology, 2Department of Urology, Renmin Hospital of Wuhan University, 3Department of Pharmacology, Wuhan University Health Science Center, Wuhan, Hubei, People’s Republic of China Background: The mortality rate associated with ovarian cancer ranks the highest among gynecological malignancies. However, the cause and underlying molecular events of ovarian cancer are not clear. Here, we applied integrated bioinformatics to identify key pathogenic genes involved in ovarian cancer and reveal potential molecular mechanisms. Results: The expression profiles of GDS3592, GSE54388, and GSE66957 were downloaded from the Gene Expression Omnibus (GEO database, which contained 115 samples, including 85 cases of ovarian cancer samples and 30 cases of normal ovarian samples. The three microarray datasets were integrated to obtain differentially expressed genes (DEGs and were deeply analyzed by bioinformatics methods. The gene ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG pathway enrichments of DEGs were performed by DAVID and KOBAS online analyses, respectively. The protein–protein interaction (PPI networks of the DEGs were constructed from the STRING database. A total of 190 DEGs were identified in the three GEO datasets, of which 99 genes were upregulated and 91 genes were downregulated. GO analysis showed that the biological functions of DEGs focused primarily on regulating cell proliferation, adhesion, and differentiation and intracellular signal cascades. The main cellular components include cell membranes, exosomes, the cytoskeleton, and the extracellular matrix. The molecular functions include growth factor activity, protein kinase regulation, DNA binding, and oxygen transport activity. KEGG pathway analysis showed that these DEGs were mainly involved in the Wnt signaling pathway, amino acid metabolism, and the

  1. DOSE RESPONSE FROM HIGH THROUGHPUT GENE EXPRESSION STUDIES AND THE INFLUENCE OF TIME AND CELL LINE ON INFERRED MODE OF ACTION BY ONTOLOGIC ENRICHMENT (SOT)

    Science.gov (United States)

    Gene expression with ontologic enrichment and connectivity mapping tools is widely used to infer modes of action (MOA) for therapeutic drugs. Despite progress in high-throughput (HT) genomic systems, strategies suitable to identify industrial chemical MOA are needed. The L1000 is...

  2. HybridGO-Loc: mining hybrid features on gene ontology for predicting subcellular localization of multi-location proteins.

    Science.gov (United States)

    Wan, Shibiao; Mak, Man-Wai; Kung, Sun-Yuan

    2014-01-01

    Protein subcellular localization prediction, as an essential step to elucidate the functions in vivo of proteins and identify drugs targets, has been extensively studied in previous decades. Instead of only determining subcellular localization of single-label proteins, recent studies have focused on predicting both single- and multi-location proteins. Computational methods based on Gene Ontology (GO) have been demonstrated to be superior to methods based on other features. However, existing GO-based methods focus on the occurrences of GO terms and disregard their relationships. This paper proposes a multi-label subcellular-localization predictor, namely HybridGO-Loc, that leverages not only the GO term occurrences but also the inter-term relationships. This is achieved by hybridizing the GO frequencies of occurrences and the semantic similarity between GO terms. Given a protein, a set of GO terms are retrieved by searching against the gene ontology database, using the accession numbers of homologous proteins obtained via BLAST search as the keys. The frequency of GO occurrences and semantic similarity (SS) between GO terms are used to formulate frequency vectors and semantic similarity vectors, respectively, which are subsequently hybridized to construct fusion vectors. An adaptive-decision based multi-label support vector machine (SVM) classifier is proposed to classify the fusion vectors. Experimental results based on recent benchmark datasets and a new dataset containing novel proteins show that the proposed hybrid-feature predictor significantly outperforms predictors based on individual GO features as well as other state-of-the-art predictors. For readers' convenience, the HybridGO-Loc server, which is for predicting virus or plant proteins, is available online at http://bioinfo.eie.polyu.edu.hk/HybridGoServer/.

  3. Argot2: a large scale function prediction tool relying on semantic similarity of weighted Gene Ontology terms.

    Science.gov (United States)

    Falda, Marco; Toppo, Stefano; Pescarolo, Alessandro; Lavezzo, Enrico; Di Camillo, Barbara; Facchinetti, Andrea; Cilia, Elisa; Velasco, Riccardo; Fontana, Paolo

    2012-03-28

    Predicting protein function has become increasingly demanding in the era of next generation sequencing technology. The task to assign a curator-reviewed function to every single sequence is impracticable. Bioinformatics tools, easy to use and able to provide automatic and reliable annotations at a genomic scale, are necessary and urgent. In this scenario, the Gene Ontology has provided the means to standardize the annotation classification with a structured vocabulary which can be easily exploited by computational methods. Argot2 is a web-based function prediction tool able to annotate nucleic or protein sequences from small datasets up to entire genomes. It accepts as input a list of sequences in FASTA format, which are processed using BLAST and HMMER searches vs UniProKB and Pfam databases respectively; these sequences are then annotated with GO terms retrieved from the UniProtKB-GOA database and the terms are weighted using the e-values from BLAST and HMMER. The weighted GO terms are processed according to both their semantic similarity relations described by the Gene Ontology and their associated score. The algorithm is based on the original idea developed in a previous tool called Argot. The entire engine has been completely rewritten to improve both accuracy and computational efficiency, thus allowing for the annotation of complete genomes. The revised algorithm has been already employed and successfully tested during in-house genome projects of grape and apple, and has proven to have a high precision and recall in all our benchmark conditions. It has also been successfully compared with Blast2GO, one of the methods most commonly employed for sequence annotation. The server is freely accessible at http://www.medcomp.medicina.unipd.it/Argot2.

  4. Phylogenetic origin and diversification of RNAi pathway genes in insects

    DEFF Research Database (Denmark)

    Dowling, Daniel; Pauli, Thomas; Donath, Alexander

    2016-01-01

    RNAinterference (RNAi) refers tothe set ofmolecular processes foundin eukaryotic organisms in which smallRNAmolecules mediate the silencing or down-regulation of target genes. In insects, RNAi serves a number of functions, including regulation of endogenous genes, anti-viral defense, and defense...... against transposable elements. Despite being well studied in model organisms, such as Drosophila, the distribution of core RNAi pathway genes and their evolution in insects is not well understood. Here we present the most comprehensive overview of the distribution and diversity of core RNAi pathway genes...... across 100 insect species, encompassing all currently recognized insect orders. We inferred the phylogenetic origin of insect-specific RNAi pathway genes and also identified several hitherto unrecorded gene expansions using whole-body transcriptome data from the international 1KITE (1000 Insect...

  5. On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report.

    Directory of Open Access Journals (Sweden)

    Paul D Thomas

    Full Text Available A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011 has proposed a metric for the "functional similarity" between two genes that uses only the Gene Ontology (GO annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the "ortholog conjecture" (or, more properly, the "ortholog functional conservation hypothesis". First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1 that GO annotations are often incomplete, potentially in a biased manner, and subject to an "open world assumption" (absence of an annotation does not imply absence of a function, and 2 that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the

  6. Text mining in cancer gene and pathway prioritization.

    Science.gov (United States)

    Luo, Yuan; Riedlinger, Gregory; Szolovits, Peter

    2014-01-01

    Prioritization of cancer implicated genes has received growing attention as an effective way to reduce wet lab cost by computational analysis that ranks candidate genes according to the likelihood that experimental verifications will succeed. A multitude of gene prioritization tools have been developed, each integrating different data sources covering gene sequences, differential expressions, function annotations, gene regulations, protein domains, protein interactions, and pathways. This review places existing gene prioritization tools against the backdrop of an integrative Omic hierarchy view toward cancer and focuses on the analysis of their text mining components. We explain the relatively slow progress of text mining in gene prioritization, identify several challenges to current text mining methods, and highlight a few directions where more effective text mining algorithms may improve the overall prioritization task and where prioritizing the pathways may be more desirable than prioritizing only genes.

  7. Transcriptome and Gene Ontology (GO) Enrichment Analysis Reveals Genes Involved in Biotin Metabolism That Affect L-Lysine Production in Corynebacterium glutamicum.

    Science.gov (United States)

    Kim, Hong-Il; Kim, Jong-Hyeon; Park, Young-Jin

    2016-03-09

    Corynebacterium glutamicum is widely used for amino acid production. In the present study, 543 genes showed a significant change in their mRNA expression levels in L-lysine-producing C. glutamicum ATCC21300 than that in the wild-type C. glutamicum ATCC13032. Among these 543 differentially expressed genes (DEGs), 28 genes were up- or downregulated. In addition, 454 DEGs were functionally enriched and categorized based on BLAST sequence homologies and gene ontology (GO) annotations using the Blast2GO software. Interestingly, NCgl0071 (bioB, encoding biotin synthase) was expressed at levels ~20-fold higher in the L-lysine-producing ATCC21300 strain than that in the wild-type ATCC13032 strain. Five other genes involved in biotin metabolism or transport--NCgl2515 (bioA, encoding adenosylmethionine-8-amino-7-oxononanoate aminotransferase), NCgl2516 (bioD, encoding dithiobiotin synthetase), NCgl1883, NCgl1884, and NCgl1885--were also expressed at significantly higher levels in the L-lysine-producing ATCC21300 strain than that in the wild-type ATCC13032 strain, which we determined using both next-generation RNA sequencing and quantitative real-time PCR analysis. When we disrupted the bioB gene in C. glutamicum ATCC21300, L-lysine production decreased by approximately 76%, and the three genes involved in biotin transport (NCgl1883, NCgl1884, and NCgl1885) were significantly downregulated. These results will be helpful to improve our understanding of C. glutamicum for industrial amino acid production.

  8. Investigating Correlation between Protein Sequence Similarity and Semantic Similarity Using Gene Ontology Annotations.

    Science.gov (United States)

    Ikram, Najmul; Qadir, Muhammad Abdul; Afzal, Muhammad Tanvir

    2018-01-01

    Sequence similarity is a commonly used measure to compare proteins. With the increasing use of ontologies, semantic (function) similarity is getting importance. The correlation between these measures has been applied in the evaluation of new semantic similarity methods, and in protein function prediction. In this research, we investigate the relationship between the two similarity methods. The results suggest absence of a strong correlation between sequence and semantic similarities. There is a large number of proteins with low sequence similarity and high semantic similarity. We observe that Pearson's correlation coefficient is not sufficient to explain the nature of this relationship. Interestingly, the term semantic similarity values above 0 and below 1 do not seem to play a role in improving the correlation. That is, the correlation coefficient depends only on the number of common GO terms in proteins under comparison, and the semantic similarity measurement method does not influence it. Semantic similarity and sequence similarity have a distinct behavior. These findings are of significant effect for future works on protein comparison, and will help understand the semantic similarity between proteins in a better way.

  9. OmniSearch: a semantic search system based on the Ontology for MIcroRNA Target (OMIT) for microRNA-target gene interaction data.

    Science.gov (United States)

    Huang, Jingshan; Gutierrez, Fernando; Strachan, Harrison J; Dou, Dejing; Huang, Weili; Smith, Barry; Blake, Judith A; Eilbeck, Karen; Natale, Darren A; Lin, Yu; Wu, Bin; Silva, Nisansa de; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming; Ruttenberg, Alan

    2016-01-01

    As a special class of non-coding RNAs (ncRNAs), microRNAs (miRNAs) perform important roles in numerous biological and pathological processes. The realization of miRNA functions depends largely on how miRNAs regulate specific target genes. It is therefore critical to identify, analyze, and cross-reference miRNA-target interactions to better explore and delineate miRNA functions. Semantic technologies can help in this regard. We previously developed a miRNA domain-specific application ontology, Ontology for MIcroRNA Target (OMIT), whose goal was to serve as a foundation for semantic annotation, data integration, and semantic search in the miRNA field. In this paper we describe our continuing effort to develop the OMIT, and demonstrate its use within a semantic search system, OmniSearch, designed to facilitate knowledge capture of miRNA-target interaction data. Important changes in the current version OMIT are summarized as: (1) following a modularized ontology design (with 2559 terms imported from the NCRO ontology); (2) encoding all 1884 human miRNAs (vs. 300 in previous versions); and (3) setting up a GitHub project site along with an issue tracker for more effective community collaboration on the ontology development. The OMIT ontology is free and open to all users, accessible at: http://purl.obolibrary.org/obo/omit.owl. The OmniSearch system is also free and open to all users, accessible at: http://omnisearch.soc.southalabama.edu/index.php/Software.

  10. Linking human diseases to animal models using ontology-based phenotype annotation.

    Directory of Open Access Journals (Sweden)

    Nicole L Washington

    2009-11-01

    Full Text Available Scientists and clinicians who study genetic alterations and disease have traditionally described phenotypes in natural language. The considerable variation in these free-text descriptions has posed a hindrance to the important task of identifying candidate genes and models for human diseases and indicates the need for a computationally tractable method to mine data resources for mutant phenotypes. In this study, we tested the hypothesis that ontological annotation of disease phenotypes will facilitate the discovery of new genotype-phenotype relationships within and across species. To describe phenotypes using ontologies, we used an Entity-Quality (EQ methodology, wherein the affected entity (E and how it is affected (Q are recorded using terms from a variety of ontologies. Using this EQ method, we annotated the phenotypes of 11 gene-linked human diseases described in Online Mendelian Inheritance in Man (OMIM. These human annotations were loaded into our Ontology-Based Database (OBD along with other ontology-based phenotype descriptions of mutants from various model organism databases. Phenotypes recorded with this EQ method can be computationally compared based on the hierarchy of terms in the ontologies and the frequency of annotation. We utilized four similarity metrics to compare phenotypes and developed an ontology of homologous and analogous anatomical structures to compare phenotypes between species. Using these tools, we demonstrate that we can identify, through the similarity of the recorded phenotypes, other alleles of the same gene, other members of a signaling pathway, and orthologous genes and pathway members across species. We conclude that EQ-based annotation of phenotypes, in conjunction with a cross-species ontology, and a variety of similarity metrics can identify biologically meaningful similarities between genes by comparing phenotypes alone. This annotation and search method provides a novel and efficient means to identify

  11. Ontological Planning

    Directory of Open Access Journals (Sweden)

    Ahmet Alkan

    2017-12-01

    • Is it possible to redefine ontology within the hierarchical structure of planning? We are going to seek answers to some of these questions within the limited scope of this paper and we are going to offer the rest for discussion by just asking them. In light of these assessments, drawing attention, based on ontological knowledge relying on the wholeness of universe, to the question, on macro level planning, of whether or not the ontological realities of man, energy and movements of thinking can provide macro data for planning on a universal level as important factors affecting mankind will be one of the limited objectives of the paper.

  12. De novo assembly of Eugenia uniflora L. transcriptome and identification of genes from the terpenoid biosynthesis pathway.

    Science.gov (United States)

    Guzman, Frank; Kulcheski, Franceli Rodrigues; Turchetto-Zolet, Andreia Carina; Margis, Rogerio

    2014-12-01

    Pitanga (Eugenia uniflora L.) is a member of the Myrtaceae family and is of particular interest due to its medicinal properties that are attributed to specialized metabolites with known biological activities. Among these molecules, terpenoids are the most abundant in essential oils that are found in the leaves and represent compounds with potential pharmacological benefits. The terpene diversity observed in Myrtaceae is determined by the activity of different members of the terpene synthase and oxidosqualene cyclase families. Therefore, the aim of this study was to perform a de novo assembly of transcripts from E. uniflora leaves and to annotation to identify the genes potentially involved in the terpenoid biosynthesis pathway and terpene diversity. In total, 72,742 unigenes with a mean length of 1048bp were identified. Of these, 43,631 and 36,289 were annotated with the NCBI non-redundant protein and Swiss-Prot databases, respectively. The gene ontology categorized the sequences into 53 functional groups. A metabolic pathway analysis with KEGG revealed 8,625 unigenes assigned to 141 metabolic pathways and 40 unigenes predicted to be associated with the biosynthesis of terpenoids. Furthermore, we identified four putative full-length terpene synthase genes involved in sesquiterpenes and monoterpenes biosynthesis, and three putative full-length oxidosqualene cyclase genes involved in the triterpenes biosynthesis. The expression of these genes was validated in different E. uniflora tissues. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  13. Large-scale inference of gene function through phylogenetic annotation of Gene Ontology terms: case study of the apoptosis and autophagy cellular processes.

    Science.gov (United States)

    Feuermann, Marc; Gaudet, Pascale; Mi, Huaiyu; Lewis, Suzanna E; Thomas, Paul D

    2016-01-01

    We previously reported a paradigm for large-scale phylogenomic analysis of gene families that takes advantage of the large corpus of experimentally supported Gene Ontology (GO) annotations. This 'GO Phylogenetic Annotation' approach integrates GO annotations from evolutionarily related genes across ∼100 different organisms in the context of a gene family tree, in which curators build an explicit model of the evolution of gene functions. GO Phylogenetic Annotation models the gain and loss of functions in a gene family tree, which is used to infer the functions of uncharacterized (or incompletely characterized) gene products, even for human proteins that are relatively well studied. Here, we report our results from applying this paradigm to two well-characterized cellular processes, apoptosis and autophagy. This revealed several important observations with respect to GO annotations and how they can be used for function inference. Notably, we applied only a small fraction of the experimentally supported GO annotations to infer function in other family members. The majority of other annotations describe indirect effects, phenotypes or results from high throughput experiments. In addition, we show here how feedback from phylogenetic annotation leads to significant improvements in the PANTHER trees, the GO annotations and GO itself. Thus GO phylogenetic annotation both increases the quantity and improves the accuracy of the GO annotations provided to the research community. We expect these phylogenetically based annotations to be of broad use in gene enrichment analysis as well as other applications of GO annotations.Database URL: http://amigo.geneontology.org/amigo. © The Author(s) 2016. Published by Oxford University Press.

  14. Gene prediction validation and functional analysis of redundant pathways

    DEFF Research Database (Denmark)

    Sønderkær, Mads

    2011-01-01

    have employed a large mRNA-seq data set to improve and validate ab initio predicted gene models. This direct experimental evidence also provides reliable determinations of UTR regions and polyadenylation sites, which are not easily predicted in plants. Furthermore, once an annotated genome sequence...... is available, gene expression by mRNA-Seq enables acquisition of a more complete overview of gene isoform usage in complex enzymatic pathways enabling the identification of key genes. Metabolism in potatoes This information is useful e.g. for crop improvement based on manipulation of agronomically important...

  15. Polymorphisms in inflammation pathway genes and endometrial cancer risk

    Science.gov (United States)

    Delahanty, Ryan J.; Xiang, Yong-Bing; Spurdle, Amanda; Beeghly-Fadiel, Alicia; Long, Jirong; Thompson, Deborah; Tomlinson, Ian; Yu, Herbert; Lambrechts, Diether; Dörk, Thilo; Goodman, Marc T.; Zheng, Ying; Salvesen, Helga B.; Bao, Ping-Ping; Amant, Frederic; Beckmann, Matthias W.; Coenegrachts, Lieve; Coosemans, An; Dubrowinskaja, Natalia; Dunning, Alison; Runnebaum, Ingo B.; Easton, Douglas; Ekici, Arif B.; Fasching, Peter A.; Halle, Mari K.; Hein, Alexander; Howarth, Kimberly; Gorman, Maggie; Kaydarova, Dylyara; Krakstad, Camilla; Lose, Felicity; Lu, Lingeng; Lurie, Galina; O’Mara, Tracy; Matsuno, Rayna K.; Pharoah, Paul; Risch, Harvey; Corssen, Madeleine; Trovik, Jone; Turmanov, Nurzhan; Wen, Wanqing; Lu, Wei; Cai, Qiuyin; Zheng, Wei; Shu, Xiao-Ou

    2013-01-01

    Background Experimental and epidemiological evidence have suggested that chronic inflammation may play a critical role in endometrial carcinogenesis. Methods To investigate this hypothesis, a two-stage study was carried out to evaluate single nucleotide polymorphisms (SNPs) in inflammatory pathway genes in association with endometrial cancer risk. In stage 1, 64 candidate pathway genes were identified and 4,542 directly genotyped or imputed SNPs were analyzed among 832 endometrial cancer cases and 2,049 controls, using data from the Shanghai Endometrial Cancer Genetics Study. Linkage disequilibrium of stage 1 SNPs significantly associated with endometrial cancer (PAsian- and European-ancestry samples. Conclusions These findings lend support to the hypothesis that genetic polymorphisms in genes involved in the inflammatory pathway may contribute to genetic susceptibility to endometrial cancer. Impact Statement This study adds to the growing evidence that inflammation plays an important role in endometrial carcinogenesis. PMID:23221126

  16. Transcriptome sequencing and de novo assembly in arecanut, Areca catechu L elucidates the secondary metabolite pathway genes

    Directory of Open Access Journals (Sweden)

    Ramaswamy Manimekalai

    2018-03-01

    Full Text Available Areca catechu L. belongs to the Arecaceae family which comprises many economically important palms. The palm is a source of alkaloids and carotenoids. The lack of ample genetic information in public databases has been a constraint for the genetic improvement of arecanut. To gain molecular insight into the palm, high throughput RNA sequencing and de novo assembly of arecanut leaf transcriptome was undertaken in the present study. A total 56,321,907 paired end reads of 101 bp length consisting of 11.343 Gb nucleotides were generated. De novo assembly resulted in 48,783 good quality transcripts, of which 67% of transcripts could be annotated against NCBI non – redundant database. The Gene Ontology (GO analysis with UniProt database identified 9222 biological process, 11268 molecular function and 7574 cellular components GO terms. Large scale expression profiling through Fragments per Kilobase per Million mapped reads (FPKM showed major genes involved in different metabolic pathways of the plant. Metabolic pathway analysis of the assembled transcripts identified 124 plant related pathways. The transcripts related to carotenoid and alkaloid biosynthetic pathways had more number of reads and FPKM values suggesting higher expression of these genes. The arecanut transcript sequences generated in the study showed high similarity with coconut, oil palm and date palm sequences retrieved from public domains. We also identified 6853 genic SSR regions in the arecanut. The possible primers were designed for SSR detection and this would simplify the future efforts in genetic characterization of arecanut.

  17. Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes.

    Science.gov (United States)

    Biankin, Andrew V; Waddell, Nicola; Kassahn, Karin S; Gingras, Marie-Claude; Muthuswamy, Lakshmi B; Johns, Amber L; Miller, David K; Wilson, Peter J; Patch, Ann-Marie; Wu, Jianmin; Chang, David K; Cowley, Mark J; Gardiner, Brooke B; Song, Sarah; Harliwong, Ivon; Idrisoglu, Senel; Nourse, Craig; Nourbakhsh, Ehsan; Manning, Suzanne; Wani, Shivangi; Gongora, Milena; Pajic, Marina; Scarlett, Christopher J; Gill, Anthony J; Pinho, Andreia V; Rooman, Ilse; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Nones, Katia; Fink, J Lynn; Christ, Angelika; Bruxner, Tim; Cloonan, Nicole; Kolle, Gabriel; Newell, Felicity; Pinese, Mark; Mead, R Scott; Humphris, Jeremy L; Kaplan, Warren; Jones, Marc D; Colvin, Emily K; Nagrial, Adnan M; Humphrey, Emily S; Chou, Angela; Chin, Venessa T; Chantrill, Lorraine A; Mawson, Amanda; Samra, Jaswinder S; Kench, James G; Lovell, Jessica A; Daly, Roger J; Merrett, Neil D; Toon, Christopher; Epari, Krishna; Nguyen, Nam Q; Barbour, Andrew; Zeps, Nikolajs; Kakkar, Nipun; Zhao, Fengmei; Wu, Yuan Qing; Wang, Min; Muzny, Donna M; Fisher, William E; Brunicardi, F Charles; Hodges, Sally E; Reid, Jeffrey G; Drummond, Jennifer; Chang, Kyle; Han, Yi; Lewis, Lora R; Dinh, Huyen; Buhay, Christian J; Beck, Timothy; Timms, Lee; Sam, Michelle; Begley, Kimberly; Brown, Andrew; Pai, Deepa; Panchal, Ami; Buchner, Nicholas; De Borja, Richard; Denroche, Robert E; Yung, Christina K; Serra, Stefano; Onetto, Nicole; Mukhopadhyay, Debabrata; Tsao, Ming-Sound; Shaw, Patricia A; Petersen, Gloria M; Gallinger, Steven; Hruban, Ralph H; Maitra, Anirban; Iacobuzio-Donahue, Christine A; Schulick, Richard D; Wolfgang, Christopher L; Morgan, Richard A; Lawlor, Rita T; Capelli, Paola; Corbo, Vincenzo; Scardoni, Maria; Tortora, Giampaolo; Tempero, Margaret A; Mann, Karen M; Jenkins, Nancy A; Perez-Mancera, Pedro A; Adams, David J; Largaespada, David A; Wessels, Lodewyk F A; Rust, Alistair G; Stein, Lincoln D; Tuveson, David A; Copeland, Neal G; Musgrove, Elizabeth A; Scarpa, Aldo; Eshleman, James R; Hudson, Thomas J; Sutherland, Robert L; Wheeler, David A; Pearson, John V; McPherson, John D; Gibbs, Richard A; Grimmond, Sean M

    2012-11-15

    Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARID1A and SF3B1), and uncover novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement of axon guidance genes in pancreatic carcinogenesis.

  18. Lentiviral gene ontology (LeGO) vectors equipped with novel drug-selectable fluorescent proteins: new building blocks for cell marking and multi-gene analysis.

    Science.gov (United States)

    Weber, K; Mock, U; Petrowitz, B; Bartsch, U; Fehse, B

    2010-04-01

    Vector-encoded fluorescent proteins (FPs) facilitate unambiguous identification or sorting of gene-modified cells by fluorescence-activated cell sorting (FACS). Exploiting this feature, we have recently developed lentiviral gene ontology (LeGO) vectors (www.LentiGO-Vectors.de) for multi-gene analysis in different target cells. In this study, we extend the LeGO principle by introducing 10 different drug-selectable FPs created by fusing one of the five selection marker (protecting against blasticidin, hygromycin, neomycin, puromycin and zeocin) and one of the five FP genes (Cerulean, eGFP, Venus, dTomato and mCherry). All tested fusion proteins allowed both fluorescence-mediated detection and drug-mediated selection of LeGO-transduced cells. Newly generated codon-optimized hygromycin- and neomycin-resistance genes showed improved expression as compared with their ancestors. New LeGO constructs were produced at titers >10(6) per ml (for non-concentrated supernatants). We show efficient combinatorial marking and selection of various cells, including mesenchymal stem cells, simultaneously transduced with different LeGO constructs. Inclusion of the cytomegalovirus early enhancer/chicken beta-actin promoter into LeGO vectors facilitated robust transgene expression in and selection of neural stem cells and their differentiated progeny. We suppose that the new drug-selectable markers combining advantages of FACS and drug selection are well suited for numerous applications and vector systems. Their inclusion into LeGO vectors opens new possibilities for (stem) cell tracking and functional multi-gene analysis.

  19. The use of semantic similarity measures for optimally integrating heterogeneous Gene Ontology data from large scale annotation pipelines

    Directory of Open Access Journals (Sweden)

    Gaston K Mazandu

    2014-08-01

    Full Text Available With the advancement of new high throughput sequencing technologies, there has been an increase in the number of genome sequencing projects worldwide, which has yielded complete genome sequences of human, animals and plants. Subsequently, several labs have focused on genome annotation, consisting of assigning functions to gene products, mostly using Gene Ontology (GO terms. As a consequence, there is an increased heterogeneity in annotations across genomes due to different approaches used by different pipelines to infer these annotations and also due to the nature of the GO structure itself. This makes a curator's task difficult, even if they adhere to the established guidelines for assessing these protein annotations. Here we develop a genome-scale approach for integrating GO annotations from different pipelines using semantic similarity measures. We used this approach to identify inconsistencies and similarities in functional annotations between orthologs of human and Drosophila melanogaster, to assess the quality of GO annotations derived from InterPro2GO mappings compared to manually annotated GO annotations for the Drosophila melanogaster proteome from a FlyBase dataset and human, and to filter GO annotation data for these proteomes. Results obtained indicate that an efficient integration of GO annotations eliminates redundancy up to 27.08 and 22.32% in the Drosophila melanogaster and human GO annotation datasets, respectively. Furthermore, we identified lack of and missing annotations for some orthologs, and annotation mismatches between InterPro2GO and manual pipelines in these two proteomes, thus requiring further curation. This simplifies and facilitates tasks of curators in assessing protein annotations, reduces redundancy and eliminates inconsistencies in large annotation datasets for ease of comparative functional genomics.

  20. Gene expression profiling, pathway analysis and subtype classification reveal molecular heterogeneity in hepatocellular carcinoma and suggest subtype specific therapeutic targets.

    Science.gov (United States)

    Agarwal, Rahul; Narayan, Jitendra; Bhattacharyya, Amitava; Saraswat, Mayank; Tomar, Anil Kumar

    2017-10-01

    A very low 5-year survival rate among hepatocellular carcinoma (HCC) patients is mainly due to lack of early stage diagnosis, distant metastasis and high risk of postoperative recurrence. Hence ascertaining novel biomarkers for early diagnosis and patient specific therapeutics is crucial and urgent. Here, we have performed a comprehensive analysis of the expression data of 423 HCC patients (373 tumors and 50 controls) downloaded from The Cancer Genome Atlas (TCGA) followed by pathway enrichment by gene ontology annotations, subtype classification and overall survival analysis. The differential gene expression analysis using non-parametric Wilcoxon test revealed a total of 479 up-regulated and 91 down-regulated genes in HCC compared to controls. The list of top differentially expressed genes mainly consists of tumor/cancer associated genes, such as AFP, THBS4, LCN2, GPC3, NUF2, etc. The genes over-expressed in HCC were mainly associated with cell cycle pathways. In total, 59 kinases associated genes were found over-expressed in HCC, including TTK, MELK, BUB1, NEK2, BUB1B, AURKB, PLK1, CDK1, PKMYT1, PBK, etc. Overall four distinct HCC subtypes were predicted using consensus clustering method. Each subtype was unique in terms of gene expression, pathway enrichment and median survival. Conclusively, this study has exposed a number of interesting genes which can be exploited in future as potential markers of HCC, diagnostic as well as prognostic and subtype classification may guide for improved and specific therapy. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. [Using (1)H-nuclear magnetic resonance metabolomics and gene ontology to establish pathological staging model for esophageal cancer patients].

    Science.gov (United States)

    Chen, X; Wang, K; Chen, W; Jiang, H; Deng, P C; Li, Z J; Peng, J; Zhou, Z Y; Yang, H; Huang, G X; Zeng, J

    2016-07-01

    (ethanol amine, hydroxy-propionic acid, homocysteine and estriol) were eventually selected. gene ontology analysis showed that 54 enzymes and genes regulated the 4 key metabolic markers. The quantitative prediction model of esophageal cancer staging based on esophageal cancer NMR spectrum were established. Cross-validation results showed that the predicted effect was good (root mean square error=5.3, R(2)=0.47, P=0.036). The systems biology approaches based on metabolomics and enzyme-gene regulatory network analysis can be used to quantify the metabolic network disturbance of patients with advanced esophageal cancer, and to predict preoperative clinical staging of esophageal cancer patients by plasma NMR metabolomics.

  2. Development of an Ontology for Periodontitis.

    Science.gov (United States)

    Suzuki, Asami; Takai-Igarashi, Takako; Nakaya, Jun; Tanaka, Hiroshi

    2015-01-01

    In the clinical dentists and periodontal researchers' community, there is an obvious demand for a systems model capable of linking the clinical presentation of periodontitis to underlying molecular knowledge. A computer-readable representation of processes on disease development will give periodontal researchers opportunities to elucidate pathways and mechanisms of periodontitis. An ontology for periodontitis can be a model for integration of large variety of factors relating to a complex disease such as chronic inflammation in different organs accompanied by bone remodeling and immune system disorders, which has recently been referred to as osteoimmunology. Terms characteristic of descriptions related to the onset and progression of periodontitis were manually extracted from 194 review articles and PubMed abstracts by experts in periodontology. We specified all the relations between the extracted terms and constructed them into an ontology for periodontitis. We also investigated matching between classes of our ontology and that of Gene Ontology Biological Process. We developed an ontology for periodontitis called Periodontitis-Ontology (PeriO). The pathological progression of periodontitis is caused by complex, multi-factor interrelationships. PeriO consists of all the required concepts to represent the pathological progression and clinical treatment of periodontitis. The pathological processes were formalized with reference to Basic Formal Ontology and Relation Ontology, which accounts for participants in the processes realized by biological objects such as molecules and cells. We investigated the peculiarity of biological processes observed in pathological progression and medical treatments for the disease in comparison with Gene Ontology Biological Process (GO-BP) annotations. The results indicated that peculiarities of Perio existed in 1) granularity and context dependency of both the conceptualizations, and 2) causality intrinsic to the pathological processes

  3. Integrative analysis of RUNX1 downstream pathways and target genes

    Directory of Open Access Journals (Sweden)

    Liu Marjorie

    2008-07-01

    Full Text Available Abstract Background The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML. The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia. Results Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1 cell lines with RUNX1 mutations from FPD-AML patients, 2 over-expression of RUNX1 and CBFβ, and 3 Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes. Conclusion This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease

  4. Signal Transduction Pathways that Regulate CAB Gene Expression

    Energy Technology Data Exchange (ETDEWEB)

    Chory, Joanne

    2004-12-31

    The process of chloroplast differentiation, involves the coordinate regulation of many nuclear and chloroplast genes. The cues for the initiation of this developmental program are both extrinsic (e.g., light) and intrinsic (cell-type and plastid signals). During this project period, we utilized a molecular genetic approach to select for Arabidopsis mutants that did not respond properly to environmental light conditions, as well as mutants that were unable to perceive plastid damage. These latter mutants, called gun mutants, define two retrograde signaling pathways that regulate nuclear gene expression in response to chloroplasts. A major finding was to identify a signal from chloroplasts that regulates nuclear gene transcription. This signal is the build-up of Mg-Protoporphyrin IX, a key intermediate of the chlorophyll biosynthetic pathway. The signaling pathways downstream of this signal are currently being studied. Completion of this project has provided an increased understanding of the input signals and retrograde signaling pathways that control nuclear gene expression in response to the functional state of chloroplasts. These studies should ultimately influence our abilities to manipulate plant growth and development, and will aid in the understanding of the developmental control of photosynthesis.

  5. Signal Transduction Pathways that Regulate CAB Gene Expression

    Energy Technology Data Exchange (ETDEWEB)

    Chory, Joanne

    2006-01-16

    The process of chloroplast differentiation, involves the coordinate regulation of many nuclear and chloroplast genes. The cues for the initiation of this developmental program are both extrinsic (e.g., light) and intrinsic (cell-type and plastid signals). During this project period, we utilized a molecular genetic approach to select for Arabidopsis mutants that did not respond properly to environmental light conditions, as well as mutants that were unable to perceive plastid damage. These latter mutants, called gun mutants, define two retrograde signaling pathways that regulate nuclear gene expression in response to chloroplasts. A major finding was to identify a signal from chloroplasts that regulates nuclear gene transcription. This signal is the build-up of Mg-Protoporphyrin IX, a key intermediate of the chlorophyll biosynthetic pathway. The signaling pathways downstream of this signal are currently being studied. Completion of this project has provided an increased understanding of the input signals and retrograde signaling pathways that control nuclear gene expression in response to the functional state of chloroplasts. These studies should ultimately influence our abilities to manipulate plant growth and development, and will aid in the understanding of the developmental control of photosynthesis.

  6. SUGOI: automated ontology interchangeability

    CSIR Research Space (South Africa)

    Khan, ZC

    2015-04-01

    Full Text Available A foundational ontology can solve interoperability issues among the domain ontologies aligned to it. However, several foundational ontologies have been developed, hence such interoperability issues exist among domain ontologies. The novel SUGOI tool...

  7. Inferring ontology graph structures using OWL reasoning

    KAUST Repository

    Rodriguez-Garcia, Miguel Angel

    2018-01-05

    Ontologies are representations of a conceptualization of a domain. Traditionally, ontologies in biology were represented as directed acyclic graphs (DAG) which represent the backbone taxonomy and additional relations between classes. These graphs are widely exploited for data analysis in the form of ontology enrichment or computation of semantic similarity. More recently, ontologies are developed in a formal language such as the Web Ontology Language (OWL) and consist of a set of axioms through which classes are defined or constrained. While the taxonomy of an ontology can be inferred directly from the axioms of an ontology as one of the standard OWL reasoning tasks, creating general graph structures from OWL ontologies that exploit the ontologies\\' semantic content remains a challenge.We developed a method to transform ontologies into graphs using an automated reasoner while taking into account all relations between classes. Searching for (existential) patterns in the deductive closure of ontologies, we can identify relations between classes that are implied but not asserted and generate graph structures that encode for a large part of the ontologies\\' semantic content. We demonstrate the advantages of our method by applying it to inference of protein-protein interactions through semantic similarity over the Gene Ontology and demonstrate that performance is increased when graph structures are inferred using deductive inference according to our method. Our software and experiment results are available at http://github.com/bio-ontology-research-group/Onto2Graph .Onto2Graph is a method to generate graph structures from OWL ontologies using automated reasoning. The resulting graphs can be used for improved ontology visualization and ontology-based data analysis.

  8. Inferring ontology graph structures using OWL reasoning.

    Science.gov (United States)

    Rodríguez-García, Miguel Ángel; Hoehndorf, Robert

    2018-01-05

    Ontologies are representations of a conceptualization of a domain. Traditionally, ontologies in biology were represented as directed acyclic graphs (DAG) which represent the backbone taxonomy and additional relations between classes. These graphs are widely exploited for data analysis in the form of ontology enrichment or computation of semantic similarity. More recently, ontologies are developed in a formal language such as the Web Ontology Language (OWL) and consist of a set of axioms through which classes are defined or constrained. While the taxonomy of an ontology can be inferred directly from the axioms of an ontology as one of the standard OWL reasoning tasks, creating general graph structures from OWL ontologies that exploit the ontologies' semantic content remains a challenge. We developed a method to transform ontologies into graphs using an automated reasoner while taking into account all relations between classes. Searching for (existential) patterns in the deductive closure of ontologies, we can identify relations between classes that are implied but not asserted and generate graph structures that encode for a large part of the ontologies' semantic content. We demonstrate the advantages of our method by applying it to inference of protein-protein interactions through semantic similarity over the Gene Ontology and demonstrate that performance is increased when graph structures are inferred using deductive inference according to our method. Our software and experiment results are available at http://github.com/bio-ontology-research-group/Onto2Graph . Onto2Graph is a method to generate graph structures from OWL ontologies using automated reasoning. The resulting graphs can be used for improved ontology visualization and ontology-based data analysis.

  9. Mutations in THAP1/DYT6 reveal that diverse dystonia genes disrupt similar neuronal pathways and functions.

    Directory of Open Access Journals (Sweden)

    Zuchra Zakirova

    2018-01-01

    Full Text Available Dystonia is characterized by involuntary muscle contractions. Its many forms are genetically, phenotypically and etiologically diverse and it is unknown whether their pathogenesis converges on shared pathways. Mutations in THAP1 [THAP (Thanatos-associated protein domain containing, apoptosis associated protein 1], a ubiquitously expressed transcription factor with DNA binding and protein-interaction domains, cause dystonia, DYT6. There is a unique, neuronal 50-kDa Thap1-like immunoreactive species, and Thap1 levels are auto-regulated on the mRNA level. However, THAP1 downstream targets in neurons, and the mechanism via which it causes dystonia are largely unknown. We used RNA-Seq to assay the in vivo effect of a heterozygote Thap1 C54Y or ΔExon2 allele on the gene transcription signatures in neonatal mouse striatum and cerebellum. Enriched pathways and gene ontology terms include eIF2α Signaling, Mitochondrial Dysfunction, Neuron Projection Development, Axonal Guidance Signaling, and Synaptic LongTerm Depression, which are dysregulated in a genotype and tissue-dependent manner. Electrophysiological and neurite outgrowth assays were consistent with those enrichments, and the plasticity defects were partially corrected by salubrinal. Notably, several of these pathways were recently implicated in other forms of inherited dystonia, including DYT1. We conclude that dysfunction of these pathways may represent a point of convergence in the pathophysiology of several forms of inherited dystonia.

  10. De novo characterization of the spleen transcriptome of the large yellow croaker (Pseudosciaena crocea) and analysis of the immune relevant genes and pathways involved in the antiviral response

    KAUST Repository

    Mu, Yinnan

    2014-05-12

    The large yellow croaker (Pseudosciaena crocea) is an economically important marine fish in China. To understand the molecular basis for antiviral defense in this species, we used Illumia paired-end sequencing to characterize the spleen transcriptome of polyriboinosinic:polyribocytidylic acid [poly(I:C)]-induced large yellow croakers. The library produced 56,355,728 reads and assembled into 108,237 contigs. As a result, 15,192 unigenes were found from this transcriptome. Gene ontology analysis showed that 4,759 genes were involved in three major functional categories: biological process, cellular component, and molecular function. We further ascertained that numerous consensus sequences were homologous to known immune-relevant genes. Kyoto Encyclopedia of Genes and Genomes orthology mapping annotated 5,389 unigenes and identified numerous immune-relevant pathways. These immune-relevant genes and pathways revealed major antiviral immunity effectors, including but not limited to: pattern recognition receptors, adaptors and signal transducers, the interferons and interferon-stimulated genes, inflammatory cytokines and receptors, complement components, and B-cell and T-cell antigen activation molecules. Moreover, the partial genes of Toll-like receptor signaling pathway, RIG-I-like receptors signaling pathway, Janus kinase-Signal Transducer and Activator of Transcription (JAK-STAT) signaling pathway, and T-cell receptor (TCR) signaling pathway were found to be changed after poly(I:C) induction by real-time polymerase chain reaction (PCR) analysis, suggesting that these signaling pathways may be regulated by poly(I:C), a viral mimic. Overall, the antivirus-related genes and signaling pathways that were identified in response to poly(I:C) challenge provide valuable leads for further investigation of the antiviral defense mechanism in the large yellow croaker. © 2014 Mu et al.

  11. De novo characterization of the spleen transcriptome of the large yellow croaker (Pseudosciaena crocea and analysis of the immune relevant genes and pathways involved in the antiviral response.

    Directory of Open Access Journals (Sweden)

    Yinnan Mu

    Full Text Available The large yellow croaker (Pseudosciaena crocea is an economically important marine fish in China. To understand the molecular basis for antiviral defense in this species, we used Illumia paired-end sequencing to characterize the spleen transcriptome of polyriboinosinic:polyribocytidylic acid [poly(I:C]-induced large yellow croakers. The library produced 56,355,728 reads and assembled into 108,237 contigs. As a result, 15,192 unigenes were found from this transcriptome. Gene ontology analysis showed that 4,759 genes were involved in three major functional categories: biological process, cellular component, and molecular function. We further ascertained that numerous consensus sequences were homologous to known immune-relevant genes. Kyoto Encyclopedia of Genes and Genomes orthology mapping annotated 5,389 unigenes and identified numerous immune-relevant pathways. These immune-relevant genes and pathways revealed major antiviral immunity effectors, including but not limited to: pattern recognition receptors, adaptors and signal transducers, the interferons and interferon-stimulated genes, inflammatory cytokines and receptors, complement components, and B-cell and T-cell antigen activation molecules. Moreover, the partial genes of Toll-like receptor signaling pathway, RIG-I-like receptors signaling pathway, Janus kinase-Signal Transducer and Activator of Transcription (JAK-STAT signaling pathway, and T-cell receptor (TCR signaling pathway were found to be changed after poly(I:C induction by real-time polymerase chain reaction (PCR analysis, suggesting that these signaling pathways may be regulated by poly(I:C, a viral mimic. Overall, the antivirus-related genes and signaling pathways that were identified in response to poly(I:C challenge provide valuable leads for further investigation of the antiviral defense mechanism in the large yellow croaker.

  12. De novo characterization of the spleen transcriptome of the large yellow croaker (Pseudosciaena crocea) and analysis of the immune relevant genes and pathways involved in the antiviral response

    KAUST Repository

    Mu, Yinnan; Li, Mingyu; Ding, Feng; Ding, Yang; Ao, Jingqun; Hu, Songnian; Chen, Xinhua

    2014-01-01

    The large yellow croaker (Pseudosciaena crocea) is an economically important marine fish in China. To understand the molecular basis for antiviral defense in this species, we used Illumia paired-end sequencing to characterize the spleen transcriptome of polyriboinosinic:polyribocytidylic acid [poly(I:C)]-induced large yellow croakers. The library produced 56,355,728 reads and assembled into 108,237 contigs. As a result, 15,192 unigenes were found from this transcriptome. Gene ontology analysis showed that 4,759 genes were involved in three major functional categories: biological process, cellular component, and molecular function. We further ascertained that numerous consensus sequences were homologous to known immune-relevant genes. Kyoto Encyclopedia of Genes and Genomes orthology mapping annotated 5,389 unigenes and identified numerous immune-relevant pathways. These immune-relevant genes and pathways revealed major antiviral immunity effectors, including but not limited to: pattern recognition receptors, adaptors and signal transducers, the interferons and interferon-stimulated genes, inflammatory cytokines and receptors, complement components, and B-cell and T-cell antigen activation molecules. Moreover, the partial genes of Toll-like receptor signaling pathway, RIG-I-like receptors signaling pathway, Janus kinase-Signal Transducer and Activator of Transcription (JAK-STAT) signaling pathway, and T-cell receptor (TCR) signaling pathway were found to be changed after poly(I:C) induction by real-time polymerase chain reaction (PCR) analysis, suggesting that these signaling pathways may be regulated by poly(I:C), a viral mimic. Overall, the antivirus-related genes and signaling pathways that were identified in response to poly(I:C) challenge provide valuable leads for further investigation of the antiviral defense mechanism in the large yellow croaker. © 2014 Mu et al.

  13. Pathways-driven sparse regression identifies pathways and genes associated with high-density lipoprotein cholesterol in two Asian cohorts.

    Directory of Open Access Journals (Sweden)

    Matt Silver

    2013-11-01

    Full Text Available Standard approaches to data analysis in genome-wide association studies (GWAS ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK

  14. Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts

    Science.gov (United States)

    Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni

    2013-01-01

    Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune

  15. Vitamin D metabolic pathway genes and pancreatic cancer risk.

    Directory of Open Access Journals (Sweden)

    Hannah Arem

    Full Text Available Evidence on the association between vitamin D status and pancreatic cancer risk is inconsistent. This inconsistency may be partially attributable to variation in vitamin D regulating genes. We selected 11 vitamin D-related genes (GC, DHCR7, CYP2R1, VDR, CYP27B1, CYP24A1, CYP27A1, RXRA, CRP2, CASR and CUBN totaling 213 single nucleotide polymorphisms (SNPs, and examined associations with pancreatic adenocarcinoma. Our study included 3,583 pancreatic cancer cases and 7,053 controls from the genome-wide association studies of pancreatic cancer PanScans-I-III. We used the Adaptive Joint Test and the Adaptive Rank Truncated Product statistic for pathway and gene analyses, and unconditional logistic regression for SNP analyses, adjusting for age, sex, study and population stratification. We examined effect modification by circulating vitamin D concentration (≤50, >50 nmol/L for the most significant SNPs using a subset of cohort cases (n = 713 and controls (n = 878. The vitamin D metabolic pathway was not associated with pancreatic cancer risk (p = 0.830. Of the individual genes, none were associated with pancreatic cancer risk at a significance level of p<0.05. SNPs near the VDR (rs2239186, LRP2 (rs4668123, CYP24A1 (rs2762932, GC (rs2282679, and CUBN (rs1810205 genes were the top SNPs associated with pancreatic cancer (p-values 0.008-0.037, but none were statistically significant after adjusting for multiple comparisons. Associations between these SNPs and pancreatic cancer were not modified by circulating concentrations of vitamin D. These findings do not support an association between vitamin D-related genes and pancreatic cancer risk. Future research should explore other pathways through which vitamin D status might be associated with pancreatic cancer risk.

  16. Building a developmental toxicity ontology.

    Science.gov (United States)

    Baker, Nancy; Boobis, Alan; Burgoon, Lyle; Carney, Edward; Currie, Richard; Fritsche, Ellen; Knudsen, Thomas; Laffont, Madeleine; Piersma, Aldert H; Poole, Alan; Schneider, Steffen; Daston, George

    2018-04-03

    As more information is generated about modes of action for developmental toxicity and more data are generated using high-throughput and high-content technologies, it is becoming necessary to organize that information. This report discussed the need for a systematic representation of knowledge about developmental toxicity (i.e., an ontology) and proposes a method to build one based on knowledge of developmental biology and mode of action/ adverse outcome pathways in developmental toxicity. This report is the result of a consensus working group developing a plan to create an ontology for developmental toxicity that spans multiple levels of biological organization. This report provide a description of some of the challenges in building a developmental toxicity ontology and outlines a proposed methodology to meet those challenges. As the ontology is built on currently available web-based resources, a review of these resources is provided. Case studies on one of the most well-understood morphogens and developmental toxicants, retinoic acid, are presented as examples of how such an ontology might be developed. This report outlines an approach to construct a developmental toxicity ontology. Such an ontology will facilitate computer-based prediction of substances likely to induce human developmental toxicity. © 2018 Wiley Periodicals, Inc.

  17. Temporal network based analysis of cell specific vein graft transcriptome defines key pathways and hub genes in implantation injury.

    Directory of Open Access Journals (Sweden)

    Manoj Bhasin

    Full Text Available Vein graft failure occurs between 1 and 6 months after implantation due to obstructive intimal hyperplasia, related in part to implantation injury. The cell-specific and temporal response of the transcriptome to vein graft implantation injury was determined by transcriptional profiling of laser capture microdissected endothelial cells (EC and medial smooth muscle cells (SMC from canine vein grafts, 2 hours (H to 30 days (D following surgery. Our results demonstrate a robust genomic response beginning at 2 H, peaking at 12-24 H, declining by 7 D, and resolving by 30 D. Gene ontology and pathway analyses of differentially expressed genes indicated that implantation injury affects inflammatory and immune responses, apoptosis, mitosis, and extracellular matrix reorganization in both cell types. Through backpropagation an integrated network was built, starting with genes differentially expressed at 30 D, followed by adding upstream interactive genes from each prior time-point. This identified significant enrichment of IL-6, IL-8, NF-κB, dendritic cell maturation, glucocorticoid receptor, and Triggering Receptor Expressed on Myeloid Cells (TREM-1 signaling, as well as PPARα activation pathways in graft EC and SMC. Interactive network-based analyses identified IL-6, IL-8, IL-1α, and Insulin Receptor (INSR as focus hub genes within these pathways. Real-time PCR was used for the validation of two of these genes: IL-6 and IL-8, in addition to Collagen 11A1 (COL11A1, a cornerstone of the backpropagation. In conclusion, these results establish causality relationships clarifying the pathogenesis of vein graft implantation injury, and identifying novel targets for its prevention.

  18. An integrated analysis of genes and pathways exhibiting metabolic differences between estrogen receptor positive breast cancer cells

    International Nuclear Information System (INIS)

    Mandal, Soma; Davie, James R

    2007-01-01

    The sex hormone estrogen (E2) is pivotal to normal mammary gland growth and differentiation and in breast carcinogenesis. In this in silico study, we examined metabolic differences between ER(+)ve breast cancer cells during E2 deprivation. Public repositories of SAGE and MA gene expression data generated from E2 deprived ER(+)ve breast cancer cell lines, MCF-7 and ZR75-1 were compared with normal breast tissue. We analyzed gene ontology (GO), enrichment, clustering, chromosome localization, and pathway profiles and performed multiple comparisons with cell lines and tumors with different ER status. In all GO terms, biological process (BP), molecular function (MF), and cellular component (CC), MCF-7 had higher gene utilization than ZR75-1. Various analyses showed a down-regulated immune function, an up-regulated protein (ZR75-1) and glucose metabolism (MCF-7). A greater percentage of 77 common genes localized to the q arm of all chromosomes, but in ZR75-1 chromosomes 11, 16, and 19 harbored more overexpressed genes. Despite differences in gene utilization (electron transport, proteasome, glycolysis/gluconeogenesis) and expression (ribosome) in both cells, there was an overall similarity of ZR75-1 with ER(-)ve cell lines and ER(+)ve/ER(-)ve breast tumors. This study demonstrates integral metabolic differences may exist within the same cell subtype (luminal A) in representative ER(+)ve cell line models. Selectivity of gene and pathway usage for strategies such as energy requirement minimization, sugar utilization by ZR75-1 contrasted with MCF-7 cells, expressing genes whose protein products require ATP utilization. Such characteristics may impart aggressiveness to ZR75-1 and may be prognostic determinants of ER(+)ve breast tumors

  19. Ontology evolution in physics

    OpenAIRE

    Chan, Michael

    2013-01-01

    With the advent of reasoning problems in dynamic environments, there is an increasing need for automated reasoning systems to automatically adapt to unexpected changes in representations. In particular, the automation of the evolution of their ontologies needs to be enhanced without substantially sacrificing expressivity in the underlying representation. Revision of beliefs is not enough, as adding to or removing from beliefs does not change the underlying formal language. Gene...

  20. Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data.

    Science.gov (United States)

    Lee, Hyeonjeong; Shin, Miyoung

    2017-01-01

    The problem of discovering genetic markers as disease signatures is of great significance for the successful diagnosis, treatment, and prognosis of complex diseases. Even if many earlier studies worked on identifying disease markers from a variety of biological resources, they mostly focused on the markers of genes or gene-sets (i.e., pathways). However, these markers may not be enough to explain biological interactions between genetic variables that are related to diseases. Thus, in this study, our aim is to investigate distinctive associations among active pathways (i.e., pathway-sets) shown each in case and control samples which can be observed from gene expression and/or methylation data. The pathway-sets are obtained by identifying a set of associated pathways that are often active together over a significant number of class samples. For this purpose, gene expression or methylation profiles are first analyzed to identify significant (active) pathways via gene-set enrichment analysis. Then, regarding these active pathways, an association rule mining approach is applied to examine interesting pathway-sets in each class of samples (case or control). By doing so, the sets of associated pathways often working together in activity profiles are finally chosen as our distinctive signature of each class. The identified pathway-sets are aggregated into a pathway activity network (PAN), which facilitates the visualization of differential pathway associations between case and control samples. From our experiments with two publicly available datasets, we could find interesting PAN structures as the distinctive signatures of breast cancer and uterine leiomyoma cancer, respectively. Our pathway-set markers were shown to be superior or very comparable to other genetic markers (such as genes or gene-sets) in disease classification. Furthermore, the PAN structure, which can be constructed from the identified markers of pathway-sets, could provide deeper insights into

  1. Building ontologies with basic formal ontology

    CERN Document Server

    Arp, Robert; Spear, Andrew D.

    2015-01-01

    In the era of "big data," science is increasingly information driven, and the potential for computers to store, manage, and integrate massive amounts of data has given rise to such new disciplinary fields as biomedical informatics. Applied ontology offers a strategy for the organization of scientific information in computer-tractable form, drawing on concepts not only from computer and information science but also from linguistics, logic, and philosophy. This book provides an introduction to the field of applied ontology that is of particular relevance to biomedicine, covering theoretical components of ontologies, best practices for ontology design, and examples of biomedical ontologies in use. After defining an ontology as a representation of the types of entities in a given domain, the book distinguishes between different kinds of ontologies and taxonomies, and shows how applied ontology draws on more traditional ideas from metaphysics. It presents the core features of the Basic Formal Ontology (BFO), now u...

  2. Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds

    Energy Technology Data Exchange (ETDEWEB)

    Shi, CY; Yang, H; Wei, CL; Yu, O; Zhang, ZZ; Sun, J; Wan, XC

    2011-01-01

    Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Using high-throughput Illumina RNA-seq, the transcriptome from poly (A){sup +} RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs). Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010). Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG) found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were analyzed by RT-PCR and quantitative real

  3. Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds

    Directory of Open Access Journals (Sweden)

    Chen Qi

    2011-02-01

    Full Text Available Abstract Background Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Results Using high-throughput Illumina RNA-seq, the transcriptome from poly (A+ RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs. Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010. Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were

  4. Gene pathways that delay Caenorhabditis elegans reproductive senescence.

    Directory of Open Access Journals (Sweden)

    Meng C Wang

    2014-12-01

    Full Text Available Reproductive senescence is a hallmark of aging. The molecular mechanisms regulating reproductive senescence and its association with the aging of somatic cells remain poorly understood. From a full genome RNA interference (RNAi screen, we identified 32 Caenorhabditis elegans gene inactivations that delay reproductive senescence and extend reproductive lifespan. We found that many of these gene inactivations interact with insulin/IGF-1 and/or TGF-β endocrine signaling pathways to regulate reproductive senescence, except nhx-2 and sgk-1 that modulate sodium reabsorption. Of these 32 gene inactivations, we also found that 19 increase reproductive lifespan through their effects on oocyte activities, 8 of them coordinate oocyte and sperm functions to extend reproductive lifespan, and 5 of them can induce sperm humoral response to promote reproductive longevity. Furthermore, we examined the effects of these reproductive aging regulators on somatic aging. We found that 5 of these gene inactivations prolong organismal lifespan, and 20 of them increase healthy life expectancy of an organism without altering total life span. These studies provide a systemic view on the genetic regulation of reproductive senescence and its intersection with organism longevity. The majority of these newly identified genes are conserved, and may provide new insights into age-associated reproductive senescence during human aging.

  5. GOASVM: a subcellular location predictor by incorporating term-frequency gene ontology into the general form of Chou's pseudo-amino acid composition.

    Science.gov (United States)

    Wan, Shibiao; Mak, Man-Wai; Kung, Sun-Yuan

    2013-04-21

    Prediction of protein subcellular localization is an important yet challenging problem. Recently, several computational methods based on Gene Ontology (GO) have been proposed to tackle this problem and have demonstrated superiority over methods based on other features. Existing GO-based methods, however, do not fully use the GO information. This paper proposes an efficient GO method called GOASVM that exploits the information from the GO term frequencies and distant homologs to represent a protein in the general form of Chou's pseudo-amino acid composition. The method first selects a subset of relevant GO terms to form a GO vector space. Then for each protein, the method uses the accession number (AC) of the protein or the ACs of its homologs to find the number of occurrences of the selected GO terms in the Gene Ontology annotation (GOA) database as a means to construct GO vectors for support vector machines (SVMs) classification. With the advantages of GO term frequencies and a new strategy to incorporate useful homologous information, GOASVM can achieve a prediction accuracy of 72.2% on a new independent test set comprising novel proteins that were added to Swiss-Prot six years later than the creation date of the training set. GOASVM and Supplementary materials are available online at http://bioinfo.eie.polyu.edu.hk/mGoaSvmServer/GOASVM.html. Copyright © 2013 Elsevier Ltd. All rights reserved.

  6. Comparative study on gene set and pathway topology-based enrichment methods.

    Science.gov (United States)

    Bayerlová, Michaela; Jung, Klaus; Kramer, Frank; Klemm, Florian; Bleckmann, Annalen; Beißbarth, Tim

    2015-10-22

    Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis. We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods. In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower. We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both

  7. Ontology authoring with Forza

    CSIR Research Space (South Africa)

    Keet, CM

    2014-11-01

    Full Text Available Generic, reusable ontology elements, such as a foundational ontology's categories and part-whole relations, are essential for good and interoperable knowledge representation. Ontology developers, which include domain experts and novices, face...

  8. Ontological Surprises

    DEFF Research Database (Denmark)

    Leahu, Lucian

    2016-01-01

    a hybrid approach where machine learning algorithms are used to identify objects as well as connections between them; finally, it argues for remaining open to ontological surprises in machine learning as they may enable the crafting of different relations with and through technologies.......This paper investigates how we might rethink design as the technological crafting of human-machine relations in the context of a machine learning technique called neural networks. It analyzes Google’s Inceptionism project, which uses neural networks for image recognition. The surprising output...

  9. De Novo Transcriptomic Analysis of an Oleaginous Microalga: Pathway Description and Gene Discovery for Production of Next-Generation Biofuels

    Science.gov (United States)

    Wan, LingLin; Han, Juan; Sang, Min; Li, AiFen; Wu, Hong; Yin, ShunJi; Zhang, ChengWu

    2012-01-01

    Background Eustigmatos cf. polyphem is a yellow-green unicellular soil microalga belonging to the eustimatophyte with high biomass and considerable production of triacylglycerols (TAGs) for biofuels, which is thus referred to as an oleaginous microalga. The paucity of microalgae genome sequences, however, limits development of gene-based biofuel feedstock optimization studies. Here we describe the sequencing and de novo transcriptome assembly for a non-model microalgae species, E. cf. polyphem, and identify pathways and genes of importance related to biofuel production. Results We performed the de novo assembly of E. cf. polyphem transcriptome using Illumina paired-end sequencing technology. In a single run, we produced 29,199,432 sequencing reads corresponding to 2.33 Gb total nucleotides. These reads were assembled into 75,632 unigenes with a mean size of 503 bp and an N50 of 663 bp, ranging from 100 bp to >3,000 bp. Assembled unigenes were subjected to BLAST similarity searches and annotated with Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology identifiers. These analyses identified the majority of carbohydrate, fatty acids, TAG and carotenoids biosynthesis and catabolism pathways in E. cf. polyphem. Conclusions Our data provides the construction of metabolic pathways involved in the biosynthesis and catabolism of carbohydrate, fatty acids, TAG and carotenoids in E. cf. polyphem and provides a foundation for the molecular genetics and functional genomics required to direct metabolic engineering efforts that seek to enhance the quantity and character of microalgae-based biofuel feedstock. PMID:22536352

  10. De novo transcriptomic analysis of an oleaginous microalga: pathway description and gene discovery for production of next-generation biofuels.

    Directory of Open Access Journals (Sweden)

    LingLin Wan

    Full Text Available Eustigmatos cf. polyphem is a yellow-green unicellular soil microalga belonging to the eustimatophyte with high biomass and considerable production of triacylglycerols (TAGs for biofuels, which is thus referred to as an oleaginous microalga. The paucity of microalgae genome sequences, however, limits development of gene-based biofuel feedstock optimization studies. Here we describe the sequencing and de novo transcriptome assembly for a non-model microalgae species, E. cf. polyphem, and identify pathways and genes of importance related to biofuel production.We performed the de novo assembly of E. cf. polyphem transcriptome using Illumina paired-end sequencing technology. In a single run, we produced 29,199,432 sequencing reads corresponding to 2.33 Gb total nucleotides. These reads were assembled into 75,632 unigenes with a mean size of 503 bp and an N50 of 663 bp, ranging from 100 bp to >3,000 bp. Assembled unigenes were subjected to BLAST similarity searches and annotated with Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG orthology identifiers. These analyses identified the majority of carbohydrate, fatty acids, TAG and carotenoids biosynthesis and catabolism pathways in E. cf. polyphem.Our data provides the construction of metabolic pathways involved in the biosynthesis and catabolism of carbohydrate, fatty acids, TAG and carotenoids in E. cf. polyphem and provides a foundation for the molecular genetics and functional genomics required to direct metabolic engineering efforts that seek to enhance the quantity and character of microalgae-based biofuel feedstock.

  11. MicroRNA-gene signaling pathways in pancreatic cancer

    Directory of Open Access Journals (Sweden)

    Alexandra Drakaki

    2013-10-01

    Full Text Available Pancreatic cancer is the fourth most frequent cause of cancer-related deaths and is characterized by early metastasis and pronounced resistance to chemotherapy and radiation therapy. Despite extensive esearch efforts, there is not any substantial progress regarding the identification of novel drugs against pancreatic cancer. Although the introduction of the chemotherapeutic agent gemcitabine improved clinical response, the prognosis of these patients remained extremely poor with a 5-year survival rate of 3-5%. Thus, the identification of the novel molecular pathways involved in pancreatic oncogenesis and the development of new and potent therapeutic options are highly desirable. Here, we describe how microRNAs control signaling pathways that are frequently deregulated during pancreatic oncogenesis. In addition, we provide evidence that microRNAs could be potentially used as novel pancreatic cancer therapeutics through reversal of chemotherapy and radiotherapy resistance or regulation of essential molecular pathways. Further studies should integrate the deregulated genes and microRNAs into molecular networks in order to identify the central regulators of pancreatic oncogenesis. Targeting these central regulators could lead to the development of novel targeted therapeutic approaches for pancreatic cancer patients.

  12. Anatomy Ontology Matching Using Markov Logic Networks

    Directory of Open Access Journals (Sweden)

    Chunhua Li

    2016-01-01

    Full Text Available The anatomy of model species is described in ontologies, which are used to standardize the annotations of experimental data, such as gene expression patterns. To compare such data between species, we need to establish relationships between ontologies describing different species. Ontology matching is a kind of solutions to find semantic correspondences between entities of different ontologies. Markov logic networks which unify probabilistic graphical model and first-order logic provide an excellent framework for ontology matching. We combine several different matching strategies through first-order logic formulas according to the structure of anatomy ontologies. Experiments on the adult mouse anatomy and the human anatomy have demonstrated the effectiveness of proposed approach in terms of the quality of result alignment.

  13. An ontology approach to comparative phenomics in plants

    KAUST Repository

    Oellrich, Anika

    2015-02-25

    Background: Plant phenotype datasets include many different types of data, formats, and terms from specialized vocabularies. Because these datasets were designed for different audiences, they frequently contain language and details tailored to investigators with different research objectives and backgrounds. Although phenotype comparisons across datasets have long been possible on a small scale, comprehensive queries and analyses that span a broad set of reference species, research disciplines, and knowledge domains continue to be severely limited by the absence of a common semantic framework. Results: We developed a workflow to curate and standardize existing phenotype datasets for six plant species, encompassing both model species and crop plants with established genetic resources. Our effort focused on mutant phenotypes associated with genes of known sequence in Arabidopsis thaliana (L.) Heynh. (Arabidopsis), Zea mays L. subsp. mays (maize), Medicago truncatula Gaertn. (barrel medic or Medicago), Oryza sativa L. (rice), Glycine max (L.) Merr. (soybean), and Solanum lycopersicum L. (tomato). We applied the same ontologies, annotation standards, formats, and best practices across all six species, thereby ensuring that the shared dataset could be used for cross-species querying and semantic similarity analyses. Curated phenotypes were first converted into a common format using taxonomically broad ontologies such as the Plant Ontology, Gene Ontology, and Phenotype and Trait Ontology. We then compared ontology-based phenotypic descriptions with an existing classification system for plant phenotypes and evaluated our semantic similarity dataset for its ability to enhance predictions of gene families, protein functions, and shared metabolic pathways that underlie informative plant phenotypes. Conclusions: The use of ontologies, annotation standards, shared formats, and best practices for cross-taxon phenotype data analyses represents a novel approach to plant phenomics

  14. An ontology approach to comparative phenomics in plants

    KAUST Repository

    Oellrich, Anika; Walls, Ramona L; Cannon, Ethalinda KS; Cannon, Steven B; Cooper, Laurel; Gardiner, Jack; Gkoutos, Georgios V; Harper, Lisa; He, Mingze; Hoehndorf, Robert; Jaiswal, Pankaj; Kalberer, Scott R; Lloyd, John P; Meinke, David; Menda, Naama; Moore, Laura; Nelson, Rex T; Pujar, Anuradha; Lawrence, Carolyn J; Huala, Eva

    2015-01-01

    Background: Plant phenotype datasets include many different types of data, formats, and terms from specialized vocabularies. Because these datasets were designed for different audiences, they frequently contain language and details tailored to investigators with different research objectives and backgrounds. Although phenotype comparisons across datasets have long been possible on a small scale, comprehensive queries and analyses that span a broad set of reference species, research disciplines, and knowledge domains continue to be severely limited by the absence of a common semantic framework. Results: We developed a workflow to curate and standardize existing phenotype datasets for six plant species, encompassing both model species and crop plants with established genetic resources. Our effort focused on mutant phenotypes associated with genes of known sequence in Arabidopsis thaliana (L.) Heynh. (Arabidopsis), Zea mays L. subsp. mays (maize), Medicago truncatula Gaertn. (barrel medic or Medicago), Oryza sativa L. (rice), Glycine max (L.) Merr. (soybean), and Solanum lycopersicum L. (tomato). We applied the same ontologies, annotation standards, formats, and best practices across all six species, thereby ensuring that the shared dataset could be used for cross-species querying and semantic similarity analyses. Curated phenotypes were first converted into a common format using taxonomically broad ontologies such as the Plant Ontology, Gene Ontology, and Phenotype and Trait Ontology. We then compared ontology-based phenotypic descriptions with an existing classification system for plant phenotypes and evaluated our semantic similarity dataset for its ability to enhance predictions of gene families, protein functions, and shared metabolic pathways that underlie informative plant phenotypes. Conclusions: The use of ontologies, annotation standards, shared formats, and best practices for cross-taxon phenotype data analyses represents a novel approach to plant phenomics

  15. InfAcrOnt: calculating cross-ontology term similarities using information flow by a random walk

    OpenAIRE

    Cheng, Liang; Jiang, Yue; Ju, Hong; Sun, Jie; Peng, Jiajie; Zhou, Meng; Hu, Yang

    2018-01-01

    Background Since the establishment of the first biomedical ontology Gene Ontology (GO), the number of biomedical ontology has increased dramatically. Nowadays over 300 ontologies have been built including extensively used Disease Ontology (DO) and Human Phenotype Ontology (HPO). Because of the advantage of identifying novel relationships between terms, calculating similarity between ontology terms is one of the major tasks in this research area. Though similarities between terms within each o...

  16. Comparative Transcriptomics to Identify Novel Genes and Pathways in Dinoflagellates

    Science.gov (United States)

    Ryan, D.

    2016-02-01

    The unarmored dinoflagellate Karenia brevis is among the most prominent harmful, bloom-forming phytoplankton species in the Gulf of Mexico. During blooms, the polyketides PbTx-1 and PbTx-2 (brevetoxins) are produced by K. brevis. Brevetoxins negatively impact human health and the Gulf shellfish harvest. However, the genes underlying brevetoxin synthesis are currently unknown. Because the K. brevis genome is extremely large ( 1 × 1011 base pairs long), and with a high proportion of repetitive, non-coding DNA, it has not been sequenced. In fact, large, repetitive genomes are common among the dinoflagellate group. High-throughput RNA sequencing technology enabled us to assemble Karenia transcriptomes de novo and investigate potential genes in the brevetoxin pathway through comparative transcriptomics. The brevetoxin profile varies among K. brevis clonal cultures. For example, well-documented Wilson-CCFWC268 typically produces 8-10 pg PbTx per cell, whereas SP1 produces differences in gene expression. Of the 85,000 transcripts in the K. brevis transcriptome, 4,600 transcripts, including novel unannotated orthologs and putative polyketide synthases (PKSs), were only expressed by brevetoxin-producing K. brevis and K. papilionacea, not K. mikimotoi. Examination of gene expression between the typical- and low-toxin Wilson clones identified about 3,500 genes with significantly different expression levels, including 2 putative PKSs. One of the 2 PKSs was only found in the brevetoxin-producing Karenia species. These transcriptomes could not have been characterized without high-throughput RNA sequencing.

  17. Using Gene Ontology to describe the role of the neurexin-neuroligin-SHANK complex in human, mouse and rat and its relevance to autism.

    Science.gov (United States)

    Patel, Sejal; Roncaglia, Paola; Lovering, Ruth C

    2015-06-06

    People with an autistic spectrum disorder (ASD) display a variety of characteristic behavioral traits, including impaired social interaction, communication difficulties and repetitive behavior. This complex neurodevelopment disorder is known to be associated with a combination of genetic and environmental factors. Neurexins and neuroligins play a key role in synaptogenesis and neurexin-neuroligin adhesion is one of several processes that have been implicated in autism spectrum disorders. In this report we describe the manual annotation of a selection of gene products known to be associated with autism and/or the neurexin-neuroligin-SHANK complex and demonstrate how a focused annotation approach leads to the creation of more descriptive Gene Ontology (GO) terms, as well as an increase in both the number of gene product annotations and their granularity, thus improving the data available in the GO database. The manual annotations we describe will impact on the functional analysis of a variety of future autism-relevant datasets. Comprehensive gene annotation is an essential aspect of genomic and proteomic studies, as the quality of gene annotations incorporated into statistical analysis tools affects the effective interpretation of data obtained through genome wide association studies, next generation sequencing, proteomic and transcriptomic datasets.

  18. Didactical Ontologies

    Directory of Open Access Journals (Sweden)

    Steffen Mencke, Reiner Dumke

    2008-03-01

    Full Text Available Ontologies are a fundamental concept of theSemantic Web envisioned by Tim Berners-Lee [1]. Togetherwith explicit representation of the semantics of data formachine-accessibility such domain theories are the basis forintelligent next generation applications for the web andother areas of interest [2]. Their application for specialaspects within the domain of e-learning is often proposed tosupport the increasing complexity ([3], [4], [5], [6]. So theycan provide a better support for course generation orlearning scenario description [7]. By the modeling ofdidactics-related expertise and their provision for thecreators of courses many improvements like reuse, rapiddevelopment and of course increased learning performancebecome possible due to the separation from other aspects ofe-learning platforms as already proposed in [8].

  19. Identification of key genes and pathways associated with neuropathic pain in uninjured dorsal root ganglion by using bioinformatic analysis

    Directory of Open Access Journals (Sweden)

    Chen CJ

    2017-11-01

    Full Text Available Chao-Jin Chen,* De-Zhao Liu,* Wei-Feng Yao, Yu Gu, Fei Huang, Zi-Qing Hei, Xiang Li Department of Anesthesiology, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, People’s Republic of China *These authors contributed equally to this work Purpose: Neuropathic pain is a complex chronic condition occurring post-nervous system damage. The transcriptional reprogramming of injured dorsal root ganglia (DRGs drives neuropathic pain. However, few comparative analyses using high-throughput platforms have investigated uninjured DRG in neuropathic pain, and potential interactions among differentially expressed genes (DEGs and pathways were not taken into consideration. The aim of this study was to identify changes in genes and pathways associated with neuropathic pain in uninjured L4 DRG after L5 spinal nerve ligation (SNL by using bioinformatic analysis.Materials and methods: The microarray profile GSE24982 was downloaded from the Gene Expression Omnibus database to identify DEGs between DRGs in SNL and sham rats. The prioritization for these DEGs was performed using the Toppgene database followed by gene ontology and pathway enrichment analyses. The relationships among DEGs from the protein interactive perspective were analyzed using protein–protein interaction (PPI network and module analysis. Real-time polymerase chain reaction (PCR and Western blotting were used to confirm the expression of DEGs in the rodent neuropathic pain model.Results: A total of 206 DEGs that might play a role in neuropathic pain were identified in L4 DRG, of which 75 were upregulated and 131 were downregulated. The upregulated DEGs were enriched in biological processes related to transcription regulation and molecular functions such as DNA binding, cell cycle, and the FoxO signaling pathway. Ctnnb1 protein had the highest connectivity degrees in the PPI network. The in vivo studies also validated that mRNA and protein levels of Ctnnb1 were

  20. Genes of the mitochondrial apoptotic pathway in Mytilus galloprovincialis.

    Directory of Open Access Journals (Sweden)

    Noelia Estévez-Calvar

    Full Text Available Bivalves play vital roles in marine, brackish, freshwater and terrestrial habitats. In recent years, these ecosystems have become affected through anthropogenic activities. The ecological success of marine bivalves is based on the ability to modify their physiological functions in response to environmental changes. One of the most important mechanisms involved in adaptive responses to environmental and biological stresses is apoptosis, which has been scarcely studied in mollusks, although the final consequence of this process, DNA fragmentation, has been frequently used for pollution monitoring. Environmental stressors induce apoptosis in molluscan cells via an intrinsic pathway. Many of the proteins involved in vertebrate apoptosis have been recognized in model invertebrates; however, this process might not be universally conserved. Mytilus galloprovincialis is presented here as a new model to study the linkage between molecular mechanisms that mediate apoptosis and marine bivalve ecological adaptations. Therefore, it is strictly necessary to identify the key elements involved in bivalve apoptosis. In the present study, six mitochondrial apoptotic-related genes were characterized, and their gene expression profiles following UV irradiation were evaluated. This is the first step for the development of potential biomarkers to assess the biological responses of marine organisms to stress. The results confirmed that apoptosis and, more specifically, the expression of the genes involved in this process can be used to assess the biological responses of marine organisms to stress.

  1. Markov Chain Ontology Analysis (MCOA).

    Science.gov (United States)

    Frost, H Robert; McCray, Alexa T

    2012-02-03

    Biomedical ontologies have become an increasingly critical lens through which researchers analyze the genomic, clinical and bibliographic data that fuels scientific research. Of particular relevance are methods, such as enrichment analysis, that quantify the importance of ontology classes relative to a collection of domain data. Current analytical techniques, however, remain limited in their ability to handle many important types of structural complexity encountered in real biological systems including class overlaps, continuously valued data, inter-instance relationships, non-hierarchical relationships between classes, semantic distance and sparse data. In this paper, we describe a methodology called Markov Chain Ontology Analysis (MCOA) and illustrate its use through a MCOA-based enrichment analysis application based on a generative model of gene activation. MCOA models the classes in an ontology, the instances from an associated dataset and all directional inter-class, class-to-instance and inter-instance relationships as a single finite ergodic Markov chain. The adjusted transition probability matrix for this Markov chain enables the calculation of eigenvector values that quantify the importance of each ontology class relative to other classes and the associated data set members. On both controlled Gene Ontology (GO) data sets created with Escherichia coli, Drosophila melanogaster and Homo sapiens annotations and real gene expression data extracted from the Gene Expression Omnibus (GEO), the MCOA enrichment analysis approach provides the best performance of comparable state-of-the-art methods. A methodology based on Markov chain models and network analytic metrics can help detect the relevant signal within large, highly interdependent and noisy data sets and, for applications such as enrichment analysis, has been shown to generate superior performance on both real and simulated data relative to existing state-of-the-art approaches.

  2. A search engine to identify pathway genes from expression data on multiple organisms

    Directory of Open Access Journals (Sweden)

    Zambon Alexander C

    2007-05-01

    Full Text Available Abstract Background The completion of several genome projects showed that most genes have not yet been characterized, especially in multicellular organisms. Although most genes have unknown functions, a large collection of data is available describing their transcriptional activities under many different experimental conditions. In many cases, the coregulatation of a set of genes across a set of conditions can be used to infer roles for genes of unknown function. Results We developed a search engine, the Multiple-Species Gene Recommender (MSGR, which scans gene expression datasets from multiple organisms to identify genes that participate in a genetic pathway. The MSGR takes a query consisting of a list of genes that function together in a genetic pathway from one of six organisms: Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana, and Helicobacter pylori. Using a probabilistic method to merge searches, the MSGR identifies genes that are significantly coregulated with the query genes in one or more of those organisms. The MSGR achieves its highest accuracy for many human pathways when searches are combined across species. We describe specific examples in which new genes were identified to be involved in a neuromuscular signaling pathway and a cell-adhesion pathway. Conclusion The search engine can scan large collections of gene expression data for new genes that are significantly coregulated with a pathway of interest. By integrating searches across organisms, the MSGR can identify pathway members whose coregulation is either ancient or newly evolved.

  3. Gene-Gene Interactions in the Folate Metabolic Pathway and the Risk of Conotruncal Heart Defects

    Directory of Open Access Journals (Sweden)

    Philip J. Lupo

    2010-01-01

    Full Text Available Conotruncal and related heart defects (CTRD are common, complex malformations. Although there are few established risk factors, there is evidence that genetic variation in the folate metabolic pathway influences CTRD risk. This study was undertaken to assess the association between inherited (i.e., case and maternal gene-gene interactions in this pathway and the risk of CTRD. Case-parent triads (n=727, ascertained from the Children's Hospital of Philadelphia, were genotyped for ten functional variants of nine folate metabolic genes. Analyses of inherited genotypes were consistent with the previously reported association between MTHFR A1298C and CTRD (adjusted P=.02, but provided no evidence that CTRD was associated with inherited gene-gene interactions. Analyses of the maternal genotypes provided evidence of a MTHFR C677T/CBS 844ins68 interaction and CTRD risk (unadjusted P=.02. This association is consistent with the effects of this genotype combination on folate-homocysteine biochemistry but remains to be confirmed in independent study populations.

  4. MicroRNA-124-3p expression and its prospective functional pathways in hepatocellular carcinoma: A quantitative polymerase chain reaction, gene expression omnibus and bioinformatics study.

    Science.gov (United States)

    He, Rong-Quan; Yang, Xia; Liang, Liang; Chen, Gang; Ma, Jie

    2018-04-01

    The present study aimed to explore the potential clinical significance of microRNA (miR)-124-3p expression in the hepatocarcinogenesis and development of hepatocellular carcinoma (HCC), as well as the potential target genes of functional HCC pathways. Reverse transcription-quantitative polymerase chain reaction was performed to evaluate the expression of miR-124-3p in 101 HCC and adjacent non-cancerous tissue samples. Additionally, the association between miR-124-3p expression and clinical parameters was also analyzed. Differentially expressed genes identified following miR-124-3p transfection, the prospective target genes predicted in silico and the key genes of HCC obtained from Natural Language Processing (NLP) were integrated to obtain potential target genes of miR-124-3p in HCC. Relevant signaling pathways were assessed with protein-protein interaction (PPI) networks, Gene Ontology (GO) enrichment analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) and Protein Annotation Through Evolutionary Relationships (PANTHER) pathway enrichment analysis. miR-124-3p expression was significantly reduced in HCC tissues compared with expression in adjacent non-cancerous liver tissues. In HCC, miR-124-3p was demonstrated to be associated with clinical stage. The mean survival time of the low miR-124-3p expression group was reduced compared with that of the high expression group. A total of 132 genes overlapped from differentially expressed genes, miR-124-3p predicted target genes and NLP identified genes. PPI network construction revealed a total of 109 nodes and 386 edges, and 20 key genes were identified. The major enriched terms of three GO categories included regulation of cell proliferation, positive regulation of cellular biosynthetic processes, cell leading edge, cytosol and cell projection, protein kinase activity, transcription activator activity and enzyme binding. KEGG analysis revealed pancreatic cancer, prostate cancer and non-small cell lung cancer as the

  5. Differential hexosamine biosynthetic pathway gene expression with type 2 diabetes

    Directory of Open Access Journals (Sweden)

    Megan Coomer

    2014-01-01

    Full Text Available The hexosamine biosynthetic pathway (HBP culminates in the attachment of O-linked β-N-acetylglucosamine (O-GlcNAc onto serine/threonine residues of target proteins. The HBP is regulated by several modulators, i.e. O-linked β-N-acetylglucosaminyl transferase (OGT and β-N-acetylglucosaminidase (OGA catalyze the addition and removal of O-GlcNAc moieties, respectively; while flux is controlled by the rate-limiting enzyme glutamine:fructose-6-phosphate amidotransferase (GFPT, transcribed by two genes, GFPT1 and GFPT2. Since increased HBP flux is glucose-responsive and linked to insulin resistance/type 2 diabetes onset, we hypothesized that diabetic individuals exhibit differential expression of HBP regulatory genes. Volunteers (n = 60; n = 20 Mixed Ancestry, n = 40 Caucasian were recruited from Stellenbosch and Paarl (Western Cape, South Africa and classified as control, pre- or diabetic according to fasting plasma glucose and HbA1c levels, respectively. RNA was purified from leukocytes isolated from collected blood samples and OGT, OGA, GFPT1 and GFPT2 expressions determined by quantitative real-time PCR. The data reveal lower OGA expression in diabetic individuals (P < 0.01, while pre- and diabetic subjects displayed attenuated OGT expression vs. controls (P < 0.01 and P < 0.001, respectively. Moreover, GFPT2 expression decreased in pre- and diabetic Caucasians vs. controls (P < 0.05 and P < 0.01, respectively. We also found ethnic differences, i.e. Mixed Ancestry individuals exhibited a 2.4-fold increase in GFPT2 expression vs. Caucasians, despite diagnosis (P < 0.01. Gene expression of HBP regulators differs between diabetic and non-diabetic individuals, together with distinct ethnic-specific gene profiles. Thus differential HBP gene regulation may offer diagnostic utility and provide candidate susceptibility genes for different ethnic groupings.

  6. Hum-mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features.

    Science.gov (United States)

    Zhou, Hang; Yang, Yang; Shen, Hong-Bin

    2017-03-15

    Protein subcellular localization prediction has been an important research topic in computational biology over the last decade. Various automatic methods have been proposed to predict locations for large scale protein datasets, where statistical machine learning algorithms are widely used for model construction. A key step in these predictors is encoding the amino acid sequences into feature vectors. Many studies have shown that features extracted from biological domains, such as gene ontology and functional domains, can be very useful for improving the prediction accuracy. However, domain knowledge usually results in redundant features and high-dimensional feature spaces, which may degenerate the performance of machine learning models. In this paper, we propose a new amino acid sequence-based human protein subcellular location prediction approach Hum-mPLoc 3.0, which covers 12 human subcellular localizations. The sequences are represented by multi-view complementary features, i.e. context vocabulary annotation-based gene ontology (GO) terms, peptide-based functional domains, and residue-based statistical features. To systematically reflect the structural hierarchy of the domain knowledge bases, we propose a novel feature representation protocol denoted as HCM (Hidden Correlation Modeling), which will create more compact and discriminative feature vectors by modeling the hidden correlations between annotation terms. Experimental results on four benchmark datasets show that HCM improves prediction accuracy by 5-11% and F 1 by 8-19% compared with conventional GO-based methods. A large-scale application of Hum-mPLoc 3.0 on the whole human proteome reveals proteins co-localization preferences in the cell. www.csbio.sjtu.edu.cn/bioinf/Hum-mPLoc3/. hbshen@sjtu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  7. Transcriptomic analysis in the developing zebrafish embryo after compound exposure: Individual gene expression and pathway regulation

    Energy Technology Data Exchange (ETDEWEB)

    Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands); Pronk, Tessa E. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Brandhof, Evert-Jan van den [Centre for Environmental Quality, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Ven, Leo T.M. van der [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Piersma, Aldert H. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands)

    2013-10-01

    The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol and saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.

  8. Transcriptome sequencing and annotation of the microalgae Dunaliella tertiolecta: Pathway description and gene discovery for production of next-generation biofuels

    Directory of Open Access Journals (Sweden)

    Bibby Kyle

    2011-03-01

    Full Text Available Abstract Background Biodiesel or ethanol derived from lipids or starch produced by microalgae may overcome many of the sustainability challenges previously ascribed to petroleum-based fuels and first generation plant-based biofuels. The paucity of microalgae genome sequences, however, limits gene-based biofuel feedstock optimization studies. Here we describe the sequencing and de novo transcriptome assembly for the non-model microalgae species, Dunaliella tertiolecta, and identify pathways and genes of importance related to biofuel production. Results Next generation DNA pyrosequencing technology applied to D. tertiolecta transcripts produced 1,363,336 high quality reads with an average length of 400 bases. Following quality and size trimming, ~ 45% of the high quality reads were assembled into 33,307 isotigs with a 31-fold coverage and 376,482 singletons. Assembled sequences and singletons were subjected to BLAST similarity searches and annotated with Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG orthology (KO identifiers. These analyses identified the majority of lipid and starch biosynthesis and catabolism pathways in D. tertiolecta. Conclusions The construction of metabolic pathways involved in the biosynthesis and catabolism of fatty acids, triacylglycrols, and starch in D. tertiolecta as well as the assembled transcriptome provide a foundation for the molecular genetics and functional genomics required to direct metabolic engineering efforts that seek to enhance the quantity and character of microalgae-based biofuel feedstock.

  9. Neurogenic gene regulatory pathways in the sea urchin embryo.

    Science.gov (United States)

    Wei, Zheng; Angerer, Lynne M; Angerer, Robert C

    2016-01-15

    During embryogenesis the sea urchin early pluteus larva differentiates 40-50 neurons marked by expression of the pan-neural marker synaptotagmin B (SynB) that are distributed along the ciliary band, in the apical plate and pharyngeal endoderm, and 4-6 serotonergic neurons that are confined to the apical plate. Development of all neurons has been shown to depend on the function of Six3. Using a combination of molecular screens and tests of gene function by morpholino-mediated knockdown, we identified SoxC and Brn1/2/4, which function sequentially in the neurogenic regulatory pathway and are also required for the differentiation of all neurons. Misexpression of Brn1/2/4 at low dose caused an increase in the number of serotonin-expressing cells and at higher dose converted most of the embryo to a neurogenic epithelial sphere expressing the Hnf6 ciliary band marker. A third factor, Z167, was shown to work downstream of the Six3 and SoxC core factors and to define a branch specific for the differentiation of serotonergic neurons. These results provide a framework for building a gene regulatory network for neurogenesis in the sea urchin embryo. © 2016. Published by The Company of Biologists Ltd.

  10. Interaction between leptin and leptin receptor in gastric carcinoma: Gene ontology analysis Interacción entre la leptina y su receptor en el carcinoma gástrico: análisis de ontología genética

    Directory of Open Access Journals (Sweden)

    V. Wiwanitkit

    2007-04-01

    Full Text Available Gastric carcinoma is a rare but important malignancy. The link between leptin, a cytokine that is elevated in obese individuals, and cancer development has been proposed. It is noted that leptin and its receptor may play a positive role in the progression in gastric cancer. However, the exact mechanism resulting form the interaction between leptin and leptin receptor has never been clarified. Here, the author used a new gene ontology technology to predict the molecular function and biological process due to the interaction between leptin and leptin receptor. Comparing to leptin and leptin receptor, the leptin-leptin receptor poses the same function and biological process as leptin receptor. This can confirm that leptin receptor has a significant suppressive effect on the expression of leptin. Loss of hormone activity and disturbance of normal cell signaling pathway of leptin can be seen. Blocking of receptor might be rational therapeutic strategy.El carcinoma gástrico es un cáncer muy poco frecuente pero importante. Se ha postulado que la leptina, una citocina que aparece elevada en las personas obesas, está relacionada con el cáncer. Se sabe que la leptina y su receptor pueden desempeñar un papel positivo en la progresión del cáncer gástrico. Sin embargo, nunca se ha dilucidado el mecanismo exacto al que daría lugar la interacción entre la leptina y el receptor de leptina. Aquí, el autor empleó una nueva tecnología de ontología genética para predecir la función molecular y el proceso biológico resultantes de la interacción entre la leptina y su receptor. Frente a la leptina y su receptor, el compuesto leptina-receptor realiza la misma función y el mismo proceso biológico que el receptor de leptina. Esto puede confirmar que el receptor de leptina ejerce un importante efecto supresor sobre la expresión de leptina. Pueden observarse una pérdida de actividad hormonal y la alteración de la vía normal de señalización celular

  11. Gene expression profiling reveals different molecular patterns in G-protein coupled receptor signaling pathways between early- and late-onset preeclampsia.

    Science.gov (United States)

    Liang, Mengmeng; Niu, Jianmin; Zhang, Liang; Deng, Hua; Ma, Jian; Zhou, Weiping; Duan, Dongmei; Zhou, Yuheng; Xu, Huikun; Chen, Longding

    2016-04-01

    Early-onset preeclampsia and late-onset preeclampsia have been regarded as two different phenotypes with heterogeneous manifestations; To gain insights into the pathogenesis of the two traits, we analyzed the gene expression profiles in preeclamptic placentas. A whole genome-wide microarray was used to determine the gene expression profiles in placental tissues from patients with early-onset (n = 7; 36 weeks) preeclampsia and their controls who delivered preterm (n = 5; 36 weeks). Genes were termed differentially expressed if they showed a fold-change ≥ 2 and q-value preeclampsia (177 genes were up-regulated and 450 were down-regulated). Gene ontology analysis identified significant alterations in several biological processes; the top two were immune response and cell surface receptor linked signal transduction. Among the cell surface receptor linked signal transduction-related, differentially expressed genes, those involved in the G-protein coupled receptor protein signaling pathway were significantly enriched. G-protein coupled receptor signaling pathway related genes, such as GPR124 and MRGPRF, were both found to be down-regulated in early-onset preeclampsia. The results were consistent with those of western blotting that the abundance of GPR124 was lower in early-onset compared with late-onset preeclampsia. The different gene expression profiles reflect the different levels of transcription regulation between the two conditions and supported the hypothesis that they are separate disease entities. Moreover, the G-protein coupled receptor signaling pathway related genes may contribute to the mechanism underlying early- and late-onset preeclampsia. Copyright © 2016 Elsevier Ltd. All rights reserved.

  12. A gene expression signature of RAS pathway dependence predicts response to PI3K and RAS pathway inhibitors and expands the population of RAS pathway activated tumors.

    Science.gov (United States)

    Loboda, Andrey; Nebozhyn, Michael; Klinghoffer, Rich; Frazier, Jason; Chastain, Michael; Arthur, William; Roberts, Brian; Zhang, Theresa; Chenard, Melissa; Haines, Brian; Andersen, Jannik; Nagashima, Kumiko; Paweletz, Cloud; Lynch, Bethany; Feldman, Igor; Dai, Hongyue; Huang, Pearl; Watters, James

    2010-06-30

    Hyperactivation of the Ras signaling pathway is a driver of many cancers, and RAS pathway activation can predict response to targeted therapies. Therefore, optimal methods for measuring Ras pathway activation are critical. The main focus of our work was to develop a gene expression signature that is predictive of RAS pathway dependence. We used the coherent expression of RAS pathway-related genes across multiple datasets to derive a RAS pathway gene expression signature and generate RAS pathway activation scores in pre-clinical cancer models and human tumors. We then related this signature to KRAS mutation status and drug response data in pre-clinical and clinical datasets. The RAS signature score is predictive of KRAS mutation status in lung tumors and cell lines with high (> 90%) sensitivity but relatively low (50%) specificity due to samples that have apparent RAS pathway activation in the absence of a KRAS mutation. In lung and breast cancer cell line panels, the RAS pathway signature score correlates with pMEK and pERK expression, and predicts resistance to AKT inhibition and sensitivity to MEK inhibition within both KRAS mutant and KRAS wild-type groups. The RAS pathway signature is upregulated in breast cancer cell lines that have acquired resistance to AKT inhibition, and is downregulated by inhibition of MEK. In lung cancer cell lines knockdown of KRAS using siRNA demonstrates that the RAS pathway signature is a better measure of dependence on RAS compared to KRAS mutation status. In human tumors, the RAS pathway signature is elevated in ER negative breast tumors and lung adenocarcinomas, and predicts resistance to cetuximab in metastatic colorectal cancer. These data demonstrate that the RAS pathway signature is superior to KRAS mutation status for the prediction of dependence on RAS signaling, can predict response to PI3K and RAS pathway inhibitors, and is likely to have the most clinical utility in lung and breast tumors.

  13. A gene expression signature of RAS pathway dependence predicts response to PI3K and RAS pathway inhibitors and expands the population of RAS pathway activated tumors

    Directory of Open Access Journals (Sweden)

    Paweletz Cloud

    2010-06-01

    Full Text Available Abstract Background Hyperactivation of the Ras signaling pathway is a driver of many cancers, and RAS pathway activation can predict response to targeted therapies. Therefore, optimal methods for measuring Ras pathway activation are critical. The main focus of our work was to develop a gene expression signature that is predictive of RAS pathway dependence. Methods We used the coherent expression of RAS pathway-related genes across multiple datasets to derive a RAS pathway gene expression signature and generate RAS pathway activation scores in pre-clinical cancer models and human tumors. We then related this signature to KRAS mutation status and drug response data in pre-clinical and clinical datasets. Results The RAS signature score is predictive of KRAS mutation status in lung tumors and cell lines with high (> 90% sensitivity but relatively low (50% specificity due to samples that have apparent RAS pathway activation in the absence of a KRAS mutation. In lung and breast cancer cell line panels, the RAS pathway signature score correlates with pMEK and pERK expression, and predicts resistance to AKT inhibition and sensitivity to MEK inhibition within both KRAS mutant and KRAS wild-type groups. The RAS pathway signature is upregulated in breast cancer cell lines that have acquired resistance to AKT inhibition, and is downregulated by inhibition of MEK. In lung cancer cell lines knockdown of KRAS using siRNA demonstrates that the RAS pathway signature is a better measure of dependence on RAS compared to KRAS mutation status. In human tumors, the RAS pathway signature is elevated in ER negative breast tumors and lung adenocarcinomas, and predicts resistance to cetuximab in metastatic colorectal cancer. Conclusions These data demonstrate that the RAS pathway signature is superior to KRAS mutation status for the prediction of dependence on RAS signaling, can predict response to PI3K and RAS pathway inhibitors, and is likely to have the most clinical

  14. GeneAnalytics Pathway Analysis and Genetic Overlap among Autism Spectrum Disorder, Bipolar Disorder and Schizophrenia

    Directory of Open Access Journals (Sweden)

    Naveen S. Khanzada

    2017-02-01

    Full Text Available Bipolar disorder (BPD and schizophrenia (SCH show similar neuropsychiatric behavioral disturbances, including impaired social interaction and communication, seen in autism spectrum disorder (ASD with multiple overlapping genetic and environmental influences implicated in risk and course of illness. GeneAnalytics software was used for pathway analysis and genetic profiling to characterize common susceptibility genes obtained from published lists for ASD (792 genes, BPD (290 genes and SCH (560 genes. Rank scores were derived from the number and nature of overlapping genes, gene-disease association, tissue specificity and gene functions subdivided into categories (e.g., diseases, tissues or functional pathways. Twenty-three genes were common to all three disorders and mapped to nine biological Superpathways including Circadian entrainment (10 genes, score = 37.0, Amphetamine addiction (five genes, score = 24.2, and Sudden infant death syndrome (six genes, score = 24.1. Brain tissues included the medulla oblongata (11 genes, score = 2.1, thalamus (10 genes, score = 2.0 and hypothalamus (nine genes, score = 2.0 with six common genes (BDNF, DRD2, CHRNA7, HTR2A, SLC6A3, and TPH2. Overlapping genes impacted dopamine and serotonin homeostasis and signal transduction pathways, impacting mood, behavior and physical activity level. Converging effects on pathways governing circadian rhythms support a core etiological relationship between neuropsychiatric illnesses and sleep disruption with hypoxia and central brain stem dysfunction.

  15. Comparing Relational and Ontological Triple Stores in Healthcare Domain

    Directory of Open Access Journals (Sweden)

    Ozgu Can

    2017-01-01

    Full Text Available Today’s technological improvements have made ubiquitous healthcare systems that converge into smart healthcare applications in order to solve patients’ problems, to communicate effectively with patients, and to improve healthcare service quality. The first step of building a smart healthcare information system is representing the healthcare data as connected, reachable, and sharable. In order to achieve this representation, ontologies are used to describe the healthcare data. Combining ontological healthcare data with the used and obtained data can be maintained by storing the entire health domain data inside big data stores that support both relational and graph-based ontological data. There are several big data stores and different types of big data sets in the healthcare domain. The goal of this paper is to determine the most applicable ontology data store for storing the big healthcare data. For this purpose, AllegroGraph and Oracle 12c data stores are compared based on their infrastructural capacity, loading time, and query response times. Hence, healthcare ontologies (GENE Ontology, Gene Expression Ontology (GEXO, Regulation of Transcription Ontology (RETO, Regulation of Gene Expression Ontology (REXO are used to measure the ontology loading time. Thereafter, various queries are constructed and executed for GENE ontology in order to measure the capacity and query response times for the performance comparison between AllegroGraph and Oracle 12c triple stores.

  16. Analysis of gene evolution and metabolic pathways using the Candida Gene Order Browser

    LENUS (Irish Health Repository)

    Fitzpatrick, David A

    2010-05-10

    Abstract Background Candida species are the most common cause of opportunistic fungal infection worldwide. Recent sequencing efforts have provided a wealth of Candida genomic data. We have developed the Candida Gene Order Browser (CGOB), an online tool that aids comparative syntenic analyses of Candida species. CGOB incorporates all available Candida clade genome sequences including two Candida albicans isolates (SC5314 and WO-1) and 8 closely related species (Candida dubliniensis, Candida tropicalis, Candida parapsilosis, Lodderomyces elongisporus, Debaryomyces hansenii, Pichia stipitis, Candida guilliermondii and Candida lusitaniae). Saccharomyces cerevisiae is also included as a reference genome. Results CGOB assignments of homology were manually curated based on sequence similarity and synteny. In total CGOB includes 65617 genes arranged into 13625 homology columns. We have also generated improved Candida gene sets by merging\\/removing partial genes in each genome. Interrogation of CGOB revealed that the majority of tandemly duplicated genes are under strong purifying selection in all Candida species. We identified clusters of adjacent genes involved in the same metabolic pathways (such as catabolism of biotin, galactose and N-acetyl glucosamine) and we showed that some clusters are species or lineage-specific. We also identified one example of intron gain in C. albicans. Conclusions Our analysis provides an important resource that is now available for the Candida community. CGOB is available at http:\\/\\/cgob.ucd.ie.

  17. Construction of ontology augmented networks for protein complex prediction.

    Science.gov (United States)

    Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian

    2013-01-01

    Protein complexes are of great importance in understanding the principles of cellular organization and function. The increase in available protein-protein interaction data, gene ontology and other resources make it possible to develop computational methods for protein complex prediction. Most existing methods focus mainly on the topological structure of protein-protein interaction networks, and largely ignore the gene ontology annotation information. In this article, we constructed ontology augmented networks with protein-protein interaction data and gene ontology, which effectively unified the topological structure of protein-protein interaction networks and the similarity of gene ontology annotations into unified distance measures. After constructing ontology augmented networks, a novel method (clustering based on ontology augmented networks) was proposed to predict protein complexes, which was capable of taking into account the topological structure of the protein-protein interaction network, as well as the similarity of gene ontology annotations. Our method was applied to two different yeast protein-protein interaction datasets and predicted many well-known complexes. The experimental results showed that (i) ontology augmented networks and the unified distance measure can effectively combine the structure closeness and gene ontology annotation similarity; (ii) our method is valuable in predicting protein complexes and has higher F1 and accuracy compared to other competing methods.

  18. Integration of multiple networks and pathways identifies cancer driver genes in pan-cancer analysis.

    Science.gov (United States)

    Cava, Claudia; Bertoli, Gloria; Colaprico, Antonio; Olsen, Catharina; Bontempi, Gianluca; Castiglioni, Isabella

    2018-01-06

    Modern high-throughput genomic technologies represent a comprehensive hallmark of molecular changes in pan-cancer studies. Although different cancer gene signatures have been revealed, the mechanism of tumourigenesis has yet to be completely understood. Pathways and networks are important tools to explain the role of genes in functional genomic studies. However, few methods consider the functional non-equal roles of genes in pathways and the complex gene-gene interactions in a network. We present a novel method in pan-cancer analysis that identifies de-regulated genes with a functional role by integrating pathway and network data. A pan-cancer analysis of 7158 tumour/normal samples from 16 cancer types identified 895 genes with a central role in pathways and de-regulated in cancer. Comparing our approach with 15 current tools that identify cancer driver genes, we found that 35.6% of the 895 genes identified by our method have been found as cancer driver genes with at least 2/15 tools. Finally, we applied a machine learning algorithm on 16 independent GEO cancer datasets to validate the diagnostic role of cancer driver genes for each cancer. We obtained a list of the top-ten cancer driver genes for each cancer considered in this study. Our analysis 1) confirmed that there are several known cancer driver genes in common among different types of cancer, 2) highlighted that cancer driver genes are able to regulate crucial pathways.

  19. WAFs lead molting retardation of naupliar stages with down-regulated expression profiles of chitin metabolic pathway and related genes in the copepod Tigriopus japonicus.

    Science.gov (United States)

    Hwang, Dae-Sik; Lee, Min-Chul; Kyung, Do-Hyun; Kim, Hui-Su; Han, Jeonghoon; Kim, Il-Chan; Puthumana, Jayesh; Lee, Jae-Seong

    2017-03-01

    Oil pollution is considered being disastrous to marine organisms and ecosystems. As molting is critical in the developmental process of arthropods in general and copepods, in particular, the impact will be adverse if the target of spilled oil is on molting. Thus, we investigated the harmful effects of water accommodated fractions (WAFs) of crude oil with an emphasis on inhibition of chitin metabolic pathways related genes and developmental retardation in the copepod Tigriopus japonicus. Also, we analysed the ontology and domain of chitin metabolic pathway genes and mRNA expression patterns of developmental stage-specific genes. Further, the developmental retardation followed by transcriptional modulations in nuclear receptor genes (NR) and chitin metabolic pathway-related genes were observed in the WAFs-exposed T. japonicus. As a result, the developmental time was found significantly (P<0.05) delayed in response to 40% WAFs in comparison with that of control. Moreover, the NR gene, HR3 and chitinases (CHT9 and CHT10) were up-regulated in N4-5 stages, while chitin synthase genes (CHS-1, CHS-2-1, and CHS-2-2) down-regulated in response to WAFs. In brief, a high concentration of WAFs repressed nuclear receptor genes but elicited activation of some of the transcription factors at low concentration of WAFs, resulting in suppression of chitin synthesis. Thus, we suggest that WAF can lead molting retardation of naupliar stages in T. japonicus through down-regulations of chitin metabolism. These findings will provide a better understanding of the mode of action of chitin biosynthesis associated with molting mechanism in WAF-exposed T. japonicus. Copyright © 2016 Elsevier Inc. All rights reserved.

  20. The Pathway From Genes to Gene Therapy in Glaucoma: A Review of Possibilities for Using Genes as Glaucoma Drugs.

    Science.gov (United States)

    Borrás, Teresa

    2017-01-01

    Treatment of diseases with gene therapy is advancing rapidly. The use of gene therapy has expanded from the original concept of re-placing the mutated gene causing the disease to the use of genes to con-trol nonphysiological levels of expression or to modify pathways known to affect the disease. Genes offer numerous advantages over conventional drugs. They have longer duration of action and are more specific. Genes can be delivered to the target site by naked DNA, cells, nonviral, and viral vectors. The enormous progress of the past decade in molecular bi-ology and delivery systems has provided ways for targeting genes to the intended cell/tissue and safe, long-term vectors. The eye is an ideal organ for gene therapy. It is easily accessible and it is an immune-privileged site. Currently, there are clinical trials for diseases affecting practically every tissue of the eye, including those to restore vision in patients with Leber congenital amaurosis. However, the number of eye trials compared with those for systemic diseases is quite low (1.8%). Nevertheless, judg-ing by the vast amount of ongoing preclinical studies, it is expected that such number will increase considerably in the near future. One area of great need for eye gene therapy is glaucoma, where a long-term gene drug would eliminate daily applications and compliance issues. Here, we review the current state of gene therapy for glaucoma and the possibilities for treating the trabecular meshwork to lower intraocular pressure and the retinal ganglion cells to protect them from neurodegeneration. Copyright© 2017 Asia-Pacific Academy of Ophthalmology.

  1. XML, Ontologies, and Their Clinical Applications.

    Science.gov (United States)

    Yu, Chunjiang; Shen, Bairong

    2016-01-01

    The development of information technology has resulted in its penetration into every area of clinical research. Various clinical systems have been developed, which produce increasing volumes of clinical data. However, saving, exchanging, querying, and exploiting these data are challenging issues. The development of Extensible Markup Language (XML) has allowed the generation of flexible information formats to facilitate the electronic sharing of structured data via networks, and it has been used widely for clinical data processing. In particular, XML is very useful in the fields of data standardization, data exchange, and data integration. Moreover, ontologies have been attracting increased attention in various clinical fields in recent years. An ontology is the basic level of a knowledge representation scheme, and various ontology repositories have been developed, such as Gene Ontology and BioPortal. The creation of these standardized repositories greatly facilitates clinical research in related fields. In this chapter, we discuss the basic concepts of XML and ontologies, as well as their clinical applications.

  2. RaMP: A Comprehensive Relational Database of Metabolomics Pathways for Pathway Enrichment Analysis of Genes and Metabolites.

    Science.gov (United States)

    Zhang, Bofei; Hu, Senyang; Baskin, Elizabeth; Patt, Andrew; Siddiqui, Jalal K; Mathé, Ewy A

    2018-02-22

    The value of metabolomics in translational research is undeniable, and metabolomics data are increasingly generated in large cohorts. The functional interpretation of disease-associated metabolites though is difficult, and the biological mechanisms that underlie cell type or disease-specific metabolomics profiles are oftentimes unknown. To help fully exploit metabolomics data and to aid in its interpretation, analysis of metabolomics data with other complementary omics data, including transcriptomics, is helpful. To facilitate such analyses at a pathway level, we have developed RaMP (Relational database of Metabolomics Pathways), which combines biological pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG), Reactome, WikiPathways, and the Human Metabolome DataBase (HMDB). To the best of our knowledge, an off-the-shelf, public database that maps genes and metabolites to biochemical/disease pathways and can readily be integrated into other existing software is currently lacking. For consistent and comprehensive analysis, RaMP enables batch and complex queries (e.g., list all metabolites involved in glycolysis and lung cancer), can readily be integrated into pathway analysis tools, and supports pathway overrepresentation analysis given a list of genes and/or metabolites of interest. For usability, we have developed a RaMP R package (https://github.com/Mathelab/RaMP-DB), including a user-friendly RShiny web application, that supports basic simple and batch queries, pathway overrepresentation analysis given a list of genes or metabolites of interest, and network visualization of gene-metabolite relationships. The package also includes the raw database file (mysql dump), thereby providing a stand-alone downloadable framework for public use and integration with other tools. In addition, the Python code needed to recreate the database on another system is also publicly available (https://github.com/Mathelab/RaMP-BackEnd). Updates for databases in RaMP will be

  3. Semantic similarity between ontologies at different scales

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Qingpeng; Haglin, David J.

    2016-04-01

    In the past decade, existing and new knowledge and datasets has been encoded in different ontologies for semantic web and biomedical research. The size of ontologies is often very large in terms of number of concepts and relationships, which makes the analysis of ontologies and the represented knowledge graph computational and time consuming. As the ontologies of various semantic web and biomedical applications usually show explicit hierarchical structures, it is interesting to explore the trade-offs between ontological scales and preservation/precision of results when we analyze ontologies. This paper presents the first effort of examining the capability of this idea via studying the relationship between scaling biomedical ontologies at different levels and the semantic similarity values. We evaluate the semantic similarity between three Gene Ontology slims (Plant, Yeast, and Candida, among which the latter two belong to the same kingdom—Fungi) using four popular measures commonly applied to biomedical ontologies (Resnik, Lin, Jiang-Conrath, and SimRel). The results of this study demonstrate that with proper selection of scaling levels and similarity measures, we can significantly reduce the size of ontologies without losing substantial detail. In particular, the performance of Jiang-Conrath and Lin are more reliable and stable than that of the other two in this experiment, as proven by (a) consistently showing that Yeast and Candida are more similar (as compared to Plant) at different scales, and (b) small deviations of the similarity values after excluding a majority of nodes from several lower scales. This study provides a deeper understanding of the application of semantic similarity to biomedical ontologies, and shed light on how to choose appropriate semantic similarity measures for biomedical engineering.

  4. Synthetic lethality between gene defects affecting a single non-essential molecular pathway with reversible steps.

    Directory of Open Access Journals (Sweden)

    Andrei Zinovyev

    2013-04-01

    Full Text Available Systematic analysis of synthetic lethality (SL constitutes a critical tool for systems biology to decipher molecular pathways. The most accepted mechanistic explanation of SL is that the two genes function in parallel, mutually compensatory pathways, known as between-pathway SL. However, recent genome-wide analyses in yeast identified a significant number of within-pathway negative genetic interactions. The molecular mechanisms leading to within-pathway SL are not fully understood. Here, we propose a novel mechanism leading to within-pathway SL involving two genes functioning in a single non-essential pathway. This type of SL termed within-reversible-pathway SL involves reversible pathway steps, catalyzed by different enzymes in the forward and backward directions, and kinetic trapping of a potentially toxic intermediate. Experimental data with recombinational DNA repair genes validate the concept. Mathematical modeling recapitulates the possibility of kinetic trapping and revealed the potential contributions of synthetic, dosage-lethal interactions in such a genetic system as well as the possibility of within-pathway positive masking interactions. Analysis of yeast gene interaction and pathway data suggests broad applicability of this novel concept. These observations extend the canonical interpretation of synthetic-lethal or synthetic-sick interactions with direct implications to reconstruct molecular pathways and improve therapeutic approaches to diseases such as cancer.

  5. Identification of sugarcane genes involved in the purine synthesis pathway

    Directory of Open Access Journals (Sweden)

    Mario A. Jancso

    2001-12-01

    Full Text Available Nucleotide synthesis is of central importance to all cells. In most organisms, the purine nucleotides are synthesized de novo from non-nucleotide precursors such as amino acids, ammonia and carbon dioxide. An understanding of the enzymes involved in sugarcane purine synthesis opens the possibility of using these enzymes as targets for chemicals which may be effective in combating phytopathogen. Such an approach has already been applied to several parasites and types of cancer. The strategy described in this paper was applied to identify sugarcane clusters for each step of the de novo purine synthesis pathway. Representative sequences of this pathway were chosen from the National Center for Biotechnology Information (NCBI database and used to search the translated sugarcane expressed sequence tag (SUCEST database using the available basic local alignment search tool (BLAST facility. Retrieved clusters were further tested for the statistical significance of the alignment by an implementation (PRSS3 of the Monte Carlo shuffling algorithm calibrated using known protein sequences of divergent taxa along the phylogenetic tree. The sequences were compared to each other and to the sugarcane clusters selected using BLAST analysis, with the resulting table of p-values indicating the degree of divergence of each enzyme within different taxa and in relation to the sugarcane clusters. The results obtained by this strategy allowed us to identify the sugarcane proteins participating in the purine synthesis pathway.A via de síntese de purino nucleotídeos é considerada uma via de central importância para todas as células. Na maioria dos organismos, os purino nucleotídeos são sintetizados ''de novo'' a partir de precursores não-nucleotídicos como amino ácidos, amônia e dióxido de carbono. O conhecimento das enzimas envolvidas na via de síntese de purinas da cana-de-açúcar vai abrir a possibilidade do uso dessas enzimas como alvos no desenho

  6. Different gene-expression profiles for the poorly differentiated carcinoma and the highly differentiated papillary adenocarcinoma in mammary glands support distinct metabolic pathways

    International Nuclear Information System (INIS)

    Eilon, Tali; Barash, Itamar

    2008-01-01

    Deregulation of Stat5 in the mammary gland of transgenic mice causes tumorigenesis. Poorly differentiated carcinoma and highly differentiated papillary adenocarcinoma tumors evolve. To distinguish the genes and elucidate the cellular processes and metabolic pathways utilized to preserve these phenotypes, gene-expression profiles were analyzed. Mammary tumors were excised from transgenic mice carrying a constitutively active variant of Stat5, or a Stat5 variant lacking s transactivation domain. These tumors displayed either the carcinoma or the papillary adenocarcinoma phenotypes. cRNAs, prepared from each tumor were hybridized to an Affymetrix GeneChip ® Mouse Genome 430A 2.0 array. Gene-ontology analysis, hierarchical clustering and biological-pathway analysis were performed to distinct the two types of tumors. Histopathology and immunofluorescence staining complemented the comparison between the tumor phenotypes. The nucleus-cytoskeleton-plasma membrane axis is a major target for differential gene expression between phenotypes. In the carcinoma, stronger expression of genes coding for specific integrins, cytoskeletal proteins and calcium-binding proteins highlight cell-adhesion and motility features of the tumor cells. This is supported by the higher expression of genes involved in O-glycan synthesis, TGF-β, activin, their receptors and Smad3, as well as the Notch ligands and members of the γ-secretase complex that enable Notch nuclear localization. The Wnt pathway was also a target for differential gene expression. Higher expression of genes encoding the degradation complex of the canonical pathway and limited TCF expression in the papillary adenocarcinoma result in membranal accumulation of β-catenin, in contrast to its nuclear translocation in the carcinoma. Genes involved in cell-cycle arrest at G1 and response to DNA damage were more highly expressed in the papillary adenocarcinomas, as opposed to favored G2/M regulation in the carcinoma tumors. At least

  7. Assessment Applications of Ontologies.

    Science.gov (United States)

    Chung, Gregory K. W. K.; Niemi, David; Bewley, William L.

    This paper discusses the use of ontologies and their applications to assessment. An ontology provides a shared and common understanding of a domain that can be communicated among people and computational systems. The ontology captures one or more experts' conceptual representation of a domain expressed in terms of concepts and the relationships…

  8. Ontology-based representation and analysis of host-Brucella interactions.

    Science.gov (United States)

    Lin, Yu; Xiang, Zuoshuang; He, Yongqun

    2015-01-01

    Biomedical ontologies are representations of classes of entities in the biomedical domain and how these classes are related in computer- and human-interpretable formats. Ontologies support data standardization and exchange and provide a basis for computer-assisted automated reasoning. IDOBRU is an ontology in the domain of Brucella and brucellosis. Brucella is a Gram-negative intracellular bacterium that causes brucellosis, the most common zoonotic disease in the world. In this study, IDOBRU is used as a platform to model and analyze how the hosts, especially host macrophages, interact with virulent Brucella strains or live attenuated Brucella vaccine strains. Such a study allows us to better integrate and understand intricate Brucella pathogenesis and host immunity mechanisms. Different levels of host-Brucella interactions based on different host cell types and Brucella strains were first defined ontologically. Three important processes of virulent Brucella interacting with host macrophages were represented: Brucella entry into macrophage, intracellular trafficking, and intracellular replication. Two Brucella pathogenesis mechanisms were ontologically represented: Brucella Type IV secretion system that supports intracellular trafficking and replication, and Brucella erythritol metabolism that participates in Brucella intracellular survival and pathogenesis. The host cell death pathway is critical to the outcome of host-Brucella interactions. For better survival and replication, virulent Brucella prevents macrophage cell death. However, live attenuated B. abortus vaccine strain RB51 induces caspase-2-mediated proinflammatory cell death. Brucella-associated cell death processes are represented in IDOBRU. The gene and protein information of 432 manually annotated Brucella virulence factors were represented using the Ontology of Genes and Genomes (OGG) and Protein Ontology (PRO), respectively. Seven inference rules were defined to capture the knowledge of host

  9. A Pathway Based Classification Method for Analyzing Gene Expression for Alzheimer's Disease Diagnosis.

    Science.gov (United States)

    Voyle, Nicola; Keohane, Aoife; Newhouse, Stephen; Lunnon, Katie; Johnston, Caroline; Soininen, Hilkka; Kloszewska, Iwona; Mecocci, Patrizia; Tsolaki, Magda; Vellas, Bruno; Lovestone, Simon; Hodges, Angela; Kiddle, Steven; Dobson, Richard Jb

    2016-01-01

    Recent studies indicate that gene expression levels in blood may be able to differentiate subjects with Alzheimer's disease (AD) from normal elderly controls and mild cognitively impaired (MCI) subjects. However, there is limited replicability at the single marker level. A pathway-based interpretation of gene expression may prove more robust. This study aimed to investigate whether a case/control classification model built on pathway level data was more robust than a gene level model and may consequently perform better in test data. The study used two batches of gene expression data from the AddNeuroMed (ANM) and Dementia Case Registry (DCR) cohorts. Our study used Illumina Human HT-12 Expression BeadChips to collect gene expression from blood samples. Random forest modeling with recursive feature elimination was used to predict case/control status. Age and APOE ɛ4 status were used as covariates for all analysis. Gene and pathway level models performed similarly to each other and to a model based on demographic information only. Any potential increase in concordance from the novel pathway level approach used here has not lead to a greater predictive ability in these datasets. However, we have only tested one method for creating pathway level scores. Further, we have been able to benchmark pathways against genes in datasets that had been extensively harmonized. Further work should focus on the use of alternative methods for creating pathway level scores, in particular those that incorporate pathway topology, and the use of an endophenotype based approach.

  10. Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer

    International Nuclear Information System (INIS)

    Thomassen, Mads; Tan, Qihua; Kruse, Torben A

    2008-01-01

    Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent studies. Besides classification of outcome, these global expression patterns may reflect biological mechanisms involved in metastasis of breast cancer. Our purpose has been to investigate pathways and transcription factors involved in metastasis by use of gene expression data sets. We have analyzed 8 publicly available gene expression data sets. A global approach, 'gene set enrichment analysis' as well as an approach focusing on a subset of significantly differently regulated genes, GenMAPP, has been applied to rank pathway gene sets according to differential regulation in metastasizing tumors compared to non-metastasizing tumors. Meta-analysis has been used to determine overrepresentation of pathways and transcription factors targets, concordant deregulated in metastasizing breast tumors, in several data sets. The major findings are up-regulation of cell cycle pathways and a metabolic shift towards glucose metabolism reflected in several pathways in metastasizing tumors. Growth factor pathways seem to play dual roles; EGF and PDGF pathways are decreased, while VEGF and sex-hormone pathways are increased in tumors that metastasize. Furthermore, migration, proteasome, immune system, angiogenesis, DNA repair and several signal transduction pathways are associated to metastasis. Finally several transcription factors e.g. E2F, NFY, and YY1 are identified as being involved in metastasis. By pathway meta-analysis many biological mechanisms beyond major characteristics such as proliferation are identified. Transcription factor analysis identifies a number of key factors that support central pathways. Several previously proposed treatment targets are identified and several new pathways that may

  11. A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer

    Directory of Open Access Journals (Sweden)

    Mary Qu Yang

    Full Text Available Clear cell renal cell carcinoma (ccRCC is the most common and most aggressive form of renal cell cancer (RCC. The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1, as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways. Keywords: ccRCC, Causative mutation, Pathways, Protein-protein interaction, Gene module, eQTL

  12. Mapping between the OBO and OWL ontology languages.

    Science.gov (United States)

    Tirmizi, Syed Hamid; Aitken, Stuart; Moreira, Dilvan A; Mungall, Chris; Sequeda, Juan; Shah, Nigam H; Miranker, Daniel P

    2011-03-07

    Ontologies are commonly used in biomedicine to organize concepts to describe domains such as anatomies, environments, experiment, taxonomies etc. NCBO BioPortal currently hosts about 180 different biomedical ontologies. These ontologies have been mainly expressed in either the Open Biomedical Ontology (OBO) format or the Web Ontology Language (OWL). OBO emerged from the Gene Ontology, and supports most of the biomedical ontology content. In comparison, OWL is a Semantic Web language, and is supported by the World Wide Web consortium together with integral query languages, rule languages and distributed infrastructure for information interchange. These features are highly desirable for the OBO content as well. A convenient method for leveraging these features for OBO ontologies is by transforming OBO ontologies to OWL. We have developed a methodology for translating OBO ontologies to OWL using the organization of the Semantic Web itself to guide the work. The approach reveals that the constructs of OBO can be grouped together to form a similar layer cake. Thus we were able to decompose the problem into two parts. Most OBO constructs have easy and obvious equivalence to a construct in OWL. A small subset of OBO constructs requires deeper consideration. We have defined transformations for all constructs in an effort to foster a standard common mapping between OBO and OWL. Our mapping produces OWL-DL, a Description Logics based subset of OWL with desirable computational properties for efficiency and correctness. Our Java implementation of the mapping is part of the official Gene Ontology project source. Our transformation system provides a lossless roundtrip mapping for OBO ontologies, i.e. an OBO ontology may be translated to OWL and back without loss of knowledge. In addition, it provides a roadmap for bridging the gap between the two ontology languages in order to enable the use of ontology content in a language independent manner.

  13. TLR-related pathway analysis : novel gene-gene interactions in the development of asthma and atopy

    NARCIS (Netherlands)

    Reijmerink, N. E.; Bottema, R. W. B.; Kerkhof, M.; Gerritsen, J.; Stelma, F. F.; Thijs, C.; van Schayck, C. P.; Smit, H. A.; Brunekreef, B.; Koppelman, G. H.; Postma, D. S.

    P>Background: The toll-like receptor (TLR)-related pathway is important in host defence and may be crucial in the development of asthma and atopy. Numerous studies have shown associations of TLR-related pathway genes with asthma and atopy phenotypes. So far it has not been investigated whether

  14. Ontologies vs. Classification Systems

    DEFF Research Database (Denmark)

    Madsen, Bodil Nistrup; Erdman Thomsen, Hanne

    2009-01-01

    What is an ontology compared to a classification system? Is a taxonomy a kind of classification system or a kind of ontology? These are questions that we meet when working with people from industry and public authorities, who need methods and tools for concept clarification, for developing meta...... data sets or for obtaining advanced search facilities. In this paper we will present an attempt at answering these questions. We will give a presentation of various types of ontologies and briefly introduce terminological ontologies. Furthermore we will argue that classification systems, e.g. product...... classification systems and meta data taxonomies, should be based on ontologies....

  15. Pre-silencing of genes involved in the electron transport chain (ETC) pathway is associated with responsiveness to abatacept in rheumatoid arthritis.

    Science.gov (United States)

    Derambure, C; Dzangue-Tchoupou, G; Berard, C; Vergne, N; Hiron, M; D'Agostino, M A; Musette, P; Vittecoq, O; Lequerré, T

    2017-05-25

    In the current context of personalized medicine, one of the major challenges in the management of rheumatoid arthritis (RA) is to identify biomarkers that predict drug responsiveness. From the European APPRAISE trial, our main objective was to identify a gene expression profile associated with responsiveness to abatacept (ABA) + methotrexate (MTX) and to understand the involvement of this signature in the pathophysiology of RA. Whole human genome microarrays (4 × 44 K) were performed from a first subset of 36 patients with RA. Data validation by quantitative reverse-transcription (qRT)-PCR was performed from a second independent subset of 32 patients with RA. Gene Ontology and WikiPathways database allowed us to highlight the specific biological mechanisms involved in predicting response to ABA/MTX. From the first subset of 36 patients with RA, a combination including 87 transcripts allowed almost perfect separation between responders and non-responders to ABA/MTX. Next, the second subset of patients 32 with RA allowed validation by qRT-PCR of a minimal signature with only four genes. This latter signature categorized 81% of patients with RA with 75% sensitivity, 85% specificity and 85% negative predictive value. This combination showed a significant enrichment of genes involved in electron transport chain (ETC) pathways. Seven transcripts from ETC pathways (NDUFA6, NDUFA4, UQCRQ, ATP5J, COX7A2, COX7B, COX6A1) were significantly downregulated in responders versus non-responders to ABA/MTX. Moreover, dysregulation of these genes was independent of inflammation and was specific to ABA response. Pre-silencing of ETC genes is associated with future response to ABA/MTX and might be a crucial key to susceptibility to ABA response.

  16. Floral pathway integrator gene expression mediates gradual transmission of environmental and endogenous cues to flowering time.

    Science.gov (United States)

    van Dijk, Aalt D J; Molenaar, Jaap

    2017-01-01

    The appropriate timing of flowering is crucial for the reproductive success of plants. Hence, intricate genetic networks integrate various environmental and endogenous cues such as temperature or hormonal statues. These signals integrate into a network of floral pathway integrator genes. At a quantitative level, it is currently unclear how the impact of genetic variation in signaling pathways on flowering time is mediated by floral pathway integrator genes. Here, using datasets available from literature, we connect Arabidopsis thaliana flowering time in genetic backgrounds varying in upstream signalling components with the expression levels of floral pathway integrator genes in these genetic backgrounds. Our modelling results indicate that flowering time depends in a quite linear way on expression levels of floral pathway integrator genes. This gradual, proportional response of flowering time to upstream changes enables a gradual adaptation to changing environmental factors such as temperature and light.

  17. Floral pathway integrator gene expression mediates gradual transmission of environmental and endogenous cues to flowering time

    Directory of Open Access Journals (Sweden)

    Aalt D.J. van Dijk

    2017-04-01

    Full Text Available The appropriate timing of flowering is crucial for the reproductive success of plants. Hence, intricate genetic networks integrate various environmental and endogenous cues such as temperature or hormonal statues. These signals integrate into a network of floral pathway integrator genes. At a quantitative level, it is currently unclear how the impact of genetic variation in signaling pathways on flowering time is mediated by floral pathway integrator genes. Here, using datasets available from literature, we connect Arabidopsis thaliana flowering time in genetic backgrounds varying in upstream signalling components with the expression levels of floral pathway integrator genes in these genetic backgrounds. Our modelling results indicate that flowering time depends in a quite linear way on expression levels of floral pathway integrator genes. This gradual, proportional response of flowering time to upstream changes enables a gradual adaptation to changing environmental factors such as temperature and light.

  18. IntPath--an integrated pathway gene relationship database for model organisms and important pathogens.

    Science.gov (United States)

    Zhou, Hufeng; Jin, Jingjing; Zhang, Haojun; Yi, Bo; Wozniak, Michal; Wong, Limsoon

    2012-01-01

    Pathway data are important for understanding the relationship between genes, proteins and many other molecules in living organisms. Pathway gene relationships are crucial information for guidance, prediction, reference and assessment in biochemistry, computational biology, and medicine. Many well-established databases--e.g., KEGG, WikiPathways, and BioCyc--are dedicated to collecting pathway data for public access. However, the effectiveness of these databases is hindered by issues such as incompatible data formats, inconsistent molecular representations, inconsistent molecular relationship representations, inconsistent referrals to pathway names, and incomprehensive data from different databases. In this paper, we overcome these issues through extraction, normalization and integration of pathway data from several major public databases (KEGG, WikiPathways, BioCyc, etc). We build a database that not only hosts our integrated pathway gene relationship data for public access but also maintains the necessary updates in the long run. This public repository is named IntPath (Integrated Pathway gene relationship database for model organisms and important pathogens). Four organisms--S. cerevisiae, M. tuberculosis H37Rv, H. Sapiens and M. musculus--are included in this version (V2.0) of IntPath. IntPath uses the "full unification" approach to ensure no deletion and no introduced noise in this process. Therefore, IntPath contains much richer pathway-gene and pathway-gene pair relationships and much larger number of non-redundant genes and gene pairs than any of the single-source databases. The gene relationships of each gene (measured by average node degree) per pathway are significantly richer. The gene relationships in each pathway (measured by average number of gene pairs per pathway) are also considerably richer in the integrated pathways. Moderate manual curation are involved to get rid of errors and noises from source data (e.g., the gene ID errors in WikiPathways and

  19. Toxicology ontology perspectives.

    Science.gov (United States)

    Hardy, Barry; Apic, Gordana; Carthew, Philip; Clark, Dominic; Cook, David; Dix, Ian; Escher, Sylvia; Hastings, Janna; Heard, David J; Jeliazkova, Nina; Judson, Philip; Matis-Mitchell, Sherri; Mitic, Dragana; Myatt, Glenn; Shah, Imran; Spjuth, Ola; Tcheremenskaia, Olga; Toldo, Luca; Watson, David; White, Andrew; Yang, Chihae

    2012-01-01

    The field of predictive toxicology requires the development of open, public, computable, standardized toxicology vocabularies and ontologies to support the applications required by in silico, in vitro, and in vivo toxicology methods and related analysis and reporting activities. In this article we review ontology developments based on a set of perspectives showing how ontologies are being used in predictive toxicology initiatives and applications. Perspectives on resources and initiatives reviewed include OpenTox, eTOX, Pistoia Alliance, ToxWiz, Virtual Liver, EU-ADR, BEL, ToxML, and Bioclipse. We also review existing ontology developments in neighboring fields that can contribute to establishing an ontological framework for predictive toxicology. A significant set of resources is already available to provide a foundation for an ontological framework for 21st century mechanistic-based toxicology research. Ontologies such as ToxWiz provide a basis for application to toxicology investigations, whereas other ontologies under development in the biological, chemical, and biomedical communities could be incorporated in an extended future framework. OpenTox has provided a semantic web framework for the implementation of such ontologies into software applications and linked data resources. Bioclipse developers have shown the benefit of interoperability obtained through ontology by being able to link their workbench application with remote OpenTox web services. Although these developments are promising, an increased international coordination of efforts is greatly needed to develop a more unified, standardized, and open toxicology ontology framework.

  20. Transcriptome characterization and gene expression of Epinephelus spp in endoplasmic reticulum stress-related pathway during betanodavirus infection in vitro

    Directory of Open Access Journals (Sweden)

    Lu Ming-Wei

    2012-11-01

    Full Text Available Abstract Background Grouper (Epinephelus spp is an economically important fish species worldwide. However, viral pathogens such as nervous necrosis virus (NNV have been causing severe infections in the fish, resulting in great loss in the grouper aquaculture industry. Yet, the understanding of the molecular mechanisms underlying the pathogenicity of NNV is still inadequate, mainly due to insufficient genomic information of the host. Results De novo assembly of grouper transcriptome in the grouper kidney (GK cells was conducted by using short read sequencing technology of Solexa/Illumina. A sum of 66,582 unigenes with mean length of 603 bp were obtained, and were annotated according to Gene Ontology (GO and Clusters of Orthologous Groups (COG. In addition, the tag-based digital gene expression (DGE system was used to investigate the gene expression and pathways associated with NNV infection in GK cells. The analysis revealed endoplasmic reticulum (ER stress response was prominently affected in NNV-infected GK cells. A further analysis revealed an interaction between the NNV capsid protein and the ER chaperone immunoglobulin heavy-chain binding protein (BiP. Furthermore, exogenous expression of NNV capsid protein was able to induce XBP-1 mRNA splicing in vivo, suggesting a role of the capsid protein in the NNV-induced ER stress. Conclusions Our data presents valuable genetic information for Epinephelus spp., which will benefit future study in this non-model but economically important species. The DGE profile of ER stress response in NNV-infected cells provides information of many important components associated with the protein processing in ER. Specifically, we showed that the viral capsid protein might play an important role in the ER stress response.

  1. AOP-DB Frontend: A user interface for the Adverse Outcome Pathways Database

    Science.gov (United States)

    The EPA Adverse Outcome Pathway Database (AOP-DB) is a database resource that aggregates association relationships between AOPs, genes, chemicals, diseases, pathways, species orthology information, ontologies. The AOP-DB frontend is a simple yet powerful user interface in the for...

  2. AOP-DB Frontend: A user interface for the Adverse Outcome Pathways Database.

    Science.gov (United States)

    The EPA Adverse Outcome Pathway Database (AOP-DB) is a database resource that aggregates association relationships between AOPs, genes, chemicals, diseases, pathways, species orthology information, ontologies. The AOP-DB frontend is a simple yet powerful AOP-DB user interface in...

  3. Integrated GWAS and Pathway profiling for feed efficiency traits in pigs leads to novel genes and their molecular pathways

    DEFF Research Database (Denmark)

    Do, Duy Ngoc; Ostersen, Tage; Strathe, Anders Bjerring

    2013-01-01

    Genome wide association studies (GWAS) are being extensively used in revealing genetic architecture of complex traits. However, GWAS offer limited understanding of the biological role of significant single nucleotide polymorphisms (SNPs) affecting complex traits. Pathway analysis using GWAS results...... is an important step where we firstly detect genes located near GWAS-detected SNPs and subsequently we detect enrichment of these genes in various biological processes and pathways. The objective of this study was to apply these steps to identify relevant pathways involved in residual feed intake (RFI) in pigs....... Residual feed intake is a feed efficiency measure and is highly economically important in animal production. In our study, a total of 596 Yorkshire boars had phenotypic and genotypic records. After quality control, 37,915 SNPs were available for GWAS which was implemented in the DMU software package...

  4. Screening key candidate genes and pathways involved in insulinoma by microarray analysis.

    Science.gov (United States)

    Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin

    2018-06-01

    Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.

  5. A gene pathway analysis highlights the role of cellular adhesion molecules in multiple sclerosis susceptibility

    DEFF Research Database (Denmark)

    Damotte, V; Guillot-Noel, L; Patsopoulos, N A

    2014-01-01

    adhesion molecule (CAMs) biological pathway using Cytoscape software. This network is a strong candidate, as it is involved in the crossing of the blood-brain barrier by the T cells, an early event in MS pathophysiology, and is used as an efficient therapeutic target. We drew up a list of 76 genes...... in interaction with other genes as a group. Pathway analysis is an alternative way to highlight such group of genes. Using SNP association P-values from eight multiple sclerosis (MS) GWAS data sets, we performed a candidate pathway analysis for MS susceptibility by considering genes interacting in the cell...... belonging to the CAM network. We highlighted 64 networks enriched with CAM genes with low P-values. Filtering by a percentage of CAM genes up to 50% and rejecting enriched signals mainly driven by transcription factors, we highlighted five networks associated with MS susceptibility. One of them, constituted...

  6. The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer

    Directory of Open Access Journals (Sweden)

    Rosa Aghdam

    2017-12-01

    Full Text Available Deciphering important genes and pathways from incomplete gene expression data could facilitate a better understanding of cancer. Different imputation methods can be applied to estimate the missing values. In our study, we evaluated various imputation methods for their performance in preserving significant genes and pathways. In the first step, 5% genes are considered in random for two types of ignorable and non-ignorable missingness mechanisms with various missing rates. Next, 10 well-known imputation methods were applied to the complete datasets. The significance analysis of microarrays (SAM method was applied to detect the significant genes in rectal and lung cancers to showcase the utility of imputation approaches in preserving significant genes. To determine the impact of different imputation methods on the identification of important genes, the chi-squared test was used to compare the proportions of overlaps between significant genes detected from original data and those detected from the imputed datasets. Additionally, the significant genes are tested for their enrichment in important pathways, using the ConsensusPathDB. Our results showed that almost all the significant genes and pathways of the original dataset can be detected in all imputed datasets, indicating that there is no significant difference in the performance of various imputation methods tested. The source code and selected datasets are available on http://profiles.bs.ipm.ir/softwares/imputation_methods/.

  7. The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer.

    Science.gov (United States)

    Aghdam, Rosa; Baghfalaki, Taban; Khosravi, Pegah; Saberi Ansari, Elnaz

    2017-12-01

    Deciphering important genes and pathways from incomplete gene expression data could facilitate a better understanding of cancer. Different imputation methods can be applied to estimate the missing values. In our study, we evaluated various imputation methods for their performance in preserving significant genes and pathways. In the first step, 5% genes are considered in random for two types of ignorable and non-ignorable missingness mechanisms with various missing rates. Next, 10 well-known imputation methods were applied to the complete datasets. The significance analysis of microarrays (SAM) method was applied to detect the significant genes in rectal and lung cancers to showcase the utility of imputation approaches in preserving significant genes. To determine the impact of different imputation methods on the identification of important genes, the chi-squared test was used to compare the proportions of overlaps between significant genes detected from original data and those detected from the imputed datasets. Additionally, the significant genes are tested for their enrichment in important pathways, using the ConsensusPathDB. Our results showed that almost all the significant genes and pathways of the original dataset can be detected in all imputed datasets, indicating that there is no significant difference in the performance of various imputation methods tested. The source code and selected datasets are available on http://profiles.bs.ipm.ir/softwares/imputation_methods/. Copyright © 2017. Production and hosting by Elsevier B.V.

  8. A cross-study gene set enrichment analysis identifies critical pathways in endometriosis

    Directory of Open Access Journals (Sweden)

    Bai Chunyan

    2009-09-01

    Full Text Available Abstract Background Endometriosis is an enigmatic disease. Gene expression profiling of endometriosis has been used in several studies, but few studies went further to classify subtypes of endometriosis based on expression patterns and to identify possible pathways involved in endometriosis. Some of the observed pathways are more inconsistent between the studies, and these candidate pathways presumably only represent a fraction of the pathways involved in endometriosis. Methods We applied a standardised microarray preprocessing and gene set enrichment analysis to six independent studies, and demonstrated increased concordance between these gene datasets. Results We find 16 up-regulated and 19 down-regulated pathways common in ovarian endometriosis data sets, 22 up-regulated and one down-regulated pathway common in peritoneal endometriosis data sets. Among them, 12 up-regulated and 1 down-regulated were found consistent between ovarian and peritoneal endometriosis. The main canonical pathways identified are related to immunological and inflammatory disease. Early secretory phase has the most over-represented pathways in the three uterine cycle phases. There are no overlapping significant pathways between the dataset from human endometrial endothelial cells and the datasets from ovarian endometriosis which used whole tissues. Conclusion The study of complex diseases through pathway analysis is able to highlight genes weakly connected to the phenotype which may be difficult to detect by using classical univariate statistics. By standardised microarray preprocessing and GSEA, we have increased the concordance in identifying many biological mechanisms involved in endometriosis. The identified gene pathways will shed light on the understanding of endometriosis and promote the development of novel therapies.

  9. Microarray analysis reveals key genes and pathways in Tetralogy of Fallot

    Science.gov (United States)

    He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai

    2017-01-01

    The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF

  10. Gene expression profiling demonstrates WNT/β-catenin pathway genes alteration in Mexican patients with colorectal cancer and diabetes mellitus.

    Science.gov (United States)

    Ivonne Wence-Chavez, Laura; Palomares-Chacon, Ulises; Pablo Flores-Gutierrez, Juan; Felipe Jave-Suarez, Luis; Del Carmen Aguilar-Lemarroy, Adriana; Barros-Nunez, Patricio; Esperanza Flores-Martinez, Silvia; Sanchez-Corona, Jose; Alejandra Rosales-Reynoso, Monica

    2017-01-01

    Several studies have shown a strong association between diabetes mellitus (DM) and increased risk of colorectal cancer (CRC). The fundamental mechanisms that support this association are not entirely understood; however, it is believed that hyperinsulinemia and hyperglycemia may be involved. Some proposed mechanisms include upregulation of mitogenic signaling pathways like MAPK, PI3K, mTOR, and WNT, which are involved in cell proliferation, growth, and cancer cell survival. The purpose of this study was to evaluate the gene expression profile and identify differently expressed genes involved in mitogenic pathways in CRC patients with and without DM. In this study, microarray analysis of gene expression followed by quantitative PCR (qPCR) was performed in cancer tissue from CRC patients with and without DM to identify the gene expression profiles and validate the differently expressed genes. Among the study groups, some differently expressed genes were identified. However, when bioinformatics clustering tools were used, a significant modulation of genes involved in the WNT pathway was evident. Therefore, we focused on genes participating in this pathway, such as WNT3A, LRP6, TCF7L2, and FRA-1. Validation of the expression levels of those genes by qPCR showed that CRC patients without type 2 diabetes mellitus (T2DM) expressed significantly more WNT3Ay LRP6, but less TCF7L2 and FRA-1 compared to controls, while in CRC patients with DM the expression levels of WNT3A, LRP6, TCF7L2, and FRA-1 were significantly higher compared to controls. Our results suggest that WNT/β-catenin pathway is upregulated in patients with CRC and DM, demonstrating its importance and involvement in both pathologies.

  11. Integrated bioinformatics analysis reveals key candidate genes and pathways in breast cancer.

    Science.gov (United States)

    Wang, Yuzhi; Zhang, Yi; Huang, Qian; Li, Chengwen

    2018-04-19

    Breast cancer (BC) is the leading malignancy in women worldwide, yet relatively little is known about the genes and signaling pathways involved in BC tumorigenesis and progression. The present study aimed to elucidate potential key candidate genes and pathways in BC. Five gene expression profile data sets (GSE22035, GSE3744, GSE5764, GSE21422 and GSE26910) were downloaded from the Gene Expression Omnibus (GEO) database, which included data from 113 tumorous and 38 adjacent non‑tumorous tissue samples. Differentially expressed genes (DEGs) were identified using t‑tests in the limma R package. These DEGs were subsequently investigated by pathway enrichment analysis and a protein‑protein interaction (PPI) network was constructed. The most significant module from the PPI network was selected for pathway enrichment analysis. In total, 227 DEGs were identified, of which 82 were upregulated and 145 were downregulated. Pathway enrichment analysis results revealed that the upregulated DEGs were mainly enriched in 'cell division', the 'proteinaceous extracellular matrix (ECM)', 'ECM structural constituents' and 'ECM‑receptor interaction', whereas downregulated genes were mainly enriched in 'response to drugs', 'extracellular space', 'transcriptional activator activity' and the 'peroxisome proliferator‑activated receptor signaling pathway'. The PPI network contained 174 nodes and 1,257 edges. DNA topoisomerase 2‑a, baculoviral inhibitor of apoptosis repeat‑containing protein 5, cyclin‑dependent kinase 1, G2/mitotic‑specific cyclin‑B1 and kinetochore protein NDC80 homolog were identified as the top 5 hub genes. Furthermore, the genes in the most significant module were predominantly involved in 'mitotic nuclear division', 'mid‑body', 'protein binding' and 'cell cycle'. In conclusion, the DEGs, relative pathways and hub genes identified in the present study may aid in understanding of the molecular mechanisms underlying BC progression and provide

  12. Coregulation of terpenoid pathway genes and prediction of isoprene production in Bacillus subtilis using transcriptomics

    Energy Technology Data Exchange (ETDEWEB)

    Hess, Becky M.; Xue, Junfeng; Markillie, Lye Meng; Taylor, Ronald C.; Wiley, H. S.; Ahring, Birgitte K.; Linggi, Bryan E.

    2013-06-19

    The isoprenoid pathway converts pyruvate to isoprene and related isoprenoid compounds in plants and some bacteria. Currently, this pathway is of great interest because of the critical role that isoprenoids play in basic cellular processes as well as the industrial value of metabolites such as isoprene. Although the regulation of several pathway genes has been described, there is a paucity of information regarding the system level regulation and control of the pathway. To address this limitation, we examined Bacillus subtilis grown under multiple conditions and then determined the relationship between altered isoprene production and the pattern of gene expression. We found that terpenoid genes appeared to fall into two distinct subsets with opposing correlations with respect to the amount of isoprene produced. The group whose expression levels positively correlated with isoprene production included dxs, the gene responsible for the commitment step in the pathway, as well as ispD, and two genes that participate in the mevalonate pathway, yhfS and pksG. The subset of terpenoid genes that inversely correlated with isoprene production included ispH, ispF, hepS, uppS, ispE, and dxr. A genome wide partial least squares regression model was created to identify other genes or pathways that contribute to isoprene production. This analysis showed that a subset of 213 regulated genes was sufficient to create a predictive model of isoprene production under different conditions and showed correlations at the transcriptional level. We conclude that gene expression levels alone are sufficiently informative about the metabolic state of a cell that produces increased isoprene and can be used to build a model which accurately predicts production of this secondary metabolite across many simulated environmental conditions.

  13. Integrating phenotype ontologies with PhenomeNET

    KAUST Repository

    Rodriguez-Garcia, Miguel Angel

    2017-12-19

    Background Integration and analysis of phenotype data from humans and model organisms is a key challenge in building our understanding of normal biology and pathophysiology. However, the range of phenotypes and anatomical details being captured in clinical and model organism databases presents complex problems when attempting to match classes across species and across phenotypes as diverse as behaviour and neoplasia. We have previously developed PhenomeNET, a system for disease gene prioritization that includes as one of its components an ontology designed to integrate phenotype ontologies. While not applicable to matching arbitrary ontologies, PhenomeNET can be used to identify related phenotypes in different species, including human, mouse, zebrafish, nematode worm, fruit fly, and yeast. Results Here, we apply the PhenomeNET to identify related classes from two phenotype and two disease ontologies using automated reasoning. We demonstrate that we can identify a large number of mappings, some of which require automated reasoning and cannot easily be identified through lexical approaches alone. Combining automated reasoning with lexical matching further improves results in aligning ontologies. Conclusions PhenomeNET can be used to align and integrate phenotype ontologies. The results can be utilized for biomedical analyses in which phenomena observed in model organisms are used to identify causative genes and mutations underlying human disease.

  14. Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways.

    Science.gov (United States)

    Obayashi, Takeshi; Kinoshita, Kengo

    2010-05-01

    Gene coexpression analyses are a powerful method to predict the function of genes and/or to identify genes that are functionally related to query genes. The basic idea of gene coexpression analyses is that genes with similar functions should have similar expression patterns under many different conditions. This approach is now widely used by many experimental researchers, especially in the field of plant biology. In this review, we will summarize recent successful examples obtained by using our gene coexpression database, ATTED-II. Specifically, the examples will describe the identification of new genes, such as the subunits of a complex protein, the enzymes in a metabolic pathway and transporters. In addition, we will discuss the discovery of a new intercellular signaling factor and new regulatory relationships between transcription factors and their target genes. In ATTED-II, we provide two basic views of gene coexpression, a gene list view and a gene network view, which can be used as guide gene approach and narrow-down approach, respectively. In addition, we will discuss the coexpression effectiveness for various types of gene sets.

  15. Placental gene-expression profiles of intrahepatic cholestasis of pregnancy reveal involvement of multiple molecular pathways in blood vessel formation and inflammation.

    Science.gov (United States)

    Du, QiaoLing; Pan, YouDong; Zhang, YouHua; Zhang, HaiLong; Zheng, YaJuan; Lu, Ling; Wang, JunLei; Duan, Tao; Chen, JianFeng

    2014-07-07

    Intrahepatic cholestasis of pregnancy (ICP) is a pregnancy-associated liver disease with potentially deleterious consequences for the fetus, particularly when maternal serum bile-acid concentration >40 μM. However, the etiology and pathogenesis of ICP remain elusive. To reveal the underlying molecular mechanisms for the association of maternal serum bile-acid level and fetal outcome in ICP patients, DNA microarray was applied to characterize the whole-genome expression profiles of placentas from healthy women and women diagnosed with ICP. Thirty pregnant women recruited in this study were categorized evenly into three groups: healthy group; mild ICP, with serum bile-acid concentration ranging from 10-40 μM; and severe ICP, with bile-acid concentration >40 μM. Gene Ontology analysis in combination with construction of gene-interaction and gene co-expression networks were applied to identify the core regulatory genes associated with ICP pathogenesis, which were further validated by quantitative real-time PCR and histological staining. The core regulatory genes were mainly involved in immune response, VEGF signaling pathway and G-protein-coupled receptor signaling, implying essential roles of immune response, vasculogenesis and angiogenesis in ICP pathogenesis. This implication was supported by the observed aggregated immune-cell infiltration and deficient blood vessel formation in ICP placentas. Our study provides a system-level insight into the placental gene-expression profiles of women with mild or severe ICP, and reveals multiple molecular pathways in immune response and blood vessel formation that might contribute to ICP pathogenesis.

  16. Assembly of inflammation-related genes for pathway-focused genetic analysis.

    Directory of Open Access Journals (Sweden)

    Matthew J Loza

    2007-10-01

    Full Text Available Recent identifications of associations between novel variants in inflammation-related genes and several common diseases emphasize the need for systematic evaluations of these genes in disease susceptibility. Considering that many genes are involved in the complex inflammation responses and many genetic variants in these genes have the potential to alter the functions and expression of these genes, we assembled a list of key inflammation-related genes to facilitate the identification of genetic associations of diseases with an inflammation-related etiology. We first reviewed various phases of inflammation responses, including the development of immune cells, sensing of danger, influx of cells to sites of insult, activation and functional responses of immune and non-immune cells, and resolution of the immune response. Assisted by the Ingenuity Pathway Analysis, we then identified 17 functional sub-pathways that are involved in one or multiple phases. This organization would greatly increase the chance of detecting gene-gene interactions by hierarchical clustering of genes with their functional closeness in a pathway. Finally, as an example application, we have developed tagging single nucleotide polymorphism (tSNP arrays for populations of European and African descent to capture all the common variants of these key inflammation-related genes. Assays of these tSNPs have been designed and assembled into two Affymetrix ParAllele customized chips, one each for European (12,011 SNPs and African (21,542 SNPs populations. These tSNPs have greater coverage for these inflammation-related genes compared to the existing genome-wide arrays, particularly in the African population. These tSNP arrays can facilitate systematic evaluation of inflammation pathways in disease susceptibility. For additional applications, other genotyping platforms could also be employed. For existing genome-wide association data, this list of key inflammation-related genes and

  17. MicroRNA expression, target genes, and signaling pathways in infants with a ventricular septal defect.

    Science.gov (United States)

    Chai, Hui; Yan, Zhaoyuan; Huang, Ke; Jiang, Yuanqing; Zhang, Lin

    2018-02-01

    This study aimed to systematically investigate the relationship between miRNA expression and the occurrence of ventricular septal defect (VSD), and characterize the miRNA target genes and pathways that can lead to VSD. The miRNAs that were differentially expressed in blood samples from VSD and normal infants were screened and validated by implementing miRNA microarrays and qRT-PCR. The target genes regulated by differentially expressed miRNAs were predicted using three target gene databases. The functions and signaling pathways of the target genes were enriched using the GO database and KEGG database, respectively. The transcription and protein expression of specific target genes in critical pathways were compared in the VSD and normal control groups using qRT-PCR and western blotting, respectively. Compared with the normal control group, the VSD group had 22 differentially expressed miRNAs; 19 were downregulated and three were upregulated. The 10,677 predicted target genes participated in many biological functions related to cardiac development and morphogenesis. Four target genes (mGLUR, Gq, PLC, and PKC) were involved in the PKC pathway and four (ECM, FAK, PI3 K, and PDK1) were involved in the PI3 K-Akt pathway. The transcription and protein expression of these eight target genes were significantly upregulated in the VSD group. The 22 miRNAs that were dysregulated in the VSD group were mainly downregulated, which may result in the dysregulation of several key genes and biological functions related to cardiac development. These effects could also be exerted via the upregulation of eight specific target genes, the subsequent over-activation of the PKC and PI3 K-Akt pathways, and the eventual abnormal cardiac development and VSD.

  18. Halobenzoquinone-Induced Alteration of Gene Expression Associated with Oxidative Stress Signaling Pathways.

    Science.gov (United States)

    Li, Jinhua; Moe, Birget; Liu, Yanming; Li, Xing-Fang

    2018-06-05

    Halobenzoquinones (HBQs) are emerging disinfection byproducts (DBPs) that effectively induce reactive oxygen species and oxidative damage in vitro. However, the impacts of HBQs on oxidative-stress-related gene expression have not been investigated. In this study, we examined alterations in the expression of 44 genes related to oxidative-stress-induced signaling pathways in human uroepithelial cells (SV-HUC-1) upon exposure to six HBQs. The results show the structure-dependent effects of HBQs on the studied gene expression. After 2 h of exposure, the expression levels of 9 to 28 genes were altered, while after 8 h of exposure, the expression levels of 29 to 31 genes were altered. Four genes ( HMOX1, NQO1, PTGS2, and TXNRD1) were significantly upregulated by all six HBQs at both exposure time points. Ingenuity pathway analysis revealed that the Nrf2 pathway was significantly responsive to HBQ exposure. Other canonical pathways responsive to HBQ exposure included GSH redox reductions, superoxide radical degradation, and xenobiotic metabolism signaling. This study has demonstrated that HBQs significantly alter the gene expression of oxidative-stress-related signaling pathways and contributes to the understanding of HBQ-DBP-associated toxicity.

  19. Gene expression profiling and candidate gene resequencing identifies pathways and mutations important for malignant transformation caused by leukemogenic fusion genes.

    Science.gov (United States)

    Novak, Rachel L; Harper, David P; Caudell, David; Slape, Christopher; Beachy, Sarah H; Aplan, Peter D

    2012-12-01

    NUP98-HOXD13 (NHD13) and CALM-AF10 (CA10) are oncogenic fusion proteins produced by recurrent chromosomal translocations in patients with acute myeloid leukemia (AML). Transgenic mice that express these fusions develop AML with a long latency and incomplete penetrance, suggesting that collaborating genetic events are required for leukemic transformation. We employed genetic techniques to identify both preleukemic abnormalities in healthy transgenic mice as well as collaborating events leading to leukemic transformation. Candidate gene resequencing revealed that 6 of 27 (22%) CA10 AMLs spontaneously acquired a Ras pathway mutation and 8 of 27 (30%) acquired an Flt3 mutation. Two CA10 AMLs acquired an Flt3 internal-tandem duplication, demonstrating that these mutations can be acquired in murine as well as human AML. Gene expression profiles revealed a marked upregulation of Hox genes, particularly Hoxa5, Hoxa9, and Hoxa10 in both NHD13 and CA10 mice. Furthermore, mir196b, which is embedded within the Hoxa locus, was overexpressed in both CA10 and NHD13 samples. In contrast, the Hox cofactors Meis1 and Pbx3 were differentially expressed; Meis1 was increased in CA10 AMLs but not NHD13 AMLs, whereas Pbx3 was consistently increased in NHD13 but not CA10 AMLs. Silencing of Pbx3 in NHD13 cells led to decreased proliferation, increased apoptosis, and decreased colony formation in vitro, suggesting a previously unexpected role for Pbx3 in leukemic transformation. Published by Elsevier Inc.

  20. Genetically based location from triploid populations and gene ontology of a 3.3-mb genome region linked to Alternaria brown spot resistance in citrus reveal clusters of resistance genes.

    Directory of Open Access Journals (Sweden)

    José Cuenca

    Full Text Available Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR to map a genome region linked to Alternaria brown spot (ABS resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids.

  1. Genetically based location from triploid populations and gene ontology of a 3.3-mb genome region linked to Alternaria brown spot resistance in citrus reveal clusters of resistance genes.

    Science.gov (United States)

    Cuenca, José; Aleza, Pablo; Vicent, Antonio; Brunel, Dominique; Ollitrault, Patrick; Navarro, Luis

    2013-01-01

    Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR) to map a genome region linked to Alternaria brown spot (ABS) resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids.

  2. MetaGO: Predicting Gene Ontology of Non-homologous Proteins Through Low-Resolution Protein Structure Prediction and Protein-Protein Network Mapping.

    Science.gov (United States)

    Zhang, Chengxin; Zheng, Wei; Freddolino, Peter L; Zhang, Yang

    2018-03-10

    Homology-based transferal remains the major approach to computational protein function annotations, but it becomes increasingly unreliable when the sequence identity between query and template decreases below 30%. We propose a novel pipeline, MetaGO, to deduce Gene Ontology attributes of proteins by combining sequence homology-based annotation with low-resolution structure prediction and comparison, and partner's homology-based protein-protein network mapping. The pipeline was tested on a large-scale set of 1000 non-redundant proteins from the CAFA3 experiment. Under the stringent benchmark conditions where templates with >30% sequence identity to the query are excluded, MetaGO achieves average F-measures of 0.487, 0.408, and 0.598, for Molecular Function, Biological Process, and Cellular Component, respectively, which are significantly higher than those achieved by other state-of-the-art function annotations methods. Detailed data analysis shows that the major advantage of the MetaGO lies in the new functional homolog detections from partner's homology-based network mapping and structure-based local and global structure alignments, the confidence scores of which can be optimally combined through logistic regression. These data demonstrate the power of using a hybrid model incorporating protein structure and interaction networks to deduce new functional insights beyond traditional sequence homology-based referrals, especially for proteins that lack homologous function templates. The MetaGO pipeline is available at http://zhanglab.ccmb.med.umich.edu/MetaGO/. Copyright © 2018. Published by Elsevier Ltd.

  3. Constructing Adverse Outcome Pathways: a Demonstration of ...

    Science.gov (United States)

    Adverse outcome pathway (AOP) provides a conceptual framework to evaluate and integrate chemical toxicity and its effects across the levels of biological organization. As such, it is essential to develop a resource-efficient and effective approach to extend molecular initiating events (MIEs) of chemicals to their downstream phenotypes of a greater regulatory relevance. A number of ongoing public phenomics (high throughput phenotyping) efforts have been generating abundant phenotypic data annotated with ontology terms. These phenotypes can be analyzed semantically and linked to MIEs of interest, all in the context of a knowledge base integrated from a variety of ontologies for various species and knowledge domains. In such analyses, two phenotypic profiles (PPs; anchored by genes or diseases) each characterized by multiple ontology terms are compared for their semantic similarities within a common ontology graph, but across boundaries of species and knowledge domains. Taking advantage of publicly available ontologies and software tool kits, we have implemented an OS-Mapping (Ontology-based Semantics Mapping) approach as a Java application, and constructed a network of 19383 PPs as nodes with edges weighed by their pairwise semantic similarity scores. Individual PPs were assembled from public phenomics data. Out of possible 1.87×108 pairwise connections among these nodes, about 71% of them have similarity scores between 0.2 and the maximum possible of 1.0.

  4. The association of environmental, individual factors, and dopamine pathway gene variation with smoking cessation.

    Science.gov (United States)

    Li, Suyun; Wang, Qiang; Pan, Lulu; Yang, Xiaorong; Li, Huijie; Jiang, Fan; Zhang, Nan; Han, Mingkui; Jia, Chongqi

    2017-09-01

    This study aimed to examine whether dopamine (DA) pathway gene variation were associated with smoking cessation, and compare the relative importance of infulence factors on smoking cessation. Participants were recruited from 17 villages of Shandong Province, China. Twenty-five single nucleotide polymorphisms in 8 DA pathway genes were genotyped. Weighted gene score of each gene was used to analyze the whole gene effect. Logistic regression was used to calculate odds ratios (OR) of the total gene score for smoking cessation. Dominance analysis was employed to compare the relative importance of individual, heaviness of smoking, psychological and genetic factors on smoking cessation. 415 successful spontaneous smoking quitters served as the cases, and 404 unsuccessful quitters served as the controls. A significant negative association of total DA pathway gene score and smoking cessation was observed (p smoking cessation was heaviness of smoking score (42%), following by individual (40%), genetic (10%) and psychological score (8%). In conclusion, although the DA pathway gene variation was significantly associated with successful smoking cessation, heaviness of smoking and individual factors had bigger effect than genetic factors on smoking cessation.

  5. Inferring the functional effect of gene expression changes in signaling pathways

    Science.gov (United States)

    Sebastián-León, Patricia; Carbonell, José; Salavert, Francisco; Sanchez, Rubén; Medina, Ignacio; Dopazo, Joaquín

    2013-01-01

    Signaling pathways constitute a valuable source of information that allows interpreting the way in which alterations in gene activities affect to particular cell functionalities. There are web tools available that allow viewing and editing pathways, as well as representing experimental data on them. However, few methods aimed to identify the signaling circuits, within a pathway, associated to the biological problem studied exist and none of them provide a convenient graphical web interface. We present PATHiWAYS, a web-based signaling pathway visualization system that infers changes in signaling that affect cell functionality from the measurements of gene expression values in typical expression microarray case–control experiments. A simple probabilistic model of the pathway is used to estimate the probabilities for signal transmission from any receptor to any final effector molecule (taking into account the pathway topology) using for this the individual probabilities of gene product presence/absence inferred from gene expression values. Significant changes in these probabilities allow linking different cell functionalities triggered by the pathway to the biological problem studied. PATHiWAYS is available at: http://pathiways.babelomics.org/. PMID:23748960

  6. About miRNAs, miRNA seeds, target genes and target pathways.

    Science.gov (United States)

    Kehl, Tim; Backes, Christina; Kern, Fabian; Fehlmann, Tobias; Ludwig, Nicole; Meese, Eckart; Lenhof, Hans-Peter; Keller, Andreas

    2017-12-05

    miRNAs are typically repressing gene expression by binding to the 3' UTR, leading to degradation of the mRNA. This process is dominated by the eight-base seed region of the miRNA. Further, miRNAs are known not only to target genes but also to target significant parts of pathways. A logical line of thoughts is: miRNAs with similar (seed) sequence target similar sets of genes and thus similar sets of pathways. By calculating similarity scores for all 3.25 million pairs of 2,550 human miRNAs, we found that this pattern frequently holds, while we also observed exceptions. Respective results were obtained for both, predicted target genes as well as experimentally validated targets. We note that miRNAs target gene set similarity follows a bimodal distribution, pointing at a set of 282 miRNAs that seems to target genes with very high specificity. Further, we discuss miRNAs with different (seed) sequences that nonetheless regulate similar gene sets or pathways. Most intriguingly, we found miRNA pairs that regulate different gene sets but similar pathways such as miR-6886-5p and miR-3529-5p. These are jointly targeting different parts of the MAPK signaling cascade. The main goal of this study is to provide a general overview on the results, to highlight a selection of relevant results on miRNAs, miRNA seeds, target genes and target pathways and to raise awareness for artifacts in respective comparisons. The full set of information that allows to infer detailed results on each miRNA has been included in miRPathDB, the miRNA target pathway database (https://mpd.bioinf.uni-sb.de).

  7. Evolutionary Rate Heterogeneity of Primary and Secondary Metabolic Pathway Genes in Arabidopsis thaliana.

    Science.gov (United States)

    Mukherjee, Dola; Mukherjee, Ashutosh; Ghosh, Tapash Chandra

    2015-11-10

    Primary metabolism is essential to plants for growth and development, and secondary metabolism helps plants to interact with the environment. Many plant metabolites are industrially important. These metabolites are produced by plants through complex metabolic pathways. Lack of knowledge about these pathways is hindering the successful breeding practices for these metabolites. For a better knowledge of the metabolism in plants as a whole, evolutionary rate variation of primary and secondary metabolic pathway genes is a prerequisite. In this study, evolutionary rate variation of primary and secondary metabolic pathway genes has been analyzed in the model plant Arabidopsis thaliana. Primary metabolic pathway genes were found to be more conserved than secondary metabolic pathway genes. Several factors such as gene structure, expression level, tissue specificity, multifunctionality, and domain number are the key factors behind this evolutionary rate variation. This study will help to better understand the evolutionary dynamics of plant metabolism. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  8. Silencing of the pentose phosphate pathway genes influences DNA replication in human fibroblasts.

    Science.gov (United States)

    Fornalewicz, Karolina; Wieczorek, Aneta; Węgrzyn, Grzegorz; Łyżeń, Robert

    2017-11-30

    Previous reports and our recently published data indicated that some enzymes of glycolysis and the tricarboxylic acid cycle can affect the genome replication process by changing either the efficiency or timing of DNA synthesis in human normal cells. Both these pathways are connected with the pentose phosphate pathway (PPP pathway). The PPP pathway supports cell growth by generating energy and precursors for nucleotides and amino acids. Therefore, we asked if silencing of genes coding for enzymes involved in the pentose phosphate pathway may also affect the control of DNA replication in human fibroblasts. Particular genes coding for PPP pathway enzymes were partially silenced with specific siRNAs. Such cells remained viable. We found that silencing of the H6PD, PRPS1, RPE genes caused less efficient enterance to the S phase and decrease in efficiency of DNA synthesis. On the other hand, in cells treated with siRNA against G6PD, RBKS and TALDO genes, the fraction of cells entering the S phase was increased. However, only in the case of G6PD and TALDO, the ratio of BrdU incorporation to DNA was significantly changed. The presented results together with our previously published studies illustrate the complexity of the influence of genes coding for central carbon metabolism on the control of DNA replication in human fibroblasts, and indicate which of them are especially important in this process. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. An ensemble method to predict target genes and pathways in uveal melanoma

    Directory of Open Access Journals (Sweden)

    Wei Chao

    2018-04-01

    Full Text Available This work proposes to predict target genes and pathways for uveal melanoma (UM based on an ensemble method and pathway analyses. Methods: The ensemble method integrated a correlation method (Pearson correlation coefficient, PCC, a causal inference method (IDA and a regression method (Lasso utilizing the Borda count election method. Subsequently, to validate the performance of PIL method, comparisons between confirmed database and predicted miRNA targets were performed. Ultimately, pathway enrichment analysis was conducted on target genes in top 1000 miRNA-mRNA interactions to identify target pathways for UM patients. Results: Thirty eight of the predicted interactions were matched with the confirmed interactions, indicating that the ensemble method was a suitable and feasible approach to predict miRNA targets. We obtained 50 seed miRNA-mRNA interactions of UM patients and extracted target genes from these interactions, such as ASPG, BSDC1 and C4BP. The 601 target genes in top 1,000 miRNA-mRNA interactions were enriched in 12 target pathways, of which Phototransduction was the most significant one. Conclusion: The target genes and pathways might provide a new way to reveal the molecular mechanism of UM and give hand for target treatments and preventions of this malignant tumor.

  10. Optimal structural inference of signaling pathways from unordered and overlapping gene sets.

    Science.gov (United States)

    Acharya, Lipi R; Judeh, Thair; Wang, Guangdi; Zhu, Dongxiao

    2012-02-15

    A plethora of bioinformatics analysis has led to the discovery of numerous gene sets, which can be interpreted as discrete measurements emitted from latent signaling pathways. Their potential to infer signaling pathway structures, however, has not been sufficiently exploited. Existing methods accommodating discrete data do not explicitly consider signal cascading mechanisms that characterize a signaling pathway. Novel computational methods are thus needed to fully utilize gene sets and broaden the scope from focusing only on pairwise interactions to the more general cascading events in the inference of signaling pathway structures. We propose a gene set based simulated annealing (SA) algorithm for the reconstruction of signaling pathway structures. A signaling pathway structure is a directed graph containing up to a few hundred nodes and many overlapping signal cascades, where each cascade represents a chain of molecular interactions from the cell surface to the nucleus. Gene sets in our context refer to discrete sets of genes participating in signal cascades, the basic building blocks of a signaling pathway, with no prior information about gene orderings in the cascades. From a compendium of gene sets related to a pathway, SA aims to search for signal cascades that characterize the optimal signaling pathway structure. In the search process, the extent of overlap among signal cascades is used to measure the optimality of a structure. Throughout, we treat gene sets as random samples from a first-order Markov chain model. We evaluated the performance of SA in three case studies. In the first study conducted on 83 KEGG pathways, SA demonstrated a significantly better performance than Bayesian network methods. Since both SA and Bayesian network methods accommodate discrete data, use a 'search and score' network learning strategy and output a directed network, they can be compared in terms of performance and computational time. In the second study, we compared SA and

  11. Constructive Ontology Engineering

    Science.gov (United States)

    Sousan, William L.

    2010-01-01

    The proliferation of the Semantic Web depends on ontologies for knowledge sharing, semantic annotation, data fusion, and descriptions of data for machine interpretation. However, ontologies are difficult to create and maintain. In addition, their structure and content may vary depending on the application and domain. Several methods described in…

  12. A UML profile for the OBO relation ontology

    Science.gov (United States)

    2012-01-01

    Background Ontologies have increasingly been used in the biomedical domain, which has prompted the emergence of different initiatives to facilitate their development and integration. The Open Biological and Biomedical Ontologies (OBO) Foundry consortium provides a repository of life-science ontologies, which are developed according to a set of shared principles. This consortium has developed an ontology called OBO Relation Ontology aiming at standardizing the different types of biological entity classes and associated relationships. Since ontologies are primarily intended to be used by humans, the use of graphical notations for ontology development facilitates the capture, comprehension and communication of knowledge between its users. However, OBO Foundry ontologies are captured and represented basically using text-based notations. The Unified Modeling Language (UML) provides a standard and widely-used graphical notation for modeling computer systems. UML provides a well-defined set of modeling elements, which can be extended using a built-in extension mechanism named Profile. Thus, this work aims at developing a UML profile for the OBO Relation Ontology to provide a domain-specific set of modeling elements that can be used to create standard UML-based ontologies in the biomedical domain. Results We have studied the OBO Relation Ontology, the UML metamodel and the UML profiling mechanism. Based on these studies, we have proposed an extension to the UML metamodel in conformance with the OBO Relation Ontology and we have defined a profile that implements the extended metamodel. Finally, we have applied the proposed UML profile in the development of a number of fragments from different ontologies. Particularly, we have considered the Gene Ontology (GO), the PRotein Ontology (PRO) and the Xenopus Anatomy and Development Ontology (XAO). Conclusions The use of an established and well-known graphical language in the development of biomedical ontologies provides a more

  13. Towards Agile Ontology Maintenance

    Science.gov (United States)

    Luczak-Rösch, Markus

    Ontologies are an appropriate means to represent knowledge on the Web. Research on ontology engineering reached practices for an integrative lifecycle support. However, a broader success of ontologies in Web-based information systems remains unreached while the more lightweight semantic approaches are rather successful. We assume, paired with the emerging trend of services and microservices on the Web, new dynamic scenarios gain momentum in which a shared knowledge base is made available to several dynamically changing services with disparate requirements. Our work envisions a step towards such a dynamic scenario in which an ontology adapts to the requirements of the accessing services and applications as well as the user's needs in an agile way and reduces the experts' involvement in ontology maintenance processes.

  14. Conceptual querying through ontologies

    DEFF Research Database (Denmark)

    Andreasen, Troels; Bulskov, Henrik

    2009-01-01

    is motivated by an obvious need for users to survey huge volumes of objects in query answers. An ontology formalism and a special notion of-instantiated ontology" are introduced. The latter is a structure reflecting the content in the document collection in that; it is a restriction of a general world......We present here ail approach to conceptual querying where the aim is, given a collection of textual database objects or documents, to target an abstraction of the entire database content in terms of the concepts appearing in documents, rather than the documents in the collection. The approach...... knowledge ontology to the concepts instantiated in the collection. The notion of ontology-based similarity is briefly described, language constructs for direct navigation and retrieval of concepts in the ontology are discussed and approaches to conceptual summarization are presented....

  15. Survey on Ontology Mapping

    Science.gov (United States)

    Zhu, Junwu

    To create a sharable semantic space in which the terms from different domain ontology or knowledge system, Ontology mapping become a hot research point in Semantic Web Community. In this paper, motivated factors of ontology mapping research are given firstly, and then 5 dominating theories and methods, such as information accessing technology, machine learning, linguistics, structure graph and similarity, are illustrated according their technology class. Before we analyses the new requirements and takes a long view, the contributions of these theories and methods are summarized in details. At last, this paper suggest to design a group of semantic connector with the ability of migration learning for OWL-2 extended with constrains and the ontology mapping theory of axiom, so as to provide a new methodology for ontology mapping.

  16. Annotating breast cancer microarray samples using ontologies

    Science.gov (United States)

    Liu, Hongfang; Li, Xin; Yoon, Victoria; Clarke, Robert

    2008-01-01

    As the most common cancer among women, breast cancer results from the accumulation of mutations in essential genes. Recent advance in high-throughput gene expression microarray technology has inspired researchers to use the technology to assist breast cancer diagnosis, prognosis, and treatment prediction. However, the high dimensionality of microarray experiments and public access of data from many experiments have caused inconsistencies which initiated the development of controlled terminologies and ontologies for annotating microarray experiments, such as the standard microarray Gene Expression Data (MGED) ontology (MO). In this paper, we developed BCM-CO, an ontology tailored specifically for indexing clinical annotations of breast cancer microarray samples from the NCI Thesaurus. Our research showed that the coverage of NCI Thesaurus is very limited with respect to i) terms used by researchers to describe breast cancer histology (covering 22 out of 48 histology terms); ii) breast cancer cell lines (covering one out of 12 cell lines); and iii) classes corresponding to the breast cancer grading and staging. By incorporating a wider range of those terms into BCM-CO, we were able to indexed breast cancer microarray samples from GEO using BCM-CO and MGED ontology and developed a prototype system with web interface that allows the retrieval of microarray data based on the ontology annotations. PMID:18999108

  17. Horizontal gene transfer of an entire metabolic pathway between a eukaryotic alga and its DNA virus

    Science.gov (United States)

    Monier, Adam; Pagarete, António; de Vargas, Colomban; Allen, Michael J.; Read, Betsy; Claverie, Jean-Michel; Ogata, Hiroyuki

    2009-01-01

    Interactions between viruses and phytoplankton, the main primary producers in the oceans, affect global biogeochemical cycles and climate. Recent studies are increasingly revealing possible cases of gene transfers between cyanobacteria and phages, which might have played significant roles in the evolution of cyanobacteria/phage systems. However, little has been documented about the occurrence of horizontal gene transfer in eukaryotic phytoplankton/virus systems. Here we report phylogenetic evidence for the transfer of seven genes involved in the sphingolipid biosynthesis pathway between the cosmopolitan eukaryotic microalga Emiliania huxleyi and its large DNA virus EhV. PCR assays indicate that these genes are prevalent in E. huxleyi and EhV strains isolated from different geographic locations. Patterns of protein and gene sequence conservation support that these genes are functional in both E. huxleyi and EhV. This is the first clear case of horizontal gene transfer of multiple functionally linked enzymes in a eukaryotic phytoplankton–virus system. We examine arguments for the possible direction of the gene transfer. The virus-to-host direction suggests the existence of ancient viruses that controlled the complex metabolic pathway in order to infect primitive eukaryotic cells. In contrast, the host-to-virus direction suggests that the serial acquisition of genes involved in the same metabolic pathway might have been a strategy for the ancestor of EhVs to stay ahead of their closest relatives in the great evolutionary race for survival. PMID:19451591

  18. Analyzing the genes related to Alzheimer's disease via a network and pathway-based approach.

    Science.gov (United States)

    Hu, Yan-Shi; Xin, Juncai; Hu, Ying; Zhang, Lei; Wang, Ju

    2017-04-27

    Our understanding of the molecular mechanisms underlying Alzheimer's disease (AD) remains incomplete. Previous studies have revealed that genetic factors provide a significant contribution to the pathogenesis and development of AD. In the past years, numerous genes implicated in this disease have been identified via genetic association studies on candidate genes or at the genome-wide level. However, in many cases, the roles of these genes and their interactions in AD are still unclear. A comprehensive and systematic analysis focusing on the biological function and interactions of these genes in the context of AD will therefore provide valuable insights to understand the molecular features of the disease. In this study, we collected genes potentially associated with AD by screening publications on genetic association studies deposited in PubMed. The major biological themes linked with these genes were then revealed by function and biochemical pathway enrichment analysis, and the relation between the pathways was explored by pathway crosstalk analysis. Furthermore, the network features of these AD-related genes were analyzed in the context of human interactome and an AD-specific network was inferred using the Steiner minimal tree algorithm. We compiled 430 human genes reported to be associated with AD from 823 publications. Biological theme analysis indicated that the biological processes and biochemical pathways related to neurodevelopment, metabolism, cell growth and/or survival, and immunology were enriched in these genes. Pathway crosstalk analysis then revealed that the significantly enriched pathways could be grouped into three interlinked modules-neuronal and metabolic module, cell growth/survival and neuroendocrine pathway module, and immune response-related module-indicating an AD-specific immune-endocrine-neuronal regulatory network. Furthermore, an AD-specific protein network was inferred and novel genes potentially associated with AD were identified. By

  19. Quality control for terms and definitions in ontologies and taxonomies

    Directory of Open Access Journals (Sweden)

    Rüegg Alexander

    2006-04-01

    Full Text Available Abstract Background Ontologies and taxonomies are among the most important computational resources for molecular biology and bioinformatics. A series of recent papers has shown that the Gene Ontology (GO, the most prominent taxonomic resource in these fields, is marked by flaws of certain characteristic types, which flow from a failure to address basic ontological principles. As yet, no methods have been proposed which would allow ontology curators to pinpoint flawed terms or definitions in ontologies in a systematic way. Results We present computational methods that automatically identify terms and definitions which are defined in a circular or unintelligible way. We further demonstrate the potential of these methods by applying them to isolate a subset of 6001 problematic GO terms. By automatically aligning GO with other ontologies and taxonomies we were able to propose alternative synonyms and definitions for some of these problematic terms. This allows us to demonstrate that these other resources do not contain definitions superior to those supplied by GO. Conclusion Our methods provide reliable indications of the quality of terms and definitions in ontologies and taxonomies. Further, they are well suited to assist ontology curators in drawing their attention to those terms that are ill-defined. We have further shown the limitations of ontology mapping and alignment in assisting ontology curators in rectifying problems, thus pointing to the need for manual curation.

  20. Practical ontologies for information professionals

    CERN Document Server

    AUTHOR|(CDS)2071712

    2016-01-01

    Practical Ontologies for Information Professionals provides an introduction to ontologies and their development, an essential tool for fighting back against information overload. The development of robust and widely used ontologies is an increasingly important tool in the fight against information overload. The publishing and sharing of explicit explanations for a wide variety of conceptualizations, in a machine readable format, has the power to both improve information retrieval and identify new knowledge. This new book provides an accessible introduction to the following: * What is an ontology? Defining the concept and why it is increasingly important to the information professional * Ontologies and the semantic web * Existing ontologies, such as SKOS, OWL, FOAF, schema.org, and the DBpedia Ontology * Adopting and building ontologies, showing how to avoid repetition of work and how to build a simple ontology with Protege * Interrogating semantic web ontologies * The future of ontologies and the role of the ...

  1. Genomics of Human Pulmonary Tuberculosis: from Genes to Pathways

    DEFF Research Database (Denmark)

    Stein, Catherine M.; Sausville, Lindsay; Wejse, Christian

    2017-01-01

    Purpose of Review Tuberculosis (TB), caused by Mycobacterium tuberculosis (MTB), remains a major public health threat globally. Several lines of evidence support a role for host genetic factors in resistance/susceptibility to TB disease and MTB infection. However, results across candidate gene...

  2. Cloning and Expression Analysis of MEP Pathway Enzyme-encoding Genes in Osmanthus fragrans

    Directory of Open Access Journals (Sweden)

    Chen Xu

    2016-09-01

    Full Text Available The 2-C-methyl-d-erythritol 4-phosphate (MEP pathway is responsible for the biosynthesis of many crucial secondary metabolites, such as carotenoids, monoterpenes, plastoquinone, and tocopherols. In this study, we isolated and identified 10 MEP pathway genes in the important aromatic plant sweet osmanthus (Osmanthus fragrans. Multiple sequence alignments revealed that 10 MEP pathway genes shared high identities with other reported proteins. The genes showed distinctive expression profiles in various tissues, or at different flower stages and diel time points. The qRT-PCR results demonstrated that these genes were highly expressed in inflorescences, which suggested a tissue-specific transcript pattern. Our results also showed that OfDXS1, OfDXS2, and OfHDR1 had a clear diurnal oscillation pattern. The isolation and expression analysis provides a strong foundation for further research on the MEP pathway involved in gene function and molecular evolution, and improves our understanding of the molecular mechanism underlying this pathway in plants.

  3. Ontological foundations for evolutionary economics: A Darwinian social ontology

    NARCIS (Netherlands)

    Stoelhorst, J.W.

    2008-01-01

    The purpose of this paper is to further the project of generalized Darwinism by developing a social ontology on the basis of a combined commitment to ontological continuity and ontological commonality. Three issues that are central to the development of a social ontology are addressed: (1) the

  4. Application of R to investigate common gene regulatory network pathway among bipolar disorder and associate diseases

    Directory of Open Access Journals (Sweden)

    Nahida Habib

    2016-12-01

    Full Text Available Depression, Major Depression or mental disorder creates severe diseases. Mental illness such as Unipolar Major Depression, Bipolar Disorder, Dysthymia, Schizophrenia, Cardiovascular Diseases (Hypertension, Coronary Heart Disease, Stroke etc., are known as Major Depression. Several studies have revealed the possibilities about the association among Bipolar Disorder, Schizophrenia, Coronary Heart Diseases and Stroke with each other. The current study aimed to investigate the relationships between genetic variants in the above four diseases and to create a common pathway or PPI network. The associated genes of each disease are collected from different gene database with verification using R. After performing some preprocessing, mining and operations using R on collected genes, seven (7 common associated genes are discovered on selected four diseases (SZ, BD, CHD and Stroke. In each of the iteration, the numbers of collected genes are reduced up to 51%, 36%, 10%, 2% and finally less than 1% respectively. Moreover, common pathway on selected diseases has been investigated in this research.

  5. Identification of Key Pathways and Genes in Advanced Coronary Atherosclerosis Using Bioinformatics Analysis

    Directory of Open Access Journals (Sweden)

    Xiaowen Tan

    2017-01-01

    Full Text Available Background. Coronary artery atherosclerosis is a chronic inflammatory disease. This study aimed to identify the key changes of gene expression between early and advanced carotid atherosclerotic plaque in human. Methods. Gene expression dataset GSE28829 was downloaded from Gene Expression Omnibus (GEO, including 16 advanced and 13 early stage atherosclerotic plaque samples from human carotid. Differentially expressed genes (DEGs were analyzed. Results. 42,450 genes were obtained from the dataset. Top 100 up- and downregulated DEGs were listed. Functional enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG identification were performed. The result of functional and pathway enrichment analysis indicted that the immune system process played a critical role in the progression of carotid atherosclerotic plaque. Protein-protein interaction (PPI networks were performed either. Top 10 hub genes were identified from PPI network and top 6 modules were inferred. These genes were mainly involved in chemokine signaling pathway, cell cycle, B cell receptor signaling pathway, focal adhesion, and regulation of actin cytoskeleton. Conclusion. The present study indicated that analysis of DEGs would make a deeper understanding of the molecular mechanisms of atherosclerosis development and they might be used as molecular targets and diagnostic biomarkers for the treatment of atherosclerosis.

  6. Differences in gene expression profiles and signaling pathways in rhabdomyolysis-induced acute kidney injury.

    Science.gov (United States)

    Geng, Xiaodong; Wang, Yuanda; Hong, Quan; Yang, Jurong; Zheng, Wei; Zhang, Gang; Cai, Guangyan; Chen, Xiangmei; Wu, Di

    2015-01-01

    Rhabdomyolysis is a threatening syndrome because it causes the breakdown of skeletal muscle. Muscle destruction leads to the release of myoglobin, intracellular proteins, and electrolytes into the circulation. The aim of this study was to investigate the differences in gene expression profiles and signaling pathways upon rhabdomyolysis-induced acute kidney injury (AKI). In this study, we used glycerol-induced renal injury as a model of rhabdomyolysis-induced AKI. We analyzed data and relevant information from the Gene Expression Omnibus database (No: GSE44925). The gene expression data for three untreated mice were compared to data for five mice with rhabdomyolysis-induced AKI. The expression profiling of the three untreated mice and the five rhabdomyolysis-induced AKI mice was performed using microarray analysis. We examined the levels of Cyp3a13, Rela, Aldh7a1, Jun, CD14. And Cdkn1a using RT-PCR to determine the accuracy of the microarray results. The microarray analysis showed that there were 1050 downregulated and 659 upregulated genes in the rhabdomyolysis-induced AKI mice compared to the control group. The interactions of all differentially expressed genes in the Signal-Net were analyzed. Cyp3a13 and Rela had the most interactions with other genes. The data showed that Rela and Aldh7a1 were the key nodes and had important positions in the Signal-Net. The genes Jun, CD14, and Cdkn1a were also significantly upregulated. The pathway analysis classified the differentially expressed genes into 71 downregulated and 48 upregulated pathways including the PI3K/Akt, MAPK, and NF-κB signaling pathways. The results of this study indicate that the NF-κB, MAPK, PI3K/Akt, and apoptotic pathways are regulated in rhabdomyolysis-induced AKI.

  7. The Planteome database: an integrated resource for reference ontologies, plant genomics and phenomics

    Science.gov (United States)

    Cooper, Laurel; Meier, Austin; Laporte, Marie-Angélique; Elser, Justin L; Mungall, Chris; Sinn, Brandon T; Cavaliere, Dario; Carbon, Seth; Dunn, Nathan A; Smith, Barry; Qu, Botong; Preece, Justin; Zhang, Eugene; Todorovic, Sinisa; Gkoutos, Georgios; Doonan, John H; Stevenson, Dennis W; Arnaud, Elizabeth

    2018-01-01

    Abstract The Planteome project (http://www.planteome.org) provides a suite of reference and species-specific ontologies for plants and annotations to genes and phenotypes. Ontologies serve as common standards for semantic integration of a large and growing corpus of plant genomics, phenomics and genetics data. The reference ontologies include the Plant Ontology, Plant Trait Ontology and the Plant Experimental Conditions Ontology developed by the Planteome project, along with the Gene Ontology, Chemical Entities of Biological Interest, Phenotype and Attribute Ontology, and others. The project also provides access to species-specific Crop Ontologies developed by various plant breeding and research communities from around the world. We provide integrated data on plant traits, phenotypes, and gene function and expression from 95 plant taxa, annotated with reference ontology terms. The Planteome project is developing a plant gene annotation platform; Planteome Noctua, to facilitate community engagement. All the Planteome ontologies are publicly available and are maintained at the Planteome GitHub site (https://github.com/Planteome) for sharing, tracking revisions and new requests. The annotated data are freely accessible from the ontology browser (http://browser.planteome.org/amigo) and our data repository. PMID:29186578

  8. RNA Interference Screen to Identify Pathways That Enhance or Reduce Nonviral Gene Transfer During Lipofection

    OpenAIRE

    Barker, Gregory A; Diamond, Scott L

    2008-01-01

    Some barriers to DNA lipofection are well characterized; however, there is as yet no method of finding unknown pathways that impact the process. A druggable genome small-interfering RNA (siRNA) screen against 5,520 genes was tested for its effect on lipofection of human aortic endothelial cells (HAECs). We found 130 gene targets which, when silenced by pooled siRNAs (three siRNAs per gene), resulted in enhanced luminescence after lipofection (86 gene targets showed reduced expression). In con...

  9. Integrated pathway-based transcription regulation network mining and visualization based on gene expression profiles.

    Science.gov (United States)

    Kibinge, Nelson; Ono, Naoaki; Horie, Masafumi; Sato, Tetsuo; Sugiura, Tadao; Altaf-Ul-Amin, Md; Saito, Akira; Kanaya, Shigehiko

    2016-06-01

    Conventionally, workflows examining transcription regulation networks from gene expression data involve distinct analytical steps. There is a need for pipelines that unify data mining and inference deduction into a singular framework to enhance interpretation and hypotheses generation. We propose a workflow that merges network construction with gene expression data mining focusing on regulation processes in the context of transcription factor driven gene regulation. The pipeline implements pathway-based modularization of expression profiles into functional units to improve biological interpretation. The integrated workflow was implemented as a web application software (TransReguloNet) with functions that enable pathway visualization and comparison of transcription factor activity between sample conditions defined in the experimental design. The pipeline merges differential expression, network construction, pathway-based abstraction, clustering and visualization. The framework was applied in analysis of actual expression datasets related to lung, breast and prostrate cancer. Copyright © 2016 Elsevier Inc. All rights reserved.

  10. Perspectives on ontology learning

    CERN Document Server

    Lehmann, J

    2014-01-01

    Perspectives on Ontology Learning brings together researchers and practitioners from different communities − natural language processing, machine learning, and the semantic web − in order to give an interdisciplinary overview of recent advances in ontology learning.Starting with a comprehensive introduction to the theoretical foundations of ontology learning methods, the edited volume presents the state-of-the-start in automated knowledge acquisition and maintenance. It outlines future challenges in this area with a special focus on technologies suitable for pushing the boundaries beyond the c

  11. Smoking, genes encoding dopamine pathway and risk for Parkinson's disease.

    Science.gov (United States)

    Gu, Zhuqin; Feng, Xiuli; Dong, Xiumin; Chan, Piu

    2010-09-20

    Smoking has been reported to be inversely associated with Parkinson's disease (PD) in many studies, but a recent study in China found that smoking increased the risk of PD. Variants in genes associated with dopamine metabolism found to increase the risk for PD have also been associated with smoking behavior. To investigate the association between smoking and PD in a Chinese population and determine whether the genetic variants of genes involved in dopamine metabolism influence the relationship between smoking and risk for PD. Chinese PD patients were recruited from Xuanwu Hospital. Controls were sampled from community. Detailed information on life-long smoking behavior was collected by face-to-face interview. Genotypes were determined for SLC6A3 VNTR, COMT Val108/158Met and MAO-B intron13 A/G polymorphisms by PCR-RFLP, DHPLC and sequencing. Chi-square and logistic regression model were used in the analysis. 176 PD cases and 354 controls were enrolled in this study. 23.9% cases are smokers, compared to 48.0% in controls. Ever smoking is inversely associated with PD (odds ratio=0.14, 95% CI 0.08-0.26, adjusted for age and gender). None of the above-mentioned genetic polymorphisms was associated with PD risk or smoking. When each variant was included in the logistic regression model, the inverse association between smoking and PD remained the same, and the interactions between smoking and variants were not significant in the model. Our data support a reduction of PD risk associated with smoking in a Chinese population. These variants of genes associated with DA uptake and metabolism do not affect the inverse association between smoking and PD. Copyright 2010 Elsevier Ireland Ltd. All rights reserved.

  12. Candidate gene approach for parasite resistance in sheep--variation in immune pathway genes and association with fecal egg count.

    Directory of Open Access Journals (Sweden)

    Kathiravan Periasamy

    Full Text Available Sheep chromosome 3 (Oar3 has the largest number of QTLs reported to be significantly associated with resistance to gastro-intestinal nematodes. This study aimed to identify single nucleotide polymorphisms (SNPs within candidate genes located in sheep chromosome 3 as well as genes involved in major immune pathways. A total of 41 SNPs were identified across 38 candidate genes in a panel of unrelated sheep and genotyped in 713 animals belonging to 22 breeds across Asia, Europe and South America. The variations and evolution of immune pathway genes were assessed in sheep populations across these macro-environmental regions that significantly differ in the diversity and load of pathogens. The mean minor allele frequency (MAF did not vary between Asian and European sheep reflecting the absence of ascertainment bias. Phylogenetic analysis revealed two major clusters with most of South Asian, South East Asian and South West Asian breeds clustering together while European and South American sheep breeds clustered together distinctly. Analysis of molecular variance revealed strong phylogeographic structure at loci located in immune pathway genes, unlike microsatellite and genome wide SNP markers. To understand the influence of natural selection processes, SNP loci located in chromosome 3 were utilized to reconstruct haplotypes, the diversity of which showed significant deviations from selective neutrality. Reduced Median network of reconstructed haplotypes showed balancing selection in force at these loci. Preliminary association of SNP genotypes with phenotypes recorded 42 days post challenge revealed significant differences (P<0.05 in fecal egg count, body weight change and packed cell volume at two, four and six SNP loci respectively. In conclusion, the present study reports strong phylogeographic structure and balancing selection operating at SNP loci located within immune pathway genes. Further, SNP loci identified in the study were found to have

  13. Data mining for ontology development.

    Energy Technology Data Exchange (ETDEWEB)

    Davidson, George S.; Strasburg, Jana (Pacific Northwest National Laboratory, Richland, WA); Stampf, David (Brookhaven National Laboratory, Upton, NY); Neymotin,Lev (Brookhaven National Laboratory, Upton, NY); Czajkowski, Carl (Brookhaven National Laboratory, Upton, NY); Shine, Eugene (Savannah River National Laboratory, Aiken, SC); Bollinger, James (Savannah River National Laboratory, Aiken, SC); Ghosh, Vinita (Brookhaven National Laboratory, Upton, NY); Sorokine, Alexandre (Oak Ridge National Laboratory, Oak Ridge, TN); Ferrell, Regina (Oak Ridge National Laboratory, Oak Ridge, TN); Ward, Richard (Oak Ridge National Laboratory, Oak Ridge, TN); Schoenwald, David Alan

    2010-06-01

    A multi-laboratory ontology construction effort during the summer and fall of 2009 prototyped an ontology for counterfeit semiconductor manufacturing. This effort included an ontology development team and an ontology validation methods team. Here the third team of the Ontology Project, the Data Analysis (DA) team reports on their approaches, the tools they used, and results for mining literature for terminology pertinent to counterfeit semiconductor manufacturing. A discussion of the value of ontology-based analysis is presented, with insights drawn from other ontology-based methods regularly used in the analysis of genomic experiments. Finally, suggestions for future work are offered.

  14. Signaling pathway-focused gene expression profiling in pressure overloaded hearts

    Directory of Open Access Journals (Sweden)

    Marco Musumeci

    2011-01-01

    Full Text Available The β-blocker propranolol displays antihypertrophic and antifibrotic properties in the heart subjected to pressure overload. Yet the underlying mechanisms responsible for these important effects remain to be completely understood. The purpose of this study was to determine signaling pathway-focused gene expression profile associated with the antihypertrophic action of propranolol in pressure overloaded hearts. To address this question, a focused real-time PCR array was used to screen left ventricular RNA expression of 84 gene transcripts representative of 18 different signaling pathways in C57BL/6 mice subjected to transverse aortic constriction (TAC or sham surgery. On the surgery day, mice received either propranolol (80 mg/kg/day or vehicle for 14 days. TAC caused a 49% increase in the left ventricular weight-to-body weight (LVW/BW ratio without changing gene expression. Propranolol blunted LVW/BW ratio increase by approximately 50% while causing about a 3-fold increase in the expression of two genes, namely Brca1 and Cdkn2a, belonging to the TGF-beta and estrogen pathways, respectively. In conclusion, after 2 weeks of pressure overload, TAC hearts show a gene expression profile superimposable to that of sham hearts. Conversely, propranolol treatment is associated with an increased expression of genes which negatively regulate cell cycle progression. It remains to be established whether a mechanistic link between gene expression changes and the antihypertrophic action of propranolol occurs.

  15. Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer

    Directory of Open Access Journals (Sweden)

    Pingzhao Hu

    2006-01-01

    Full Text Available Background: Microarray technology has been previously used to identify genes that are differentially expressed between tumour and normal samples in a single study, as well as in syntheses involving multiple studies. When integrating results from several Affymetrix microarray datasets, previous studies summarized probeset-level data, which may potentially lead to a loss of information available at the probe-level. In this paper, we present an approach for integrating results across studies while taking probe-level data into account. Additionally, we follow a new direction in the analysis of microarray expression data, namely to focus on the variation of expression phenotypes in predefined gene sets, such as pathways. This targeted approach can be helpful for revealing information that is not easily visible from the changes in the individual genes. Results: We used a recently developed method to integrate Affymetrix expression data across studies. The idea is based on a probe-level based test statistic developed for testing for differentially expressed genes in individual studies. We incorporated this test statistic into a classic random-effects model for integrating data across studies. Subsequently, we used a gene set enrichment test to evaluate the significance of enriched biological pathways in the differentially expressed genes identified from the integrative analysis. We compared statistical and biological significance of the prognostic gene expression signatures and pathways identified in the probe-level model (PLM with those in the probeset-level model (PSLM. Our integrative analysis of Affymetrix microarray data from 110 prostate cancer samples obtained from three studies reveals thousands of genes significantly correlated with tumour cell differentiation. The bioinformatics analysis, mapping these genes to the publicly available KEGG database, reveals evidence that tumour cell differentiation is significantly associated with many

  16. User centered and ontology based information retrieval system for life sciences.

    Science.gov (United States)

    Sy, Mohameth-François; Ranwez, Sylvie; Montmain, Jacky; Regnault, Armelle; Crampes, Michel; Ranwez, Vincent

    2012-01-25

    Because of the increasing number of electronic resources, designing efficient tools to retrieve and exploit them is a major challenge. Some improvements have been offered by semantic Web technologies and applications based on domain ontologies. In life science, for instance, the Gene Ontology is widely exploited in genomic applications and the Medical Subject Headings is the basis of biomedical publications indexation and information retrieval process proposed by PubMed. However current search engines suffer from two main drawbacks: there is limited user interaction with the list of retrieved resources and no explanation for their adequacy to the query is provided. Users may thus be confused by the selection and have no idea on how to adapt their queries so that the results match their expectations. This paper describes an information retrieval system that relies on domain ontology to widen the set of relevant documents that is retrieved and that uses a graphical rendering of query results to favor user interactions. Semantic proximities between ontology concepts and aggregating models are used to assess documents adequacy with respect to a query. The selection of documents is displayed in a semantic map to provide graphical indications that make explicit to what extent they match the user's query; this man/machine interface favors a more interactive and iterative exploration of data corpus, by facilitating query concepts weighting and visual explanation. We illustrate the benefit of using this information retrieval system on two case studies one of which aiming at collecting human genes related to transcription factors involved in hemopoiesis pathway. The ontology based information retrieval system described in this paper (OBIRS) is freely available at: http://www.ontotoolkit.mines-ales.fr/ObirsClient/. This environment is a first step towards a user centred application in which the system enlightens relevant information to provide decision help.

  17. User centered and ontology based information retrieval system for life sciences

    Directory of Open Access Journals (Sweden)

    Sy Mohameth-François

    2012-01-01

    Full Text Available Abstract Background Because of the increasing number of electronic resources, designing efficient tools to retrieve and exploit them is a major challenge. Some improvements have been offered by semantic Web technologies and applications based on domain ontologies. In life science, for instance, the Gene Ontology is widely exploited in genomic applications and the Medical Subject Headings is the basis of biomedical publications indexation and information retrieval process proposed by PubMed. However current search engines suffer from two main drawbacks: there is limited user interaction with the list of retrieved resources and no explanation for their adequacy to the query is provided. Users may thus be confused by the selection and have no idea on how to adapt their queries so that the results match their expectations. Results This paper describes an information retrieval system that relies on domain ontology to widen the set of relevant documents that is retrieved and that uses a graphical rendering of query results to favor user interactions. Semantic proximities between ontology concepts and aggregating models are used to assess documents adequacy with respect to a query. The selection of documents is displayed in a semantic map to provide graphical indications that make explicit to what extent they match the user's query; this man/machine interface favors a more interactive and iterative exploration of data corpus, by facilitating query concepts weighting and visual explanation. We illustrate the benefit of using this information retrieval system on two case studies one of which aiming at collecting human genes related to transcription factors involved in hemopoiesis pathway. Conclusions The ontology based information retrieval system described in this paper (OBIRS is freely available at: http://www.ontotoolkit.mines-ales.fr/ObirsClient/. This environment is a first step towards a user centred application in which the system enlightens

  18. Ontology of fractures

    Science.gov (United States)

    Zhong, Jian; Aydina, Atilla; McGuinness, Deborah L.

    2009-03-01

    Fractures are fundamental structures in the Earth's crust and they can impact many societal and industrial activities including oil and gas exploration and production, aquifer management, CO 2 sequestration, waste isolation, the stabilization of engineering structures, and assessing natural hazards (earthquakes, volcanoes, and landslides). Therefore, an ontology which organizes the concepts of fractures could help facilitate a sound education within, and communication among, the highly diverse professional and academic community interested in the problems cited above. We developed a process-based ontology that makes explicit specifications about fractures, their properties, and the deformation mechanisms which lead to their formation and evolution. Our ontology emphasizes the relationships among concepts such as the factors that influence the mechanism(s) responsible for the formation and evolution of specific fracture types. Our ontology is a valuable resource with a potential to applications in a number of fields utilizing recent advances in Information Technology, specifically for digital data and information in computers, grids, and Web services.

  19. A Method for Evaluating and Standardizing Ontologies

    Science.gov (United States)

    Seyed, Ali Patrice

    2012-01-01

    The Open Biomedical Ontology (OBO) Foundry initiative is a collaborative effort for developing interoperable, science-based ontologies. The Basic Formal Ontology (BFO) serves as the upper ontology for the domain-level ontologies of OBO. BFO is an upper ontology of types as conceived by defenders of realism. Among the ontologies developed for OBO…

  20. Manufacturing ontology through templates

    Directory of Open Access Journals (Sweden)

    Diciuc Vlad

    2017-01-01

    Full Text Available The manufacturing industry contains a high volume of knowhow and of high value, much of it being held by key persons in the company. The passing of this know-how is the basis of manufacturing ontology. Among other methods like advanced filtering and algorithm based decision making, one way of handling the manufacturing ontology is via templates. The current paper tackles this approach and highlights the advantages concluding with some recommendations.

  1. The Electronic Notebook Ontology

    OpenAIRE

    Chalk, Stuart

    2016-01-01

    Science is rapidly being brought into the electronic realm and electronic laboratory notebooks (ELN) are a big part of this activity. The representation of the scientific process in the context of an ELN is an important component to making the data recorded in ELNs semantically integrated. This presentation will outline initial developments of an Electronic Notebook Ontology (ENO) that will help tie together the ExptML ontology, HCLS Community Profile data descriptions, and the VIVO-ISF ontol...

  2. RNA-Seq analysis for indigo biosynthesis pathway genes in Indigofera tinctoria and Polygonum tinctorium

    Directory of Open Access Journals (Sweden)

    Bijaya K. Sarangi

    2015-12-01

    Full Text Available Natural indigo is the most important blue dye for textile dyeing and valuable secondary metabolite biosynthesized in Indigofera tinctoria and Polygonum tinctorium plants. Present investigation is made to generation of gene resource for pathway enrichment and to understand possible gene expression involved in indigo biosynthesis. The data about raw reads and the transcriptome assembly project has been deposited at GenBank under the accessions SRA180766 and SRX692542 for I. tinctoria and P. tinctorium, respectively.

  3. Ontology Update in the Cognitive Model of Ontology Learning

    Directory of Open Access Journals (Sweden)

    Zhang De-Hai

    2016-01-01

    Full Text Available Ontology has been used in many hot-spot fields, but most ontology construction methods are semiautomatic, and the construction process of ontology is still a tedious and painstaking task. In this paper, a kind of cognitive models is presented for ontology learning which can simulate human being’s learning from world. In this model, the cognitive strategies are applied with the constrained axioms. Ontology update is a key step when the new knowledge adds into the existing ontology and conflict with old knowledge in the process of ontology learning. This proposal designs and validates the method of ontology update based on the axiomatic cognitive model, which include the ontology update postulates, axioms and operations of the learning model. It is proved that these operators subject to the established axiom system.

  4. Effect of secretory pathway gene overexpression on secretion of a fluorescent reporter protein in Aspergillus nidulans

    DEFF Research Database (Denmark)

    Schalén, Martin; Anyaogu, Diana Chinyere; Hoof, Jakob Blæsbjerg

    2016-01-01

    roles in the process have been identified through transcriptomics. The assignment of function to these genes has been enabled in combination with gene deletion studies. In this work, 14 genes known to play a role in protein secretion in filamentous fungi were overexpressed in Aspergillus nidulans....... The background strain was a fluorescent reporter secreting mRFP. The overall effect of the overexpressions could thus be easily monitored through fluorescence measurements, while the effects on physiology were determined in batch cultivations and surface growth studies. Results: Fourteen protein secretion...... pathway related genes were overexpressed with a tet-ON promoter in the RFP-secreting reporter strain and macromorphology, physiology and protein secretion were monitored when the secretory genes were induced. Overexpression of several of the chosen genes was shown to cause anomalies on growth, micro...

  5. Differential selection on carotenoid biosynthesis genes as a function of gene position in the metabolic pathway: a study on the carrot and dicots.

    Directory of Open Access Journals (Sweden)

    Jérémy Clotault

    Full Text Available Selection of genes involved in metabolic pathways could target them differently depending on the position of genes in the pathway and on their role in controlling metabolic fluxes. This hypothesis was tested in the carotenoid biosynthesis pathway using population genetics and phylogenetics.Evolutionary rates of seven genes distributed along the carotenoid biosynthesis pathway, IPI, PDS, CRTISO, LCYB, LCYE, CHXE and ZEP, were compared in seven dicot taxa. A survey of deviations from neutrality expectations at these genes was also undertaken in cultivated carrot (Daucus carota subsp. sativus, a species that has been intensely bred for carotenoid pattern diversification in its root during its cultivation history. Parts of sequences of these genes were obtained from 46 individuals representing a wide diversity of cultivated carrots. Downstream genes exhibited higher deviations from neutral expectations than upstream genes. Comparisons of synonymous and nonsynonymous substitution rates between genes among dicots revealed greater constraints on upstream genes than on downstream genes. An excess of intermediate frequency polymorphisms, high nucleotide diversity and/or high differentiation of CRTISO, LCYB1 and LCYE in cultivated carrot suggest that balancing selection may have targeted genes acting centrally in the pathway.Our results are consistent with relaxed constraints on downstream genes and selection targeting the central enzymes of the carotenoid biosynthesis pathway during carrot breeding history.

  6. Maternal vernalization and vernalization-pathway genes influence progeny seed germination.

    Science.gov (United States)

    Auge, Gabriela A; Blair, Logan K; Neville, Hannah; Donohue, Kathleen

    2017-10-01

    Different life stages frequently respond to the same environmental cue to regulate development so that each life stage is matched to its appropriate season. We investigated how independently each life stage can respond to shared environmental cues, focusing on vernalization, in Arabidopsis thaliana plants. We first tested whether effects of rosette vernalization persisted to influence seed germination. To test whether genes in the vernalization flowering pathway also influence germination, we assessed germination of functional and nonfunctional alleles of these genes and measured their level of expression at different life stages in response to rosette vernalization. Rosette vernalization increased seed germination in diverse ecotypes. Genes in the vernalization flowering pathway also influenced seed germination. In the Columbia accession, functional alleles of most of these genes opposed the germination response observed in the ecotypes. Some genes influenced germination in a manner consistent with their known effects on FLOWERING LOCUS C gene regulation during the transition to flowering. Others did not, suggesting functional divergence across life stages. Despite persistent effects of environmental conditions across life stages, and despite pleiotropy of genes that affect both flowering and germination, the function of these genes can differ across life stages, potentially mitigating pleiotropic constraints and enabling independent environmental regulation of different life stages. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  7. Transcriptional profiles of SHH pathway genes in keratocystic odontogenic tumor and ameloblastoma.

    Science.gov (United States)

    Gurgel, Clarissa Araújo Silva; Buim, Marcilei Eliza Cavichiolli; Carvalho, Kátia Cândido; Sales, Caroline Brandi Schlaepfer; Reis, Mitermayer Galvão; de Souza, Renata Oliveira; de Faro Valverde, Ludmila; de Azevedo, Roberto Almeida; Dos Santos, Jean Nunes; Soares, Fernando Augusto; Ramos, Eduardo Antônio Gonçalves

    2014-09-01

    Sonic hedgehog (SHH) pathway activation has been identified as a key factor in the development of many types of tumors, including odontogenic tumors. Our study examined the expression of genes in the SHH pathway to characterize their roles in the pathogenesis of keratocystic odontogenic tumors (KOT) and ameloblastomas (AB). We quantified the expression of SHH, SMO, PTCH1, SUFU, GLI1, CCND1, and BCL2 genes by qPCR in a total of 23 KOT, 11 AB, and three non-neoplastic oral mucosa (NNM). We also measured the expression of proteins related to this pathway (CCND1 and BCL2) by immunohistochemistry. We observed overexpression of SMO, PTCH1, GLI1, and CCND1 genes in both KOT (23/23) and AB (11/11). However, we did not detect expression of the SHH gene in 21/23 KOT and 10/11 AB tumors. Low levels of the SUFU gene were expressed in KOT (P = 0.0199) and AB (P = 0.0127) relative to the NNM. Recurrent KOT exhibited high levels of SMO (P = 0.035), PTCH1 (P = 0.048), CCND1 (P = 0.048), and BCL2 (P = 0.045) transcripts. Using immunolabeling of CCND1, we observed no statistical difference between primary and recurrent KOT (P = 0.8815), sporadic and NBCCS-KOT (P = 0.7688), and unicystic and solid AB (P = 0.7521). Overexpression of upstream (PTCH1 and SMO) and downstream (GLI1, CCND1 and BCL2) genes in the SHH pathway leads to the constitutive activation of this pathway in KOT and AB and may suggest a mechanism for the development of these types of tumors. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  8. Macrophage Gene Expression Associated with Remodeling of the Prepartum Rat Cervix: Microarray and Pathway Analyses

    Science.gov (United States)

    Dobyns, Abigail E.; Goyal, Ravi; Carpenter, Lauren Grisham; Freeman, Tom C.; Longo, Lawrence D.; Yellon, Steven M.

    2015-01-01

    As the critical gatekeeper for birth, prepartum remodeling of the cervix is associated with increased resident macrophages (Mφ), proinflammatory processes, and extracellular matrix degradation. This study tested the hypothesis that expression of genes unique to Mφs characterizes the prepartum from unremodeled nonpregnant cervix. Perfused cervix from prepartum day 21 postbreeding (D21) or nonpregnant (NP) rats, with or without Mφs, had RNA extracted and whole genome microarray analysis performed. By subtractive analyses, expression of 194 and 120 genes related to Mφs in the cervix from D21 rats were increased and decreased, respectively. In both D21 and NP groups, 158 and 57 Mφ genes were also more or less up- or down-regulated, respectively. Mφ gene expression patterns were most strongly correlated within groups and in 5 major clustering patterns. In the cervix from D21 rats, functional categories and canonical pathways of increased expression by Mφ gene related to extracellular matrix, cell proliferation, differentiation, as well as cell signaling. Pathways were characteristic of inflammation and wound healing, e.g., CD163, CD206, and CCR2. Signatures of only inflammation pathways, e.g., CSF1R, EMR1, and MMP12 were common to both D21 and NP groups. Thus, a novel and complex balance of Mφ genes and clusters differentiated the degraded extracellular matrix and cellular genomic activities in the cervix before birth from the unremodeled state. Predicted Mφ activities, pathways, and networks raise the possibility that expression patterns of specific genes characterize and promote prepartum remodeling of the cervix for parturition at term and with preterm labor. PMID:25811906

  9. Challenging a dogma: co-mutations exist in MAPK pathway genes in colorectal cancer.

    Science.gov (United States)

    Grellety, Thomas; Gros, Audrey; Pedeutour, Florence; Merlio, Jean-Philippe; Duranton-Tanneur, Valerie; Italiano, Antoine; Soubeyran, Isabelle

    2016-10-01

    Sequencing of genes encoding mitogen-activated protein kinase (MAPK) pathway proteins in colorectal cancer (CRC) has established as dogma that of the genes in a pathway only a single one is ever mutated. We searched for cases with a mutation in more than one MAPK pathway gene (co-mutations). Tumor tissue samples of all patients presenting with CRC, and referred between 01/01/2008 and 01/06/2015 to three French cancer centers for determination of mutation status of RAS/RAF+/-PIK3CA, were retrospectively screened for co-mutations using Sanger sequencing or next-generation sequencing. We found that of 1791 colorectal patients with mutations in the MAPK pathway, 20 had a co-mutation, 8 of KRAS/NRAS, and some even with a third mutation. More than half of the mutations were in codons 12 and 13. We also found 3 cases with a co-mutation of NRAS/BRAF and 9 with a co-mutation of KRAS/BRAF. In 2 patients with a co-mutation of KRAS/NRAS, the co-mutation existed in the primary as well as in a metastasis, which suggests that co-mutations occur early during carcinogenesis and are maintained when a tumor disseminates. We conclude that co-mutations exist in the MAPK genes but with low frequency and as yet with unknown outcome implications.

  10. Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer

    DEFF Research Database (Denmark)

    Thomassen, Mads; Tan, Qihua; Kruse, Torben

    2008-01-01

    ABSTRACT: BACKGROUND: Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent stud...

  11. Characterization of phenylpropanoid pathway genes within European maize (Zea mays L.) inbreds

    DEFF Research Database (Denmark)

    Andersen, Jeppe Reitan; Zein, Imad; Wenzel, Gerhard

    2008-01-01

    genomic fragments of six putative phenylpropanoid pathway genes in a panel of elite European inbred lines of maize (Zea mays L.) contrasting in forage quality traits. Six loci, encoding C4H, 4CL1, 4CL2, C3H, F5H, and CAD, displayed different levels of nucleotide diversity and linkage disequilibrium (LD...

  12. Identification of alleles of carotenoid pathway genes important for zeaxanthin accumulation in potato tubers

    NARCIS (Netherlands)

    Wolters, A.M.A.; Uitdewilligen, J.G.A.M.L.; Kloosterman, B.A.; Hutten, R.C.B.; Visser, R.G.F.; Eck, van H.J.

    2010-01-01

    We have investigated the genetics and molecular biology of orange flesh colour in potato (Solanum tuberosum L.). To this end the natural diversity in three genes of the carotenoid pathway was assessed by SNP analyses. Association analysis was performed between SNP haplotypes and flesh colour

  13. Identifying the Gene Signatures from Gene-Pathway Bipartite Network Guarantees the Robust Model Performance on Predicting the Cancer Prognosis

    Directory of Open Access Journals (Sweden)

    Li He

    2014-01-01

    Full Text Available For the purpose of improving the prediction of cancer prognosis in the clinical researches, various algorithms have been developed to construct the predictive models with the gene signatures detected by DNA microarrays. Due to the heterogeneity of the clinical samples, the list of differentially expressed genes (DEGs generated by the statistical methods or the machine learning algorithms often involves a number of false positive genes, which are not associated with the phenotypic differences between the compared clinical conditions, and subsequently impacts the reliability of the predictive models. In this study, we proposed a strategy, which combined the statistical algorithm with the gene-pathway bipartite networks, to generate the reliable lists of cancer-related DEGs and constructed the models by using support vector machine for predicting the prognosis of three types of cancers, namely, breast cancer, acute myeloma leukemia, and glioblastoma. Our results demonstrated that, combined with the gene-pathway bipartite networks, our proposed strategy can efficiently generate the reliable cancer-related DEG lists for constructing the predictive models. In addition, the model performance in the swap analysis was similar to that in the original analysis, indicating the robustness of the models in predicting the cancer outcomes.

  14. The hedgehog pathway gene shifted functions together with the hmgcr-dependent isoprenoid biosynthetic pathway to orchestrate germ cell migration.

    Directory of Open Access Journals (Sweden)

    Girish Deshpande

    Full Text Available The Drosophila embryonic gonad is assembled from two distinct cell types, the Primordial Germ Cells (PGCs and the Somatic Gonadal Precursor cells (SGPs. The PGCs form at the posterior of blastoderm stage embryos and are subsequently carried inside the embryo during gastrulation. To reach the SGPs, the PGCs must traverse the midgut wall and then migrate through the mesoderm. A combination of local repulsive cues and attractive signals emanating from the SGPs guide migration. We have investigated the role of the hedgehog (hh pathway gene shifted (shf in directing PGC migration. shf encodes a secreted protein that facilitates the long distance transmission of Hh through the proteoglycan matrix after it is released from basolateral membranes of Hh expressing cells in the wing imaginal disc. shf is expressed in the gonadal mesoderm, and loss- and gain-of-function experiments demonstrate that it is required for PGC migration. Previous studies have established that the hmgcr-dependent isoprenoid biosynthetic pathway plays a pivotal role in generating the PGC attractant both by the SGPs and by other tissues when hmgcr is ectopically expressed. We show that production of this PGC attractant depends upon shf as well as a second hh pathway gene gγ1. Further linking the PGC attractant to Hh, we present evidence indicating that ectopic expression of hmgcr in the nervous system promotes the release/transmission of the Hh ligand from these cells into and through the underlying mesodermal cell layer, where Hh can contact migrating PGCs. Finally, potentiation of Hh by hmgcr appears to depend upon cholesterol modification.

  15. Signalling pathways involved in adult heart formation revealed by gene expression profiling in Drosophila.

    Directory of Open Access Journals (Sweden)

    Bruno Zeitouni

    2007-10-01

    Full Text Available Drosophila provides a powerful system for defining the complex genetic programs that drive organogenesis. Under control of the steroid hormone ecdysone, the adult heart in Drosophila forms during metamorphosis by a remodelling of the larval cardiac organ. Here, we evaluated the extent to which transcriptional signatures revealed by genomic approaches can provide new insights into the molecular pathways that underlie heart organogenesis. Whole-genome expression profiling at eight successive time-points covering adult heart formation revealed a highly dynamic temporal map of gene expression through 13 transcript clusters with distinct expression kinetics. A functional atlas of the transcriptome profile strikingly points to the genomic transcriptional response of the ecdysone cascade, and a sharp regulation of key components belonging to a few evolutionarily conserved signalling pathways. A reverse genetic analysis provided evidence that these specific signalling pathways are involved in discrete steps of adult heart formation. In particular, the Wnt signalling pathway is shown to participate in inflow tract and cardiomyocyte differentiation, while activation of the PDGF-VEGF pathway is required for cardiac valve formation. Thus, a detailed temporal map of gene expression can reveal signalling pathways responsible for specific developmental programs and provides here substantial grasp into heart formation.

  16. Vitamin D Pathway Status and the Identification of Target Genes in the Mouse Mammary Gland

    Science.gov (United States)

    2014-11-01

    breast cancer stem cells with oncolytic herpes simplex virus. Cancer Gene Therapy 2012;19(10):707-14. June 21, 2012 – Poster Presentation – Presented...AD_________________ Award Number: W81XWH-11-1-0152 TITLE: Vitamin D Pathway Status and the Identification of Target Genes in the Mouse Mammary... Identification of Target Genes in the 5b. GRANT NUMBER W81XWH-11-1-0152 Mouse Mammary Gland 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) 5d. PROJECT

  17. Dysregulation of gene expression within the peroxisome proliferator activated receptor pathway in morbidly obese patients.

    Science.gov (United States)

    Hindle, A Katharine; Koury, Jadd; McCaffrey, Tim; Fu, Sidney W; Brody, Fred

    2009-06-01

    The causes of obesity are multifactorial but may include dysregulation of a family of related genes, such as the peroxisome proliferator activated receptor gamma (PPARgamma). When activated, the PPARgamma pathway promotes lipid metabolism. This study used microarray technology to evaluate differential gene expression profiles in obese patients undergoing bariatric surgery. The study enrolled six morbidly obese patients with a body mass index (BMI) exceeding 35 and four nonobese individuals. Blood samples were stabilized in PaxGene tubes (PreAnalytiX), and total RNA was extracted. Next, 100 ng of total RNA was amplified and labeled using the Ovation RNA Amplification System V2 with the Ovation whole-blood reagent (NuGen) before it was hybridized to an Affymetrix (Santa Clara, CA) focus array containing more than 8,500 verified genes. The data were analyzed using an analysis of variance (ANOVA) (p < 0.05) in the GeneSpring program, and potential pathways were identified with the Ingenuity program. Real-time quantitative reverse transcriptase-polymerase chain reaction was used to validate the array data. A total of 97 upregulated genes and 125 downregulated genes were identified. More than a 1.5-fold change was identified between the morbidly obese patients and the control subjects for a cluster of dysregulated genes involving pathways regulating cell metabolism and lipid formation. Specifically, the PPARgamma pathway showed a plethora of dysregulated genes including tumor necrosis factor-alpha (TNFalpha). In morbidly obese patients, TNFalpha expression was increased (upregulated) 1.6-fold. These findings were confirmed using quantitative polymerase chain reaction with a 2.8-fold change. Microarrays are a powerful tool for identifying biomarkers indicating morbid obesity by analyzing differential gene expression profiles. This study confirms the association of PPARgamma with morbid obesity. Also, these findings in blood support previous work documented in tissue

  18. Functional pathway analysis of genes associated with response to treatment for chronic hepatitis C.

    Science.gov (United States)

    Birerdinc, A; Afendy, A; Stepanova, M; Younossi, I; Manyam, G; Baranova, A; Younossi, Z M

    2010-10-01

    Chronic hepatitis C (CH-C) is among the most common causes of chronic liver disease. Approximately 50% of patients with CH-C treated with pegylated interferon-α and ribavirin (PEG-IFN-α + RBV) achieve a sustained virological response (SVR). Several factors such as genotype 1, African American (AA) race, obesity and the absence of an early virological response (EVR) are associated with low SVR. This study elucidates molecular pathways deregulated in patients with CH-C with negative predictors of response to antiviral therapy. Sixty-eight patients with CH-C who underwent a full course of treatment with PEG-IFN-α + RBV were included in the study. Pretreatment blood samples were collected in PAXgene™ RNA tubes. EVR, complete EVR (cEVR), and SVR rates were 76%, 57% and 41%, respectively. Total RNA was extracted from pretreatment peripheral blood mononuclear cells, quantified and used for one-step RT-PCR to profile 154 mRNAs. The expression of mRNAs was normalized with six 'housekeeping' genes. Differentially expressed genes were separated into up and downregulated gene lists according to the presence or absence of a risk factor and subjected to KEGG Pathway Painter which allows high-throughput visualization of the pathway-specific changes in expression profiles. The genes were consolidated into the networks associated with known predictors of response. Before treatment, various genes associated with core components of the JAK/STAT pathway were activated in the cohorts least likely to achieve SVR. Genes related to focal adhesion and TGF-β pathways were activated in some patients with negative predictors of response. Pathway-centred analysis of gene expression profiles from treated patients with CH-C points to the Janus kinase-signal transducers and activators of transcription signalling cascade as the major pathogenetic component responsible for not achieving SVR. In addition, focal adhesion and TGF-β pathways are associated with some predictors of response.

  19. Mining rare associations between biological ontologies.

    Science.gov (United States)

    Benites, Fernando; Simon, Svenja; Sapozhnikova, Elena

    2014-01-01

    The constantly increasing volume and complexity of available biological data requires new methods for their management and analysis. An important challenge is the integration of information from different sources in order to discover possible hidden relations between already known data. In this paper we introduce a data mining approach which relates biological ontologies by mining cross and intra-ontology pairwise generalized association rules. Its advantage is sensitivity to rare associations, for these are important for biologists. We propose a new class of interestingness measures designed for hierarchically organized rules. These measures allow one to select the most important rules and to take into account rare cases. They favor rules with an actual interestingness value that exceeds the expected value. The latter is calculated taking into account the parent rule. We demonstrate this approach by applying it to the analysis of data from Gene Ontology and GPCR databases. Our objective is to discover interesting relations between two different ontologies or parts of a single ontology. The association rules that are thus discovered can provide the user with new knowledge about underlying biological processes or help improve annotation consistency. The obtained results show that produced rules represent meaningful and quite reliable associations.

  20. Mining rare associations between biological ontologies.

    Directory of Open Access Journals (Sweden)

    Fernando Benites

    Full Text Available The constantly increasing volume and complexity of available biological data requires new methods for their management and analysis. An important challenge is the integration of information from different sources in order to discover possible hidden relations between already known data. In this paper we introduce a data mining approach which relates biological ontologies by mining cross and intra-ontology pairwise generalized association rules. Its advantage is sensitivity to rare associations, for these are important for biologists. We propose a new class of interestingness measures designed for hierarchically organized rules. These measures allow one to select the most important rules and to take into account rare cases. They favor rules with an actual interestingness value that exceeds the expected value. The latter is calculated taking into account the parent rule. We demonstrate this approach by applying it to the analysis of data from Gene Ontology and GPCR databases. Our objective is to discover interesting relations between two different ontologies or parts of a single ontology. The association rules that are thus discovered can provide the user with new knowledge about underlying biological processes or help improve annotation consistency. The obtained results show that produced rules represent meaningful and quite reliable associations.

  1. Agrobacterium Mediated Transient Gene Silencing (AMTS) in Stevia rebaudiana: Insights into Steviol Glycoside Biosynthesis Pathway

    Science.gov (United States)

    Guleria, Praveen; Yadav, Sudesh Kumar

    2013-01-01

    Background Steviol glycoside biosynthesis pathway has emerged as bifurcation from ent-kaurenoic acid, substrate of methyl erythritol phosphate pathway that also leads to gibberellin biosynthesis. However, the genetic regulation of steviol glycoside biosynthesis has not been studied. So, in present study RNA interference (RNAi) based Agrobacterium mediated transient gene silencing (AMTS) approach was followed. SrKA13H and three SrUGTs (SrUGT85C2, SrUGT74G1 and SrUGT76G1) genes encoding ent-kaurenoic acid-13 hydroxylase and three UDP glycosyltransferases of steviol glycoside biosynthesis pathway were silenced in Stevia rebaudiana to understand its molecular mechanism and association with gibberellins. Methodology/Principal Findings RNAi mediated AMTS of SrKA13H and three SrUGTs has significantly reduced the expression of targeted endogenous genes as well as total steviol glycoside accumulation. While gibberellins (GA3) content was significantly enhanced on AMTS of SrUGT85C2 and SrKA13H. Silencing of SrKA13H and SrUGT85C2 was found to block the metabolite flux of steviol glycoside pathway and shifted it towards GA3 biosynthesis. Further, molecular docking of three SrUGT proteins has documented highest affinity of SrUGT76G1 for the substrates of alternate pathways synthesizing steviol glycosides. This could be a plausible reason for maximum reduction in steviol glycoside content on silencing of SrUGT76G1 than other genes. Conclusions SrKA13H and SrUGT85C2 were identified as regulatory genes influencing carbon flux between steviol glycoside and gibberellin biosynthesis. This study has also documented the existence of alternate steviol glycoside biosynthesis route. PMID:24023961

  2. Agrobacterium mediated transient gene silencing (AMTS in Stevia rebaudiana: insights into steviol glycoside biosynthesis pathway.

    Directory of Open Access Journals (Sweden)

    Praveen Guleria

    Full Text Available Steviol glycoside biosynthesis pathway has emerged as bifurcation from ent-kaurenoic acid, substrate of methyl erythritol phosphate pathway that also leads to gibberellin biosynthesis. However, the genetic regulation of steviol glycoside biosynthesis has not been studied. So, in present study RNA interference (RNAi based Agrobacterium mediated transient gene silencing (AMTS approach was followed. SrKA13H and three SrUGTs (SrUGT85C2, SrUGT74G1 and SrUGT76G1 genes encoding ent-kaurenoic acid-13 hydroxylase and three UDP glycosyltransferases of steviol glycoside biosynthesis pathway were silenced in Stevia rebaudiana to understand its molecular mechanism and association with gibberellins.RNAi mediated AMTS of SrKA13H and three SrUGTs has significantly reduced the expression of targeted endogenous genes as well as total steviol glycoside accumulation. While gibberellins (GA3 content was significantly enhanced on AMTS of SrUGT85C2 and SrKA13H. Silencing of SrKA13H and SrUGT85C2 was found to block the metabolite flux of steviol glycoside pathway and shifted it towards GA3 biosynthesis. Further, molecular docking of three SrUGT proteins has documented highest affinity of SrUGT76G1 for the substrates of alternate pathways synthesizing steviol glycosides. This could be a plausible reason for maximum reduction in steviol glycoside content on silencing of SrUGT76G1 than other genes.SrKA13H and SrUGT85C2 were identified as regulatory genes influencing carbon flux between steviol glycoside and gibberellin biosynthesis. This study has also documented the existence of alternate steviol glycoside biosynthesis route.

  3. Radiation hybrid mapping of genes in the lithium-sensitive wnt signaling pathway.

    Science.gov (United States)

    Rhoads, A R; Karkera, J D; Detera-Wadleigh, S D

    1999-09-01

    Lithium, an effective drug in the treatment of bipolar disorder, has been proposed to disrupt the Wnt signaling pathway. To facilitate analysis of the possible involvement of elements of the Wnt pathway in human bipolar disorder, a high resolution radiation hybrid mapping (RHM) of these genes was performed. A fine physical location has been obtained for Wnt 7A, frizzled 3, 4 and 5, dishevelled 1, 2 and 3, GSK3beta, axin, alpha-catenin, the Armadillo repeat-containing genes (delta-catenin and ARVCF), and a frizzled-like protein (frpHE) using the Stanford Human Genome Center (SHGC) G3 panel. Most of these genes were previously mapped by fluorescence in situ hybridization (FISH). Frizzled 4, axin and frpHE did not have a previous chromosomal assignment and were linked by RHM to chromosome markers, SHGC-35131 at 11q22.1, NIB1488 at 16p13.3 and D7S2919 at 7p15.2, respectively. Interestingly, some of these genes were found to map within potential regions underlying susceptibility to bipolar disorder and schizophrenia as well as disorders of neurodevelopmental origin. This alternative approach of establishing the precise location of selected genetic components of a candidate pathway and determining if they map within previously defined susceptibility loci should help to identify plausible candidate genes that warrant further analysis through association and mutational scanning.

  4. DeMO: An Ontology for Discrete-event Modeling and Simulation

    Science.gov (United States)

    Silver, Gregory A; Miller, John A; Hybinette, Maria; Baramidze, Gregory; York, William S

    2011-01-01

    Several fields have created ontologies for their subdomains. For example, the biological sciences have developed extensive ontologies such as the Gene Ontology, which is considered a great success. Ontologies could provide similar advantages to the Modeling and Simulation community. They provide a way to establish common vocabularies and capture knowledge about a particular domain with community-wide agreement. Ontologies can support significantly improved (semantic) search and browsing, integration of heterogeneous information sources, and improved knowledge discovery capabilities. This paper discusses the design and development of an ontology for Modeling and Simulation called the Discrete-event Modeling Ontology (DeMO), and it presents prototype applications that demonstrate various uses and benefits that such an ontology may provide to the Modeling and Simulation community. PMID:22919114

  5. Association Study between Folate Pathway Gene Single Nucleotide Polymorphisms and Gastric Cancer in Koreans

    Directory of Open Access Journals (Sweden)

    Jae-Young Yoo

    2012-09-01

    Full Text Available Gastric cancer is ranked as the most common cancer in Koreans. A recent molecular biological study about the folate pathway gene revealed the correlation with a couple of cancer types. In the folate pathway, several genes are involved, including methylenetetrahydrofolate reductase (MTHFR, methyltetrahydrofolate-homocysteine methyltransferase reductase (MTRR, and methyltetrahydrofolate-homocysteine methyltransferase (MTR. The MTHFR gene has been reported several times for the correlation with gastric cancer risk. However, the association of the MTRR or MTR gene has not been reported to date. In this study, we investigated the association between the single nucleotide polymorphisms (SNPs of the MTHFR, MTRR, and MTR genes and the risk of gastric cancer in Koreans. To identify the genetic association with gastric cancer, we selected 17 SNPs sites in folate pathway-associated genes of MTHFR, MTR, and MTRR and tested in 1,261 gastric cancer patients and 375 healthy controls. By genotype analysis, estimating odds ratios and 95% confidence intervals (CI, rs1801394 in the MTRR gene showed increased risk for gastric cacner, with statistical significance both in the codominant model (odds ratio [OR], 1.39; 95% CI, 1.04 to 1.85 and dominant model (OR, 1.34; 95% CI, 1.02 to 1.75. Especially, in the obese group (body mass index ≥ 25 kg/m2, the codominant (OR, 9.08; 95% CI, 1.01 to 94.59 and recessive model (OR, 3.72; 95% CI, 0.92 to 16.59 showed dramatically increased risk (p < 0.05. In conclusion, rs1801394 in the MTRR gene is associated with gastric cancer risk, and its functional significance need to be validated.

  6. Prioritising lexical patterns to increase axiomatisation in biomedical ontologies. The role of localisation and modularity.

    Science.gov (United States)

    Quesada-Martínez, M; Fernández-Breis, J T; Stevens, R; Mikroyannidi, E

    2015-01-01

    This article is part of the Focus Theme of METHODS of Information in Medicine on "Managing Interoperability and Complexity in Health Systems". In previous work, we have defined methods for the extraction of lexical patterns from labels as an initial step towards semi-automatic ontology enrichment methods. Our previous findings revealed that many biomedical ontologies could benefit from enrichment methods using lexical patterns as a starting point.Here, we aim to identify which lexical patterns are appropriate for ontology enrichment, driving its analysis by metrics to prioritised the patterns. We propose metrics for suggesting which lexical regularities should be the starting point to enrich complex ontologies. Our method determines the relevance of a lexical pattern by measuring its locality in the ontology, that is, the distance between the classes associated with the pattern, and the distribution of the pattern in a certain module of the ontology. The methods have been applied to four significant biomedical ontologies including the Gene Ontology and SNOMED CT. The metrics provide information about the engineering of the ontologies and the relevance of the patterns. Our method enables the suggestion of links between classes that are not made explicit in the ontology. We propose a prioritisation of the lexical patterns found in the analysed ontologies. The locality and distribution of lexical patterns offer insights into the further engineering of the ontology. Developers can use this information to improve the axiomatisation of their ontologies.

  7. Annotating the human genome with Disease Ontology

    Science.gov (United States)

    Osborne, John D; Flatow, Jared; Holko, Michelle; Lin, Simon M; Kibbe, Warren A; Zhu, Lihua (Julie); Danila, Maria I; Feng, Gang; Chisholm, Rex L

    2009-01-01

    Background The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. Results We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. Conclusion The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome. PMID:19594883

  8. Identification of Key Pathways and Genes in the Dynamic Progression of HCC Based on WGCNA.

    Science.gov (United States)

    Yin, Li; Cai, Zhihui; Zhu, Baoan; Xu, Cunshuan

    2018-02-14

    Hepatocellular carcinoma (HCC) is a devastating disease worldwide. Though many efforts have been made to elucidate the process of HCC, its molecular mechanisms of development remain elusive due to its complexity. To explore the stepwise carcinogenic process from pre-neoplastic lesions to the end stage of HCC, we employed weighted gene co-expression network analysis (WGCNA) which has been proved to be an effective method in many diseases to detect co-expressed modules and hub genes using eight pathological stages including normal, cirrhosis without HCC, cirrhosis, low-grade dysplastic, high-grade dysplastic, very early and early, advanced HCC and very advanced HCC. Among the eight consecutive pathological stages, five representative modules are selected to perform canonical pathway enrichment and upstream regulator analysis by using ingenuity pathway analysis (IPA) software. We found that cell cycle related biological processes were activated at four neoplastic stages, and the degree of activation of the cell cycle corresponded to the deterioration degree of HCC. The orange and yellow modules enriched in energy metabolism, especially oxidative metabolism, and the expression value of the genes decreased only at four neoplastic stages. The brown module, enriched in protein ubiquitination and ephrin receptor signaling pathways, correlated mainly with the very early stage of HCC. The darkred module, enriched in hepatic fibrosis/hepatic stellate cell activation, correlated with the cirrhotic stage only. The high degree hub genes were identified based on the protein-protein interaction (PPI) network and were verified by Kaplan-Meier survival analysis. The novel five high degree hub genes signature that was identified in our study may shed light on future prognostic and therapeutic approaches. Our study brings a new perspective to the understanding of the key pathways and genes in the dynamic changes of HCC progression. These findings shed light on further investigations.

  9. Identification of target genes of the p16INK4A-pRB-E2F pathway

    DEFF Research Database (Denmark)

    Vernell, Richard; Helin, Kristian; Müller, Heiko

    2003-01-01

    as physiological targets of the pRB pathway, and the further characterization of these genes should provide insights into how this pathway controls proliferation. We show that Gibbs sampling detects enrichment of several sequence motifs, including E2F consensus binding sites, in the upstream regions of these genes...

  10. Comprehensive analysis of schizophrenia-associated loci highlights ion channel pathways and biologically plausible candidate causal genes

    DEFF Research Database (Denmark)

    Pers, Tune H; Timshel, Pascal; Ripke, Stephan

    2016-01-01

    Over 100 associated genetic loci have been robustly associated with schizophrenia. Gene prioritization and pathway analysis have focused on a priori hypotheses and thus may have been unduly influenced by prior assumptions and missed important causal genes and pathways. Using a data-driven approac...

  11. Preterm birth in Caucasians is associated with coagulation and inflammation pathway gene variants.

    Directory of Open Access Journals (Sweden)

    Digna R Velez

    Full Text Available Spontaneous preterm birth (<37 weeks gestation-PTB occurs in approximately 12% of pregnancies in the United States, and is the largest contributor to neonatal morbidity and mortality. PTB is a complex disease, potentially induced by several etiologic factors from multiple pathophysiologic pathways. To dissect the genetic risk factors of PTB a large-scale high-throughput candidate gene association study was performed examining 1536 SNP in 130 candidate genes from hypothesized PTB pathways. Maternal and fetal DNA from 370 US Caucasian birth-events (172 cases and 198 controls was examined. Single locus, haplotype, and multi-locus association analyses were performed separately on maternal and fetal data. For maternal data the strongest associations were found in genes in the complement-coagulation pathway related to decidual hemorrhage in PTB. In this pathway 3 of 6 genes examined had SNPs significantly associated with PTB. These include factor V (FV that was previously associated with PTB, factor VII (FVII, and tissue plasminogen activator (tPA. The single strongest effect was observed in tPA marker rs879293 with a significant allelic (p = 2.30x10(-3 and genotypic association (p = 2.0x10(-6 with PTB. The odds ratio (OR for this SNP was 2.80 [CI 1.77-4.44] for a recessive model. Given that 6 of 8 markers in tPA were statistically significant, sliding window haplotype analyses were performed and revealed an associating 4 marker haplotype in tPA (p = 6.00x10(-3. The single strongest effect in fetal DNA was observed in the inflammatory pathway at rs17121510 in the interleukin-10 receptor antagonist (IL-10RA gene for allele (p = 0.01 and genotype (p = 3.34x10(-4. The OR for the IL-10RA genotypic additive model was 1.92 [CI 1.15-3.19] (p = 2.00x10(-3. Finally, exploratory multi-locus analyses in the complement and coagulation pathway were performed and revealed a potentially significant interaction between a marker in FV (rs2187952 and FVII (rs3211719 (p

  12. OpenDMAP: An open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression

    Directory of Open Access Journals (Sweden)

    Johnson Helen L

    2008-01-01

    Full Text Available Abstract Background Information extraction (IE efforts are widely acknowledged to be important in harnessing the rapid advance of biomedical knowledge, particularly in areas where important factual information is published in a diverse literature. Here we report on the design, implementation and several evaluations of OpenDMAP, an ontology-driven, integrated concept analysis system. It significantly advances the state of the art in information extraction by leveraging knowledge in ontological resources, integrating diverse text processing applications, and using an expanded pattern language that allows the mixing of syntactic and semantic elements and variable ordering. Results OpenDMAP information extraction systems were produced for extracting protein transport assertions (transport, protein-protein interaction assertions (interaction and assertions that a gene is expressed in a cell type (expression. Evaluations were performed on each system, resulting in F-scores ranging from .26 – .72 (precision .39 – .85, recall .16 – .85. Additionally, each of these systems was run over all abstracts in MEDLINE, producing a total of 72,460 transport instances, 265,795 interaction instances and 176,153 expression instances. Conclusion OpenDMAP advances the performance standards for extracting protein-protein interaction predications from the full texts of biomedical research articles. Furthermore, this level of performance appears to generalize to other information extraction tasks, including extracting information about predicates of more than two arguments. The output of the information extraction system is always constructed from elements of an ontology, ensuring that the knowledge representation is grounded with respect to a carefully constructed model of reality. The results of these efforts can be used to increase the efficiency of manual curation efforts and to provide additional features in systems that integrate multiple sources for

  13. Gene expression in the lignin biosynthesis pathway during soybean seed development.

    Science.gov (United States)

    Baldoni, A; Von Pinho, E V R; Fernandes, J S; Abreu, V M; Carvalho, M L M

    2013-02-28

    The study of gene expression in plants is fundamental, and understanding the molecular mechanisms involved in important biological processes, such as biochemical pathways or signaling that are used or manipulated in improvement programs, are key for the production of high-quality soybean seeds. Reports related to gene expression of lignin in seeds are scarce in the literature. We studied the expression of the phenylalanine ammonia-lyase (PAL), cinnamate 4-hydroxylase, 4-hydroxycinnamate 3-hydroxylase, and cinnamyl alcohol dehydrogenase genes involved in lignin biosynthesis during the development of soybean (Glycine max L. Merrill) seeds. As the endogenous control, the eukaryotic elongation factor 1-beta gene was used in two biological replicates performed in triplicate. Relative quantitative expression of these genes during the R4, R5, R6, and R7 development stages was analyzed. Real-time polymerase chain reaction was used for the gene expression study. The analyses were carried out in an ABI PRISM 7500 thermocycler using the comparative Ct method and SYBR Green to detect amplification. The seed samples at the R4 stage were chosen as calibrators. Increased expression of the cinnamate-4-hydroxylase and PAL genes occurred in soybean seeds at the R5 and R6 development stages. The cinnamyl alcohol dehydrogenase gene was expressed during the final development phases of soybean seeds. In low-lignin soybean cultivars, the higher expression of the PAL gene occurs at development stages R6 and R7. Activation of the genes involved in the lignin biosynthesis pathway occurs at the beginning of soybean seed development.

  14. Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process

    International Nuclear Information System (INIS)

    Chandran, Uma R; Ma, Changqing; Dhir, Rajiv; Bisceglia, Michelle; Lyons-Weiler, Maureen; Liang, Wenjing; Michalopoulos, George; Becich, Michael; Monzon, Federico A

    2007-01-01

    Prostate cancer is characterized by heterogeneity in the clinical course that often does not correlate with morphologic features of the tumor. Metastasis reflects the most adverse outcome of prostate cancer, and to date there are no reliable morphologic features or serum biomarkers that can reliably predict which patients are at higher risk of developing metastatic disease. Understanding the differences in the biology of metastatic and organ confined primary tumors is essential for developing new prognostic markers and therapeutic targets. Using Affymetrix oligonucleotide arrays, we analyzed gene expression profiles of 24 androgen-ablation resistant metastatic samples obtained from 4 patients and a previously published dataset of 64 primary prostate tumor samples. Differential gene expression was analyzed after removing potentially uninformative stromal genes, addressing the differences in cellular content between primary and metastatic tumors. The metastatic samples are highly heterogenous in expression; however, differential expression analysis shows that 415 genes are upregulated and 364 genes are downregulated at least 2 fold in every patient with metastasis. The expression profile of metastatic samples reveals changes in expression of a unique set of genes representing both the androgen ablation related pathways and other metastasis related gene networks such as cell adhesion, bone remodelling and cell cycle. The differentially expressed genes include metabolic enzymes, transcription factors such as Forkhead Box M1 (FoxM1) and cell adhesion molecules such as Osteopontin (SPP1). We hypothesize that these genes have a role in the biology of metastatic disease and that they represent potential therapeutic targets for prostate cancer

  15. Novel Myopia Genes and Pathways Identified From Syndromic Forms of Myopia

    Science.gov (United States)

    Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.

    2018-01-01

    Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1

  16. Gestation Related Gene Expression of the Endocannabinoid Pathway in Rat Placenta

    Directory of Open Access Journals (Sweden)

    Kanchan Vaswani

    2015-01-01

    Full Text Available Mammalian placentation is a vital facet of the development of a healthy and viable offspring. Throughout gestation the placenta changes to accommodate, provide for, and meet the demands of a growing fetus. Gestational gene expression is a crucial part of placenta development. The endocannabinoid pathway is activated in the placenta and decidual tissues throughout pregnancy and aberrant endocannabinoid signaling during the period of placental development has been associated with pregnancy disorders. In this study, the gene expression of eight endocannabinoid system enzymes was investigated throughout gestation. Rat placentae were obtained at E14.25, E15.25, E17.25, and E20, RNA was extracted, and microarray was performed. Gene expression of enzymes Faah, Mgll, Plcd4, Pld1, Nat1, Daglα, and Ptgs2 was studied (cohort 1, microarray. Biological replication of the results was performed by qPCR (cohort 2. Four genes showed differential expression (Mgll, Plcd4, Ptgs2, and Pld1, from mid to late gestation. Genes positively associated with gestational age were Ptgs2, Mgll, and Pld1, while Plcd4 was downregulated. This is the first comprehensive study that has investigated endocannabinoid pathway gene expression during rat pregnancy. This study provides the framework for future studies that investigate the role of endocannabinoid system during pregnancy.

  17. PANDA: pathway and annotation explorer for visualizing and interpreting gene-centric data.

    Science.gov (United States)

    Hart, Steven N; Moore, Raymond M; Zimmermann, Michael T; Oliver, Gavin R; Egan, Jan B; Bryce, Alan H; Kocher, Jean-Pierre A

    2015-01-01

    Objective. Bringing together genomics, transcriptomics, proteomics, and other -omics technologies is an important step towards developing highly personalized medicine. However, instrumentation has advances far beyond expectations and now we are able to generate data faster than it can be interpreted. Materials and Methods. We have developed PANDA (Pathway AND Annotation) Explorer, a visualization tool that integrates gene-level annotation in the context of biological pathways to help interpret complex data from disparate sources. PANDA is a web-based application that displays data in the context of well-studied pathways like KEGG, BioCarta, and PharmGKB. PANDA represents data/annotations as icons in the graph while maintaining the other data elements (i.e., other columns for the table of annotations). Custom pathways from underrepresented diseases can be imported when existing data sources are inadequate. PANDA also allows sharing annotations among collaborators. Results. In our first use case, we show how easy it is to view supplemental data from a manuscript in the context of a user's own data. Another use-case is provided describing how PANDA was leveraged to design a treatment strategy from the somatic variants found in the tumor of a patient with metastatic sarcomatoid renal cell carcinoma. Conclusion. PANDA facilitates the interpretation of gene-centric annotations by visually integrating this information with context of biological pathways. The application can be downloaded or used directly from our website: http://bioinformaticstools.mayo.edu/research/panda-viewer/.

  18. High-throughput sequencing reveals key genes and immune homeostatic pathways activated in myeloid dendritic cells by Porphyromonas gingivalis 381 and its fimbrial mutants.

    Science.gov (United States)

    Arjunan, P; El-Awady, A; Dannebaum, R O; Kunde-Ramamoorthy, G; Cutler, C W

    2016-02-01

    The human microbiome consists of highly diverse microbial communities that colonize our skin and mucosal surfaces, aiding in maintenance of immune homeostasis. The keystone pathogen Porphyromonas gingivalis induces a dysbiosis and disrupts immune homeostasis through as yet unclear mechanisms. The fimbrial adhesins of P. gingivalis facilitate biofilm formation, invasion of and dissemination by blood dendritic cells; hence, fimbriae may be key factors in disruption of immune homeostasis. In this study we employed RNA-sequencing transcriptome profiling to identify differentially expressed genes (DEGs) in human monocyte-derived dendritic cells (MoDCs) in response to in vitro infection/exposure by Pg381 or its isogenic mutant strains that solely express minor-Mfa1 fimbriae (DPG3), major-FimA fimbriae (MFI) or are deficient in both fimbriae (MFB) relative to uninfected control. Our results yielded a total of 479 DEGs that were at least two-fold upregulated and downregulated in MoDCs significantly (P ≤ 0.05) by all four strains and certain DEGs that were strain-specific. Interestingly, the gene ontology biological and functional analysis shows that the upregulated genes in DPG3-induced MoDCs were more significant than other strains and associated with inflammation, immune response, anti-apoptosis, cell proliferation, and other homeostatic functions. Both transcriptome and quantitative polymerase chain reaction results show that DPG3, which solely expresses Mfa1, increased ZNF366, CD209, LOX1, IDO1, IL-10, CCL2, SOCS3, STAT3 and FOXO1 gene expression. In conclusion, we have identified key DC-mediated immune homeostatic pathways that could contribute to dysbiosis in periodontal infection with P. gingivalis. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  19. Ontology: ambiguity and accuracy

    Directory of Open Access Journals (Sweden)

    Marcelo Schiessl

    2012-08-01

    Full Text Available Ambiguity is a major obstacle to information retrieval. It is source of several researches in Information Science. Ontologies have been studied in order to solve problems related to ambiguities. Paradoxically, “ontology” term is also ambiguous and it is understood according to the use by the community. Philosophy and Computer Science seems to have the most accentuated difference related to the term sense. The former holds undisputed tradition and authority. The latter, in despite of being quite recent, holds an informal sense, but pragmatic. Information Science acts ranging from philosophical to computational approaches so as to get organized collections based on balance between users’ necessities and available information. The semantic web requires informational cycle automation and demands studies related to ontologies. Consequently, revisiting relevant approaches for the study of ontologies plays a relevant role as a way to provide useful ideas to researchers maintaining philosophical rigor, and convenience provided by computers.

  20. Ontological engineering versus metaphysics

    Science.gov (United States)

    Tataj, Emanuel; Tomanek, Roman; Mulawka, Jan

    2011-10-01

    It has been recognized that ontologies are a semantic version of world wide web and can be found in knowledge-based systems. A recent time survey of this field also suggest that practical artificial intelligence systems may be motivated by this research. Especially strong artificial intelligence as well as concept of homo computer can also benefit from their use. The main objective of this contribution is to present and review already created ontologies and identify the main advantages which derive such approach for knowledge management systems. We would like to present what ontological engineering borrows from metaphysics and what a feedback it can provide to natural language processing, simulations and modelling. The potential topics of further development from philosophical point of view is also underlined.

  1. C1-Pathways in Methyloversatilis universalis FAM5: Genome Wide Gene Expression and Mutagenesis Studies

    Directory of Open Access Journals (Sweden)

    Nathan M. Good

    2015-04-01

    Full Text Available Methyloversatilis universalis FAM5 utilizes single carbon compounds such as methanol or methylamine as a sole source of carbon and energy. Expression profiling reveals distinct sets of genes altered during growth on methylamine vs methanol. As expected, all genes for the N-methylglutamate pathway were induced during growth on methylamine. Among other functions responding to the aminated source of C1-carbon, are a heme-containing amine dehydrogenase (Qhp, a distant homologue of formaldehyde activating enzyme (Fae3, molybdenum-containing formate dehydrogenase, ferredoxin reductase, a set of homologues to urea/ammonium transporters and amino-acid permeases. Mutants lacking one of the functional subunits of the amine dehydrogenase (ΔqhpA or Δfae3 showed no growth defect on C1-compounds. M. universalis FAM5 strains with a lesion in the H4-folate pathway were not able to use any C1-compound, methanol or methylamine. Genes essential for C1-assimilation (the serine cycle and glyoxylate shunt and H4MTP-pathway for formaldehyde oxidation showed similar levels of expression on both C1-carbon sources. M. universalis FAM5 possesses three homologs of the formaldehyde activating enzyme, a key enzyme of the H4MTP-pathway. Strains lacking the canonical Fae (fae1 lost the ability to grow on both C1-compounds. However, upon incubation on methylamine the fae1-mutant produced revertants (Δfae1R, which regained the ability to grow on methylamine. Double and triple mutants (Δfae1RΔfae3, or Δfae1RΔfae2 or Δfae1RΔfae2Δfae3 constructed in the revertant strain background showed growth similar to the Δfae1R phenotype. The metabolic pathways for utilization of methanol and methylamine in Methyloversatilis universalis FAM5 are reconstructed based on these gene expression and phenotypic data.

  2. Gene expression profiling and pathway analysis of human bronchial epithelial cells exposed to airborne particulate matter collected from Saudi Arabia

    International Nuclear Information System (INIS)

    Sun, Hong; Shamy, Magdy; Kluz, Thomas; Muñoz, Alexandra B.; Zhong, Mianhua; Laulicht, Freda; Alghamdi, Mansour A.; Khoder, Mamdouh I.; Chen, Lung-Chi; Costa, Max

    2012-01-01

    Epidemiological studies have established a positive correlation between human mortality and increased concentration of airborne particulate matters (PM). However, the mechanisms underlying PM related human diseases, as well as the molecules and pathways mediating the cellular response to PM, are not fully understood. This study aims to investigate the global gene expression changes in human cells exposed to PM 10 and to identify genes and pathways that may contribute to PM related adverse health effects. Human bronchial epithelial cells were exposed to PM 10 collected from Saudi Arabia for 1 or 4 days, and whole transcript expression was profiled using the GeneChip human gene 1.0 ST array. A total of 140 and 230 genes were identified that significantly changed more than 1.5 fold after PM 10 exposure for 1 or 4 days, respectively. Ingenuity Pathway Analysis revealed that different exposure durations triggered distinct pathways. Genes involved in NRF2-mediated response to oxidative stress were up-regulated after 1 day exposure. In contrast, cells exposed for 4 days exhibited significant changes in genes related to cholesterol and lipid synthesis pathways. These observed changes in cellular oxidative stress and lipid synthesis might contribute to PM related respiratory and cardiovascular disease. -- Highlights: ► PM exposure modulated gene expression and associated pathways in BEAS-2B cells. ► One-day exposure to PM induced genes involved in responding to oxidative stress. ► 4-day exposure to PM changed genes associated to cholesterol and lipid synthesis.

  3. A Method for Building Personalized Ontology Summaries

    OpenAIRE

    Queiroz-Sousa, Paulo Orlando; Salgado, Ana Carolina; Pires, Carlos Eduardo

    2013-01-01

    In the context of ontology engineering, the ontology understanding is the basis for its further developmentand reuse. One intuitive eective approach to support ontology understanding is the process of ontology summarizationwhich highlights the most important concepts of an ontology. Ontology summarization identies an excerpt from anontology that contains the most relevant concepts and produces an abridged ontology. In this article, we present amethod for summarizing ontologies that represent ...

  4. Ontology and medical diagnosis.

    Science.gov (United States)

    Bertaud-Gounot, Valérie; Duvauferrier, Régis; Burgun, Anita

    2012-03-01

    Ontology and associated generic tools are appropriate for knowledge modeling and reasoning, but most of the time, disease definitions in existing description logic (DL) ontology are not sufficient to classify patient's characteristics under a particular disease because they do not formalize operational definitions of diseases (association of signs and symptoms=diagnostic criteria). The main objective of this study is to propose an ontological representation which takes into account the diagnostic criteria on which specific patient conditions may be classified under a specific disease. This method needs as a prerequisite a clear list of necessary and sufficient diagnostic criteria as defined for lots of diseases by learned societies. It does not include probability/uncertainty which Web Ontology Language (OWL 2.0) cannot handle. We illustrate it with spondyloarthritis (SpA). Ontology has been designed in Protégé 4.1 OWL-DL2.0. Several kinds of criteria were formalized: (1) mandatory criteria, (2) picking two criteria among several diagnostic criteria, (3) numeric criteria. Thirty real patient cases were successfully classified with the reasoner. This study shows that it is possible to represent operational definitions of diseases with OWL and successfully classify real patient cases. Representing diagnostic criteria as descriptive knowledge (instead of rules in Semantic Web Rule Language or Prolog) allows us to take advantage of tools already available for OWL. While we focused on Assessment of SpondyloArthritis international Society SpA criteria, we believe that many of the representation issues addressed here are relevant to using OWL-DL for operational definition of other diseases in ontology.

  5. Core Semantics for Public Ontologies

    National Research Council Canada - National Science Library

    Suni, Niranjan

    2005-01-01

    ... (schemas or ontologies) with respect to objects. The DARPA Agent Markup Language (DAML) through the use of ontologies provides a very powerful way to describe objects and their relationships to other objects...

  6. Genome-Wide Gene Set Analysis for Identification of Pathways Associated with Alcohol Dependence

    Science.gov (United States)

    Biernacka, Joanna M.; Geske, Jennifer; Jenkins, Gregory D.; Colby, Colin; Rider, David N.; Karpyak, Victor M.; Choi, Doo-Sup; Fridley, Brooke L.

    2013-01-01

    It is believed that multiple genetic variants with small individual effects contribute to the risk of alcohol dependence. Such polygenic effects are difficult to detect in genome-wide association studies that test for association of the phenotype with each single nucleotide polymorphism (SNP) individually. To overcome this challenge, gene set analysis (GSA) methods that jointly test for the effects of pre-defined groups of genes have been proposed. Rather than testing for association between the phenotype and individual SNPs, these analyses evaluate the global evidence of association with a set of related genes enabling the identification of cellular or molecular pathways or biological processes that play a role in development of the disease. It is hoped that by aggregating the evidence of association for all available SNPs in a group of related genes, these approaches will have enhanced power to detect genetic associations with complex traits. We performed GSA using data from a genome-wide study of 1165 alcohol dependent cases and 1379 controls from the Study of Addiction: Genetics and Environment (SAGE), for all 200 pathways listed in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Results demonstrated a potential role of the “Synthesis and Degradation of Ketone Bodies” pathway. Our results also support the potential involvement of the “Neuroactive Ligand Receptor Interaction” pathway, which has previously been implicated in addictive disorders. These findings demonstrate the utility of GSA in the study of complex disease, and suggest specific directions for further research into the genetic architecture of alcohol dependence. PMID:22717047

  7. Oxytocin Pathway Genes: Evolutionary Ancient System Impacting on Human Affiliation, Sociality, and Psychopathology.

    Science.gov (United States)

    Feldman, Ruth; Monakhov, Mikhail; Pratt, Maayan; Ebstein, Richard P

    2016-02-01

    Oxytocin (OT), a nonapeptide signaling molecule originating from an ancestral peptide, appears in different variants across all vertebrate and several invertebrate species. Throughout animal evolution, neuropeptidergic signaling has been adapted by organisms for regulating response to rapidly changing environments. The family of OT-like molecules affects both peripheral tissues implicated in reproduction, homeostasis, and energy balance, as well as neuromodulation of social behavior, stress regulation, and associative learning in species ranging from nematodes to humans. After describing the OT-signaling pathway, we review research on the three genes most extensively studied in humans: the OT receptor (OXTR), the structural gene for OT (OXT/neurophysin-I), and CD38. Consistent with the notion that sociality should be studied from the perspective of social life at the species level, we address human social functions in relation to OT-pathway genes, including parenting, empathy, and using social relationships to manage stress. We then describe associations between OT-pathway genes with psychopathologies involving social dysfunctions such as autism, depression, or schizophrenia. Human research particularly underscored the involvement of two OXTR single nucleotide polymorphisms (rs53576, rs2254298) with fewer studies focusing on other OXTR (rs7632287, rs1042778, rs2268494, rs2268490), OXT (rs2740210, rs4813627, rs4813625), and CD38 (rs3796863, rs6449197) single nucleotide polymorphisms. Overall, studies provide evidence for the involvement of OT-pathway genes in human social functions but also suggest that factors such as gender, culture, and early environment often confound attempts to replicate first findings. We conclude by discussing epigenetics, conceptual implications within an evolutionary perspective, and future directions, especially the need to refine phenotypes, carefully characterize early environments, and integrate observations of social behavior across

  8. Learning expressive ontologies

    CERN Document Server

    Völker, J

    2009-01-01

    This publication advances the state-of-the-art in ontology learning by presenting a set of novel approaches to the semi-automatic acquisition, refinement and evaluation of logically complex axiomatizations. It has been motivated by the fact that the realization of the semantic web envisioned by Tim Berners-Lee is still hampered by the lack of ontological resources, while at the same time more and more applications of semantic technologies emerge from fast-growing areas such as e-business or life sciences. Such knowledge-intensive applications, requiring large scale reasoning over complex domai

  9. ONTOLOGY IN PHARMACY

    Directory of Open Access Journals (Sweden)

    L. Yu. Babintseva

    2015-05-01

    Full Text Available It’s considered ontological models for formalization of knowledge in pharmacy. There is emphasized the view that the possibility of rapid exchange of information in the pharmaceutical industry, it is necessary to create a single information space. This means not only the establishment of uniform standards for the presentation of information on pharmaceutical groups pharmacotherapeutic classifications, but also the creation of a unified and standardized system for the transfer and renewal of knowledge. It is the organization of information in the ontology helps quickly in the future to build expert systems and applications to work with data.

  10. Integrated bioinformatic analysis unveils significant genes and pathways in the pathogenesis of supratentorial primitive neuroectodermal tumor

    Directory of Open Access Journals (Sweden)

    Wang G

    2018-04-01

    Full Text Available Guang-Yu Wang,1,* Ling Li,2,* Bo Liu,1 Xiao Han,1 Chun-Hua Wang,1 Ji-Wen Wang3 1Department of Neurosurgery, 2Department of Pediatrics, Qilu Children’s Hospital of Shandong University, Jinan, Shandong, 3Department of Neurology, Shanghai Children’s Medical Center, Shanghai Jiaotong University School of Medicine, Pudong New District, Shanghai, People’s Republic of China *These authors contributed equally to this work Purpose: This study aimed to explore significant genes and pathways involved in the pathogenesis of supratentorial primitive neuroectodermal tumor (sPNET. Materials and methods: Gene expression profile of GSE14295 was downloaded from publicly available Gene Expression Omnibus (GEO database. Differentially expressed genes (DEGs were screened out in primary sPNET samples compared with normal fetal and adult brain reference samples (sPNET vs fetal brain and sPNET vs adult brain. Pathway enrichment analysis of these DEGs was conducted, followed by protein–protein interaction (PPI network construction and significant module selection. Additionally, transcription factors (TFs regulating the common DEGs in the two comparison groups were identified, and the regulatory network was constructed. Results: In total, 526 DEGs (99 up- and 427 downregulated in sPNET vs fetal brain and 815 DEGs (200 up- and 615 downregulated in sPNET vs adult brain were identified. DEGs in sPNET vs fetal brain and sPNET vs adult brain were associated with calcium signaling pathway, cell cycle, and p53 signaling pathway. CDK1, CDC20, BUB1B, and BUB1 were hub nodes in the PPI networks of DEGs in sPNET vs fetal brain and sPNET vs adult brain. Significant modules were extracted from the PPI networks. In addition, 64 upregulated and 200 downregulated overlapping DEGs were identified in both sPNET vs fetal brain and sPNET vs adult brain. The genes involved in the regulatory network upon overlapping DEGs and the TFs were correlated with calcium signaling pathway

  11. Summarization by domain ontology navigation

    DEFF Research Database (Denmark)

    Andreasen, Troels; Bulskov, Henrik

    2013-01-01

    of the subject. In between these two extremes, conceptual summaries encompass selected concepts derived using background knowledge. We address in this paper an approach where conceptual summaries are provided through a conceptualization as given by an ontology. The ontology guiding the summarization can...... be a simple taxonomy or a generative domain ontology. A domain ontology can be provided by a preanalysis of a domain corpus and can be used to condense improved summaries that better reflects the conceptualization of a given domain....

  12. InfAcrOnt: calculating cross-ontology term similarities using information flow by a random walk.

    Science.gov (United States)

    Cheng, Liang; Jiang, Yue; Ju, Hong; Sun, Jie; Peng, Jiajie; Zhou, Meng; Hu, Yang

    2018-01-19

    Since the establishment of the first biomedical ontology Gene Ontology (GO), the number of biomedical ontology has increased dramatically. Nowadays over 300 ontologies have been built including extensively used Disease Ontology (DO) and Human Phenotype Ontology (HPO). Because of the advantage of identifying novel relationships between terms, calculating similarity between ontology terms is one of the major tasks in this research area. Though similarities between terms within each ontology have been studied with in silico methods, term similarities across different ontologies were not investigated as deeply. The latest method took advantage of gene functional interaction network (GFIN) to explore such inter-ontology similarities of terms. However, it only used gene interactions and failed to make full use of the connectivity among gene nodes of the network. In addition, all existent methods are particularly designed for GO and their performances on the extended ontology community remain unknown. We proposed a method InfAcrOnt to infer similarities between terms across ontologies utilizing the entire GFIN. InfAcrOnt builds a term-gene-gene network which comprised ontology annotations and GFIN, and acquires similarities between terms across ontologies through modeling the information flow within the network by random walk. In our benchmark experiments on sub-ontologies of GO, InfAcrOnt achieves a high average area under the receiver operating characteristic curve (AUC) (0.9322 and 0.9309) and low standard deviations (1.8746e-6 and 3.0977e-6) in both human and yeast benchmark datasets exhibiting superior performance. Meanwhile, comparisons of InfAcrOnt results and prior knowledge on pair-wise DO-HPO terms and pair-wise DO-GO terms show high correlations. The experiment results show that InfAcrOnt significantly improves the performance of inferring similarities between terms across ontologies in benchmark set.

  13. RNA interference screen to identify pathways that enhance or reduce nonviral gene transfer during lipofection.

    Science.gov (United States)

    Barker, Gregory A; Diamond, Scott L

    2008-09-01

    Some barriers to DNA lipofection are well characterized; however, there is as yet no method of finding unknown pathways that impact the process. A druggable genome small-interfering RNA (siRNA) screen against 5,520 genes was tested for its effect on lipofection of human aortic endothelial cells (HAECs). We found 130 gene targets which, when silenced by pooled siRNAs (three siRNAs per gene), resulted in enhanced luminescence after lipofection (86 gene targets showed reduced expression). In confirmation tests with single siRNAs, 18 of the 130 hits showed enhanced lipofection with two or more individual siRNAs in the absence of cytotoxicity. Of these confirmed gene targets, we identified five leading candidates, two of which are isoforms of the regulatory subunit of protein phosphatase 2A (PP2A). The best candidate siRNA targeted the PPP2R2C gene and produced a 65% increase in luminescence from lipofection, with a quantitative PCR-validated knockdown of approximately 76%. Flow cytometric analysis confirmed that the silencing of the PPP2R2C gene resulted in an improvement of 10% in transfection efficiency, thereby demonstrating an increase in the number of transfected cells. These results show that an RNA interference (RNAi) high-throughput screen (HTS) can be applied to nonviral gene transfer. We have also demonstrated that siRNAs can be co-delivered with lipofected DNA to increase the transfection efficiency in vitro.

  14. Biomedical ontologies: toward scientific debate.

    Science.gov (United States)

    Maojo, V; Crespo, J; García-Remesal, M; de la Iglesia, D; Perez-Rey, D; Kulikowski, C

    2011-01-01

    Biomedical ontologies have been very successful in structuring knowledge for many different applications, receiving widespread praise for their utility and potential. Yet, the role of computational ontologies in scientific research, as opposed to knowledge management applications, has not been extensively discussed. We aim to stimulate further discussion on the advantages and challenges presented by biomedical ontologies from a scientific perspective. We review various aspects of biomedical ontologies going beyond their practical successes, and focus on some key scientific questions in two ways. First, we analyze and discuss current approaches to improve biomedical ontologies that are based largely on classical, Aristotelian ontological models of reality. Second, we raise various open questions about biomedical ontologies that require further research, analyzing in more detail those related to visual reasoning and spatial ontologies. We outline significant scientific issues that biomedical ontologies should consider, beyond current efforts of building practical consensus between them. For spatial ontologies, we suggest an approach for building "morphospatial" taxonomies, as an example that could stimulate research on fundamental open issues for biomedical ontologies. Analysis of a large number of problems with biomedical ontologies suggests that the field is very much open to alternative interpretations of current work, and in need of scientific debate and discussion that can lead to new ideas and research directions.

  15. Using a Foundational Ontology for Reengineering a Software Enterprise Ontology

    Science.gov (United States)

    Perini Barcellos, Monalessa; de Almeida Falbo, Ricardo

    The knowledge about software organizations is considerably relevant to software engineers. The use of a common vocabulary for representing the useful knowledge about software organizations involved in software projects is important for several reasons, such as to support knowledge reuse and to allow communication and interoperability between tools. Domain ontologies can be used to define a common vocabulary for sharing and reuse of knowledge about some domain. Foundational ontologies can be used for evaluating and re-designing domain ontologies, giving to these real-world semantics. This paper presents an evaluating of a Software Enterprise Ontology that was reengineered using the Unified Foundation Ontology (UFO) as basis.

  16. Sequence homology and expression profile of genes associated with dna repair pathways in Mycobacterium leprae

    Directory of Open Access Journals (Sweden)

    Mukul Sharma

    2017-01-01

    direct repair pathway. Conclusion: This study provided preliminary information on the potential DNA repair pathways that are extant in M. leprae and the associated genes.

  17. Sequence homology and expression profile of genes associated with DNA repair pathways in Mycobacterium leprae.

    Science.gov (United States)

    Sharma, Mukul; Vedithi, Sundeep Chaitanya; Das, Madhusmita; Roy, Anindya; Ebenezer, Mannam

    2017-01-01

    Survival of Mycobacterium leprae, the causative bacteria for leprosy, in the human host is dependent to an extent on the ways in which its genome integrity is retained. DNA repair mechanisms protect bacterial DNA from damage induced by various stress factors. The current study is aimed at understanding the sequence and functional annotation of DNA repair genes in M. leprae. T he genome of M. leprae was annotated using sequence alignment tools to identify DNA repair genes that have homologs in Mycobacterium tuberculosis and Escherichia coli. A set of 96 genes known to be involved in DNA repair mechanisms in E. coli and Mycobacteriaceae were chosen as a reference. Among these, 61 were identified in M. leprae based on sequence similarity and domain architecture. The 61 were classified into 36 characterized gene products (59%), 11 hypothetical proteins (18%), and 14 pseudogenes (23%). All these genes have homologs in M. tuberculosis and 49 (80.32%) in E. coli. A set of 12 genes which are absent in E. coli were present in M. leprae and in Mycobacteriaceae. These 61 genes were further investigated for their expression profiles in the whole transcriptome microarray data of M. leprae which was obtained from the signal intensities of 60bp probes, tiling the entire genome with 10bp overlaps. It was noted that transcripts corresponding to all the 61 genes were identified in the transcriptome data with varying expression levels ranging from 0.18 to 2.47 fold (normalized with 16SrRNA). The mRNA expression levels of a representative set of seven genes ( four annotated and three hypothetical protein coding genes) were analyzed using quantitative Polymerase Chain Reaction (qPCR) assays with RNA extracted from skin biopsies of 10 newly diagnosed, untreated leprosy cases. It was noted that RNA expression levels were higher for genes involved in homologous recombination whereas the genes with a low level of expression are involved in the direct repair pathway. This study provided

  18. Piper betle Induced Cytoprotective Genes and Proteins via the Nrf2/ARE Pathway in Aging Mice.

    Science.gov (United States)

    Aliahmat, Nor Syahida; Abdul Sani, Nur Fathiah; Wan Hasan, Wan Nuraini; Makpol, Suzana; Wan Ngah, Wan Zurinah; Mohd Yusof, Yasmin Anum

    2016-01-01

    The objective of this study was to elucidate the underlying antioxidant mechanism of aqueous extract of Piper betle (PB) in aging rats. The nuclear factor erythroid 2-related factor 2 (Nrf2)/ARE pathway involving phase II detoxifying and antioxidant enzymes plays an important role in the antioxidant system by reducing electrophiles and reactive oxygen species through induction of phase II enzymes and proteins. Genes and proteins of phase II detoxifying antioxidant enzymes were analyzed by QuantiGenePlex 2.0 Assay and Western blot analysis. PB significantly induced genes and proteins of phase II and antioxidant enzymes, NAD(P)H quinone oxidoreductase 1, and catalase in aging mice (p < 0.05). The expression of these enzymes were stimulated via translocation of Nrf2 into the nucleus, indicating the involvement of ARE, a cis-acting motif located in the promoter region of nearly all phase II genes. PB was testified for the first time to induce cytoprotective genes through the Nrf2/ARE signaling pathway, thus unraveling the antioxidant mechanism of PB during the aging process. © 2016 S. Karger AG, Basel.

  19. Newborn serum retinoic acid level is associated with variants of genes in the retinol metabolism pathway.

    Science.gov (United States)

    Manolescu, Daniel C; El-Kares, Reyhan; Lakhal-Chaieb, Lajmi; Montpetit, Alexandre; Bhat, Pangala V; Goodyer, Paul

    2010-06-01

    Retinoic acid (RA) is a critical regulator of gene expression during embryonic development. In rodents, moderate maternal vitamin A deficiency leads to subtle morphogenetic defects and inactivation of RA pathway genes causes major disturbances of embryogenesis. In this study, we quantified RA in umbilical cord blood of 145 healthy full-term Caucasian infants from Montreal. Sixty seven percent of values were ROL). However, we found that the (A) allele of the rs12591551 single nucleotide polymorphism (SNP) in the ALDH1A2 gene (ALDH1A2rs12591551(A)), occurring in 19% of newborns, was associated with 2.5-fold higher serum RA levels. ALDH1A2 encodes retinaldehyde dehydrogenase (RALDH) 2, which synthesizes RA in fetal tissues. We also found that homozygosity for the (A) allele of the rs12724719 SNP in the CRABP2 gene (CRABP2rs12724719(A/A)) was associated with 4.4-fold increase in umbilical cord serum RA. CRABP2 facilitates RA binding to its cognate receptor complex and transfer to the nucleus. We hypothesize that individual variation in RA pathway genes may account for subtle variations in RA-dependent human embryogenesis.

  20. Chromosomal Aberrations in Canine Gliomas Define Candidate Genes and Common Pathways in Dogs and Humans

    Science.gov (United States)

    York, Dan; Higgins, Robert J.; LeCouteur, Richard A.; Joshi, Nikhil; Bannasch, Danika

    2016-01-01

    Spontaneous gliomas in dogs occur at a frequency similar to that in humans and may provide a translational model for therapeutic development and comparative biological investigations. Copy number alterations in 38 canine gliomas, including diffuse astrocytomas, glioblastomas, oligodendrogliomas, and mixed oligoastrocytomas, were defined using an Illumina 170K single nucleotide polymorphism array. Highly recurrent alterations were seen in up to 85% of some tumor types, most notably involving chromosomes 13, 22, and 38, and gliomas clustered into 2 major groups consisting of high-grade IV astrocytomas, or oligodendrogliomas and other tumors. Tumor types were characterized by specific broad and focal chromosomal events including focal loss of the INK4A/B locus in glioblastoma and loss of the RB1 gene and amplification of the PDGFRA gene in oligodendrogliomas. Genes associated with the 3 critical pathways in human high-grade gliomas (TP53, RB1, and RTK/RAS/PI3K) were frequently associated with canine aberrations. Analysis of oligodendrogliomas revealed regions of chromosomal losses syntenic to human 1p involving tumor suppressor genes, such as CDKN2C, as well as genes associated with apoptosis, autophagy, and response to chemotherapy and radiation. Analysis of high frequency chromosomal aberrations with respect to human orthologues may provide insight into both novel and common pathways in gliomagenesis and response to therapy. PMID:27251041

  1. PathwaySplice: An R package for unbiased pathway analysis of alternative splicing in RNA-Seq data.

    Science.gov (United States)

    Yan, Aimin; Ban, Yuguang; Gao, Zhen; Chen, Xi; Wang, Lily

    2018-04-24

    Pathway analysis of alternative splicing would be biased without accounting for the different number of exons or junctions associated with each gene, because genes with higher number of exons or junctions are more likely to be included in the "significant" gene list in alternative splicing. We present PathwaySplice, an R package that (1) Performs pathway analysis that explicitly adjusts for the number of exons or junctions associated with each gene; (2) Visualizes selection bias due to different number of exons or junctions for each gene and formally tests for presence of bias using logistic regression; (3) Supports gene sets based on the Gene Ontology terms, as well as more broadly defined gene sets (e.g. MSigDB) or user defined gene sets; (4) Identifies the significant genes driving pathway significance and (5) Organizes significant pathways with an enrichment map, where pathways with large number of overlapping genes are grouped together in a network graph. https://bioconductor.org/packages/release/bioc/html/PathwaySplice.html. lily.wangg@gmail.com, xi.steven.chen@gmail.com.

  2. The design ontology

    DEFF Research Database (Denmark)

    Storga, Mario; Andreasen, Mogens Myrup; Marjanovic, Dorian

    2010-01-01

    The article presents the research of the nature, building and practical role of a Design Ontology as a potential framework for the more efficient product development (PD) data-, information- and knowledge- description, -explanation, -understanding and -reusing. In the methodology for development ...

  3. Dahlbeck and Pure Ontology

    Science.gov (United States)

    Mackenzie, Jim

    2016-01-01

    This article responds to Johan Dahlbeck's "Towards a pure ontology: Children's bodies and morality" ["Educational Philosophy and Theory," vol. 46 (1), 2014, pp. 8-23 (EJ1026561)]. His arguments from Nietzsche and Spinoza do not carry the weight he supposes, and the conclusions he draws from them about pedagogy would be…

  4. Audit Validation Using Ontologies

    Directory of Open Access Journals (Sweden)

    Ion IVAN

    2015-01-01

    Full Text Available Requirements to increase quality audit processes in enterprises are defined. It substantiates the need for assessment and management audit processes using ontologies. Sets of rules, ways to assess the consistency of rules and behavior within the organization are defined. Using ontologies are obtained qualifications that assess the organization's audit. Elaboration of the audit reports is a perfect algorithm-based activity characterized by generality, determinism, reproducibility, accuracy and a well-established. The auditors obtain effective levels. Through ontologies obtain the audit calculated level. Because the audit report is qualitative structure of information and knowledge it is very hard to analyze and interpret by different groups of users (shareholders, managers or stakeholders. Developing ontology for audit reports validation will be a useful instrument for both auditors and report users. In this paper we propose an instrument for validation of audit reports contain a lot of keywords that calculates indicators, a lot of indicators for each key word there is an indicator, qualitative levels; interpreter who builds a table of indicators, levels of actual and calculated levels.

  5. Biomedicine: an ontological dissection.

    Science.gov (United States)

    Baronov, David

    2008-01-01

    Though ubiquitous across the medical social sciences literature, the term "biomedicine" as an analytical concept remains remarkably slippery. It is argued here that this imprecision is due in part to the fact that biomedicine is comprised of three interrelated ontological spheres, each of which frames biomedicine as a distinct subject of investigation. This suggests that, depending upon one's ontological commitment, the meaning of biomedicine will shift. From an empirical perspective, biomedicine takes on the appearance of a scientific enterprise and is defined as a derivative category of Western science more generally. From an interpretive perspective, biomedicine represents a symbolic-cultural expression whose adherence to the principles of scientific objectivity conceals an ideological agenda. From a conceptual perspective, biomedicine represents an expression of social power that reflects structures of power and privilege within capitalist society. No one perspective exists in isolation and so the image of biomedicine from any one presents an incomplete understanding. It is the mutually-conditioning interrelations between these ontological spheres that account for biomedicine's ongoing development. Thus, the ontological dissection of biomedicine that follows, with particular emphasis on the period of its formal crystallization in the latter nineteenth and early twentieth century, is intended to deepen our understanding of biomedicine as an analytical concept across the medical social sciences literature.

  6. Gene networks and toxicity pathways induced by acute cadmium exposure in adult largemouth bass (Micropterus salmoides).

    Science.gov (United States)

    Mehinto, Alvine C; Prucha, Melinda S; Colli-Dula, Reyna C; Kroll, Kevin J; Lavelle, Candice M; Barber, David S; Vulpe, Christopher D; Denslow, Nancy D

    2014-07-01

    Cadmium is a heavy metal that can accumulate to toxic levels in the environment leading to detrimental effects in animals and humans including kidney, liver and lung injuries. Using a transcriptomics approach, genes and cellular pathways affected by a low dose of cadmium were investigated. Adult largemouth bass were intraperitoneally injected with 20μg/kg of cadmium chloride (mean exposure level - 2.6μg of cadmium per fish) and microarray analyses were conducted in the liver and testis 48h after injection. Transcriptomic profiles identified in response to cadmium exposure were tissue-specific with the most differential expression changes found in the liver tissues, which also contained much higher levels of cadmium than the testis. Acute exposure to a low dose of cadmium induced oxidative stress response and oxidative damage pathways in the liver. The mRNA levels of antioxidants such as catalase increased and numerous transcripts related to DNA damage and DNA repair were significantly altered. Hepatic mRNA levels of metallothionein, a molecular marker of metal exposure, did not increase significantly after 48h exposure. Carbohydrate metabolic pathways were also disrupted with hepatic transcripts such as UDP-glucose, pyrophosphorylase 2, and sorbitol dehydrogenase highly induced. Both tissues exhibited a disruption of steroid signaling pathways. In the testis, estrogen receptor beta and transcripts linked to cholesterol metabolism were suppressed. On the contrary, genes involved in cholesterol metabolism were highly increased in the liver including genes encoding for the rate limiting steroidogenic acute regulatory protein and the catalytic enzyme 7-dehydrocholesterol reductase. Integration of the transcriptomic data using functional enrichment analyses revealed a number of enriched gene networks associated with previously reported adverse outcomes of cadmium exposure such as liver toxicity and impaired reproduction. Copyright © 2014 Elsevier B.V. All rights

  7. Annotating gene sets by mining large literature collections with protein networks.

    Science.gov (United States)

    Wang, Sheng; Ma, Jianzhu; Yu, Michael Ku; Zheng, Fan; Huang, Edward W; Han, Jiawei; Peng, Jian; Ideker, Trey

    2018-01-01

    Analysis of patient genomes and transcriptomes routinely recognizes new gene sets associated with human disease. Here we present an integrative natural language processing system which infers common functions for a gene set through automatic mining of the scientific literature with biological networks. This system links genes with associated literature phrases and combines these links with protein interactions in a single heterogeneous network. Multiscale functional annotations are inferred based on network distances between phrases and genes and then visualized as an ontology of biological concepts. To evaluate this system, we predict functions for gene sets representing known pathways and find that our approach achieves substantial improvement over the conventional text-mining baseline method. Moreover, our system discovers novel annotations for gene sets or pathways without previously known functions. Two case studies demonstrate how the system is used in discovery of new cancer-related pathways with ontological annotations.

  8. Association of Polymorphisms in BDNF, MTHFR, and Genes Involved in the Dopaminergic Pathway with Memory in a Healthy Chinese Population

    Science.gov (United States)

    Yeh, Ting-Kuang; Hu, Chung-Yi; Yeh, Ting-Chi; Lin, Pei-Jung; Wu, Chung-Hsin; Lee, Po-Lei; Chang, Chun-Yen

    2012-01-01

    The contribution of genetic factors to the memory is widely acknowledged. Research suggests that these factors include genes involved in the dopaminergic pathway, as well as the genes for brain-derived neurotrophic factor (BDNF) and methylenetetrahydrofolate reductase (MTHFR). The activity of the products of these genes is affected by single…

  9. Characterization, expression, and mutation of the Lactococcus lactis galPMKTE genes, involved in galactose utilization via the Leloir pathway

    NARCIS (Netherlands)

    Groossiord, B.P.; Luesink, E.J.; Vaughan, E.E.; Arnaud, A.; Vos, de W.M.

    2003-01-01

    A cluster containing five similarly oriented genes involved in the metabolism of galactose via the Leloir pathway in Lactococcus lactis subsp. cremoris MG1363 was cloned and characterized. The order of the genes is galPMKTE, and these genes encode a galactose permease (GalP), an aldose I-epimerase

  10. Distinct Calcium Signaling Pathways Regulate Calmodulin Gene Expression in Tobacco1

    Science.gov (United States)

    van der Luit, Arnold H.; Olivari, Claudio; Haley, Ann; Knight, Marc R.; Trewavas, Anthony J.

    1999-01-01

    Cold shock and wind stimuli initiate Ca2+ transients in transgenic tobacco (Nicotiana plumbaginifolia) seedlings (named MAQ 2.4) containing cytoplasmic aequorin. To investigate whether these stimuli initiate Ca2+ pathways that are spatially distinct, stress-induced nuclear and cytoplasmic Ca2+ transients and the expression of a stress-induced calmodulin gene were compared. Tobacco seedlings were transformed with a construct that encodes a fusion protein between nucleoplasmin (a major oocyte nuclear protein) and aequorin. Immunocytochemical evidence indicated targeting of the fusion protein to the nucleus in these plants, which were named MAQ 7.11. Comparison between MAQ 7.11 and MAQ 2.4 seedlings confirmed that wind stimuli and cold shock invoke separate Ca2+ signaling pathways. Partial cDNAs encoding two tobacco calmodulin genes, NpCaM-1 and NpCaM-2, were identified and shown to have distinct nucleotide sequences that encode identical polypeptides. Expression of NpCaM-1, but not NpCaM-2, responded to wind and cold shock stimulation. Comparison of the Ca2+ dynamics with NpCaM-1 expression after stimulation suggested that wind-induced NpCaM-1 expression is regulated by a Ca2+ signaling pathway operational predominantly in the nucleus. In contrast, expression of NpCaM-1 in response to cold shock is regulated by a pathway operational predominantly in the cytoplasm. PMID:10557218

  11. Scaling the Drosophila Wing: TOR-Dependent Target Gene Access by the Hippo Pathway Transducer Yorkie.

    Science.gov (United States)

    Parker, Joseph; Struhl, Gary

    2015-10-01

    Organ growth is controlled by patterning signals that operate locally (e.g., Wingless/Ints [Wnts], Bone Morphogenetic Proteins [BMPs], and Hedgehogs [Hhs]) and scaled by nutrient-dependent signals that act systemically (e.g., Insulin-like peptides [ILPs] transduced by the Target of Rapamycin [TOR] pathway). How cells integrate these distinct inputs to generate organs of the appropriate size and shape is largely unknown. The transcriptional coactivator Yorkie (Yki, a YES-Associated Protein, or YAP) acts downstream of patterning morphogens and other tissue-intrinsic signals to promote organ growth. Yki activity is regulated primarily by the Warts/Hippo (Wts/Hpo) tumour suppressor pathway, which impedes nuclear access of Yki by a cytoplasmic tethering mechanism. Here, we show that the TOR pathway regulates Yki by a separate and novel mechanism in the Drosophila wing. Instead of controlling Yki nuclear access, TOR signaling governs Yki action after it reaches the nucleus by allowing it to gain access to its target genes. When TOR activity is inhibited, Yki accumulates in the nucleus but is sequestered from its normal growth-promoting target genes--a phenomenon we term "nuclear seclusion." Hence, we posit that in addition to its well-known role in stimulating cellular metabolism in response to nutrients, TOR also promotes wing growth by liberating Yki from nuclear seclusion, a parallel pathway that we propose contributes to the scaling of wing size with nutrient availability.

  12. Decoding the contribution of dopaminergic genes and pathways to autism spectrum disorder (ASD).

    Science.gov (United States)

    Nguyen, Michael; Roth, Andrew; Kyzar, Evan J; Poudel, Manoj K; Wong, Keith; Stewart, Adam Michael; Kalueff, Allan V

    2014-01-01

    Autism spectrum disorder (ASD) is a debilitating brain illness causing social deficits, delayed development and repetitive behaviors. ASD is a heritable neurodevelopmental disorder with poorly understood and complex etiology. The central dopaminergic system is strongly implicated in ASD pathogenesis. Genes encoding various elements of this system (including dopamine receptors, the dopamine transporter or enzymes of synthesis and catabolism) have been linked to ASD. Here, we comprehensively evaluate known molecular interactors of dopaminergic genes, and identify their potential molecular partners within up/down-steam signaling pathways associated with dopamine. These in silico analyses allowed us to construct a map of molecular pathways, regulated by dopamine and involved in ASD. Clustering these pathways reveals groups of genes associated with dopamine metabolism, encoding proteins that control dopamine neurotransmission, cytoskeletal processes, synaptic release, Ca(2+) signaling, as well as the adenosine, glutamatergic and gamma-aminobutyric systems. Overall, our analyses emphasize the important role of the dopaminergic system in ASD, and implicate several cellular signaling processes in its pathogenesis. Copyright © 2014 Elsevier Ltd. All rights reserved.

  13. Deciphering a unique biotin scavenging pathway with redundant genes in the probiotic bacterium Lactococcus lactis.

    Science.gov (United States)

    Zhang, Huimin; Wang, Qingjing; Fisher, Derek J; Cai, Mingzhu; Chakravartty, Vandana; Ye, Huiyan; Li, Ping; Solbiati, Jose O; Feng, Youjun

    2016-05-10

    Biotin protein ligase (BPL) is widespread in the three domains of the life. The paradigm BPL is the Escherichia coli BirA protein, which also functions as a repressor for the biotin biosynthesis pathway. Here we report that Lactococcus lactis possesses two different orthologues of birA (birA1_LL and birA2_LL). Unlike the scenario in E. coli, L. lactis appears to be auxotrophic for biotin in that it lacks a full biotin biosynthesis pathway. In contrast, it retains two biotin transporter-encoding genes (bioY1_LL and bioY2_LL), suggesting the use of a scavenging strategy to obtain biotin from the environment. The in vivo function of the two L. lactis birA genes was judged by their abilities to complement the conditional lethal E. coli birA mutant. Thin-layer chromatography and mass spectroscopy assays demonstrated that these two recombinant BirA proteins catalyze the biotinylation reaction of the acceptor biotin carboxyl carrier protein (BCCP), through the expected biotinoyl-AMP intermediate. Gel shift assays were used to characterize bioY1_LL and BirA1_LL. We also determined the ability to uptake (3)H-biotin by L. lactis. Taken together, our results deciphered a unique biotin scavenging pathway with redundant genes present in the probiotic bacterium L. lactis.

  14. Elucidation of primary metabolic pathways in Aspergillus species: orphaned research in characterizing orphan genes.

    Science.gov (United States)

    Andersen, Mikael Rørdam

    2014-11-01

    Primary metabolism affects all phenotypical traits of filamentous fungi. Particular examples include reacting to extracellular stimuli, producing precursor molecules required for cell division and morphological changes as well as providing monomer building blocks for production of secondary metabolites and extracellular enzymes. In this review, all annotated genes from four Aspergillus species have been examined. In this process, it becomes evident that 80-96% of the genes (depending on the species) are still without verified function. A significant proportion of the genes with verified metabolic functions are assigned to secondary or extracellular metabolism, leaving only 2-4% of the annotated genes within primary metabolism. It is clear that primary metabolism has not received the same attention in the post-genomic area as many other research areas--despite its role at the very centre of cellular function. However, several methods can be employed to use the metabolic networks in tandem with comparative genomics to accelerate functional assignment of genes in primary metabolism. In particular, gaps in metabolic pathways can be used to assign functions to orphan genes. In this review, applications of this from the Aspergillus genes will be examined, and it is proposed that, where feasible, this should be a standard part of functional annotation of fungal genomes. © The Author 2014. Published by Oxford University Press.

  15. Epistemology and ontology in core ontologies: FOLaw and LRI-Core, two core ontologies for law

    NARCIS (Netherlands)

    Breukers, J.A.P.J.; Hoekstra, R.J.

    2004-01-01

    For more than a decade constructing ontologies for legal domains, we, at the Leibniz Center for Law, felt really the need to develop a core ontology for law that would enable us to re-use the common denominator of the various legal domains. In this paper we present two core ontologies for law. The

  16. Altered Levels of Aroma and Volatiles by Metabolic Engineering of Shikimate Pathway Genes in Tomato Fruits

    Directory of Open Access Journals (Sweden)

    Vered Tzin

    2015-06-01

    Full Text Available The tomato (Solanum lycopersicum fruit is an excellent source of antioxidants, dietary fibers, minerals and vitamins and therefore has been referred to as a “functional food”. Ripe tomato fruits produce a large number of specialized metabolites including volatile organic compounds. These volatiles serve as key components of the tomato fruit flavor, participate in plant pathogen and herbivore defense, and are used to attract seed dispersers. A major class of specialized metabolites is derived from the shikimate pathway followed by aromatic amino acid biosynthesis of phenylalanine, tyrosine and tryptophan. We attempted to modify tomato fruit flavor by overexpressing key regulatory genes in the shikimate pathway. Bacterial genes encoding feedback-insensitive variants of 3-Deoxy-D-Arabino-Heptulosonate 7-Phosphate Synthase (DAHPS; AroG209-9 and bi-functional Chorismate Mutase/Prephenate Dehydratase (CM/PDT; PheA12 were expressed under the control of a fruit-specific promoter. We crossed these transgenes to generate tomato plants expressing both the AroG209 and PheA12 genes. Overexpression of the AroG209-9 gene had a dramatic effect on the overall metabolic profile of the fruit, including enhanced levels of multiple volatile and non-volatile metabolites. In contrast, the PheA12 overexpression line exhibited minor metabolic effects compared to the wild type fruit. Co-expression of both the AroG209-9 and PheA12 genes in tomato resulted overall in a similar metabolic effect to that of expressing only the AroG209-9 gene. However, the aroma ranking attributes of the tomato fruits from PheA12//AroG209-9 were unique and different from those of the lines expressing a single gene, suggesting a contribution of the PheA12 gene to the overall metabolic profile. We suggest that expression of bacterial genes encoding feedback-insensitive enzymes of the shikimate pathway in tomato fruits provides a useful metabolic engineering tool for the modification of

  17. The expression of petunia strigolactone pathway genes is altered as part of the endogenous developmental program

    Directory of Open Access Journals (Sweden)

    Revel S M Drummond

    2012-01-01

    Full Text Available Analysis of mutants with increased branching has revealed the strigolactone synthesis/perception pathway which regulates branching in plants. However, whether variation in this well conserved developmental signalling system contributes to the unique plant architectures of different species is yet to be determined. We examined petunia orthologues of the Arabidopsis MAX1 and MAX2 genes to characterise their role in petunia architecture. A single orthologue of MAX1, PhMAX1 which encodes a cytochrome P450, was identified and was able to complement the max1 mutant of Arabidopsis. Petunia has two copies of the MAX2 gene, PhMAX2A and PhMAX2B which encode F-Box proteins. Differences in the transcript levels of these two MAX2-like genes suggest diverging functions. Unlike PhMAX2B, PhMAX2A mRNA levels increase as leaves age. Nonetheless, this gene functionally complements the Arabidopsis max2 mutant indicating that the biochemical activity of the PhMAX2A protein is not significantly different from MAX2. The expression of the petunia strigolactone pathway genes (PhCCD7, PhCCD8, PhMAX1, PhMAX2A, and PhMAX2B was then further investigated throughout the development of wild-type petunia plants. Three of these genes showed changes in mRNA levels over the development series. Alterations to the expression of these genes over time, or in different regions of the plant, may influence the branching growth habit of the plant. Alterations to strigolactone production and/or sensitivity could allow both subtle and dramatic changes to branching within and between species.

  18. Benchmarking ontologies: bigger or better?

    Directory of Open Access Journals (Sweden)

    Lixia Yao

    2011-01-01

    Full Text Available A scientific ontology is a formal representation of knowledge within a domain, typically including central concepts, their properties, and relations. With the rise of computers and high-throughput data collection, ontologies have become essential to data mining and sharing across communities in the biomedical sciences. Powerful approaches exist for testing the internal consistency of an ontology, but not for assessing the fidelity of its domain representation. We introduce a family of metrics that describe the breadth and depth with which an ontology represents its knowledge domain. We then test these metrics using (1 four of the most common medical ontologies with respect to a corpus of medical documents and (2 seven of the most popular English thesauri with respect to three corpora that sample language from medicine, news, and novels. Here we show that our approach captures the quality of ontological representation and guides efforts to narrow the breach between ontology and collective discourse within a domain. Our results also demonstrate key features of medical ontologies, English thesauri, and discourse from different domains. Medical ontologies have a small intersection, as do English thesauri. Moreover, dialects characteristic of distinct domains vary strikingly as many of the same words are used quite differently in medicine, news, and novels. As ontologies are intended to mirror the state of knowledge, our methods to tighten the fit between ontology and domain will increase their relevance for new areas of biomedical science and improve the accuracy and power of inferences computed across them.

  19. Ontology-based Information Retrieval

    DEFF Research Database (Denmark)

    Styltsvig, Henrik Bulskov

    In this thesis, we will present methods for introducing ontologies in information retrieval. The main hypothesis is that the inclusion of conceptual knowledge such as ontologies in the information retrieval process can contribute to the solution of major problems currently found in information...... retrieval. This utilization of ontologies has a number of challenges. Our focus is on the use of similarity measures derived from the knowledge about relations between concepts in ontologies, the recognition of semantic information in texts and the mapping of this knowledge into the ontologies in use......, as well as how to fuse together the ideas of ontological similarity and ontological indexing into a realistic information retrieval scenario. To achieve the recognition of semantic knowledge in a text, shallow natural language processing is used during indexing that reveals knowledge to the level of noun...

  20. Use of an activated beta-catenin to identify Wnt pathway target genes in caenorhabditis elegans, including a subset of collagen genes expressed in late larval development.

    Science.gov (United States)

    Jackson, Belinda M; Abete-Luzi, Patricia; Krause, Michael W; Eisenmann, David M

    2014-04-16

    The Wnt signaling pathway plays a fundamental role during metazoan development, where it regulates diverse processes, including cell fate specification, cell migration, and stem cell renewal. Activation of the beta-catenin-dependent/canonical Wnt pathway up-regulates expression of Wnt target genes to mediate a cellular response. In the nematode Caenorhabditis elegans, a canonical Wnt signaling pathway regulates several processes during larval development; however, few target genes of this pathway have been identified. To address this deficit, we used a novel approach of conditionally activated Wnt signaling during a defined stage of larval life by overexpressing an activated beta-catenin protein, then used microarray analysis to identify genes showing altered expression compared with control animals. We identified 166 differentially expressed genes, of which 104 were up-regulated. A subset of the up-regulated genes was shown to have altered expression in mutants with decreased or increased Wnt signaling; we consider these genes to be bona fide C. elegans Wnt pathway targets. Among these was a group of six genes, including the cuticular collagen genes, bli-1 col-38, col-49, and col-71. These genes show a peak of expression in the mid L4 stage during normal development, suggesting a role in adult cuticle formation. Consistent with this finding, reduction of function for several of the genes causes phenotypes suggestive of defects in cuticle function or integrity. Therefore, this work has identified a large number of putative Wnt pathway target genes during larval life, including a small subset of Wnt-regulated collagen genes that may function in synthesis of the adult cuticle.

  1. Identification and Analysis of Jasmonate Pathway Genes in Coffea canephora (Robusta Coffee) by In Silico Approach.

    Science.gov (United States)

    Bharathi, Kosaraju; Sreenath, H L

    2017-07-01

    Coffea canephora is the commonly cultivated coffee species in the world along with Coffea arabica . Different pests and pathogens affect the production and quality of the coffee. Jasmonic acid (JA) is a plant hormone which plays an important role in plants growth, development, and defense mechanisms, particularly against insect pests. The key enzymes involved in the production of JA are lipoxygenase, allene oxide synthase, allene oxide cyclase, and 12-oxo-phytodienoic reductase. There is no report on the genes involved in JA pathway in coffee plants. We made an attempt to identify and analyze the genes coding for these enzymes in C. canephora . First, protein sequences of jasmonate pathway genes from model plant Arabidopsis thaliana were identified in the National Center for Biotechnology Information (NCBI) database. These protein sequences were used to search the web-based database Coffee Genome Hub to identify homologous protein sequences in C. canephora genome using Basic Local Alignment Search Tool (BLAST). Homologous protein sequences for key genes were identified in the C. canephora genome database. Protein sequences of the top matches were in turn used to search in NCBI database using BLAST tool to confirm the identity of the selected proteins and to identify closely related genes in species. The protein sequences from C. canephora database and the top matches in NCBI were aligned, and phylogenetic trees were constructed using MEGA6 software and identified the genetic distance of the respective genes. The study identified the four key genes of JA pathway in C. canephora , confirming the conserved nature of the pathway in coffee. The study expected to be useful to further explore the defense mechanisms of coffee plants. JA is a plant hormone that plays an important role in plant defense against insect pests. Genes coding for the 4 key enzymes involved in the production of JA viz., LOX, AOS, AOC, and OPR are identified in C. canephora (robusta coffee) by

  2. Tetra primer ARMS-PCR relates folate/homocysteine pathway genes and ACE gene polymorphism with coronary artery disease.

    Science.gov (United States)

    Masud, Rizwan; Qureshi, Irfan Zia

    2011-09-01

    Cardiovascular disorders and coronary artery disease (CAD) are significant contributors to morbidity and mortality in heart patients. As genes of the folate/homocysteine pathway have been linked with the vascular disease, we investigated association of these gene polymorphisms with CAD/myocardial infarction (MI) using the novel approach of tetraprimer ARMS-PCR. A total of 230 participants (129 MI cases, 101 normal subjects) were recruited. We genotyped rs1801133 and rs1801131 SNPs in 5'10' methylenetetrahydrofolate reductase (MTHFR), rs1805087 SNP in 5' methyltetrahydrofolate homocysteine methyltransferase (MTR), rs662 SNP in paroxanse1 (PON1), and rs5742905 polymorphism in cystathionine beta synthase (CBS). Angiotensin converting enzyme (ACE) insertion/deletion polymorphism was detected through conventional PCR. Covariates included blood pressure, fasting blood sugar, serum cholesterol, and creatinine concentrations. Our results showed allele frequencies at rs1801133, rs1801131, rs1805087 and the ACE insertion/deletion (I/D) polymorphism varied between cases and controls. Logistic regression, after adjusting for covariates, demonstrated significant associations of rs1801133 and rs1805087 with CAD in the additive, dominant, and genotype model. In contrast, ACE I/D polymorphism was significantly related with CAD where recessive model was applied. Gene-gene interaction against the disease status revealed two polymorphism groups: rs1801133, rs662, and rs1805087; and rs1801131, rs662, and ACE I/D. Only the latter interaction maintained significance after adjusted for covariates. Our study concludes that folate pathway variants exert contributory influence on susceptibility to CAD. We further suggest that tetraprimer ARMS-PCR successfully resolves the genotypes in selected samples and might prove to be a superior technique compared to the conventional approach.

  3. Ontology Design Patterns for Combining Pathology and Anatomy: Application to Study Aging and Longevity in Inbred Mouse Strains

    KAUST Repository

    Alghamdi, Sarah M.

    2018-05-13

    In biomedical research, ontologies are widely used to represent knowledge as well as to annotate datasets. Many of the existing ontologies cover a single type of phenomena, such as a process, cell type, gene, pathological entity or anatomical structure. Consequently, there is a requirement to use multiple ontologies to fully characterize the observations in the datasets. Although this allows precise annotation of different aspects of a given dataset, it limits our ability to use the ontologies in data analysis, as the ontologies are usually disconnected and their combinations cannot be exploited. Motivated by this, here we present novel ontology design methods for combining pathology and anatomy concepts. To this end, we use a dataset of mouse models which has been characterized through two ontologies: one of them is the mouse pathology ontology (MPATH) covering pathological lesions while the other is the mouse anatomy ontology (MA) covering the anatomical site of the lesions. We propose four novel ontology design patterns for combining these ontologies, and use these patterns to generate four ontologies in a data-driven way. To evaluate the generated ontologies, we utilize these in ontology-based data analysis, including ontology enrichment analysis and computation of semantic similarity. We demonstrate that there are significant differences between the four ontologies in different analysis approaches. In addition, when using semantic similarity to confirm the hypothesis that genetically identical mice should develop more similar diseases, the generated combined ontologies lead to significantly better analysis results compared to using each ontology individually. Our results reveal that using ontology design patterns to combine different facets characterizing a dataset can improve established analysis methods.

  4. Expression of carotenoid biosynthetic pathway genes and changes in carotenoids during ripening in tomato (Lycopersicon esculentum).

    Science.gov (United States)

    Namitha, Kanakapura Krishnamurthy; Archana, Surya Narayana; Negi, Pradeep Singh

    2011-04-01

    To study the expression pattern of carotenoid biosynthetic pathway genes, changes in their expression at different stages of maturity in tomato fruit (cv. Arka Ahuti) were investigated. The genes regulating carotenoid production were quantified by a dot blot method using a DIG (dioxigenin) labelling and detection kit. The results revealed that there was an increase in the levels of upstream genes of the carotenoid biosynthetic pathway such as 1-deoxy-d-xylulose-5-phosphate reductoisomerase (DXR), 4-hydroxy-3-methyl-but-2-enyl diphosphate reductase (Lyt B), phytoene synthase (PSY), phytoene desaturase (PDS) and ζ-carotene desaturase (ZDS) by 2-4 fold at the breaker stage as compared to leaf. The lycopene and β-carotene content was analyzed by HPLC at different stages of maturity. The lycopene (15.33 ± 0.24 mg per 100 g) and β-carotene (10.37 ± 0.46 mg per 100 g) content were found to be highest at 5 days post-breaker and 10 days post-breaker stage, respectively. The lycopene accumulation pattern also coincided with the color values at different stages of maturity. These studies may provide insight into devising gene-based strategies for enhancing carotenoid accumulation in tomato fruits.

  5. Study of gene expression alteration in male androgenetic alopecia: evidence of predominant molecular signalling pathways.

    Science.gov (United States)

    Michel, L; Reygagne, P; Benech, P; Jean-Louis, F; Scalvino, S; Ly Ka So, S; Hamidou, Z; Bianovici, S; Pouch, J; Ducos, B; Bonnet, M; Bensussan, A; Patatian, A; Lati, E; Wdzieczak-Bakala, J; Choulot, J-C; Loing, E; Hocquaux, M

    2017-11-01

    Male androgenetic alopecia (AGA) is the most common form of hair loss in men. It is characterized by a distinct pattern of progressive hair loss starting from the frontal area and the vertex of the scalp. Although several genetic risk loci have been identified, relevant genes for AGA remain to be defined. To identify biomarkers associated with AGA. Molecular biomarkers associated with premature AGA were identified through gene expression analysis using cDNA generated from scalp vertex biopsies of hairless or bald men with premature AGA, and healthy volunteers. This monocentric study reveals that genes encoding mast cell granule enzymes, inflammatory mediators and immunoglobulin-associated immune mediators were significantly overexpressed in AGA. In contrast, underexpressed genes appear to be associated with the Wnt/β-catenin and bone morphogenic protein/transforming growth factor-β signalling pathways. Although involvement of these pathways in hair follicle regeneration is well described, functional interpretation of the transcriptomic data highlights different events that account for their inhibition. In particular, one of these events depends on the dysregulated expression of proopiomelanocortin, as confirmed by polymerase chain reaction and immunohistochemistry. In addition, lower expression of CYP27B1 in patients with AGA supports the notion that changes in vitamin D metabolism contributes to hair loss. This study provides compelling evidence for distinct molecular events contributing to alopecia that may pave the way for new therapeutic approaches. © 2017 British Association of Dermatologists.

  6. Investigating a multigene prognostic assay based on significant pathways for Luminal A breast cancer through gene expression profile analysis.

    Science.gov (United States)

    Gao, Haiyan; Yang, Mei; Zhang, Xiaolan

    2018-04-01

    The present study aimed to investigate potential recurrence-risk biomarkers based on significant pathways for Luminal A breast cancer through gene expression profile analysis. Initially, the gene expression profiles of Luminal A breast cancer patients were downloaded from The Cancer Genome Atlas database. The differentially expressed genes (DEGs) were identified using a Limma package and the hierarchical clustering analysis was conducted for the DEGs. In addition, the functional pathways were screened using Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses and rank ratio calculation. The multigene prognostic assay was exploited based on the statistically significant pathways and its prognostic function was tested using train set and verified using the gene expression data and survival data of Luminal A breast cancer patients downloaded from the Gene Expression Omnibus. A total of 300 DEGs were identified between good and poor outcome groups, including 176 upregulated genes and 124 downregulated genes. The DEGs may be used to effectively distinguish Luminal A samples with different prognoses verified by hierarchical clustering analysis. There were 9 pathways screened as significant pathways and a total of 18 DEGs involved in these 9 pathways were identified as prognostic biomarkers. According to the survival analysis and receiver operating characteristic curve, the obtained 18-gene prognostic assay exhibited good prognostic function with high sensitivity and specificity to both the train and test samples. In conclusion the 18-gene prognostic assay including the key genes, transcription factor 7-like 2, anterior parietal cortex and lymphocyte enhancer factor-1 may provide a new method for predicting outcomes and may be conducive to the promotion of precision medicine for Luminal A breast cancer.

  7. Comparative gene expression analysis of Dtg, a novel target gene of Dpp signaling pathway in the early Drosophila melanogaster embryo.

    Science.gov (United States)

    Hodar, Christian; Zuñiga, Alejandro; Pulgar, Rodrigo; Travisany, Dante; Chacon, Carlos; Pino, Michael; Maass, Alejandro; Cambiazo, Verónica

    2014-02-10

    In the early Drosophila melanogaster embryo, Dpp, a secreted molecule that belongs to the TGF-β superfamily of growth factors, activates a set of downstream genes to subdivide the dorsal region into amnioserosa and dorsal epidermis. Here, we examined the expression pattern and transcriptional regulation of Dtg, a new target gene of Dpp signaling pathway that is required for proper amnioserosa differentiation. We showed that the expression of Dtg was controlled by Dpp and characterized a 524-bp enhancer that mediated expression in the dorsal midline, as well as, in the differentiated amnioserosa in transgenic reporter embryos. This enhancer contained a highly conserved region of 48-bp in which bioinformatic predictions and in vitro assays identified three Mad binding motifs. Mutational analysis revealed that these three motifs were necessary for proper expression of a reporter gene in transgenic embryos, suggesting that short and highly conserved genomic sequences may be indicative of functional regulatory regions in D. melanogaster genes. Dtg orthologs were not detected in basal lineages of Dipterans, which unlike D. melanogaster develop two extra-embryonic membranes, amnion and serosa, nevertheless Dtg orthologs were identified in the transcriptome of Musca domestica, in which dorsal ectoderm patterning leads to the formation of a single extra-embryonic membrane. These results suggest that Dtg was recruited as a new component of the network that controls dorsal ectoderm patterning in the lineage leading to higher Cyclorrhaphan flies, such as D. melanogaster and M. domestica. Copyright © 2013 Elsevier B.V. All rights reserved.

  8. Pathway-based factor analysis of gene expression data produces highly heritable phenotypes that associate with age.

    Science.gov (United States)

    Anand Brown, Andrew; Ding, Zhihao; Viñuela, Ana; Glass, Dan; Parts, Leopold; Spector, Tim; Winn, John; Durbin, Richard

    2015-03-09

    Statistical factor analysis methods have previously been used to remove noise components from high-dimensional data prior to genetic association mapping and, in a guided fashion, to summarize biologically relevant sources of variation. Here, we show how the derived factors summarizing pathway expression can be used to analyze the relationships between expression, heritability, and aging. We used skin gene expression data from 647 twins from the MuTHER Consortium and applied factor analysis to concisely summarize patterns of gene expression to remove broad confounding influences and to produce concise pathway-level phenotypes. We derived 930 "pathway phenotypes" that summarized patterns of variation across 186 KEGG pathways (five phenotypes per pathway). We identified 69 significant associations of age with phenotype from 57 distinct KEGG pathways at a stringent Bonferroni threshold ([Formula: see text]). These phenotypes are more heritable ([Formula: see text]) than gene expression levels. On average, expression levels of 16% of genes within these pathways are associated with age. Several significant pathways relate to metabolizing sugars and fatty acids; others relate to insulin signaling. We have demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases our power to discover biologically relevant associations. These phenotypes could also be applied to discover associations with other environmental factors. Copyright © 2015 Brown et al.

  9. The Cremeomycin Biosynthetic Gene Cluster Encodes a Pathway for Diazo Formation.

    Science.gov (United States)

    Waldman, Abraham J; Pechersky, Yakov; Wang, Peng; Wang, Jennifer X; Balskus, Emily P

    2015-10-12

    Diazo groups are found in a range of natural products that possess potent biological activities. Despite longstanding interest in these metabolites, diazo group biosynthesis is not well understood, in part because of difficulties in identifying specific genes linked to diazo formation. Here we describe the discovery of the gene cluster that produces the o-diazoquinone natural product cremeomycin and its heterologous expression in Streptomyces lividans. We used stable isotope feeding experiments and in vitro characterization of biosynthetic enzymes to decipher the order of events in this pathway and establish that diazo construction involves late-stage N-N bond formation. This work represents the first successful production of a diazo-containing metabolite in a heterologous host, experimentally linking a set of genes with diazo formation. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Subpathway-GM: identification of metabolic subpathways via joint power of interesting genes and metabolites and their topologies within pathways.

    Science.gov (United States)

    Li, Chunquan; Han, Junwei; Yao, Qianlan; Zou, Chendan; Xu, Yanjun; Zhang, Chunlong; Shang, Desi; Zhou, Lingyun; Zou, Chaoxia; Sun, Zeguo; Li, Jing; Zhang, Yunpeng; Yang, Haixiu; Gao, Xu; Li, Xia

    2013-05-01

    Various 'omics' technologies, including microarrays and gas chromatography mass spectrometry, can be used to identify hundreds of interesting genes, proteins and metabolites, such as differential genes, proteins and metabolites associated with diseases. Identifying metabolic pathways has become an invaluable aid to understanding the genes and metabolites associated with studying conditions. However, the classical methods used to identify pathways fail to accurately consider joint power of interesting gene/metabolite and the key regions impacted by them within metabolic pathways. In this study, we propose a powerful analytical method referred to as Subpathway-GM for the identification of metabolic subpathways. This provides a more accurate level of pathway analysis by integrating information from genes and metabolites, and their positions and cascade regions within the given pathway. We analyzed two colorectal cancer and one metastatic prostate cancer data sets and demonstrated that Subpathway-GM was able to identify disease-relevant subpathways whose corresponding entire pathways might be ignored using classical entire pathway identification methods. Further analysis indicated that the power of a joint genes/metabolites and subpathway strategy based on their topologies may play a key role in reliably recalling disease-relevant subpathways and finding novel subpathways.

  11. Signature pathways identified from gene expression profiles in the human uterine cervix before and after spontaneous term parturition

    Science.gov (United States)

    HASSAN, Sonia S.; ROMERO, Roberto; TARCA, Adi L.; DRAGHICI, Sorin; PINELES, Beth; BUGRIM, Andrej; KHALEK, Nahla; CAMACHO, Natalia; MITTAL, Pooja; YOON, Bo Hyun; ESPINOZA, Jimmy; KIM, Chong Jai; SOROKIN, Yoram; MALONE, John

    2008-01-01

    Objective This study aimed to discover ‘signature pathways’ characterizing biological processes based on genes differentially expressed in the uterine cervix before and after spontaneous labor. Study Design The cervical transcriptome was previously characterized from biopsies taken before and after term labor. Pathway analysis was used to study the differentially expressed genes based on two gene-to-pathway annotation databases (KEGG and Metacore™). Over-represented and highly impacted pathways and connectivity nodes were identified. Results Fifty-two pathways in the Metacore™ database were significantly enriched in differentially expressed genes. Three of the top 5 pathways were known to be involved in cervical remodeling.Two novel pathways were: plasmin signaling and plasminogen activator urokinase (PLAU) signaling. The same analysis in the KEGG database identified 4 significant pathways, of which impact analysis confirmed. Multiple nodes providing connectivity within the plasmin and PLAU signaling pathways were identified.. Conclusions Three strategies for pathway analysis were consistent in their identification of novel, unexpected as well as expected networks, suggesting that this approach is both valid and effective for the elucidation of biological mechanisms involved in cervical dilation and remodeling. PMID:17826407

  12. A pathway-based network analysis of hypertension-related genes

    Science.gov (United States)

    Wang, Huan; Hu, Jing-Bo; Xu, Chuan-Yun; Zhang, De-Hai; Yan, Qian; Xu, Ming; Cao, Ke-Fei; Zhang, Xu-Sheng

    2016-02-01

    Complex network approach has become an effective way to describe interrelationships among large amounts of biological data, which is especially useful in finding core functions and global behavior of biological systems. Hypertension is a complex disease caused by many reasons including genetic, physiological, psychological and even social factors. In this paper, based on the information of biological pathways, we construct a network model of hypertension-related genes of the salt-sensitive rat to explore the interrelationship between genes. Statistical and topological characteristics show that the network has the small-world but not scale-free property, and exhibits a modular structure, revealing compact and complex connections among these genes. By the threshold of integrated centrality larger than 0.71, seven key hub genes are found: Jun, Rps6kb1, Cycs, Creb312, Cdk4, Actg1 and RT1-Da. These genes should play an important role in hypertension, suggesting that the treatment of hypertension should focus on the combination of drugs on multiple genes.

  13. Serine Proteolytic Pathway Activation Reveals an Expanded Ensemble of Wound Response Genes in Drosophila

    Science.gov (United States)

    Patterson, Rachel A.; Juarez, Michelle T.; Hermann, Anita; Sasik, Roman; Hardiman, Gary; McGinnis, William

    2013-01-01

    After injury to the animal epidermis, a variety of genes are transcriptionally activated in nearby cells to regenerate the missing cells and facilitate barrier repair. The range and types of diffusible wound signals that are produced by damaged epidermis and function to activate repair genes during epidermal regeneration remains a subject of very active study in many animals. In Drosophila embryos, we have discovered that serine protease function is locally activated around wound sites, and is also required for localized activation of epidermal repair genes. The serine protease trypsin is sufficient to induce a striking global epidermal wound response without inflicting cell death or compromising the integrity of the epithelial barrier. We developed a trypsin wounding treatment as an amplification tool to more fully understand the changes in the Drosophila transcriptome that occur after epidermal injury. By comparing our array results with similar results on mammalian skin wounding we can see which evolutionarily conserved pathways are activated after epidermal wounding in very diverse animals. Our innovative serine protease-mediated wounding protocol allowed us to identify 8 additional genes that are activated in epidermal cells in the immediate vicinity of puncture wounds, and the functions of many of these genes suggest novel genetic pathways that may control epidermal wound repair. Additionally, our data augments the evidence that clean puncture wounding can mount a powerful innate immune transcriptional response, with different innate immune genes being activated in an interesting variety of ways. These include puncture-induced activation only in epidermal cells in the immediate vicinity of wounds, or in all epidermal cells, or specifically in the fat body, or in multiple tissues. PMID:23637905

  14. Making memories of stressful events: a journey along epigenetic, gene transcription and signaling pathways

    Directory of Open Access Journals (Sweden)

    Johannes M.H.M. eReul

    2014-01-01

    Full Text Available Strong psychologically stressful events are known to have a long-lasting impact on behavior. The consolidation of such, largely adaptive, behavioral responses to stressful events involves changes in gene expression in limbic brain regions such as the hippocampus and amygdala. The underlying molecular mechanisms however were until recently unresolved. More than a decade ago we started to investigate the role of these hormones in signaling and epigenetic mechanisms participating in the effects of stress on gene transcription in hippocampal neurons. We discovered a novel, rapid non-genomic mechanism in which glucocorticoids via glucocorticoid receptors (GRs facilitate signaling of the ERK MAPK signaling pathway to the downstream nuclear kinases MSK1 and Elk-1 in dentate gyrus (DG granule neurons. Activation of this signaling pathway results in serine10 (S10 phosphorylation and lysine14 (K14 acetylation at histone H3 (H3S10p-K14ac, leading to the induction of the immediate early genes c-Fos and Egr-1. In addition, we found a role of the DNA methylation status of gene promoters. A series of studies showed that these molecular mechanisms play a critical role in the long-lasting consolidation of behavioral responses in the forced swim test and Morris water maze. Furthermore, an important role of GABA was found in controlling the epigenetic and gene transcriptional responses to psychological stress. Thus, psychologically stressful events evoke a long-term impact on behavior through changes in hippocampal function brought about by distinct glutamatergic and glucocorticoid-driven changes in epigenetic regulation of gene transcription which are modulated by (local GABAergic interneurons and limbic afferent inputs. These epigenetic processes may play an important role in the etiology of stress-related mental disorders such as major depressive and anxiety disorders like PTSD.

  15. Epistasis Analysis for Estrogen Metabolic and Signaling Pathway Genes on Young Ischemic Stroke Patients

    Science.gov (United States)

    Hsieh, Yi-Chen; Jeng, Jiann-Shing; Lin, Huey-Juan; Hu, Chaur-Jong; Yu, Chia-Chen; Lien, Li-Ming; Peng, Giia-Sheun; Chen, Chin-I; Tang, Sung-Chun; Chi, Nai-Fang; Tseng, Hung-Pin; Chern, Chang-Ming; Hsieh, Fang-I; Bai, Chyi-Huey; Chen, Yi-Rhu; Chiou, Hung-Yi; Jeng, Jiann-Shing; Tang, Sung-Chun; Yeh, Shin-Joe; Tsai, Li-Kai; Kong, Shin; Lien, Li-Ming; Chiu, Hou-Chang; Chen, Wei-Hung; Bai, Chyi-Huey; Huang, Tzu-Hsuan; Chi-Ieong, Lau; Wu, Ya-Ying; Yuan, Rey-Yue; Hu, Chaur-Jong; Sheu, Jau- Jiuan; Yu, Jia-Ming; Ho, Chun-Sum; Chen, Chin-I; Sung, Jia-Ying; Weng, Hsing-Yu; Han, Yu-Hsuan; Huang, Chun-Ping; Chung, Wen-Ting; Ke, Der-Shin; Lin, Huey-Juan; Chang, Chia-Yu; Yeh, Poh-Shiow; Lin, Kao-Chang; Cheng, Tain-Junn; Chou, Chih-Ho; Yang, Chun-Ming; Peng, Giia-Sheun; Lin, Jiann-Chyun; Hsu, Yaw-Don; Denq, Jong-Chyou; Lee, Jiunn-Tay; Hsu, Chang-Hung; Lin, Chun-Chieh; Yen, Che-Hung; Cheng, Chun-An; Sung, Yueh-Feng; Chen, Yuan-Liang; Lien, Ming-Tung; Chou, Chung-Hsing; Liu, Chia-Chen; Yang, Fu-Chi; Wu, Yi-Chung; Tso, An-Chen; Lai, Yu- Hua; Chiang, Chun-I; Tsai, Chia-Kuang; Liu, Meng-Ta; Lin, Ying-Che; Hsu, Yu-Chuan; Chen, Chih-Hung; Sung, Pi-Shan; Chern, Chang-Ming; Hu, Han-Hwa; Wong, Wen-Jang; Luk, Yun-On; Hsu, Li-Chi; Chung, Chih-Ping; Tseng, Hung-Pin; Liu, Chin-Hsiung; Lin, Chun-Liang; Lin, Hung-Chih; Hu, Chaur-Jong

    2012-01-01

    Background Endogenous estrogens play an important role in the overall cardiocirculatory system. However, there are no studies exploring the hormone metabolism and signaling pathway genes together on ischemic stroke, including sulfotransferase family 1E (SULT1E1), catechol-O-methyl-transferase (COMT), and estrogen receptor α (ESR1). Methods A case-control study was conducted on 305 young ischemic stroke subjects aged ≦ 50 years and 309 age-matched healthy controls. SULT1E1 -64G/A, COMT Val158Met, ESR1 c.454−397 T/C and c.454−351 A/G genes were genotyped and compared between cases and controls to identify single nucleotide polymorphisms associated with ischemic stroke susceptibility. Gene-gene interaction effects were analyzed using entropy-based multifactor dimensionality reduction (MDR), classification and regression tree (CART), and traditional multiple regression models. Results COMT Val158Met polymorphism showed a significant association with susceptibility of young ischemic stroke among females. There was a two-way interaction between SULT1E1 -64G/A and COMT Val158Met in both MDR and CART analysis. The logistic regression model also showed there was a significant interaction effect between SULT1E1 -64G/A and COMT Val158Met on ischemic stroke of the young (P for interaction = 0.0171). We further found that lower estradiol level could increase the risk of young ischemic stroke for those who carry either SULT1E1 or COMT risk genotypes, showing a significant interaction effect (P for interaction = 0.0174). Conclusions Our findings support that a significant epistasis effect exists among estrogen metabolic and signaling pathway genes and gene-environment interactions on young ischemic stroke subjects. PMID:23112845

  16. Mechanical stress activates Smad pathway through PKCδ to enhance interleukin-11 gene transcription in osteoblasts.

    Directory of Open Access Journals (Sweden)

    Shinsuke Kido

    Full Text Available BACKGROUND: Mechanical stress rapidly induces ΔFosB expression in osteoblasts, which binds to interleukin (IL-11 gene promoter to enhance IL-11 expression, and IL-11 enhances osteoblast differentiation. Because bone morphogenetic proteins (BMPs also stimulate IL-11 expression in osteoblasts, there is a possibility that BMP-Smad signaling is involved in the enhancement of osteoblast differentiation by mechanical stress. The present study was undertaken to clarify whether mechanical stress affects BMP-Smad signaling, and if so, to elucidate the role of Smad signaling in mechanical stress-induced enhancement of IL-11 gene transcription. METHODOLOGY/PRINCIPAL FINDINGS: Mechanical loading by fluid shear stress (FSS induced phosphorylation of BMP-specific receptor-regulated Smads (BR-Smads, Smad1/5, in murine primary osteoblasts (mPOBs. FSS rapidly phosphorylated Y311 of protein kinase C (PKCδ, and phosphorylated PKCδ interacted with BR-Smads to phosphorylate BR-Smads. Transfection of PKCδ siRNA or Y311F mutant PKCδ abrogated BR-Smads phosphorylation and suppressed IL-11 gene transcription enhanced by FSS. Activated BR-Smads bound to the Smad-binding element (SBE of IL-11 gene promoter and formed complex with ΔFosB/JunD heterodimer via binding to the C-terminal region of JunD. Site-directed mutagenesis in the SBE and the AP-1 site revealed that both SBE and AP-1 sites were required for full activation of IL-11 gene promoter by FSS. CONCLUSIONS/SIGNIFICANCE: These results demonstrate that PKCδ-BR-Smads pathway plays an important role in the intracellular signaling in response to mechanical stress, and that a cross-talk between PKCδ-BR-Smads and ΔFosB/JunD pathways synergistically stimulates IL-11 gene transcription in response to mechanical stress.

  17. Semantics in support of biodiversity knowledge discovery: an introduction to the biological collections ontology and related ontologies.

    Science.gov (United States)

    Walls, Ramona L; Deck, John; Guralnick, Robert; Baskauf, Steve; Beaman, Reed; Blum, Stanley; Bowers, Shawn; Buttigieg, Pier Luigi; Davies, Neil; Endresen, Dag; Gandolfo, Maria Alejandra; Hanner, Robert; Janning, Alyssa; Krishtalka, Leonard; Matsunaga, Andréa; Midford, Peter; Morrison, Norman; Ó Tuama, Éamonn; Schildhauer, Mark; Smith, Barry; Stucky, Brian J; Thomer, Andrea; Wieczorek, John; Whitacre, Jamie; Wooley, John

    2014-01-01

    The study of biodiversity spans many disciplines and includes data pertaining to species distributions and abundances, genetic sequences, trait measurements, and ecological niches, complemented by information on collection and measurement protocols. A review of the current landscape of metadata standards and ontologies in biodiversity science suggests that existing standards such as the Darwin Core terminology are inadequate for describing biodiversity data in a semantically meaningful and computationally useful way. Existing ontologies, such as the Gene Ontology and others in the Open Biological and Biomedical Ontologies (OBO) Foundry library, provide a semantic structure but lack many of the necessary terms to describe biodiversity data in all its dimensions. In this paper, we describe the motivation for and ongoing development of a new Biological Collections Ontology, the Environment Ontology, and the Population and Community Ontology. These ontologies share the aim of improving data aggregation and integration across the biodiversity domain and can be used to describe physical samples and sampling processes (for example, collection, extraction, and preservation techniques), as well as biodiversity observations that involve no physical sampling. Together they encompass studies of: 1) individual organisms, including voucher specimens from ecological studies and museum specimens, 2) bulk or environmental samples (e.g., gut contents, soil, water) that include DNA, other molecules, and potentially many organisms, especially microbes, and 3) survey-based ecological observations. We discuss how these ontologies can be applied to biodiversity use cases that span genetic, organismal, and ecosystem levels of organization. We argue that if adopted as a standard and rigorously applied and enriched by the biodiversity community, these ontologies would significantly reduce barriers to data discovery, integration, and exchange among biodiversity resources and researchers.

  18. Semantics in Support of Biodiversity Knowledge Discovery: An Introduction to the Biological Collections Ontology and Related Ontologies

    Science.gov (United States)

    Baskauf, Steve; Blum, Stanley; Bowers, Shawn; Davies, Neil; Endresen, Dag; Gandolfo, Maria Alejandra; Hanner, Robert; Janning, Alyssa; Krishtalka, Leonard; Matsunaga, Andréa; Midford, Peter; Tuama, Éamonn Ó.; Schildhauer, Mark; Smith, Barry; Stucky, Brian J.; Thomer, Andrea; Wieczorek, John; Whitacre, Jamie; Wooley, John

    2014-01-01

    The study of biodiversity spans many disciplines and includes data pertaining to species distributions and abundances, genetic sequences, trait measurements, and ecological niches, complemented by information on collection and measurement protocols. A review of the current landscape of metadata standards and ontologies in biodiversity science suggests that existing standards such as the Darwin Core terminology are inadequate for describing biodiversity data in a semantically meaningful and computationally useful way. Existing ontologies, such as the Gene Ontology and others in the Open Biological and Biomedical Ontologies (OBO) Foundry library, provide a semantic structure but lack many of the necessary terms to describe biodiversity data in all its dimensions. In this paper, we describe the motivation for and ongoing development of a new Biological Collections Ontology, the Environment Ontology, and the Population and Community Ontology. These ontologies share the aim of improving data aggregation and integration across the biodiversity domain and can be used to describe physical samples and sampling processes (for example, collection, extraction, and preservation techniques), as well as biodiversity observations that involve no physical sampling. Together they encompass studies of: 1) individual organisms, including voucher specimens from ecological studies and museum specimens, 2) bulk or environmental samples (e.g., gut contents, soil, water) that include DNA, other molecules, and potentially many organisms, especially microbes, and 3) survey-based ecological observations. We discuss how these ontologies can be applied to biodiversity use cases that span genetic, organismal, and ecosystem levels of organization. We argue that if adopted as a standard and rigorously applied and enriched by the biodiversity community, these ontologies would significantly reduce barriers to data discovery, integration, and exchange among biodiversity resources and researchers

  19. Owlready: Ontology-oriented programming in Python with automatic classification and high level constructs for biomedical ontologies.

    Science.gov (United States)

    Lamy, Jean-Baptiste

    2017-07-01

    Ontologies are widely used in the biomedical domain. While many tools exist for the edition, alignment or evaluation of ontologies, few solutions have been proposed for ontology programming interface, i.e. for accessing and modifying an ontology within a programming language. Existing query languages (such as SPARQL) and APIs (such as OWLAPI) are not as easy-to-use as object programming languages are. Moreover, they provide few solutions to difficulties encountered with biomedical ontologies. Our objective was to design a tool for accessing easily the entities of an OWL ontology, with high-level constructs helping with biomedical ontologies. From our experience on medical ontologies, we identified two difficulties: (1) many entities are represented by classes (rather than individuals), but the existing tools do not permit manipulating classes as easily as individuals, (2) ontologies rely on the open-world assumption, whereas the medical reasoning must consider only evidence-based medical knowledge as true. We designed a Python module for ontology-oriented programming. It allows access to the entities of an OWL ontology as if they were objects in the programming language. We propose a simple high-level syntax for managing classes and the associated "role-filler" constraints. We also propose an algorithm for performing local closed world reasoning in simple situations. We developed Owlready, a Python module for a high-level access to OWL ontologies. The paper describes the architecture and the syntax of the module version 2. It details how we integrated the OWL ontology model with the Python object model. The paper provides examples based on Gene Ontology (GO). We also demonstrate the interest of Owlready in a use case focused on the automatic comparison of the contraindications of several drugs. This use case illustrates the use of the specific syntax proposed for manipulating classes and for performing local closed world reasoning. Owlready has been successfully

  20. Multiple interactions between maternally-activated signalling pathways control Xenopus nodal-related genes.

    Science.gov (United States)

    Rex, Maria; Hilton, Emma; Old, Robert

    2002-03-01

    We have investigated the induction of the six Xenopus nodal-related genes, Xnr1-Xnr6, by maternal determinants. The beta-catenin pathway was modelled by stimulation using Xwnt8, activin-like signalling was modelled by activin, and VegT action was studied by overexpression in animal cap explants. Combinations of factors were examined, and previously unrecognised interactions were revealed in animal caps and whole embryos. For the induction of Xnr5 and Xnr6 in whole embryos, using a beta-catenin antisense morpholino oligonucleotide or a dominant negative XTcf3, we have demonstrated an absolute permissive requirement for the beta-catenin/Tcf pathway, in addition to the requirement for VegT action. In animal caps Xnr5 and Xnr6 are induced in response to VegT overexpression, and this induction is dependent upon the concomitant activation of the beta-catenin pathway that VegT initiates in animal caps. For the induction of Xnr3, VegT interacts negatively so as to inhibit the induction otherwise observed with wnt-signalling alone. The negative effect of VegT is not the result of a general inhibition of wnt-signalling, and does not result from an inhibition of wnt-induced siamois expression. A 294 bp proximal promoter fragment of the Xnr3 gene is sufficient to mediate the negative effect of VegT. Further experiments, employing cycloheximide to examine the dependence of Xnr gene expression upon proteins translated after the mid-blastula stage, demonstrated that Xnrs 4, 5 and 6 are 'primary' Xnr genes whose expression in the late blastula is solely dependent upon factors present before the mid-blastula stage.

  1. Completeness, supervenience and ontology

    International Nuclear Information System (INIS)

    Maudlin, Tim W E

    2007-01-01

    In 1935, Einstein, Podolsky and Rosen raised the issue of the completeness of the quantum description of a physical system. What they had in mind is whether or not the quantum description is informationally complete, in that all physical features of a system can be recovered from it. In a collapse theory such as the theory of Ghirardi, Rimini and Weber, the quantum wavefunction is informationally complete, and this has often been taken to suggest that according to that theory the wavefunction is all there is. If we distinguish the ontological completeness of a description from its informational completeness, we can see that the best interpretations of the GRW theory must postulate more physical ontology than just the wavefunction

  2. Completeness, supervenience and ontology

    Energy Technology Data Exchange (ETDEWEB)

    Maudlin, Tim W E [Department of Philosophy, Rutgers University, 26 Nichol Avenue, New Brunswick, NJ 08901-1411 (United States)

    2007-03-23

    In 1935, Einstein, Podolsky and Rosen raised the issue of the completeness of the quantum description of a physical system. What they had in mind is whether or not the quantum description is informationally complete, in that all physical features of a system can be recovered from it. In a collapse theory such as the theory of Ghirardi, Rimini and Weber, the quantum wavefunction is informationally complete, and this has often been taken to suggest that according to that theory the wavefunction is all there is. If we distinguish the ontological completeness of a description from its informational completeness, we can see that the best interpretations of the GRW theory must postulate more physical ontology than just the wavefunction.

  3. Transcriptomic profiling of bovine IVF embryos revealed candidate genes and pathways involved in early embryonic development

    Directory of Open Access Journals (Sweden)

    Yandell Brian S

    2010-01-01

    Full Text Available Abstract Background Early embryonic loss is a large contributor to infertility in cattle. Although genetic factors are known to affect early embryonic development, the discovery of such factors has been a serious challenge. The objective of this study was to identify genes differentially expressed between blastocysts and degenerative embryos at early stages of development. Results Using microarrays, genome-wide RNA expression was profiled and compared for in vitro fertilization (IVF - derived blastocysts and embryos undergoing degenerative development up to the same time point. Surprisingly similar transcriptomic profiles were found in degenerative embryos and blastocysts. Nonetheless, we identified 67 transcripts that significantly differed between these two groups of embryos at a 15% false discovery rate, including 33 transcripts showing at least a two-fold difference. Several signaling and metabolic pathways were found to be associated with the developmental status of embryos, among which were previously known important steroid biosynthesis and cell communication pathways in early embryonic development. Conclusions This study presents the first direct and comprehensive comparison of transcriptomes between IVF blastocysts and degenerative embryos, providing important information for potential genes and pathways associated with early embryonic development.

  4. How Did Arthropod Sesquiterpenoids and Ecdysteroids Arise? Comparison of Hormonal Pathway Genes in Noninsect Arthropod Genomes.

    Science.gov (United States)

    Qu, Zhe; Kenny, Nathan James; Lam, Hon Ming; Chan, Ting Fung; Chu, Ka Hou; Bendena, William G; Tobe, Stephen S; Hui, Jerome Ho Lam

    2015-06-25

    The phylum Arthropoda contains the largest number of described living animal species, with insects and crustaceans dominating the terrestrial and aquatic environments, respectively. Their successful radiations have long been linked to their rigid exoskeleton in conjunction with their specialized endocrine systems. In order to understand how hormones can contribute to the evolution of these animals, here, we have categorized the sesquiterpenoid and ecdysteroid pathway genes in the noninsect arthropod genomes, which are known to play important roles in the regulation of molting and metamorphosis in insects. In our analyses, the majority of gene homologs involved in the biosynthetic, degradative, and signaling pathways of sesquiterpenoids and ecdysteroids can be identified, implying these two hormonal systems were present in the last common ancestor of arthropods. Moreover, we found that the "Broad-Complex" was specifically gained in the Pancrustacea, and the innovation of juvenile hormone (JH) in the insect linage correlates with the gain of the JH epoxidase (CYP15A1/C1) and the key residue changes in the binding domain of JH receptor ("Methoprene-tolerant"). Furthermore, the gain of "Phantom" differentiates chelicerates from the other arthropods in using ponasterone A rather than 20-hydroxyecdysone as molting hormone. This study establishes a comprehensive framework for interpreting the evolution of these vital hormonal pathways in these most successful animals, the arthropods, for the first time. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. How Did Arthropod Sesquiterpenoids and Ecdysteroids Arise? Comparison of Hormonal Pathway Genes in Noninsect Arthropod Genomes

    Science.gov (United States)

    Qu, Zhe; Kenny, Nathan James; Lam, Hon Ming; Chan, Ting Fung; Chu, Ka Hou; Bendena, William G.; Tobe, Stephen S.; Hui, Jerome Ho Lam

    2015-01-01

    The phylum Arthropoda contains the largest number of described living animal species, with insects and crustaceans dominating the terrestrial and aquatic environments, respectively. Their successful radiations have long been linked to their rigid exoskeleton in conjunction with their specialized endocrine systems. In order to understand how hormones can contribute to the evolution of these animals, here, we have categorized the sesquiterpenoid and ecdysteroid pathway genes in the noninsect arthropod genomes, which are known to play important roles in the regulation of molting and metamorphosis in insects. In our analyses, the majority of gene homologs involved in the biosynthetic, degradative, and signaling pathways of sesquiterpenoids and ecdysteroids can be identified, implying these two hormonal systems were present in the last common ancestor of arthropods. Moreover, we found that the “Broad-Complex” was specifically gained in the Pancrustacea, and the innovation of juvenile hormone (JH) in the insect linage correlates with the gain of the JH epoxidase (CYP15A1/C1) and the key residue changes in the binding domain of JH receptor (“Methoprene-tolerant”). Furthermore, the gain of “Phantom” differentiates chelicerates from the other arthropods in using ponasterone A rather than 20-hydroxyecdysone as molting hormone. This study establishes a comprehensive framework for interpreting the evolution of these vital hormonal pathways in these most successful animals, the arthropods, for the first time. PMID:26112967

  6. Human protein secretory pathway genes are expressed in a tissue-specific pattern to match processing demands of the secretome

    DEFF Research Database (Denmark)

    Feizi, Amir; Gatto, Francesco; Uhlén, Mathias

    2017-01-01

    Protein secretory pathway in eukaryal cells is responsible for delivering functional secretory proteins. The dysfunction of this pathway causes a range of important human diseases from congenital disorders to cancer. Despite the piled-up knowledge on the molecular biology and biochemistry level...... in specific gene families of the secretory pathway. We also inspected the potential functional link between detected extreme genes and the corresponding tissues enriched secretome. As a result, the detected extreme genes showed correlation with the enrichment of the nature and number of specific post......-translational modifications in each tissue's secretome. Our findings conciliate both the housekeeping and tissue-specific nature of the protein secretory pathway, which we attribute to a fine-tuned regulation of defined gene families to support the diversity of secreted proteins and their modifications....

  7. LOGISTICS OPTIMIZATION USING ONTOLOGIES

    OpenAIRE

    Hendi , Hayder; Ahmad , Adeel; Bouneffa , Mourad; Fonlupt , Cyril

    2014-01-01

    International audience; Logistics processes involve complex physical flows and integration of different elements. It is widely observed that the uncontrolled processes can decline the state of logistics. The optimization of logistic processes can support the desired growth and consistent continuity of logistics. In this paper, we present a software framework for logistic processes optimization. It primarily defines logistic ontologies and then optimize them. It intends to assist the design of...

  8. Gene networks and toxicity pathways induced by acute cadmium exposure in adult largemouth bass (Micropterus salmoides)

    International Nuclear Information System (INIS)

    Mehinto, Alvine C.; Prucha, Melinda S.; Colli-Dula, Reyna C.; Kroll, Kevin J.; Lavelle, Candice M.; Barber, David S.; Vulpe, Christopher D.; Denslow, Nancy D.

    2014-01-01

    Highlights: • Low-level acute cadmium exposure elicited tissue-specific gene expression changes. • Molecular initiating events included oxidative stress and disruption of DNA repair. • Metallothionein, a marker of metal exposure, was not significantly affected. • We report effects of cadmium on cholesterol metabolism and steroid synthesis. • Diabetic complications and impaired reproduction are potential adverse outcomes. - Abstract: Cadmium is a heavy metal that can accumulate to toxic levels in the environment leading to detrimental effects in animals and humans including kidney, liver and lung injuries. Using a transcriptomics approach, genes and cellular pathways affected by a low dose of cadmium were investigated. Adult largemouth bass were intraperitoneally injected with 20 μg/kg of cadmium chloride (mean exposure level – 2.6 μg of cadmium per fish) and microarray analyses were conducted in the liver and testis 48 h after injection. Transcriptomic profiles identified in response to cadmium exposure were tissue-specific with the most differential expression changes found in the liver tissues, which also contained much higher levels of cadmium than the testis. Acute exposure to a low dose of cadmium induced oxidative stress response and oxidative damage pathways in the liver. The mRNA levels of antioxidants such as catalase increased and numerous transcripts related to DNA damage and DNA repair were significantly altered. Hepatic mRNA levels of metallothionein, a molecular marker of metal exposure, did not increase significantly after 48 h exposure. Carbohydrate metabolic pathways were also disrupted with hepatic transcripts such as UDP-glucose, pyrophosphorylase 2, and sorbitol dehydrogenase highly induced. Both tissues exhibited a disruption of steroid signaling pathways. In the testis, estrogen receptor beta and transcripts linked to cholesterol metabolism were suppressed. On the contrary, genes involved in cholesterol metabolism were highly

  9. Gene networks and toxicity pathways induced by acute cadmium exposure in adult largemouth bass (Micropterus salmoides)

    Energy Technology Data Exchange (ETDEWEB)

    Mehinto, Alvine C., E-mail: alvinam@sccwrp.org [Southern California Coastal Water Research Project, Costa Mesa, CA 92626 (United States); Department of Physiological Sciences and Center for Environmental and Human Toxicology, University of Florida, Gainesville, FL 32611 (United States); Prucha, Melinda S. [Department of Human Genetics, Yerkes National Primate Research Center, Emory University, Atlanta, GA 30322 (United States); Department of Physiological Sciences and Center for Environmental and Human Toxicology, University of Florida, Gainesville, FL 32611 (United States); Colli-Dula, Reyna C.; Kroll, Kevin J.; Lavelle, Candice M.; Barber, David S. [Department of Physiological Sciences and Center for Environmental and Human Toxicology, University of Florida, Gainesville, FL 32611 (United States); Vulpe, Christopher D. [Department of Nutritional Sciences and Toxicology, University of California, Berkeley, CA 94720 (United States); Denslow, Nancy D. [Department of Physiological Sciences and Center for Environmental and Human Toxicology, University of Florida, Gainesville, FL 32611 (United States)

    2014-07-01

    Highlights: • Low-level acute cadmium exposure elicited tissue-specific gene expression changes. • Molecular initiating events included oxidative stress and disruption of DNA repair. • Metallothionein, a marker of metal exposure, was not significantly affected. • We report effects of cadmium on cholesterol metabolism and steroid synthesis. • Diabetic complications and impaired reproduction are potential adverse outcomes. - Abstract: Cadmium is a heavy metal that can accumulate to toxic levels in the environment leading to detrimental effects in animals and humans including kidney, liver and lung injuries. Using a transcriptomics approach, genes and cellular pathways affected by a low dose of cadmium were investigated. Adult largemouth bass were intraperitoneally injected with 20 μg/kg of cadmium chloride (mean exposure level – 2.6 μg of cadmium per fish) and microarray analyses were conducted in the liver and testis 48 h after injection. Transcriptomic profiles identified in response to cadmium exposure were tissue-specific with the most differential expression changes found in the liver tissues, which also contained much higher levels of cadmium than the testis. Acute exposure to a low dose of cadmium induced oxidative stress response and oxidative damage pathways in the liver. The mRNA levels of antioxidants such as catalase increased and numerous transcripts related to DNA damage and DNA repair were significantly altered. Hepatic mRNA levels of metallothionein, a molecular marker of metal exposure, did not increase significantly after 48 h exposure. Carbohydrate metabolic pathways were also disrupted with hepatic transcripts such as UDP-glucose, pyrophosphorylase 2, and sorbitol dehydrogenase highly induced. Both tissues exhibited a disruption of steroid signaling pathways. In the testis, estrogen receptor beta and transcripts linked to cholesterol metabolism were suppressed. On the contrary, genes involved in cholesterol metabolism were highly

  10. A unified anatomy ontology of the vertebrate skeletal system.

    Directory of Open Access Journals (Sweden)

    Wasila M Dahdul

    Full Text Available The skeleton is of fundamental importance in research in comparative vertebrate morphology, paleontology, biomechanics, developmental biology, and systematics. Motivated by research questions that require computational access to and comparative reasoning across the diverse skeletal phenotypes of vertebrates, we developed a module of anatomical concepts for the skeletal system, the Vertebrate Skeletal Anatomy Ontology (VSAO, to accommodate and unify the existing skeletal terminologies for the species-specific (mouse, the frog Xenopus, zebrafish and multispecies (teleost, amphibian vertebrate anatomy ontologies. Previous differences between these terminologies prevented even simple queries across databases pertaining to vertebrate morphology. This module of upper-level and specific skeletal terms currently includes 223 defined terms and 179 synonyms that integrate skeletal cells, tissues, biological processes, organs (skeletal elements such as bones and cartilages, and subdivisions of the skeletal system. The VSAO is designed to integrate with other ontologies, including the Common Anatomy Reference Ontology (CARO, Gene Ontology (GO, Uberon, and Cell Ontology (CL, and it is freely available to the community to be updated with additional terms required for research. Its structure accommodates anatomical variation among vertebrate species in development, structure, and composition. Annotation of diverse vertebrate phenotypes with this ontology will enable novel inquiries across the full spectrum of phenotypic diversity.

  11. A unified anatomy ontology of the vertebrate skeletal system.

    Science.gov (United States)

    Dahdul, Wasila M; Balhoff, James P; Blackburn, David C; Diehl, Alexander D; Haendel, Melissa A; Hall, Brian K; Lapp, Hilmar; Lundberg, John G; Mungall, Christopher J; Ringwald, Martin; Segerdell, Erik; Van Slyke, Ceri E; Vickaryous, Matthew K; Westerfield, Monte; Mabee, Paula M

    2012-01-01

    The skeleton is of fundamental importance in research in comparative vertebrate morphology, paleontology, biomechanics, developmental biology, and systematics. Motivated by research questions that require computational access to and comparative reasoning across the diverse skeletal phenotypes of vertebrates, we developed a module of anatomical concepts for the skeletal system, the Vertebrate Skeletal Anatomy Ontology (VSAO), to accommodate and unify the existing skeletal terminologies for the species-specific (mouse, the frog Xenopus, zebrafish) and multispecies (teleost, amphibian) vertebrate anatomy ontologies. Previous differences between these terminologies prevented even simple queries across databases pertaining to vertebrate morphology. This module of upper-level and specific skeletal terms currently includes 223 defined terms and 179 synonyms that integrate skeletal cells, tissues, biological processes, organs (skeletal elements such as bones and cartilages), and subdivisions of the skeletal system. The VSAO is designed to integrate with other ontologies, including the Common Anatomy Reference Ontology (CARO), Gene Ontology (GO), Uberon, and Cell Ontology (CL), and it is freely available to the community to be updated with additional terms required for research. Its structure accommodates anatomical variation among vertebrate species in development, structure, and composition. Annotation of diverse vertebrate phenotypes with this ontology will enable novel inquiries across the full spectrum of phenotypic diversity.

  12. A Unified Anatomy Ontology of the Vertebrate Skeletal System

    Science.gov (United States)

    Dahdul, Wasila M.; Balhoff, James P.; Blackburn, David C.; Diehl, Alexander D.; Haendel, Melissa A.; Hall, Brian K.; Lapp, Hilmar; Lundberg, John G.; Mungall, Christopher J.; Ringwald, Martin; Segerdell, Erik; Van Slyke, Ceri E.; Vickaryous, Matthew K.; Westerfield, Monte; Mabee, Paula M.

    2012-01-01

    The skeleton is of fundamental importance in research in comparative vertebrate morphology, paleontology, biomechanics, developmental biology, and systematics. Motivated by research questions that require computational access to and comparative reasoning across the diverse skeletal phenotypes of vertebrates, we developed a module of anatomical concepts for the skeletal system, the Vertebrate Skeletal Anatomy Ontology (VSAO), to accommodate and unify the existing skeletal terminologies for the species-specific (mouse, the frog Xenopus, zebrafish) and multispecies (teleost, amphibian) vertebrate anatomy ontologies. Previous differences between these terminologies prevented even simple queries across databases pertaining to vertebrate morphology. This module of upper-level and specific skeletal terms currently includes 223 defined terms and 179 synonyms that integrate skeletal cells, tissues, biological processes, organs (skeletal elements such as bones and cartilages), and subdivisions of the skeletal system. The VSAO is designed to integrate with other ontologies, including the Common Anatomy Reference Ontology (CARO), Gene Ontology (GO), Uberon, and Cell Ontology (CL), and it is freely available to the community to be updated with additional terms required for research. Its structure accommodates anatomical variation among vertebrate species in development, structure, and composition. Annotation of diverse vertebrate phenotypes with this ontology will enable novel inquiries across the full spectrum of phenotypic diversity. PMID:23251424

  13. Aberrant DNA methylation of WNT pathway genes in the development and progression of CIMP-negative colorectal cancer.

    Science.gov (United States)

    Galamb, Orsolya; Kalmár, Alexandra; Péterfia, Bálint; Csabai, István; Bodor, András; Ribli, Dezső; Krenács, Tibor; Patai, Árpád V; Wichmann, Barnabás; Barták, Barbara Kinga; Tóth, Kinga; Valcz, Gábor; Spisák, Sándor; Tulassay, Zsolt; Molnár, Béla

    2016-08-02

    The WNT signaling pathway has an essential role in colorectal carcinogenesis and progression, which involves a cascade of genetic and epigenetic changes. We aimed to analyze DNA methylation affecting the WNT pathway genes in colorectal carcinogenesis in promoter and gene body regions using whole methylome analysis in 9 colorectal cancer, 15 adenoma, and 6 normal tumor adjacent tissue (NAT) samples by methyl capture sequencing. Functional methylation was confirmed on 5-aza-2'-deoxycytidine-treated colorectal cancer cell line datasets. In parallel with the DNA methylation analysis, mutations of WNT pathway genes (APC, β-catenin/CTNNB1) were analyzed by 454 sequencing on GS Junior platform. Most differentially methylated CpG sites were localized in gene body regions (95% of WNT pathway genes). In the promoter regions, 33 of the 160 analyzed WNT pathway genes were differentially methylated in colorectal cancer vs. normal, including hypermethylated AXIN2, CHP1, PRICKLE1, SFRP1, SFRP2, SOX17, and hypomethylated CACYBP, CTNNB1, MYC; 44 genes in adenoma vs. NAT; and 41 genes in colorectal cancer vs. adenoma comparisons. Hypermethylation of AXIN2, DKK1, VANGL1, and WNT5A gene promoters was higher, while those of SOX17, PRICKLE1, DAAM2, and MYC was lower in colon carcinoma compared to adenoma. Inverse correlation between expression and methylation was confirmed in 23 genes, including APC, CHP1, PRICKLE1, PSEN1, and SFRP1. Differential methylation affected both canonical and noncanonical WNT pathway genes in colorectal normal-adenoma-carcinoma sequence. Aberrant DNA methylation appears already in adenomas as an early event of colorectal carcinogenesis.

  14. Molecular Pathways: Targeting the Stimulator of Interferon Genes (STING) in the Immunotherapy of Cancer.

    Science.gov (United States)

    Corrales, Leticia; Gajewski, Thomas F

    2015-11-01

    Novel immunotherapy approaches are transforming the treatment of cancer, yet many patients remain refractory to these agents. One hypothesis is that immunotherapy fails because of a tumor microenvironment that fails to support recruitment of immune cells, including CD8(+) T cells. Therefore, new approaches designed to initiate a de novo antitumor immune response from within the tumor microenvironment are being pursued. Recent evidence has indicated that spontaneous activation of the Stimulator of Interferon Genes (STING) pathway within tumor-resident dendritic cells leads to type I IFN production and adaptive immune responses against tumors. This pathway is activated in the presence of cytosolic DNA that is detected by the sensor cyclic GMP-AMP synthase (cGAS) and generates cyclic GMP-AMP (cGAMP), which binds and activates STING. As a therapeutic approach, intratumoral injection of STING agonists has demonstrated profound therapeutic effects in multiple mouse tumor models, including melanoma, colon, breast, prostate, and fibrosarcoma. Better characterization of the STING pathway in human tumor recognition, and the development of new pharmacologic approaches to engage this pathway within the tumor microenvironment in patients, are important areas for clinical translation. ©2015 American Association for Cancer Research.

  15. Analysis of SNPs and haplotypes in vitamin D pathway genes and renal cancer risk.

    Directory of Open Access Journals (Sweden)

    Sara Karami

    2009-09-01

    Full Text Available In the kidney vitamin D is converted to its active form. Since vitamin D exerts its activity through binding to the nuclear vitamin D receptor (VDR, most genetic studies have primarily focused on variation within this gene. Therefore, analysis of genetic variation in VDR and other vitamin D pathway genes may provide insight into the role of vitamin D in renal cell carcinoma (RCC etiology. RCC cases (N = 777 and controls (N = 1,035 were genotyped to investigate the relationship between RCC risk and variation in eight target genes. Minimum-p-value permutation (Min-P tests were used to identify genes associated with risk. A three single nucleotide polymorphism (SNP sliding window was used to identify chromosomal regions with a False Discovery Rate of <10%, where subsequently, haplotype relative risks were computed in Haplostats. Min-P values showed that VDR (p-value = 0.02 and retinoid-X-receptor-alpha (RXRA (p-value = 0.10 were associated with RCC risk. Within VDR, three haplotypes across two chromosomal regions of interest were identified. The first region, located within intron 2, contained two haplotypes that increased RCC risk by approximately 25%. The second region included a haplotype (rs2239179, rs12717991 across intron 4 that increased risk among participants with the TC (OR = 1.31, 95% CI = 1.09-1.57 haplotype compared to participants with the common haplotype, TT. Across RXRA, one haplotype located 3' of the coding sequence (rs748964, rs3118523, increased RCC risk 35% among individuals with the variant haplotype compared to those with the most common haplotype. This study comprehensively evaluated genetic variation across eight vitamin D pathway genes in relation to RCC risk. We found increased risk associated with VDR and RXRA. Replication studies are warranted to confirm these findings.

  16. Rare Copy Number Variations in Adults with Tetralogy of Fallot Implicate Novel Risk Gene Pathways

    Science.gov (United States)

    Costain, Gregory; Merico, Daniele; Migita, Ohsuke; Liu, Ben; Yuen, Tracy; Rickaby, Jessica; Thiruvahindrapuram, Bhooma; Marshall, Christian R.; Scherer, Stephen W.; Bassett, Anne S.

    2012-01-01

    Structural genetic changes, especially copy number variants (CNVs), represent a major source of genetic variation contributing to human disease. Tetralogy of Fallot (TOF) is the most common form of cyanotic congenital heart disease, but to date little is known about the role of CNVs in the etiology of TOF. Using high-resolution genome-wide microarrays and stringent calling methods, we investigated rare CNVs in a prospectively recruited cohort of 433 unrelated adults with TOF and/or pulmonary atresia at a single centre. We excluded those with recognized syndromes, including 22q11.2 deletion syndrome. We identified candidate genes for TOF based on converging evidence between rare CNVs that overlapped the same gene in unrelated individuals and from pathway analyses comparing rare CNVs in TOF cases to those in epidemiologic controls. Even after excluding the 53 (10.7%) subjects with 22q11.2 deletions, we found that adults with TOF had a greater burden of large rare genic CNVs compared to controls (8.82% vs. 4.33%, p = 0.0117). Six loci showed evidence for recurrence in TOF or related congenital heart disease, including typical 1q21.1 duplications in four (1.18%) of 340 Caucasian probands. The rare CNVs implicated novel candidate genes of interest for TOF, including PLXNA2, a gene involved in semaphorin signaling. Independent pathway analyses highlighted developmental processes as potential contributors to the pathogenesis of TOF. These results indicate that individually rare CNVs are collectively significant contributors to the genetic burden of TOF. Further, the data provide new evidence for dosage sensitive genes in PLXNA2-semaphorin signaling and related developmental processes in human cardiovascular development, consistent with previous animal models. PMID:22912587

  17. Multiple Gene-Environment Interactions on the Angiogenesis Gene-Pathway Impact Rectal Cancer Risk and Survival

    Directory of Open Access Journals (Sweden)

    Noha Sharafeldin

    2017-09-01

    Full Text Available Characterization of gene-environment interactions (GEIs in cancer is limited. We aimed at identifying GEIs in rectal cancer focusing on a relevant biologic process involving the angiogenesis pathway and relevant environmental exposures: cigarette smoking, alcohol consumption, and animal protein intake. We analyzed data from 747 rectal cancer cases and 956 controls from the Diet, Activity and Lifestyle as a Risk Factor for Rectal Cancer study. We applied a 3-step analysis approach: first, we searched for interactions among single nucleotide polymorphisms on the pathway genes; second, we searched for interactions among the genes, both steps using Logic regression; third, we examined the GEIs significant at the 5% level using logistic regression for cancer risk and Cox proportional hazards models for survival. Permutation-based test was used for multiple testing adjustment. We identified 8 significant GEIs associated with risk among 6 genes adjusting for multiple testing: TNF (OR = 1.85, 95% CI: 1.10, 3.11, TLR4 (OR = 2.34, 95% CI: 1.38, 3.98, and EGR2 (OR = 2.23, 95% CI: 1.04, 4.78 with smoking; IGF1R (OR = 1.69, 95% CI: 1.04, 2.72, TLR4 (OR = 2.10, 95% CI: 1.22, 3.60 and EGR2 (OR = 2.12, 95% CI: 1.01, 4.46 with alcohol; and PDGFB (OR = 1.75, 95% CI: 1.04, 2.92 and MMP1 (OR = 2.44, 95% CI: 1.24, 4.81 with protein. Five GEIs were associated with survival at the 5% significance level but not after multiple testing adjustment: CXCR1 (HR = 2.06, 95% CI: 1.13, 3.75 with smoking; and KDR (HR = 4.36, 95% CI: 1.62, 11.73, TLR2 (HR = 9.06, 95% CI: 1.14, 72.11, EGR2 (HR = 2.45, 95% CI: 1.42, 4.22, and EGFR (HR = 6.33, 95% CI: 1.95, 20.54 with protein. GEIs between angiogenesis genes and smoking, alcohol, and animal protein impact rectal cancer risk. Our results support the importance of considering the biologic hypothesis to characterize GEIs associated with cancer outcomes.

  18. Keap1/Nrf2 pathway in kidney cancer : frequent methylation of KEAP1 gene promoter in clear renal cell carcinoma

    NARCIS (Netherlands)

    Fabrizio, Federico Pio; Costantini, Manuela; Copetti, Massimiliano; la Torre, Annamaria; Sparaneo, Angelo; Fontana, Andrea; Poeta, Luana; Gallucci, Michele; Sentinelli, Steno; Graziano, Paolo; Parente, Paola; Pompeo, Vincenzo; De Salvo, Laura; Simone, Giuseppe; Papalia, Rocco; Picardo, Francesco; Balsamo, Teresa; Flammia, Gerardo Paolo; Trombetta, Domenico; Pantalone, Angela; Kok, Klaas; Paranita, Ferronika; Muscarella, Lucia Anna; Fazio, Vito Michele

    2017-01-01

    The Keap1/Nrf2 pathway is a master regulator of the cellular redox state through the induction of several antioxidant defence genes implicated in chemotherapeutic drugs resistance of tumor cells. An increasing body of evidence supports a key role for Keap1/Nrf2 pathway in kidney diseases and renal

  19. Consortium analysis of gene and gene–folate interactions in purine and pyrimidine metabolism pathways with ovarian carcinoma risk

    DEFF Research Database (Denmark)

    Kelemen, Linda E; Terry, Kathryn L; Goodman, Marc T

    2014-01-01</