WorldWideScience

Sample records for expressed gene sets

  1. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  2. Novel gene sets improve set-level classification of prokaryotic gene expression data.

    Science.gov (United States)

    Holec, Matěj; Kuželka, Ondřej; Železný, Filip

    2015-10-28

    Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.

  3. Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

    Science.gov (United States)

    Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

    2014-05-01

    Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  4. Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

    Directory of Open Access Journals (Sweden)

    Tintle Nathan L

    2012-08-01

    Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

  5. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Boris P Hejblum

    2015-06-01

    Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.

  6. Zebrafish Expression Ontology of Gene Sets (ZEOGS): a tool to analyze enrichment of zebrafish anatomical terms in large gene sets.

    Science.gov (United States)

    Prykhozhij, Sergey V; Marsico, Annalisa; Meijsing, Sebastiaan H

    2013-09-01

    The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene expression

  7. Zebrafish Expression Ontology of Gene Sets (ZEOGS): A Tool to Analyze Enrichment of Zebrafish Anatomical Terms in Large Gene Sets

    Science.gov (United States)

    Marsico, Annalisa

    2013-01-01

    Abstract The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene

  8. GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Chris Cheadle

    2007-01-01

    Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

  9. A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

    Science.gov (United States)

    Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.

  10. Tracking difference in gene expression in a time-course experiment using gene set enrichment analysis.

    Directory of Open Access Journals (Sweden)

    Pui Shan Wong

    Full Text Available Fistulifera sp. strain JPCC DA0580 is a newly sequenced pennate diatom that is capable of simultaneously growing and accumulating lipids. This is a unique trait, not found in other related microalgae so far. It is able to accumulate between 40 to 60% of its cell weight in lipids, making it a strong candidate for the production of biofuel. To investigate this characteristic, we used RNA-Seq data gathered at four different times while Fistulifera sp. strain JPCC DA0580 was grown in oil accumulating and non-oil accumulating conditions. We then adapted gene set enrichment analysis (GSEA to investigate the relationship between the difference in gene expression of 7,822 genes and metabolic functions in our data. We utilized information in the KEGG pathway database to create the gene sets and changed GSEA to use re-sampling so that data from the different time points could be included in the analysis. Our GSEA method identified photosynthesis, lipid synthesis and amino acid synthesis related pathways as processes that play a significant role in oil production and growth in Fistulifera sp. strain JPCC DA0580. In addition to GSEA, we visualized the results by creating a network of compounds and reactions, and plotted the expression data on top of the network. This made existing graph algorithms available to us which we then used to calculate a path that metabolizes glucose into triacylglycerol (TAG in the smallest number of steps. By visualizing the data this way, we observed a separate up-regulation of genes at different times instead of a concerted response. We also identified two metabolic paths that used less reactions than the one shown in KEGG and showed that the reactions were up-regulated during the experiment. The combination of analysis and visualization methods successfully analyzed time-course data, identified important metabolic pathways and provided new hypotheses for further research.

  11. Genome-wide survey and developmental expression mapping of zebrafish SET domain-containing genes.

    Directory of Open Access Journals (Sweden)

    Xiao-Jian Sun

    Full Text Available SET domain-containing proteins represent an evolutionarily conserved family of epigenetic regulators, which are responsible for most histone lysine methylation. Since some of these genes have been revealed to be essential for embryonic development, we propose that the zebrafish, a vertebrate model organism possessing many advantages for developmental studies, can be utilized to study the biological functions of these genes and the related epigenetic mechanisms during early development. To this end, we have performed a genome-wide survey of zebrafish SET domain genes. 58 genes total have been identified. Although gene duplication events give rise to several lineage-specific paralogs, clear reciprocal orthologous relationship reveals high conservation between zebrafish and human SET domain genes. These data were further subject to an evolutionary analysis ranging from yeast to human, leading to the identification of putative clusters of orthologous groups (COGs of this gene family. By means of whole-mount mRNA in situ hybridization strategy, we have also carried out a developmental expression mapping of these genes. A group of maternal SET domain genes, which are implicated in the programming of histone modification states in early development, have been identified and predicted to be responsible for all known sites of SET domain-mediated histone methylation. Furthermore, some genes show specific expression patterns in certain tissues at certain stages, suggesting the involvement of epigenetic mechanisms in the development of these systems. These results provide a global view of zebrafish SET domain histone methyltransferases in evolutionary and developmental dimensions and pave the way for using zebrafish to systematically study the roles of these genes during development.

  12. Expression map of a complete set of gustatory receptor genes in chemosensory organs of Bombyx mori.

    Science.gov (United States)

    Guo, Huizhen; Cheng, Tingcai; Chen, Zhiwei; Jiang, Liang; Guo, Youbing; Liu, Jianqiu; Li, Shenglong; Taniai, Kiyoko; Asaoka, Kiyoshi; Kadono-Okuda, Keiko; Arunkumar, Kallare P; Wu, Jiaqi; Kishino, Hirohisa; Zhang, Huijie; Seth, Rakesh K; Gopinathan, Karumathil P; Montagné, Nicolas; Jacquin-Joly, Emmanuelle; Goldsmith, Marian R; Xia, Qingyou; Mita, Kazuei

    2017-03-01

    Most lepidopteran species are herbivores, and interaction with host plants affects their gene expression and behavior as well as their genome evolution. Gustatory receptors (Grs) are expected to mediate host plant selection, feeding, oviposition and courtship behavior. However, due to their high diversity, sequence divergence and extremely low level of expression it has been difficult to identify precisely a complete set of Grs in Lepidoptera. By manual annotation and BAC sequencing, we improved annotation of 43 gene sequences compared with previously reported Grs in the most studied lepidopteran model, the silkworm, Bombyx mori, and identified 7 new tandem copies of BmGr30 on chromosome 7, bringing the total number of BmGrs to 76. Among these, we mapped 68 genes to chromosomes in a newly constructed chromosome distribution map and 8 genes to scaffolds; we also found new evidence for large clusters of BmGrs, especially from the bitter receptor family. RNA-seq analysis of diverse BmGr expression patterns in chemosensory organs of larvae and adults enabled us to draw a precise organ specific map of BmGr expression. Interestingly, most of the clustered genes were expressed in the same tissues and more than half of the genes were expressed in larval maxillae, larval thoracic legs and adult legs. For example, BmGr63 showed high expression levels in all organs in both larval and adult stages. By contrast, some genes showed expression limited to specific developmental stages or organs and tissues. BmGr19 was highly expressed in larval chemosensory organs (especially antennae and thoracic legs), the single exon genes BmGr53 and BmGr67 were expressed exclusively in larval tissues, the BmGr27-BmGr31 gene cluster on chr7 displayed a high expression level limited to adult legs and the candidate CO 2 receptor BmGr2 was highly expressed in adult antennae, where few other Grs were expressed. Transcriptional analysis of the Grs in B. mori provides a valuable new reference for

  13. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer

    International Nuclear Information System (INIS)

    Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-01-01

    Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC

  14. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer

    Energy Technology Data Exchange (ETDEWEB)

    Pandi, Narayanan Sathiya, E-mail: sathiyapandi@gmail.com; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-10-04

    Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC.

  15. Identification of a set of genes showing regionally enriched expression in the mouse brain

    Directory of Open Access Journals (Sweden)

    Marra Marco A

    2008-07-01

    Full Text Available Abstract Background The Pleiades Promoter Project aims to improve gene therapy by designing human mini-promoters ( Results We have utilized LongSAGE to identify regionally enriched transcripts in the adult mouse brain. As supplemental strategies, we also performed a meta-analysis of published literature and inspected the Allen Brain Atlas in situ hybridization data. From a set of approximately 30,000 mouse genes, 237 were identified as showing specific or enriched expression in 30 target regions of the mouse brain. GO term over-representation among these genes revealed co-involvement in various aspects of central nervous system development and physiology. Conclusion Using a multi-faceted expression validation approach, we have identified mouse genes whose human orthologs are good candidates for design of mini-promoters. These mouse genes represent molecular markers in several discrete brain regions/cell-types, which could potentially provide a mechanistic explanation of unique functions performed by each region. This set of markers may also serve as a resource for further studies of gene regulatory elements influencing brain expression.

  16. Accurate Gene Expression-Based Biodosimetry Using a Minimal Set of Human Gene Transcripts

    Energy Technology Data Exchange (ETDEWEB)

    Tucker, James D., E-mail: jtucker@biology.biosci.wayne.edu [Department of Biological Sciences, Wayne State University, Detroit, Michigan (United States); Joiner, Michael C. [Department of Radiation Oncology, Wayne State University, Detroit, Michigan (United States); Thomas, Robert A.; Grever, William E.; Bakhmutsky, Marina V. [Department of Biological Sciences, Wayne State University, Detroit, Michigan (United States); Chinkhota, Chantelle N.; Smolinski, Joseph M. [Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan (United States); Divine, George W. [Department of Public Health Sciences, Henry Ford Hospital, Detroit, Michigan (United States); Auner, Gregory W. [Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan (United States)

    2014-03-15

    Purpose: Rapid and reliable methods for conducting biological dosimetry are a necessity in the event of a large-scale nuclear event. Conventional biodosimetry methods lack the speed, portability, ease of use, and low cost required for triaging numerous victims. Here we address this need by showing that polymerase chain reaction (PCR) on a small number of gene transcripts can provide accurate and rapid dosimetry. The low cost and relative ease of PCR compared with existing dosimetry methods suggest that this approach may be useful in mass-casualty triage situations. Methods and Materials: Human peripheral blood from 60 adult donors was acutely exposed to cobalt-60 gamma rays at doses of 0 (control) to 10 Gy. mRNA expression levels of 121 selected genes were obtained 0.5, 1, and 2 days after exposure by reverse-transcriptase real-time PCR. Optimal dosimetry at each time point was obtained by stepwise regression of dose received against individual gene transcript expression levels. Results: Only 3 to 4 different gene transcripts, ASTN2, CDKN1A, GDF15, and ATM, are needed to explain ≥0.87 of the variance (R{sup 2}). Receiver-operator characteristics, a measure of sensitivity and specificity, of 0.98 for these statistical models were achieved at each time point. Conclusions: The actual and predicted radiation doses agree very closely up to 6 Gy. Dosimetry at 8 and 10 Gy shows some effect of saturation, thereby slightly diminishing the ability to quantify higher exposures. Analyses of these gene transcripts may be advantageous for use in a field-portable device designed to assess exposures in mass casualty situations or in clinical radiation emergencies.

  17. Selection and validation of a set of reliable reference genes for quantitative sod gene expression analysis in C. elegans

    Directory of Open Access Journals (Sweden)

    Vandesompele Jo

    2008-01-01

    Full Text Available Abstract Background In the nematode Caenorhabditis elegans the conserved Ins/IGF-1 signaling pathway regulates many biological processes including life span, stress response, dauer diapause and metabolism. Detection of differentially expressed genes may contribute to a better understanding of the mechanism by which the Ins/IGF-1 signaling pathway regulates these processes. Appropriate normalization is an essential prerequisite for obtaining accurate and reproducible quantification of gene expression levels. The aim of this study was to establish a reliable set of reference genes for gene expression analysis in C. elegans. Results Real-time quantitative PCR was used to evaluate the expression stability of 12 candidate reference genes (act-1, ama-1, cdc-42, csq-1, eif-3.C, mdh-1, gpd-2, pmp-3, tba-1, Y45F10D.4, rgs-6 and unc-16 in wild-type, three Ins/IGF-1 pathway mutants, dauers and L3 stage larvae. After geNorm analysis, cdc-42, pmp-3 and Y45F10D.4 showed the most stable expression pattern and were used to normalize 5 sod expression levels. Significant differences in mRNA levels were observed for sod-1 and sod-3 in daf-2 relative to wild-type animals, whereas in dauers sod-1, sod-3, sod-4 and sod-5 are differentially expressed relative to third stage larvae. Conclusion Our findings emphasize the importance of accurate normalization using stably expressed reference genes. The methodology used in this study is generally applicable to reliably quantify gene expression levels in the nematode C. elegans using quantitative PCR.

  18. In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer.

    Science.gov (United States)

    Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi

    2013-10-04

    Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC. Copyright © 2013 Elsevier Inc. All rights reserved.

  19. Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets

    Directory of Open Access Journals (Sweden)

    Karacali Bilge

    2007-10-01

    Full Text Available Abstract Background Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a all genes on the microarray platform and b a list of known disease-related genes (a priori selection. We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms. Results Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform. Conclusion Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine

  20. Meta-analysis of differentiating mouse embryonic stem cell gene expression kinetics reveals early change of a small gene set.

    Directory of Open Access Journals (Sweden)

    Clive H Glover

    2006-11-01

    Full Text Available Stem cell differentiation involves critical changes in gene expression. Identification of these should provide endpoints useful for optimizing stem cell propagation as well as potential clues about mechanisms governing stem cell maintenance. Here we describe the results of a new meta-analysis methodology applied to multiple gene expression datasets from three mouse embryonic stem cell (ESC lines obtained at specific time points during the course of their differentiation into various lineages. We developed methods to identify genes with expression changes that correlated with the altered frequency of functionally defined, undifferentiated ESC in culture. In each dataset, we computed a novel statistical confidence measure for every gene which captured the certainty that a particular gene exhibited an expression pattern of interest within that dataset. This permitted a joint analysis of the datasets, despite the different experimental designs. Using a ranking scheme that favored genes exhibiting patterns of interest, we focused on the top 88 genes whose expression was consistently changed when ESC were induced to differentiate. Seven of these (103728_at, 8430410A17Rik, Klf2, Nr0b1, Sox2, Tcl1, and Zfp42 showed a rapid decrease in expression concurrent with a decrease in frequency of undifferentiated cells and remained predictive when evaluated in additional maintenance and differentiating protocols. Through a novel meta-analysis, this study identifies a small set of genes whose expression is useful for identifying changes in stem cell frequencies in cultures of mouse ESC. The methods and findings have broader applicability to understanding the regulation of self-renewal of other stem cell types.

  1. Application of biclustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

    Directory of Open Access Journals (Sweden)

    Andrew Williams

    2015-12-01

    Full Text Available Background: The presence of diverse types of nanomaterials (NMs in commerce is growing at an exponential pace. As a result, human exposure to these materials in the environment is inevitable, necessitating the need for rapid and reliable toxicity testing methods to accurately assess the potential hazards associated with NMs. In this study, we applied biclustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related biclusters of genes showing similar expression profiles were identified. The identified biclusters were then used to conduct a gene set enrichment analysis on pulmonary gene expression profiles derived from mice exposed to nano-titanium dioxide (nano-TiO2, carbon black (CB or carbon nanotubes (CNTs to determine the disease significance of these data-driven gene sets.Results: Biclusters representing inflammation (chemokine activity, DNA binding, cell cycle, apoptosis, reactive oxygen species (ROS and fibrosis processes were identified. All of the NM studies were significant with respect to the bicluster related to chemokine activity (DAVID; FDR p-value = 0.032. The bicluster related to pulmonary fibrosis was enriched in studies where toxicity induced by CNT and CB studies was investigated, suggesting the potential for these materials to induce lung fibrosis. The pro-fibrogenic potential of CNTs is well established. Although CB has not been shown to induce fibrosis, it induces stronger inflammatory, oxidative stress and DNA damage responses than nano-TiO2 particles.Conclusion: The results of the analysis correctly identified all NMs to be inflammogenic and only CB and CNTs as potentially fibrogenic. In addition to identifying several

  2. Identification of self-consistent modulons from bacterial microarray expression data with the help of structured regulon gene sets

    KAUST Repository

    Permina, Elizaveta A.; Medvedeva, Yulia; Baeck, Pia M.; Hegde, Shubhada R.; Mande, Shekhar C.; Makeev, Vsevolod J.

    2013-01-01

    interactions helps to evaluate parameters for regulatory subnetwork inference. We suggest a procedure for modulon construction where a seed regulon is iteratively updated with genes having expression patterns similar to those for regulon member genes. A set

  3. Can survival prediction be improved by merging gene expression data sets?

    Directory of Open Access Journals (Sweden)

    Haleh Yasrebi

    Full Text Available BACKGROUND: High-throughput gene expression profiling technologies generating a wealth of data, are increasingly used for characterization of tumor biopsies for clinical trials. By applying machine learning algorithms to such clinically documented data sets, one hopes to improve tumor diagnosis, prognosis, as well as prediction of treatment response. However, the limited number of patients enrolled in a single trial study limits the power of machine learning approaches due to over-fitting. One could partially overcome this limitation by merging data from different studies. Nevertheless, such data sets differ from each other with regard to technical biases, patient selection criteria and follow-up treatment. It is therefore not clear at all whether the advantage of increased sample size outweighs the disadvantage of higher heterogeneity of merged data sets. Here, we present a systematic study to answer this question specifically for breast cancer data sets. We use survival prediction based on Cox regression as an assay to measure the added value of merged data sets. RESULTS: Using time-dependent Receiver Operating Characteristic-Area Under the Curve (ROC-AUC and hazard ratio as performance measures, we see in overall no significant improvement or deterioration of survival prediction with merged data sets as compared to individual data sets. This apparently was due to the fact that a few genes with strong prognostic power were not available on all microarray platforms and thus were not retained in the merged data sets. Surprisingly, we found that the overall best performance was achieved with a single-gene predictor consisting of CYB5D1. CONCLUSIONS: Merging did not deteriorate performance on average despite (a The diversity of microarray platforms used. (b The heterogeneity of patients cohorts. (c The heterogeneity of breast cancer disease. (d Substantial variation of time to death or relapse. (e The reduced number of genes in the merged data

  4. ADAGE signature analysis: differential expression analysis with data-defined gene sets.

    Science.gov (United States)

    Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S

    2017-11-22

    Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed

  5. Meta-analysis of Drosophila circadian microarray studies identifies a novel set of rhythmically expressed genes.

    Directory of Open Access Journals (Sweden)

    Kevin P Keegan

    2007-11-01

    Full Text Available Five independent groups have reported microarray studies that identify dozens of rhythmically expressed genes in the fruit fly Drosophila melanogaster. Limited overlap among the lists of discovered genes makes it difficult to determine which, if any, exhibit truly rhythmic patterns of expression. We reanalyzed data from all five reports and found two sources for the observed discrepancies, the use of different expression pattern detection algorithms and underlying variation among the datasets. To improve upon the methods originally employed, we developed a new analysis that involves compilation of all existing data, application of identical transformation and standardization procedures followed by ANOVA-based statistical prescreening, and three separate classes of post hoc analysis: cross-correlation to various cycling waveforms, autocorrelation, and a previously described fast Fourier transform-based technique. Permutation-based statistical tests were used to derive significance measures for all post hoc tests. We find application of our method, most significantly the ANOVA prescreening procedure, significantly reduces the false discovery rate relative to that observed among the results of the original five reports while maintaining desirable statistical power. We identify a set of 81 cycling transcripts previously found in one or more of the original reports as well as a novel set of 133 transcripts not found in any of the original studies. We introduce a novel analysis method that compensates for variability observed among the original five Drosophila circadian array reports. Based on the statistical fidelity of our meta-analysis results, and the results of our initial validation experiments (quantitative RT-PCR, we predict many of our newly found genes to be bona fide cyclers, and suggest that they may lead to new insights into the pathways through which clock mechanisms regulate behavioral rhythms.

  6. Gene expression

    International Nuclear Information System (INIS)

    Hildebrand, C.E.; Crawford, B.D.; Walters, R.A.; Enger, M.D.

    1983-01-01

    We prepared probes for isolating functional pieces of the metallothionein locus. The probes enabled a variety of experiments, eventually revealing two mechanisms for metallothionein gene expression, the order of the DNA coding units at the locus, and the location of the gene site in its chromosome. Once the switch regulating metallothionein synthesis was located, it could be joined by recombinant DNA methods to other, unrelated genes, then reintroduced into cells by gene-transfer techniques. The expression of these recombinant genes could then be induced by exposing the cells to Zn 2+ or Cd 2+ . We would thus take advantage of the clearly defined switching properties of the metallothionein gene to manipulate the expression of other, perhaps normally constitutive, genes. Already, despite an incomplete understanding of how the regulatory switch of the metallothionein locus operates, such experiments have been performed successfully

  7. Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes

    OpenAIRE

    Kreiman, Gabriel

    2004-01-01

    Sequence information and high‐throughput methods to measure gene expression levels open the door to explore transcriptional regulation using computational tools. Combinatorial regulation and sparseness of regulatory elements throughout the genome allow organisms to control the spatial and temporal patterns of gene expression. Here we study the organization of cis‐regulatory elements in sets of co‐regulated genes. We build an algorithm to search for combinations of transcription factor binding...

  8. Development of a set of SNP markers present in expressed genes of the apple.

    Science.gov (United States)

    Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S

    2008-11-01

    Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.

  9. Genome-Wide Temporal Expression Profiling in Caenorhabditis elegans Identifies a Core Gene Set Related to Long-Term Memory.

    Science.gov (United States)

    Freytag, Virginie; Probst, Sabine; Hadziselimovic, Nils; Boglari, Csaba; Hauser, Yannick; Peter, Fabian; Gabor Fenyves, Bank; Milnik, Annette; Demougin, Philippe; Vukojevic, Vanja; de Quervain, Dominique J-F; Papassotiropoulos, Andreas; Stetak, Attila

    2017-07-12

    The identification of genes related to encoding, storage, and retrieval of memories is a major interest in neuroscience. In the current study, we analyzed the temporal gene expression changes in a neuronal mRNA pool during an olfactory long-term associative memory (LTAM) in Caenorhabditis elegans hermaphrodites. Here, we identified a core set of 712 (538 upregulated and 174 downregulated) genes that follows three distinct temporal peaks demonstrating multiple gene regulation waves in LTAM. Compared with the previously published positive LTAM gene set (Lakhina et al., 2015), 50% of the identified upregulated genes here overlap with the previous dataset, possibly representing stimulus-independent memory-related genes. On the other hand, the remaining genes were not previously identified in positive associative memory and may specifically regulate aversive LTAM. Our results suggest a multistep gene activation process during the formation and retrieval of long-term memory and define general memory-implicated genes as well as conditioning-type-dependent gene sets. SIGNIFICANCE STATEMENT The identification of genes regulating different steps of memory is of major interest in neuroscience. Identification of common memory genes across different learning paradigms and the temporal activation of the genes are poorly studied. Here, we investigated the temporal aspects of Caenorhabditis elegans gene expression changes using aversive olfactory associative long-term memory (LTAM) and identified three major gene activation waves. Like in previous studies, aversive LTAM is also CREB dependent, and CREB activity is necessary immediately after training. Finally, we define a list of memory paradigm-independent core gene sets as well as conditioning-dependent genes. Copyright © 2017 the authors 0270-6474/17/376661-12$15.00/0.

  10. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    Science.gov (United States)

    2013-01-01

    Background Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the

  11. Comprehensive Analysis of MILE Gene Expression Data Set Advances Discovery of Leukaemia Type and Subtype Biomarkers.

    Science.gov (United States)

    Labaj, Wojciech; Papiez, Anna; Polanski, Andrzej; Polanska, Joanna

    2017-03-01

    Large collections of data in studies on cancer such as leukaemia provoke the necessity of applying tailored analysis algorithms to ensure supreme information extraction. In this work, a custom-fit pipeline is demonstrated for thorough investigation of the voluminous MILE gene expression data set. Three analyses are accomplished, each for gaining a deeper understanding of the processes underlying leukaemia types and subtypes. First, the main disease groups are tested for differential expression against the healthy control as in a standard case-control study. Here, the basic knowledge on molecular mechanisms is confirmed quantitatively and by literature references. Second, pairwise comparison testing is performed for juxtaposing the main leukaemia types among each other. In this case by means of the Dice coefficient similarity measure the general relations are pointed out. Moreover, lists of candidate main leukaemia group biomarkers are proposed. Finally, with this approach being successful, the third analysis provides insight into all of the studied subtypes, followed by the emergence of four leukaemia subtype biomarkers. In addition, the class enhanced DEG signature obtained on the basis of novel pipeline processing leads to significantly better classification power of multi-class data classifiers. The developed methodology consisting of batch effect adjustment, adaptive noise and feature filtration coupled with adequate statistical testing and biomarker definition proves to be an effective approach towards knowledge discovery in high-throughput molecular biology experiments.

  12. A reference gene set for sex pheromone biosynthesis and degradation genes from the diamondback moth, Plutella xylostella, based on genome and transcriptome digital gene expression analyses.

    Science.gov (United States)

    He, Peng; Zhang, Yun-Fei; Hong, Duan-Yang; Wang, Jun; Wang, Xing-Liang; Zuo, Ling-Hua; Tang, Xian-Fu; Xu, Wei-Ming; He, Ming

    2017-03-01

    comprehensive gene data set of sex pheromone biosynthesis and degradation enzyme related genes in DBM created by genome- and transcriptome-wide identification, characterization and expression profiling. Our findings provide a basis to better understand the function of genes with tissue enriched expression. The results also provide information on the genes involved in sex pheromone biosynthesis and degradation, and may be useful to identify potential gene targets for pest control strategies by disrupting the insect-insect communication using pheromone-based behavioral antagonists.

  13. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    Directory of Open Access Journals (Sweden)

    Hettne Kristina M

    2013-01-01

    Full Text Available Abstract Background Availability of chemical response-specific lists of genes (gene sets for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM, and that these can be used with gene set analysis (GSA methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human and 588 (mouse gene sets from the Comparative Toxicogenomics Database (CTD. We tested for significant differential expression (SDE (false discovery rate -corrected p-values Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Conclusions Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect.

  14. Transcriptome-wide selection of a reliable set of reference genes for gene expression studies in potato cyst nematodes (Globodera spp.).

    Science.gov (United States)

    Sabeh, Michael; Duceppe, Marc-Olivier; St-Arnaud, Marc; Mimee, Benjamin

    2018-01-01

    Relative gene expression analyses by qRT-PCR (quantitative reverse transcription PCR) require an internal control to normalize the expression data of genes of interest and eliminate the unwanted variation introduced by sample preparation. A perfect reference gene should have a constant expression level under all the experimental conditions. However, the same few housekeeping genes selected from the literature or successfully used in previous unrelated experiments are often routinely used in new conditions without proper validation of their stability across treatments. The advent of RNA-Seq and the availability of public datasets for numerous organisms are opening the way to finding better reference genes for expression studies. Globodera rostochiensis is a plant-parasitic nematode that is particularly yield-limiting for potato. The aim of our study was to identify a reliable set of reference genes to study G. rostochiensis gene expression. Gene expression levels from an RNA-Seq database were used to identify putative reference genes and were validated with qRT-PCR analysis. Three genes, GR, PMP-3, and aaRS, were found to be very stable within the experimental conditions of this study and are proposed as reference genes for future work.

  15. Identification of a set of endogenous reference genes for miRNA expression studies in Parkinson's disease blood samples.

    Science.gov (United States)

    Serafin, Alice; Foco, Luisa; Blankenburg, Hagen; Picard, Anne; Zanigni, Stefano; Zanon, Alessandra; Pramstaller, Peter P; Hicks, Andrew A; Schwienbacher, Christine

    2014-10-10

    Research on microRNAs (miRNAs) is becoming an increasingly attractive field, as these small RNA molecules are involved in several physiological functions and diseases. To date, only few studies have assessed the expression of blood miRNAs related to Parkinson's disease (PD) using microarray and quantitative real-time PCR (qRT-PCR). Measuring miRNA expression involves normalization of qRT-PCR data using endogenous reference genes for calibration, but their choice remains a delicate problem with serious impact on the resulting expression levels. The aim of the present study was to evaluate the suitability of a set of commonly used small RNAs as normalizers and to identify which of these miRNAs might be considered reliable reference genes in qRT-PCR expression analyses on PD blood samples. Commonly used reference genes snoRNA RNU24, snRNA RNU6B, snoRNA Z30 and miR-103a-3p were selected from the literature. We then analyzed the effect of using these genes as reference, alone or in any possible combination, on the measured expression levels of the target genes miR-30b-5p and miR-29a-3p, which have been previously reported to be deregulated in PD blood samples. We identified RNU24 and Z30 as a reliable and stable pair of reference genes in PD blood samples.

  16. Integrating genome-wide association study and expression quantitative trait loci data identifies multiple genes and gene set associated with neuroticism.

    Science.gov (United States)

    Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng

    2017-08-01

    Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.

  17. Identification of self-consistent modulons from bacterial microarray expression data with the help of structured regulon gene sets

    KAUST Repository

    Permina, Elizaveta A.

    2013-01-01

    Identification of bacterial modulons from series of gene expression measurements on microarrays is a principal problem, especially relevant for inadequately studied but practically important species. Usage of a priori information on regulatory interactions helps to evaluate parameters for regulatory subnetwork inference. We suggest a procedure for modulon construction where a seed regulon is iteratively updated with genes having expression patterns similar to those for regulon member genes. A set of genes essential for a regulon is used to control modulon updating. Essential genes for a regulon were selected as a subset of regulon genes highly related by different measures to each other. Using Escherichia coli as a model, we studied how modulon identification depends on the data, including the microarray experiments set, the adopted relevance measure and the regulon itself. We have found that results of modulon identification are highly dependent on all parameters studied and thus the resulting modulon varies substantially depending on the identification procedure. Yet, modulons that were identified correctly displayed higher stability during iterations, which allows developing a procedure for reliable modulon identification in the case of less studied species where the known regulatory interactions are sparse. Copyright © 2013 Taylor & Francis.

  18. The Schizophrenia-Associated BRD1 Gene Regulates Behavior, Neurotransmission, and Expression of Schizophrenia Risk Enriched Gene Sets in Mice.

    Science.gov (United States)

    Qvist, Per; Christensen, Jane Hvarregaard; Vardya, Irina; Rajkumar, Anto Praveen; Mørk, Arne; Paternoster, Veerle; Füchtbauer, Ernst-Martin; Pallesen, Jonatan; Fryland, Tue; Dyrvig, Mads; Hauberg, Mads Engel; Lundsberg, Birgitte; Fejgin, Kim; Nyegaard, Mette; Jensen, Kimmo; Nyengaard, Jens Randel; Mors, Ole; Didriksen, Michael; Børglum, Anders Dupont

    2017-07-01

    The schizophrenia-associated BRD1 gene encodes a transcriptional regulator whose comprehensive chromatin interactome is enriched with schizophrenia risk genes. However, the biology underlying the disease association of BRD1 remains speculative. This study assessed the transcriptional drive of a schizophrenia-associated BRD1 risk variant in vitro. Accordingly, to examine the effects of reduced Brd1 expression, we generated a genetically modified Brd1 +/- mouse and subjected it to behavioral, electrophysiological, molecular, and integrative genomic analyses with focus on schizophrenia-relevant parameters. Brd1 +/- mice displayed cerebral histone H3K14 hypoacetylation and a broad range of behavioral changes with translational relevance to schizophrenia. These behaviors were accompanied by striatal dopamine/serotonin abnormalities and cortical excitation-inhibition imbalances involving loss of parvalbumin immunoreactive interneurons. RNA-sequencing analyses of cortical and striatal micropunches from Brd1 +/- and wild-type mice revealed differential expression of genes enriched for schizophrenia risk, including several schizophrenia genome-wide association study risk genes (e.g., calcium channel subunits [Cacna1c and Cacnb2], cholinergic muscarinic receptor 4 [Chrm4)], dopamine receptor D 2 [Drd2], and transcription factor 4 [Tcf4]). Integrative analyses further found differentially expressed genes to cluster in functional networks and canonical pathways associated with mental illness and molecular signaling processes (e.g., glutamatergic, monoaminergic, calcium, cyclic adenosine monophosphate [cAMP], dopamine- and cAMP-regulated neuronal phosphoprotein 32 kDa [DARPP-32], and cAMP responsive element binding protein signaling [CREB]). Our study bridges the gap between genetic association and pathogenic effects and yields novel insights into the unfolding molecular changes in the brain of a new schizophrenia model that incorporates genetic risk at three levels: allelic

  19. Using Variable Precision Rough Set for Selection and Classification of Biological Knowledge Integrated in DNA Gene Expression

    Directory of Open Access Journals (Sweden)

    Calvo-Dmgz D.

    2012-12-01

    Full Text Available DNA microarrays have contributed to the exponential growth of genomic and experimental data in the last decade. This large amount of gene expression data has been used by researchers seeking diagnosis of diseases like cancer using machine learning methods. In turn, explicit biological knowledge about gene functions has also grown tremendously over the last decade. This work integrates explicit biological knowledge, provided as gene sets, into the classication process by means of Variable Precision Rough Set Theory (VPRS. The proposed model is able to highlight which part of the provided biological knowledge has been important for classification. This paper presents a novel model for microarray data classification which is able to incorporate prior biological knowledge in the form of gene sets. Based on this knowledge, we transform the input microarray data into supergenes, and then we apply rough set theory to select the most promising supergenes and to derive a set of easy interpretable classification rules. The proposed model is evaluated over three breast cancer microarrays datasets obtaining successful results compared to classical classification techniques. The experimental results shows that there are not significat differences between our model and classical techniques but it is able to provide a biological-interpretable explanation of how it classifies new samples.

  20. Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Pugalendhi Ganesh Kumar

    Full Text Available This study describes a novel approach to reducing the challenges of highly nonlinear multiclass gene expression values for cancer diagnosis. To build a fruitful system for cancer diagnosis, in this study, we introduced two levels of gene selection such as filtering and embedding for selection of potential genes and the most relevant genes associated with cancer, respectively. The filter procedure was implemented by developing a fuzzy rough set (FR-based method for redefining the criterion function of f-information (FI to identify the potential genes without discretizing the continuous gene expression values. The embedded procedure is implemented by means of a water swirl algorithm (WSA, which attempts to optimize the rule set and membership function required to classify samples using a fuzzy-rule-based multiclassification system (FRBMS. Two novel update equations are proposed in WSA, which have better exploration and exploitation abilities while designing a self-learning FRBMS. The efficiency of our new approach was evaluated on 13 multicategory and 9 binary datasets of cancer gene expression. Additionally, the performance of the proposed FRFI-WSA method in designing an FRBMS was compared with existing methods for gene selection and optimization such as genetic algorithm (GA, particle swarm optimization (PSO, and artificial bee colony algorithm (ABC on all the datasets. In the global cancer map with repeated measurements (GCM_RM dataset, the FRFI-WSA showed the smallest number of 16 most relevant genes associated with cancer using a minimal number of 26 compact rules with the highest classification accuracy (96.45%. In addition, the statistical validation used in this study revealed that the biological relevance of the most relevant genes associated with cancer and their linguistics detected by the proposed FRFI-WSA approach are better than those in the other methods. The simple interpretable rules with most relevant genes and effectively

  1. A reference gene set for sex pheromone biosynthesis and degradation genes from the diamondback moth, Plutella xylostella, based on genome and transcriptome digital gene expression analyses

    OpenAIRE

    He, Peng; Zhang, Yun-Fei; Hong, Duan-Yang; Wang, Jun; Wang, Xing-Liang; Zuo, Ling-Hua; Tang, Xian-Fu; Xu, Wei-Ming; He, Ming

    2017-01-01

    Background Female moths synthesize species-specific sex pheromone components and release them to attract male moths, which depend on precise sex pheromone chemosensory system to locate females. Two types of genes involved in the sex pheromone biosynthesis and degradation pathways play essential roles in this important moth behavior. To understand the function of genes in the sex pheromone pathway, this study investigated the genome-wide and digital gene expression of sex pheromone biosynthesi...

  2. Enriching the gene set analysis of genome-wide data by incorporating directionality of gene expression and combining statistical hypotheses and methods

    Science.gov (United States)

    Väremo, Leif; Nielsen, Jens; Nookaew, Intawat

    2013-01-01

    Gene set analysis (GSA) is used to elucidate genome-wide data, in particular transcriptome data. A multitude of methods have been proposed for this step of the analysis, and many of them have been compared and evaluated. Unfortunately, there is no consolidated opinion regarding what methods should be preferred, and the variety of available GSA software and implementations pose a difficulty for the end-user who wants to try out different methods. To address this, we have developed the R package Piano that collects a range of GSA methods into the same system, for the benefit of the end-user. Further on we refine the GSA workflow by using modifications of the gene-level statistics. This enables us to divide the resulting gene set P-values into three classes, describing different aspects of gene expression directionality at gene set level. We use our fully implemented workflow to investigate the impact of the individual components of GSA by using microarray and RNA-seq data. The results show that the evaluated methods are globally similar and the major separation correlates well with our defined directionality classes. As a consequence of this, we suggest to use a consensus scoring approach, based on multiple GSA runs. In combination with the directionality classes, this constitutes a more thorough basis for an enriched biological interpretation. PMID:23444143

  3. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    Hettne, K.M.; Boorsma, A.; Dartel, D.A. van; Goeman, J.J.; Jong, E. de; Piersma, A.H.; Stierum, R.H.; Kleinjans, J.C.; Kors, J.A.

    2013-01-01

    BACKGROUND: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set

  4. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    Hettne, K.M.; Boorsma, A.; Dartel, van D.A.M.; Goeman, J.J.; Jong, de E.; Piersma, A.H.; Stierum, R.H.; Kleinjans, J.C.; Kors, J.A.

    2013-01-01

    Background: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set

  5. Gene set analysis of the EADGENE chicken data-set

    DEFF Research Database (Denmark)

    Skarman, Axel; Jiang, Li; Hornshøj, Henrik

    2009-01-01

     Abstract Background: Gene set analysis is considered to be a way of improving our biological interpretation of the observed expression patterns. This paper describes different methods applied to analyse expression data from a chicken DNA microarray dataset. Results: Applying different gene set...... analyses to the chicken expression data led to different ranking of the Gene Ontology terms tested. A method for prediction of possible annotations was applied. Conclusion: Biological interpretation based on gene set analyses dependent on the statistical method used. Methods for predicting the possible...

  6. Data set for diet specific differential gene expression analysis in three Spodoptera moths

    Directory of Open Access Journals (Sweden)

    A. Roy

    2016-09-01

    Full Text Available Examination of closely related species pairs is suggested for evolutionary comparisons of different degrees of polyphagy, which we did here with three taxa of lepidopteran herbivores, Spodoptera spp (S. littoralis, S. frugiperda maize (C and rice (R strains for a RNAseq analysis of the midguts from the 3rd instar insect larvae for differential metabolic responses after feeding on pinto bean based artificial diet vs maize leaves. Paired-end (2×100 bp Illumina HiSeq2500 sequencing resulted in a total of 24, 23, 24, and 21 million reads for the SF-C-Maize, SF-C-Pinto, SF-R-Maize, SF-R Pinto, and a total of 35 and 36 million reads for the SL-Maize and SL-Pinto samples, respectively. After quality control measures, a total of 62.2 million reads from SL and 71.7 million reads from SF were used for transcriptome assembly (TA. The resulting final de novo reference TA (backbone for the SF taxa contained 37,985 contigs with a N50 contig size of 1030 bp and a maximum contig length of 17,093 bp, while for SL, 28,329 contigs were generated with a N50 contig size of 1980 bp and a maximum contig length of 18,267 bp. The data presented herein contains supporting information related to our research article Roy et al. (2016 http://dx.doi.org/10.1016/j.ibmb.2016.02.006 [1]. Keywords: Differential expression analysis (DGE, Transcriptomics, Spodoptera, Adaptation, Generalist, Specialist, RPKM (reads per kilo base of transcript per million mapped reads, RNA seq

  7. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    NARCIS (Netherlands)

    K.M. Hettne (Kristina); J. Boorsma (Jeffrey); D.A.M. van Dartel (Dorien A M); J.J. Goeman (Jelle); E.C. de Jong (Esther); A.H. Piersma (Aldert); R.H. Stierum (Rob); J. Kleinjans (Jos); J.A. Kors (Jan)

    2013-01-01

    textabstractBackground: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with

  8. AnovArray: a set of SAS macros for the analysis of variance of gene expression data

    Directory of Open Access Journals (Sweden)

    Renard Jean-Paul

    2005-06-01

    Full Text Available Abstract Background Analysis of variance is a powerful approach to identify differentially expressed genes in a complex experimental design for microarray and macroarray data. The advantage of the anova model is the possibility to evaluate multiple sources of variation in an experiment. Results AnovArray is a package implementing ANOVA for gene expression data using SAS® statistical software. The originality of the package is 1 to quantify the different sources of variation on all genes together, 2 to provide a quality control of the model, 3 to propose two models for a gene's variance estimation and to perform a correction for multiple comparisons. Conclusion AnovArray is freely available at http://www-mig.jouy.inra.fr/stat/AnovArray and requires only SAS® statistical software.

  9. Sexual and asexual oogenesis require the expression of unique and shared sets of genes in the insect Acyrthosiphon pisum

    Directory of Open Access Journals (Sweden)

    Gallot Aurore

    2012-02-01

    Full Text Available Abstract Background Although sexual reproduction is dominant within eukaryotes, asexual reproduction is widespread and has evolved independently as a derived trait in almost all major taxa. How asexuality evolved in sexual organisms is unclear. Aphids, such as Acyrthosiphon pisum, alternate between asexual and sexual reproductive means, as the production of parthenogenetic viviparous females or sexual oviparous females and males varies in response to seasonal photoperiodism. Consequently, sexual and asexual development in aphids can be analyzed simultaneously in genetically identical individuals. Results We compared the transcriptomes of aphid embryos in the stages of development during which the trajectory of oogenesis is determined for producing sexual or asexual gametes. This study design aimed at identifying genes involved in the onset of the divergent mechanisms that result in the sexual or asexual phenotype. We detected 33 genes that were differentially transcribed in sexual and asexual embryos. Functional annotation by gene ontology (GO showed a biological signature of oogenesis, cell cycle regulation, epigenetic regulation and RNA maturation. In situ hybridizations demonstrated that 16 of the differentially-transcribed genes were specifically expressed in germ cells and/or oocytes of asexual and/or sexual ovaries, and therefore may contribute to aphid oogenesis. We categorized these 16 genes by their transcription patterns in the two types of ovaries; they were: i expressed during sexual and asexual oogenesis; ii expressed during sexual and asexual oogenesis but with different localizations; or iii expressed only during sexual or asexual oogenesis. Conclusions Our results show that asexual and sexual oogenesis in aphids share common genetic programs but diverge by adapting specificities in their respective gene expression profiles in germ cells and oocytes.

  10. Gene set analysis using variance component tests.

    Science.gov (United States)

    Huang, Yen-Tsung; Lin, Xihong

    2013-06-28

    Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.

  11. Association between expression of random gene sets and survival is evident in multiple cancer types and may be explained by sub-classification

    Science.gov (United States)

    2018-01-01

    One of the goals of cancer research is to identify a set of genes that cause or control disease progression. However, although multiple such gene sets were published, these are usually in very poor agreement with each other, and very few of the genes proved to be functional therapeutic targets. Furthermore, recent findings from a breast cancer gene-expression cohort showed that sets of genes selected randomly can be used to predict survival with a much higher probability than expected. These results imply that many of the genes identified in breast cancer gene expression analysis may not be causal of cancer progression, even though they can still be highly predictive of prognosis. We performed a similar analysis on all the cancer types available in the cancer genome atlas (TCGA), namely, estimating the predictive power of random gene sets for survival. Our work shows that most cancer types exhibit the property that random selections of genes are more predictive of survival than expected. In contrast to previous work, this property is not removed by using a proliferation signature, which implies that proliferation may not always be the confounder that drives this property. We suggest one possible solution in the form of data-driven sub-classification to reduce this property significantly. Our results suggest that the predictive power of random gene sets may be used to identify the existence of sub-classes in the data, and thus may allow better understanding of patient stratification. Furthermore, by reducing the observed bias this may allow more direct identification of biologically relevant, and potentially causal, genes. PMID:29470520

  12. Differential gene expression in granulosa cells from polycystic ovary syndrome patients with and without insulin resistance: identification of susceptibility gene sets through network analysis.

    Science.gov (United States)

    Kaur, Surleen; Archer, Kellie J; Devi, M Gouri; Kriplani, Alka; Strauss, Jerome F; Singh, Rita

    2012-10-01

    Polycystic ovary syndrome (PCOS) is a heterogeneous, genetically complex, endocrine disorder of uncertain etiology in women. Our aim was to compare the gene expression profiles in stimulated granulosa cells of PCOS women with and without insulin resistance vs. matched controls. This study included 12 normal ovulatory women (controls), 12 women with PCOS without evidence for insulin resistance (PCOS non-IR), and 16 women with insulin resistance (PCOS-IR) undergoing in vitro fertilization. Granulosa cell gene expression profiling was accomplished using Affymetrix Human Genome-U133 arrays. Differentially expressed genes were classified according to gene ontology using ingenuity pathway analysis tools. Microarray results for selected genes were confirmed by real-time quantitative PCR. A total of 211 genes were differentially expressed in PCOS non-IR and PCOS-IR granulosa cells (fold change≥1.5; P≤0.001) vs. matched controls. Diabetes mellitus and inflammation genes were significantly increased in PCOS-IR patients. Real-time quantitative PCR confirmed higher expression of NCF2 (2.13-fold), TCF7L2 (1.92-fold), and SERPINA1 (5.35-fold). Increased expression of inflammation genes ITGAX (3.68-fold) and TAB2 (1.86-fold) was confirmed in PCOS non-IR. Different cardiometabolic disease genes were differentially expressed in the two groups. Decreased expression of CAV1 (-3.58-fold) in PCOS non-IR and SPARC (-1.88-fold) in PCOS-IR was confirmed. Differential expression of genes involved in TGF-β signaling (IGF2R, increased; and HAS2, decreased), and oxidative stress (TXNIP, increased) was confirmed in both groups. Microarray analysis demonstrated differential expression of genes linked to diabetes mellitus, inflammation, cardiovascular diseases, and infertility in the granulosa cells of PCOS women with and without insulin resistance. Because these dysregulated genes are also involved in oxidative stress, lipid metabolism, and insulin signaling, we hypothesize that these

  13. Heat Stress and Lipopolysaccharide Stimulation of Chicken Macrophage-Like Cell Line Activates Expression of Distinct Sets of Genes.

    Directory of Open Access Journals (Sweden)

    Anna Slawinska

    Full Text Available Acute heat stress requires immediate adjustment of the stressed individual to sudden changes of ambient temperatures. Chickens are particularly sensitive to heat stress due to development of insufficient physiological mechanisms to mitigate its effects. One of the symptoms of heat stress is endotoxemia that results from release of the lipopolysaccharide (LPS from the guts. Heat-related cytotoxicity is mitigated by the innate immune system, which is comprised mostly of phagocytic cells such as monocytes and macrophages. The objective of this study was to analyze the molecular responses of the chicken macrophage-like HD11 cell line to combined heat stress and lipopolysaccharide treatment in vitro. The cells were heat-stressed and then allowed a temperature-recovery period, during which the gene expression was investigated. LPS was added to the cells to mimic the heat-stress-related endotoxemia. Semi high-throughput gene expression analysis was used to study a gene panel comprised of heat shock proteins, stress-related genes, signaling molecules and immune response genes. HD11 cell line responded to heat stress with increased mRNA abundance of the HSP25, HSPA2 and HSPH1 chaperones as well as DNAJA4 and DNAJB6 co-chaperones. The anti-apoptotic gene BAG3 was also highly up-regulated, providing evidence that the cells expressed pro-survival processes. The immune response of the HD11 cell line to LPS in the heat stress environment (up-regulation of CCL4, CCL5, IL1B, IL8 and iNOS was higher than in thermoneutral conditions. However, the peak in the transcriptional regulation of the immune genes was after two hours of temperature-recovery. Therefore, we propose the potential influence of the extracellular heat shock proteins not only in mitigating effects of abiotic stress but also in triggering the higher level of the immune responses. Finally, use of correlation networks for the data analysis aided in discovering subtle differences in the gene

  14. Poster: Observing change in crowded data sets in 3D space - Visualizing gene expression in human tissues

    KAUST Repository

    Rogowski, Marcin

    2013-03-01

    We have been confronted with a real-world problem of visualizing and observing change of gene expression between different human tissues. In this paper, we are presenting a universal representation space based on two-dimensional gel electrophoresis as opposed to force-directed layouts encountered most often in similar problems. We are discussing the methods we devised to make observing change more convenient in a 3D virtual reality environment. © 2013 IEEE.

  15. Evaluation of endogenous control genes for gene expression studies across multiple tissues and in the specific sets of fat- and muscle-type samples of the pig.

    Science.gov (United States)

    Gu, Y R; Li, M Z; Zhang, K; Chen, L; Jiang, A A; Wang, J Y; Li, X W

    2011-08-01

    To normalize a set of quantitative real-time PCR (q-PCR) data, it is essential to determine an optimal number/set of housekeeping genes, as the abundance of housekeeping genes can vary across tissues or cells during different developmental stages, or even under certain environmental conditions. In this study, of the 20 commonly used endogenous control genes, 13, 18 and 17 genes exhibited credible stability in 56 different tissues, 10 types of adipose tissue and five types of muscle tissue, respectively. Our analysis clearly showed that three optimal housekeeping genes are adequate for an accurate normalization, which correlated well with the theoretical optimal number (r ≥ 0.94). In terms of economical and experimental feasibility, we recommend the use of the three most stable housekeeping genes for calculating the normalization factor. Based on our results, the three most stable housekeeping genes in all analysed samples (TOP2B, HSPCB and YWHAZ) are recommended for accurate normalization of q-PCR data. We also suggest that two different sets of housekeeping genes are appropriate for 10 types of adipose tissue (the HSPCB, ALDOA and GAPDH genes) and five types of muscle tissue (the TOP2B, HSPCB and YWHAZ genes), respectively. Our report will serve as a valuable reference for other studies aimed at measuring tissue-specific mRNA abundance in porcine samples. © 2011 Blackwell Verlag GmbH.

  16. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery.

    Science.gov (United States)

    Jia, Zhilong; Liu, Ying; Guan, Naiyang; Bo, Xiaochen; Luo, Zhigang; Barnes, Michael R

    2016-05-27

    Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and

  17. Bridging cancer biology with the clinic: relative expression of a GRHL2-mediated gene-set pair predicts breast cancer metastasis.

    Directory of Open Access Journals (Sweden)

    Xinan Yang

    Full Text Available Identification and characterization of crucial gene target(s that will allow focused therapeutics development remains a challenge. We have interrogated the putative therapeutic targets associated with the transcription factor Grainy head-like 2 (GRHL2, a critical epithelial regulatory factor. We demonstrate the possibility to define the molecular functions of critical genes in terms of their personalized expression profiles, allowing appropriate functional conclusions to be derived. A novel methodology, relative expression analysis with gene-set pairs (RXA-GSP, is designed to explore the potential clinical utility of cancer-biology discovery. Observing that Grhl2-overexpression leads to increased metastatic potential in vitro, we established a model assuming Grhl2-induced or -inhibited genes confer poor or favorable prognosis respectively for cancer metastasis. Training on public gene expression profiles of 995 breast cancer patients, this method prioritized one gene-set pair (GRHL2, CDH2, FN1, CITED2, MKI67 versus CTNNB1 and CTNNA3 from all 2717 possible gene-set pairs (GSPs. The identified GSP significantly dichotomized 295 independent patients for metastasis-free survival (log-rank tested p = 0.002; severe empirical p = 0.035. It also showed evidence of clinical prognostication in another independent 388 patients collected from three studies (log-rank tested p = 3.3e-6. This GSP is independent of most traditional prognostic indicators, and is only significantly associated with the histological grade of breast cancer (p = 0.0017, a GRHL2-associated clinical character (p = 6.8e-6, Spearman correlation, suggesting that this GSP is reflective of GRHL2-mediated events. Furthermore, a literature review indicates the therapeutic potential of the identified genes. This research demonstrates a novel strategy to integrate both biological experiments and clinical gene expression profiles for extracting and elucidating the genomic

  18. Gene Expression Omnibus (GEO)

    Data.gov (United States)

    U.S. Department of Health & Human Services — Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided...

  19. Gene set analysis for GWAS

    DEFF Research Database (Denmark)

    Debrabant, Birgit; Soerensen, Mette

    2014-01-01

    Abstract We discuss the use of modified Kolmogorov-Smirnov (KS) statistics in the context of gene set analysis and review corresponding null and alternative hypotheses. Especially, we show that, when enhancing the impact of highly significant genes in the calculation of the test statistic, the co...

  20. Thy1.2 driven expression of transgenic His₆-SUMO2 in the brain of mice alters a restricted set of genes.

    Science.gov (United States)

    Rossner, Moritz J; Tirard, Marilyn

    2014-08-05

    Protein SUMOylation is a post-translational protein modification with a key regulatory role in nerve cell development and function, but its function in mammals in vivo has only been studied cursorily. We generated two new transgenic mouse lines that express His6-tagged SUMO1 and SUMO2 driven by the Thy1.2 promoter. The brains of mice of the two lines express transgenic His6-SUMO peptides and conjugate them to substrates in vivo but cytoarchitecture and synaptic organization of adult transgenic mouse brains are indistinguishable from the wild-type situation. We investigated the impact of transgenic SUMO expression on gene transcription in the hippocampus by performing genome wide analyses using microarrays. Surprisingly, no changes were observed in Thy1.2::His6-SUMO1 transgenic mice and only a restricted set of genes were upregulated in Thy1.2::His6-SUMO2 mice. Among these, Penk1 (Preproenkephalin 1), which encodes Met-enkephalin neuropeptides, showed the highest degree of alteration. Accordingly, a significant increase in Met-enkephalin peptide levels in the hippocampus of Thy1.2::His6-SUMO2 was detected, but the expression levels and cellular localization of Met-enkephalin receptors were not changed. Thus, transgenic neuronal expression of His6-SUMO1 or His6-SUMO2 only induces very minor phenotypical changes in mice. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Synthetic promoter libraries- tuning of gene expression

    DEFF Research Database (Denmark)

    Hammer, Karin; Mijakovic, Ivan; Jensen, Peter Ruhdal

    2006-01-01

    knockout and strong overexpression. However, applications such as metabolic optimization and control analysis necessitate a continuous set of expression levels with only slight increments in strength to cover a specific window around the wildtype expression level of the studied gene; this requirement can......The study of gene function often requires changing the expression of a gene and evaluating the consequences. In principle, the expression of any given gene can be modulated in a quasi-continuum of discrete expression levels but the traditional approaches are usually limited to two extremes: gene...

  2. HU participates in expression of a specific set of genes required for growth and survival at acidic pH in Escherichia coli.

    Science.gov (United States)

    Bi, Hongkai; Sun, Lianle; Fukamachi, Toshihiko; Saito, Hiromi; Kobayashi, Hiroshi

    2009-05-01

    The major histone-like Escherichia coli protein, HU, is composed of alpha and beta subunits respectively encoded by hupA and hupB in Escherichia coli. A mutant deficient in both hupA and hupB grew at a slightly slower rate than the wild type at pH 7.5. Growth of the mutant diminished with a decrease in pH, and no growth was observed at pH 4.6. Mutants of either hupA or hupB grew at all pH levels tested. The arginine-dependent survival at pH 2.5 was diminished approximately 60-fold by the deletion of both hupA and hupB, whereas the survival was slightly affected by the deletion of either hupA or hupB. The mRNA levels of adiA and adiC, which respectively encode arginine decarboxylase and arginine/agmatine antiporter, were low in the mutant deficient in both hupA and hupB. The deletion of both hupA and hupB had little effect on survival at pH 2.5 in the presence of glutamate or lysine, and expression of the genes for glutamate and lysine decarboxylases was not impaired by the deletion of the HU genes. These results suggest that HU regulates expression of the specific set of genes required for growth and survival in acidic environments.

  3. Gene expression and gene therapy imaging

    International Nuclear Information System (INIS)

    Rome, Claire; Couillaud, Franck; Moonen, Chrit T.W.

    2007-01-01

    The fast growing field of molecular imaging has achieved major advances in imaging gene expression, an important element of gene therapy. Gene expression imaging is based on specific probes or contrast agents that allow either direct or indirect spatio-temporal evaluation of gene expression. Direct evaluation is possible with, for example, contrast agents that bind directly to a specific target (e.g., receptor). Indirect evaluation may be achieved by using specific substrate probes for a target enzyme. The use of marker genes, also called reporter genes, is an essential element of MI approaches for gene expression in gene therapy. The marker gene may not have a therapeutic role itself, but by coupling the marker gene to a therapeutic gene, expression of the marker gene reports on the expression of the therapeutic gene. Nuclear medicine and optical approaches are highly sensitive (detection of probes in the picomolar range), whereas MRI and ultrasound imaging are less sensitive and require amplification techniques and/or accumulation of contrast agents in enlarged contrast particles. Recently developed MI techniques are particularly relevant for gene therapy. Amongst these are the possibility to track gene therapy vectors such as stem cells, and the techniques that allow spatiotemporal control of gene expression by non-invasive heating (with MRI guided focused ultrasound) and the use of temperature sensitive promoters. (orig.)

  4. A Bayesian variable selection procedure for ranking overlapping gene sets

    DEFF Research Database (Denmark)

    Skarman, Axel; Mahdi Shariati, Mohammad; Janss, Luc

    2012-01-01

    Background Genome-wide expression profiling using microarrays or sequence-based technologies allows us to identify genes and genetic pathways whose expression patterns influence complex traits. Different methods to prioritize gene sets, such as the genes in a given molecular pathway, have been de...

  5. The PR/SET Domain Zinc Finger Protein Prdm4 Regulates Gene Expression in Embryonic Stem Cells but Plays a Nonessential Role in the Developing Mouse Embryo

    Science.gov (United States)

    Bogani, Debora; Morgan, Marc A. J.; Nelson, Andrew C.; Costello, Ita; McGouran, Joanna F.; Kessler, Benedikt M.

    2013-01-01

    Prdm4 is a highly conserved member of the Prdm family of PR/SET domain zinc finger proteins. Many well-studied Prdm family members play critical roles in development and display striking loss-of-function phenotypes. Prdm4 functional contributions have yet to be characterized. Here, we describe its widespread expression in the early embryo and adult tissues. We demonstrate that DNA binding is exclusively mediated by the Prdm4 zinc finger domain, and we characterize its tripartite consensus sequence via SELEX (systematic evolution of ligands by exponential enrichment) and ChIP-seq (chromatin immunoprecipitation-sequencing) experiments. In embryonic stem cells (ESCs), Prdm4 regulates key pluripotency and differentiation pathways. Two independent strategies, namely, targeted deletion of the zinc finger domain and generation of a EUCOMM LacZ reporter allele, resulted in functional null alleles. However, homozygous mutant embryos develop normally and adults are healthy and fertile. Collectively, these results strongly suggest that Prdm4 functions redundantly with other transcriptional partners to cooperatively regulate gene expression in the embryo and adult animal. PMID:23918801

  6. RBiomirGS: an all-in-one miRNA gene set analysis solution featuring target mRNA mapping and expression profile integration

    Directory of Open Access Journals (Sweden)

    Jing Zhang

    2018-01-01

    Full Text Available Background With the continuous discovery of microRNA’s (miRNA association with a wide range of biological and cellular processes, expression profile-based functional characterization of such post-transcriptional regulation is crucial for revealing its significance behind particular phenotypes. Profound advancement in bioinformatics has been made to enable in depth investigation of miRNA’s role in regulating cellular and molecular events, resulting in a huge quantity of software packages covering different aspects of miRNA functional analysis. Therefore, an all-in-one software solution is in demand for a comprehensive yet highly efficient workflow. Here we present RBiomirGS, an R package for a miRNA gene set (GS analysis. Methods The package utilizes multiple databases for target mRNA mapping, estimates miRNA effect on the target mRNAs through miRNA expression profile and conducts a logistic regression-based GS enrichment. Additionally, human ortholog Entrez ID conversion functionality is included for target mRNAs. Results By incorporating all the core steps into one package, RBiomirGS eliminates the need for switching between different software packages. The modular structure of RBiomirGS enables various access points to the analysis, with which users can choose the most relevant functionalities for their workflow. Conclusions With RBiomirGS, users are able to assess the functional significance of the miRNA expression profile under the corresponding experimental condition by minimal input and intervention. Accordingly, RBiomirGS encompasses an all-in-one solution for miRNA GS analysis. RBiomirGS is available on GitHub (http://github.com/jzhangc/RBiomirGS. More information including instruction and examples can be found on website (http://kenstoreylab.com/?page_id=2865.

  7. Imaging gene expression in gene therapy

    International Nuclear Information System (INIS)

    Wiebe, Leonard I.

    1997-01-01

    Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on 'suicide gene therapy' of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k + ) has been use for 'suicide' in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k + gene expression where the H S V-1 t k + gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([ 18 F]F H P G; [ 18 F]-A C V), and pyrimidine- ([ 123 / 131 I]I V R F U; [ 124 / 131I ]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [ 123 / 131I ]I V R F U imaging with the H S V-1 t k + reporter gene will be presented

  8. Imaging gene expression in gene therapy

    Energy Technology Data Exchange (ETDEWEB)

    Wiebe, Leonard I. [Alberta Univ., Edmonton (Canada). Noujaim Institute for Pharmaceutical Oncology Research

    1997-12-31

    Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on `suicide gene therapy` of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k{sup +}) has been use for `suicide` in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k{sup +} gene expression where the H S V-1 t k{sup +} gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([{sup 18} F]F H P G; [{sup 18} F]-A C V), and pyrimidine- ([{sup 123}/{sup 131} I]I V R F U; [{sup 124}/{sup 131I}]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [{sup 123}/{sup 131I}]I V R F U imaging with the H S V-1 t k{sup +} reporter gene will be presented

  9. Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.

    Science.gov (United States)

    Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M

    2011-11-01

    Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.

  10. Regulation of eucaryotic gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Brent, R.; Ptashne, M.S

    1989-05-23

    This patent describes a method of regulating the expression of a gene in a eucaryotic cell. The method consists of: providing in the eucaryotic cell, a peptide, derived from or substantially similar to a peptide of a procaryotic cell able to bind to DNA upstream from or within the gene, the amount of the peptide being sufficient to bind to the gene and thereby control expression of the gene.

  11. Differential Gene Expression and Aging

    Directory of Open Access Journals (Sweden)

    Laurent Seroude

    2002-01-01

    Full Text Available It has been established that an intricate program of gene expression controls progression through the different stages in development. The equally complex biological phenomenon known as aging is genetically determined and environmentally modulated. This review focuses on the genetic component of aging, with a special emphasis on differential gene expression. At least two genetic pathways regulating organism longevity act by modifying gene expression. Many genes are also subjected to age-dependent transcriptional regulation. Some age-related gene expression changes are prevented by caloric restriction, the most robust intervention that slows down the aging process. Manipulating the expression of some age-regulated genes can extend an organism's life span. Remarkably, the activity of many transcription regulatory elements is linked to physiological age as opposed to chronological age, indicating that orderly and tightly controlled regulatory pathways are active during aging.

  12. Expression patterns of porcine Toll-like receptors family set of genes (TLR1-10) in gut-associated lymphoid tissues alter with age.

    Science.gov (United States)

    Uddin, Muhammad Jasim; Kaewmala, Kanokwan; Tesfaye, Dawit; Tholen, Ernst; Looft, Christian; Hoelker, Michael; Schellander, Karl; Cinar, Mehmet Ulas

    2013-08-01

    The aim was to study the expression pattern of the porcine TLR family (TLR1-10) genes in gut-associated lymphoid tissues (GALT) of varying ages. A total of nine clinically healthy pigs of three ages group (1 day, 2 months and 5 months old) were selected for this experiment (three pigs in each group). Tissues from intestinal mucosa in stomach, duodenum, jejunum and ileum and mesenteric lymph node (MLN) were used. mRNA expression of TLRs (1-10) was detectable in all tissues and TLR3 showed the highest mRNA abundance among TLRs. TLR3 expression in stomach, and TLR1 and TLR6 expression in MLN were higher in adult than newborn pigs. The western blot results of TLR2, 3 and 9 in some cases, did not coincide with the mRNA expression results. The protein localization of TLR2, 3 and 9 showed that TLR expressing cells were abundant in the lamina propria, Peyer's patches in intestine, and around and within the lymphoid follicles in the MLN. This expressions study sheds the first light on the expression patterns of all TLR genes in GALT at different ages of pigs. Copyright © 2013 Elsevier Ltd. All rights reserved.

  13. A common multiple cloning site in a set of vectors for expression of eukaryotic genes in mammalian, insect and bacterial cells

    DEFF Research Database (Denmark)

    Pallisgaard, N; Pedersen, FS; Birkelund, Svend

    1994-01-01

    a start Met codon was included in the same reading frame as in lambda gt11Sfi-Not to support expression of partial cDNA clones. Thus a cDNA insert of lambda gt11Sfi-Not could be shuttled among the new vectors for expression. The other set of vectors without a start codon were suitable for expression of c......DNA carrying their own start Met codon. By Western blot analysis and by transactivation of a reporter plasmid in co-transfections we show that cDNA is very efficiently expressed in NIH 3T3 cells under control of the elongation factor 1 alpha promoter....

  14. Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.

    Science.gov (United States)

    Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M

    2012-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.

  15. Setting the pace: host rhythmic behaviour and gene expression patterns in the facultatively symbiotic cnidarian Aiptasia are determined largely by Symbiodinium.

    Science.gov (United States)

    Sorek, Michal; Schnytzer, Yisrael; Ben-Asher, Hiba Waldman; Caspi, Vered Chalifa; Chen, Chii-Shiarng; Miller, David J; Levy, Oren

    2018-05-09

    All organisms employ biological clocks to anticipate physical changes in the environment; however, the integration of biological clocks in symbiotic systems has received limited attention. In corals, the interpretation of rhythmic behaviours is complicated by the daily oscillations in tissue oxygen tension resulting from the photosynthetic and respiratory activities of the associated algal endosymbiont Symbiodinium. In order to better understand the integration of biological clocks in cnidarian hosts of Symbiodinium, daily rhythms of behaviour and gene expression were studied in symbiotic and aposymbiotic morphs of the sea-anemone Aiptasia diaphana. The results showed that whereas circatidal (approx. 12-h) cycles of activity and gene expression predominated in aposymbiotic morphs, circadian (approx. 24-h) patterns were the more common in symbiotic morphs, where the expression of a significant number of genes shifted from a 12- to 24-h rhythm. The behavioural experiments on symbiotic A. diaphana displayed diel (24-h) rhythmicity in body and tentacle contraction under the light/dark cycles, whereas aposymbiotic morphs showed approximately 12-h (circatidal) rhythmicity. Reinfection experiments represent an important step in understanding the hierarchy of endogenous clocks in symbiotic associations, where the aposymbiotic Aiptasia morphs returned to a 24-h behavioural rhythm after repopulation with algae. Whilst some modification of host metabolism is to be expected, the extent to which the presence of the algae modified host endogenous behavioural and transcriptional rhythms implies that it is the symbionts that influence the pace. Our results clearly demonstrate the importance of the endosymbiotic algae in determining the timing and the duration of the extension and contraction of the body and tentacles and temporal gene expression.

  16. IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering.

    Science.gov (United States)

    Wu, Lingxiang; Chen, Xiujie; Zhang, Denan; Zhang, Wubing; Liu, Lei; Ma, Hongzhe; Yang, Jingbo; Xie, Hongbo; Liu, Bo; Jin, Qing

    2016-01-01

    Analysis of gene sets has been widely applied in various high-throughput biological studies. One weakness in the traditional methods is that they neglect the heterogeneity of genes expressions in samples which may lead to the omission of some specific and important gene sets. It is also difficult for them to reflect the severities of disease and provide expression profiles of gene sets for individuals. We developed an application software called IGSA that leverages a powerful analytical capacity in gene sets enrichment and samples clustering. IGSA calculates gene sets expression scores for each sample and takes an accumulating clustering strategy to let the samples gather into the set according to the progress of disease from mild to severe. We focus on gastric, pancreatic and ovarian cancer data sets for the performance of IGSA. We also compared the results of IGSA in KEGG pathways enrichment with David, GSEA, SPIA, ssGSEA and analyzed the results of IGSA clustering and different similarity measurement methods. Notably, IGSA is proved to be more sensitive and specific in finding significant pathways, and can indicate related changes in pathways with the severity of disease. In addition, IGSA provides with significant gene sets profile for each sample.

  17. Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease.

    Science.gov (United States)

    Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

    2014-01-01

    We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. http://rged.wall-eva.net. © The Author(s) 2014. Published by Oxford University Press.

  18. Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease

    Science.gov (United States)

    Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

    2014-01-01

    We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Availability and implementation: Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. Database URL: http://rged.wall-eva.net PMID:25252782

  19. Gene expression analysis by cDNA-AFLP highlights a set of new signaling networks and translational control during seed dormancy breaking in Nicotiana plumbaginifolia.

    Science.gov (United States)

    Bove, Jérôme; Lucas, Philippe; Godin, Béatrice; Ogé, Laurent; Jullien, Marc; Grappin, Philippe

    2005-03-01

    Seed dormancy in Nicotiana plumbaginifolia is characterized by an abscisic acid accumulation linked to a pronounced germination delay. Dormancy can be released by 1 year after-ripening treatment. Using a cDNA-amplified fragment length polymorphism (cDNA-AFLP) approach we compared the gene expression patterns of dormant and after-ripened seeds, air-dry or during one day imbibition and analyzed 15,000 cDNA fragments. Among them 1020 were found to be differentially regulated by dormancy. Of 412 sequenced cDNA fragments, 83 were assigned to a known function by search similarities to public databases. The functional categories of the identified dormancy maintenance and breaking responsive genes, give evidence that after-ripening turns in the air-dry seed to a new developmental program that modulates, at the RNA level, components of translational control, signaling networks, transcriptional control and regulated proteolysis.

  20. Gene expression in colorectal cancer

    DEFF Research Database (Denmark)

    Birkenkamp-Demtroder, Karin; Christensen, Lise Lotte; Olesen, Sanne Harder

    2002-01-01

    Understanding molecular alterations in colorectal cancer (CRC) is needed to define new biomarkers and treatment targets. We used oligonucleotide microarrays to monitor gene expression of about 6,800 known genes and 35,000 expressed sequence tags (ESTs) on five pools (four to six samples in each...... pool) of total RNA from left-sided sporadic colorectal carcinomas. We compared normal tissue to carcinoma tissue from Dukes' stages A-D (noninvasive to distant metastasis) and identified 908 known genes and 4,155 ESTs that changed remarkably from normal to tumor tissue. Based on intensive filtering 226...

  1. Correction of gene expression data

    DEFF Research Database (Denmark)

    Darbani Shirvanehdeh, Behrooz; Stewart, C. Neal, Jr.; Noeparvar, Shahin

    2014-01-01

    This report investigates for the first time the potential inter-treatment bias source of cell number for gene expression studies. Cell-number bias can affect gene expression analysis when comparing samples with unequal total cellular RNA content or with different RNA extraction efficiencies....... For maximal reliability of analysis, therefore, comparisons should be performed at the cellular level. This could be accomplished using an appropriate correction method that can detect and remove the inter-treatment bias for cell-number. Based on inter-treatment variations of reference genes, we introduce...

  2. Principles for the organization of gene-sets.

    Science.gov (United States)

    Li, Wentian; Freudenberg, Jan; Oswald, Michaela

    2015-12-01

    A gene-set, an important concept in microarray expression analysis and systems biology, is a collection of genes and/or their products (i.e. proteins) that have some features in common. There are many different ways to construct gene-sets, but a systematic organization of these ways is lacking. Gene-sets are mainly organized ad hoc in current public-domain databases, with group header names often determined by practical reasons (such as the types of technology in obtaining the gene-sets or a balanced number of gene-sets under a header). Here we aim at providing a gene-set organization principle according to the level at which genes are connected: homology, physical map proximity, chemical interaction, biological, and phenotypic-medical levels. We also distinguish two types of connections between genes: actual connection versus sharing of a label. Actual connections denote direct biological interactions, whereas shared label connection denotes shared membership in a group. Some extensions of the framework are also addressed such as overlapping of gene-sets, modules, and the incorporation of other non-protein-coding entities such as microRNAs. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. Mining gene expression data of multiple sclerosis.

    Directory of Open Access Journals (Sweden)

    Pi Guo

    Full Text Available Microarray produces a large amount of gene expression data, containing various biological implications. The challenge is to detect a panel of discriminative genes associated with disease. This study proposed a robust classification model for gene selection using gene expression data, and performed an analysis to identify disease-related genes using multiple sclerosis as an example.Gene expression profiles based on the transcriptome of peripheral blood mononuclear cells from a total of 44 samples from 26 multiple sclerosis patients and 18 individuals with other neurological diseases (control were analyzed. Feature selection algorithms including Support Vector Machine based on Recursive Feature Elimination, Receiver Operating Characteristic Curve, and Boruta algorithms were jointly performed to select candidate genes associating with multiple sclerosis. Multiple classification models categorized samples into two different groups based on the identified genes. Models' performance was evaluated using cross-validation methods, and an optimal classifier for gene selection was determined.An overlapping feature set was identified consisting of 8 genes that were differentially expressed between the two phenotype groups. The genes were significantly associated with the pathways of apoptosis and cytokine-cytokine receptor interaction. TNFSF10 was significantly associated with multiple sclerosis. A Support Vector Machine model was established based on the featured genes and gave a practical accuracy of ∼86%. This binary classification model also outperformed the other models in terms of Sensitivity, Specificity and F1 score.The combined analytical framework integrating feature ranking algorithms and Support Vector Machine model could be used for selecting genes for other diseases.

  4. Reprogramming the body weight set point by a reciprocal interaction of hypothalamic leptin sensitivity and Pomc gene expression reverts extreme obesity

    Directory of Open Access Journals (Sweden)

    Kavaljit H. Chhabra

    2016-10-01

    Conclusions: Pomc reactivation in previously obese, calorie-restricted ArcPomc−/− mice normalized energy homeostasis, suggesting that their body weight set point was restored to control levels. In contrast, massively obese and hyperleptinemic ArcPomc−/− mice or those weight-matched and treated with PASylated leptin to maintain extreme hyperleptinemia prior to Pomc reactivation converged to an intermediate set point relative to lean control and obese ArcPomc−/− mice. We conclude that restoration of hypothalamic leptin sensitivity and Pomc expression is necessary for obese ArcPomc−/− mice to achieve and sustain normal metabolic homeostasis; whereas deficits in either parameter set a maladaptive allostatic balance that defends increased adiposity and body weight.

  5. Reference Gene Screening for Analyzing Gene Expression Across Goat Tissue

    Directory of Open Access Journals (Sweden)

    Yu Zhang

    2013-12-01

    Full Text Available Real-time quantitative PCR (qRT-PCR is one of the important methods for investigating the changes in mRNA expression levels in cells and tissues. Selection of the proper reference genes is very important when calibrating the results of real-time quantitative PCR. Studies on the selection of reference genes in goat tissues are limited, despite the economic importance of their meat and dairy products. We used real-time quantitative PCR to detect the expression levels of eight reference gene candidates (18S, TBP, HMBS, YWHAZ, ACTB, HPRT1, GAPDH and EEF1A2 in ten tissues types sourced from Boer goats. The optimal reference gene combination was selected according to the results determined by geNorm, NormFinder and Bestkeeper software packages. The analyses showed that tissue is an important variability factor in genes expression stability. When all tissues were considered, 18S, TBP and HMBS is the optimal reference combination for calibrating quantitative PCR analysis of gene expression from goat tissues. Dividing data set by tissues, ACTB was the most stable in stomach, small intestine and ovary, 18S in heart and spleen, HMBS in uterus and lung, TBP in liver, HPRT1 in kidney and GAPDH in muscle. Overall, this study provided valuable information about the goat reference genes that can be used in order to perform a proper normalisation when relative quantification by qRT-PCR studies is undertaken.

  6. Using RNA-Seq data to select refence genes for normalizing gene expression in apple roots

    Science.gov (United States)

    Gene expression in apple roots in response to various stress conditions is a less-explored research subject. Reliable reference genes for normalizing quantitative gene expression data have not been carefully investigated. In this study, the suitability of a set of 15 apple genes were evaluated for t...

  7. Codominant expression of genes coding for different sets of inducible salivary polypeptides associated with parotid hypertrophy in two inbred mouse strains.

    Science.gov (United States)

    López-Solís, Remigio O; Kemmerling, Ulrike

    2005-05-01

    Experimental mouse parotid hypertrophy has been associated with the expression of a number of isoproterenol-induced salivary proline-rich polypeptides (IISPs). Mouse salivary proline-rich proteins (PRPs) have been mapped both to chromosomes 6 and 8. Recently, mice of two inbred strains (A/Snell and A. Swiss) have been found to differ drastically in the IISPs. In this study, mice of both strains were used for cross-breeding experiments addressed to define the pattern of inheritance of the IISP phenotype and to establish whether the IISPs are coded on a single or on several chromosomes. The IISP phenotype of individual mice was assessed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) of whole saliva collected after three daily stimulations by isoproterenol. Parental A/Snell and A. Swiss mice were homogeneous for distinctive strain-associated IISP-patterns. First filial generation (F1) mice obtained from the cross of A/Snell with A. Swiss mice expressed with no exception both the A/Snell and A. Swiss IISPs (coexpression). In the second filial generation (F2) both parental IISP phenotypes reappeared together with a majority of mice expressing the F1-hybrid phenotype (1:2:1 ratio). Backcrosses of F1 x A/Snell and F1 x A. Swiss produced offsprings displaying the F1 and the corresponding parental phenotypes with a 1:1 ratio. No recombinants were observed among F2 mice or among mice resulting from backcrosses. Thus, genes coding for the IISPs that are expressed differentially in both mouse strains are located on the same chromosome, probably at the same locus (alleles) or at quite closely linked loci (nonalleles). 2005 Wiley-Liss, Inc

  8. Transgenic Arabidopsis Gene Expression System

    Science.gov (United States)

    Ferl, Robert; Paul, Anna-Lisa

    2009-01-01

    The Transgenic Arabidopsis Gene Expression System (TAGES) investigation is one in a pair of investigations that use the Advanced Biological Research System (ABRS) facility. TAGES uses Arabidopsis thaliana, thale cress, with sensor promoter-reporter gene constructs that render the plants as biomonitors (an organism used to determine the quality of the surrounding environment) of their environment using real-time nondestructive Green Fluorescent Protein (GFP) imagery and traditional postflight analyses.

  9. Using logistic regression to improve the prognostic value of microarray gene expression data sets: application to early-stage squamous cell carcinoma of the lung and triple negative breast carcinoma.

    Science.gov (United States)

    Mount, David W; Putnam, Charles W; Centouri, Sara M; Manziello, Ann M; Pandey, Ritu; Garland, Linda L; Martinez, Jesse D

    2014-06-10

    Numerous microarray-based prognostic gene expression signatures of primary neoplasms have been published but often with little concurrence between studies, thus limiting their clinical utility. We describe a methodology using logistic regression, which circumvents limitations of conventional Kaplan Meier analysis. We applied this approach to a thrice-analyzed and published squamous cell carcinoma (SQCC) of the lung data set, with the objective of identifying gene expressions predictive of early death versus long survival in early-stage disease. A similar analysis was applied to a data set of triple negative breast carcinoma cases, which present similar clinical challenges. Important to our approach is the selection of homogenous patient groups for comparison. In the lung study, we selected two groups (including only stages I and II), equal in size, of earliest deaths and longest survivors. Genes varying at least four-fold were tested by logistic regression for accuracy of prediction (area under a ROC plot). The gene list was refined by applying two sliding-window analyses and by validations using a leave-one-out approach and model building with validation subsets. In the breast study, a similar logistic regression analysis was used after selecting appropriate cases for comparison. A total of 8594 variable genes were tested for accuracy in predicting earliest deaths versus longest survivors in SQCC. After applying the two sliding window and the leave-one-out analyses, 24 prognostic genes were identified; most of them were B-cell related. When the same data set of stage I and II cases was analyzed using a conventional Kaplan Meier (KM) approach, we identified fewer immune-related genes among the most statistically significant hits; when stage III cases were included, most of the prognostic genes were missed. Interestingly, logistic regression analysis of the breast cancer data set identified many immune-related genes predictive of clinical outcome. Stratification of

  10. Human papillomavirus gene expression

    International Nuclear Information System (INIS)

    Chow, L.T.; Hirochika, H.; Nasseri, M.; Stoler, M.H.; Wolinsky, S.M.; Chin, M.T.; Hirochika, R.; Arvan, D.S.; Broker, T.R.

    1987-01-01

    To determine the role of tissue differentiation on expression of each of the papillomavirus mRNA species identified by electron microscopy, the authors prepared exon-specific RNA probes that could distinguish the alternatively spliced mRNA species. Radioactively labeled single-stranded RNA probes were generated from a dual promoter vector system and individually hybridized to adjacent serial sections of formalin-fixed, paraffin-embedded biopsies of condylomata. Autoradiography showed that each of the message species had a characteristic tissue distribution and relative abundance. The authors have characterized a portion of the regulatory network of the HPVs by showing that the E2 ORF encodes a trans-acting enhancer-stimulating protein, as it does in BPV-1 (Spalholz et al. 1985). The HPV-11 enhancer was mapped to a 150-bp tract near the 3' end of the URR. Portions of this region are duplicated in some aggressive strains of HPV-6 (Boshart and zur Hausen 1986; Rando et al. 1986). To test the possible biological relevance of these duplications, they cloned tandem arrays of the enhancer and demonstrated, using a chloramphenicol acetyltransferase (CAT) assay, that they led to dramatically increased transcription proportional to copy number. Using the CAT assays, the authors found that the E2 proteins of several papillomavirus types can cross-stimulate the enhancers of most other types. This suggests that prior infection of a tissue with one papillomavirus type may provide a helper effect for superinfection and might account fo the HPV-6/HPV-16 coinfections in condylomata that they have observed

  11. Homeobox gene expression in Brachiopoda

    DEFF Research Database (Denmark)

    Altenburger, Andreas; Martinez, Pedro; Wanninger, Andreas

    2011-01-01

    (ectoderm) specification with co-opted functions in notochord formation in chordates and left/right determination in ambulacrarians and vertebrates. The caudal ortholog, TtrCdx, is first expressed in the ectoderm of the gastrulating embryo in the posterior region of the blastopore. Its expression stays......The molecular control that underlies brachiopod ontogeny is largely unknown. In order to contribute to this issue we analyzed the expression pattern of two homeobox containing genes, Not and Cdx, during development of the rhynchonelliform (i.e., articulate) brachiopod Terebratalia transversa...... completion of larval development, which is marked by a three-lobed body with larval setae. Expression starts at gastrulation in two areas lateral to the blastopore and subsequently extends over the animal pole of the gastrula. With elongation of the gastrula, expression at the animal pole narrows to a small...

  12. Effect of the absolute statistic on gene-sampling gene-set analysis methods.

    Science.gov (United States)

    Nam, Dougu

    2017-06-01

    Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.

  13. Vascular Gene Expression: A Hypothesis

    Directory of Open Access Journals (Sweden)

    Angélica Concepción eMartínez-Navarro

    2013-07-01

    Full Text Available The phloem is the conduit through which photoassimilates are distributed from autotrophic to heterotrophic tissues and is involved in the distribution of signaling molecules that coordinate plant growth and responses to the environment. Phloem function depends on the coordinate expression of a large array of genes. We have previously identified conserved motifs in upstream regions of the Arabidopsis genes, encoding the homologs of pumpkin phloem sap mRNAs, displaying expression in vascular tissues. This tissue-specific expression in Arabidopsis is predicted by the overrepresentation of GA/CT-rich motifs in gene promoters. In this work we have searched for common motifs in upstream regions of the homologous genes from plants considered to possess a primitive vascular tissue (a lycophyte, as well as from others that lack a true vascular tissue (a bryophyte, and finally from chlorophytes. Both lycophyte and bryophyte display motifs similar to those found in Arabidopsis with a significantly low E-value, while the chlorophytes showed either a different conserved motif or no conserved motif at all. These results suggest that these same genes are expressed coordinately in non- vascular plants; this coordinate expression may have been one of the prerequisites for the development of conducting tissues in plants. We have also analyzed the phylogeny of conserved proteins that may be involved in phloem function and development. The presence of CmPP16, APL, FT and YDA in chlorophytes suggests the recruitment of ancient regulatory networks for the development of the vascular tissue during evolution while OPS is a novel protein specific to vascular plants.

  14. Neighboring Genes Show Correlated Evolution in Gene Expression

    Science.gov (United States)

    Ghanbarian, Avazeh T.; Hurst, Laurence D.

    2015-01-01

    When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. PMID:25743543

  15. Gene expression profile of pulpitis.

    Science.gov (United States)

    Galicia, J C; Henson, B R; Parker, J S; Khan, A A

    2016-06-01

    The cost, prevalence and pain associated with endodontic disease necessitate an understanding of the fundamental molecular aspects of its pathogenesis. This study was aimed to identify the genetic contributors to pulpal pain and inflammation. Inflamed pulps were collected from patients diagnosed with irreversible pulpitis (n=20). Normal pulps from teeth extracted for various reasons served as controls (n=20). Pain level was assessed using a visual analog scale (VAS). Genome-wide microarray analysis was performed using Affymetrix GeneTitan Multichannel Instrument. The difference in gene expression levels were determined by the significance analysis of microarray program using a false discovery rate (q-value) of 5%. Genes involved in immune response, cytokine-cytokine receptor interaction and signaling, integrin cell surface interactions, and others were expressed at relatively higher levels in the pulpitis group. Moreover, several genes known to modulate pain and inflammation showed differential expression in asymptomatic and mild pain patients (⩾30 mm on VAS) compared with those with moderate to severe pain. This exploratory study provides a molecular basis for the clinical diagnosis of pulpitis. With an enhanced understanding of pulpal inflammation, future studies on treatment and management of pulpitis and on pain associated with it can have a biological reference to bridge treatment strategies with pulpal biology.

  16. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  17. Improved gene expression signature of testicular carcinoma in situ

    DEFF Research Database (Denmark)

    Almstrup, Kristian; Leffers, Henrik; Lothe, Ragnhild A

    2007-01-01

    on global gene expression in testicular CIS have been previously published. We have merged the two data sets on CIS samples (n = 6) and identified the shared gene expression signature in relation to expression in normal testis. Among the top-20 highest expressed genes, one-third was transcription factors...... development' were significantly altered and could collectively affect cellular pathways like the WNT signalling cascade, which thus may be disrupted in testicular CIS. The merged CIS data from two different microarray platforms, to our knowledge, provide the most precise CIS gene expression signature to date....

  18. Using gene expression noise to understand gene regulation

    NARCIS (Netherlands)

    Munsky, B.; Neuert, G.; van Oudenaarden, A.

    2012-01-01

    Phenotypic variation is ubiquitous in biology and is often traceable to underlying genetic and environmental variation. However, even genetically identical cells in identical environments display variable phenotypes. Stochastic gene expression, or gene expression "noise," has been suggested as a

  19. Genetic Variants Contribute to Gene Expression Variability in Humans

    Science.gov (United States)

    Hulse, Amanda M.; Cai, James J.

    2013-01-01

    Expression quantitative trait loci (eQTL) studies have established convincing relationships between genetic variants and gene expression. Most of these studies focused on the mean of gene expression level, but not the variance of gene expression level (i.e., gene expression variability). In the present study, we systematically explore genome-wide association between genetic variants and gene expression variability in humans. We adapt the double generalized linear model (dglm) to simultaneously fit the means and the variances of gene expression among the three possible genotypes of a biallelic SNP. The genomic loci showing significant association between the variances of gene expression and the genotypes are termed expression variability QTL (evQTL). Using a data set of gene expression in lymphoblastoid cell lines (LCLs) derived from 210 HapMap individuals, we identify cis-acting evQTL involving 218 distinct genes, among which 8 genes, ADCY1, CTNNA2, DAAM2, FERMT2, IL6, PLOD2, SNX7, and TNFRSF11B, are cross-validated using an extra expression data set of the same LCLs. We also identify ∼300 trans-acting evQTL between >13,000 common SNPs and 500 randomly selected representative genes. We employ two distinct scenarios, emphasizing single-SNP and multiple-SNP effects on expression variability, to explain the formation of evQTL. We argue that detecting evQTL may represent a novel method for effectively screening for genetic interactions, especially when the multiple-SNP influence on expression variability is implied. The implication of our results for revealing genetic mechanisms of gene expression variability is discussed. PMID:23150607

  20. Gene set analysis for interpreting genetic studies

    DEFF Research Database (Denmark)

    Pers, Tune H

    2016-01-01

    Interpretation of genome-wide association study (GWAS) results is lacking behind the discovery of new genetic associations. Consequently, there is an urgent need for data-driven methods for interpreting genetic association studies. Gene set analysis (GSA) can identify aetiologic pathways...

  1. A constructive approach to gene expression dynamics

    International Nuclear Information System (INIS)

    Ochiai, T.; Nacher, J.C.; Akutsu, T.

    2004-01-01

    Recently, experiments on mRNA abundance (gene expression) have revealed that gene expression shows a stationary organization described by a scale-free distribution. Here we propose a constructive approach to gene expression dynamics which restores the scale-free exponent and describes the intermediate state dynamics. This approach requires only one assumption: Markov property

  2. Expression profiling identifies genes involved in emphysema severity

    Directory of Open Access Journals (Sweden)

    Bowman Rayleen V

    2009-09-01

    Full Text Available Abstract Chronic obstructive pulmonary disease (COPD is a major public health problem. The aim of this study was to identify genes involved in emphysema severity in COPD patients. Gene expression profiling was performed on total RNA extracted from non-tumor lung tissue from 30 smokers with emphysema. Class comparison analysis based on gas transfer measurement was performed to identify differentially expressed genes. Genes were then selected for technical validation by quantitative reverse transcriptase-PCR (qRT-PCR if also represented on microarray platforms used in previously published emphysema studies. Genes technically validated advanced to tests of biological replication by qRT-PCR using an independent test set of 62 lung samples. Class comparison identified 98 differentially expressed genes (p p Gene expression profiling of lung from emphysema patients identified seven candidate genes associated with emphysema severity including COL6A3, SERPINF1, ZNHIT6, NEDD4, CDKN2A, NRN1 and GSTM3.

  3. Blood Gene Expression Predicts Bronchiolitis Obliterans Syndrome

    Directory of Open Access Journals (Sweden)

    Richard Danger

    2018-01-01

    Full Text Available Bronchiolitis obliterans syndrome (BOS, the main manifestation of chronic lung allograft dysfunction, leads to poor long-term survival after lung transplantation. Identifying predictors of BOS is essential to prevent the progression of dysfunction before irreversible damage occurs. By using a large set of 107 samples from lung recipients, we performed microarray gene expression profiling of whole blood to identify early biomarkers of BOS, including samples from 49 patients with stable function for at least 3 years, 32 samples collected at least 6 months before BOS diagnosis (prediction group, and 26 samples at or after BOS diagnosis (diagnosis group. An independent set from 25 lung recipients was used for validation by quantitative PCR (13 stables, 11 in the prediction group, and 8 in the diagnosis group. We identified 50 transcripts differentially expressed between stable and BOS recipients. Three genes, namely POU class 2 associating factor 1 (POU2AF1, T-cell leukemia/lymphoma protein 1A (TCL1A, and B cell lymphocyte kinase, were validated as predictive biomarkers of BOS more than 6 months before diagnosis, with areas under the curve of 0.83, 0.77, and 0.78 respectively. These genes allow stratification based on BOS risk (log-rank test p < 0.01 and are not associated with time posttransplantation. This is the first published large-scale gene expression analysis of blood after lung transplantation. The three-gene blood signature could provide clinicians with new tools to improve follow-up and adapt treatment of patients likely to develop BOS.

  4. Modulation of gene expression made easy

    DEFF Research Database (Denmark)

    Solem, Christian; Jensen, Peter Ruhdal

    2002-01-01

    A new approach for modulating gene expression, based on randomization of promoter (spacer) sequences, was developed. The method was applied to chromosomal genes in Lactococcus lactis and shown to generate libraries of clones with broad ranges of expression levels of target genes. In one example...... that the method can be applied to modulating the expression of native genes on the chromosome. We constructed a series of strains in which the expression of the las operon, containing the genes pfk, pyk, and ldh, was modulated by integrating a truncated copy of the pfk gene. Importantly, the modulation affected...

  5. DNA methylation polymorphism in a set of elite rice cultivars and its possible contribution to inter-cultivar differential gene expression.

    Science.gov (United States)

    Wang, Yongming; Lin, Xiuyun; Dong, Bo; Wang, Yingdian; Liu, Bao

    2004-01-01

    RAPD (randomly amplified polymorphic DNA) and ISSR (inter-simple sequence repeat) fingerprinting on HpaII/MspI-digested genomic DNA of nine elite japonica rice cultivars implies inter-cultivar DNA methylation polymorphism. Using both DNA fragments isolated from RAPD or ISSR gels and selected low-copy sequences as probes, methylation-sensitive Southern blot analysis confirms the existence of extensive DNA methylation polymorphism in both genes and DNA repeats among the rice cultivars. The cultivar-specific methylation patterns are stably maintained, and can be used as reliable molecular markers. Transcriptional analysis of four selected sequences (RdRP, AC9, HSP90 and MMR) on leaves and roots from normal and 5-azacytidine-treated seedlings of three representative cultivars shows an association between the transcriptional activity of one of the genes, the mismatch repair (MMR) gene, and its CG methylation patterns.

  6. Adaptive Evolution of Gene Expression in Drosophila

    Directory of Open Access Journals (Sweden)

    Armita Nourmohammad

    2017-08-01

    Full Text Available Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis.

  7. Gene Expression and the Diversity of Identified Neurons

    OpenAIRE

    Buck, L.; Stein, R.; Palazzolo, M.; Anderson, D. J.; Axel, R.

    1983-01-01

    Nervous systems consist of diverse populations of neurons that are anatomically and functionally distinct. The diversity of neurons and the precision with which they are interconnected suggest that specific genes or sets of genes are activated in some neurons but not expressed in others. Experimentally, this problem may be considered at two levels. First, what is the total number of genes expressed in the brain, and how are they distributed among the different populations of neurons? Second, ...

  8. Characterization of differentially expressed genes using high-dimensional co-expression networks

    DEFF Research Database (Denmark)

    Coelho Goncalves de Abreu, Gabriel; Labouriau, Rodrigo S.

    2010-01-01

    We present a technique to characterize differentially expressed genes in terms of their position in a high-dimensional co-expression network. The set-up of Gaussian graphical models is used to construct representations of the co-expression network in such a way that redundancy and the propagation...... that allow to make effective inference in problems with high degree of complexity (e.g. several thousands of genes) and small number of observations (e.g. 10-100) as typically occurs in high throughput gene expression studies. Taking advantage of the internal structure of decomposable graphical models, we...... construct a compact representation of the co-expression network that allows to identify the regions with high concentration of differentially expressed genes. It is argued that differentially expressed genes located in highly interconnected regions of the co-expression network are less informative than...

  9. Determining Physical Mechanisms of Gene Expression Regulation from Single Cell Gene Expression Data

    OpenAIRE

    Ezer, Daphne; Moignard, Victoria; G?ttgens, Berthold; Adryan, Boris

    2016-01-01

    Many genes are expressed in bursts, which can contribute to cell-to-cell heterogeneity. It is now possible to measure this heterogeneity with high throughput single cell gene expression assays (single cell qPCR and RNA-seq). These experimental approaches generate gene expression distributions which can be used to estimate the kinetic parameters of gene expression bursting, namely the rate that genes turn on, the rate that genes turn off, and the rate of transcription. We construct a complete ...

  10. The evolution of gene expression in primates

    OpenAIRE

    Tashakkori Ghanbarian, Avazeh

    2015-01-01

    The evolution of a gene’s expression profile is commonly assumed to be independent of its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between expression of neighboring genes in extant taxa. Indeed, in all eukaryotic genomes, genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their e...

  11. GOBO: gene expression-based outcome for breast cancer online.

    Directory of Open Access Journals (Sweden)

    Markus Ringnér

    Full Text Available Microarray-based gene expression analysis holds promise of improving prognostication and treatment decisions for breast cancer patients. However, the heterogeneity of breast cancer emphasizes the need for validation of prognostic gene signatures in larger sample sets stratified into relevant subgroups. Here, we describe a multifunctional user-friendly online tool, GOBO (http://co.bmc.lu.se/gobo, allowing a range of different analyses to be performed in an 1881-sample breast tumor data set, and a 51-sample breast cancer cell line set, both generated on Affymetrix U133A microarrays. GOBO supports a wide range of applications including: 1 rapid assessment of gene expression levels in subgroups of breast tumors and cell lines, 2 identification of co-expressed genes for creation of potential metagenes, 3 association with outcome for gene expression levels of single genes, sets of genes, or gene signatures in multiple subgroups of the 1881-sample breast cancer data set. The design and implementation of GOBO facilitate easy incorporation of additional query functions and applications, as well as additional data sets irrespective of tumor type and array platform.

  12. Automated discovery of functional generality of human gene expression programs.

    Directory of Open Access Journals (Sweden)

    Georg K Gerber

    2007-08-01

    Full Text Available An important research problem in computational biology is the identification of expression programs, sets of co-expressed genes orchestrating normal or pathological processes, and the characterization of the functional breadth of these programs. The use of human expression data compendia for discovery of such programs presents several challenges including cellular inhomogeneity within samples, genetic and environmental variation across samples, uncertainty in the numbers of programs and sample populations, and temporal behavior. We developed GeneProgram, a new unsupervised computational framework based on Hierarchical Dirichlet Processes that addresses each of the above challenges. GeneProgram uses expression data to simultaneously organize tissues into groups and genes into overlapping programs with consistent temporal behavior, to produce maps of expression programs, which are sorted by generality scores that exploit the automatically learned groupings. Using synthetic and real gene expression data, we showed that GeneProgram outperformed several popular expression analysis methods. We applied GeneProgram to a compendium of 62 short time-series gene expression datasets exploring the responses of human cells to infectious agents and immune-modulating molecules. GeneProgram produced a map of 104 expression programs, a substantial number of which were significantly enriched for genes involved in key signaling pathways and/or bound by NF-kappaB transcription factors in genome-wide experiments. Further, GeneProgram discovered expression programs that appear to implicate surprising signaling pathways or receptor types in the response to infection, including Wnt signaling and neurotransmitter receptors. We believe the discovered map of expression programs involved in the response to infection will be useful for guiding future biological experiments; genes from programs with low generality scores might serve as new drug targets that exhibit minimal

  13. Integrative analysis of survival-associated gene sets in breast cancer.

    Science.gov (United States)

    Varn, Frederick S; Ung, Matthew H; Lou, Shao Ke; Cheng, Chao

    2015-03-12

    Patient gene expression information has recently become a clinical feature used to evaluate breast cancer prognosis. The emergence of prognostic gene sets that take advantage of these data has led to a rich library of information that can be used to characterize the molecular nature of a patient's cancer. Identifying robust gene sets that are consistently predictive of a patient's clinical outcome has become one of the main challenges in the field. We inputted our previously established BASE algorithm with patient gene expression data and gene sets from MSigDB to develop the gene set activity score (GSAS), a metric that quantitatively assesses a gene set's activity level in a given patient. We utilized this metric, along with patient time-to-event data, to perform survival analyses to identify the gene sets that were significantly correlated with patient survival. We then performed cross-dataset analyses to identify robust prognostic gene sets and to classify patients by metastasis status. Additionally, we created a gene set network based on component gene overlap to explore the relationship between gene sets derived from MSigDB. We developed a novel gene set based on this network's topology and applied the GSAS metric to characterize its role in patient survival. Using the GSAS metric, we identified 120 gene sets that were significantly associated with patient survival in all datasets tested. The gene overlap network analysis yielded a novel gene set enriched in genes shared by the robustly predictive gene sets. This gene set was highly correlated to patient survival when used alone. Most interestingly, removal of the genes in this gene set from the gene pool on MSigDB resulted in a large reduction in the number of predictive gene sets, suggesting a prominent role for these genes in breast cancer progression. The GSAS metric provided a useful medium by which we systematically investigated how gene sets from MSigDB relate to breast cancer patient survival. We used

  14. Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

    Directory of Open Access Journals (Sweden)

    Paules Richard S

    2007-11-01

    Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying

  15. Identification and validation of suitable endogenous reference genes for gene expression studies in human peripheral blood

    Directory of Open Access Journals (Sweden)

    Turner Renee J

    2009-08-01

    Full Text Available Abstract Background Gene expression studies require appropriate normalization methods. One such method uses stably expressed reference genes. Since suitable reference genes appear to be unique for each tissue, we have identified an optimal set of the most stably expressed genes in human blood that can be used for normalization. Methods Whole-genome Affymetrix Human 2.0 Plus arrays were examined from 526 samples of males and females ages 2 to 78, including control subjects and patients with Tourette syndrome, stroke, migraine, muscular dystrophy, and autism. The top 100 most stably expressed genes with a broad range of expression levels were identified. To validate the best candidate genes, we performed quantitative RT-PCR on a subset of 10 genes (TRAP1, DECR1, FPGS, FARP1, MAPRE2, PEX16, GINS2, CRY2, CSNK1G2 and A4GALT, 4 commonly employed reference genes (GAPDH, ACTB, B2M and HMBS and PPIB, previously reported to be stably expressed in blood. Expression stability and ranking analysis were performed using GeNorm and NormFinder algorithms. Results Reference genes were ranked based on their expression stability and the minimum number of genes needed for nomalization as calculated using GeNorm showed that the fewest, most stably expressed genes needed for acurate normalization in RNA expression studies of human whole blood is a combination of TRAP1, FPGS, DECR1 and PPIB. We confirmed the ranking of the best candidate control genes by using an alternative algorithm (NormFinder. Conclusion The reference genes identified in this study are stably expressed in whole blood of humans of both genders with multiple disease conditions and ages 2 to 78. Importantly, they also have different functions within cells and thus should be expressed independently of each other. These genes should be useful as normalization genes for microarray and RT-PCR whole blood studies of human physiology, metabolism and disease.

  16. Simple Comparative Analyses of Differentially Expressed Gene Lists May Overestimate Gene Overlap.

    Science.gov (United States)

    Lawhorn, Chelsea M; Schomaker, Rachel; Rowell, Jonathan T; Rueppell, Olav

    2018-04-16

    Comparing the overlap between sets of differentially expressed genes (DEGs) within or between transcriptome studies is regularly used to infer similarities between biological processes. Significant overlap between two sets of DEGs is usually determined by a simple test. The number of potentially overlapping genes is compared to the number of genes that actually occur in both lists, treating every gene as equal. However, gene expression is controlled by transcription factors that bind to a variable number of transcription factor binding sites, leading to variation among genes in general variability of their expression. Neglecting this variability could therefore lead to inflated estimates of significant overlap between DEG lists. With computer simulations, we demonstrate that such biases arise from variation in the control of gene expression. Significant overlap commonly arises between two lists of DEGs that are randomly generated, assuming that the control of gene expression is variable among genes but consistent between corresponding experiments. More overlap is observed when transcription factors are specific to their binding sites and when the number of genes is considerably higher than the number of different transcription factors. In contrast, overlap between two DEG lists is always lower than expected when the genetic architecture of expression is independent between the two experiments. Thus, the current methods for determining significant overlap between DEGs are potentially confounding biologically meaningful overlap with overlap that arises due to variability in control of expression among genes, and more sophisticated approaches are needed.

  17. Decoupling Linear and Nonlinear Associations of Gene Expression

    KAUST Repository

    Itakura, Alan

    2013-01-01

    The FANTOM consortium has generated a large gene expression dataset of different cell lines and tissue cultures using the single-molecule sequencing technology of HeliscopeCAGE. This provides a unique opportunity to investigate novel associations between gene expression over time and different cell types. Here, we create a MatLab wrapper for a powerful and computationally intensive set of statistics known as Maximal Information Coefficient, and then calculate this statistic for a large, comprehensive dataset containing gene expression of a variety of differentiating tissues. We then distinguish between linear and nonlinear associations, and then create gene association networks. Following this analysis, we are then able to identify clusters of linear gene associations that then associate nonlinearly with other clusters of linearity, providing insight to much more complex connections between gene expression patterns than previously anticipated.

  18. Decoupling Linear and Nonlinear Associations of Gene Expression

    KAUST Repository

    Itakura, Alan

    2013-05-01

    The FANTOM consortium has generated a large gene expression dataset of different cell lines and tissue cultures using the single-molecule sequencing technology of HeliscopeCAGE. This provides a unique opportunity to investigate novel associations between gene expression over time and different cell types. Here, we create a MatLab wrapper for a powerful and computationally intensive set of statistics known as Maximal Information Coefficient, and then calculate this statistic for a large, comprehensive dataset containing gene expression of a variety of differentiating tissues. We then distinguish between linear and nonlinear associations, and then create gene association networks. Following this analysis, we are then able to identify clusters of linear gene associations that then associate nonlinearly with other clusters of linearity, providing insight to much more complex connections between gene expression patterns than previously anticipated.

  19. Methods for monitoring multiple gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA

    2012-05-01

    The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.

  20. Methods for monitoring multiple gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Berka, Randy; Bachkirova, Elena; Rey, Michael

    2013-10-01

    The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.

  1. Clinicopathologic and gene expression parameters predict liver cancer prognosis

    International Nuclear Information System (INIS)

    Hao, Ke; Zhong, Hua; Greenawalt, Danielle; Ferguson, Mark D; Ng, Irene O; Sham, Pak C; Poon, Ronnie T; Molony, Cliona; Schadt, Eric E; Dai, Hongyue; Luk, John M; Lamb, John; Zhang, Chunsheng; Xie, Tao; Wang, Kai; Zhang, Bin; Chudin, Eugene; Lee, Nikki P; Mao, Mao

    2011-01-01

    The prognosis of hepatocellular carcinoma (HCC) varies following surgical resection and the large variation remains largely unexplained. Studies have revealed the ability of clinicopathologic parameters and gene expression to predict HCC prognosis. However, there has been little systematic effort to compare the performance of these two types of predictors or combine them in a comprehensive model. Tumor and adjacent non-tumor liver tissues were collected from 272 ethnic Chinese HCC patients who received curative surgery. We combined clinicopathologic parameters and gene expression data (from both tissue types) in predicting HCC prognosis. Cross-validation and independent studies were employed to assess prediction. HCC prognosis was significantly associated with six clinicopathologic parameters, which can partition the patients into good- and poor-prognosis groups. Within each group, gene expression data further divide patients into distinct prognostic subgroups. Our predictive genes significantly overlap with previously published gene sets predictive of prognosis. Moreover, the predictive genes were enriched for genes that underwent normal-to-tumor gene network transformation. Previously documented liver eSNPs underlying the HCC predictive gene signatures were enriched for SNPs that associated with HCC prognosis, providing support that these genes are involved in key processes of tumorigenesis. When applied individually, clinicopathologic parameters and gene expression offered similar predictive power for HCC prognosis. In contrast, a combination of the two types of data dramatically improved the power to predict HCC prognosis. Our results also provided a framework for understanding the impact of gene expression on the processes of tumorigenesis and clinical outcome

  2. Large scale gene expression meta-analysis reveals tissue-specific, sex-biased gene expression in humans

    Directory of Open Access Journals (Sweden)

    Benjamin Mayne

    2016-10-01

    Full Text Available The severity and prevalence of many diseases are known to differ between the sexes. Organ specific sex-biased gene expression may underpin these and other sexually dimorphic traits. To further our understanding of sex differences in transcriptional regulation, we performed meta-analyses of sex biased gene expression in multiple human tissues. We analysed 22 publicly available human gene expression microarray data sets including over 2500 samples from 15 different tissues and 9 different organs. Briefly, by using an inverse-variance method we determined the effect size difference of gene expression between males and females. We found the greatest sex differences in gene expression in the brain, specifically in the anterior cingulate cortex, (1818 genes, followed by the heart (375 genes, kidney (224 genes, colon (218 genes and thyroid (163 genes. More interestingly, we found different parts of the brain with varying numbers and identity of sex-biased genes, indicating that specific cortical regions may influence sexually dimorphic traits. The majority of sex-biased genes in other tissues such as the bladder, liver, lungs and pancreas were on the sex chromosomes or involved in sex hormone production. On average in each tissue, 32% of autosomal genes that were expressed in a sex-biased fashion contained androgen or estrogen hormone response elements. Interestingly, across all tissues, we found approximately two-thirds of autosomal genes that were sex-biased were not under direct influence of sex hormones. To our knowledge this is the largest analysis of sex-biased gene expression in human tissues to date. We identified many sex-biased genes that were not under the direct influence of sex chromosome genes or sex hormones. These may provide targets for future development of sex-specific treatments for diseases.

  3. GeneTopics - interpretation of gene sets via literature-driven topic models

    Science.gov (United States)

    2013-01-01

    Background Annotation of a set of genes is often accomplished through comparison to a library of labelled gene sets such as biological processes or canonical pathways. However, this approach might fail if the employed libraries are not up to date with the latest research, don't capture relevant biological themes or are curated at a different level of granularity than is required to appropriately analyze the input gene set. At the same time, the vast biomedical literature offers an unstructured repository of the latest research findings that can be tapped to provide thematic sub-groupings for any input gene set. Methods Our proposed method relies on a gene-specific text corpus and extracts commonalities between documents in an unsupervised manner using a topic model approach. We automatically determine the number of topics summarizing the corpus and calculate a gene relevancy score for each topic allowing us to eliminate non-specific topics. As a result we obtain a set of literature topics in which each topic is associated with a subset of the input genes providing directly interpretable keywords and corresponding documents for literature research. Results We validate our method based on labelled gene sets from the KEGG metabolic pathway collection and the genetic association database (GAD) and show that the approach is able to detect topics consistent with the labelled annotation. Furthermore, we discuss the results on three different types of experimentally derived gene sets, (1) differentially expressed genes from a cardiac hypertrophy experiment in mice, (2) altered transcript abundance in human pancreatic beta cells, and (3) genes implicated by GWA studies to be associated with metabolite levels in a healthy population. In all three cases, we are able to replicate findings from the original papers in a quick and semi-automated manner. Conclusions Our approach provides a novel way of automatically generating meaningful annotations for gene sets that are directly

  4. Delimiting Coalescence Genes (C-Genes) in Phylogenomic Data Sets.

    Science.gov (United States)

    Springer, Mark S; Gatesy, John

    2018-02-26

    coalescence methods have emerged as a popular alternative for inferring species trees with large genomic datasets, because these methods explicitly account for incomplete lineage sorting. However, statistical consistency of summary coalescence methods is not guaranteed unless several model assumptions are true, including the critical assumption that recombination occurs freely among but not within coalescence genes (c-genes), which are the fundamental units of analysis for these methods. Each c-gene has a single branching history, and large sets of these independent gene histories should be the input for genome-scale coalescence estimates of phylogeny. By contrast, numerous studies have reported the results of coalescence analyses in which complete protein-coding sequences are treated as c-genes even though exons for these loci can span more than a megabase of DNA. Empirical estimates of recombination breakpoints suggest that c-genes may be much shorter, especially when large clades with many species are the focus of analysis. Although this idea has been challenged recently in the literature, the inverse relationship between c-gene size and increased taxon sampling in a dataset-the 'recombination ratchet'-is a fundamental property of c-genes. For taxonomic groups characterized by genes with long intron sequences, complete protein-coding sequences are likely not valid c-genes and are inappropriate units of analysis for summary coalescence methods unless they occur in recombination deserts that are devoid of incomplete lineage sorting (ILS). Finally, it has been argued that coalescence methods are robust when the no-recombination within loci assumption is violated, but recombination must matter at some scale because ILS, a by-product of recombination, is the raison d'etre for coalescence methods. That is, extensive recombination is required to yield the large number of independently segregating c-genes used to infer a species tree. If coalescent methods are powerful

  5. Identification of a robust gene signature that predicts breast cancer outcome in independent data sets

    International Nuclear Information System (INIS)

    Korkola, James E; Waldman, Frederic M; Blaveri, Ekaterina; DeVries, Sandy; Moore, Dan H II; Hwang, E Shelley; Chen, Yunn-Yi; Estep, Anne LH; Chew, Karen L; Jensen, Ronald H

    2007-01-01

    Breast cancer is a heterogeneous disease, presenting with a wide range of histologic, clinical, and genetic features. Microarray technology has shown promise in predicting outcome in these patients. We profiled 162 breast tumors using expression microarrays to stratify tumors based on gene expression. A subset of 55 tumors with extensive follow-up was used to identify gene sets that predicted outcome. The predictive gene set was further tested in previously published data sets. We used different statistical methods to identify three gene sets associated with disease free survival. A fourth gene set, consisting of 21 genes in common to all three sets, also had the ability to predict patient outcome. To validate the predictive utility of this derived gene set, it was tested in two published data sets from other groups. This gene set resulted in significant separation of patients on the basis of survival in these data sets, correctly predicting outcome in 62–65% of patients. By comparing outcome prediction within subgroups based on ER status, grade, and nodal status, we found that our gene set was most effective in predicting outcome in ER positive and node negative tumors. This robust gene selection with extensive validation has identified a predictive gene set that may have clinical utility for outcome prediction in breast cancer patients

  6. cis sequence effects on gene expression

    Directory of Open Access Journals (Sweden)

    Jacobs Kevin

    2007-08-01

    Full Text Available Abstract Background Sequence and transcriptional variability within and between individuals are typically studied independently. The joint analysis of sequence and gene expression variation (genetical genomics provides insight into the role of linked sequence variation in the regulation of gene expression. We investigated the role of sequence variation in cis on gene expression (cis sequence effects in a group of genes commonly studied in cancer research in lymphoblastoid cell lines. We estimated the proportion of genes exhibiting cis sequence effects and the proportion of gene expression variation explained by cis sequence effects using three different analytical approaches, and compared our results to the literature. Results We generated gene expression profiling data at N = 697 candidate genes from N = 30 lymphoblastoid cell lines for this study and used available candidate gene resequencing data at N = 552 candidate genes to identify N = 30 candidate genes with sufficient variance in both datasets for the investigation of cis sequence effects. We used two additive models and the haplotype phylogeny scanning approach of Templeton (Tree Scanning to evaluate association between individual SNPs, all SNPs at a gene, and diplotypes, with log-transformed gene expression. SNPs and diplotypes at eight candidate genes exhibited statistically significant (p cis sequence effects in our study, respectively. Conclusion Based on analysis of our results and the extant literature, one in four genes exhibits significant cis sequence effects, and for these genes, about 30% of gene expression variation is accounted for by cis sequence variation. Despite diverse experimental approaches, the presence or absence of significant cis sequence effects is largely supported by previously published studies.

  7. Gene expression inference with deep learning.

    Science.gov (United States)

    Chen, Yifei; Li, Yi; Narayan, Rajiv; Subramanian, Aravind; Xie, Xiaohui

    2016-06-15

    Large-scale gene expression profiling has been widely used to characterize cellular states in response to various disease conditions, genetic perturbations, etc. Although the cost of whole-genome expression profiles has been dropping steadily, generating a compendium of expression profiling over thousands of samples is still very expensive. Recognizing that gene expressions are often highly correlated, researchers from the NIH LINCS program have developed a cost-effective strategy of profiling only ∼1000 carefully selected landmark genes and relying on computational methods to infer the expression of remaining target genes. However, the computational approach adopted by the LINCS program is currently based on linear regression (LR), limiting its accuracy since it does not capture complex nonlinear relationship between expressions of genes. We present a deep learning method (abbreviated as D-GEX) to infer the expression of target genes from the expression of landmark genes. We used the microarray-based Gene Expression Omnibus dataset, consisting of 111K expression profiles, to train our model and compare its performance to those from other methods. In terms of mean absolute error averaged across all genes, deep learning significantly outperforms LR with 15.33% relative improvement. A gene-wise comparative analysis shows that deep learning achieves lower error than LR in 99.97% of the target genes. We also tested the performance of our learned model on an independent RNA-Seq-based GTEx dataset, which consists of 2921 expression profiles. Deep learning still outperforms LR with 6.57% relative improvement, and achieves lower error in 81.31% of the target genes. D-GEX is available at https://github.com/uci-cbcl/D-GEX CONTACT: xhx@ics.uci.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Determinants of human adipose tissue gene expression

    DEFF Research Database (Denmark)

    Viguerie, Nathalie; Montastier, Emilie; Maoret, Jean-José

    2012-01-01

    weight maintenance diets. For 175 genes, opposite regulation was observed during calorie restriction and weight maintenance phases, independently of variations in body weight. Metabolism and immunity genes showed inverse profiles. During the dietary intervention, network-based analyses revealed strong...... interconnection between expression of genes involved in de novo lipogenesis and components of the metabolic syndrome. Sex had a marked influence on AT expression of 88 transcripts, which persisted during the entire dietary intervention and after control for fat mass. In women, the influence of body mass index...... on expression of a subset of genes persisted during the dietary intervention. Twenty-two genes revealed a metabolic syndrome signature common to men and women. Genetic control of AT gene expression by cis signals was observed for 46 genes. Dietary intervention, sex, and cis genetic variants independently...

  9. Deriving Trading Rules Using Gene Expression Programming

    Directory of Open Access Journals (Sweden)

    Adrian VISOIU

    2011-01-01

    Full Text Available This paper presents how buy and sell trading rules are generated using gene expression programming with special setup. Market concepts are presented and market analysis is discussed with emphasis on technical analysis and quantitative methods. The use of genetic algorithms in deriving trading rules is presented. Gene expression programming is applied in a form where multiple types of operators and operands are used. This gives birth to multiple gene contexts and references between genes in order to keep the linear structure of the gene expression programming chromosome. The setup of multiple gene contexts is presented. The case study shows how to use the proposed gene setup to derive trading rules encoded by Boolean expressions, using a dataset with the reference exchange rates between the Euro and the Romanian leu. The conclusions highlight the positive results obtained in deriving useful trading rules.

  10. Detecting microRNA activity from gene expression data

    LENUS (Irish Health Repository)

    Madden, Stephen F

    2010-05-18

    Abstract Background MicroRNAs (miRNAs) are non-coding RNAs that regulate gene expression by binding to the messenger RNA (mRNA) of protein coding genes. They control gene expression by either inhibiting translation or inducing mRNA degradation. A number of computational techniques have been developed to identify the targets of miRNAs. In this study we used predicted miRNA-gene interactions to analyse mRNA gene expression microarray data to predict miRNAs associated with particular diseases or conditions. Results Here we combine correspondence analysis, between group analysis and co-inertia analysis (CIA) to determine which miRNAs are associated with differences in gene expression levels in microarray data sets. Using a database of miRNA target predictions from TargetScan, TargetScanS, PicTar4way PicTar5way, and miRanda and combining these data with gene expression levels from sets of microarrays, this method produces a ranked list of miRNAs associated with a specified split in samples. We applied this to three different microarray datasets, a papillary thyroid carcinoma dataset, an in-house dataset of lipopolysaccharide treated mouse macrophages, and a multi-tissue dataset. In each case we were able to identified miRNAs of biological importance. Conclusions We describe a technique to integrate gene expression data and miRNA target predictions from multiple sources.

  11. Detecting microRNA activity from gene expression data.

    LENUS (Irish Health Repository)

    Madden, Stephen F

    2010-01-01

    BACKGROUND: MicroRNAs (miRNAs) are non-coding RNAs that regulate gene expression by binding to the messenger RNA (mRNA) of protein coding genes. They control gene expression by either inhibiting translation or inducing mRNA degradation. A number of computational techniques have been developed to identify the targets of miRNAs. In this study we used predicted miRNA-gene interactions to analyse mRNA gene expression microarray data to predict miRNAs associated with particular diseases or conditions. RESULTS: Here we combine correspondence analysis, between group analysis and co-inertia analysis (CIA) to determine which miRNAs are associated with differences in gene expression levels in microarray data sets. Using a database of miRNA target predictions from TargetScan, TargetScanS, PicTar4way PicTar5way, and miRanda and combining these data with gene expression levels from sets of microarrays, this method produces a ranked list of miRNAs associated with a specified split in samples. We applied this to three different microarray datasets, a papillary thyroid carcinoma dataset, an in-house dataset of lipopolysaccharide treated mouse macrophages, and a multi-tissue dataset. In each case we were able to identified miRNAs of biological importance. CONCLUSIONS: We describe a technique to integrate gene expression data and miRNA target predictions from multiple sources.

  12. Polycistronic gene expression in Aspergillus niger.

    Science.gov (United States)

    Schuetze, Tabea; Meyer, Vera

    2017-09-25

    Genome mining approaches predict dozens of biosynthetic gene clusters in each of the filamentous fungal genomes sequenced so far. However, the majority of these gene clusters still remain cryptic because they are not expressed in their natural host. Simultaneous expression of all genes belonging to a biosynthetic pathway in a heterologous host is one approach to activate biosynthetic gene clusters and to screen the metabolites produced for bioactivities. Polycistronic expression of all pathway genes under control of a single and tunable promoter would be the method of choice, as this does not only simplify cloning procedures, but also offers control on timing and strength of expression. However, polycistronic gene expression is a feature not commonly found in eukaryotic host systems, such as Aspergillus niger. In this study, we tested the suitability of the viral P2A peptide for co-expression of three genes in A. niger. Two genes descend from Fusarium oxysporum and are essential to produce the secondary metabolite enniatin (esyn1, ekivR). The third gene (luc) encodes the reporter luciferase which was included to study position effects. Expression of the polycistronic gene cassette was put under control of the Tet-On system to ensure tunable gene expression in A. niger. In total, three polycistronic expression cassettes which differed in the position of luc were constructed and targeted to the pyrG locus in A. niger. This allowed direct comparison of the luciferase activity based on the position of the luciferase gene. Doxycycline-mediated induction of the Tet-On expression cassettes resulted in the production of one long polycistronic mRNA as proven by Northern analyses, and ensured comparable production of enniatin in all three strains. Notably, gene position within the polycistronic expression cassette matters, as, luciferase activity was lowest at position one and had a comparable activity at positions two and three. The P2A peptide can be used to express at

  13. Profiling Gene Expression in Germinating Brassica Roots.

    Science.gov (United States)

    Park, Myoung Ryoul; Wang, Yi-Hong; Hasenstein, Karl H

    2014-01-01

    Based on previously developed solid-phase gene extraction (SPGE) we examined the mRNA profile in primary roots of Brassica rapa seedlings for highly expressed genes like ACT7 (actin7), TUB (tubulin1), UBQ (ubiquitin), and low expressed GLK (glucokinase) during the first day post-germination. The assessment was based on the mRNA load of the SPGE probe of about 2.1 ng. The number of copies of the investigated genes changed spatially along the length of primary roots. The expression level of all genes differed significantly at each sample position. Among the examined genes ACT7 expression was most even along the root. UBQ was highest at the tip and root-shoot junction (RS). TUB and GLK showed a basipetal gradient. The temporal expression of UBQ was highest in the MZ 9 h after primary root emergence and higher than at any other sample position. Expressions of GLK in EZ and RS increased gradually over time. SPGE extraction is the result of oligo-dT and oligo-dA hybridization and the results illustrate that SPGE can be used for gene expression profiling at high spatial and temporal resolution. SPGE needles can be used within two weeks when stored at 4 °C. Our data indicate that gene expression studies that are based on the entire root miss important differences in gene expression that SPGE is able to resolve for example growth adjustments during gravitropism.

  14. Ranking metrics in gene set enrichment analysis: do they matter?

    Science.gov (United States)

    Zyla, Joanna; Marczyk, Michal; Weiner, January; Polanska, Joanna

    2017-05-12

    There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results. In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA . Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner

  15. Gene Expression Analysis of Four Radiation-resistant Bacteria

    OpenAIRE

    Gao, Na; Ma, Bin-Guang; Zhang, Yu-Sheng; Song, Qin; Chen, Ling-Ling; Zhang, Hong-Yu

    2009-01-01

    To investigate the general radiation-resistant mechanisms of bacteria, bioinformatic method was employed to predict highly expressed genes for four radiation-resistant bacteria, i.e. Deinococcus geothermalis (D. geo), Deinococcus radiodurans (D. rad), Kineococcus radiotolerans (K. rad) and Rubrobacter xylanophilus (R. xyl). It is revealed that most of the three reference gene sets, i.e. ribosomal proteins, transcription factors and major chaperones, are generally highly expressed in the four ...

  16. Chromatin loops, gene positioning, and gene expression

    NARCIS (Netherlands)

    Holwerda, S.; de Laat, W.

    2012-01-01

    Technological developments and intense research over the last years have led to a better understanding of the 3D structure of the genome and its influence on genome function inside the cell nucleus. We will summarize topological studies performed on four model gene loci: the alpha- and beta-globin

  17. Gene expression profiles in stages II and III colon cancers

    DEFF Research Database (Denmark)

    Thorsteinsson, Morten; Kirkeby, Lene T; Hansen, Raino

    2012-01-01

    PURPOSE: A 128-gene signature has been proposed to predict outcome in patients with stages II and III colorectal cancers. In the present study, we aimed to reproduce and validate the 128-gene signature in external and independent material. METHODS: Gene expression data from the original material...... were retrieved from the Gene Expression Omnibus (GEO) (n¿=¿111) in addition to a Danish data set (n¿=¿37). All patients had stages II and III colon cancers. A Prediction Analysis of Microarray classifier, based on the 128-gene signature and the original training set of stage I (n¿=¿65) and stage IV (n...... correctly predicted as stage IV-like, and the remaining patients were predicted as stage I-like and unclassifiable, respectively. Stage II patients could not be stratified. CONCLUSIONS: The 128-gene signature showed reproducibility in stage III colon cancer, but could not predict recurrence in stage II...

  18. Serial analysis of gene expression (SAGE)

    NARCIS (Netherlands)

    van Ruissen, Fred; Baas, Frank

    2007-01-01

    In 1995, serial analysis of gene expression (SAGE) was developed as a versatile tool for gene expression studies. SAGE technology does not require pre-existing knowledge of the genome that is being examined and therefore SAGE can be applied to many different model systems. In this chapter, the SAGE

  19. Predicting cellular growth from gene expression signatures.

    Directory of Open Access Journals (Sweden)

    Edoardo M Airoldi

    2009-01-01

    Full Text Available Maintaining balanced growth in a changing environment is a fundamental systems-level challenge for cellular physiology, particularly in microorganisms. While the complete set of regulatory and functional pathways supporting growth and cellular proliferation are not yet known, portions of them are well understood. In particular, cellular proliferation is governed by mechanisms that are highly conserved from unicellular to multicellular organisms, and the disruption of these processes in metazoans is a major factor in the development of cancer. In this paper, we develop statistical methodology to identify quantitative aspects of the regulatory mechanisms underlying cellular proliferation in Saccharomyces cerevisiae. We find that the expression levels of a small set of genes can be exploited to predict the instantaneous growth rate of any cellular culture with high accuracy. The predictions obtained in this fashion are robust to changing biological conditions, experimental methods, and technological platforms. The proposed model is also effective in predicting growth rates for the related yeast Saccharomyces bayanus and the highly diverged yeast Schizosaccharomyces pombe, suggesting that the underlying regulatory signature is conserved across a wide range of unicellular evolution. We investigate the biological significance of the gene expression signature that the predictions are based upon from multiple perspectives: by perturbing the regulatory network through the Ras/PKA pathway, observing strong upregulation of growth rate even in the absence of appropriate nutrients, and discovering putative transcription factor binding sites, observing enrichment in growth-correlated genes. More broadly, the proposed methodology enables biological insights about growth at an instantaneous time scale, inaccessible by direct experimental methods. Data and tools enabling others to apply our methods are available at http://function.princeton.edu/growthrate.

  20. Biasogram: visualization of confounding technical bias in gene expression data

    DEFF Research Database (Denmark)

    Krzystanek, Marcin; Szallasi, Zoltan Imre; Eklund, Aron Charles

    2013-01-01

    Gene expression profiles of clinical cohorts can be used to identify genes that are correlated with a clinical variable of interest such as patient outcome or response to a particular drug. However, expression measurements are susceptible to technical bias caused by variation in extraneous factors...... such as RNA quality and array hybridization conditions. If such technical bias is correlated with the clinical variable of interest, the likelihood of identifying false positive genes is increased. Here we describe a method to visualize an expression matrix as a projection of all genes onto a plane defined...... by a clinical variable and a technical nuisance variable. The resulting plot indicates the extent to which each gene is correlated with the clinical variable or the technical variable. We demonstrate this method by applying it to three clinical trial microarray data sets, one of which identified genes that may...

  1. Expression of Sox genes in tooth development.

    Science.gov (United States)

    Kawasaki, Katsushige; Kawasaki, Maiko; Watanabe, Momoko; Idrus, Erik; Nagai, Takahiro; Oommen, Shelly; Maeda, Takeyasu; Hagiwara, Nobuko; Que, Jianwen; Sharpe, Paul T; Ohazama, Atsushi

    2015-01-01

    Members of the Sox gene family play roles in many biological processes including organogenesis. We carried out comparative in situ hybridization analysis of seventeen sox genes (Sox1-14, 17, 18, 21) during murine odontogenesis from the epithelial thickening to the cytodifferentiation stages. Localized expression of five Sox genes (Sox6, 9, 13, 14 and 21) was observed in tooth bud epithelium. Sox13 showed restricted expression in the primary enamel knots. At the early bell stage, three Sox genes (Sox8, 11, 17 and 21) were expressed in pre-ameloblasts, whereas two others (Sox5 and 18) showed expression in odontoblasts. Sox genes thus showed a dynamic spatio-temporal expression during tooth development.

  2. Dynamic association rules for gene expression data analysis.

    Science.gov (United States)

    Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

    2015-10-14

    The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed

  3. Positron emission tomography imaging of gene expression

    International Nuclear Information System (INIS)

    Tang Ganghua

    2001-01-01

    The merging of molecular biology and nuclear medicine is developed into molecular nuclear medicine. Positron emission tomography (PET) of gene expression in molecular nuclear medicine has become an attractive area. Positron emission tomography imaging gene expression includes the antisense PET imaging and the reporter gene PET imaging. It is likely that the antisense PET imaging will lag behind the reporter gene PET imaging because of the numerous issues that have not yet to be resolved with this approach. The reporter gene PET imaging has wide application into animal experimental research and human applications of this approach will likely be reported soon

  4. MAGMA: generalized gene-set analysis of GWAS data.

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  5. MAGMA: Generalized Gene-Set Analysis of GWAS Data

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  6. Evaluation of suitable reference genes for gene expression studies in bovine muscular tissue

    Directory of Open Access Journals (Sweden)

    Dunner Susana

    2008-09-01

    Full Text Available Abstract Background Real-time reverse transcriptase quantitative polymerase chain reaction (real-time RTqPCR is a technique used to measure mRNA species copy number as a way to determine key genes involved in different biological processes. However, the expression level of these key genes may vary among tissues or cells not only as a consequence of differential expression but also due to different factors, including choice of reference genes to normalize the expression levels of the target genes; thus the selection of reference genes is critical for expression studies. For this purpose, ten candidate reference genes were investigated in bovine muscular tissue. Results The value of stability of ten candidate reference genes included in three groups was estimated: the so called 'classical housekeeping' genes (18S, GAPDH and ACTB, a second set of genes used in expression studies conducted on other tissues (B2M, RPII, UBC and HMBS and a third set of novel genes (SF3A1, EEF1A2 and CASC3. Three different statistical algorithms were used to rank the genes by their stability measures as produced by geNorm, NormFinder and Bestkeeper. The three methods tend to agree on the most stably expressed genes and the least in muscular tissue. EEF1A2 and HMBS followed by SF3A1, ACTB, and CASC3 can be considered as stable reference genes, and B2M, RPII, UBC and GAPDH would not be appropriate. Although the rRNA-18S stability measure seems to be within the range of acceptance, its use is not recommended because its synthesis regulation is not representative of mRNA levels. Conclusion Based on geNorm algorithm, we propose the use of three genes SF3A1, EEF1A2 and HMBS as references for normalization of real-time RTqPCR in muscle expression studies.

  7. Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

    Science.gov (United States)

    Caracausi, Maria; Piovesan, Allison; Antonaros, Francesca; Strippoli, Pierluigi; Vitale, Lorenza; Pelleri, Maria Chiara

    2017-09-01

    The ideal reference, or control, gene for the study of gene expression in a given organism should be expressed at a medium‑high level for easy detection, should be expressed at a constant/stable level throughout different cell types and within the same cell type undergoing different treatments, and should maintain these features through as many different tissues of the organism. From a biological point of view, these theoretical requirements of an ideal reference gene appear to be best suited to housekeeping (HK) genes. Recent advancements in the quality and completeness of human expression microarray data and in their statistical analysis may provide new clues toward the quantitative standardization of human gene expression studies in biology and medicine, both cross‑ and within‑tissue. The systematic approach used by the present study is based on the Transcriptome Mapper tool and exploits the automated reassignment of probes to corresponding genes, intra‑ and inter‑sample normalization, elaboration and representation of gene expression values in linear form within an indexed and searchable database with a graphical interface recording quantitative levels of expression, expression variability and cross‑tissue width of expression for more than 31,000 transcripts. The present study conducted a meta‑analysis of a pool of 646 expression profile data sets from 54 different human tissues and identified actin γ 1 as the HK gene that best fits the combination of all the traditional criteria to be used as a reference gene for general use; two ribosomal protein genes, RPS18 and RPS27, and one aquaporin gene, POM121 transmembrane nucleporin C, were also identified. The present study provided a list of tissue‑ and organ‑specific genes that may be most suited for the following individual tissues/organs: Adipose tissue, bone marrow, brain, heart, kidney, liver, lung, ovary, skeletal muscle and testis; and also provides in these cases a representative

  8. Gene expression profiling of cutaneous wound healing

    Directory of Open Access Journals (Sweden)

    Wang Ena

    2007-02-01

    Full Text Available Abstract Background Although the sequence of events leading to wound repair has been described at the cellular and, to a limited extent, at the protein level this process has yet to be fully elucidated. Genome wide transcriptional analysis tools promise to further define the global picture of this complex progression of events. Study Design This study was part of a placebo-controlled double-blind clinical trial in which basal cell carcinomas were treated topically with an immunomodifier – toll-like receptor 7 agonist: imiquimod. The fourteen patients with basal cell carcinoma in the placebo arm of the trial received placebo treatment consisting solely of vehicle cream. A skin punch biopsy was obtained immediately before treatment and at the end of the placebo treatment (after 2, 4 or 8 days. 17.5K cDNA microarrays were utilized to profile the biopsy material. Results Four gene signatures whose expression changed relative to baseline (before wound induction by the pre-treatment biopsy were identified. The largest group was comprised predominantly of inflammatory genes whose expression was increased throughout the study. Two additional signatures were observed which included preferentially pro-inflammatory genes in the early post-treatment biopsies (2 days after pre-treatment biopsies and repair and angiogenesis genes in the later (4 to 8 days biopsies. The fourth and smallest set of genes was down-regulated throughout the study. Early in wound healing the expression of markers of both M1 and M2 macrophages were increased, but later M2 markers predominated. Conclusion The initial response to a cutaneous wound induces powerful transcriptional activation of pro-inflammatory stimuli which may alert the host defense. Subsequently and in the absence of infection, inflammation subsides and it is replaced by angiogenesis and remodeling. Understanding this transition which may be driven by a change from a mixed macrophage population to predominately M2

  9. The functional landscape of mouse gene expression

    Directory of Open Access Journals (Sweden)

    Zhang Wen

    2004-12-01

    Full Text Available Abstract Background Large-scale quantitative analysis of transcriptional co-expression has been used to dissect regulatory networks and to predict the functions of new genes discovered by genome sequencing in model organisms such as yeast. Although the idea that tissue-specific expression is indicative of gene function in mammals is widely accepted, it has not been objectively tested nor compared with the related but distinct strategy of correlating gene co-expression as a means to predict gene function. Results We generated microarray expression data for nearly 40,000 known and predicted mRNAs in 55 mouse tissues, using custom-built oligonucleotide arrays. We show that quantitative transcriptional co-expression is a powerful predictor of gene function. Hundreds of functional categories, as defined by Gene Ontology 'Biological Processes', are associated with characteristic expression patterns across all tissues, including categories that bear no overt relationship to the tissue of origin. In contrast, simple tissue-specific restriction of expression is a poor predictor of which genes are in which functional categories. As an example, the highly conserved mouse gene PWP1 is widely expressed across different tissues but is co-expressed with many RNA-processing genes; we show that the uncharacterized yeast homolog of PWP1 is required for rRNA biogenesis. Conclusions We conclude that 'functional genomics' strategies based on quantitative transcriptional co-expression will be as fruitful in mammals as they have been in simpler organisms, and that transcriptional control of mammalian physiology is more modular than is generally appreciated. Our data and analyses provide a public resource for mammalian functional genomics.

  10. GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.

    Science.gov (United States)

    Doungpan, Narumol; Engchuan, Worrawat; Chan, Jonathan H; Meechai, Asawin

    2016-12-05

    Gene expression has been used to identify disease gene biomarkers, but there are ongoing challenges. Single gene or gene-set biomarkers are inadequate to provide sufficient understanding of complex disease mechanisms and the relationship among those genes. Network-based methods have thus been considered for inferring the interaction within a group of genes to further study the disease mechanism. Recently, the Gene-Network-based Feature Set (GNFS), which is capable of handling case-control and multiclass expression for gene biomarker identification, has been proposed, partly taking into account of network topology. However, its performance relies on a greedy search for building subnetworks and thus requires further improvement. In this work, we establish a new approach named Gene Sub-Network-based Feature Selection (GSNFS) by implementing the GNFS framework with two proposed searching and scoring algorithms, namely gene-set-based (GS) search and parent-node-based (PN) search, to identify subnetworks. An additional dataset is used to validate the results. The two proposed searching algorithms of the GSNFS method for subnetwork expansion are concerned with the degree of connectivity and the scoring scheme for building subnetworks and their topology. For each iteration of expansion, the neighbour genes of a current subnetwork, whose expression data improved the overall subnetwork score, is recruited. While the GS search calculated the subnetwork score using an activity score of a current subnetwork and the gene expression values of its neighbours, the PN search uses the expression value of the corresponding parent of each neighbour gene. Four lung cancer expression datasets were used for subnetwork identification. In addition, using pathway data and protein-protein interaction as network data in order to consider the interaction among significant genes were discussed. Classification was performed to compare the performance of the identified gene subnetworks with three

  11. A comparative gene expression database for invertebrates

    Directory of Open Access Journals (Sweden)

    Ormestad Mattias

    2011-08-01

    Full Text Available Abstract Background As whole genome and transcriptome sequencing gets cheaper and faster, a great number of 'exotic' animal models are emerging, rapidly adding valuable data to the ever-expanding Evo-Devo field. All these new organisms serve as a fantastic resource for the research community, but the sheer amount of data, some published, some not, makes detailed comparison of gene expression patterns very difficult to summarize - a problem sometimes even noticeable within a single lab. The need to merge existing data with new information in an organized manner that is publicly available to the research community is now more necessary than ever. Description In order to offer a homogenous way of storing and handling gene expression patterns from a variety of organisms, we have developed the first web-based comparative gene expression database for invertebrates that allows species-specific as well as cross-species gene expression comparisons. The database can be queried by gene name, developmental stage and/or expression domains. Conclusions This database provides a unique tool for the Evo-Devo research community that allows the retrieval, analysis and comparison of gene expression patterns within or among species. In addition, this database enables a quick identification of putative syn-expression groups that can be used to initiate, among other things, gene regulatory network (GRN projects.

  12. Adaptive Evolution of Gene Expression in Drosophila.

    Science.gov (United States)

    Nourmohammad, Armita; Rambeau, Joachim; Held, Torsten; Kovacova, Viera; Berg, Johannes; Lässig, Michael

    2017-08-08

    Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

    Science.gov (United States)

    Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

    2015-11-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.

  14. Differential gene expression during Trypanosoma cruzi metacyclogenesis

    Directory of Open Access Journals (Sweden)

    Marco Aurelio Krieger

    1999-09-01

    Full Text Available The transformation of epimastigotes into metacyclic trypomastigotes involves changes in the pattern of expressed genes, resulting in important morphological and functional differences between these developmental forms of Trypanosoma cruzi. In order to identify and characterize genes involved in triggering the metacyclogenesis process and in conferring to metacyclic trypomastigotes their stage specific biological properties, we have developed a method allowing the isolation of genes specifically expressed when comparing two close related cell populations (representation of differential expression or RDE. The method is based on the PCR amplification of gene sequences selected by hybridizing and subtracting the populations in such a way that after some cycles of hybridization-amplification genes specific to a given population are highly enriched. The use of this method in the analysis of differential gene expression during T. cruzi metacyclogenesis (6 hr and 24 hr of differentiation and metacyclic trypomastigotes resulted in the isolation of several clones from each time point. Northern blot analysis showed that some genes are transiently expressed (6 hr and 24 hr differentiating cells, while others are present in differentiating cells and in metacyclic trypomastigotes. Nucleotide sequencing of six clones characterized so far showed that they do not display any homology to gene sequences available in the GeneBank.

  15. A longitudinal study of gene expression in healthy individuals

    Directory of Open Access Journals (Sweden)

    Tessier Michel

    2009-06-01

    Full Text Available Abstract Background The use of gene expression in venous blood either as a pharmacodynamic marker in clinical trials of drugs or as a diagnostic test requires knowledge of the variability in expression over time in healthy volunteers. Here we defined a normal range of gene expression over 6 months in the blood of four cohorts of healthy men and women who were stratified by age (22–55 years and > 55 years and gender. Methods Eleven immunomodulatory genes likely to play important roles in inflammatory conditions such as rheumatoid arthritis and infection in addition to four genes typically used as reference genes were examined by quantitative reverse transcription-polymerase chain reaction (qRT-PCR, as well as the full genome as represented by Affymetrix HG U133 Plus 2.0 microarrays. Results Gene expression levels as assessed by qRT-PCR and microarray were relatively stable over time with ~2% of genes as measured by microarray showing intra-subject differences over time periods longer than one month. Fifteen genes varied by gender. The eleven genes examined by qRT-PCR remained within a limited dynamic range for all individuals. Specifically, for the seven most stably expressed genes (CXCL1, HMOX1, IL1RN, IL1B, IL6R, PTGS2, and TNF, 95% of all samples profiled fell within 1.5–2.5 Ct, the equivalent of a 4- to 6-fold dynamic range. Two subjects who experienced severe adverse events of cancer and anemia, had microarray gene expression profiles that were distinct from normal while subjects who experienced an infection had only slightly elevated levels of inflammatory markers. Conclusion This study defines the range and variability of gene expression in healthy men and women over a six-month period. These parameters can be used to estimate the number of subjects needed to observe significant differences from normal gene expression in clinical studies. A set of genes that varied by gender was also identified as were a set of genes with elevated

  16. Screening for interaction effects in gene expression data.

    Directory of Open Access Journals (Sweden)

    Peter J Castaldi

    Full Text Available Expression quantitative trait (eQTL studies are a powerful tool for identifying genetic variants that affect levels of messenger RNA. Since gene expression is controlled by a complex network of gene-regulating factors, one way to identify these factors is to search for interaction effects between genetic variants and mRNA levels of transcription factors (TFs and their respective target genes. However, identification of interaction effects in gene expression data pose a variety of methodological challenges, and it has become clear that such analyses should be conducted and interpreted with caution. Investigating the validity and interpretability of several interaction tests when screening for eQTL SNPs whose effect on the target gene expression is modified by the expression level of a transcription factor, we characterized two important methodological issues. First, we stress the scale-dependency of interaction effects and highlight that commonly applied transformation of gene expression data can induce or remove interactions, making interpretation of results more challenging. We then demonstrate that, in the setting of moderate to strong interaction effects on the order of what may be reasonably expected for eQTL studies, standard interaction screening can be biased due to heteroscedasticity induced by true interactions. Using simulation and real data analysis, we outline a set of reasonable minimum conditions and sample size requirements for reliable detection of variant-by-environment and variant-by-TF interactions using the heteroscedasticity consistent covariance-based approach.

  17. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  18. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  19. Gene Expression Signature in Endemic Osteoarthritis by Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Xi Wang

    2015-05-01

    Full Text Available Kashin-Beck Disease (KBD is an endemic osteochondropathy with an unknown pathogenesis. Diagnosis of KBD is effective only in advanced cases, which eliminates the possibility of early treatment and leads to an inevitable exacerbation of symptoms. Therefore, we aim to identify an accurate blood-based gene signature for the detection of KBD. Previously published gene expression profile data on cartilage and peripheral blood mononuclear cells (PBMCs from adults with KBD were compared to select potential target genes. Microarray analysis was conducted to evaluate the expression of the target genes in a cohort of 100 KBD patients and 100 healthy controls. A gene expression signature was identified using a training set, which was subsequently validated using an independent test set with a minimum redundancy maximum relevance (mRMR algorithm and support vector machine (SVM algorithm. Fifty unique genes were differentially expressed between KBD patients and healthy controls. A 20-gene signature was identified that distinguished between KBD patients and controls with 90% accuracy, 85% sensitivity, and 95% specificity. This study identified a 20-gene signature that accurately distinguishes between patients with KBD and controls using peripheral blood samples. These results promote the further development of blood-based genetic biomarkers for detection of KBD.

  20. Stochastic gene expression in Arabidopsis thaliana.

    Science.gov (United States)

    Araújo, Ilka Schultheiß; Pietsch, Jessica Magdalena; Keizer, Emma Mathilde; Greese, Bettina; Balkunde, Rachappa; Fleck, Christian; Hülskamp, Martin

    2017-12-14

    Although plant development is highly reproducible, some stochasticity exists. This developmental stochasticity may be caused by noisy gene expression. Here we analyze the fluctuation of protein expression in Arabidopsis thaliana. Using the photoconvertible KikGR marker, we show that the protein expressions of individual cells fluctuate over time. A dual reporter system was used to study extrinsic and intrinsic noise of marker gene expression. We report that extrinsic noise is higher than intrinsic noise and that extrinsic noise in stomata is clearly lower in comparison to several other tissues/cell types. Finally, we show that cells are coupled with respect to stochastic protein expression in young leaves, hypocotyls and roots but not in mature leaves. Our data indicate that stochasticity of gene expression can vary between tissues/cell types and that it can be coupled in a non-cell-autonomous manner.

  1. Gene expression variability in human hepatic drug metabolizing enzymes and transporters.

    Directory of Open Access Journals (Sweden)

    Lun Yang

    Full Text Available Interindividual variability in the expression of drug-metabolizing enzymes and transporters (DMETs in human liver may contribute to interindividual differences in drug efficacy and adverse reactions. Published studies that analyzed variability in the expression of DMET genes were limited by sample sizes and the number of genes profiled. We systematically analyzed the expression of 374 DMETs from a microarray data set consisting of gene expression profiles derived from 427 human liver samples. The standard deviation of interindividual expression for DMET genes was much higher than that for non-DMET genes. The 20 DMET genes with the largest variability in the expression provided examples of the interindividual variation. Gene expression data were also analyzed using network analysis methods, which delineates the similarities of biological functionalities and regulation mechanisms for these highly variable DMET genes. Expression variability of human hepatic DMET genes may affect drug-gene interactions and disease susceptibility, with concomitant clinical implications.

  2. Molecular subsets in the gene expression signatures of scleroderma skin.

    Directory of Open Access Journals (Sweden)

    Ausra Milano

    2008-07-01

    Full Text Available Scleroderma is a clinically heterogeneous disease with a complex phenotype. The disease is characterized by vascular dysfunction, tissue fibrosis, internal organ dysfunction, and immune dysfunction resulting in autoantibody production.We analyzed the genome-wide patterns of gene expression with DNA microarrays in skin biopsies from distinct scleroderma subsets including 17 patients with systemic sclerosis (SSc with diffuse scleroderma (dSSc, 7 patients with SSc with limited scleroderma (lSSc, 3 patients with morphea, and 6 healthy controls. 61 skin biopsies were analyzed in a total of 75 microarray hybridizations. Analysis by hierarchical clustering demonstrates nearly identical patterns of gene expression in 17 out of 22 of the forearm and back skin pairs of SSc patients. Using this property of the gene expression, we selected a set of 'intrinsic' genes and analyzed the inherent data-driven groupings. Distinct patterns of gene expression separate patients with dSSc from those with lSSc and both are easily distinguished from normal controls. Our data show three distinct patient groups among the patients with dSSc and two groups among patients with lSSc. Each group can be distinguished by unique gene expression signatures indicative of proliferating cells, immune infiltrates and a fibrotic program. The intrinsic groups are statistically significant (p<0.001 and each has been mapped to clinical covariates of modified Rodnan skin score, interstitial lung disease, gastrointestinal involvement, digital ulcers, Raynaud's phenomenon and disease duration. We report a 177-gene signature that is associated with severity of skin disease in dSSc.Genome-wide gene expression profiling of skin biopsies demonstrates that the heterogeneity in scleroderma can be measured quantitatively with DNA microarrays. The diversity in gene expression demonstrates multiple distinct gene expression programs in the skin of patients with scleroderma.

  3. Gene expression in periodontal tissues following treatment

    Directory of Open Access Journals (Sweden)

    Eisenacher Martin

    2008-07-01

    Full Text Available Abstract Background In periodontitis, treatment aimed at controlling the periodontal biofilm infection results in a resolution of the clinical and histological signs of inflammation. Although the cell types found in periodontal tissues following treatment have been well described, information on gene expression is limited to few candidate genes. Therefore, the aim of the study was to determine the expression profiles of immune and inflammatory genes in periodontal tissues from sites with severe chronic periodontitis following periodontal therapy in order to identify genes involved in tissue homeostasis. Gingival biopsies from 12 patients with severe chronic periodontitis were taken six to eight weeks following non-surgical periodontal therapy, and from 11 healthy controls. As internal standard, RNA of an immortalized human keratinocyte line (HaCaT was used. Total RNA was subjected to gene expression profiling using a commercially available microarray system focusing on inflammation-related genes. Post-hoc confirmation of selected genes was done by Realtime-PCR. Results Out of the 136 genes analyzed, the 5% most strongly expressed genes compared to healthy controls were Interleukin-12A (IL-12A, Versican (CSPG-2, Matrixmetalloproteinase-1 (MMP-1, Down syndrome critical region protein-1 (DSCR-1, Macrophage inflammatory protein-2β (Cxcl-3, Inhibitor of apoptosis protein-1 (BIRC-1, Cluster of differentiation antigen 38 (CD38, Regulator of G-protein signalling-1 (RGS-1, and Finkel-Biskis-Jinkins murine osteosarcoma virus oncogene (C-FOS; the 5% least strongly expressed genes were Receptor-interacting Serine/Threonine Kinase-2 (RIP-2, Complement component 3 (C3, Prostaglandin-endoperoxide synthase-2 (COX-2, Interleukin-8 (IL-8, Endothelin-1 (EDN-1, Plasminogen activator inhibitor type-2 (PAI-2, Matrix-metalloproteinase-14 (MMP-14, and Interferon regulating factor-7 (IRF-7. Conclusion Gene expression profiles found in periodontal tissues following

  4. MAGMA: generalized gene-set analysis of GWAS data.

    Science.gov (United States)

    de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

    2015-04-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.

  5. Evaluation of Appropriate Reference Genes for Gene Expression Normalization during Watermelon Fruit Development.

    Directory of Open Access Journals (Sweden)

    Qiusheng Kong

    Full Text Available Gene expression analysis in watermelon (Citrullus lanatus fruit has drawn considerable attention with the availability of genome sequences to understand the regulatory mechanism of fruit development and to improve its quality. Real-time quantitative reverse-transcription PCR (qRT-PCR is a routine technique for gene expression analysis. However, appropriate reference genes for transcript normalization in watermelon fruits have not been well characterized. The aim of this study was to evaluate the appropriateness of 12 genes for their potential use as reference genes in watermelon fruits. Expression variations of these genes were measured in 48 samples obtained from 12 successive developmental stages of parthenocarpic and fertilized fruits of two watermelon genotypes by using qRT-PCR analysis. Considering the effects of genotype, fruit setting method, and developmental stage, geNorm determined clathrin adaptor complex subunit (ClCAC, β-actin (ClACT, and alpha tubulin 5 (ClTUA5 as the multiple reference genes in watermelon fruit. Furthermore, ClCAC alone or together with SAND family protein (ClSAND was ranked as the single or two best reference genes by NormFinder. By using the top-ranked reference genes to normalize the transcript abundance of phytoene synthase (ClPSY1, a good correlation between lycopene accumulation and ClPSY1 expression pattern was observed in ripening watermelon fruit. These validated reference genes will facilitate the accurate measurement of gene expression in the studies on watermelon fruit biology.

  6. Evaluation of Appropriate Reference Genes for Gene Expression Normalization during Watermelon Fruit Development.

    Science.gov (United States)

    Kong, Qiusheng; Yuan, Jingxian; Gao, Lingyun; Zhao, Liqiang; Cheng, Fei; Huang, Yuan; Bie, Zhilong

    2015-01-01

    Gene expression analysis in watermelon (Citrullus lanatus) fruit has drawn considerable attention with the availability of genome sequences to understand the regulatory mechanism of fruit development and to improve its quality. Real-time quantitative reverse-transcription PCR (qRT-PCR) is a routine technique for gene expression analysis. However, appropriate reference genes for transcript normalization in watermelon fruits have not been well characterized. The aim of this study was to evaluate the appropriateness of 12 genes for their potential use as reference genes in watermelon fruits. Expression variations of these genes were measured in 48 samples obtained from 12 successive developmental stages of parthenocarpic and fertilized fruits of two watermelon genotypes by using qRT-PCR analysis. Considering the effects of genotype, fruit setting method, and developmental stage, geNorm determined clathrin adaptor complex subunit (ClCAC), β-actin (ClACT), and alpha tubulin 5 (ClTUA5) as the multiple reference genes in watermelon fruit. Furthermore, ClCAC alone or together with SAND family protein (ClSAND) was ranked as the single or two best reference genes by NormFinder. By using the top-ranked reference genes to normalize the transcript abundance of phytoene synthase (ClPSY1), a good correlation between lycopene accumulation and ClPSY1 expression pattern was observed in ripening watermelon fruit. These validated reference genes will facilitate the accurate measurement of gene expression in the studies on watermelon fruit biology.

  7. SIGNATURE: A workbench for gene expression signature analysis

    Directory of Open Access Journals (Sweden)

    Chang Jeffrey T

    2011-11-01

    Full Text Available Abstract Background The biological phenotype of a cell, such as a characteristic visual image or behavior, reflects activities derived from the expression of collections of genes. As such, an ability to measure the expression of these genes provides an opportunity to develop more precise and varied sets of phenotypes. However, to use this approach requires computational methods that are difficult to implement and apply, and thus there is a critical need for intelligent software tools that can reduce the technical burden of the analysis. Tools for gene expression analyses are unusually difficult to implement in a user-friendly way because their application requires a combination of biological data curation, statistical computational methods, and database expertise. Results We have developed SIGNATURE, a web-based resource that simplifies gene expression signature analysis by providing software, data, and protocols to perform the analysis successfully. This resource uses Bayesian methods for processing gene expression data coupled with a curated database of gene expression signatures, all carried out within a GenePattern web interface for easy use and access. Conclusions SIGNATURE is available for public use at http://genepattern.genome.duke.edu/signature/.

  8. Multiple Suboptimal Solutions for Prediction Rules in Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Osamu Komori

    2013-01-01

    Full Text Available This paper discusses mathematical and statistical aspects in analysis methods applied to microarray gene expressions. We focus on pattern recognition to extract informative features embedded in the data for prediction of phenotypes. It has been pointed out that there are severely difficult problems due to the unbalance in the number of observed genes compared with the number of observed subjects. We make a reanalysis of microarray gene expression published data to detect many other gene sets with almost the same performance. We conclude in the current stage that it is not possible to extract only informative genes with high performance in the all observed genes. We investigate the reason why this difficulty still exists even though there are actively proposed analysis methods and learning algorithms in statistical machine learning approaches. We focus on the mutual coherence or the absolute value of the Pearson correlations between two genes and describe the distributions of the correlation for the selected set of genes and the total set. We show that the problem of finding informative genes in high dimensional data is ill-posed and that the difficulty is closely related with the mutual coherence.

  9. Changes in gene expression during male meiosis in Petunia hybrida.

    Science.gov (United States)

    Cnudde, Filip; Hedatale, Veena; de Jong, Hans; Pierson, Elisabeth S; Rainey, Daphne Y; Zabeau, Marc; Weterings, Koen; Gerats, Tom; Peters, Janny L

    2006-01-01

    We analyzed changes in gene expression during male meiosis in Petunia by combining the meiotic staging of pollen mother cells from a single anther with cDNA-AFLP transcript profiling of mRNA from the synchronously developing sister anthers. The transcript profiling experiments focused on the identification of genes with a modulated expression profile during meiosis, while premeiotic archesporial cells and postmeiotic microspores served as a reference. About 8000 transcript tags, estimated at 30% of the total transcriptome, were generated, of which around 6% exhibited a modulated gene expression pattern at meiosis. Cluster analysis revealed a transcriptional cascade that coincides with the initiation and progression through all stages of the two meiotic divisions. Fragments that exhibited high expression specifically during meiosis I were characterized further by sequencing; 90 out of the 293 sequenced fragments showed homology with known genes, belonging to a wide range of gene classes, including previously characterized meiotic genes. In-situ hybridization experiments were performed to determine the spatial expression pattern for five selected transcript tags. Its concurrence with cDNA-AFLP transcript profiles indicates that this is an excellent approach to study genes involved in specialized processes such as meiosis. Our data set provides the potential to unravel unique meiotic genes that are as yet elusive to reverse genetics approaches.

  10. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature

    Directory of Open Access Journals (Sweden)

    Ning Ye

    2015-01-01

    Full Text Available The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  11. Widespread ectopic expression of olfactory receptor genes

    Directory of Open Access Journals (Sweden)

    Yanai Itai

    2006-05-01

    Full Text Available Abstract Background Olfactory receptors (ORs are the largest gene family in the human genome. Although they are expected to be expressed specifically in olfactory tissues, some ectopic expression has been reported, with special emphasis on sperm and testis. The present study systematically explores the expression patterns of OR genes in a large number of tissues and assesses the potential functional implication of such ectopic expression. Results We analyzed the expression of hundreds of human and mouse OR transcripts, via EST and microarray data, in several dozens of human and mouse tissues. Different tissues had specific, relatively small OR gene subsets which had particularly high expression levels. In testis, average expression was not particularly high, and very few highly expressed genes were found, none corresponding to ORs previously implicated in sperm chemotaxis. Higher expression levels were more common for genes with a non-OR genomic neighbor. Importantly, no correlation in expression levels was detected for human-mouse orthologous pairs. Also, no significant difference in expression levels was seen between intact and pseudogenized ORs, except for the pseudogenes of subfamily 7E which has undergone a human-specific expansion. Conclusion The OR superfamily as a whole, show widespread, locus-dependent and heterogeneous expression, in agreement with a neutral or near neutral evolutionary model for transcription control. These results cannot reject the possibility that small OR subsets might play functional roles in different tissues, however considerable care should be exerted when offering a functional interpretation for ectopic OR expression based only on transcription information.

  12. GeneBins: a database for classifying gene expression data, with application to plant genome arrays

    Directory of Open Access Journals (Sweden)

    Weiller Georg

    2007-03-01

    Full Text Available Abstract Background To interpret microarray experiments, several ontological analysis tools have been developed. However, current tools are limited to specific organisms. Results We developed a bioinformatics system to assign the probe set sequences of any organism to a hierarchical functional classification modelled on KEGG ontology. The GeneBins database currently supports the functional classification of expression data from four Affymetrix arrays; Arabidopsis thaliana, Oryza sativa, Glycine max and Medicago truncatula. An online analysis tool to identify relevant functions is also provided. Conclusion GeneBins provides resources to interpret gene expression results from microarray experiments. It is available at http://bioinfoserver.rsbs.anu.edu.au/utils/GeneBins/

  13. Domestication rewired gene expression and nucleotide diversity patterns in tomato.

    Science.gov (United States)

    Sauvage, Christopher; Rau, Andrea; Aichholz, Charlotte; Chadoeuf, Joël; Sarah, Gautier; Ruiz, Manuel; Santoni, Sylvain; Causse, Mathilde; David, Jacques; Glémin, Sylvain

    2017-08-01

    Plant domestication has led to considerable phenotypic modifications from wild species to modern varieties. However, although changes in key traits have been well documented, less is known about the underlying molecular mechanisms, such as the reduction of molecular diversity or global gene co-expression patterns. In this study, we used a combination of gene expression and population genetics in wild and crop tomato to decipher the footprints of domestication. We found a set of 1729 differentially expressed genes (DEG) between the two genetic groups, belonging to 17 clusters of co-expressed DEG, suggesting that domestication affected not only individual genes but also regulatory networks. Five co-expression clusters were enriched in functional terms involving carbohydrate metabolism or epigenetic regulation of gene expression. We detected differences in nucleotide diversity between the crop and wild groups specific to DEG. Our study provides an extensive profiling of the rewiring of gene co-expression induced by the domestication syndrome in one of the main crop species. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  14. Regulation of Gene Expression in Protozoa Parasites

    Directory of Open Access Journals (Sweden)

    Consuelo Gomez

    2010-01-01

    Full Text Available Infections with protozoa parasites are associated with high burdens of morbidity and mortality across the developing world. Despite extensive efforts to control the transmission of these parasites, the spread of populations resistant to drugs and the lack of effective vaccines against them contribute to their persistence as major public health problems. Parasites should perform a strict control on the expression of genes involved in their pathogenicity, differentiation, immune evasion, or drug resistance, and the comprehension of the mechanisms implicated in that control could help to develop novel therapeutic strategies. However, until now these mechanisms are poorly understood in protozoa. Recent investigations into gene expression in protozoa parasites suggest that they possess many of the canonical machineries employed by higher eukaryotes for the control of gene expression at transcriptional, posttranscriptional, and epigenetic levels, but they also contain exclusive mechanisms. Here, we review the current understanding about the regulation of gene expression in Plasmodium sp., Trypanosomatids, Entamoeba histolytica and Trichomonas vaginalis.

  15. Regulation of gene expression in protozoa parasites.

    Science.gov (United States)

    Gomez, Consuelo; Esther Ramirez, M; Calixto-Galvez, Mercedes; Medel, Olivia; Rodríguez, Mario A

    2010-01-01

    Infections with protozoa parasites are associated with high burdens of morbidity and mortality across the developing world. Despite extensive efforts to control the transmission of these parasites, the spread of populations resistant to drugs and the lack of effective vaccines against them contribute to their persistence as major public health problems. Parasites should perform a strict control on the expression of genes involved in their pathogenicity, differentiation, immune evasion, or drug resistance, and the comprehension of the mechanisms implicated in that control could help to develop novel therapeutic strategies. However, until now these mechanisms are poorly understood in protozoa. Recent investigations into gene expression in protozoa parasites suggest that they possess many of the canonical machineries employed by higher eukaryotes for the control of gene expression at transcriptional, posttranscriptional, and epigenetic levels, but they also contain exclusive mechanisms. Here, we review the current understanding about the regulation of gene expression in Plasmodium sp., Trypanosomatids, Entamoeba histolytica and Trichomonas vaginalis.

  16. Inferring gene networks from discrete expression data

    KAUST Repository

    Zhang, L.; Mallick, B. K.

    2013-01-01

    graphical models applied to continuous data, which give a closedformmarginal likelihood. In this paper,we extend network modeling to discrete data, specifically data from serial analysis of gene expression, and RNA-sequencing experiments, both of which

  17. Gene Expression and Microarray Investigation of Dendrobium ...

    African Journals Online (AJOL)

    blood glucose > 16.7 mmol/L were used as the model group and treated with Dendrobium mixture. (DEN ... Keywords: Diabetes, Gene expression, Dendrobium mixture, Microarray testing ..... homeostasis in airway smooth muscle. Am J.

  18. Identification of genes showing differential expression profile ...

    Indian Academy of Sciences (India)

    3Department of Natural Sciences, International Christian University, Mitaka, Tokyo 181-8585, Japan ... the changes of expression predicted from gene function suggested association ... ate School of Science and Technology, Niigata University.

  19. Drosophila melanogaster gene expression changes after spaceflight.

    Data.gov (United States)

    National Aeronautics and Space Administration — Gene expression levels were determined in 3rd instar and adult Drosophila melanogaster reared during spaceflight to elucidate the genetic and molecular mechanisms...

  20. Exertional Heat Illness and Human Gene Expression

    National Research Council Canada - National Science Library

    Sonna, L.A; Sawka, M. N; Lilly, C. M

    2007-01-01

    Microarray analysis of gene expression at the level of RNA has generated new insights into the relationship between cellular responses to acute heat shock in vitro, exercise, and exertional heat illness...

  1. Expression Profiling of Tyrosine Kinase Genes

    National Research Council Canada - National Science Library

    Weier, Heinz

    2000-01-01

    ... of these genes parallels the progression of tumors to a more malignant phenotype. We developed a DNA micro-array based screening system to monitor the level of expression of tyrosine kinase (tk...

  2. Regulation of meiotic gene expression in plants

    Directory of Open Access Journals (Sweden)

    Adele eZhou

    2014-08-01

    Full Text Available With the recent advances in genomics and sequencing technologies, databases of transcriptomes representing many cellular processes have been built. Meiotic transcriptomes in plants have been studied in Arabidopsis thaliana, rice (Oryza sativa, wheat (Triticum aestivum, petunia (Petunia hybrida, sunflower (Helianthus annuus, and maize (Zea mays. Studies in all organisms, but particularly in plants, indicate that a very large number of genes are expressed during meiosis, though relatively few of them seem to be required for the completion of meiosis. In this review, we focus on gene expression at the RNA level and analyze the meiotic transcriptome datasets and explore expression patterns of known meiotic genes to elucidate how gene expression could be regulated during meiosis. We also discuss mechanisms, such as chromatin organization and non-coding RNAs, that might be involved in the regulation of meiotic transcription patterns.

  3. Identification of genes preferentially expressed during

    African Journals Online (AJOL)

    雨林木风

    2012-08-16

    Aug 16, 2012 ... The suppression subtractive hybridization (SSH) method conducted to generate ... which showed the lack of genomic information currently available for lily. ..... characterization of genes expressed during somatic embryo.

  4. Alteration of gene expression by alcohol exposure at early neurulation.

    Science.gov (United States)

    Zhou, Feng C; Zhao, Qianqian; Liu, Yunlong; Goodlett, Charles R; Liang, Tiebing; McClintick, Jeanette N; Edenberg, Howard J; Li, Lang

    2011-02-21

    We have previously demonstrated that alcohol exposure at early neurulation induces growth retardation, neural tube abnormalities, and alteration of DNA methylation. To explore the global gene expression changes which may underline these developmental defects, microarray analyses were performed in a whole embryo mouse culture model that allows control over alcohol and embryonic variables. Alcohol caused teratogenesis in brain, heart, forelimb, and optic vesicle; a subset of the embryos also showed cranial neural tube defects. In microarray analysis (accession number GSM9545), adopting hypothesis-driven Gene Set Enrichment Analysis (GSEA) informatics and intersection analysis of two independent experiments, we found that there was a collective reduction in expression of neural specification genes (neurogenin, Sox5, Bhlhe22), neural growth factor genes [Igf1, Efemp1, Klf10 (Tieg), and Edil3], and alteration of genes involved in cell growth, apoptosis, histone variants, eye and heart development. There was also a reduction of retinol binding protein 1 (Rbp1), and de novo expression of aldehyde dehydrogenase 1B1 (Aldh1B1). Remarkably, four key hematopoiesis genes (glycophorin A, adducin 2, beta-2 microglobulin, and ceruloplasmin) were absent after alcohol treatment, and histone variant genes were reduced. The down-regulation of the neurospecification and the neurotrophic genes were further confirmed by quantitative RT-PCR. Furthermore, the gene expression profile demonstrated distinct subgroups which corresponded with two distinct alcohol-related neural tube phenotypes: an open (ALC-NTO) and a closed neural tube (ALC-NTC). Further, the epidermal growth factor signaling pathway and histone variants were specifically altered in ALC-NTO, and a greater number of neurotrophic/growth factor genes were down-regulated in the ALC-NTO than in the ALC-NTC embryos. This study revealed a set of genes vulnerable to alcohol exposure and genes that were associated with neural tube

  5. Blood Gene Expression Profiling of Breast Cancer Survivors Experiencing Fibrosis

    International Nuclear Information System (INIS)

    Landmark-Hoyvik, Hege; Dumeaux, Vanessa; Reinertsen, Kristin V.; Edvardsen, Hege; Fossa, Sophie D.; Borresen-Dale, Anne-Lise

    2011-01-01

    Purpose: To extend knowledge on the mechanisms and pathways involved in maintenance of radiation-induced fibrosis (RIF) by performing gene expression profiling of whole blood from breast cancer (BC) survivors with and without fibrosis 3-7 years after end of radiotherapy treatment. Methods and Materials: Gene expression profiles from blood were obtained for 254 BC survivors derived from a cohort of survivors, treated with adjuvant radiotherapy for breast cancer 3-7 years earlier. Analyses of transcriptional differences in blood gene expression between BC survivors with fibrosis (n = 31) and BC survivors without fibrosis (n = 223) were performed using R version 2.8.0 and tools from the Bioconductor project. Gene sets extracted through a literature search on fibrosis and breast cancer were subsequently used in gene set enrichment analysis. Results: Substantial differences in blood gene expression between BC survivors with and without fibrosis were observed, and 87 differentially expressed genes were identified through linear analysis. Transforming growth factor-β1 signaling was identified as the most significant gene set, showing a down-regulation of most of the core genes, together with up-regulation of a transcriptional activator of the inhibitor of fibrinolysis, Plasminogen activator inhibitor 1 in the BC survivors with fibrosis. Conclusion: Transforming growth factor-β1 signaling was found down-regulated during the maintenance phase of fibrosis as opposed to the up-regulation reported during the early, initiating phase of fibrosis. Hence, once the fibrotic tissue has developed, the maintenance phase might rather involve a deregulation of fibrinolysis and altered degradation of extracellular matrix components.

  6. Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis

    Science.gov (United States)

    dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

    2015-01-01

    Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis. PMID:26393928

  7. Evaluation of suitable reference genes for gene expression studies ...

    Indian Academy of Sciences (India)

    2011-12-14

    Dec 14, 2011 ... MADS family of TFs control floral organ identity within each whorl of the flower by activating downstream genes. Measuring gene expression in different tissue types and developmental stages is of fundamental importance in TFs functional research. In last few years, quantitative real-time. PCR (qRT-PCR) ...

  8. PRAME gene expression profile in medulloblastoma

    Directory of Open Access Journals (Sweden)

    Tânia Maria Vulcani-Freitas

    2011-02-01

    Full Text Available Medulloblastoma is the most common malignant tumors of central nervous system in the childhood. The treatment is severe, harmful and, thus, has a dismal prognosis. As PRAME is present in various cancers, including meduloblastoma, and has limited expression in normal tissues, this antigen can be an ideal vaccine target for tumor immunotherapy. In order to find a potential molecular target, we investigated PRAME expression in medulloblastoma fragments and we compare the results with the clinical features of each patient. Analysis of gene expression was performed by real-time quantitative PCR from 37 tumor samples. The Mann-Whitney test was used to analysis the relationship between gene expression and clinical characteristics. Kaplan-Meier curves were used to evaluate survival. PRAME was overexpressed in 84% samples. But no statistical association was found between clinical features and PRAME overexpression. Despite that PRAME gene could be a strong candidate for immunotherapy since it is highly expressed in medulloblastomas.

  9. Comparative gene expression between two yeast species

    Directory of Open Access Journals (Sweden)

    Guan Yuanfang

    2013-01-01

    Full Text Available Abstract Background Comparative genomics brings insight into sequence evolution, but even more may be learned by coupling sequence analyses with experimental tests of gene function and regulation. However, the reliability of such comparisons is often limited by biased sampling of expression conditions and incomplete knowledge of gene functions across species. To address these challenges, we previously systematically generated expression profiles in Saccharomyces bayanus to maximize functional coverage as compared to an existing Saccharomyces cerevisiae data repository. Results In this paper, we take advantage of these two data repositories to compare patterns of ortholog expression in a wide variety of conditions. First, we developed a scalable metric for expression divergence that enabled us to detect a significant correlation between sequence and expression conservation on the global level, which previous smaller-scale expression studies failed to detect. Despite this global conservation trend, between-species gene expression neighborhoods were less well-conserved than within-species comparisons across different environmental perturbations, and approximately 4% of orthologs exhibited a significant change in co-expression partners. Furthermore, our analysis of matched perturbations collected in both species (such as diauxic shift and cell cycle synchrony demonstrated that approximately a quarter of orthologs exhibit condition-specific expression pattern differences. Conclusions Taken together, these analyses provide a global view of gene expression patterns between two species, both in terms of the conditions and timing of a gene's expression as well as co-expression partners. Our results provide testable hypotheses that will direct future experiments to determine how these changes may be specified in the genome.

  10. Inferring gene networks from discrete expression data

    KAUST Repository

    Zhang, L.

    2013-07-18

    The modeling of gene networks from transcriptional expression data is an important tool in biomedical research to reveal signaling pathways and to identify treatment targets. Current gene network modeling is primarily based on the use of Gaussian graphical models applied to continuous data, which give a closedformmarginal likelihood. In this paper,we extend network modeling to discrete data, specifically data from serial analysis of gene expression, and RNA-sequencing experiments, both of which generate counts of mRNAtranscripts in cell samples.We propose a generalized linear model to fit the discrete gene expression data and assume that the log ratios of the mean expression levels follow a Gaussian distribution.We restrict the gene network structures to decomposable graphs and derive the graphs by selecting the covariance matrix of the Gaussian distribution with the hyper-inverse Wishart priors. Furthermore, we incorporate prior network models based on gene ontology information, which avails existing biological information on the genes of interest. We conduct simulation studies to examine the performance of our discrete graphical model and apply the method to two real datasets for gene network inference. © The Author 2013. Published by Oxford University Press. All rights reserved.

  11. Gene expression profiling in autoimmune diseases

    DEFF Research Database (Denmark)

    Bovin, Lone Frier; Brynskov, Jørn; Hegedüs, Laszlo

    2007-01-01

    A central issue in autoimmune disease is whether the underlying inflammation is a repeated stereotypical process or whether disease specific gene expression is involved. To shed light on this, we analysed whether genes previously found to be differentially regulated in rheumatoid arthritis (RA...

  12. Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

    Directory of Open Access Journals (Sweden)

    Walchli John

    2009-04-01

    Full Text Available Abstract Background With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. Results In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38α, viral polymerase (HCV NS5B, and bacterial structural protein (FtsZ were expressed in both E. coli and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. Conclusion The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.

  13. Selection of reference genes for gene expression studies in heart failure for left and right ventricles.

    Science.gov (United States)

    Li, Mengmeng; Rao, Man; Chen, Kai; Zhou, Jianye; Song, Jiangping

    2017-07-15

    Real-time quantitative reverse transcriptase-PCR (qRT-PCR) is a feasible tool for determining gene expression profiles, but the accuracy and reliability of the results depends on the stable expression of selected housekeeping genes in different samples. By far, researches on stable housekeeping genes in human heart failure samples are rare. Moreover the effect of heart failure on the expression of housekeeping genes in right and left ventricles is yet to be studied. Therefore we aim to provide stable housekeeping genes for both ventricles in heart failure and normal heart samples. In this study, we selected seven commonly used housekeeping genes as candidates. By using the qRT-PCR, the expression levels of ACTB, RAB7A, GAPDH, REEP5, RPL5, PSMB4 and VCP in eight heart failure and four normal heart samples were assessed. The stability of candidate housekeeping genes was evaluated by geNorm and Normfinder softwares. GAPDH showed the least variation in all heart samples. Results also indicated the difference of gene expression existed in heart failure left and right ventricles. GAPDH had the highest expression stability in both heart failure and normal heart samples. We also propose using different sets of housekeeping genes for left and right ventricles respectively. The combination of RPL5, GAPDH and PSMB4 is suitable for the right ventricle and the combination of GAPDH, REEP5 and RAB7A is suitable for the left ventricle. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Bayesian assignment of gene ontology terms to gene expression experiments.

    Science.gov (United States)

    Sykacek, P

    2012-09-15

    Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Source code under GPL license is available from the author. peter.sykacek@boku.ac.at.

  15. Bayesian assignment of gene ontology terms to gene expression experiments

    Science.gov (United States)

    Sykacek, P.

    2012-01-01

    Motivation: Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. Results: This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Availability: Source code under GPL license is available from the author. Contact: peter.sykacek@boku.ac.at PMID:22962488

  16. Expression of SET Protein in the Ovaries of Patients with Polycystic Ovary Syndrome

    OpenAIRE

    Xu Boqun; Dai Xiaonan; Cui YuGui; Gao Lingling; Dai Xue; Chao Gao; Diao Feiyang; Liu Jiayin; Li Gao; Mei Li; Yuan Zhang; Xiang Ma

    2013-01-01

    Background. We previously found that expression of SET gene was up-regulated in polycystic ovaries by using microarray. It suggested that SET may be an attractive candidate regulator involved in the pathophysiology of polycystic ovary syndrome (PCOS). In this study, expression and cellular localization of SET protein were investigated in human polycystic and normal ovaries. Method. Ovarian tissues, six normal ovaries and six polycystic ovaries, were collected during transsexual operation and ...

  17. Classification across gene expression microarray studies

    Directory of Open Access Journals (Sweden)

    Kuner Ruprecht

    2009-12-01

    Full Text Available Abstract Background The increasing number of gene expression microarray studies represents an important resource in biomedical research. As a result, gene expression based diagnosis has entered clinical practice for patient stratification in breast cancer. However, the integration and combined analysis of microarray studies remains still a challenge. We assessed the potential benefit of data integration on the classification accuracy and systematically evaluated the generalization performance of selected methods on four breast cancer studies comprising almost 1000 independent samples. To this end, we introduced an evaluation framework which aims to establish good statistical practice and a graphical way to monitor differences. The classification goal was to correctly predict estrogen receptor status (negative/positive and histological grade (low/high of each tumor sample in an independent study which was not used for the training. For the classification we chose support vector machines (SVM, predictive analysis of microarrays (PAM, random forest (RF and k-top scoring pairs (kTSP. Guided by considerations relevant for classification across studies we developed a generalization of kTSP which we evaluated in addition. Our derived version (DV aims to improve the robustness of the intrinsic invariance of kTSP with respect to technologies and preprocessing. Results For each individual study the generalization error was benchmarked via complete cross-validation and was found to be similar for all classification methods. The misclassification rates were substantially higher in classification across studies, when each single study was used as an independent test set while all remaining studies were combined for the training of the classifier. However, with increasing number of independent microarray studies used in the training, the overall classification performance improved. DV performed better than the average and showed slightly less variance. In

  18. Multiscale Embedded Gene Co-expression Network Analysis.

    Directory of Open Access Journals (Sweden)

    Won-Min Song

    2015-11-01

    Full Text Available Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3, the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA by: i introducing quality control of co-expression similarities, ii parallelizing embedded network construction, and iii developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs. We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA. MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  19. Multiscale Embedded Gene Co-expression Network Analysis.

    Science.gov (United States)

    Song, Won-Min; Zhang, Bin

    2015-11-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  20. Uniform approximation is more appropriate for Wilcoxon Rank-Sum Test in gene set analysis.

    Directory of Open Access Journals (Sweden)

    Zhide Fang

    Full Text Available Gene set analysis is widely used to facilitate biological interpretations in the analyses of differential expression from high throughput profiling data. Wilcoxon Rank-Sum (WRS test is one of the commonly used methods in gene set enrichment analysis. It compares the ranks of genes in a gene set against those of genes outside the gene set. This method is easy to implement and it eliminates the dichotomization of genes into significant and non-significant in a competitive hypothesis testing. Due to the large number of genes being examined, it is impractical to calculate the exact null distribution for the WRS test. Therefore, the normal distribution is commonly used as an approximation. However, as we demonstrate in this paper, the normal approximation is problematic when a gene set with relative small number of genes is tested against the large number of genes in the complementary set. In this situation, a uniform approximation is substantially more powerful, more accurate, and less intensive in computation. We demonstrate the advantage of the uniform approximations in Gene Ontology (GO term analysis using simulations and real data sets.

  1. Equivalent Gene Expression Profiles between Glatopa™ and Copaxone®.

    Directory of Open Access Journals (Sweden)

    Josephine S D'Alessandro

    Full Text Available Glatopa™ is a generic glatiramer acetate recently approved for the treatment of patients with relapsing forms of multiple sclerosis. Gene expression profiling was performed as a means to evaluate equivalence of Glatopa and Copaxone®. Microarray analysis containing 39,429 unique probes across the entire genome was performed in murine glatiramer acetate--responsive Th2-polarized T cells, a test system highly relevant to the biology of glatiramer acetate. A closely related but nonequivalent glatiramoid molecule was used as a control to establish assay sensitivity. Multiple probe-level (Student's t-test and sample-level (principal component analysis, multidimensional scaling, and hierarchical clustering statistical analyses were utilized to look for differences in gene expression induced by the test articles. The analyses were conducted across all genes measured, as well as across a subset of genes that were shown to be modulated by Copaxone. The following observations were made across multiple statistical analyses: the expression of numerous genes was significantly changed by treatment with Copaxone when compared against media-only control; gene expression profiles induced by Copaxone and Glatopa were not significantly different; and gene expression profiles induced by Copaxone and the nonequivalent glatiramoid were significantly different, underscoring the sensitivity of the test system and the multiple analysis methods. Comparative analysis was also performed on sets of transcripts relevant to T-cell biology and antigen presentation, among others that are known to be modulated by glatiramer acetate. No statistically significant differences were observed between Copaxone and Glatopa in the expression levels (magnitude and direction of these glatiramer acetate-regulated genes. In conclusion, multiple methods consistently supported equivalent gene expression profiles between Copaxone and Glatopa.

  2. Radiation-modulated gene expression in C. elegans

    International Nuclear Information System (INIS)

    Nelson, G.A.; Bayeta, E.; Perez, C.; Lloyd, E.; Jones, T.; Smith, A.; Tian, J.

    2003-01-01

    Full text: We use the nematode C. elegans to characterize the genotoxic and cytotoxic effects of ionizing radiation with emphasis effects of charged particle radiation and have described the fluence vs. response relationships for mutation, chromosome aberration and certain developmental errors. These endpoints quantify the biological after repair and compensation pathways have completed their work. In order to address the control of these reactions we have turned to gene expression profiling to identify genes that uniquely respond to high LET species or respond differentially as a function of radiation properties. We have employed whole genome microarray methods to map gene expression following exposure to gamma rays, protons and accelerated iron ions. We found that 599 of 17871 genes analyzed showed differential expression 3 hrs after exposure to 3 Gy of at least one radiation types. 193 were up-regulated, 406 were down-regulated, and 90% were affected by only one species of radiation. Genes whose transcription levels responded significantly mapped to definite statistical clusters that were unique for each radiation type. We are now trying to establish the functional relationships of the genes their relevance to mitigation of radiation-induced damage. Three approaches are being used. First, bioinformatics tools are being used to determine the roles of genes in co-regulated gene sets. Second, we are applying the technique of RNA interference to determine whether our radiation-induced genes affect cell survival (measured in terms of embryo survival) and chromosome aberration (intestinal anaphase bridges). Finally we are focussing on the response of the most strongly-regulated gene in our data set. This is the autosomal gene, F36D3.9, whose predicted structure is that of a cysteine protease resembling cathepsin B. An enzymological approach is being used to characterize this gene at the protein level. This work was supported by NASA Cooperative Agreement NCC9-149

  3. A compendium of canine normal tissue gene expression.

    Directory of Open Access Journals (Sweden)

    Joseph Briggs

    Full Text Available BACKGROUND: Our understanding of disease is increasingly informed by changes in gene expression between normal and abnormal tissues. The release of the canine genome sequence in 2005 provided an opportunity to better understand human health and disease using the dog as clinically relevant model. Accordingly, we now present the first genome-wide, canine normal tissue gene expression compendium with corresponding human cross-species analysis. METHODOLOGY/PRINCIPAL FINDINGS: The Affymetrix platform was utilized to catalogue gene expression signatures of 10 normal canine tissues including: liver, kidney, heart, lung, cerebrum, lymph node, spleen, jejunum, pancreas and skeletal muscle. The quality of the database was assessed in several ways. Organ defining gene sets were identified for each tissue and functional enrichment analysis revealed themes consistent with known physio-anatomic functions for each organ. In addition, a comparison of orthologous gene expression between matched canine and human normal tissues uncovered remarkable similarity. To demonstrate the utility of this dataset, novel canine gene annotations were established based on comparative analysis of dog and human tissue selective gene expression and manual curation of canine probeset mapping. Public access, using infrastructure identical to that currently in use for human normal tissues, has been established and allows for additional comparisons across species. CONCLUSIONS/SIGNIFICANCE: These data advance our understanding of the canine genome through a comprehensive analysis of gene expression in a diverse set of tissues, contributing to improved functional annotation that has been lacking. Importantly, it will be used to inform future studies of disease in the dog as a model for human translational research and provides a novel resource to the community at large.

  4. Noise minimization in eukaryotic gene expression.

    Directory of Open Access Journals (Sweden)

    Hunter B Fraser

    2004-06-01

    Full Text Available All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or "noise." Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection.

  5. Noise minimization in eukaryotic gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Fraser, Hunter B.; Hirsh, Aaron E.; Giaever, Guri; Kumm, Jochen; Eisen, Michael B.

    2004-01-15

    All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or noise. Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection.

  6. Noise minimization in eukaryotic gene expression

    International Nuclear Information System (INIS)

    Fraser, Hunter B.; Hirsh, Aaron E.; Giaever, Guri; Kumm, Jochen; Eisen, Michael B.

    2004-01-01

    All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or noise. Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection

  7. Design parameters to control synthetic gene expression in Escherichia coli.

    Directory of Open Access Journals (Sweden)

    Mark Welch

    Full Text Available BACKGROUND: Production of proteins as therapeutic agents, research reagents and molecular tools frequently depends on expression in heterologous hosts. Synthetic genes are increasingly used for protein production because sequence information is easier to obtain than the corresponding physical DNA. Protein-coding sequences are commonly re-designed to enhance expression, but there are no experimentally supported design principles. PRINCIPAL FINDINGS: To identify sequence features that affect protein expression we synthesized and expressed in E. coli two sets of 40 genes encoding two commercially valuable proteins, a DNA polymerase and a single chain antibody. Genes differing only in synonymous codon usage expressed protein at levels ranging from undetectable to 30% of cellular protein. Using partial least squares regression we tested the correlation of protein production levels with parameters that have been reported to affect expression. We found that the amount of protein produced in E. coli was strongly dependent on the codons used to encode a subset of amino acids. Favorable codons were predominantly those read by tRNAs that are most highly charged during amino acid starvation, not codons that are most abundant in highly expressed E. coli proteins. Finally we confirmed the validity of our models by designing, synthesizing and testing new genes using codon biases predicted to perform well. CONCLUSION: The systematic analysis of gene design parameters shown in this study has allowed us to identify codon usage within a gene as a critical determinant of achievable protein expression levels in E. coli. We propose a biochemical basis for this, as well as design algorithms to ensure high protein production from synthetic genes. Replication of this methodology should allow similar design algorithms to be empirically derived for any expression system.

  8. Reproducibility of gene expression across generations of Affymetrix microarrays

    Directory of Open Access Journals (Sweden)

    Haslett Judith N

    2003-06-01

    Full Text Available Abstract Background The development of large-scale gene expression profiling technologies is rapidly changing the norms of biological investigation. But the rapid pace of change itself presents challenges. Commercial microarrays are regularly modified to incorporate new genes and improved target sequences. Although the ability to compare datasets across generations is crucial for any long-term research project, to date no means to allow such comparisons have been developed. In this study the reproducibility of gene expression levels across two generations of Affymetrix GeneChips® (HuGeneFL and HG-U95A was measured. Results Correlation coefficients were computed for gene expression values across chip generations based on different measures of similarity. Comparing the absolute calls assigned to the individual probe sets across the generations found them to be largely unchanged. Conclusion We show that experimental replicates are highly reproducible, but that reproducibility across generations depends on the degree of similarity of the probe sets and the expression level of the corresponding transcript.

  9. Serial Expression Analysis: a web tool for the analysis of serial gene expression data

    Science.gov (United States)

    Nueda, Maria José; Carbonell, José; Medina, Ignacio; Dopazo, Joaquín; Conesa, Ana

    2010-01-01

    Serial transcriptomics experiments investigate the dynamics of gene expression changes associated with a quantitative variable such as time or dosage. The statistical analysis of these data implies the study of global and gene-specific expression trends, the identification of significant serial changes, the comparison of expression profiles and the assessment of transcriptional changes in terms of cellular processes. We have created the SEA (Serial Expression Analysis) suite to provide a complete web-based resource for the analysis of serial transcriptomics data. SEA offers five different algorithms based on univariate, multivariate and functional profiling strategies framed within a user-friendly interface and a project-oriented architecture to facilitate the analysis of serial gene expression data sets from different perspectives. SEA is available at sea.bioinfo.cipf.es. PMID:20525784

  10. Mining gene expression data by interpreting principal components

    Directory of Open Access Journals (Sweden)

    Mortazavi Ali

    2006-04-01

    Full Text Available Abstract Background There are many methods for analyzing microarray data that group together genes having similar patterns of expression over all conditions tested. However, in many instances the biologically important goal is to identify relatively small sets of genes that share coherent expression across only some conditions, rather than all or most conditions as required in traditional clustering; e.g. genes that are highly up-regulated and/or down-regulated similarly across only a subset of conditions. Equally important is the need to learn which conditions are the decisive ones in forming such gene sets of interest, and how they relate to diverse conditional covariates, such as disease diagnosis or prognosis. Results We present a method for automatically identifying such candidate sets of biologically relevant genes using a combination of principal components analysis and information theoretic metrics. To enable easy use of our methods, we have developed a data analysis package that facilitates visualization and subsequent data mining of the independent sources of significant variation present in gene microarray expression datasets (or in any other similarly structured high-dimensional dataset. We applied these tools to two public datasets, and highlight sets of genes most affected by specific subsets of conditions (e.g. tissues, treatments, samples, etc.. Statistically significant associations for highlighted gene sets were shown via global analysis for Gene Ontology term enrichment. Together with covariate associations, the tool provides a basis for building testable hypotheses about the biological or experimental causes of observed variation. Conclusion We provide an unsupervised data mining technique for diverse microarray expression datasets that is distinct from major methods now in routine use. In test uses, this method, based on publicly available gene annotations, appears to identify numerous sets of biologically relevant genes. It

  11. Comparative modular analysis of gene expression in vertebrate organs

    Directory of Open Access Journals (Sweden)

    Piasecka Barbara

    2012-03-01

    Full Text Available Abstract Background The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Results Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Conclusions Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.

  12. Comparative modular analysis of gene expression in vertebrate organs.

    Science.gov (United States)

    Piasecka, Barbara; Kutalik, Zoltán; Roux, Julien; Bergmann, Sven; Robinson-Rechavi, Marc

    2012-03-29

    The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.

  13. Expression Study of Banana Pathogenic Resistance Genes

    Directory of Open Access Journals (Sweden)

    Fenny M. Dwivany

    2016-10-01

    Full Text Available Banana is one of the world's most important trade commodities. However, infection of banana pathogenic fungi (Fusarium oxysporum race 4 is one of the major causes of decreasing production in Indonesia. Genetic engineering has become an alternative way to control this problem by isolating genes that involved in plant defense mechanism against pathogens. Two of the important genes are API5 and ChiI1, each gene encodes apoptosis inhibitory protein and chitinase enzymes. The purpose of this study was to study the expression of API5 and ChiI1 genes as candidate pathogenic resistance genes. The amplified fragments were then cloned, sequenced, and confirmed with in silico studies. Based on sequence analysis, it is showed that partial API5 gene has putative transactivation domain and ChiI1 has 9 chitinase family GH19 protein motifs. Data obtained from this study will contribute in banana genetic improvement.

  14. Dlx homeobox gene family expression in osteoclasts.

    Science.gov (United States)

    Lézot, F; Thomas, B L; Blin-Wakkach, C; Castaneda, B; Bolanos, A; Hotton, D; Sharpe, P T; Heymann, D; Carles, G F; Grigoriadis, A E; Berdal, A

    2010-06-01

    Skeletal growth and homeostasis require the finely orchestrated secretion of mineralized tissue matrices by highly specialized cells, balanced with their degradation by osteoclasts. Time- and site-specific expression of Dlx and Msx homeobox genes in the cells secreting these matrices have been identified as important elements in the regulation of skeletal morphology. Such specific expression patterns have also been reported in osteoclasts for Msx genes. The aim of the present study was to establish the expression patterns of Dlx genes in osteoclasts and identify their function in regulating skeletal morphology. The expression patterns of all Dlx genes were examined during the whole osteoclastogenesis using different in vitro models. The results revealed that Dlx1 and Dlx2 are the only Dlx family members with a possible function in osteoclastogenesis as well as in mature osteoclasts. Dlx5 and Dlx6 were detected in the cultures but appear to be markers of monocytes and their derivatives. In vivo, Dlx2 expression in osteoclasts was examined using a Dlx2/LacZ transgenic mouse. Dlx2 is expressed in a subpopulation of osteoclasts in association with tooth, brain, nerve, and bone marrow volumetric growths. Altogether the present data suggest a role for Dlx2 in regulation of skeletal morphogenesis via functions within osteoclasts. (c) 2010 Wiley-Liss, Inc.

  15. Hepatocyte specific expression of human cloned genes

    Energy Technology Data Exchange (ETDEWEB)

    Cortese, R

    1986-01-01

    A large number of proteins are specifically synthesized in the hepatocyte. Only the adult liver expresses the complete repertoire of functions which are required at various stages during development. There is therefore a complex series of regulatory mechanisms responsible for the maintenance of the differentiated state and for the developmental and physiological variations in the pattern of gene expression. Human hepatoma cell lines HepG2 and Hep3B display a pattern of gene expression similar to adult and fetal liver, respectively; in contrast, cultured fibroblasts or HeLa cells do not express most of the liver specific genes. They have used these cell lines for transfection experiments with cloned human liver specific genes. DNA segments coding for alpha1-antitrypsin and retinol binding protein (two proteins synthesized both in fetal and adult liver) are expressed in the hepatoma cell lines HepG2 and Hep3B, but not in HeLa cells or fibroblasts. A DNA segment coding for haptoglobin (a protein synthesized only after birth) is only expressed in the hepatoma cell line HepG2 but not in Hep3B nor in non hepatic cell lines. The information for tissue specific expression is located in the 5' flanking region of all three genes. In vivo competition experiments show that these DNA segments bind to a common, apparently limiting, transacting factor. Conventional techniques (Bal deletions, site directed mutagenesis, etc.) have been used to precisely identify the DNA sequences responsible for these effects. The emerging picture is complex: they have identified multiple, separate transcriptional signals, essential for maximal promoter activation and tissue specific expression. Some of these signals show a negative effect on transcription in fibroblast cell lines.

  16. Oxygen and tissue culture affect placental gene expression.

    Science.gov (United States)

    Brew, O; Sullivan, M H F

    2017-07-01

    Placental explant culture is an important model for studying placental development and functions. We investigated the differences in placental gene expression in response to tissue culture, atmospheric and physiologic oxygen concentrations. Placental explants were collected from normal term (38-39 weeks of gestation) placentae with no previous uterine contractile activity. Placental transcriptomic expressions were evaluated with GeneChip ® Human Genome U133 Plus 2.0 arrays (Affymetrix). We uncovered sub-sets of genes that regulate response to stress, induction of apoptosis programmed cell death, mis-regulation of cell growth, proliferation, cell morphogenesis, tissue viability, and protection from apoptosis in cultured placental explants. We also identified a sub-set of genes with highly unstable pattern of expression after exposure to tissue culture. Tissue culture irrespective of oxygen concentration induced dichotomous increase in significant gene expression and increased enrichment of significant pathways and transcription factor targets (TFTs) including HIF1A. The effect was exacerbated by culture at atmospheric oxygen concentration, where further up-regulation of TFTs including PPARA, CEBPD, HOXA9 and down-regulated TFTs such as JUND/FOS suggest intrinsic heightened key biological and metabolic mechanisms such as glucose use, lipid biosynthesis, protein metabolism; apoptosis, inflammatory responses; and diminished trophoblast proliferation, differentiation, invasion, regeneration, and viability. These findings demonstrate that gene expression patterns differ between pre-culture and cultured explants, and the gene expression of explants cultured at atmospheric oxygen concentration favours stressed, pro-inflammatory and increased apoptotic transcriptomic response. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. Gene expression profiles in skeletal muscle after gene electrotransfer

    DEFF Research Database (Denmark)

    Hojman, Pernille; Zibert, John R; Gissel, Hanne

    2007-01-01

    BACKGROUND: Gene transfer by electroporation (DNA electrotransfer) to muscle results in high level long term transgenic expression, showing great promise for treatment of e.g. protein deficiency syndromes. However little is known about the effects of DNA electrotransfer on muscle fibres. We have...... caused down-regulation of structural proteins e.g. sarcospan and catalytic enzymes. Injection of DNA induced down-regulation of intracellular transport proteins e.g. sentrin. The effects on muscle fibres were transient as the expression profiles 3 weeks after treatment were closely related......) followed by a long low voltage pulse (LV, 100 V/cm, 400 ms); a pulse combination optimised for efficient and safe gene transfer. Muscles were transfected with green fluorescent protein (GFP) and excised at 4 hours, 48 hours or 3 weeks after treatment. RESULTS: Differentially expressed genes were...

  18. Gene expression analysis of flax seed development

    Science.gov (United States)

    2011-01-01

    Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise

  19. Gene expression analysis of flax seed development

    Directory of Open Access Journals (Sweden)

    Sharpe Andrew

    2011-04-01

    Full Text Available Abstract Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages seed coats (globular and torpedo stages and endosperm (pooled globular to torpedo stages and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST (GenBank accessions LIBEST_026995 to LIBEST_027011 were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152 had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid

  20. Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

    Science.gov (United States)

    Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

    2018-04-23

    Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis

  1. The Medicago truncatula gene expression atlas web server

    Directory of Open Access Journals (Sweden)

    Tang Yuhong

    2009-12-01

    Full Text Available Abstract Background Legumes (Leguminosae or Fabaceae play a major role in agriculture. Transcriptomics studies in the model legume species, Medicago truncatula, are instrumental in helping to formulate hypotheses about the role of legume genes. With the rapid growth of publically available Affymetrix GeneChip Medicago Genome Array GeneChip data from a great range of tissues, cell types, growth conditions, and stress treatments, the legume research community desires an effective bioinformatics system to aid efforts to interpret the Medicago genome through functional genomics. We developed the Medicago truncatula Gene Expression Atlas (MtGEA web server for this purpose. Description The Medicago truncatula Gene Expression Atlas (MtGEA web server is a centralized platform for analyzing the Medicago transcriptome. Currently, the web server hosts gene expression data from 156 Affymetrix GeneChip® Medicago genome arrays in 64 different experiments, covering a broad range of developmental and environmental conditions. The server enables flexible, multifaceted analyses of transcript data and provides a range of additional information about genes, including different types of annotation and links to the genome sequence, which help users formulate hypotheses about gene function. Transcript data can be accessed using Affymetrix probe identification number, DNA sequence, gene name, functional description in natural language, GO and KEGG annotation terms, and InterPro domain number. Transcripts can also be discovered through co-expression or differential expression analysis. Flexible tools to select a subset of experiments and to visualize and compare expression profiles of multiple genes have been implemented. Data can be downloaded, in part or full, in a tabular form compatible with common analytical and visualization software. The web server will be updated on a regular basis to incorporate new gene expression data and genome annotation, and is accessible

  2. Lithium ions induce prestalk-associated gene expression and inhibit prespore gene expression in Dictyostelium discoideum

    NARCIS (Netherlands)

    Peters, Dorien J.M.; Lookeren Campagne, Michiel M. van; Haastert, Peter J.M. van; Spek, Wouter; Schaap, Pauline

    1989-01-01

    We investigated the effect of Li+ on two types of cyclic AMP-regulated gene expression and on basal and cyclic AMP-stimulated inositol 1,4,5-trisphosphate (Ins(1,4,5)P3) levels. Li+ effectively inhibits cyclic AMP-induced prespore gene expression, half-maximal inhibition occurring at about 2mM-LiCl.

  3. Scaling of gene expression data allowing the comparison of different gene expression platforms

    NARCIS (Netherlands)

    van Ruissen, Fred; Schaaf, Gerben J.; Kool, Marcel; Baas, Frank; Ruijter, Jan M.

    2008-01-01

    Serial analysis of gene expression (SAGE) and microarrays have found a widespread application, but much ambiguity exists regarding the amalgamation of the data resulting from these technologies. Cross-platform utilization of gene expression data from the SAGE and microarray technology could reduce

  4. Biclustering methods: biological relevance and application in gene expression analysis.

    Directory of Open Access Journals (Sweden)

    Ali Oghabian

    Full Text Available DNA microarray technologies are used extensively to profile the expression levels of thousands of genes under various conditions, yielding extremely large data-matrices. Thus, analyzing this information and extracting biologically relevant knowledge becomes a considerable challenge. A classical approach for tackling this challenge is to use clustering (also known as one-way clustering methods where genes (or respectively samples are grouped together based on the similarity of their expression profiles across the set of all samples (or respectively genes. An alternative approach is to develop biclustering methods to identify local patterns in the data. These methods extract subgroups of genes that are co-expressed across only a subset of samples and may feature important biological or medical implications. In this study we evaluate 13 biclustering and 2 clustering (k-means and hierarchical methods. We use several approaches to compare their performance on two real gene expression data sets. For this purpose we apply four evaluation measures in our analysis: (1 we examine how well the considered (biclustering methods differentiate various sample types; (2 we evaluate how well the groups of genes discovered by the (biclustering methods are annotated with similar Gene Ontology categories; (3 we evaluate the capability of the methods to differentiate genes that are known to be specific to the particular sample types we study and (4 we compare the running time of the algorithms. In the end, we conclude that as long as the samples are well defined and annotated, the contamination of the samples is limited, and the samples are well replicated, biclustering methods such as Plaid and SAMBA are useful for discovering relevant subsets of genes and samples.

  5. Developmental gene expression profiles of the human pathogen Schistosoma japonicum

    Directory of Open Access Journals (Sweden)

    McManus Donald P

    2009-03-01

    Full Text Available Abstract Background The schistosome blood flukes are complex trematodes and cause a chronic parasitic disease of significant public health importance worldwide, schistosomiasis. Their life cycle is characterised by distinct parasitic and free-living phases involving mammalian and snail hosts and freshwater. Microarray analysis was used to profile developmental gene expression in the Asian species, Schistosoma japonicum. Total RNAs were isolated from the three distinct environmental phases of the lifecycle – aquatic/snail (eggs, miracidia, sporocysts, cercariae, juvenile (lung schistosomula and paired but pre-egg laying adults and adult (paired, mature males and egg-producing females, both examined separately. Advanced analyses including ANOVA, principal component analysis, and hierarchal clustering provided a global synopsis of gene expression relationships among the different developmental stages of the schistosome parasite. Results Gene expression profiles were linked to the major environmental settings through which the developmental stages of the fluke have to adapt during the course of its life cycle. Gene ontologies of the differentially expressed genes revealed a wide range of functions and processes. In addition, stage-specific, differentially expressed genes were identified that were involved in numerous biological pathways and functions including calcium signalling, sphingolipid metabolism and parasite defence. Conclusion The findings provide a comprehensive database of gene expression in an important human pathogen, including transcriptional changes in genes involved in evasion of the host immune response, nutrient acquisition, energy production, calcium signalling, sphingolipid metabolism, egg production and tegumental function during development. This resource should help facilitate the identification and prioritization of new anti-schistosome drug and vaccine targets for the control of schistosomiasis.

  6. Development of Gene Expression Signatures for Practical Radiation Biodosimetry

    International Nuclear Information System (INIS)

    Paul, Sunirmal; Amundson, Sally A.

    2008-01-01

    Purpose: In a large-scale radiologic emergency, estimates of exposure doses and radiation injury would be required for individuals without physical dosimeters. Current methods are inadequate for the task, so we are developing gene expression profiles for radiation biodosimetry. This approach could provide both an estimate of physical radiation dose and an indication of the extent of individual injury or future risk. Methods and Materials: We used whole genome microarray expression profiling as a discovery platform to identify genes with the potential to predict radiation dose across an exposure range relevant for medical decision making in a radiologic emergency. Human peripheral blood from 10 healthy donors was irradiated ex vivo, and global gene expression was measured both 6 and 24 h after exposure. Results: A 74-gene signature was identified that distinguishes between four radiation doses (0.5, 2, 5, and 8 Gy) and controls. More than one third of these genes are regulated by TP53. A nearest centroid classifier using these same 74 genes correctly predicted 98% of samples taken either 6 h or 24 h after treatment as unexposed, exposed to 0.5, 2, or ≥5 Gy. Expression patterns of five genes (CDKN1A, FDXR, SESN1, BBC3, and PHPT1) from this signature were also confirmed by real-time polymerase chain reaction. Conclusion: The ability of a single gene set to predict radiation dose throughout a window of time without need for individual pre-exposure controls represents an important advance in the development of gene expression for biodosimetry

  7. Genetics of sputum gene expression in chronic obstructive pulmonary disease.

    Directory of Open Access Journals (Sweden)

    Weiliang Qiu

    Full Text Available Previous expression quantitative trait loci (eQTL studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs. The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5, the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus.

  8. Genetics of Sputum Gene Expression in Chronic Obstructive Pulmonary Disease

    Science.gov (United States)

    Qiu, Weiliang; Cho, Michael H.; Riley, John H.; Anderson, Wayne H.; Singh, Dave; Bakke, Per; Gulsvik, Amund; Litonjua, Augusto A.; Lomas, David A.; Crapo, James D.; Beaty, Terri H.; Celli, Bartolome R.; Rennard, Stephen; Tal-Singer, Ruth; Fox, Steven M.; Silverman, Edwin K.; Hersh, Craig P.

    2011-01-01

    Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus. PMID:21949713

  9. Functional clustering of time series gene expression data by Granger causality

    Science.gov (United States)

    2012-01-01

    Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425

  10. Sequential Logic Model Deciphers Dynamic Transcriptional Control of Gene Expressions

    Science.gov (United States)

    Yeo, Zhen Xuan; Wong, Sum Thai; Arjunan, Satya Nanda Vel; Piras, Vincent; Tomita, Masaru; Selvarajoo, Kumar; Giuliani, Alessandro; Tsuchiya, Masa

    2007-01-01

    Background Cellular signaling involves a sequence of events from ligand binding to membrane receptors through transcription factors activation and the induction of mRNA expression. The transcriptional-regulatory system plays a pivotal role in the control of gene expression. A novel computational approach to the study of gene regulation circuits is presented here. Methodology Based on the concept of finite state machine, which provides a discrete view of gene regulation, a novel sequential logic model (SLM) is developed to decipher control mechanisms of dynamic transcriptional regulation of gene expressions. The SLM technique is also used to systematically analyze the dynamic function of transcriptional inputs, the dependency and cooperativity, such as synergy effect, among the binding sites with respect to when, how much and how fast the gene of interest is expressed. Principal Findings SLM is verified by a set of well studied expression data on endo16 of Strongylocentrotus purpuratus (sea urchin) during the embryonic midgut development. A dynamic regulatory mechanism for endo16 expression controlled by three binding sites, UI, R and Otx is identified and demonstrated to be consistent with experimental findings. Furthermore, we show that during transition from specification to differentiation in wild type endo16 expression profile, SLM reveals three binary activities are not sufficient to explain the transcriptional regulation of endo16 expression and additional activities of binding sites are required. Further analyses suggest detailed mechanism of R switch activity where indirect dependency occurs in between UI activity and R switch during specification to differentiation stage. Conclusions/Significance The sequential logic formalism allows for a simplification of regulation network dynamics going from a continuous to a discrete representation of gene activation in time. In effect our SLM is non-parametric and model-independent, yet providing rich biological

  11. Sequential logic model deciphers dynamic transcriptional control of gene expressions.

    Directory of Open Access Journals (Sweden)

    Zhen Xuan Yeo

    Full Text Available BACKGROUND: Cellular signaling involves a sequence of events from ligand binding to membrane receptors through transcription factors activation and the induction of mRNA expression. The transcriptional-regulatory system plays a pivotal role in the control of gene expression. A novel computational approach to the study of gene regulation circuits is presented here. METHODOLOGY: Based on the concept of finite state machine, which provides a discrete view of gene regulation, a novel sequential logic model (SLM is developed to decipher control mechanisms of dynamic transcriptional regulation of gene expressions. The SLM technique is also used to systematically analyze the dynamic function of transcriptional inputs, the dependency and cooperativity, such as synergy effect, among the binding sites with respect to when, how much and how fast the gene of interest is expressed. PRINCIPAL FINDINGS: SLM is verified by a set of well studied expression data on endo16 of Strongylocentrotus purpuratus (sea urchin during the embryonic midgut development. A dynamic regulatory mechanism for endo16 expression controlled by three binding sites, UI, R and Otx is identified and demonstrated to be consistent with experimental findings. Furthermore, we show that during transition from specification to differentiation in wild type endo16 expression profile, SLM reveals three binary activities are not sufficient to explain the transcriptional regulation of endo16 expression and additional activities of binding sites are required. Further analyses suggest detailed mechanism of R switch activity where indirect dependency occurs in between UI activity and R switch during specification to differentiation stage. CONCLUSIONS/SIGNIFICANCE: The sequential logic formalism allows for a simplification of regulation network dynamics going from a continuous to a discrete representation of gene activation in time. In effect our SLM is non-parametric and model-independent, yet

  12. Confidence in Phase Definition for Periodicity in Genes Expression Time Series.

    Science.gov (United States)

    El Anbari, Mohammed; Fadda, Abeer; Ptitsyn, Andrey

    2015-01-01

    Circadian oscillation in baseline gene expression plays an important role in the regulation of multiple cellular processes. Most of the knowledge of circadian gene expression is based on studies measuring gene expression over time. Our ability to dissect molecular events in time is determined by the sampling frequency of such experiments. However, the real peaks of gene activity can be at any time on or between the time points at which samples are collected. Thus, some genes with a peak activity near the observation point have their phase of oscillation detected with better precision then those which peak between observation time points. Separating genes for which we can confidently identify peak activity from ambiguous genes can improve the analysis of time series gene expression. In this study we propose a new statistical method to quantify the phase confidence of circadian genes. The numerical performance of the proposed method has been tested using three real gene expression data sets.

  13. Sex hormones and gene expression signatures in peripheral blood from postmenopausal women - the NOWAC postgenome study

    Directory of Open Access Journals (Sweden)

    Rylander Charlotta

    2011-03-01

    Full Text Available Abstract Background Postmenopausal hormone therapy (HT influences endogenous hormone concentrations and increases the risk of breast cancer. Gene expression profiling may reveal the mechanisms behind this relationship. Our objective was to explore potential associations between sex hormones and gene expression in whole blood from a population-based, random sample of postmenopausal women Methods Gene expression, as measured by the Applied Biosystems microarray platform, was compared between hormone therapy (HT users and non-users and between high and low hormone plasma concentrations using both gene-wise analysis and gene set analysis. Gene sets found to be associated with HT use were further analysed for enrichment in functional clusters and network predictions. The gene expression matrix included 285 samples and 16185 probes and was adjusted for significant technical variables. Results Gene-wise analysis revealed several genes significantly associated with different types of HT use. The functional cluster analyses provided limited information on these genes. Gene set analysis revealed 22 gene sets that were enriched between high and low estradiol concentration (HT-users excluded. Among these were seven oestrogen related gene sets, including our gene list associated with systemic estradiol use, which thereby represents a novel oestrogen signature. Seven gene sets were related to immune response. Among the 15 gene sets enriched for progesterone, 11 overlapped with estradiol. No significant gene expression patterns were found for testosterone, follicle stimulating hormone (FSH or sex hormone binding globulin (SHBG. Conclusions Distinct gene expression patterns associated with sex hormones are detectable in a random group of postmenopausal women, as demonstrated by the finding of a novel oestrogen signature.

  14. Cloning and selection of reference genes for gene expression ...

    African Journals Online (AJOL)

    Full length mRNA sequences of Ac-β-actin and Ac-gapdh, and partial mRNA sequences of Ac-18SrRNA and Ac-ubiquitin were cloned from pineapple in this study. The four genes were tested as housekeeping genes in three experimental sets. GeNorm and NormFinder analysis revealed that β-actin was the most ...

  15. Gene expression analysis identifies global gene dosage sensitivity in cancer

    DEFF Research Database (Denmark)

    Fehrmann, Rudolf S. N.; Karjalainen, Juha M.; Krajewska, Malgorzata

    2015-01-01

    Many cancer-associated somatic copy number alterations (SCNAs) are known. Currently, one of the challenges is to identify the molecular downstream effects of these variants. Although several SCNAs are known to change gene expression levels, it is not clear whether each individual SCNA affects gen...

  16. Stochastic biological response to radiation. Comprehensive analysis of gene expression

    International Nuclear Information System (INIS)

    Inoue, Tohru; Hirabayashi, Yoko

    2012-01-01

    Authors explain that the radiation effect on biological system is stochastic along the law of physics, differing from chemical effect, using instances of Cs-137 gamma-ray (GR) and benzene (BZ) exposures to mice and of resultant comprehensive analyses of gene expression. Single GR irradiation is done with Gamma Cell 40 (CSR) to C57BL/6 or C3H/He mouse at 0, 0.6 and 3 Gy. BE is given orally at 150 mg/kg/day for 5 days x 2 weeks. Bone marrow cells are sampled 1 month after the exposure. Comprehensive gene expression is analyzed by Gene Chip Mouse Genome 430 2.0 Array (Affymetrix) and data are processed by programs like case normalization, statistics, network generation, functional analysis etc. GR irradiation brings about changes of gene expression, which are classifiable in common genes variable commonly on the dose change and stochastic genes variable stochastically within each dose: e.g., with Welch-t-test, significant differences are between 0/3 Gy (dose-specific difference, 455 pbs (probe set), in stochastic 2113 pbs), 0/0.6 Gy (267 in 1284 pbs) and 0.6/3 Gy (532 pbs); and with one-way analysis of variation (ANOVA) and hierarchial/dendrographic analyses, 520 pbs are shown to involve the dose-dependent 226 and dose-specific 294 pbs. It is also shown that at 3 Gy, expression of common genes are rather suppressed, including those related to the proliferation/apoptosis of B/T cells, and of stochastic genes, related to cell division/signaling. Ven diagram of the common genes of above 520 pbs, stochastic 2113 pbs at 3 Gy and 1284 pbs at 0.6 Gy shows the overlapping genes 29, 2 and 4, respectively, indicating only 35 pbs are overlapping in total. Network analysis of changes by GR shows the rather high expression of genes around hub of cAMP response element binding protein (CREB) at 0.6 Gy, and rather variable expression around CREB hub/suppressed expression of kinesin hub at 3 Gy; in the network by BZ exposure, unchanged or low expression around p53 hub and suppression

  17. Metallothionein gene expression in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Deeksha Pal

    2014-01-01

    Full Text Available Introduction: Metallothioneins (MTs are a group of low-molecular weight, cysteine-rich proteins. In general, MT is known to modulate three fundamental processes: (1 the release of gaseous mediators such as hydroxyl radical or nitric oxide, (2 apoptosis and (3 the binding and exchange of heavy metals such as zinc, cadmium or copper. Previous studies have shown a positive correlation between the expression of MT with invasion, metastasis and poor prognosis in various cancers. Most of the previous studies primarily used immunohistochemistry to analyze localization of MT in renal cell carcinoma (RCC. No information is available on the gene expression of MT2A isoform in different types and grades of RCC. Materials and Methods: In the present study, total RNA was isolated from 38 histopathologically confirmed cases of RCC of different types and grades. Corresponding adjacent normal renal parenchyma was taken as control. Real-time polymerase chain reaction (RT PCR analysis was done for the MT2A gene expression using b-actin as an internal control. All statistical calculations were performed using SPSS software. Results: The MT2A gene expression was found to be significantly increased (P < 0.01 in clear cell RCC in comparison with the adjacent normal renal parenchyma. The expression of MT2A was two to three-fold higher in sarcomatoid RCC, whereas there was no change in papillary and collecting duct RCC. MT2A gene expression was significantly higher in lower grade (grades I and II, P < 0.05, while no change was observed in high-grade tumor (grade III and IV in comparison to adjacent normal renal tissue. Conclusion: The first report of the expression of MT2A in different types and grades of RCC and also these data further support the role of MT2A in tumorigenesis.

  18. Analysis of baseline gene expression levels from ...

    Science.gov (United States)

    The use of gene expression profiling to predict chemical mode of action would be enhanced by better characterization of variance due to individual, environmental, and technical factors. Meta-analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in gene expression. A dataset of control animal microarray expression data was assembled by a working group of the Health and Environmental Sciences Institute's Technical Committee on the Application of Genomics in Mechanism Based Risk Assessment in order to provide a public resource for assessments of variability in baseline gene expression. Data from over 500 Affymetrix microarrays from control rat liver and kidney were collected from 16 different institutions. Thirty-five biological and technical factors were obtained for each animal, describing a wide range of study characteristics, and a subset were evaluated in detail for their contribution to total variability using multivariate statistical and graphical techniques. The study factors that emerged as key sources of variability included gender, organ section, strain, and fasting state. These and other study factors were identified as key descriptors that should be included in the minimal information about a toxicogenomics study needed for interpretation of results by an independent source. Genes that are the most and least variable, gender-selectiv

  19. Gene expression of the endolymphatic sac

    DEFF Research Database (Denmark)

    Friis, Morten; Martin-Bertelsen, Tomas; Friis-Hansen, Lennart

    2011-01-01

    that the endolymphatic sac has multiple and diverse functions in the inner ear. Objectives:The objective of this study was to provide a comprehensive review of the genes expressed in the endolymphatic sac in the rat and perform a functional characterization based on measured mRNA abundance. Methods:Microarray technology...

  20. Gene expression in early stage cervical cancer

    NARCIS (Netherlands)

    Biewenga, Petra; Buist, Marrije R.; Moerland, Perry D.; van Thernaat, Emiel Ver Loren; van Kampen, Antoine H. C.; ten Kate, Fiebo J. W.; Baas, Frank

    2008-01-01

    Objective. Pelvic lymph node metastases are the main prognostic factor for survival in early stage cervical cancer, yet accurate detection methods before surgery are lacking. In this study, we examined whether gene expression profiling can predict the presence of lymph node metastasis in early stage

  1. Shrinkage Approach for Gene Expression Data Analysis

    Czech Academy of Sciences Publication Activity Database

    Haman, Jiří; Valenta, Zdeněk; Kalina, Jan

    2013-01-01

    Roč. 1, č. 1 (2013), s. 65-65 ISSN 1805-8698. [EFMI 2013 Special Topic Conference. 17.04.2013-19.04.2013, Prague] Institutional support: RVO:67985807 Keywords : shrinkage estimation * covariance matrix * high dimensional data * gene expression Subject RIV: IN - Informatics, Computer Science

  2. Identification of reference genes and validation for gene expression studies in diverse axolotl (Ambystoma mexicanum) tissues.

    Science.gov (United States)

    Guelke, Eileen; Bucan, Vesna; Liebsch, Christina; Lazaridis, Andrea; Radtke, Christine; Vogt, Peter M; Reimers, Kerstin

    2015-04-10

    For the precise quantitative RT-PCR normalization a set of valid reference genes is obligatory. Moreover have to be taken into concern the experimental conditions as they bias the regulation of reference genes. Up till now, no reference targets have been described for the axolotl (Ambystoma mexicanum). In a search in the public database SalSite for genetic information of the axolotl we identified fourteen presumptive reference genes, eleven of which were further tested for their gene expression stability. This study characterizes the expressional patterns of 11 putative endogenous control genes during axolotl limb regeneration and in an axolotl tissue panel. All 11 reference genes showed variable expression. Strikingly, ACTB was to be found most stable expressed in all comparative tissue groups, so we reason it to be suitable for all different kinds of axolotl tissue-type investigations. Moreover do we suggest GAPDH and RPLP0 as suitable for certain axolotl tissue analysis. When it comes to axolotl limb regeneration, a validated pair of reference genes is ODC and RPLP0. With these findings, new insights into axolotl gene expression profiling might be gained. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. Regulation of methane genes and genome expression

    Energy Technology Data Exchange (ETDEWEB)

    John N. Reeve

    2009-09-09

    At the start of this project, it was known that methanogens were Archaeabacteria (now Archaea) and were therefore predicted to have gene expression and regulatory systems different from Bacteria, but few of the molecular biology details were established. The goals were then to establish the structures and organizations of genes in methanogens, and to develop the genetic technologies needed to investigate and dissect methanogen gene expression and regulation in vivo. By cloning and sequencing, we established the gene and operon structures of all of the “methane” genes that encode the enzymes that catalyze methane biosynthesis from carbon dioxide and hydrogen. This work identified unique sequences in the methane gene that we designated mcrA, that encodes the largest subunit of methyl-coenzyme M reductase, that could be used to identify methanogen DNA and establish methanogen phylogenetic relationships. McrA sequences are now the accepted standard and used extensively as hybridization probes to identify and quantify methanogens in environmental research. With the methane genes in hand, we used northern blot and then later whole-genome microarray hybridization analyses to establish how growth phase and substrate availability regulated methane gene expression in Methanobacterium thermautotrophicus ΔH (now Methanothermobacter thermautotrophicus). Isoenzymes or pairs of functionally equivalent enzymes catalyze several steps in the hydrogen-dependent reduction of carbon dioxide to methane. We established that hydrogen availability determine which of these pairs of methane genes is expressed and therefore which of the alternative enzymes is employed to catalyze methane biosynthesis under different environmental conditions. As were unable to establish a reliable genetic system for M. thermautotrophicus, we developed in vitro transcription as an alternative system to investigate methanogen gene expression and regulation. This led to the discovery that an archaeal protein

  4. Identification of a conserved set of upregulated genes in mouse skeletal muscle hypertrophy and regrowth.

    Science.gov (United States)

    Chaillou, Thomas; Jackson, Janna R; England, Jonathan H; Kirby, Tyler J; Richards-White, Jena; Esser, Karyn A; Dupont-Versteegden, Esther E; McCarthy, John J

    2015-01-01

    The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. Copyright © 2015 the American Physiological Society.

  5. Fluid Mechanics, Arterial Disease, and Gene Expression.

    Science.gov (United States)

    Tarbell, John M; Shi, Zhong-Dong; Dunn, Jessilyn; Jo, Hanjoong

    2014-01-01

    This review places modern research developments in vascular mechanobiology in the context of hemodynamic phenomena in the cardiovascular system and the discrete localization of vascular disease. The modern origins of this field are traced, beginning in the 1960s when associations between flow characteristics, particularly blood flow-induced wall shear stress, and the localization of atherosclerotic plaques were uncovered, and continuing to fluid shear stress effects on the vascular lining endothelial) cells (ECs), including their effects on EC morphology, biochemical production, and gene expression. The earliest single-gene studies and genome-wide analyses are considered. The final section moves from the ECs lining the vessel wall to the smooth muscle cells and fibroblasts within the wall that are fluid me chanically activated by interstitial flow that imposes shear stresses on their surfaces comparable with those of flowing blood on EC surfaces. Interstitial flow stimulates biochemical production and gene expression, much like blood flow on ECs.

  6. Gene Expression Commons: an open platform for absolute gene expression profiling.

    Directory of Open Access Journals (Sweden)

    Jun Seita

    Full Text Available Gene expression profiling using microarrays has been limited to comparisons of gene expression between small numbers of samples within individual experiments. However, the unknown and variable sensitivities of each probeset have rendered the absolute expression of any given gene nearly impossible to estimate. We have overcome this limitation by using a very large number (>10,000 of varied microarray data as a common reference, so that statistical attributes of each probeset, such as the dynamic range and threshold between low and high expression, can be reliably discovered through meta-analysis. This strategy is implemented in a web-based platform named "Gene Expression Commons" (https://gexc.stanford.edu/ which contains data of 39 distinct highly purified mouse hematopoietic stem/progenitor/differentiated cell populations covering almost the entire hematopoietic system. Since the Gene Expression Commons is designed as an open platform, investigators can explore the expression level of any gene, search by expression patterns of interest, submit their own microarray data, and design their own working models representing biological relationship among samples.

  7. Comparative gene expression of intestinal metabolizing enzymes.

    Science.gov (United States)

    Shin, Ho-Chul; Kim, Hye-Ryoung; Cho, Hee-Jung; Yi, Hee; Cho, Soo-Min; Lee, Dong-Goo; Abd El-Aty, A M; Kim, Jin-Suk; Sun, Duxin; Amidon, Gordon L

    2009-11-01

    The purpose of this study was to compare the expression profiles of drug-metabolizing enzymes in the intestine of mouse, rat and human. Total RNA was isolated from the duodenum and the mRNA expression was measured using Affymetrix GeneChip oligonucleotide arrays. Detected genes from the intestine of mouse, rat and human were ca. 60% of 22690 sequences, 40% of 8739 and 47% of 12559, respectively. Total genes of metabolizing enzymes subjected in this study were 95, 33 and 68 genes in mouse, rat and human, respectively. Of phase I enzymes, the mouse exhibited abundant gene expressions for Cyp3a25, Cyp4v3, Cyp2d26, followed by Cyp2b20, Cyp2c65 and Cyp4f14, whereas, the rat showed higher expression profiles of Cyp3a9, Cyp2b19, Cyp4f1, Cyp17a1, Cyp2d18, Cyp27a1 and Cyp4f6. However, the highly expressed P450 enzymes were CYP3A4, CYP3A5, CYP4F3, CYP2C18, CYP2C9, CYP2D6, CYP3A7, CYP11B1 and CYP2B6 in the human. For phase II enzymes, glucuronosyltransferase Ugt1a6, glutathione S-transferases Gstp1, Gstm3 and Gsta2, sulfotransferase Sult1b1 and acyltransferase Dgat1 were highly expressed in the mouse. The rat revealed predominant expression of glucuronosyltransferases Ugt1a1 and Ugt1a7, sulfotransferase Sult1b1, acetyltransferase Dlat and acyltransferase Dgat1. On the other hand, in human, glucuronosyltransferases UGT2B15 and UGT2B17, glutathione S-transferases MGST3, GSTP1, GSTA2 and GSTM4, sulfotransferases ST1A3 and SULT1A2, acetyltransferases SAT1 and CRAT, and acyltransferase AGPAT2 were dominantly detected. Therefore, current data indicated substantial interspecies differences in the pattern of intestinal gene expression both for P450 enzymes and phase II drug-metabolizing enzymes. This genomic database is expected to improve our understanding of interspecies variations in estimating intestinal prehepatic clearance of oral drugs.

  8. Automatic Control of Gene Expression in Mammalian Cells.

    Science.gov (United States)

    Fracassi, Chiara; Postiglione, Lorena; Fiore, Gianfranco; di Bernardo, Diego

    2016-04-15

    Automatic control of gene expression in living cells is paramount importance to characterize both endogenous gene regulatory networks and synthetic circuits. In addition, such a technology can be used to maintain the expression of synthetic circuit components in an optimal range in order to ensure reliable performance. Here we present a microfluidics-based method to automatically control gene expression from the tetracycline-inducible promoter in mammalian cells in real time. Our approach is based on the negative-feedback control engineering paradigm. We validated our method in a monoclonal population of cells constitutively expressing a fluorescent reporter protein (d2EYFP) downstream of a minimal CMV promoter with seven tet-responsive operator motifs (CMV-TET). These cells also constitutively express the tetracycline transactivator protein (tTA). In cells grown in standard growth medium, tTA is able to bind the CMV-TET promoter, causing d2EYFP to be maximally expressed. Upon addition of tetracycline to the culture medium, tTA detaches from the CMV-TET promoter, thus preventing d2EYFP expression. We tested two different model-independent control algorithms (relay and proportional-integral (PI)) to force a monoclonal population of cells to express an intermediate level of d2EYFP equal to 50% of its maximum expression level for up to 3500 min. The control input is either tetracycline-rich or standard growth medium. We demonstrated that both the relay and PI controllers can regulate gene expression at the desired level, despite oscillations (dampened in the case of the PI controller) around the chosen set point.

  9. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-03-01

    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  10. Modeling gene expression measurement error: a quasi-likelihood approach

    Directory of Open Access Journals (Sweden)

    Strimmer Korbinian

    2003-03-01

    Full Text Available Abstract Background Using suitable error models for gene expression measurements is essential in the statistical analysis of microarray data. However, the true probabilistic model underlying gene expression intensity readings is generally not known. Instead, in currently used approaches some simple parametric model is assumed (usually a transformed normal distribution or the empirical distribution is estimated. However, both these strategies may not be optimal for gene expression data, as the non-parametric approach ignores known structural information whereas the fully parametric models run the risk of misspecification. A further related problem is the choice of a suitable scale for the model (e.g. observed vs. log-scale. Results Here a simple semi-parametric model for gene expression measurement error is presented. In this approach inference is based an approximate likelihood function (the extended quasi-likelihood. Only partial knowledge about the unknown true distribution is required to construct this function. In case of gene expression this information is available in the form of the postulated (e.g. quadratic variance structure of the data. As the quasi-likelihood behaves (almost like a proper likelihood, it allows for the estimation of calibration and variance parameters, and it is also straightforward to obtain corresponding approximate confidence intervals. Unlike most other frameworks, it also allows analysis on any preferred scale, i.e. both on the original linear scale as well as on a transformed scale. It can also be employed in regression approaches to model systematic (e.g. array or dye effects. Conclusions The quasi-likelihood framework provides a simple and versatile approach to analyze gene expression data that does not make any strong distributional assumptions about the underlying error model. For several simulated as well as real data sets it provides a better fit to the data than competing models. In an example it also

  11. Construction and use of gene expression covariation matrix

    Directory of Open Access Journals (Sweden)

    Bellis Michel

    2009-07-01

    Full Text Available Abstract Background One essential step in the massive analysis of transcriptomic profiles is the calculation of the correlation coefficient, a value used to select pairs of genes with similar or inverse transcriptional profiles across a large fraction of the biological conditions examined. Until now, the choice between the two available methods for calculating the coefficient has been dictated mainly by technological considerations. Specifically, in analyses based on double-channel techniques, researchers have been required to use covariation correlation, i.e. the correlation between gene expression changes measured between several pairs of biological conditions, expressed for example as fold-change. In contrast, in analyses of single-channel techniques scientists have been restricted to the use of coexpression correlation, i.e. correlation between gene expression levels. To our knowledge, nobody has ever examined the possible benefits of using covariation instead of coexpression in massive analyses of single channel microarray results. Results We describe here how single-channel techniques can be treated like double-channel techniques and used to generate both gene expression changes and covariation measures. We also present a new method that allows the calculation of both positive and negative correlation coefficients between genes. First, we perform systematic comparisons between two given biological conditions and classify, for each comparison, genes as increased (I, decreased (D, or not changed (N. As a result, the original series of n gene expression level measures assigned to each gene is replaced by an ordered string of n(n-1/2 symbols, e.g. IDDNNIDID....DNNNNNNID, with the length of the string corresponding to the number of comparisons. In a second step, positive and negative covariation matrices (CVM are constructed by calculating statistically significant positive or negative correlation scores for any pair of genes by comparing their

  12. Structure and expression of thyroglobulin gene

    Energy Technology Data Exchange (ETDEWEB)

    Vassart, G; Brocas, H; Christophe, D; de Martynoff, G; Leriche, A; Mercken, L; Pohl, V; van Heuverswyn, B [Institut de Recherche Interdisciplinaire en Biologie Humaine et Nucleaire (IRIBHN), Faculte de Medecine, Universite libre de Bruxelles, Campus Hopital Erasme, Brussels (Belgium)

    1982-01-01

    Thyroglobulin is composed of two 300000 dalton polypeptide chains, translated from an 8000 base mRNA. Preparation of a full length cDNA and its cloning in E. coli have lead to the demonstration that the polypeptides of thyroglobulin protomers were identical. Used as molecular probes, the cloned cDNA allowed the isolation of a fragment of thyroglobulin gene. Electron microscopic studies have demonstrated that this gene contains more than 90 % intronic material separating small size exons (<200 bp). Sequencing of bovine thyroglobulin structural gene is in progress. Preliminary results show evidence for the existence of repetitive segments. Availability of cloned DNA complementary to bovine and human thyroglobulin mRNA allows the study of genetic defects of thyroglobulin gene expression in the human and in various animal models.

  13. Cerebrovascular gene expression in spontaneously hypertensive rats

    DEFF Research Database (Denmark)

    Grell, Anne-Sofie; Frederiksen, Simona Denise; Edvinsson, Lars

    2017-01-01

    Hypertension is a hemodynamic disorder and one of the most important and well-established risk factors for vascular diseases such as stroke. Blood vessels exposed to chronic shear stress develop structural changes and remodeling of the vascular wall through many complex mechanisms. However......, the molecular mechanisms involved are not fully understood. Hypertension-susceptible genes may provide a novel insight into potential molecular mechanisms of hypertension and secondary complications associated with hypertension. The aim of this exploratory study was to identify gene expression differences......, the identified genes in the middle cerebral arteries from spontaneously hypertensive rats could be possible mediators of the vascular changes and secondary complications associated with hypertension. This study supports the selection of key genes to investigate in the future research of hypertension-induced end...

  14. Comparative study on gene set and pathway topology-based enrichment methods.

    Science.gov (United States)

    Bayerlová, Michaela; Jung, Klaus; Kramer, Frank; Klemm, Florian; Bleckmann, Annalen; Beißbarth, Tim

    2015-10-22

    Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis. We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods. In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower. We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both

  15. The Molecular Signatures Database (MSigDB) hallmark gene set collection.

    Science.gov (United States)

    Liberzon, Arthur; Birger, Chet; Thorvaldsdóttir, Helga; Ghandi, Mahmoud; Mesirov, Jill P; Tamayo, Pablo

    2015-12-23

    The Molecular Signatures Database (MSigDB) is one of the most widely used and comprehensive databases of gene sets for performing gene set enrichment analysis. Since its creation, MSigDB has grown beyond its roots in metabolic disease and cancer to include >10,000 gene sets. These better represent a wider range of biological processes and diseases, but the utility of the database is reduced by increased redundancy across, and heterogeneity within, gene sets. To address this challenge, here we use a combination of automated approaches and expert curation to develop a collection of "hallmark" gene sets as part of MSigDB. Each hallmark in this collection consists of a "refined" gene set, derived from multiple "founder" sets, that conveys a specific biological state or process and displays coherent expression. The hallmarks effectively summarize most of the relevant information of the original founder sets and, by reducing both variation and redundancy, provide more refined and concise inputs for gene set enrichment analysis.

  16. Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

    Science.gov (United States)

    Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

    2015-01-27

    Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.

  17. Learning gene regulatory networks from gene expression data using weighted consensus

    KAUST Repository

    Fujii, Chisato; Kuwahara, Hiroyuki; Yu, Ge; Guo, Lili; Gao, Xin

    2016-01-01

    An accurate determination of the network structure of gene regulatory systems from high-throughput gene expression data is an essential yet challenging step in studying how the expression of endogenous genes is controlled through a complex interaction of gene products and DNA. While numerous methods have been proposed to infer the structure of gene regulatory networks, none of them seem to work consistently over different data sets with high accuracy. A recent study to compare gene network inference methods showed that an average-ranking-based consensus method consistently performs well under various settings. Here, we propose a linear programming-based consensus method for the inference of gene regulatory networks. Unlike the average-ranking-based one, which treats the contribution of each individual method equally, our new consensus method assigns a weight to each method based on its credibility. As a case study, we applied the proposed consensus method on synthetic and real microarray data sets, and compared its performance to that of the average-ranking-based consensus and individual inference methods. Our results show that our weighted consensus method achieves superior performance over the unweighted one, suggesting that assigning weights to different individual methods rather than giving them equal weights improves the accuracy. © 2016 Elsevier B.V.

  18. Learning gene regulatory networks from gene expression data using weighted consensus

    KAUST Repository

    Fujii, Chisato

    2016-08-25

    An accurate determination of the network structure of gene regulatory systems from high-throughput gene expression data is an essential yet challenging step in studying how the expression of endogenous genes is controlled through a complex interaction of gene products and DNA. While numerous methods have been proposed to infer the structure of gene regulatory networks, none of them seem to work consistently over different data sets with high accuracy. A recent study to compare gene network inference methods showed that an average-ranking-based consensus method consistently performs well under various settings. Here, we propose a linear programming-based consensus method for the inference of gene regulatory networks. Unlike the average-ranking-based one, which treats the contribution of each individual method equally, our new consensus method assigns a weight to each method based on its credibility. As a case study, we applied the proposed consensus method on synthetic and real microarray data sets, and compared its performance to that of the average-ranking-based consensus and individual inference methods. Our results show that our weighted consensus method achieves superior performance over the unweighted one, suggesting that assigning weights to different individual methods rather than giving them equal weights improves the accuracy. © 2016 Elsevier B.V.

  19. Mechanism-based biomarker gene sets for glutathione depletion-related hepatotoxicity in rats

    International Nuclear Information System (INIS)

    Gao Weihua; Mizukawa, Yumiko; Nakatsu, Noriyuki; Minowa, Yosuke; Yamada, Hiroshi; Ohno, Yasuo; Urushidani, Tetsuro

    2010-01-01

    Chemical-induced glutathione depletion is thought to be caused by two types of toxicological mechanisms: PHO-type glutathione depletion [glutathione conjugated with chemicals such as phorone (PHO) or diethyl maleate (DEM)], and BSO-type glutathione depletion [i.e., glutathione synthesis inhibited by chemicals such as L-buthionine-sulfoximine (BSO)]. In order to identify mechanism-based biomarker gene sets for glutathione depletion in rat liver, male SD rats were treated with various chemicals including PHO (40, 120 and 400 mg/kg), DEM (80, 240 and 800 mg/kg), BSO (150, 450 and 1500 mg/kg), and bromobenzene (BBZ, 10, 100 and 300 mg/kg). Liver samples were taken 3, 6, 9 and 24 h after administration and examined for hepatic glutathione content, physiological and pathological changes, and gene expression changes using Affymetrix GeneChip Arrays. To identify differentially expressed probe sets in response to glutathione depletion, we focused on the following two courses of events for the two types of mechanisms of glutathione depletion: a) gene expression changes occurring simultaneously in response to glutathione depletion, and b) gene expression changes after glutathione was depleted. The gene expression profiles of the identified probe sets for the two types of glutathione depletion differed markedly at times during and after glutathione depletion, whereas Srxn1 was markedly increased for both types as glutathione was depleted, suggesting that Srxn1 is a key molecule in oxidative stress related to glutathione. The extracted probe sets were refined and verified using various compounds including 13 additional positive or negative compounds, and they established two useful marker sets. One contained three probe sets (Akr7a3, Trib3 and Gstp1) that could detect conjugation-type glutathione depletors any time within 24 h after dosing, and the other contained 14 probe sets that could detect glutathione depletors by any mechanism. These two sets, with appropriate scoring

  20. Global gene expression in Escherichia coli biofilms

    DEFF Research Database (Denmark)

    Schembri, Mark; Kjærgaard, K.; Klemm, Per

    2003-01-01

    It is now apparent that microorganisms undergo significant changes during the transition from planktonic to biofilm growth. These changes result in phenotypic adaptations that allow the formation of highly organized and structured sessile communities, which possess enhanced resistance to antimicr......It is now apparent that microorganisms undergo significant changes during the transition from planktonic to biofilm growth. These changes result in phenotypic adaptations that allow the formation of highly organized and structured sessile communities, which possess enhanced resistance...... the transition to biofilm growth, and these included genes expressed under oxygen-limiting conditions, genes encoding (putative) transport proteins, putative oxidoreductases and genes associated with enhanced heavy metal resistance. Of particular interest was the observation that many of the genes altered...... in expression have no current defined function. These genes, as well as those induced by stresses relevant to biofilm growth such as oxygen and nutrient limitation, may be important factors that trigger enhanced resistance mechanisms of sessile communities to antibiotics and hydrodynamic shear forces....

  1. Aberrant Gene Expression in Acute Myeloid Leukaemia

    DEFF Research Database (Denmark)

    Bagger, Frederik Otzen

    model to investigate the role of telomerase in AML, we were able to translate the observed effect into human AML patients and identify specific genes involved, which also predict survival patterns in AML patients. During these studies we have applied methods for investigating differentially expressed......-based gene-lookup webservices, called HemaExplorer and BloodSpot. These web-services support the aim of making data and analysis of haematopoietic cells from mouse and human accessible for researchers without bioinformatics expertise. Finally, in order to aid the analysis of the very limited number...

  2. Integrating mean and variance heterogeneities to identify differentially expressed genes.

    Science.gov (United States)

    Ouyang, Weiwei; An, Qiang; Zhao, Jinying; Qin, Huaizhen

    2016-12-06

    In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment

  3. Gene expression in Pseudomonas aeruginosa swarming motility

    Directory of Open Access Journals (Sweden)

    Déziel Eric

    2010-10-01

    Full Text Available Abstract Background The bacterium Pseudomonas aeruginosa is capable of three types of motilities: swimming, twitching and swarming. The latter is characterized by a fast and coordinated group movement over a semi-solid surface resulting from intercellular interactions and morphological differentiation. A striking feature of swarming motility is the complex fractal-like patterns displayed by migrating bacteria while they move away from their inoculation point. This type of group behaviour is still poorly understood and its characterization provides important information on bacterial structured communities such as biofilms. Using GeneChip® Affymetrix microarrays, we obtained the transcriptomic profiles of both bacterial populations located at the tip of migrating tendrils and swarm center of swarming colonies and compared these profiles to that of a bacterial control population grown on the same media but solidified to not allow swarming motility. Results Microarray raw data were corrected for background noise with the RMA algorithm and quantile normalized. Differentially expressed genes between the three conditions were selected using a threshold of 1.5 log2-fold, which gave a total of 378 selected genes (6.3% of the predicted open reading frames of strain PA14. Major shifts in gene expression patterns are observed in each growth conditions, highlighting the presence of distinct bacterial subpopulations within a swarming colony (tendril tips vs. swarm center. Unexpectedly, microarrays expression data reveal that a minority of genes are up-regulated in tendril tip populations. Among them, we found energy metabolism, ribosomal protein and transport of small molecules related genes. On the other hand, many well-known virulence factors genes were globally repressed in tendril tip cells. Swarm center cells are distinct and appear to be under oxidative and copper stress responses. Conclusions Results reported in this study show that, as opposed to

  4. Decomposition of gene expression state space trajectories.

    Directory of Open Access Journals (Sweden)

    Jessica C Mar

    2009-12-01

    Full Text Available Representing and analyzing complex networks remains a roadblock to creating dynamic network models of biological processes and pathways. The study of cell fate transitions can reveal much about the transcriptional regulatory programs that underlie these phenotypic changes and give rise to the coordinated patterns in expression changes that we observe. The application of gene expression state space trajectories to capture cell fate transitions at the genome-wide level is one approach currently used in the literature. In this paper, we analyze the gene expression dataset of Huang et al. (2005 which follows the differentiation of promyelocytes into neutrophil-like cells in the presence of inducers dimethyl sulfoxide and all-trans retinoic acid. Huang et al. (2005 build on the work of Kauffman (2004 who raised the attractor hypothesis, stating that cells exist in an expression landscape and their expression trajectories converge towards attractive sites in this landscape. We propose an alternative interpretation that explains this convergent behavior by recognizing that there are two types of processes participating in these cell fate transitions-core processes that include the specific differentiation pathways of promyelocytes to neutrophils, and transient processes that capture those pathways and responses specific to the inducer. Using functional enrichment analyses, specific biological examples and an analysis of the trajectories and their core and transient components we provide a validation of our hypothesis using the Huang et al. (2005 dataset.

  5. Gene set-based module discovery in the breast cancer transcriptome

    Directory of Open Access Journals (Sweden)

    Zhang Michael Q

    2009-02-01

    Full Text Available Abstract Background Although microarray-based studies have revealed global view of gene expression in cancer cells, we still have little knowledge about regulatory mechanisms underlying the transcriptome. Several computational methods applied to yeast data have recently succeeded in identifying expression modules, which is defined as co-expressed gene sets under common regulatory mechanisms. However, such module discovery methods are not applied cancer transcriptome data. Results In order to decode oncogenic regulatory programs in cancer cells, we developed a novel module discovery method termed EEM by extending a previously reported module discovery method, and applied it to breast cancer expression data. Starting from seed gene sets prepared based on cis-regulatory elements, ChIP-chip data, and gene locus information, EEM identified 10 principal expression modules in breast cancer based on their expression coherence. Moreover, EEM depicted their activity profiles, which predict regulatory programs in each subtypes of breast tumors. For example, our analysis revealed that the expression module regulated by the Polycomb repressive complex 2 (PRC2 is downregulated in triple negative breast cancers, suggesting similarity of transcriptional programs between stem cells and aggressive breast cancer cells. We also found that the activity of the PRC2 expression module is negatively correlated to the expression of EZH2, a component of PRC2 which belongs to the E2F expression module. E2F-driven EZH2 overexpression may be responsible for the repression of the PRC2 expression modules in triple negative tumors. Furthermore, our network analysis predicts regulatory circuits in breast cancer cells. Conclusion These results demonstrate that the gene set-based module discovery approach is a powerful tool to decode regulatory programs in cancer cells.

  6. Combining gene expression data from different generations of oligonucleotide arrays

    Directory of Open Access Journals (Sweden)

    Kong Sek

    2004-10-01

    Full Text Available Abstract Background One of the important challenges in microarray analysis is to take full advantage of previously accumulated data, both from one's own laboratory and from public repositories. Through a comparative analysis on a variety of datasets, a more comprehensive view of the underlying mechanism or structure can be obtained. However, as we discover in this work, continual changes in genomic sequence annotations and probe design criteria make it difficult to compare gene expression data even from different generations of the same microarray platform. Results We first describe the extent of discordance between the results derived from two generations of Affymetrix oligonucleotide arrays, as revealed in cluster analysis and in identification of differentially expressed genes. We then propose a method for increasing comparability. The dataset we use consists of a set of 14 human muscle biopsy samples from patients with inflammatory myopathies that were hybridized on both HG-U95Av2 and HG-U133A human arrays. We find that the use of the probe set matching table for comparative analysis provided by Affymetrix produces better results than matching by UniGene or LocusLink identifiers but still remains inadequate. Rescaling of expression values for each gene across samples and data filtering by expression values enhance comparability but only for few specific analyses. As a generic method for improving comparability, we select a subset of probes with overlapping sequence segments in the two array types and recalculate expression values based only on the selected probes. We show that this filtering of probes significantly improves the comparability while retaining a sufficient number of probe sets for further analysis. Conclusions Compatibility between high-density oligonucleotide arrays is significantly affected by probe-level sequence information. With a careful filtering of the probes based on their sequence overlaps, data from different

  7. Gene-expression signatures of Atlantic salmon's plastic life cycle.

    Science.gov (United States)

    Aubin-Horth, Nadia; Letcher, Benjamin H; Hofmann, Hans A

    2009-09-15

    How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated-at similar magnitudes, yet in opposite direction-in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators.

  8. Gene-expression signatures of Atlantic salmon's plastic life cycle

    Science.gov (United States)

    Aubin-Horth, N.; Letcher, B.H.; Hofmann, H.A.

    2009-01-01

    How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated-at similar magnitudes, yet in opposite direction-in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators. ?? 2009 Elsevier Inc. All rights reserved.

  9. Spatial reconstruction of single-cell gene expression data.

    Science.gov (United States)

    Satija, Rahul; Farrell, Jeffrey A; Gennert, David; Schier, Alexander F; Regev, Aviv

    2015-05-01

    Spatial localization is a key determinant of cellular fate and behavior, but methods for spatially resolved, transcriptome-wide gene expression profiling across complex tissues are lacking. RNA staining methods assay only a small number of transcripts, whereas single-cell RNA-seq, which measures global gene expression, separates cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos and generated a transcriptome-wide map of spatial patterning. We confirmed Seurat's accuracy using several experimental approaches, then used the strategy to identify a set of archetypal expression patterns and spatial markers. Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems.

  10. Discovery of cancer common and specific driver gene sets

    Science.gov (United States)

    2017-01-01

    Abstract Cancer is known as a disease mainly caused by gene alterations. Discovery of mutated driver pathways or gene sets is becoming an important step to understand molecular mechanisms of carcinogenesis. However, systematically investigating commonalities and specificities of driver gene sets among multiple cancer types is still a great challenge, but this investigation will undoubtedly benefit deciphering cancers and will be helpful for personalized therapy and precision medicine in cancer treatment. In this study, we propose two optimization models to de novo discover common driver gene sets among multiple cancer types (ComMDP) and specific driver gene sets of one certain or multiple cancer types to other cancers (SpeMDP), respectively. We first apply ComMDP and SpeMDP to simulated data to validate their efficiency. Then, we further apply these methods to 12 cancer types from The Cancer Genome Atlas (TCGA) and obtain several biologically meaningful driver pathways. As examples, we construct a common cancer pathway model for BRCA and OV, infer a complex driver pathway model for BRCA carcinogenesis based on common driver gene sets of BRCA with eight cancer types, and investigate specific driver pathways of the liquid cancer lymphoblastic acute myeloid leukemia (LAML) versus other solid cancer types. In these processes more candidate cancer genes are also found. PMID:28168295

  11. Gene expression profile associated with radioresistance and malignancy in melanoma

    International Nuclear Information System (INIS)

    Ibañez, I.L.; Molinari, B.; Notcovich, C.; García, F.M.; Bracalente, C.; Zuccato, C.F.; Durán, H.

    2015-01-01

    The incidence of melanoma has substantially increased over the last decades. Melanomas respond poorly to treatments and no effective therapy exists to inhibit its metastatic spread. The aim of this study was to evaluate the association between radioresistance of melanoma cells and malignancy. A melanoma model developed in our laboratory from A375 human amelanotic melanoma cells was used. It consists in two catalase-overexpressing cell lines with the same genetic background, but with different phenotypes: A375-A7, melanotic and non-invasive and A375-G10, amelanotic and metastatic; and A375-PCDNA3 (transfected with empty plasmid) as control. Radiosensitivity was determined by clonogenic assay after irradiating these cells with a “1”3”7 Cs gamma source. Survival curves were fitted to the linear-quadratic model and surviving fraction at 2 Gy (SF2) was calculated. Results showed that A375-G10 cells were significantly more radioresistant than both A375-A7 and control cells, demonstrated by SF2 and α parameter of survival curves: SF2=0.32±0.03, 0.43±0.16 and 0.89±0.05 and α=0.45±0.05, 0.20±0.05 and 0 for A375-PCDNA3, A375-A7 and A375-G10 respectively. Bioinformatic analysis of whole genome expression microarrays data (Affymetrix) from these cells was performed. A priori defined gene sets associated with cell cycle, apoptosis and MAPK signaling pathway were collected from KEGG (Kyoto Encyclopedia of Genes and Genomes) to evaluate significant differences in gene set expression between cells by GSEA (Gene Set Enrichment Analysis). A375-G10 showed significant decrease in the expression of genes related to DNA damage response (ATM, TP53BP1 and MRE11A) compared to A375-A7 and controls. Moreover, A375-G10 exhibited down-regulated gene sets that are involved in DNA repair, checkpoint and negative regulation of cell cycle and apoptosis. In conclusion, A375-G10 gene expression profile could be involved in radioresistance mechanisms of these cells. Thus, this expression

  12. Global expression differences and tissue specific expression differences in rice evolution result in two contrasting types of differentially expressed genes

    KAUST Repository

    Horiuchi, Youko; Harushima, Yoshiaki; Fujisawa, Hironori; Mochizuki, Takako; Fujita, Masahiro; Ohyanagi, Hajime; Kurata, Nori

    2015-01-01

    Since the development of transcriptome analysis systems, many expression evolution studies characterized evolutionary forces acting on gene expression, without explicit discrimination between global expression differences and tissue

  13. Utility and Limitations of Using Gene Expression Data to Identify Functional Associations.

    Directory of Open Access Journals (Sweden)

    Sahra Uygun

    2016-12-01

    Full Text Available Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets.

  14. CAsubtype: An R Package to Identify Gene Sets Predictive of Cancer Subtypes and Clinical Outcomes.

    Science.gov (United States)

    Kong, Hualei; Tong, Pan; Zhao, Xiaodong; Sun, Jielin; Li, Hua

    2018-03-01

    In the past decade, molecular classification of cancer has gained high popularity owing to its high predictive power on clinical outcomes as compared with traditional methods commonly used in clinical practice. In particular, using gene expression profiles, recent studies have successfully identified a number of gene sets for the delineation of cancer subtypes that are associated with distinct prognosis. However, identification of such gene sets remains a laborious task due to the lack of tools with flexibility, integration and ease of use. To reduce the burden, we have developed an R package, CAsubtype, to efficiently identify gene sets predictive of cancer subtypes and clinical outcomes. By integrating more than 13,000 annotated gene sets, CAsubtype provides a comprehensive repertoire of candidates for new cancer subtype identification. For easy data access, CAsubtype further includes the gene expression and clinical data of more than 2000 cancer patients from TCGA. CAsubtype first employs principal component analysis to identify gene sets (from user-provided or package-integrated ones) with robust principal components representing significantly large variation between cancer samples. Based on these principal components, CAsubtype visualizes the sample distribution in low-dimensional space for better understanding of the distinction between samples and classifies samples into subgroups with prevalent clustering algorithms. Finally, CAsubtype performs survival analysis to compare the clinical outcomes between the identified subgroups, assessing their clinical value as potentially novel cancer subtypes. In conclusion, CAsubtype is a flexible and well-integrated tool in the R environment to identify gene sets for cancer subtype identification and clinical outcome prediction. Its simple R commands and comprehensive data sets enable efficient examination of the clinical value of any given gene set, thus facilitating hypothesis generating and testing in biological and

  15. VE-Cadherin–Mediated Epigenetic Regulation of Endothelial Gene Expression

    Science.gov (United States)

    Morini, Marco F.; Giampietro, Costanza; Corada, Monica; Pisati, Federica; Lavarone, Elisa; Cunha, Sara I.; Conze, Lei L.; O’Reilly, Nicola; Joshi, Dhira; Kjaer, Svend; George, Roger; Nye, Emma; Ma, Anqi; Jin, Jian; Mitter, Richard; Lupia, Michela; Cavallaro, Ugo; Pasini, Diego; Calado, Dinis P.

    2018-01-01

    Rationale: The mechanistic foundation of vascular maturation is still largely unknown. Several human pathologies are characterized by deregulated angiogenesis and unstable blood vessels. Solid tumors, for instance, get their nourishment from newly formed structurally abnormal vessels which present wide and irregular interendothelial junctions. Expression and clustering of the main endothelial-specific adherens junction protein, VEC (vascular endothelial cadherin), upregulate genes with key roles in endothelial differentiation and stability. Objective: We aim at understanding the molecular mechanisms through which VEC triggers the expression of a set of genes involved in endothelial differentiation and vascular stabilization. Methods and Results: We compared a VEC-null cell line with the same line reconstituted with VEC wild-type cDNA. VEC expression and clustering upregulated endothelial-specific genes with key roles in vascular stabilization including claudin-5, vascular endothelial-protein tyrosine phosphatase (VE-PTP), and von Willebrand factor (vWf). Mechanistically, VEC exerts this effect by inhibiting polycomb protein activity on the specific gene promoters. This is achieved by preventing nuclear translocation of FoxO1 (Forkhead box protein O1) and β-catenin, which contribute to PRC2 (polycomb repressive complex-2) binding to promoter regions of claudin-5, VE-PTP, and vWf. VEC/β-catenin complex also sequesters a core subunit of PRC2 (Ezh2 [enhancer of zeste homolog 2]) at the cell membrane, preventing its nuclear translocation. Inhibition of Ezh2/VEC association increases Ezh2 recruitment to claudin-5, VE-PTP, and vWf promoters, causing gene downregulation. RNA sequencing comparison of VEC-null and VEC-positive cells suggested a more general role of VEC in activating endothelial genes and triggering a vascular stability-related gene expression program. In pathological angiogenesis of human ovarian carcinomas, reduced VEC expression paralleled decreased

  16. VE-Cadherin-Mediated Epigenetic Regulation of Endothelial Gene Expression.

    Science.gov (United States)

    Morini, Marco F; Giampietro, Costanza; Corada, Monica; Pisati, Federica; Lavarone, Elisa; Cunha, Sara I; Conze, Lei L; O'Reilly, Nicola; Joshi, Dhira; Kjaer, Svend; George, Roger; Nye, Emma; Ma, Anqi; Jin, Jian; Mitter, Richard; Lupia, Michela; Cavallaro, Ugo; Pasini, Diego; Calado, Dinis P; Dejana, Elisabetta; Taddei, Andrea

    2018-01-19

    The mechanistic foundation of vascular maturation is still largely unknown. Several human pathologies are characterized by deregulated angiogenesis and unstable blood vessels. Solid tumors, for instance, get their nourishment from newly formed structurally abnormal vessels which present wide and irregular interendothelial junctions. Expression and clustering of the main endothelial-specific adherens junction protein, VEC (vascular endothelial cadherin), upregulate genes with key roles in endothelial differentiation and stability. We aim at understanding the molecular mechanisms through which VEC triggers the expression of a set of genes involved in endothelial differentiation and vascular stabilization. We compared a VEC-null cell line with the same line reconstituted with VEC wild-type cDNA. VEC expression and clustering upregulated endothelial-specific genes with key roles in vascular stabilization including claudin-5 , vascular endothelial-protein tyrosine phosphatase ( VE-PTP ), and von Willebrand factor ( vWf ). Mechanistically, VEC exerts this effect by inhibiting polycomb protein activity on the specific gene promoters. This is achieved by preventing nuclear translocation of FoxO1 (Forkhead box protein O1) and β-catenin, which contribute to PRC2 (polycomb repressive complex-2) binding to promoter regions of claudin-5 , VE-PTP , and vWf . VEC/β-catenin complex also sequesters a core subunit of PRC2 (Ezh2 [enhancer of zeste homolog 2]) at the cell membrane, preventing its nuclear translocation. Inhibition of Ezh2/VEC association increases Ezh2 recruitment to claudin-5 , VE-PTP , and vWf promoters, causing gene downregulation. RNA sequencing comparison of VEC-null and VEC-positive cells suggested a more general role of VEC in activating endothelial genes and triggering a vascular stability-related gene expression program. In pathological angiogenesis of human ovarian carcinomas, reduced VEC expression paralleled decreased levels of claudin-5 and VE-PTP. These

  17. Using RNA-Seq Data to Evaluate Reference Genes Suitable for Gene Expression Studies in Soybean.

    Directory of Open Access Journals (Sweden)

    Aldrin Kay-Yuen Yim

    Full Text Available Differential gene expression profiles often provide important clues for gene functions. While reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR is an important tool, the validity of the results depends heavily on the choice of proper reference genes. In this study, we employed new and published RNA-sequencing (RNA-Seq datasets (26 sequencing libraries in total to evaluate reference genes reported in previous soybean studies. In silico PCR showed that 13 out of 37 previously reported primer sets have multiple targets, and 4 of them have amplicons with different sizes. Using a probabilistic approach, we identified new and improved candidate reference genes. We further performed 2 validation tests (with 26 RNA samples on 8 commonly used reference genes and 7 newly identified candidates, using RT-qPCR. In general, the new candidate reference genes exhibited more stable expression levels under the tested experimental conditions. The three newly identified candidate reference genes Bic-C2, F-box protein2, and VPS-like gave the best overall performance, together with the commonly used ELF1b. It is expected that the proposed probabilistic model could serve as an important tool to identify stable reference genes when more soybean RNA-Seq data from different growth stages and treatments are used.

  18. Mechanisms of gap gene expression canalization in the Drosophila blastoderm

    Directory of Open Access Journals (Sweden)

    Samsonova Maria G

    2011-07-01

    Full Text Available Abstract Background Extensive variation in early gap gene expression in the Drosophila blastoderm is reduced over time because of gap gene cross regulation. This phenomenon is a manifestation of canalization, the ability of an organism to produce a consistent phenotype despite variations in genotype or environment. The canalization of gap gene expression can be understood as arising from the actions of attractors in the gap gene dynamical system. Results In order to better understand the processes of developmental robustness and canalization in the early Drosophila embryo, we investigated the dynamical effects of varying spatial profiles of Bicoid protein concentration on the formation of the expression border of the gap gene hunchback. At several positions on the anterior-posterior axis of the embryo, we analyzed attractors and their basins of attraction in a dynamical model describing expression of four gap genes with the Bicoid concentration profile accounted as a given input in the model equations. This model was tested against a family of Bicoid gradients obtained from individual embryos. These gradients were normalized by two independent methods, which are based on distinct biological hypotheses and provide different magnitudes for Bicoid spatial variability. We showed how the border formation is dictated by the biological initial conditions (the concentration gradient of maternal Hunchback protein being attracted to specific attracting sets in a local vicinity of the border. Different types of these attracting sets (point attractors or one dimensional attracting manifolds define several possible mechanisms of border formation. The hunchback border formation is associated with intersection of the spatial gradient of the maternal Hunchback protein and a boundary between the attraction basins of two different point attractors. We demonstrated how the positional variability for hunchback is related to the corresponding variability of the

  19. Reduced expression of Autographa californica nucleopolyhedrovirus ORF34, an essential gene, enhances heterologous gene expression

    International Nuclear Information System (INIS)

    Salem, Tamer Z.; Zhang, Fengrui; Thiem, Suzanne M.

    2013-01-01

    Autographa californica multiple nucleopolyhedrovirus ORF34 is part of a transcriptional unit that includes ORF32, encoding a viral fibroblast growth factor (FGF) and ORF33. We identified ORF34 as a candidate for deletion to improve protein expression in the baculovirus expression system based on enhanced reporter gene expression in an RNAi screen of virus genes. However, ORF34 was shown to be an essential gene. To explore ORF34 function, deletion (KO34) and rescue bacmids were constructed and characterized. Infection did not spread from primary KO34 transfected cells and supernatants from KO34 transfected cells could not infect fresh Sf21 cells whereas the supernatant from the rescue bacmids transfection could recover the infection. In addition, budded viruses were not observed in KO34 transfected cells by electron microscopy, nor were viral proteins detected from the transfection supernatants by western blots. These demonstrate that ORF34 is an essential gene with a possible role in infectious virus production.

  20. Reduced expression of Autographa californica nucleopolyhedrovirus ORF34, an essential gene, enhances heterologous gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Salem, Tamer Z. [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Department of Microbial Molecular Biology, AGERI, Agricultural Research Center, Giza 12619 (Egypt); Division of Biomedical Sciences, Zewail University, Zewail City of Science and Technology, Giza 12588 (Egypt); Zhang, Fengrui [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Thiem, Suzanne M., E-mail: smthiem@msu.edu [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824 (United States)

    2013-01-20

    Autographa californica multiple nucleopolyhedrovirus ORF34 is part of a transcriptional unit that includes ORF32, encoding a viral fibroblast growth factor (FGF) and ORF33. We identified ORF34 as a candidate for deletion to improve protein expression in the baculovirus expression system based on enhanced reporter gene expression in an RNAi screen of virus genes. However, ORF34 was shown to be an essential gene. To explore ORF34 function, deletion (KO34) and rescue bacmids were constructed and characterized. Infection did not spread from primary KO34 transfected cells and supernatants from KO34 transfected cells could not infect fresh Sf21 cells whereas the supernatant from the rescue bacmids transfection could recover the infection. In addition, budded viruses were not observed in KO34 transfected cells by electron microscopy, nor were viral proteins detected from the transfection supernatants by western blots. These demonstrate that ORF34 is an essential gene with a possible role in infectious virus production.

  1. Evaluation of reference genes for gene expression analysis using quantitative RT-PCR in Azospirillum brasilense.

    Science.gov (United States)

    McMillan, Mary; Pereg, Lily

    2014-01-01

    Azospirillum brasilense is a nitrogen fixing bacterium that has been shown to have various beneficial effects on plant growth and yield. Under normal conditions A. brasilense exists in a motile flagellated form, which, under starvation or stress conditions, can undergo differentiation into an encapsulated, cyst-like form. Quantitative RT-PCR can be used to analyse changes in gene expression during this differentiation process. The accuracy of quantification of mRNA levels by qRT-PCR relies on the normalisation of data against stably expressed reference genes. No suitable set of reference genes has yet been described for A. brasilense. Here we evaluated the expression of ten candidate reference genes (16S rRNA, gapB, glyA, gyrA, proC, pykA, recA, recF, rpoD, and tpiA) in wild-type and mutant A. brasilense strains under different culture conditions, including conditions that induce differentiation. Analysis with the software programs BestKeeper, NormFinder and GeNorm indicated that gyrA, glyA and recA are the most stably expressed reference genes in A. brasilense. The results also suggested that the use of two reference genes (gyrA and glyA) is sufficient for effective normalisation of qRT-PCR data.

  2. Evaluation of reference genes for gene expression analysis using quantitative RT-PCR in Azospirillum brasilense.

    Directory of Open Access Journals (Sweden)

    Mary McMillan

    Full Text Available Azospirillum brasilense is a nitrogen fixing bacterium that has been shown to have various beneficial effects on plant growth and yield. Under normal conditions A. brasilense exists in a motile flagellated form, which, under starvation or stress conditions, can undergo differentiation into an encapsulated, cyst-like form. Quantitative RT-PCR can be used to analyse changes in gene expression during this differentiation process. The accuracy of quantification of mRNA levels by qRT-PCR relies on the normalisation of data against stably expressed reference genes. No suitable set of reference genes has yet been described for A. brasilense. Here we evaluated the expression of ten candidate reference genes (16S rRNA, gapB, glyA, gyrA, proC, pykA, recA, recF, rpoD, and tpiA in wild-type and mutant A. brasilense strains under different culture conditions, including conditions that induce differentiation. Analysis with the software programs BestKeeper, NormFinder and GeNorm indicated that gyrA, glyA and recA are the most stably expressed reference genes in A. brasilense. The results also suggested that the use of two reference genes (gyrA and glyA is sufficient for effective normalisation of qRT-PCR data.

  3. Identification of suitable reference genes for gene expression studies of shoulder instability.

    Directory of Open Access Journals (Sweden)

    Mariana Ferreira Leal

    Full Text Available Shoulder instability is a common shoulder injury, and patients present with plastic deformation of the glenohumeral capsule. Gene expression analysis may be a useful tool for increasing the general understanding of capsule deformation, and reverse-transcription quantitative polymerase chain reaction (RT-qPCR has become an effective method for such studies. Although RT-qPCR is highly sensitive and specific, it requires the use of suitable reference genes for data normalization to guarantee meaningful and reproducible results. In the present study, we evaluated the suitability of a set of reference genes using samples from the glenohumeral capsules of individuals with and without shoulder instability. We analyzed the expression of six commonly used reference genes (ACTB, B2M, GAPDH, HPRT1, TBP and TFRC in the antero-inferior, antero-superior and posterior portions of the glenohumeral capsules of cases and controls. The stability of the candidate reference gene expression was determined using four software packages: NormFinder, geNorm, BestKeeper and DataAssist. Overall, HPRT1 was the best single reference gene, and HPRT1 and B2M composed the best pair of reference genes from different analysis groups, including simultaneous analysis of all tissue samples. GenEx software was used to identify the optimal number of reference genes to be used for normalization and demonstrated that the accumulated standard deviation resulting from the use of 2 reference genes was similar to that resulting from the use of 3 or more reference genes. To identify the optimal combination of reference genes, we evaluated the expression of COL1A1. Although the use of different reference gene combinations yielded variable normalized quantities, the relative quantities within sample groups were similar and confirmed that no obvious differences were observed when using 2, 3 or 4 reference genes. Consequently, the use of 2 stable reference genes for normalization, especially

  4. Impact of methoxyacetic acid on mouse Leydig cell gene expression

    Directory of Open Access Journals (Sweden)

    Waxman David J

    2010-06-01

    Full Text Available Abstract Background Methoxyacetic acid (MAA is the active metabolite of the widely used industrial chemical ethylene glycol monomethyl ether, which is associated with various developmental and reproductive toxicities, including neural toxicity, blood and immune disorders, limb degeneration and testicular toxicity. Testicular toxicity is caused by degeneration of germ cells in association with changes in gene expression in both germ cells and Sertoli cells of the testis. This study investigates the impact of MAA on gene expression in testicular Leydig cells, which play a critical role in germ cell survival and male reproductive function. Methods Cultured mouse TM3 Leydig cells were treated with MAA for 3, 8, and 24 h and changes in gene expression were monitored by genome-wide transcriptional profiling. Results A total of 3,912 MAA-responsive genes were identified. Ingenuity Pathway analysis identified reproductive system disease, inflammatory disease and connective tissue disorder as the top biological functions affected by MAA. The MAA-responsive genes were classified into 1,366 early responders, 1,387 mid-responders, and 1,138 late responders, based on the time required for MAA to elicit a response. Analysis of enriched functional clusters for each subgroup identified 106 MAA early response genes involved in transcription regulation, including 32 genes associated with developmental processes. 60 DNA-binding proteins responded to MAA rapidly but transiently, and may contribute to the downstream effects of MAA seen for many mid and late response genes. Genes within the phosphatidylinositol/phospholipase C/calcium signaling pathway, whose activity is required for potentiation of nuclear receptor signaling by MAA, were also enriched in the set of early MAA response genes. In contrast, many of the genes responding to MAA at later time points encode membrane proteins that contribute to cell adhesion and membrane signaling. Conclusions These findings

  5. Sex Differences in Drosophila Somatic Gene Expression: Variation and Regulation by doublesex

    Directory of Open Access Journals (Sweden)

    Michelle N. Arbeitman

    2016-07-01

    Full Text Available Sex differences in gene expression have been widely studied in Drosophila melanogaster. Sex differences vary across strains, but many molecular studies focus on only a single strain, or on genes that show sexually dimorphic expression in many strains. How extensive variability is and whether this variability occurs among genes regulated by sex determination hierarchy terminal transcription factors is unknown. To address these questions, we examine differences in sexually dimorphic gene expression between two strains in Drosophila adult head tissues. We also examine gene expression in doublesex (dsx mutant strains to determine which sex-differentially expressed genes are regulated by DSX, and the mode by which DSX regulates expression. We find substantial variation in sex-differential expression. The sets of genes with sexually dimorphic expression in each strain show little overlap. The prevalence of different DSX regulatory modes also varies between the two strains. Neither the patterns of DSX DNA occupancy, nor mode of DSX regulation explain why some genes show consistent sex-differential expression across strains. We find that the genes identified as regulated by DSX in this study are enriched with known sites of DSX DNA occupancy. Finally, we find that sex-differentially expressed genes and genes regulated by DSX are highly enriched on the fourth chromosome. These results provide insights into a more complete pool of potential DSX targets, as well as revealing the molecular flexibility of DSX regulation.

  6. A gene expression signature associated with survival in metastatic melanoma

    Science.gov (United States)

    Mandruzzato, Susanna; Callegaro, Andrea; Turcatel, Gianluca; Francescato, Samuela; Montesco, Maria C; Chiarion-Sileni, Vanna; Mocellin, Simone; Rossi, Carlo R; Bicciato, Silvio; Wang, Ena; Marincola, Francesco M; Zanovello, Paola

    2006-01-01

    Background Current clinical and histopathological criteria used to define the prognosis of melanoma patients are inadequate for accurate prediction of clinical outcome. We investigated whether genome screening by means of high-throughput gene microarray might provide clinically useful information on patient survival. Methods Forty-three tumor tissues from 38 patients with stage III and stage IV melanoma were profiled with a 17,500 element cDNA microarray. Expression data were analyzed using significance analysis of microarrays (SAM) to identify genes associated with patient survival, and supervised principal components (SPC) to determine survival prediction. Results SAM analysis revealed a set of 80 probes, corresponding to 70 genes, associated with survival, i.e. 45 probes characterizing longer and 35 shorter survival times, respectively. These transcripts were included in a survival prediction model designed using SPC and cross-validation which allowed identifying 30 predicting probes out of the 80 associated with survival. Conclusion The longer-survival group of genes included those expressed in immune cells, both innate and acquired, confirming the interplay between immunological mechanisms and the natural history of melanoma. Genes linked to immune cells were totally lacking in the poor-survival group, which was instead associated with a number of genes related to highly proliferative and invasive tumor cells. PMID:17129373

  7. A gene expression signature associated with survival in metastatic melanoma

    Directory of Open Access Journals (Sweden)

    Rossi Carlo R

    2006-11-01

    Full Text Available Abstract Background Current clinical and histopathological criteria used to define the prognosis of melanoma patients are inadequate for accurate prediction of clinical outcome. We investigated whether genome screening by means of high-throughput gene microarray might provide clinically useful information on patient survival. Methods Forty-three tumor tissues from 38 patients with stage III and stage IV melanoma were profiled with a 17,500 element cDNA microarray. Expression data were analyzed using significance analysis of microarrays (SAM to identify genes associated with patient survival, and supervised principal components (SPC to determine survival prediction. Results SAM analysis revealed a set of 80 probes, corresponding to 70 genes, associated with survival, i.e. 45 probes characterizing longer and 35 shorter survival times, respectively. These transcripts were included in a survival prediction model designed using SPC and cross-validation which allowed identifying 30 predicting probes out of the 80 associated with survival. Conclusion The longer-survival group of genes included those expressed in immune cells, both innate and acquired, confirming the interplay between immunological mechanisms and the natural history of melanoma. Genes linked to immune cells were totally lacking in the poor-survival group, which was instead associated with a number of genes related to highly proliferative and invasive tumor cells.

  8. Symbiont modulates expression of specific gene categories in Angomonas deanei

    Directory of Open Access Journals (Sweden)

    Luciana Loureiro Penha

    Full Text Available Trypanosomatids are parasites that cause disease in humans, animals, and plants. Most are non-pathogenic and some harbor a symbiotic bacterium. Endosymbiosis is part of the evolutionary process of vital cell functions such as respiration and photosynthesis. Angomonas deanei is an example of a symbiont-containing trypanosomatid. In this paper, we sought to investigate how symbionts influence host cells by characterising and comparing the transcriptomes of the symbiont-containing A. deanei (wild type and the symbiont-free aposymbiotic strains. The comparison revealed that the presence of the symbiont modulates several differentially expressed genes. Empirical analysis of differential gene expression showed that 216 of the 7625 modulated genes were significantly changed. Finally, gene set enrichment analysis revealed that the largest categories of genes that downregulated in the absence of the symbiont were those involved in oxidation-reduction process, ATP hydrolysis coupled proton transport and glycolysis. In contrast, among the upregulated gene categories were those involved in proteolysis, microtubule-based movement, and cellular metabolic process. Our results provide valuable information for dissecting the mechanism of endosymbiosis in A. deanei.

  9. Development of a radiation-responsive gene expression system

    International Nuclear Information System (INIS)

    Ogawa, Ryohei; Morii, Akihiro; Watanabe, Akihiko

    2013-01-01

    We have obtained a promoter enhancing expression of a gene of our interest connected downstream after activation in response to radiation stimulation and it could be used in radiogenetic therapy, a combination between radiotherapy and gene therapy. The promoter has been chosen out of a library of DNA fragments constructed by connecting the TATA box to randomly combined binding sequences of transcription factors that are activated in response to radiation. Although it was shown that the promoter activation was cell type specific, it turned out that radiation responsive promoters could be obtained for a different type of cells by using another set of transcription factor binding sequences, suggesting that the method would be feasible to obtain promoters functioning in any type of cells. Radiation reactivity of obtained promoters could be improved by techniques such as random introduction of point mutations. The improved promoters significantly enhanced expression of the luciferase gene connected downstream in response to radiation even in vivo, in addition, a gene cassette composed of one such promoter and the fcy::fur gene was confirmed useful for suicide gene therapy as shown in vitro simulation experiment, suggesting possible clinical application. (author)

  10. Gene Expression Profiling of Xeroderma Pigmentosum

    Directory of Open Access Journals (Sweden)

    Bowden Nikola A

    2006-05-01

    Full Text Available Abstract Xeroderma pigmentosum (XP is a rare recessive disorder that is characterized by extreme sensitivity to UV light. UV light exposure results in the formation of DNA damage such as cyclobutane dimers and (6-4 photoproducts. Nucleotide excision repair (NER orchestrates the removal of cyclobutane dimers and (6-4 photoproducts as well as some forms of bulky chemical DNA adducts. The disease XP is comprised of 7 complementation groups (XP-A to XP-G, which represent functional deficiencies in seven different genes, all of which are believed to be involved in NER. The main clinical feature of XP is various forms of skin cancers; however, neurological degeneration is present in XPA, XPB, XPD and XPG complementation groups. The relationship between NER and other types of DNA repair processes is now becoming evident but the exact relationships between the different complementation groups remains to be precisely determined. Using gene expression analysis we have identified similarities and differences after UV light exposure between the complementation groups XP-A, XP-C, XP-D, XP-E, XP-F, XP-G and an unaffected control. The results reveal that there is a graded change in gene expression patterns between the mildest, most similar to the control response (XP-E and the severest form (XP-A of the disease, with the exception of XP-D. Distinct differences between the complementation groups with neurological symptoms (XP-A, XP-D and XP-G and without (XP-C, XP-E and XP-F were also identified. Therefore, this analysis has revealed distinct gene expression profiles for the XP complementation groups and the first step towards understanding the neurological symptoms of XP.

  11. A comparative analysis of biclustering algorithms for gene expression data

    Science.gov (United States)

    Eren, Kemal; Deveci, Mehmet; Küçüktunç, Onur; Çatalyürek, Ümit V.

    2013-01-01

    The need to analyze high-dimension biological data is driving the development of new data mining methods. Biclustering algorithms have been successfully applied to gene expression data to discover local patterns, in which a subset of genes exhibit similar expression levels over a subset of conditions. However, it is not clear which algorithms are best suited for this task. Many algorithms have been published in the past decade, most of which have been compared only to a small number of algorithms. Surveys and comparisons exist in the literature, but because of the large number and variety of biclustering algorithms, they are quickly outdated. In this article we partially address this problem of evaluating the strengths and weaknesses of existing biclustering methods. We used the BiBench package to compare 12 algorithms, many of which were recently published or have not been extensively studied. The algorithms were tested on a suite of synthetic data sets to measure their performance on data with varying conditions, such as different bicluster models, varying noise, varying numbers of biclusters and overlapping biclusters. The algorithms were also tested on eight large gene expression data sets obtained from the Gene Expression Omnibus. Gene Ontology enrichment analysis was performed on the resulting biclusters, and the best enrichment terms are reported. Our analyses show that the biclustering method and its parameters should be selected based on the desired model, whether that model allows overlapping biclusters, and its robustness to noise. In addition, we observe that the biclustering algorithms capable of finding more than one model are more successful at capturing biologically relevant clusters. PMID:22772837

  12. Genomewide analysis of gene expression associated with Tcof1 in mouse neuroblastoma

    International Nuclear Information System (INIS)

    Mogass, Michael; York, Timothy P.; Li, Lin; Rujirabanjerd, Sinitdhorn; Shiang, Rita

    2004-01-01

    Mutations in the Treacher Collins syndrome gene, TCOF1, cause a disorder of craniofacial development. We manipulated the levels of Tcof1 and its protein treacle in a murine neuroblastoma cell line to identify downstream changes in gene expression using a microarray platform. We identified a set of genes that have similar expression with Tcof1 as well as a set of genes that are negatively correlated with Tcof1 expression. We also showed that the level of Tcof1 and treacle expression is downregulated during differentiation of neuroblastoma cells into neuronal cells. Inhibition of Tcof1 expression by siRNA induced morphological changes in neuroblastoma cells that mimic differentiation. Thus, expression of Tcof1 and treacle synthesis play an important role in the proliferation of neuroblastoma cells and we have identified genes that may be important in this pathway

  13. Changes in gene expression following androgen receptor blockade ...

    Indian Academy of Sciences (India)

    Madhu urs

    of gene expression in the ventral prostate, it is not clear whether all the gene expression ... These include clusterin, methionine adenosyl transferase IIα, and prostate-specific ..... MAGEE1 melanoma antigen and no similarity was found with the ...

  14. Rubisco activity and gene expression of tropical tree species under ...

    African Journals Online (AJOL)

    Young

    2013-05-15

    May 15, 2013 ... Proteomics analysis associated with gene expression of plants reveal .... Consequently, Rubisco enzyme plays a role in assi- milating into ... technique for examining gene expression encoded at the. mRNA level .... Ammonia.

  15. Gene structure, phylogeny and expression profile of the sucrose ...

    Indian Academy of Sciences (India)

    Gene structure, phylogeny and expression profile of the sucrose synthase gene family in .... 24, 701–713. Bate N. and Twell D. 1998 Functional architecture of a late pollen .... Manzara T. and Gruissem W. 1988 Organization and expression.

  16. Cholinergic regulation of VIP gene expression in human neuroblastoma cells

    DEFF Research Database (Denmark)

    Kristensen, Bo; Georg, Birgitte; Fahrenkrug, Jan

    1997-01-01

    Vasoactive intestinal polypeptide, muscarinic receptor, neuroblastoma cell, mRNA, gene expression, peptide processing......Vasoactive intestinal polypeptide, muscarinic receptor, neuroblastoma cell, mRNA, gene expression, peptide processing...

  17. Enhanced gene ranking approaches using modified trace ratio algorithm for gene expression data

    Directory of Open Access Journals (Sweden)

    Shruti Mishra

    Full Text Available Microarray technology enables the understanding and investigation of gene expression levels by analyzing high dimensional datasets that contain few samples. Over time, microarray expression data have been collected for studying the underlying biological mechanisms of disease. One such application for understanding the mechanism is by constructing a gene regulatory network (GRN. One of the foremost key criteria for GRN discovery is gene selection. Choosing a generous set of genes for the structure of the network is highly desirable. For this role, two suitable methods were proposed for selection of appropriate genes. The first approach comprises a gene selection method called Information gain, where the dataset is reformed and fused with another distinct algorithm called Trace Ratio (TR. Our second method is the implementation of our projected modified TR algorithm, where the scoring base for finding weight matrices has been re-designed. Both the methods' efficiency was shown with different classifiers that include variants of the Artificial Neural Network classifier, such as Resilient Propagation, Quick Propagation, Back Propagation, Manhattan Propagation and Radial Basis Function Neural Network and also the Support Vector Machine (SVM classifier. In the study, it was confirmed that both of the proposed methods worked well and offered high accuracy with a lesser number of iterations as compared to the original Trace Ratio algorithm. Keywords: Gene regulatory network, Gene selection, Information gain, Trace ratio, Canonical correlation analysis, Classification

  18. Nuclear AXIN2 represses MYC gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Rennoll, Sherri A.; Konsavage, Wesley M.; Yochum, Gregory S., E-mail: gsy3@psu.edu

    2014-01-03

    Highlights: •AXIN2 localizes to cytoplasmic and nuclear compartments in colorectal cancer cells. •Nuclear AXIN2 represses the activity of Wnt-responsive luciferase reporters. •β-Catenin bridges AXIN2 to TCF transcription factors. •AXIN2 binds the MYC promoter and represses MYC gene expression. -- Abstract: The β-catenin transcriptional coactivator is the key mediator of the canonical Wnt signaling pathway. In the absence of Wnt, β-catenin associates with a cytosolic and multi-protein destruction complex where it is phosphorylated and targeted for proteasomal degradation. In the presence of Wnt, the destruction complex is inactivated and β-catenin translocates into the nucleus. In the nucleus, β-catenin binds T-cell factor (TCF) transcription factors to activate expression of c-MYC (MYC) and Axis inhibition protein 2 (AXIN2). AXIN2 is a member of the destruction complex and, thus, serves in a negative feedback loop to control Wnt/β-catenin signaling. AXIN2 is also present in the nucleus, but its function within this compartment is unknown. Here, we demonstrate that AXIN2 localizes to the nuclei of epithelial cells within normal and colonic tumor tissues as well as colorectal cancer cell lines. In the nucleus, AXIN2 represses expression of Wnt/β-catenin-responsive luciferase reporters and forms a complex with β-catenin and TCF. We demonstrate that AXIN2 co-occupies β-catenin/TCF complexes at the MYC promoter region. When constitutively localized to the nucleus, AXIN2 alters the chromatin structure at the MYC promoter and directly represses MYC gene expression. These findings suggest that nuclear AXIN2 functions as a rheostat to control MYC expression in response to Wnt/β-catenin signaling.

  19. Nuclear AXIN2 represses MYC gene expression

    International Nuclear Information System (INIS)

    Rennoll, Sherri A.; Konsavage, Wesley M.; Yochum, Gregory S.

    2014-01-01

    Highlights: •AXIN2 localizes to cytoplasmic and nuclear compartments in colorectal cancer cells. •Nuclear AXIN2 represses the activity of Wnt-responsive luciferase reporters. •β-Catenin bridges AXIN2 to TCF transcription factors. •AXIN2 binds the MYC promoter and represses MYC gene expression. -- Abstract: The β-catenin transcriptional coactivator is the key mediator of the canonical Wnt signaling pathway. In the absence of Wnt, β-catenin associates with a cytosolic and multi-protein destruction complex where it is phosphorylated and targeted for proteasomal degradation. In the presence of Wnt, the destruction complex is inactivated and β-catenin translocates into the nucleus. In the nucleus, β-catenin binds T-cell factor (TCF) transcription factors to activate expression of c-MYC (MYC) and Axis inhibition protein 2 (AXIN2). AXIN2 is a member of the destruction complex and, thus, serves in a negative feedback loop to control Wnt/β-catenin signaling. AXIN2 is also present in the nucleus, but its function within this compartment is unknown. Here, we demonstrate that AXIN2 localizes to the nuclei of epithelial cells within normal and colonic tumor tissues as well as colorectal cancer cell lines. In the nucleus, AXIN2 represses expression of Wnt/β-catenin-responsive luciferase reporters and forms a complex with β-catenin and TCF. We demonstrate that AXIN2 co-occupies β-catenin/TCF complexes at the MYC promoter region. When constitutively localized to the nucleus, AXIN2 alters the chromatin structure at the MYC promoter and directly represses MYC gene expression. These findings suggest that nuclear AXIN2 functions as a rheostat to control MYC expression in response to Wnt/β-catenin signaling

  20. Molecular mechanisms of curcumin action: gene expression.

    Science.gov (United States)

    Shishodia, Shishir

    2013-01-01

    Curcumin derived from the tropical plant Curcuma longa has a long history of use as a dietary agent, food preservative, and in traditional Asian medicine. It has been used for centuries to treat biliary disorders, anorexia, cough, diabetic wounds, hepatic disorders, rheumatism, and sinusitis. The preventive and therapeutic properties of curcumin are associated with its antioxidant, anti-inflammatory, and anticancer properties. Extensive research over several decades has attempted to identify the molecular mechanisms of curcumin action. Curcumin modulates numerous molecular targets by altering their gene expression, signaling pathways, or through direct interaction. Curcumin regulates the expression of inflammatory cytokines (e.g., TNF, IL-1), growth factors (e.g., VEGF, EGF, FGF), growth factor receptors (e.g., EGFR, HER-2, AR), enzymes (e.g., COX-2, LOX, MMP9, MAPK, mTOR, Akt), adhesion molecules (e.g., ELAM-1, ICAM-1, VCAM-1), apoptosis related proteins (e.g., Bcl-2, caspases, DR, Fas), and cell cycle proteins (e.g., cyclin D1). Curcumin modulates the activity of several transcription factors (e.g., NF-κB, AP-1, STAT) and their signaling pathways. Based on its ability to affect multiple targets, curcumin has the potential for the prevention and treatment of various diseases including cancers, arthritis, allergies, atherosclerosis, aging, neurodegenerative disease, hepatic disorders, obesity, diabetes, psoriasis, and autoimmune diseases. This review summarizes the molecular mechanisms of modulation of gene expression by curcumin. Copyright © 2012 International Union of Biochemistry and Molecular Biology, Inc.

  1. A polynomial time biclustering algorithm for finding approximate expression patterns in gene expression time series

    Directory of Open Access Journals (Sweden)

    Madeira Sara C

    2009-06-01

    the art methods that require exact matching of gene expression time series. Discussion The identification of co-regulated genes, involved in specific biological processes, remains one of the main avenues open to researchers studying gene regulatory networks. The ability of the proposed methodology to efficiently identify sets of genes with similar expression patterns is shown to be instrumental in the discovery of relevant biological phenomena, leading to more convincing evidence of specific regulatory mechanisms. Availability A prototype implementation of the algorithm coded in Java together with the dataset and examples used in the paper is available in http://kdbio.inesc-id.pt/software/e-ccc-biclustering.

  2. Expression of the histone chaperone SET/TAF-Iβ during the strobilation process of Mesocestoides corti (Platyhelminthes, Cestoda).

    Science.gov (United States)

    Costa, Caroline B; Monteiro, Karina M; Teichmann, Aline; da Silva, Edileuza D; Lorenzatto, Karina R; Cancela, Martín; Paes, Jéssica A; Benitz, André de N D; Castillo, Estela; Margis, Rogério; Zaha, Arnaldo; Ferreira, Henrique B

    2015-08-01

    The histone chaperone SET/TAF-Iβ is implicated in processes of chromatin remodelling and gene expression regulation. It has been associated with the control of developmental processes, but little is known about its function in helminth parasites. In Mesocestoides corti, a partial cDNA sequence related to SET/TAF-Iβ was isolated in a screening for genes differentially expressed in larvae (tetrathyridia) and adult worms. Here, the full-length coding sequence of the M. corti SET/TAF-Iβ gene was analysed and the encoded protein (McSET/TAF) was compared with orthologous sequences, showing that McSET/TAF can be regarded as a SET/TAF-Iβ family member, with a typical nucleosome-assembly protein (NAP) domain and an acidic tail. The expression patterns of the McSET/TAF gene and protein were investigated during the strobilation process by RT-qPCR, using a set of five reference genes, and by immunoblot and immunofluorescence, using monospecific polyclonal antibodies. A gradual increase in McSET/TAF transcripts and McSET/TAF protein was observed upon development induction by trypsin, demonstrating McSET/TAF differential expression during strobilation. These results provided the first evidence for the involvement of a protein from the NAP family of epigenetic effectors in the regulation of cestode development.

  3. Expression of SET Protein in the Ovaries of Patients with Polycystic Ovary Syndrome.

    Science.gov (United States)

    Boqun, Xu; Xiaonan, Dai; Yugui, Cui; Lingling, Gao; Xue, Dai; Gao, Chao; Feiyang, Diao; Jiayin, Liu; Gao, Li; Li, Mei; Zhang, Yuan; Ma, Xiang

    2013-01-01

    Background. We previously found that expression of SET gene was up-regulated in polycystic ovaries by using microarray. It suggested that SET may be an attractive candidate regulator involved in the pathophysiology of polycystic ovary syndrome (PCOS). In this study, expression and cellular localization of SET protein were investigated in human polycystic and normal ovaries. Method. Ovarian tissues, six normal ovaries and six polycystic ovaries, were collected during transsexual operation and surgical treatment with the signed consent form. The cellular localization of SET protein was observed by immunohistochemistry. The expression levels of SET protein were analyzed by Western Blot. Result. SET protein was expressed predominantly in the theca cells and oocytes of human ovarian follicles in both PCOS ovarian tissues and normal ovarian tissues. The level of SET protein expression in polycystic ovaries was triple higher than that in normal ovaries (P polycystic ovaries more than that in normal ovaries. Combined with its localization in theca cells, SET may participate in regulating ovarian androgen biosynthesis and the pathophysiology of hyperandrogenism in PCOS.

  4. Identification of heterogeneity among soft tissue sarcomas by gene expression profiles from different tumors

    Directory of Open Access Journals (Sweden)

    Skubitz Amy PN

    2008-05-01

    Full Text Available Abstract The heterogeneity that soft tissue sarcomas (STS exhibit in their clinical behavior, even within histological subtypes, complicates patient care. Histological appearance is determined by gene expression. Morphologic features are generally good predictors of biologic behavior, however, metastatic propensity, tumor growth, and response to chemotherapy may be determined by gene expression patterns that do not correlate well with morphology. One approach to identify heterogeneity is to search for genetic markers that correlate with differences in tumor behavior. Alternatively, subsets may be identified based on gene expression patterns alone, independent of knowledge of clinical outcome. We have reported gene expression patterns that distinguish two subgroups of clear cell renal carcinoma (ccRCC, and other gene expression patterns that distinguish heterogeneity of serous ovarian carcinoma (OVCA and aggressive fibromatosis (AF. In this study, gene expression in 53 samples of STS and AF [including 16 malignant fibrous histiocytoma (MFH, 9 leiomyosarcoma, 12 liposarcoma, 4 synovial sarcoma, and 12 samples of AF] was determined at Gene Logic Inc. (Gaithersburg, MD using Affymetrix GeneChip® U_133 arrays containing approximately 40,000 genes/ESTs. Gene expression analysis was performed with the Gene Logic Genesis Enterprise System® Software and Expressionist software. Hierarchical clustering of the STS using our three previously reported gene sets, each generated subgroups within the STS that for some subtypes correlated with histology, and also suggested the existence of subsets of MFH. All three gene sets also recognized the same two subsets of the fibromatosis samples that we had found in our earlier study of AF. These results suggest that these subgroups may have biological significance, and that these gene sets may be useful for sub-classification of STS. In addition, several genes that are targets of some anti-tumor drugs were found to

  5. Using interpolation to estimate system uncertainty in gene expression experiments.

    Directory of Open Access Journals (Sweden)

    Lee J Falin

    Full Text Available The widespread use of high-throughput experimental assays designed to measure the entire complement of a cell's genes or gene products has led to vast stores of data that are extremely plentiful in terms of the number of items they can measure in a single sample, yet often sparse in the number of samples per experiment due to their high cost. This often leads to datasets where the number of treatment levels or time points sampled is limited, or where there are very small numbers of technical and/or biological replicates. Here we introduce a novel algorithm to quantify the uncertainty in the unmeasured intervals between biological measurements taken across a set of quantitative treatments. The algorithm provides a probabilistic distribution of possible gene expression values within unmeasured intervals, based on a plausible biological constraint. We show how quantification of this uncertainty can be used to guide researchers in further data collection by identifying which samples would likely add the most information to the system under study. Although the context for developing the algorithm was gene expression measurements taken over a time series, the approach can be readily applied to any set of quantitative systems biology measurements taken following quantitative (i.e. non-categorical treatments. In principle, the method could also be applied to combinations of treatments, in which case it could greatly simplify the task of exploring the large combinatorial space of future possible measurements.

  6. The relationship among gene expression, the evolution of gene dosage, and the rate of protein evolution.

    Directory of Open Access Journals (Sweden)

    Jean-François Gout

    2010-05-01

    Full Text Available The understanding of selective constraints affecting genes is a major issue in biology. It is well established that gene expression level is a major determinant of the rate of protein evolution, but the reasons for this relationship remain highly debated. Here we demonstrate that gene expression is also a major determinant of the evolution of gene dosage: the rate of gene losses after whole genome duplications in the Paramecium lineage is negatively correlated to the level of gene expression, and this relationship is not a byproduct of other factors known to affect the fate of gene duplicates. This indicates that changes in gene dosage are generally more deleterious for highly expressed genes. This rule also holds for other taxa: in yeast, we find a clear relationship between gene expression level and the fitness impact of reduction in gene dosage. To explain these observations, we propose a model based on the fact that the optimal expression level of a gene corresponds to a trade-off between the benefit and cost of its expression. This COSTEX model predicts that selective pressure against mutations changing gene expression level or affecting the encoded protein should on average be stronger in highly expressed genes and hence that both the frequency of gene loss and the rate of protein evolution should correlate negatively with gene expression. Thus, the COSTEX model provides a simple and common explanation for the general relationship observed between the level of gene expression and the different facets of gene evolution.

  7. Retrotransposons as regulators of gene expression.

    Science.gov (United States)

    Elbarbary, Reyad A; Lucas, Bronwyn A; Maquat, Lynne E

    2016-02-12

    Transposable elements (TEs) are both a boon and a bane to eukaryotic organisms, depending on where they integrate into the genome and how their sequences function once integrated. We focus on two types of TEs: long interspersed elements (LINEs) and short interspersed elements (SINEs). LINEs and SINEs are retrotransposons; that is, they transpose via an RNA intermediate. We discuss how LINEs and SINEs have expanded in eukaryotic genomes and contribute to genome evolution. An emerging body of evidence indicates that LINEs and SINEs function to regulate gene expression by affecting chromatin structure, gene transcription, pre-mRNA processing, or aspects of mRNA metabolism. We also describe how adenosine-to-inosine editing influences SINE function and how ongoing retrotransposition is countered by the body's defense mechanisms. Copyright © 2016, American Association for the Advancement of Science.

  8. A Classification Framework Applied to Cancer Gene Expression Profiles

    Directory of Open Access Journals (Sweden)

    Hussein Hijazi

    2013-01-01

    Full Text Available Classification of cancer based on gene expression has provided insight into possible treatment strategies. Thus, developing machine learning methods that can successfully distinguish among cancer subtypes or normal versus cancer samples is important. This work discusses supervised learning techniques that have been employed to classify cancers. Furthermore, a two-step feature selection method based on an attribute estimation method (e.g., ReliefF and a genetic algorithm was employed to find a set of genes that can best differentiate between cancer subtypes or normal versus cancer samples. The application of different classification methods (e.g., decision tree, k-nearest neighbor, support vector machine (SVM, bagging, and random forest on 5 cancer datasets shows that no classification method universally outperforms all the others. However, k-nearest neighbor and linear SVM generally improve the classification performance over other classifiers. Finally, incorporating diverse types of genomic data (e.g., protein-protein interaction data and gene expression increase the prediction accuracy as compared to using gene expression alone.

  9. Differential expression of cell adhesion genes

    DEFF Research Database (Denmark)

    Stein, Wilfred D; Litman, Thomas; Fojo, Tito

    2005-01-01

    that compare cells grown in suspension to similar cells grown attached to one another as aggregates have suggested that it is adhesion to the extracellular matrix of the basal membrane that confers resistance to apoptosis and, hence, resistance to cytotoxins. The genes whose expression correlates with poor...... in cell adhesion and the cytoskeleton. If the proteins involved in tethering cells to the extracellular matrix are important in conferring drug resistance, it may be possible to improve chemotherapy by designing drugs that target these proteins....

  10. Network Completion for Static Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Natsu Nakajima

    2014-01-01

    Full Text Available We tackle the problem of completing and inferring genetic networks under stationary conditions from static data, where network completion is to make the minimum amount of modifications to an initial network so that the completed network is most consistent with the expression data in which addition of edges and deletion of edges are basic modification operations. For this problem, we present a new method for network completion using dynamic programming and least-squares fitting. This method can find an optimal solution in polynomial time if the maximum indegree of the network is bounded by a constant. We evaluate the effectiveness of our method through computational experiments using synthetic data. Furthermore, we demonstrate that our proposed method can distinguish the differences between two types of genetic networks under stationary conditions from lung cancer and normal gene expression data.

  11. Inferring gene expression dynamics via functional regression analysis

    Directory of Open Access Journals (Sweden)

    Leng Xiaoyan

    2008-01-01

    Full Text Available Abstract Background Temporal gene expression profiles characterize the time-dynamics of expression of specific genes and are increasingly collected in current gene expression experiments. In the analysis of experiments where gene expression is obtained over the life cycle, it is of interest to relate temporal patterns of gene expression associated with different developmental stages to each other to study patterns of long-term developmental gene regulation. We use tools from functional data analysis to study dynamic changes by relating temporal gene expression profiles of different developmental stages to each other. Results We demonstrate that functional regression methodology can pinpoint relationships that exist between temporary gene expression profiles for different life cycle phases and incorporates dimension reduction as needed for these high-dimensional data. By applying these tools, gene expression profiles for pupa and adult phases are found to be strongly related to the profiles of the same genes obtained during the embryo phase. Moreover, one can distinguish between gene groups that exhibit relationships with positive and others with negative associations between later life and embryonal expression profiles. Specifically, we find a positive relationship in expression for muscle development related genes, and a negative relationship for strictly maternal genes for Drosophila, using temporal gene expression profiles. Conclusion Our findings point to specific reactivation patterns of gene expression during the Drosophila life cycle which differ in characteristic ways between various gene groups. Functional regression emerges as a useful tool for relating gene expression patterns from different developmental stages, and avoids the problems with large numbers of parameters and multiple testing that affect alternative approaches.

  12. Can subtle changes in gene expression be consistently detected with different microarray platforms?

    Directory of Open Access Journals (Sweden)

    Kuiper Rowan

    2008-03-01

    Full Text Available Abstract Background The comparability of gene expression data generated with different microarray platforms is still a matter of concern. Here we address the performance and the overlap in the detection of differentially expressed genes for five different microarray platforms in a challenging biological context where differences in gene expression are few and subtle. Results Gene expression profiles in the hippocampus of five wild-type and five transgenic δC-doublecortin-like kinase mice were evaluated with five microarray platforms: Applied Biosystems, Affymetrix, Agilent, Illumina, LGTC home-spotted arrays. Using a fixed false discovery rate of 10% we detected surprising differences between the number of differentially expressed genes per platform. Four genes were selected by ABI, 130 by Affymetrix, 3,051 by Agilent, 54 by Illumina, and 13 by LGTC. Two genes were found significantly differentially expressed by all platforms and the four genes identified by the ABI platform were found by at least three other platforms. Quantitative RT-PCR analysis confirmed 20 out of 28 of the genes detected by two or more platforms and 8 out of 15 of the genes detected by Agilent only. We observed improved correlations between platforms when ranking the genes based on the significance level than with a fixed statistical cut-off. We demonstrate significant overlap in the affected gene sets identified by the different platforms, although biological processes were represented by only partially overlapping sets of genes. Aberrances in GABA-ergic signalling in the transgenic mice were consistently found by all platforms. Conclusion The different microarray platforms give partially complementary views on biological processes affected. Our data indicate that when analyzing samples with only subtle differences in gene expression the use of two different platforms might be more attractive than increasing the number of replicates. Commercial two-color platforms seem to

  13. Heart morphogenesis gene regulatory networks revealed by temporal expression analysis.

    Science.gov (United States)

    Hill, Jonathon T; Demarest, Bradley; Gorsi, Bushra; Smith, Megan; Yost, H Joseph

    2017-10-01

    During embryogenesis the heart forms as a linear tube that then undergoes multiple simultaneous morphogenetic events to obtain its mature shape. To understand the gene regulatory networks (GRNs) driving this phase of heart development, during which many congenital heart disease malformations likely arise, we conducted an RNA-seq timecourse in zebrafish from 30 hpf to 72 hpf and identified 5861 genes with altered expression. We clustered the genes by temporal expression pattern, identified transcription factor binding motifs enriched in each cluster, and generated a model GRN for the major gene batteries in heart morphogenesis. This approach predicted hundreds of regulatory interactions and found batteries enriched in specific cell and tissue types, indicating that the approach can be used to narrow the search for novel genetic markers and regulatory interactions. Subsequent analyses confirmed the GRN using two mutants, Tbx5 and nkx2-5 , and identified sets of duplicated zebrafish genes that do not show temporal subfunctionalization. This dataset provides an essential resource for future studies on the genetic/epigenetic pathways implicated in congenital heart defects and the mechanisms of cardiac transcriptional regulation. © 2017. Published by The Company of Biologists Ltd.

  14. [Regulation of heat shock gene expression in response to stress].

    Science.gov (United States)

    Garbuz, D G

    2017-01-01

    Heat shock (HS) genes, or stress genes, code for a number of proteins that collectively form the most ancient and universal stress defense system. The system determines the cell capability of adaptation to various adverse factors and performs a variety of auxiliary functions in normal physiological conditions. Common stress factors, such as higher temperatures, hypoxia, heavy metals, and others, suppress transcription and translation for the majority of genes, while HS genes are upregulated. Transcription of HS genes is controlled by transcription factors of the HS factor (HSF) family. Certain HSFs are activated on exposure to higher temperatures or other adverse factors to ensure stress-induced HS gene expression, while other HSFs are specifically activated at particular developmental stages. The regulation of the main mammalian stress-inducible factor HSF1 and Drosophila melanogaster HSF includes many components, such as a variety of early warning signals indicative of abnormal cell activity (e.g., increases in intracellular ceramide, cytosolic calcium ions, or partly denatured proteins); protein kinases, which phosphorylate HSFs at various Ser residues; acetyltransferases; and regulatory proteins, such as SUMO and HSBP1. Transcription factors other than HSFs are also involved in activating HS gene transcription; the set includes D. melanogaster GAF, mammalian Sp1 and NF-Y, and other factors. Transcription of several stress genes coding for molecular chaperones of the glucose-regulated protein (GRP) family is predominantly regulated by another stress-detecting system, which is known as the unfolded protein response (UPR) system and is activated in response to massive protein misfolding in the endoplasmic reticulum and mitochondrial matrix. A translational fine tuning of HS protein expression occurs via changing the phosphorylation status of several proteins involved in translation initiation. In addition, specific signal sequences in the 5'-UTRs of some HS

  15. APPRIS 2017: principal isoforms for multiple gene sets

    Science.gov (United States)

    Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso

    2018-01-01

    Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475

  16. Inferring time-varying network topologies from gene expression data.

    Science.gov (United States)

    Rao, Arvind; Hero, Alfred O; States, David J; Engel, James Douglas

    2007-01-01

    Most current methods for gene regulatory network identification lead to the inference of steady-state networks, that is, networks prevalent over all times, a hypothesis which has been challenged. There has been a need to infer and represent networks in a dynamic, that is, time-varying fashion, in order to account for different cellular states affecting the interactions amongst genes. In this work, we present an approach, regime-SSM, to understand gene regulatory networks within such a dynamic setting. The approach uses a clustering method based on these underlying dynamics, followed by system identification using a state-space model for each learnt cluster--to infer a network adjacency matrix. We finally indicate our results on the mouse embryonic kidney dataset as well as the T-cell activation-based expression dataset and demonstrate conformity with reported experimental evidence.

  17. Global expression differences and tissue specific expression differences in rice evolution result in two contrasting types of differentially expressed genes

    KAUST Repository

    Horiuchi, Youko

    2015-12-23

    Background Since the development of transcriptome analysis systems, many expression evolution studies characterized evolutionary forces acting on gene expression, without explicit discrimination between global expression differences and tissue specific expression differences. However, different types of gene expression alteration should have different effects on an organism, the evolutionary forces that act on them might be different, and different types of genes might show different types of differential expression between species. To confirm this, we studied differentially expressed (DE) genes among closely related groups that have extensive gene expression atlases, and clarified characteristics of different types of DE genes including the identification of regulating loci for differential expression using expression quantitative loci (eQTL) analysis data. Results We detected differentially expressed (DE) genes between rice subspecies in five homologous tissues that were verified using japonica and indica transcriptome atlases in public databases. Using the transcriptome atlases, we classified DE genes into two types, global DE genes and changed-tissues DE genes. Global type DE genes were not expressed in any tissues in the atlas of one subspecies, however changed-tissues type DE genes were expressed in both subspecies with different tissue specificity. For the five tissues in the two japonica-indica combinations, 4.6 ± 0.8 and 5.9 ± 1.5 % of highly expressed genes were global and changed-tissues DE genes, respectively. Changed-tissues DE genes varied in number between tissues, increasing linearly with the abundance of tissue specifically expressed genes in the tissue. Molecular evolution of global DE genes was rapid, unlike that of changed-tissues DE genes. Based on gene ontology, global and changed-tissues DE genes were different, having no common GO terms. Expression differences of most global DE genes were regulated by cis-eQTLs. Expression

  18. Mining tissue specificity, gene connectivity and disease association to reveal a set of genes that modify the action of disease causing genes

    Directory of Open Access Journals (Sweden)

    Reverter Antonio

    2008-09-01

    Full Text Available Abstract Background The tissue specificity of gene expression has been linked to a number of significant outcomes including level of expression, and differential rates of polymorphism, evolution and disease association. Recent studies have also shown the importance of exploring differential gene connectivity and sequence conservation in the identification of disease-associated genes. However, no study relates gene interactions with tissue specificity and disease association. Methods We adopted an a priori approach making as few assumptions as possible to analyse the interplay among gene-gene interactions with tissue specificity and its subsequent likelihood of association with disease. We mined three large datasets comprising expression data drawn from massively parallel signature sequencing across 32 tissues, describing a set of 55,606 true positive interactions for 7,197 genes, and microarray expression results generated during the profiling of systemic inflammation, from which 126,543 interactions among 7,090 genes were reported. Results Amongst the myriad of complex relationships identified between expression, disease, connectivity and tissue specificity, some interesting patterns emerged. These include elevated rates of expression and network connectivity in housekeeping and disease-associated tissue-specific genes. We found that disease-associated genes are more likely to show tissue specific expression and most frequently interact with other disease genes. Using the thresholds defined in these observations, we develop a guilt-by-association algorithm and discover a group of 112 non-disease annotated genes that predominantly interact with disease-associated genes, impacting on disease outcomes. Conclusion We conclude that parameters such as tissue specificity and network connectivity can be used in combination to identify a group of genes, not previously confirmed as disease causing, that are involved in interactions with disease causing

  19. Interactive visualization of gene regulatory networks with associated gene expression time series data

    NARCIS (Netherlands)

    Westenberg, M.A.; Hijum, van S.A.F.T.; Lulko, A.T.; Kuipers, O.P.; Roerdink, J.B.T.M.; Linsen, L.; Hagen, H.; Hamann, B.

    2008-01-01

    We present GENeVis, an application to visualize gene expression time series data in a gene regulatory network context. This is a network of regulator proteins that regulate the expression of their respective target genes. The networks are represented as graphs, in which the nodes represent genes,

  20. Early gene expression divergence between allopatric populations of the house mouse (Mus musculus domesticus).

    Science.gov (United States)

    Bryk, Jarosław; Somel, Mehmet; Lorenc, Anna; Teschke, Meike

    2013-03-01

    Divergence of gene expression is known to contribute to the differentiation and separation of populations and species, although the dynamics of this process in early stages of population divergence remains unclear. We analyzed gene expression differences in three organs (brain, liver, and testis) between two natural populations of Mus musculus domesticus that have been separated for at most 3000 years. We used two different microarray platforms to corroborate the results at a large scale and identified hundreds of genes with significant expression differences between the populations. We find that although the three tissues have similar number of differentially expressed genes, brain and liver have more tissue-specific genes than testis. Most genes show changes in a single tissue only, even when expressed in all tissues, supporting the notion that tissue-specific enhancers act as separable targets of evolution. In terms of functional categories, in brain and to a smaller extent in liver, we find transcription factors and their targets to be particularly variable between populations, similar to previous findings in primates. Testis, however, has a different set of differently expressed genes, both with respect to functional categories and overall correlation with the other tissues, the latter indicating that gene expression divergence of potential importance might be present in other datasets where no differences in fraction of differentially expressed genes were reported. Our results show that a significant amount of gene expression divergence quickly accumulates between allopatric populations.

  1. FunGeneNet: a web tool to estimate enrichment of functional interactions in experimental gene sets.

    Science.gov (United States)

    Tiys, Evgeny S; Ivanisenko, Timofey V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2018-02-09

    Estimation of functional connectivity in gene sets derived from genome-wide or other biological experiments is one of the essential tasks of bioinformatics. A promising approach for solving this problem is to compare gene networks built using experimental gene sets with random networks. One of the resources that make such an analysis possible is CrossTalkZ, which uses the FunCoup database. However, existing methods, including CrossTalkZ, do not take into account individual types of interactions, such as protein/protein interactions, expression regulation, transport regulation, catalytic reactions, etc., but rather work with generalized types characterizing the existence of any connection between network members. We developed the online tool FunGeneNet, which utilizes the ANDSystem and STRING to reconstruct gene networks using experimental gene sets and to estimate their difference from random networks. To compare the reconstructed networks with random ones, the node permutation algorithm implemented in CrossTalkZ was taken as a basis. To study the FunGeneNet applicability, the functional connectivity analysis of networks constructed for gene sets involved in the Gene Ontology biological processes was conducted. We showed that the method sensitivity exceeds 0.8 at a specificity of 0.95. We found that the significance level of the difference between gene networks of biological processes and random networks is determined by the type of connections considered between objects. At the same time, the highest reliability is achieved for the generalized form of connections that takes into account all the individual types of connections. By taking examples of the thyroid cancer networks and the apoptosis network, it is demonstrated that key participants in these processes are involved in the interactions of those types by which these networks differ from random ones. FunGeneNet is a web tool aimed at proving the functionality of networks in a wide range of sizes of

  2. Positive selection on gene expression in the human brain

    DEFF Research Database (Denmark)

    Khaitovich, Philipp; Tang, Kun; Franz, Henriette

    2006-01-01

    Recent work has shown that the expression levels of genes transcribed in the brains of humans and chimpanzees have changed less than those of genes transcribed in other tissues [1] . However, when gene expression changes are mapped onto the evolutionary lineage in which they occurred, the brain...... shows more changes than other tissues in the human lineage compared to the chimpanzee lineage [1] , [2] and [3] . There are two possible explanations for this: either positive selection drove more gene expression changes to fixation in the human brain than in the chimpanzee brain, or genes expressed...... in the brain experienced less purifying selection in humans than in chimpanzees, i.e. gene expression in the human brain is functionally less constrained. The first scenario would be supported if genes that changed their expression in the brain in the human lineage showed more selective sweeps than other genes...

  3. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  4. SPECT imaging of cardiac reporter gene expression in living rabbits

    International Nuclear Information System (INIS)

    Liu Ying; Lan Xiaoli; Zhang Liang; Wu Tao; Jiang Rifeng; Zhang Yongxue

    2009-01-01

    This work is to demonstrate feasibility of imaging the expression of herpes simplex virus 1-thymidine kinase (HSVI-tk) reporter gene in rabbits myocardium by using the reporter probe 131 I-2'-fluoro-2'-deoxy-l-β-D-arabinofuranosyl-5-iodouracil ( 131 I-FIAU) and SPECT. Rabbits of the study group received intramyocardial injection of Ad5-tk and control group received aseptic saline injection. Two sets of experiments were performed on the study group. Rabbits of the 1st set were injected with 131 I-FIAU 600 μCi at Day 2 after intramyocardial transfection of Ad5-tk in 1xl0 9 , 5x10 8 , 1x10 8 , 5x10 7 and 1x10 7 pfu, and heart SPECT imaging was done at different hours. Rabbits of the 2nd were transferred various titers of Ad5-tk (1x10 9 , 5x10 8 , 1x10 8 , 5x10 7 , 1x10 7 pfu) to determine the threshold and optimal viral titer needed for detection of gene expression. Two days later, 131 I-FIAU was injected and heart SPECT imaging was performed at 6, 24 and 48 h, before killing them for gamma counting of the hearts. Reverse transcription-polymerase chain reaction (RT-PCR) was used to verify the transferred HSVI-tk gene expression. Semi-quantitative analysis derived of region of interest (ROI) of SPECT images and RT-PCR images was performed and the relationship of SPECT images with ex vivo gamma counting and mRNA level were evaluated. SPECT images conformed 131 I-FIAU accumulation in rabbits injected with Ad5-tk in the anterolateral wall. The optimal images quality was obtained at 24-48 h for different viral titers. The highest radioactivity in the focal myocardium was seen at 6 h, and then declined with time. The threshold was 5x10 7 pfu of virus titer. The result could be set better in 1-5x10 8 pfu by SPECT analysis and gamma counting. ROI-derived semi-quantitative study on SPECT images correlated well with ex vivo gamma counting and mRNA levels from RT-PCR analysis. The HSVI-tk/ 131 I-FIAU reporter gene/reporter probe system is feasible for cardiac SPECT reporter

  5. Zfp206 regulates ES cell gene expression and differentiation.

    Science.gov (United States)

    Zhang, Wen; Walker, Emily; Tamplin, Owen J; Rossant, Janet; Stanford, William L; Hughes, Timothy R

    2006-01-01

    Understanding transcriptional regulation in early developmental stages is fundamental to understanding mammalian development and embryonic stem (ES) cell properties. Expression surveys suggest that the putative SCAN-Zinc finger transcription factor Zfp206 is expressed specifically in ES cells [Zhang,W., Morris,Q.D., Chang,R., Shai,O., Bakowski,M.A., Mitsakakis,N., Mohammad,N., Robinson,M.D., Zirngibl,R., Somogyi,E. et al., (2004) J. Biol., 3, 21; Brandenberger,R., Wei,H., Zhang,S., Lei,S., Murage,J., Fisk,G.J., Li,Y., Xu,C., Fang,R., Guegler,K. et al., (2004) Nat. Biotechnol., 22, 707-716]. Here, we confirm this observation, and we show that ZFP206 expression decreases rapidly upon differentiation of cultured mouse ES cells, and during development of mouse embryos. We find that there are at least six isoforms of the ZFP206 transcript, the longest being predominant. Overexpression and depletion experiments show that Zfp206 promotes formation of undifferentiated ES cell clones, and positively regulates abundance of a very small set of transcripts whose expression is also specific to ES cells and the two- to four-cell stages of preimplantation embryos. This set includes members of the Zscan4, Thoc4, Tcstv1 and eIF-1A gene families, none of which have been functionally characterized in vivo but whose members include apparent transcription factors, RNA-binding proteins and translation factors. Together, these data indicate that Zfp206 is a regulator of ES cell differentiation that controls a set of genes expressed very early in development, most of which themselves appear to be regulators.

  6. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes.

    Directory of Open Access Journals (Sweden)

    Simone de Jong

    Full Text Available Despite large-scale genome-wide association studies (GWAS, the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1, is located in, and regulated by the major histocompatibility (MHC complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network.

  7. Sex-based differences in gene expression in hippocampus following postnatal lead exposure

    International Nuclear Information System (INIS)

    Schneider, J.S.; Anderson, D.W.; Sonnenahalli, H.; Vadigepalli, R.

    2011-01-01

    The influence of sex as an effect modifier of childhood lead poisoning has received little systematic attention. Considering the paucity of information available concerning the interactive effects of lead and sex on the brain, the current study examined the interactive effects of lead and sex on gene expression patterns in the hippocampus, a structure involved in learning and memory. Male or female rats were fed either 1500 ppm lead-containing chow or control chow for 30 days beginning at weaning.Blood lead levels were 26.7 ± 2.1 μg/dl and 27.1 ± 1.7 μg/dl for females and males, respectively. The expression of 175 unique genes was differentially regulated between control male and female rats. A total of 167 unique genes were differentially expressed in response to lead in either males or females. Lead exposure had a significant effect without a significant difference between male and female responses in 77 of these genes. In another set of 71 genes, there were significant differences in male vs. female response. A third set of 30 genes was differentially expressed in opposite directions in males vs. females, with the majority of genes expressed at a lower level in females than in males. Highly differentially expressed genes in males and females following lead exposure were associated with diverse biological pathways and functions. These results show that a brief exposure to lead produced significant changes in expression of a variety of genes in the hippocampus and that the response of the brain to a given lead exposure may vary depending on sex. - Highlights: → Postnatal lead exposure has a significant effect on hippocampal gene expression patterns. → At least one set of genes was affected in opposite directions in males and females. → Differentially expressed genes were associated with diverse biological pathways.

  8. In-silico gene co-expression network analysis in Paracoccidioides brasiliensis with reference to haloacid dehalogenase superfamily hydrolase gene

    Directory of Open Access Journals (Sweden)

    Raghunath Satpathy

    2015-01-01

    Full Text Available Context: Paracoccidioides brasiliensis, a dimorphic fungus is the causative agent of paracoccidioidomycosis, a disease globally affecting millions of people. The haloacid dehalogenase (HAD superfamily hydrolases enzyme in the fungi, in particular, is known to be responsible in the pathogenesis by adhering to the tissue. Hence, identification of novel drug targets is essential. Aims: In-silico based identification of co-expressed genes along with HAD superfamily hydrolase in P. brasiliensis during the morphogenesis from mycelium to yeast to identify possible genes as drug targets. Materials and Methods: In total, four datasets were retrieved from the NCBI-gene expression omnibus (GEO database, each containing 4340 genes, followed by gene filtration expression of the data set. Further co-expression (CE study was performed individually and then a combination these genes were visualized in the Cytoscape 2. 8.3. Statistical Analysis Used: Mean and standard deviation value of the HAD superfamily hydrolase gene was obtained from the expression data and this value was subsequently used for the CE calculation purpose by selecting specific correlation power and filtering threshold. Results: The 23 genes that were thus obtained are common with respect to the HAD superfamily hydrolase gene. A significant network was selected from the Cytoscape network visualization that contains total 7 genes out of which 5 genes, which do not have significant protein hits, obtained from gene annotation of the expressed sequence tags by BLAST X. For all the protein PSI-BLAST was performed against human genome to find the homology. Conclusions: The gene co-expression network was obtained with respect to HAD superfamily dehalogenase gene in P. Brasiliensis.

  9. Analysis of multiplex gene expression maps obtained by voxelation.

    Science.gov (United States)

    An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

    2009-04-29

    Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental

  10. Analysis of multiplex gene expression maps obtained by voxelation

    Directory of Open Access Journals (Sweden)

    Smith Desmond J

    2009-04-01

    Full Text Available Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. Results To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in

  11. Gene-expression profiling after exposure to C-ion beams

    International Nuclear Information System (INIS)

    Saegusa, Kumiko; Furuno, Aki; Ishikawa, Kenichi; Ishikawa, Atsuko; Ohtsuka, Yoshimi; Kawai, Seiko; Imai, Takashi; Nojima, Kumie

    2005-01-01

    It is recognized that carbon-ion beam kills cancer cells more efficiently than X-ray. In this study we have compared cellular gene expression response after carbon-ion beam exposure with that after X-ray exposure. Gene expression profiles of cultured neonatal human dermal fibroblasts (NHDF) at 0, 1, 3, 6, 12, 18, and 24 hr after exposure to 0.1, 2 and 5 Gy of X-ray or carbon-ion beam were obtained using 22K oligonucleotide microarray. N-way ANOVA analysis of whole gene expression data sets selected 960 genes for carbon-ion beam and 977 genes for X-ray, respectively. Interestingly, majority of these genes (91% for carbon-ion beam and 88% for X-ray, respectively) were down regulated. The selected genes were further classified by their dose-dependence or time-dependence of gene expression change (fold change>1.5). It was revealed that genes involved in cell proliferation had tendency to show time-dependent up regulation by carbon-ion beam. Another N-way ANOVA analysis was performed to select 510 genes, and further selection was made to find 70 genes that showed radiation species-dependent gene expression change (fold change>1.25). These genes were then categorized by the K-Mean clustering method into 4 clusters. Each cluster showed tendency to contain genes involved in cell cycle regulation, cell death, responses to stress and metabolisms, respectively. (author)

  12. Finding gene regulatory network candidates using the gene expression knowledge base.

    Science.gov (United States)

    Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

    2014-12-10

    Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.

  13. Comparative gene expression profiles between heterotic and non-heterotic hybrids of tetraploid Medicago sativa

    Directory of Open Access Journals (Sweden)

    Nettleton Dan

    2009-08-01

    Full Text Available Abstract Background Heterosis, the superior performance of hybrids relative to parents, has clear agricultural value, but its genetic control is unknown. Our objective was to test the hypotheses that hybrids expressing heterosis for biomass yield would show more gene expression levels that were different from midparental values and outside the range of parental values than hybrids that do not exhibit heterosis. Results We tested these hypotheses in three Medicago sativa (alfalfa genotypes and their three hybrids, two of which expressed heterosis for biomass yield and a third that did not, using Affymetrix M. truncatula GeneChip arrays. Alfalfa hybridized to approximately 47% of the M. truncatula probe sets. Probe set signal intensities were analyzed using MicroArray Suite v.5.0 (MAS and robust multi-array average (RMA algorithms. Based on MAS analysis, the two heterotic hybrids performed similarly, with about 27% of genes showing differential expression among the parents and their hybrid compared to 12.5% for the non-heterotic hybrid. At a false discovery rate of 0.15, 4.7% of differentially expressed genes in hybrids (~300 genes showed nonadditive expression compared to only 0.5% (16 genes in the non-heterotic hybrid. Of the nonadditively expressed genes, approximately 50% showed expression levels that fell outside the parental range in heterotic hybrids, but only one of 16 showed a similar profile in the non-heterotic hybrid. Genes whose expression differed in the parents were three times more likely to show nonadditive expression than genes whose parental transcript levels were equal. Conclusion The higher proportions of probe sets with expression level that differed from the parental midparent value and that were more extreme than either parental value in the heterotic hybrids compared to a non-heterotic hybrid were also found using RMA. We conclude that nonadditive expression of transcript levels may contribute to heterosis for biomass

  14. A Fisheye Viewer for microarray-based gene expression data.

    Science.gov (United States)

    Wu, Min; Thao, Cheng; Mu, Xiangming; Munson, Ethan V

    2006-10-13

    Microarray has been widely used to measure the relative amounts of every mRNA transcript from the genome in a single scan. Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface--an electronic table (E-table) that uses fisheye distortion technology. The Fisheye Viewer for microarray-based gene expression data has been successfully developed to view MIAME data stored in the MAGE-ML format. The viewer can be downloaded from the project web site http://polaris.imt.uwm.edu:7777/fisheye/. The fisheye viewer was implemented in Java so that it could run on multiple platforms. We implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen. This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table.

  15. A fisheye viewer for microarray-based gene expression data

    Directory of Open Access Journals (Sweden)

    Munson Ethan V

    2006-10-01

    Full Text Available Abstract Background Microarray has been widely used to measure the relative amounts of every mRNA transcript from the genome in a single scan. Biologists have been accustomed to reading their experimental data directly from tables. However, microarray data are quite large and are stored in a series of files in a machine-readable format, so direct reading of the full data set is not feasible. The challenge is to design a user interface that allows biologists to usefully view large tables of raw microarray-based gene expression data. This paper presents one such interface – an electronic table (E-table that uses fisheye distortion technology. Results The Fisheye Viewer for microarray-based gene expression data has been successfully developed to view MIAME data stored in the MAGE-ML format. The viewer can be downloaded from the project web site http://polaris.imt.uwm.edu:7777/fisheye/. The fisheye viewer was implemented in Java so that it could run on multiple platforms. We implemented the E-table by adapting JTable, a default table implementation in the Java Swing user interface library. Fisheye views use variable magnification to balance magnification for easy viewing and compression for maximizing the amount of data on the screen. Conclusion This Fisheye Viewer is a lightweight but useful tool for biologists to quickly overview the raw microarray-based gene expression data in an E-table.

  16. Gene Expression Profiling in Fish Toxicology: A Review.

    Science.gov (United States)

    Kumar, Girish; Denslow, Nancy D

    In this review, we present an overview of transcriptomic responses to chemical exposures in a variety of fish species. We have discussed the use of several molecular approaches such as northern blotting, differential display reverse transcription-polymerase chain reaction (DDRT-PCR), suppression subtractive hybridization (SSH), real time quantitative PCR (RT-qPCR), microarrays, and next-generation sequencing (NGS) for measuring gene expression. These techniques have been mainly used to measure the toxic effects of single compounds or simple mixtures in laboratory conditions. In addition, only few studies have been conducted to examine the biological significance of differentially expressed gene sets following chemical exposure. Therefore, future studies should focus more under field conditions using a multidisciplinary approach (genomics, proteomics and metabolomics) to understand the synergetic effects of multiple environmental stressors and to determine the functional significance of differentially expressed genes. Nevertheless, recent developments in NGS technologies and decreasing costs of sequencing holds the promise to uncover the complexity of anthropogenic impacts and biological effects in wild fish populations.

  17. Model-based gene set analysis for Bioconductor.

    Science.gov (United States)

    Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien

    2011-07-01

    Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.

  18. Codon usage and amino acid usage influence genes expression level.

    Science.gov (United States)

    Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo

    2018-02-01

    Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.

  19. Gene expression of circulating tumour cells in breast cancer patients

    Directory of Open Access Journals (Sweden)

    Bölke E

    2009-09-01

    Full Text Available Abstract Background The diagnostic tools to predict the prognosis in patients suffering from breast cancer (BC need further improvements. New technological achievements like the gene profiling of circulating tumour cells (CTC could help identify new prognostic markers in the clinical setting. Furthermore, gene expression patterns of CTC might provide important informations on the mechanisms of tumour cell metastasation. Materials and methods We performed realtime-PCR and multiplex-PCR analyses following immunomagnetic separation of CTC. Peripheral blood (PB samples of 63 patients with breast cancer of various stages were analyzed and compared to a control group of 14 healthy individuals. After reverse-transcription, we performed multiplex PCR using primers for the genes ga733.3, muc-1 and c-erbB2. Mammaglobin1, spdef and c-erbB2 were analyzed applying realtime-PCR. Results ga733.2 overexpression was found in 12.7% of breast cancer cases, muc-1 in 15.9%, mgb1 in 9.1% and spdef in 12.1%. In this study, c-erbB2 did not show any significant correlation to BC, possibly due to a highly ambient expression. Besides single gene analyses, gene profiles were additionally evaluated. Highly significant correlations to BC were found in single gene analyses of ga733.2 and muc-1 and in gene profile analyses of ga733.3*muc-1 and GA7 ga733.3*muc-1*mgb1*spdef. Conclusion Our study reveals that the single genes ga733.3, muc-1 and the gene profiles ga733.3*muc-1 and ga733.3*3muc-1*mgb1*spdef can serve as markers for the detection of CTC in BC. The multigene analyses found highly positive levels in BC patients. Our study indicates that not single gene analyses but subtle patterns of multiple genes lead to rising accuracy and low loss of specificity in detection of breast cancer cases.

  20. The Arabidopsis co-expression tool (act): a WWW-based tool and database for microarray-based gene expression analysis

    DEFF Research Database (Denmark)

    Jen, C. H.; Manfield, I. W.; Michalopoulos, D. W.

    2006-01-01

    be examined using the novel clique finder tool to determine the sets of genes most likely to be regulated in a similar manner. In combination, these tools offer three levels of analysis: creation of correlation lists of co-expressed genes, refinement of these lists using two-dimensional scatter plots......We present a new WWW-based tool for plant gene analysis, the Arabidopsis Co-Expression Tool (act) , based on a large Arabidopsis thaliana microarray data set obtained from the Nottingham Arabidopsis Stock Centre. The co-expression analysis tool allows users to identify genes whose expression...

  1. Early pregnancy peripheral blood gene expression and risk of preterm delivery: a nested case control study

    Directory of Open Access Journals (Sweden)

    Muhie Seid Y

    2009-12-01

    Full Text Available Abstract Background Preterm delivery (PTD is a significant public health problem associated with greater risk of mortality and morbidity in infants and mothers. Pathophysiologic processes that may lead to PTD start early in pregnancy. We investigated early pregnancy peripheral blood global gene expression and PTD risk. Methods As part of a prospective study, ribonucleic acid was extracted from blood samples (collected at 16 weeks gestational age from 14 women who had PTD (cases and 16 women who delivered at term (controls. Gene expressions were measured using the GeneChip® Human Genome U133 Plus 2.0 Array. Student's T-test and fold change analysis were used to identify differentially expressed genes. We used hierarchical clustering and principle components analysis to characterize signature gene expression patterns among cases and controls. Pathway and promoter sequence analyses were used to investigate functions and functional relationships as well as regulatory regions of differentially expressed genes. Results A total of 209 genes, including potential candidate genes (e.g. PTGDS, prostaglandin D2 synthase 21 kDa, were differentially expressed. A set of these genes achieved accurate pre-diagnostic separation of cases and controls. These genes participate in functions related to immune system and inflammation, organ development, metabolism (lipid, carbohydrate and amino acid and cell signaling. Binding sites of putative transcription factors such as EGR1 (early growth response 1, TFAP2A (transcription factor AP2A, Sp1 (specificity protein 1 and Sp3 (specificity protein 3 were over represented in promoter regions of differentially expressed genes. Real-time PCR confirmed microarray expression measurements of selected genes. Conclusions PTD is associated with maternal early pregnancy peripheral blood gene expression changes. Maternal early pregnancy peripheral blood gene expression patterns may be useful for better understanding of PTD

  2. Understanding gene expression in coronary artery disease through ...

    Indian Academy of Sciences (India)

    Understanding gene expression in coronary artery disease through global profiling, network analysis and independent validation of key candidate genes. Prathima ... Table 2. Differentially expressed genes in CAD compared to age and gender matched controls. .... Regulation of nuclear pre-mRNA domain containing 1A.

  3. Peak flood estimation using gene expression programming

    Science.gov (United States)

    Zorn, Conrad R.; Shamseldin, Asaad Y.

    2015-12-01

    As a case study for the Auckland Region of New Zealand, this paper investigates the potential use of gene-expression programming (GEP) in predicting specific return period events in comparison to the established and widely used Regional Flood Estimation (RFE) method. Initially calibrated to 14 gauged sites, the GEP derived model was further validated to 10 and 100 year flood events with a relative errors of 29% and 18%, respectively. This is compared to the RFE method providing 48% and 44% errors for the same flood events. While the effectiveness of GEP in predicting specific return period events is made apparent, it is argued that the derived equations should be used in conjunction with those existing methodologies rather than as a replacement.

  4. Some statistical properties of gene expression clustering for array data

    DEFF Research Database (Denmark)

    Abreu, G C G; Pinheiro, A; Drummond, R D

    2010-01-01

    DNA array data without a corresponding statistical error measure. We propose an easy-to-implement and simple-to-use technique that uses bootstrap re-sampling to evaluate the statistical error of the nodes provided by SOM-based clustering. Comparisons between SOM and parametric clustering are presented...... for simulated as well as for two real data sets. We also implement a bootstrap-based pre-processing procedure for SOM, that improves the false discovery ratio of differentially expressed genes. Code in Matlab is freely available, as well as some supplementary material, at the following address: https...

  5. Expression regulation of design process gene in product design

    DEFF Research Database (Denmark)

    Li, Bo; Fang, Lusheng; Li, Bo

    2011-01-01

    To improve the design process efficiency, this paper proposes the principle and methodology that design process gene controls the characteristics of design process under the framework of design process reuse and optimization based on design process gene. First, the concept of design process gene...... is proposed and analyzed, as well as its three categories i.e., the operator gene, the structural gene and the regulator gene. Second, the trigger mechanism that design objectives and constraints trigger the operator gene is constructed. Third, the expression principle of structural gene is analyzed...... with the example of design management gene. Last, the regulation mode that the regulator gene regulates the expression of the structural gene is established and it is illustrated by taking the design process management gene as an example. © (2011) Trans Tech Publications....

  6. Dissecting specific and global transcriptional regulation of bacterial gene expression

    NARCIS (Netherlands)

    Gerosa, Luca; Kochanowski, Karl; Heinemann, Matthias; Sauer, Uwe

    Gene expression is regulated by specific transcriptional circuits but also by the global expression machinery as a function of growth. Simultaneous specific and global regulation thus constitutes an additional-but often neglected-layer of complexity in gene expression. Here, we develop an

  7. An extended Kalman filtering approach to modeling nonlinear dynamic gene regulatory networks via short gene expression time series.

    Science.gov (United States)

    Wang, Zidong; Liu, Xiaohui; Liu, Yurong; Liang, Jinling; Vinciotti, Veronica

    2009-01-01

    In this paper, the extended Kalman filter (EKF) algorithm is applied to model the gene regulatory network from gene time series data. The gene regulatory network is considered as a nonlinear dynamic stochastic model that consists of the gene measurement equation and the gene regulation equation. After specifying the model structure, we apply the EKF algorithm for identifying both the model parameters and the actual value of gene expression levels. It is shown that the EKF algorithm is an online estimation algorithm that can identify a large number of parameters (including parameters of nonlinear functions) through iterative procedure by using a small number of observations. Four real-world gene expression data sets are employed to demonstrate the effectiveness of the EKF algorithm, and the obtained models are evaluated from the viewpoint of bioinformatics.

  8. Gene expression of the mismatch repair gene MSH2 in primary colorectal cancer

    DEFF Research Database (Denmark)

    Jensen, Lars Henrik; Kuramochi, Hidekazu; Crüger, Dorthe Gylling

    2011-01-01

    promoter was only detected in 14 samples and only at a low level with no correlation to gene expression. MSH2 gene expression was not a prognostic factor for overall survival in univariate or multivariate analysis. The gene expression of MSH2 is a potential quantitative marker ready for further clinical...

  9. Integrative analysis of genome-wide gene copy number changes and gene expression in non-small cell lung cancer.

    Directory of Open Access Journals (Sweden)

    Verena Jabs

    Full Text Available Non-small cell lung cancer (NSCLC represents a genomically unstable cancer type with extensive copy number aberrations. The relationship of gene copy number alterations and subsequent mRNA levels has only fragmentarily been described. The aim of this study was to conduct a genome-wide analysis of gene copy number gains and corresponding gene expression levels in a clinically well annotated NSCLC patient cohort (n = 190 and their association with survival. While more than half of all analyzed gene copy number-gene expression pairs showed statistically significant correlations (10,296 of 18,756 genes, high correlations, with a correlation coefficient >0.7, were obtained only in a subset of 301 genes (1.6%, including KRAS, EGFR and MDM2. Higher correlation coefficients were associated with higher copy number and expression levels. Strong correlations were frequently based on few tumors with high copy number gains and correspondingly increased mRNA expression. Among the highly correlating genes, GO groups associated with posttranslational protein modifications were particularly frequent, including ubiquitination and neddylation. In a meta-analysis including 1,779 patients we found that survival associated genes were overrepresented among highly correlating genes (61 of the 301 highly correlating genes, FDR adjusted p<0.05. Among them are the chaperone CCT2, the core complex protein NUP107 and the ubiquitination and neddylation associated protein CAND1. In conclusion, in a comprehensive analysis we described a distinct set of highly correlating genes. These genes were found to be overrepresented among survival-associated genes based on gene expression in a large collection of publicly available datasets.

  10. Investigating the effect of paralogs on microarray gene-set analysis

    LENUS (Irish Health Repository)

    Faure, Andre J

    2011-01-24

    Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

  11. Empirical validation of the S-Score algorithm in the analysis of gene expression data

    Directory of Open Access Journals (Sweden)

    Archer Kellie J

    2006-03-01

    Full Text Available Abstract Background Current methods of analyzing Affymetrix GeneChip® microarray data require the estimation of probe set expression summaries, followed by application of statistical tests to determine which genes are differentially expressed. The S-Score algorithm described by Zhang and colleagues is an alternative method that allows tests of hypotheses directly from probe level data. It is based on an error model in which the detected signal is proportional to the probe pair signal for highly expressed genes, but approaches a background level (rather than 0 for genes with low levels of expression. This model is used to calculate relative change in probe pair intensities that converts probe signals into multiple measurements with equalized errors, which are summed over a probe set to form the S-Score. Assuming no expression differences between chips, the S-Score follows a standard normal distribution, allowing direct tests of hypotheses to be made. Using spike-in and dilution datasets, we validated the S-Score method against comparisons of gene expression utilizing the more recently developed methods RMA, dChip, and MAS5. Results The S-score showed excellent sensitivity and specificity in detecting low-level gene expression changes. Rank ordering of S-Score values more accurately reflected known fold-change values compared to other algorithms. Conclusion The S-score method, utilizing probe level data directly, offers significant advantages over comparisons using only probe set expression summaries.

  12. Identification of valid reference genes for the normalization of RT qPCR gene expression data in human brain tissue

    Directory of Open Access Journals (Sweden)

    Ravid Rivka

    2008-05-01

    Full Text Available Abstract Background Studies of gene expression in post mortem human brain can contribute to understanding of the pathophysiology of neurodegenerative diseases, including Alzheimer's disease (AD, Parkinson's disease (PD and dementia with Lewy bodies (DLB. Quantitative real-time PCR (RT qPCR is often used to analyse gene expression. The validity of results obtained using RT qPCR is reliant on accurate data normalization. Reference genes are generally used to normalize RT qPCR data. Given that expression of some commonly used reference genes is altered in certain conditions, this study aimed to establish which reference genes were stably expressed in post mortem brain tissue from individuals with AD, PD or DLB. Results The present study investigated the expression stability of 8 candidate reference genes, (ubiquitin C [UBC], tyrosine-3-monooxygenase [YWHAZ], RNA polymerase II polypeptide [RP II], hydroxymethylbilane synthase [HMBS], TATA box binding protein [TBP], β-2-microglobulin [B2M], glyceraldehyde-3-phosphate dehydrogenase [GAPDH], and succinate dehydrogenase complex-subunit A, [SDHA] in cerebellum and medial temporal gyrus of 6 AD, 6 PD, 6 DLB subjects, along with 5 matched controls using RT qPCR (TaqMan® Gene Expression Assays. Gene expression stability was analysed using geNorm to rank the candidate genes in order of decreasing stability in each disease group. The optimal number of genes recommended for accurate data normalization in each disease state was determined by pairwise variation analysis. Conclusion This study identified validated sets of mRNAs which would be appropriate for the normalization of RT qPCR data when studying gene expression in brain tissue of AD, PD, DLB and control subjects.

  13. CDX2 gene expression in acute lymphoblastic leukemia

    International Nuclear Information System (INIS)

    Arnaoaut, H.H.; Mokhtar, D.A.; Samy, R.M.; Omar, Sh.A.; Khames, S.A.

    2014-01-01

    CDX genes are classically known as regulators of axial elongation during early embryogenesis. An unsuspected role for CDX genes has been revealed during hematopoietic development. The CDX gene family member CDX2 belongs to the most frequent aberrantly expressed proto-oncogenes in human acute leukemias and is highly leukemogenic in experimental models. We used reversed transcriptase polymerase chain reaction (RT-PCR) to determine the expression level of CDX2 gene in 30 pediatric patients with acute lymphoblastic leukemia (ALL) at diagnosis and 30 healthy volunteers. ALL patients were followed up to detect minimal residual disease (MRD) on days 15 and 42 of induction. We found that CDX2 gene was expressed in 50% of patients and not expressed in controls. Associations between gene expression and different clinical and laboratory data of patients revealed no impact on different findings. With follow up, we could not confirm that CDX2 expression had a prognostic significance.

  14. A method to identify differential expression profiles of time-course gene data with Fourier transformation.

    Science.gov (United States)

    Kim, Jaehee; Ogden, Robert Todd; Kim, Haseong

    2013-10-18

    Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization.The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be

  15. MALDI-TOF mass spectrometry for quantitative gene expression analysis of acid responses in Staphylococcus aureus.

    Science.gov (United States)

    Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild

    2009-07-01

    Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.

  16. Developmentally regulated expression of reporter gene in adult ...

    Indian Academy of Sciences (India)

    pression of reporter gene in adult brain specific GAL4 enhancer traps of. Drosophila ... genes based on their expression pattern, thus enabling us to overcome the ... order association and storage centres of olfactory learning and memory, and ...

  17. GSEH: A Novel Approach to Select Prostate Cancer-Associated Genes Using Gene Expression Heterogeneity.

    Science.gov (United States)

    Kim, Hyunjin; Choi, Sang-Min; Park, Sanghyun

    2018-01-01

    When a gene shows varying levels of expression among normal people but similar levels in disease patients or shows similar levels of expression among normal people but different levels in disease patients, we can assume that the gene is associated with the disease. By utilizing this gene expression heterogeneity, we can obtain additional information that abets discovery of disease-associated genes. In this study, we used collaborative filtering to calculate the degree of gene expression heterogeneity between classes and then scored the genes on the basis of the degree of gene expression heterogeneity to find "differentially predicted" genes. Through the proposed method, we discovered more prostate cancer-associated genes than 10 comparable methods. The genes prioritized by the proposed method are potentially significant to biological processes of a disease and can provide insight into them.

  18. Identification of a core set of rhizobial infection genes using data from single cell-types

    Directory of Open Access Journals (Sweden)

    Da-Song eChen

    2015-07-01

    Full Text Available Genome-wide expression studies on nodulation have varied in their scale from entire root systems to dissected nodules or root sections containing nodule primordia. More recently efforts have focused on developing methods for isolation of root hairs from infected plants and the application of laser-capture microdissection technology to nodules. Here we analyze two published data sets to identify a core set of infection genes that are expressed in the nodule and in root hairs during infection. Among the genes identified were those encoding phenylpropanoid biosynthesis enzymes including Chalcone-O-Methyltransferase which is required for the production of the potent Nod gene inducer 4’,4-dihydroxy-2-methoxychalcone. A promoter-GUS analysis in transgenic hairy roots for two genes encoding Chalcone-O-Methyltransferase isoforms revealed their expression in rhizobially infected root hairs and the nodule infection zone but not in the nitrogen fixation zone. We also describe a group of Rhizobially Induced Peroxidases whose expression overlaps with the production of superoxide in rhizobially infected root hairs and in nodules and roots. Finally, we identify a cohort of co-regulated transcription factors as candidate regulators of these processes.

  19. EXP-PAC: providing comparative analysis and storage of next generation gene expression data.

    Science.gov (United States)

    Church, Philip C; Goscinski, Andrzej; Lefèvre, Christophe

    2012-07-01

    Microarrays and more recently RNA sequencing has led to an increase in available gene expression data. How to manage and store this data is becoming a key issue. In response we have developed EXP-PAC, a web based software package for storage, management and analysis of gene expression and sequence data. Unique to this package is SQL based querying of gene expression data sets, distributed normalization of raw gene expression data and analysis of gene expression data across experiments and species. This package has been populated with lactation data in the international milk genomic consortium web portal (http://milkgenomics.org/). Source code is also available which can be hosted on a Windows, Linux or Mac APACHE server connected to a private or public network (http://mamsap.it.deakin.edu.au/~pcc/Release/EXP_PAC.html). Copyright © 2012 Elsevier Inc. All rights reserved.

  20. Cartilage-selective genes identified in genome-scale analysis of non-cartilage and cartilage gene expression

    Directory of Open Access Journals (Sweden)

    Cohn Zachary A

    2007-06-01

    Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.

  1. Analysis of multiplex gene expression maps obtained by voxelation

    OpenAIRE

    An, L; Xie, H; Chin, MH; Obradovic, Z; Smith, DJ; Megalooikonomou, V

    2009-01-01

    Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we presen...

  2. Host Gene Expression Analysis in Sri Lankan Melioidosis Patients

    Science.gov (United States)

    2017-06-19

    CCL5 Chemokine (C-C motif) ligand 5 /RANTES. IFNγ Interferon gamma TNFα Tumor necrosis factor alpha HMGB1 High mobility group box 1 protein /high...aim of this study was to analyze gene expression levels of human host factors in melioidosis patients and establish useful correlation with disease...PBMC’s) of study subjects. Gene expression profiles of 25 gene targets including 19 immune response genes and 6 epigenetic factors were analyzed by

  3. Mining Gene Regulatory Networks by Neural Modeling of Expression Time-Series.

    Science.gov (United States)

    Rubiolo, Mariano; Milone, Diego H; Stegmayer, Georgina

    2015-01-01

    Discovering gene regulatory networks from data is one of the most studied topics in recent years. Neural networks can be successfully used to infer an underlying gene network by modeling expression profiles as times series. This work proposes a novel method based on a pool of neural networks for obtaining a gene regulatory network from a gene expression dataset. They are used for modeling each possible interaction between pairs of genes in the dataset, and a set of mining rules is applied to accurately detect the subjacent relations among genes. The results obtained on artificial and real datasets confirm the method effectiveness for discovering regulatory networks from a proper modeling of the temporal dynamics of gene expression profiles.

  4. Spatial reconstruction of single-cell gene expression

    Science.gov (United States)

    Satija, Rahul; Farrell, Jeffrey A.; Gennert, David; Schier, Alexander F.; Regev, Aviv

    2015-01-01

    Spatial localization is a key determinant of cellular fate and behavior, but spatial RNA assays traditionally rely on staining for a limited number of RNA species. In contrast, single-cell RNA-seq allows for deep profiling of cellular gene expression, but established methods separate cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos, inferring a transcriptome-wide map of spatial patterning. We confirmed Seurat’s accuracy using several experimental approaches, and used it to identify a set of archetypal expression patterns and spatial markers. Additionally, Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems. PMID:25867923

  5. Predictive modelling of gene expression from transcriptional regulatory elements.

    Science.gov (United States)

    Budden, David M; Hurley, Daniel G; Crampin, Edmund J

    2015-07-01

    Predictive modelling of gene expression provides a powerful framework for exploring the regulatory logic underpinning transcriptional regulation. Recent studies have demonstrated the utility of such models in identifying dysregulation of gene and miRNA expression associated with abnormal patterns of transcription factor (TF) binding or nucleosomal histone modifications (HMs). Despite the growing popularity of such approaches, a comparative review of the various modelling algorithms and feature extraction methods is lacking. We define and compare three methods of quantifying pairwise gene-TF/HM interactions and discuss their suitability for integrating the heterogeneous chromatin immunoprecipitation (ChIP)-seq binding patterns exhibited by TFs and HMs. We then construct log-linear and ϵ-support vector regression models from various mouse embryonic stem cell (mESC) and human lymphoblastoid (GM12878) data sets, considering both ChIP-seq- and position weight matrix- (PWM)-derived in silico TF-binding. The two algorithms are evaluated both in terms of their modelling prediction accuracy and ability to identify the established regulatory roles of individual TFs and HMs. Our results demonstrate that TF-binding and HMs are highly predictive of gene expression as measured by mRNA transcript abundance, irrespective of algorithm or cell type selection and considering both ChIP-seq and PWM-derived TF-binding. As we encourage other researchers to explore and develop these results, our framework is implemented using open-source software and made available as a preconfigured bootable virtual environment. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  6. Genomic Features That Predict Allelic Imbalance in Humans Suggest Patterns of Constraint on Gene Expression Variation

    Science.gov (United States)

    Fédrigo, Olivier; Haygood, Ralph; Mukherjee, Sayan; Wray, Gregory A.

    2009-01-01

    Variation in gene expression is an important contributor to phenotypic diversity within and between species. Although this variation often has a genetic component, identification of the genetic variants driving this relationship remains challenging. In particular, measurements of gene expression usually do not reveal whether the genetic basis for any observed variation lies in cis or in trans to the gene, a distinction that has direct relevance to the physical location of the underlying genetic variant, and which may also impact its evolutionary trajectory. Allelic imbalance measurements identify cis-acting genetic effects by assaying the relative contribution of the two alleles of a cis-regulatory region to gene expression within individuals. Identification of patterns that predict commonly imbalanced genes could therefore serve as a useful tool and also shed light on the evolution of cis-regulatory variation itself. Here, we show that sequence motifs, polymorphism levels, and divergence levels around a gene can be used to predict commonly imbalanced genes in a human data set. Reduction of this feature set to four factors revealed that only one factor significantly differentiated between commonly imbalanced and nonimbalanced genes. We demonstrate that these results are consistent between the original data set and a second published data set in humans obtained using different technical and statistical methods. Finally, we show that variation in the single allelic imbalance-associated factor is partially explained by the density of genes in the region of a target gene (allelic imbalance is less probable for genes in gene-dense regions), and, to a lesser extent, the evenness of expression of the gene across tissues and the magnitude of negative selection on putative regulatory regions of the gene. These results suggest that the genomic distribution of functional cis-regulatory variants in the human genome is nonrandom, perhaps due to local differences in evolutionary

  7. Heterologous gene expression in filamentous fungi.

    Science.gov (United States)

    Su, Xiaoyun; Schmitz, George; Zhang, Meiling; Mackie, Roderick I; Cann, Isaac K O

    2012-01-01

    Filamentous fungi are critical to production of many commercial enzymes and organic compounds. Fungal-based systems have several advantages over bacterial-based systems for protein production because high-level secretion of enzymes is a common trait of their decomposer lifestyle. Furthermore, in the large-scale production of recombinant proteins of eukaryotic origin, the filamentous fungi become the vehicle of choice due to critical processes shared in gene expression with other eukaryotic organisms. The complexity and relative dearth of understanding of the physiology of filamentous fungi, compared to bacteria, have hindered rapid development of these organisms as highly efficient factories for the production of heterologous proteins. In this review, we highlight several of the known benefits and challenges in using filamentous fungi (particularly Aspergillus spp., Trichoderma reesei, and Neurospora crassa) for the production of proteins, especially heterologous, nonfungal enzymes. We review various techniques commonly employed in recombinant protein production in the filamentous fungi, including transformation methods, selection of gene regulatory elements such as promoters, protein secretion factors such as the signal peptide, and optimization of coding sequence. We provide insights into current models of host genomic defenses such as repeat-induced point mutation and quelling. Furthermore, we examine the regulatory effects of transcript sequences, including introns and untranslated regions, pre-mRNA (messenger RNA) processing, transcript transport, and mRNA stability. We anticipate that this review will become a resource for researchers who aim at advancing the use of these fascinating organisms as protein production factories, for both academic and industrial purposes, and also for scientists with general interest in the biology of the filamentous fungi. Copyright © 2012 Elsevier Inc. All rights reserved.

  8. Rhythmic diel pattern of gene expression in juvenile maize leaf.

    Directory of Open Access Journals (Sweden)

    Maciej Jończyk

    Full Text Available BACKGROUND: Numerous biochemical and physiological parameters of living organisms follow a circadian rhythm. Although such rhythmic behavior is particularly pronounced in plants, which are strictly dependent on the daily photoperiod, data on the molecular aspects of the diurnal cycle in plants is scarce and mostly concerns the model species Arabidopsis thaliana. Here we studied the leaf transcriptome in seedlings of maize, an important C4 crop only distantly related to A. thaliana, throughout a cycle of 10 h darkness and 14 h light to look for rhythmic patterns of gene expression. RESULTS: Using DNA microarrays comprising ca. 43,000 maize-specific probes we found that ca. 12% of all genes showed clear-cut diel rhythms of expression. Cluster analysis identified 35 groups containing from four to ca. 1,000 genes, each comprising genes of similar expression patterns. Perhaps unexpectedly, the most pronounced and most common (concerning the highest number of genes expression maxima were observed towards and during the dark phase. Using Gene Ontology classification several meaningful functional associations were found among genes showing similar diel expression patterns, including massive induction of expression of genes related to gene expression, translation, protein modification and folding at dusk and night. Additionally, we found a clear-cut tendency among genes belonging to individual clusters to share defined transcription factor-binding sequences. CONCLUSIONS: Co-expressed genes belonging to individual clusters are likely to be regulated by common mechanisms. The nocturnal phase of the diurnal cycle involves gross induction of fundamental biochemical processes and should be studied more thoroughly than was appreciated in most earlier physiological studies. Although some general mechanisms responsible for the diel regulation of gene expression might be shared among plants, details of the diurnal regulation of gene expression seem to differ

  9. FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data

    DEFF Research Database (Denmark)

    Manijak, Mieszko P.; Nielsen, Henrik Bjørn

    2011-01-01

    circumvented by instead matching gene expression signatures to signatures of other experiments. FINDINGS: To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700...... Arabidopsis microarray experiments. CONCLUSIONS: Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/....

  10. Differentially expressed genes associated with dormancy or germination of Arabidopsis thaliana seeds

    NARCIS (Netherlands)

    Toorop, P.E.; Barroco, R.M.; Engler, G.; Groot, S.P.C.; Hilhorst, H.W.M.

    2005-01-01

    Differential display analysis using dormant and non-dormant Arabidopsis thaliana (L.) Heynh seeds resulted in a set of genes that were associated with either dormancy or germination. Expression of the germination-associated genes AtRPL36B and AtRPL27B, encoding two ribosomal proteins, was

  11. PRAME Gene Expression in Acute Leukemia and Its Clinical Significance

    International Nuclear Information System (INIS)

    Ding, Kai; Wang, Xiao-ming; Fu, Rong; Ruan, Er-bao; Liu, Hui; Shao, Zong-hong

    2012-01-01

    To investigate the expression of the preferentially expressed antigen of melanoma (PRAME) gene in acute leukemia and its clinical significance. The level of expressed PRAME mRNA in bone marrow mononuclear cells from 34 patients with acute leukemia (AL) and in 12 bone marrow samples from healthy volunteers was measured via RT-PCR. Correlation analyses between PRAME gene expression and the clinical characteristics (gender, age, white blood count, immunophenotype of leukemia, percentage of blast cells, and karyotype) of the patients were performed. The PRAME gene was expressed in 38.2% of all 34 patients, in 40.7% of the patients with acute myelogenous leukemia (AML, n=27), and in 28.6% of the patients with acute lymphoblastic leukemia (ALL, n=7), but was not expressed in the healthy volunteers. The difference in the expression levels between AML and ALL patients was statistically significant. The rate of gene expression was 80% in M 3 , 33.3% in M 2 , and 28.6% in M 5 . Gene expression was also found to be correlated with CD15 and CD33 expression and abnormal karyotype, but not with age, gender, white blood count or percentage of blast cells. The PRAME gene is highly expressed in acute leukemia and could be a useful marker to monitor minimal residual disease. This gene is also a candidate target for the immunotherapy of acute leukemia

  12. Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process

    International Nuclear Information System (INIS)

    Chandran, Uma R; Ma, Changqing; Dhir, Rajiv; Bisceglia, Michelle; Lyons-Weiler, Maureen; Liang, Wenjing; Michalopoulos, George; Becich, Michael; Monzon, Federico A

    2007-01-01

    Prostate cancer is characterized by heterogeneity in the clinical course that often does not correlate with morphologic features of the tumor. Metastasis reflects the most adverse outcome of prostate cancer, and to date there are no reliable morphologic features or serum biomarkers that can reliably predict which patients are at higher risk of developing metastatic disease. Understanding the differences in the biology of metastatic and organ confined primary tumors is essential for developing new prognostic markers and therapeutic targets. Using Affymetrix oligonucleotide arrays, we analyzed gene expression profiles of 24 androgen-ablation resistant metastatic samples obtained from 4 patients and a previously published dataset of 64 primary prostate tumor samples. Differential gene expression was analyzed after removing potentially uninformative stromal genes, addressing the differences in cellular content between primary and metastatic tumors. The metastatic samples are highly heterogenous in expression; however, differential expression analysis shows that 415 genes are upregulated and 364 genes are downregulated at least 2 fold in every patient with metastasis. The expression profile of metastatic samples reveals changes in expression of a unique set of genes representing both the androgen ablation related pathways and other metastasis related gene networks such as cell adhesion, bone remodelling and cell cycle. The differentially expressed genes include metabolic enzymes, transcription factors such as Forkhead Box M1 (FoxM1) and cell adhesion molecules such as Osteopontin (SPP1). We hypothesize that these genes have a role in the biology of metastatic disease and that they represent potential therapeutic targets for prostate cancer

  13. Gene expression patterns during the larval development of European sea bass (dicentrarchus labrax) by microarray analysis.

    Science.gov (United States)

    Darias, M J; Zambonino-Infante, J L; Hugot, K; Cahu, C L; Mazurais, D

    2008-01-01

    During the larval period, marine teleosts undergo very fast growth and dramatic changes in morphology, metabolism, and behavior to accomplish their metamorphosis into juvenile fish. Regulation of gene expression is widely thought to be a key mechanism underlying the management of the biological processes required for harmonious development over this phase of life. To provide an overall analysis of gene expression in the whole body during sea bass larval development, we monitored the expression of 6,626 distinct genes at 10 different points in time between 7 and 43 days post-hatching (dph) by using heterologous hybridization of a rainbow trout cDNA microarray. The differentially expressed genes (n = 485) could be grouped into two categories: genes that were generally up-expressed early, between 7 and 23 dph, and genes up-expressed between 25 and 43 dph. Interestingly, among the genes regulated during the larval period, those related to organogenesis, energy pathways, biosynthesis, and digestion were over-represented compared with total set of analyzed genes. We discuss the quantitative regulation of whole-body contents of these specific transcripts with regard to the ontogenesis and maturation of essential functions that take place over larval development. Our study is the first utilization of a transcriptomic approach in sea bass and reveals dynamic changes in gene expression patterns in relation to marine finfish larval development.

  14. Complete gene expression profiling of Saccharopolyspora erythraea using GeneChip DNA microarrays

    Directory of Open Access Journals (Sweden)

    Bordoni Roberta

    2007-11-01

    Full Text Available Abstract Background The Saccharopolyspora erythraea genome sequence, recently published, presents considerable divergence from those of streptomycetes in gene organization and function, confirming the remarkable potential of S. erythraea for producing many other secondary metabolites in addition to erythromycin. In order to investigate, at whole transcriptome level, how S. erythraea genes are modulated, a DNA microarray was specifically designed and constructed on the S. erythraea strain NRRL 2338 genome sequence, and the expression profiles of 6494 ORFs were monitored during growth in complex liquid medium. Results The transcriptional analysis identified a set of 404 genes, whose transcriptional signals vary during growth and characterize three distinct phases: a rapid growth until 32 h (Phase A; a growth slowdown until 52 h (Phase B; and another rapid growth phase from 56 h to 72 h (Phase C before the cells enter the stationary phase. A non-parametric statistical method, that identifies chromosomal regions with transcriptional imbalances, determined regional organization of transcription along the chromosome, highlighting differences between core and non-core regions, and strand specific patterns of expression. Microarray data were used to characterize the temporal behaviour of major functional classes and of all the gene clusters for secondary metabolism. The results confirmed that the ery cluster is up-regulated during Phase A and identified six additional clusters (for terpenes and non-ribosomal peptides that are clearly regulated in later phases. Conclusion The use of a S. erythraea DNA microarray improved specificity and sensitivity of gene expression analysis, allowing a global and at the same time detailed picture of how S. erythraea genes are modulated. This work underlines the importance of using DNA microarrays, coupled with an exhaustive statistical and bioinformatic analysis of the results, to understand the transcriptional

  15. A pipeline to determine RT-QPCR control genes for evolutionary studies: application to primate gene expression across multiple tissues.

    Directory of Open Access Journals (Sweden)

    Olivier Fedrigo

    Full Text Available Because many species-specific phenotypic differences are assumed to be caused by differential regulation of gene expression, many recent investigations have focused on measuring transcript abundance. Despite the availability of high-throughput platforms, quantitative real-time polymerase chain reaction (RT-QPCR is often the method of choice because of its low cost and wider dynamic range. However, the accuracy of this technique heavily relies on the use of multiple valid control genes for normalization. We created a pipeline for choosing genes potentially useful as RT-QPCR control genes for measuring expression between human and chimpanzee samples across multiple tissues, using published microarrays and a measure of tissue-specificity. We identified 13 genes from the pipeline and from commonly used control genes: ACTB, USP49, ARGHGEF2, GSK3A, TBP, SDHA, EIF2B2, GPDH, YWHAZ, HPTR1, RPL13A, HMBS, and EEF2. We then tested these candidate genes and validated their expression stability across species. We established the rank order of the most preferable set of genes for single and combined tissues. Our results suggest that for at least three tissues (cerebral cortex, liver, and skeletal muscle, EIF2B2, EEF2, HMBS, and SDHA are useful genes for normalizing human and chimpanzee expression using RT-QPCR. Interestingly, other commonly used control genes, including TBP, GAPDH, and, especially ACTB do not perform as well. This pipeline could be easily adapted to other species for which expression data exist, providing taxonomically appropriate control genes for comparisons of gene expression among species.

  16. Gene structure and expression characteristic of a novel odorant receptor gene cluster in the parasitoid wasp Microplitis mediator (Hymenoptera: Braconidae).

    Science.gov (United States)

    Wang, S-N; Shan, S; Zheng, Y; Peng, Y; Lu, Z-Y; Yang, Y-Q; Li, R-J; Zhang, Y-J; Guo, Y-Y

    2017-08-01

    Odorant receptors (ORs) expressed in the antennae of parasitoid wasps are responsible for detection of various lipophilic airborne molecules. In the present study, 107 novel OR genes were identified from Microplitis mediator antennal transcriptome data. Phylogenetic analysis of the set of OR genes from M. mediator and Microplitis demolitor revealed that M. mediator OR (MmedOR) genes can be classified into different subfamilies, and the majority of MmedORs in each subfamily shared high sequence identities and clear orthologous relationships to M. demolitor ORs. Within a subfamily, six MmedOR genes, MmedOR98, 124, 125, 126, 131 and 155, shared a similar gene structure and were tightly linked in the genome. To evaluate whether the clustered MmedOR genes share common regulatory features, the transcription profile and expression characteristics of the six closely related OR genes were investigated in M. mediator. Rapid amplification of cDNA ends-PCR experiments revealed that the OR genes within the cluster were transcribed as single mRNAs, and a bicistronic mRNA for two adjacent genes (MmedOR124 and MmedOR98) was also detected in female antennae by reverse transcription PCR. In situ hybridization experiments indicated that each OR gene within the cluster was expressed in a different number of cells. Moreover, there was no co-expression of the two highly related OR genes, MmedOR124 and MmedOR98, which appeared to be individually expressed in a distinct population of neurons. Overall, there were distinct expression profiles of closely related MmedOR genes from the same cluster in M. mediator. These data provide a basic understanding of the olfactory coding in parasitoid wasps. © 2017 The Royal Entomological Society.

  17. Global gene expression analysis for evaluation and design of biomaterials

    Directory of Open Access Journals (Sweden)

    Nobutaka Hanagata, Taro Takemura and Takashi Minowa

    2010-01-01

    Full Text Available Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data.

  18. Global gene expression analysis for evaluation and design of biomaterials

    International Nuclear Information System (INIS)

    Hanagata, Nobutaka; Takemura, Taro; Minowa, Takashi

    2010-01-01

    Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data. (topical review)

  19. Nur77 coordinately regulates expression of genes linked to glucose metabolism in skeletal muscle.

    Science.gov (United States)

    Chao, Lily C; Zhang, Zidong; Pei, Liming; Saito, Tsugumichi; Tontonoz, Peter; Pilch, Paul F

    2007-09-01

    Innervation is important for normal metabolism in skeletal muscle, including insulin-sensitive glucose uptake. However, the transcription factors that transduce signals from the neuromuscular junction to the nucleus and affect changes in metabolic gene expression are not well defined. We demonstrate here that the orphan nuclear receptor Nur77 is a regulator of gene expression linked to glucose utilization in muscle. In vivo, Nur77 is preferentially expressed in glycolytic compared with oxidative muscle and is responsive to beta-adrenergic stimulation. Denervation of rat muscle compromises expression of Nur77 in parallel with that of numerous genes linked to glucose metabolism, including glucose transporter 4 and genes involved in glycolysis, glycogenolysis, and the glycerophosphate shuttle. Ectopic expression of Nur77, either in rat muscle or in C2C12 muscle cells, induces expression of a highly overlapping set of genes, including glucose transporter 4, muscle phosphofructokinase, and glycogen phosphorylase. Furthermore, selective knockdown of Nur77 in rat muscle by small hairpin RNA or genetic deletion of Nur77 in mice reduces the expression of a battery of genes involved in skeletal muscle glucose utilization in vivo. Finally, we show that Nur77 binds the promoter regions of multiple genes involved in glucose metabolism in muscle. These results identify Nur77 as a potential mediator of neuromuscular signaling in the control of metabolic gene expression.

  20. Robust assignment of cancer subtypes from expression data using a uni-variate gene expression average as classifier

    International Nuclear Information System (INIS)

    Lauss, Martin; Frigyesi, Attila; Ryden, Tobias; Höglund, Mattias

    2010-01-01

    Genome wide gene expression data is a rich source for the identification of gene signatures suitable for clinical purposes and a number of statistical algorithms have been described for both identification and evaluation of such signatures. Some employed algorithms are fairly complex and hence sensitive to over-fitting whereas others are more simple and straight forward. Here we present a new type of simple algorithm based on ROC analysis and the use of metagenes that we believe will be a good complement to existing algorithms. The basis for the proposed approach is the use of metagenes, instead of collections of individual genes, and a feature selection using AUC values obtained by ROC analysis. Each gene in a data set is assigned an AUC value relative to the tumor class under investigation and the genes are ranked according to these values. Metagenes are then formed by calculating the mean expression level for an increasing number of ranked genes, and the metagene expression value that optimally discriminates tumor classes in the training set is used for classification of new samples. The performance of the metagene is then evaluated using LOOCV and balanced accuracies. We show that the simple uni-variate gene expression average algorithm performs as well as several alternative algorithms such as discriminant analysis and the more complex approaches such as SVM and neural networks. The R package rocc is freely available at http://cran.r-project.org/web/packages/rocc/index.html

  1. Microarray analysis identifies a common set of cellular genes modulated by different HCV replicon clones

    Directory of Open Access Journals (Sweden)

    Gerosolimo Germano

    2008-06-01

    Full Text Available Abstract Background Hepatitis C virus (HCV RNA synthesis and protein expression affect cell homeostasis by modulation of gene expression. The impact of HCV replication on global cell transcription has not been fully evaluated. Thus, we analysed the expression profiles of different clones of human hepatoma-derived Huh-7 cells carrying a self-replicating HCV RNA which express all viral proteins (HCV replicon system. Results First, we compared the expression profile of HCV replicon clone 21-5 with both the Huh-7 parental cells and the 21-5 cured (21-5c cells. In these latter, the HCV RNA has been eliminated by IFN-α treatment. To confirm data, we also analyzed microarray results from both the 21-5 and two other HCV replicon clones, 22-6 and 21-7, compared to the Huh-7 cells. The study was carried out by using the Applied Biosystems (AB Human Genome Survey Microarray v1.0 which provides 31,700 probes that correspond to 27,868 human genes. Microarray analysis revealed a specific transcriptional program induced by HCV in replicon cells respect to both IFN-α-cured and Huh-7 cells. From the original datasets of differentially expressed genes, we selected by Venn diagrams a final list of 38 genes modulated by HCV in all clones. Most of the 38 genes have never been described before and showed high fold-change associated with significant p-value, strongly supporting data reliability. Classification of the 38 genes by Panther System identified functional categories that were significantly enriched in this gene set, such as histones and ribosomal proteins as well as extracellular matrix and intracellular protein traffic. The dataset also included new genes involved in lipid metabolism, extracellular matrix and cytoskeletal network, which may be critical for HCV replication and pathogenesis. Conclusion Our data provide a comprehensive analysis of alterations in gene expression induced by HCV replication and reveal modulation of new genes potentially useful

  2. Redox regulation of photosynthetic gene expression.

    Science.gov (United States)

    Queval, Guillaume; Foyer, Christine H

    2012-12-19

    Redox chemistry and redox regulation are central to the operation of photosynthesis and respiration. However, the roles of different oxidants and antioxidants in the regulation of photosynthetic or respiratory gene expression remain poorly understood. Leaf transcriptome profiles of a range of Arabidopsis thaliana genotypes that are deficient in either hydrogen peroxide processing enzymes or in low molecular weight antioxidant were therefore compared to determine how different antioxidant systems that process hydrogen peroxide influence transcripts encoding proteins targeted to the chloroplasts or mitochondria. Less than 10 per cent overlap was observed in the transcriptome patterns of leaves that are deficient in either photorespiratory (catalase (cat)2) or chloroplastic (thylakoid ascorbate peroxidase (tapx)) hydrogen peroxide processing. Transcripts encoding photosystem II (PSII) repair cycle components were lower in glutathione-deficient leaves, as were the thylakoid NAD(P)H (nicotinamide adenine dinucleotide (phosphate)) dehydrogenases (NDH) mRNAs. Some thylakoid NDH mRNAs were also less abundant in tAPX-deficient and ascorbate-deficient leaves. Transcripts encoding the external and internal respiratory NDHs were increased by low glutathione and low ascorbate. Regulation of transcripts encoding specific components of the photosynthetic and respiratory electron transport chains by hydrogen peroxide, ascorbate and glutathione may serve to balance non-cyclic and cyclic electron flow pathways in relation to oxidant production and reductant availability.

  3. Cell cycle gene expression under clinorotation

    Science.gov (United States)

    Artemenko, Olga

    2016-07-01

    Cyclins and cyclin-dependent kinase (CDK) are main regulators of the cell cycle of eukaryotes. It's assumes a significant change of their level in cells under microgravity conditions and by other physical factors actions. The clinorotation use enables to determine the influence of gravity on simulated events in the cell during the cell cycle - exit from the state of quiet stage and promotion presynthetic phase (G1) and DNA synthesis phase (S) of the cell cycle. For the clinorotation effect study on cell proliferation activity is the necessary studies of molecular mechanisms of cell cycle regulation and development of plants under altered gravity condition. The activity of cyclin D, which is responsible for the events of the cell cycle in presynthetic phase can be controlled by the action of endogenous as well as exogenous factors, but clinorotation is one of the factors that influence on genes expression that regulate the cell cycle.These data can be used as a model for further research of cyclin - CDK complex for study of molecular mechanisms regulation of growth and proliferation. In this investigation we tried to summarize and analyze known literature and own data we obtained relatively the main regulators of the cell cycle in altered gravity condition.

  4. Social Regulation of Gene Expression in Threespine Sticklebacks.

    Directory of Open Access Journals (Sweden)

    Anna K Greenwood

    Full Text Available Identifying genes that are differentially expressed in response to social interactions is informative for understanding the molecular basis of social behavior. To address this question, we described changes in gene expression as a result of differences in the extent of social interactions. We housed threespine stickleback (Gasterosteus aculeatus females in either group conditions or individually for one week, then measured levels of gene expression in three brain regions using RNA-sequencing. We found that numerous genes in the hindbrain/cerebellum had altered expression in response to group or individual housing. However, relatively few genes were differentially expressed in either the diencephalon or telencephalon. The list of genes upregulated in fish from social groups included many genes related to neural development and cell adhesion as well as genes with functions in sensory signaling, stress, and social and reproductive behavior. The list of genes expressed at higher levels in individually-housed fish included several genes previously identified as regulated by social interactions in other animals. The identified genes are interesting targets for future research on the molecular mechanisms of normal social interactions.

  5. TiGER: a database for tissue-specific gene expression and regulation.

    Science.gov (United States)

    Liu, Xiong; Yu, Xueping; Zack, Donald J; Zhu, Heng; Qian, Jiang

    2008-06-09

    Understanding how genes are expressed and regulated in different tissues is a fundamental and challenging question. However, most of currently available biological databases do not focus on tissue-specific gene regulation. The recent development of computational methods for tissue-specific combinational gene regulation, based on transcription factor binding sites, enables us to perform a large-scale analysis of tissue-specific gene regulation in human tissues. The results are stored in a web database called TiGER (Tissue-specific Gene Expression and Regulation). The database contains three types of data including tissue-specific gene expression profiles, combinatorial gene regulations, and cis-regulatory module (CRM) detections. At present the database contains expression profiles for 19,526 UniGene genes, combinatorial regulations for 7,341 transcription factor pairs and 6,232 putative CRMs for 2,130 RefSeq genes. We have developed and made publicly available a database, TiGER, which summarizes and provides large scale data sets for tissue-specific gene expression and regulation in a variety of human tissues. This resource is available at 1.

  6. TiGER: A database for tissue-specific gene expression and regulation

    Directory of Open Access Journals (Sweden)

    Zack Donald J

    2008-06-01

    Full Text Available Abstract Background Understanding how genes are expressed and regulated in different tissues is a fundamental and challenging question. However, most of currently available biological databases do not focus on tissue-specific gene regulation. Results The recent development of computational methods for tissue-specific combinational gene regulation, based on transcription factor binding sites, enables us to perform a large-scale analysis of tissue-specific gene regulation in human tissues. The results are stored in a web database called TiGER (Tissue-specific Gene Expression and Regulation. The database contains three types of data including tissue-specific gene expression profiles, combinatorial gene regulations, and cis-regulatory module (CRM detections. At present the database contains expression profiles for 19,526 UniGene genes, combinatorial regulations for 7,341 transcription factor pairs and 6,232 putative CRMs for 2,130 RefSeq genes. Conclusion We have developed and made publicly available a database, TiGER, which summarizes and provides large scale data sets for tissue-specific gene expression and regulation in a variety of human tissues. This resource is available at 1.

  7. Evaluation of candidate reference genes for gene expression normalization in Brassica juncea using real time quantitative RT-PCR.

    Directory of Open Access Journals (Sweden)

    Ruby Chandna

    Full Text Available The real time quantitative reverse transcription PCR (qRT-PCR is becoming increasingly important to gain insight into function of genes. Given the increased sensitivity, ease and reproducibility of qRT-PCR, the requirement of suitable reference genes for normalization has become important and stringent. It is now known that the expression of internal control genes in living organism vary considerably during developmental stages and under different experimental conditions. For economically important Brassica crops, only a couple of reference genes are reported till date. In this study, expression stability of 12 candidate reference genes including ACT2, ELFA, GAPDH, TUA, UBQ9 (traditional housekeeping genes, ACP, CAC, SNF, TIPS-41, TMD, TSB and ZNF (new candidate reference genes, in a diverse set of 49 tissue samples representing different developmental stages, stress and hormone treated conditions and cultivars of Brassica juncea has been validated. For the normalization of vegetative stages the ELFA, ACT2, CAC and TIPS-41 combination would be appropriate whereas TIPS-41 along with CAC would be suitable for normalization of reproductive stages. A combination of GAPDH, TUA, TIPS-41 and CAC were identified as the most suitable reference genes for total developmental stages. In various stress and hormone treated samples, UBQ9 and TIPS-41 had the most stable expression. Across five cultivars of B. juncea, the expression of CAC and TIPS-41 did not vary significantly and were identified as the most stably expressed reference genes. This study provides comprehensive information that the new reference genes selected herein performed better than the traditional housekeeping genes. The selection of most suitable reference genes depends on the experimental conditions, and is tissue and cultivar-specific. Further, to attain accuracy in the results more than one reference genes are necessary for normalization.

  8. The effects of microgravity on gene expression of Arabidopsis

    Science.gov (United States)

    Correll, Melanie; Stimpson, Alexander; Pereira, Rhea; Kiss, John Z.

    TROPI (for TROPIsms) consisted of a series of experiments on the International Space Station to study the interaction between phototropism and gravitropism. As part of TROPI, we received frozen Arabidopsis seedlings from the ISS on three shuttle missions (STS-116, STS-117 and STS-120). These seedlings are being used for gene expression studies. Unfortunately, the quality of RNA returned from the first return mission was poor while that from the second and third missions were of high quality. This indicates that some environmental parameters were not maintained during first return mission since all of these samples were stored in the same location at -80° C on the ISS. Therefore, due to the loss during the first sample return, we had to develop new protocols to maximize RNA yields and optimize labeling techniques for microarray analysis. Using these new protocols, RNA was extracted from several sets of seedlings grown in various light treatments and µg levels and microarray analyses performed. Hundreds of genes were shown to be regulated in response to microgravity and include transcription factors (WRKY, MYB, ZF families) and those involved in plant hormone signaling (auxin, ethylene, and ABA responsive genes). The characterization of the regulated pathways and genes specific to gravity and light treatments is underway. (This project is Supported By: NASA NCC2-1200).

  9. Microarray gene expression profiling and analysis in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Sadhukhan Provash

    2004-06-01

    Full Text Available Abstract Background Renal cell carcinoma (RCC is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. Methods Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. Results Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR. Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. Conclusions This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most

  10. Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

    KAUST Repository

    Othoum, Ghofran K

    2013-05-01

    Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using

  11. Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

    KAUST Repository

    Othoum, Ghofran K

    2013-01-01

    Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using

  12. Learning Gene Regulatory Networks Computationally from Gene Expression Data Using Weighted Consensus

    KAUST Repository

    Fujii, Chisato

    2015-04-16

    Gene regulatory networks analyze the relationships between genes allowing us to un- derstand the gene regulatory interactions in systems biology. Gene expression data from the microarray experiments is used to obtain the gene regulatory networks. How- ever, the microarray data is discrete, noisy and non-linear which makes learning the networks a challenging problem and existing gene network inference methods do not give consistent results. Current state-of-the-art study uses the average-ranking-based consensus method to combine and average the ranked predictions from individual methods. However each individual method has an equal contribution to the consen- sus prediction. We have developed a linear programming-based consensus approach which uses learned weights from linear programming among individual methods such that the methods have di↵erent weights depending on their performance. Our result reveals that assigning di↵erent weights to individual methods rather than giving them equal weights improves the performance of the consensus. The linear programming- based consensus method is evaluated and it had the best performance on in silico and Saccharomyces cerevisiae networks, and the second best on the Escherichia coli network outperformed by Inferelator Pipeline method which gives inconsistent results across a wide range of microarray data sets.

  13. A stochastic approach to multi-gene expression dynamics

    International Nuclear Information System (INIS)

    Ochiai, T.; Nacher, J.C.; Akutsu, T.

    2005-01-01

    In the last years, tens of thousands gene expression profiles for cells of several organisms have been monitored. Gene expression is a complex transcriptional process where mRNA molecules are translated into proteins, which control most of the cell functions. In this process, the correlation among genes is crucial to determine the specific functions of genes. Here, we propose a novel multi-dimensional stochastic approach to deal with the gene correlation phenomena. Interestingly, our stochastic framework suggests that the study of the gene correlation requires only one theoretical assumption-Markov property-and the experimental transition probability, which characterizes the gene correlation system. Finally, a gene expression experiment is proposed for future applications of the model

  14. Evidence for intron length conservation in a set of mammalian genes associated with embryonic development

    LENUS (Irish Health Repository)

    2011-10-05

    Abstract Background We carried out an analysis of intron length conservation across a diverse group of nineteen mammalian species. Motivated by recent research suggesting a role for time delays associated with intron transcription in gene expression oscillations required for early embryonic patterning, we searched for examples of genes that showed the most extreme conservation of total intron content in mammals. Results Gene sets annotated as being involved in pattern specification in the early embryo or containing the homeobox DNA-binding domain, were significantly enriched among genes with highly conserved intron content. We used ancestral sequences reconstructed with probabilistic models that account for insertion and deletion mutations to distinguish insertion and deletion events on lineages leading to human and mouse from their last common ancestor. Using a randomization procedure, we show that genes containing the homeobox domain show less change in intron content than expected, given the number of insertion and deletion events within their introns. Conclusions Our results suggest selection for gene expression precision or the existence of additional development-associated genes for which transcriptional delay is functionally significant.

  15. Association between gene expression profile of the primary tumor and chemotherapy response of metastatic breast cancer

    NARCIS (Netherlands)

    Savci-Heijink, Cemile Dilara; Halfwerk, Hans; Koster, Jan; van de Vijver, Marc Joan

    2017-01-01

    Background: To better predict the likelihood of response to chemotherapy, we have conducted a study comparing the gene expression patterns of primary tumours with their corresponding response to systemic chemotherapy in the metastatic setting. Methods: mRNA expression profiles of breast carcinomas

  16. Arabidopsis ATRX Modulates H3.3 Occupancy and Fine-Tunes Gene Expression

    KAUST Repository

    Duc, Cé line; Benoit, Matthias; Dé tourné , Gwé naë lle; Simon, Lauriane; Poulet, Axel; Jung, Matthieu; Veluchamy, Alaguraj; Latrasse, David; Le Goff, Samuel; Cotterell, Sylviane; Tatout, Christophe; Benhamed, Moussa; Probst, Aline V.

    2017-01-01

    , including the 45S ribosomal DNA (45S rDNA) loci, where loss of ATRX results in altered expression of specific 45S rDNA sequence variants. At the genome-wide scale, our data indicate that ATRX modifies gene expression concomitantly to H3.3 deposition at a set

  17. Minimising Immunohistochemical False Negative ER Classification Using a Complementary 23 Gene Expression Signature of ER Status

    DEFF Research Database (Denmark)

    Li, Qiyuan; Eklund, Aron Charles; Birkbak, Nicolai Juul

    2010-01-01

    with clinical outcome. METHODOLOGY/PRINCIPAL FINDINGS: Firstly, ER status was discriminated by fitting the bimodal expression of ESR1 to a mixed Gaussian model. The discriminative power of ESR1 suggested bimodal expression as an efficient way to stratify breast cancer; therefore we identified a set of genes...

  18. Classification of Breast Cancer Subtypes by combining Gene Expression and DNA Methylation Data

    DEFF Research Database (Denmark)

    List, Markus; Hauschild, Anne-Christin; Tan, Qihua

    2014-01-01

    expression data for hundreds of patients, the challenge is to extract a minimal optimal set of genes with good prognostic properties from a large bulk of genes making a moderate contribution to classification. Several studies have successfully applied machine learning algorithms to solve this so-called gene...... on the transcriptomic, but also on an epigenetic level. We compared so-called random forest derived classification models based on gene expression and methylation data alone, to a model based on the combined features and to a model based on the gold standard PAM50. We obtained bootstrap errors of 10...

  19. Bayesian models and meta analysis for multiple tissue gene expression data following corticosteroid administration

    Directory of Open Access Journals (Sweden)

    Kelemen Arpad

    2008-08-01

    Full Text Available Abstract Background This paper addresses key biological problems and statistical issues in the analysis of large gene expression data sets that describe systemic temporal response cascades to therapeutic doses in multiple tissues such as liver, skeletal muscle, and kidney from the same animals. Affymetrix time course gene expression data U34A are obtained from three different tissues including kidney, liver and muscle. Our goal is not only to find the concordance of gene in different tissues, identify the common differentially expressed genes over time and also examine the reproducibility of the findings by integrating the results through meta analysis from multiple tissues in order to gain a significant increase in the power of detecting differentially expressed genes over time and to find the differential differences of three tissues responding to the drug. Results and conclusion Bayesian categorical model for estimating the proportion of the 'call' are used for pre-screening genes. Hierarchical Bayesian Mixture Model is further developed for the identifications of differentially expressed genes across time and dynamic clusters. Deviance information criterion is applied to determine the number of components for model comparisons and selections. Bayesian mixture model produces the gene-specific posterior probability of differential/non-differential expression and the 95% credible interval, which is the basis for our further Bayesian meta-inference. Meta-analysis is performed in order to identify commonly expressed genes from multiple tissues that may serve as ideal targets for novel treatment strategies and to integrate the results across separate studies. We have found the common expressed genes in the three tissues. However, the up/down/no regulations of these common genes are different at different time points. Moreover, the most differentially expressed genes were found in the liver, then in kidney, and then in muscle.

  20. Unstable Expression of Commonly Used Reference Genes in Rat Pancreatic Islets Early after Isolation Affects Results of Gene Expression Studies.

    Directory of Open Access Journals (Sweden)

    Lucie Kosinová

    Full Text Available The use of RT-qPCR provides a powerful tool for gene expression studies; however, the proper interpretation of the obtained data is crucially dependent on accurate normalization based on stable reference genes. Recently, strong evidence has been shown indicating that the expression of many commonly used reference genes may vary significantly due to diverse experimental conditions. The isolation of pancreatic islets is a complicated procedure which creates severe mechanical and metabolic stress leading possibly to cellular damage and alteration of gene expression. Despite of this, freshly isolated islets frequently serve as a control in various gene expression and intervention studies. The aim of our study was to determine expression of 16 candidate reference genes and one gene of interest (F3 in isolated rat pancreatic islets during short-term cultivation in order to find a suitable endogenous control for gene expression studies. We compared the expression stability of the most commonly used reference genes and evaluated the reliability of relative and absolute quantification using RT-qPCR during 0-120 hrs after isolation. In freshly isolated islets, the expression of all tested genes was markedly depressed and it increased several times throughout the first 48 hrs of cultivation. We observed significant variability among samples at 0 and 24 hrs but substantial stabilization from 48 hrs onwards. During the first 48 hrs, relative quantification failed to reflect the real changes in respective mRNA concentrations while in the interval 48-120 hrs, the relative expression generally paralleled the results determined by absolute quantification. Thus, our data call into question the suitability of relative quantification for gene expression analysis in pancreatic islets during the first 48 hrs of cultivation, as the results may be significantly affected by unstable expression of reference genes. However, this method could provide reliable information

  1. Venus Express set for launch to the cryptic planet

    Science.gov (United States)

    2005-10-01

    On Wednesday, 26 October 2005, the sky over the Baikonur Cosmodrome, Kazakhstan, will be illuminated by the blast from a Soyuz-Fregat rocket carrying this precious spacecraft aloft. The celestial motion of the planets in our Solar System has given Venus Express the window to travel to Venus on the best route. In fact, every nineteen months Venus reaches the point where a voyage from Earth is the most fuel-efficient. To take advantage of this opportunity, ESA has opted to launch Venus Express within the next ‘launch window’, opening on 26 October this year and closing about one month later, on 24 November. Again, due to the relative motion of Earth and Venus, plus Earth’s daily rotation, there is only one short period per day when it is possible to launch, lasting only a few seconds. The first launch opportunity is on 26 October at 06:43 Central European Summer Time (CEST) (10:43 in Baikonur). Venus Express will take only 163 days, a little more than five months, to reach Venus. Then, in April 2006, the adventure of exploration will begin with Venus finally welcoming a spacecraft, a fully European one, more than ten years after humankind paid the last visit. The journey starts at launch One of the most reliable launchers in the world, the Soyuz-Fregat rocket, will set Venus Express on course for its target. Soyuz, procured by the European/Russian Starsem company, consists of three main stages with an additional upper stage, Fregat, atop. Venus Express is attached to this upper stage. The injection of Venus Express into the interplanetary trajectory which will bring it to Venus consists of three phases. In the first nine minutes after launch, Soyuz will perform the first phase, that is an almost vertical ascent trajectory, in which it is boosted to about 190 kilometres altitude by its three stages, separating in sequence. In the second phase, the Fregat-Venus Express ‘block’, now free from the Soyuz, is injected into a circular parking orbit around Earth

  2. BRAIN NETWORKS. Correlated gene expression supports synchronous activity in brain networks.

    Science.gov (United States)

    Richiardi, Jonas; Altmann, Andre; Milazzo, Anna-Clare; Chang, Catie; Chakravarty, M Mallar; Banaschewski, Tobias; Barker, Gareth J; Bokde, Arun L W; Bromberg, Uli; Büchel, Christian; Conrod, Patricia; Fauth-Bühler, Mira; Flor, Herta; Frouin, Vincent; Gallinat, Jürgen; Garavan, Hugh; Gowland, Penny; Heinz, Andreas; Lemaître, Hervé; Mann, Karl F; Martinot, Jean-Luc; Nees, Frauke; Paus, Tomáš; Pausova, Zdenka; Rietschel, Marcella; Robbins, Trevor W; Smolka, Michael N; Spanagel, Rainer; Ströhle, Andreas; Schumann, Gunter; Hawrylycz, Mike; Poline, Jean-Baptiste; Greicius, Michael D

    2015-06-12

    During rest, brain activity is synchronized between different regions widely distributed throughout the brain, forming functional networks. However, the molecular mechanisms supporting functional connectivity remain undefined. We show that functional brain networks defined with resting-state functional magnetic resonance imaging can be recapitulated by using measures of correlated gene expression in a post mortem brain tissue data set. The set of 136 genes we identify is significantly enriched for ion channels. Polymorphisms in this set of genes significantly affect resting-state functional connectivity in a large sample of healthy adolescents. Expression levels of these genes are also significantly associated with axonal connectivity in the mouse. The results provide convergent, multimodal evidence that resting-state functional networks correlate with the orchestrated activity of dozens of genes linked to ion channel activity and synaptic function. Copyright © 2015, American Association for the Advancement of Science.

  3. Microarray meta-analysis to explore abiotic stress-specific gene expression patterns in Arabidopsis.

    Science.gov (United States)

    Shen, Po-Chih; Hour, Ai-Ling; Liu, Li-Yu Daisy

    2017-12-01

    Abiotic stresses are the major limiting factors that affect plant growth, development, yield and final quality. Deciphering the underlying mechanisms of plants' adaptations to stresses using few datasets might overlook the different aspects of stress tolerance in plants, which might be simultaneously and consequently operated in the system. Fortunately, the accumulated microarray expression data offer an opportunity to infer abiotic stress-specific gene expression patterns through meta-analysis. In this study, we propose to combine microarray gene expression data under control, cold, drought, heat, and salt conditions and determined modules (gene sets) of genes highly associated with each other according to the observed expression data. By analyzing the expression variations of the Eigen genes from different conditions, we had identified two, three, and five gene modules as cold-, heat-, and salt-specific modules, respectively. Most of the cold- or heat-specific modules were differentially expressed to a particular degree in shoot samples, while most of the salt-specific modules were differentially expressed to a particular degree in root samples. A gene ontology (GO) analysis on the stress-specific modules suggested that the gene modules exclusively enriched stress-related GO terms and that different genes under the same GO terms may be alternatively disturbed in different conditions. The gene regulatory events for two genes, DREB1A and DEAR1, in the cold-specific gene module had also been validated, as evidenced through the literature search. Our protocols study the specificity of the gene modules that were specifically activated under a particular type of abiotic stress. The biplot can also assist to visualize the stress-specific gene modules. In conclusion, our approach has the potential to further elucidate mechanisms in plants and beneficial for future experiments design under different abiotic stresses.

  4. Measurement of Gene Expression in Archival Paraffin-Embedded Tissues

    Science.gov (United States)

    Cronin, Maureen; Pho, Mylan; Dutta, Debjani; Stephans, James C.; Shak, Steven; Kiefer, Michael C.; Esteban, Jose M.; Baker, Joffre B.

    2004-01-01

    Throughout the last decade many laboratories have shown that mRNA levels in formalin-fixed and paraffin-embedded (FPE) tissue specimens can be quantified by reverse transcriptase-polymerase chain reaction (RT-PCR) techniques despite the extensive RNA fragmentation that occurs in tissues so preserved. We have developed RT-PCR methods that are sensitive, precise, and that have multianalyte capability for potential wide use in clinical research and diagnostic assays. Here it is shown that the extent of fragmentation of extracted FPE tissue RNA significantly increases with archive storage time. Probe and primer sets for RT-PCR assays based on amplicons that are both short and homogeneous in length enable effective reference gene-based data normalization for cross comparison of specimens that differ substantially in age. A 48-gene assay used to compare gene expression profiles from the same breast cancer tissue that had been either frozen or FPE showed very similar profiles after reference gene-based normalization. A 92-gene assay, using RNA extracted from three 10-μm FPE sections of archival breast cancer specimens (dating from 1985 to 2001) yielded analyzable data for these genes in all 62 tested specimens. The results were substantially concordant when estrogen receptor, progesterone receptor, and HER2 receptor status determined by RT-PCR was compared with immunohistochemistry assays for these receptors. Furthermore, the results highlight the advantages of RT-PCR over immunohistochemistry with respect to quantitation and dynamic range. These findings support the development of RT-PCR analysis of FPE tissue RNA as a platform for multianalyte clinical diagnostic tests. PMID:14695316

  5. Gene Expression Measurement Module (GEMM) - a fully automated, miniaturized instrument for measuring gene expression in space

    Science.gov (United States)

    Karouia, Fathi; Ricco, Antonio; Pohorille, Andrew; Peyvan, Kianoosh

    2012-07-01

    The capability to measure gene expression on board spacecrafts opens the doors to a large number of experiments on the influence of space environment on biological systems that will profoundly impact our ability to conduct safe and effective space travel, and might also shed light on terrestrial physiology or biological function and human disease and aging processes. Measurements of gene expression will help us to understand adaptation of terrestrial life to conditions beyond the planet of origin, identify deleterious effects of the space environment on a wide range of organisms from microbes to humans, develop effective countermeasures against these effects, determine metabolic basis of microbial pathogenicity and drug resistance, test our ability to sustain and grow in space organisms that can be used for life support and in situ resource utilization during long-duration space exploration, and monitor both the spacecraft environment and crew health. These and other applications hold significant potential for discoveries in space biology, biotechnology and medicine. Accordingly, supported by funding from the NASA Astrobiology Science and Technology Instrument Development Program, we are developing a fully automated, miniaturized, integrated fluidic system for small spacecraft capable of in-situ measuring microbial expression of thousands of genes from multiple samples. The instrument will be capable of (1) lysing bacterial cell walls, (2) extracting and purifying RNA released from cells, (3) hybridizing it on a microarray and (4) providing electrochemical readout, all in a microfluidics cartridge. The prototype under development is suitable for deployment on nanosatellite platforms developed by the NASA Small Spacecraft Office. The first target application is to cultivate and measure gene expression of the photosynthetic bacterium Synechococcus elongatus, i.e. a cyanobacterium known to exhibit remarkable metabolic diversity and resilience to adverse conditions

  6. Differentially expressed genes in iron-induced prion protein conversion

    International Nuclear Information System (INIS)

    Kim, Minsun; Kim, Eun-hee; Choi, Bo-Ran; Woo, Hee-Jong

    2016-01-01

    The conversion of the cellular prion protein (PrP C ) to the protease-resistant isoform is the key event in chronic neurodegenerative diseases, including transmissible spongiform encephalopathies (TSEs). Increased iron in prion-related disease has been observed due to the prion protein-ferritin complex. Additionally, the accumulation and conversion of recombinant PrP (rPrP) is specifically derived from Fe(III) but not Fe(II). Fe(III)-mediated PK-resistant PrP (PrP res ) conversion occurs within a complex cellular environment rather than via direct contact between rPrP and Fe(III). In this study, differentially expressed genes correlated with prion degeneration by Fe(III) were identified using Affymetrix microarrays. Following Fe(III) treatment, 97 genes were differentially expressed, including 85 upregulated genes and 12 downregulated genes (≥1.5-fold change in expression). However, Fe(II) treatment produced moderate alterations in gene expression without inducing dramatic alterations in gene expression profiles. Moreover, functional grouping of identified genes indicated that the differentially regulated genes were highly associated with cell growth, cell maintenance, and intra- and extracellular transport. These findings showed that Fe(III) may influence the expression of genes involved in PrP folding by redox mechanisms. The identification of genes with altered expression patterns in neural cells may provide insights into PrP conversion mechanisms during the development and progression of prion-related diseases. - Highlights: • Differential genes correlated with prion degeneration by Fe(III) were identified. • Genes were identified in cell proliferation and intra- and extracellular transport. • In PrP degeneration, redox related genes were suggested. • Cbr2, Rsad2, Slc40a1, Amph and Mvd were expressed significantly.

  7. Gene expression profile data for mouse facial development

    Directory of Open Access Journals (Sweden)

    Sonia M. Leach

    2017-08-01

    Full Text Available This article contains data related to the research articles "Spatial and Temporal Analysis of Gene Expression during Growth and Fusion of the Mouse Facial Prominences" (Feng et al., 2009 [1] and “Systems Biology of facial development: contributions of ectoderm and mesenchyme” (Hooper et al., 2017 In press [2]. Embryonic mammalian craniofacial development is a complex process involving the growth, morphogenesis, and fusion of distinct facial prominences into a functional whole. Aberrant gene regulation during this process can lead to severe craniofacial birth defects, including orofacial clefting. As a means to understand the genes involved in facial development, we had previously dissected the embryonic mouse face into distinct prominences: the mandibular, maxillary or nasal between E10.5 and E12.5. The prominences were then processed intact, or separated into ectoderm and mesenchyme layers, prior analysis of RNA expression using microarrays (Feng et al., 2009, Hooper et al., 2017 in press [1,2]. Here, individual gene expression profiles have been built from these datasets that illustrate the timing of gene expression in whole prominences or in the separated tissue layers. The data profiles are presented as an indexed and clickable list of the genes each linked to a graphical image of that gene׳s expression profile in the ectoderm, mesenchyme, or intact prominence. These data files will enable investigators to obtain a rapid assessment of the relative expression level of any gene on the array with respect to time, tissue, prominence, and expression trajectory.

  8. Stably Expressed Genes Involved in Basic Cellular Functions.

    Directory of Open Access Journals (Sweden)

    Kejian Wang

    Full Text Available Stably Expressed Genes (SEGs whose expression varies within a narrow range may be involved in core cellular processes necessary for basic functions. To identify such genes, we re-analyzed existing RNA-Seq gene expression profiles across 11 organs at 4 developmental stages (from immature to old age in both sexes of F344 rats (n = 4/group; 320 samples. Expression changes (calculated as the maximum expression / minimum expression for each gene of >19000 genes across organs, ages, and sexes ranged from 2.35 to >109-fold, with a median of 165-fold. The expression of 278 SEGs was found to vary ≤4-fold and these genes were significantly involved in protein catabolism (proteasome and ubiquitination, RNA transport, protein processing, and the spliceosome. Such stability of expression was further validated in human samples where the expression variability of the homologous human SEGs was significantly lower than that of other genes in the human genome. It was also found that the homologous human SEGs were generally less subject to non-synonymous mutation than other genes, as would be expected of stably expressed genes. We also found that knockout of SEG homologs in mouse models was more likely to cause complete preweaning lethality than non-SEG homologs, corroborating the fundamental roles played by SEGs in biological development. Such stably expressed genes and pathways across life-stages suggest that tight control of these processes is important in basic cellular functions and that perturbation by endogenous (e.g., genetics or exogenous agents (e.g., drugs, environmental factors may cause serious adverse effects.

  9. Evolution and Expression Patterns of CYC/TB1 Genes in Anacyclus: Phylogenetic Insights for Floral Symmetry Genes in Asteraceae

    Science.gov (United States)

    Bello, María A.; Cubas, Pilar; Álvarez, Inés; Sanjuanbenito, Guillermo; Fuertes-Aguilar, Javier

    2017-01-01

    Homologs of the CYC/TB1 gene family have been independently recruited many times across the eudicots to control aspects of floral symmetry The family Asteraceae exhibits the largest known diversification in this gene paralog family accompanied by a parallel morphological floral richness in its specialized head-like inflorescence. In Asteraceae, whether or not CYC/TB1 gene floral symmetry function is preserved along organismic and gene lineages is unknown. In this study, we used phylogenetic, structural and expression analyses focused on the highly derived genus Anacyclus (tribe Anthemidae) to address this question. Phylogenetic reconstruction recovered eight main gene lineages present in Asteraceae: two from CYC1, four from CYC2 and two from CYC3-like genes. The species phylogeny was recovered in most of the gene lineages, allowing the delimitation of orthologous sets of CYC/TB1 genes in Asteraceae. Quantitative real-time PCR analysis indicated that in Anacyclus three of the four isolated CYC2 genes are more highly expressed in ray flowers. The expression of the four AcCYC2 genes overlaps in several organs including the ligule of ray flowers, as well as in anthers and ovules throughout development. PMID:28487706

  10. ANALYSES ON DIFFERENTIALLY EXPRESSED GENES ASSOCIATED WITH HUMAN BREAST CANCER

    Institute of Scientific and Technical Information of China (English)

    MENG Xu-li; DING Xiao-wen; XU Xiao-hong

    2006-01-01

    Objective: To investigate the molecular etiology of breast cancer by way of studying the differential expression and initial function of the related genes in the occurrence and development of breast cancer. Methods: Two hundred and eighty-eight human tumor related genes were chosen for preparation of the oligochips probe. mRNA was extracted from 16 breast cancer tissues and the corresponding normal breast tissues, and cDNA probe was prepared through reverse-transcription and hybridized with the gene chip. A laser focused fluorescent scanner was used to scan the chip. The different gene expressions were thereafter automatically compared and analyzed between the two sample groups. Cy3/Cy5>3.5 meant significant up-regulation. Cy3/Cy5<0.25 meant significant down-regulation. Results: The comparison between the breast cancer tissues and their corresponding normal tissues showed that 84 genes had differential expression in the Chip. Among the differently expressed genes, there were 4 genes with significant down-regulation and 6 with significant up-regulation. Compared with normal breast tissues, differentially expressed genes did partially exist in the breast cancer tissues. Conclusion: Changes in multi-gene expression regulations take place during the occurrence and development of breast cancer; and the research on related genes can help understanding the mechanism of tumor occurrence.

  11. Regulation of mitochondrial gene expression, the epigenetic enigma

    NARCIS (Netherlands)

    Mposhi, Archibold; van der Wijst, Monique G. P.; Faber, Klaas Nico; Rots, Marianne G.

    2017-01-01

    Epigenetics provides an important layer of information on top of the DNA sequence and is essential for establishing gene expression profiles. Extensive studies have shown that nuclear DNA methylation and histone modifications influence nuclear gene expression. However, it remains unclear whether

  12. Expression of KLK2 gene in prostate cancer

    Directory of Open Access Journals (Sweden)

    Sajad Shafai

    2018-01-01

    Conclusion: The expression of KLK2 gene in people with prostate cancer is the higher than the healthy person; finally, according to the results, it could be mentioned that the KLK2 gene considered as a useful factor in prostate cancer, whose expression is associated with progression and development of the prostate cancer.

  13. Comparative genomics of the relationship between gene structure and expression

    NARCIS (Netherlands)

    Ren, X.

    2006-01-01

    The relationship between the structure of genes and their expression is a relatively new aspect of genome organization and regulation. With more genome sequences and expression data becoming available, bioinformatics approaches can help the further elucidation of the relationships between gene

  14. The gene expressions of DNA methylation/demethylation enzymes ...

    African Journals Online (AJOL)

    user

    2011-01-31

    Jan 31, 2011 ... A decrease in mRNA levels for cytochrome c oxidase (COX) subunits was observed in skeletal muscle of hypothyroid rats. However, the precise expression mechanisms of the related genes in hypothyroid state still remain unclear. This study investigated gene expressions of DNA methyltransferases.

  15. Genome polymorphism markers and stress genes expression for ...

    African Journals Online (AJOL)

    SAM

    2014-06-11

    Jun 11, 2014 ... RNA extraction and purification for SOD and PAL gene expression. Fresh leaf tissues (100 mg), from ... Data analysis. Gelquant program for quantification of protein, DNA and RNA gel. (version 1.8.2) was used for .... by reprogramming the expression of endogenous genes. Higher level of these antioxidant ...

  16. Genome organization and expression of the rat ACBP gene family

    DEFF Research Database (Denmark)

    Mandrup, S; Andreasen, P H; Knudsen, J

    1993-01-01

    pool former. We have molecularly cloned and characterized the rat ACBP gene family which comprises one expressed and four processed pseudogenes. One of these was shown to exist in two allelic forms. A comprehensive computer-aided analysis of the promoter region of the expressed ACBP gene revealed...

  17. Effects of heat stress on gene expression in eggplant ( Solanum ...

    African Journals Online (AJOL)

    In order to identify differentially expressed genes involved in heat shock response, cDNA amplified fragment length polymorphism (cDNA-AFLP) and quantitative real-time polymerase chain reaction (QPCR) were used to study gene expression of eggplant seedlings subjected to 0, 6 and 12 h at 43°C. A total of 53 of over ...

  18. RNA preparation and characterization for gene expression studies

    DEFF Research Database (Denmark)

    Stangegaard, Michael

    2009-01-01

    Much information can be obtained from knowledge of the relative expression level of each gene in the transcriptome. With the current advances in technology as little as a single cell is required as starting material for gene expression experiments. The mRNA from a single cell may be linearly...

  19. The gene expressions of DNA methylation/demethylation enzymes ...

    African Journals Online (AJOL)

    A decrease in mRNA levels for cytochrome c oxidase (COX) subunits was observed in skeletal muscle of hypothyroid rats. However, the precise expression mechanisms of the related genes in hypothyroid state still remain unclear. This study investigated gene expressions of DNA methyltransferases (Dnmts), DNA ...

  20. Microarray analysis of the gene expression profile in triethylene ...

    African Journals Online (AJOL)

    Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells. ... Conclusions: Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.

  1. A search engine to identify pathway genes from expression data on multiple organisms

    Directory of Open Access Journals (Sweden)

    Zambon Alexander C

    2007-05-01

    Full Text Available Abstract Background The completion of several genome projects showed that most genes have not yet been characterized, especially in multicellular organisms. Although most genes have unknown functions, a large collection of data is available describing their transcriptional activities under many different experimental conditions. In many cases, the coregulatation of a set of genes across a set of conditions can be used to infer roles for genes of unknown function. Results We developed a search engine, the Multiple-Species Gene Recommender (MSGR, which scans gene expression datasets from multiple organisms to identify genes that participate in a genetic pathway. The MSGR takes a query consisting of a list of genes that function together in a genetic pathway from one of six organisms: Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana, and Helicobacter pylori. Using a probabilistic method to merge searches, the MSGR identifies genes that are significantly coregulated with the query genes in one or more of those organisms. The MSGR achieves its highest accuracy for many human pathways when searches are combined across species. We describe specific examples in which new genes were identified to be involved in a neuromuscular signaling pathway and a cell-adhesion pathway. Conclusion The search engine can scan large collections of gene expression data for new genes that are significantly coregulated with a pathway of interest. By integrating searches across organisms, the MSGR can identify pathway members whose coregulation is either ancient or newly evolved.

  2. DNMT1 is associated with cell cycle and DNA replication gene sets in diffuse large B-cell lymphoma.

    Science.gov (United States)

    Loo, Suet Kee; Ab Hamid, Suzina Sheikh; Musa, Mustaffa; Wong, Kah Keng

    2018-01-01

    Dysregulation of DNA (cytosine-5)-methyltransferase 1 (DNMT1) is associated with the pathogenesis of various types of cancer. It has been previously shown that DNMT1 is frequently expressed in diffuse large B-cell lymphoma (DLBCL), however its functions remain to be elucidated in the disease. In this study, we gene expression profiled (GEP) shRNA targeting DNMT1(shDNMT1)-treated germinal center B-cell-like DLBCL (GCB-DLBCL)-derived cell line (i.e. HT) compared with non-silencing shRNA (control shRNA)-treated HT cells. Independent gene set enrichment analysis (GSEA) performed using GEPs of shRNA-treated HT cells and primary GCB-DLBCL cases derived from two publicly-available datasets (i.e. GSE10846 and GSE31312) produced three separate lists of enriched gene sets for each gene sets collection from Molecular Signatures Database (MSigDB). Subsequent Venn analysis identified 268, 145 and six consensus gene sets from analyzing gene sets in C2 collection (curated gene sets), C5 sub-collection [gene sets from gene ontology (GO) biological process ontology] and Hallmark collection, respectively to be enriched in positive correlation with DNMT1 expression profiles in shRNA-treated HT cells, GSE10846 and GSE31312 datasets [false discovery rate (FDR) 0.8) with DNMT1 expression and significantly downregulated (log fold-change <-1.35; p<0.05) following DNMT1 silencing in HT cells. These results suggest the involvement of DNMT1 in the activation of cell cycle and DNA replication in DLBCL cells. Copyright © 2017 Elsevier GmbH. All rights reserved.

  3. Integration of steady-state and temporal gene expression data for the inference of gene regulatory networks.

    Science.gov (United States)

    Wang, Yi Kan; Hurley, Daniel G; Schnell, Santiago; Print, Cristin G; Crampin, Edmund J

    2013-01-01

    We develop a new regression algorithm, cMIKANA, for inference of gene regulatory networks from combinations of steady-state and time-series gene expression data. Using simulated gene expression datasets to assess the accuracy of reconstructing gene regulatory networks, we show that steady-state and time-series data sets can successfully be combined to identify gene regulatory interactions using the new algorithm. Inferring gene networks from combined data sets was found to be advantageous when using noisy measurements collected with either lower sampling rates or a limited number of experimental replicates. We illustrate our method by applying it to a microarray gene expression dataset from human umbilical vein endothelial cells (HUVECs) which combines time series data from treatment with growth factor TNF and steady state data from siRNA knockdown treatments. Our results suggest that the combination of steady-state and time-series datasets may provide better prediction of RNA-to-RNA interactions, and may also reveal biological features that cannot be identified from dynamic or steady state information alone. Finally, we consider the experimental design of genomics experiments for gene regulatory network inference and show that network inference can be improved by incorporating steady-state measurements with time-series data.

  4. Fungal and plant gene expression in arbuscular mycorrhizal symbiosis.

    Science.gov (United States)

    Balestrini, Raffaella; Lanfranco, Luisa

    2006-11-01

    Arbuscular mycorrhizas (AMs) are a unique example of symbiosis between two eukaryotes, soil fungi and plants. This association induces important physiological changes in each partner that lead to reciprocal benefits, mainly in nutrient supply. The symbiosis results from modifications in plant and fungal cell organization caused by specific changes in gene expression. Recently, much effort has gone into studying these gene expression patterns to identify a wider spectrum of genes involved. We aim in this review to describe AM symbiosis in terms of current knowledge on plant and fungal gene expression profiles.

  5. Expression and clinical significance of Pax6 gene in retinoblastoma

    Directory of Open Access Journals (Sweden)

    Hai-Dong Huang

    2013-07-01

    Full Text Available AIM: To discuss the expression and clinical significance of Pax6 gene in retinoblastoma(Rb. METHODS: Totally 15 cases of fresh Rb organizations were selected as observation group and 15 normal retinal organizations as control group. Western-Blot and reverse transcriptase polymerase chain reaction(RT-PCRmethods were used to detect Pax6 protein and Pax6 mRNA expressions of the normal retina organizations and Rb organizations. At the same time, Western Blot method was used to detect the Pax6 gene downstream MATH5 and BRN3b differentiation gene protein level expression. After the comparison between two groups, the expression and clinical significance of Pax6 gene in Rb were discussed. RESULTS: In the observation group, average value of mRNA expression of Pax6 gene was 0.99±0.03; average value of Pax6 gene protein expression was 2.07±0.15; average value of BRN3b protein expression was 0.195±0.016; average value of MATH5 protein expression was 0.190±0.031. They were significantly higher than the control group, and the differences were statistically significant(PCONCLUSION: Abnormal expression of Pax6 gene is likely to accelerate the occurrence of Rb.

  6. Gene expression in cerebral ischemia: a new approach for neuroprotection.

    Science.gov (United States)

    Millán, Mónica; Arenillas, Juan

    2006-01-01

    Cerebral ischemia is one of the strongest stimuli for gene induction in the brain. Hundreds of genes have been found to be induced by brain ischemia. Many genes are involved in neurodestructive functions such as excitotoxicity, inflammatory response and neuronal apoptosis. However, cerebral ischemia is also a powerful reformatting and reprogramming stimulus for the brain through neuroprotective gene expression. Several genes may participate in both cellular responses. Thus, isolation of candidate genes for neuroprotection strategies and interpretation of expression changes have been proven difficult. Nevertheless, many studies are being carried out to improve the knowledge of the gene activation and protein expression following ischemic stroke, as well as in the development of new therapies that modify biochemical, molecular and genetic changes underlying cerebral ischemia. Owing to the complexity of the process involving numerous critical genes expressed differentially in time, space and concentration, ongoing therapeutic efforts should be based on multiple interventions at different levels. By modification of the acute gene expression induced by ischemia or the apoptotic gene program, gene therapy is a promising treatment but is still in a very experimental phase. Some hurdles will have to be overcome before these therapies can be introduced into human clinical stroke trials. Copyright 2006 S. Karger AG, Basel.

  7. Genetic architecture of gene expression in the chicken

    Directory of Open Access Journals (Sweden)

    Stanley Dragana

    2013-01-01

    Full Text Available Abstract Background The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. Results Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. Conclusions We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.