WorldWideScience

Sample records for gene expression prediction

  1. Blood Gene Expression Predicts Bronchiolitis Obliterans Syndrome

    Directory of Open Access Journals (Sweden)

    Richard Danger

    2018-01-01

    Full Text Available Bronchiolitis obliterans syndrome (BOS, the main manifestation of chronic lung allograft dysfunction, leads to poor long-term survival after lung transplantation. Identifying predictors of BOS is essential to prevent the progression of dysfunction before irreversible damage occurs. By using a large set of 107 samples from lung recipients, we performed microarray gene expression profiling of whole blood to identify early biomarkers of BOS, including samples from 49 patients with stable function for at least 3 years, 32 samples collected at least 6 months before BOS diagnosis (prediction group, and 26 samples at or after BOS diagnosis (diagnosis group. An independent set from 25 lung recipients was used for validation by quantitative PCR (13 stables, 11 in the prediction group, and 8 in the diagnosis group. We identified 50 transcripts differentially expressed between stable and BOS recipients. Three genes, namely POU class 2 associating factor 1 (POU2AF1, T-cell leukemia/lymphoma protein 1A (TCL1A, and B cell lymphocyte kinase, were validated as predictive biomarkers of BOS more than 6 months before diagnosis, with areas under the curve of 0.83, 0.77, and 0.78 respectively. These genes allow stratification based on BOS risk (log-rank test p < 0.01 and are not associated with time posttransplantation. This is the first published large-scale gene expression analysis of blood after lung transplantation. The three-gene blood signature could provide clinicians with new tools to improve follow-up and adapt treatment of patients likely to develop BOS.

  2. Predicting cellular growth from gene expression signatures.

    Directory of Open Access Journals (Sweden)

    Edoardo M Airoldi

    2009-01-01

    Full Text Available Maintaining balanced growth in a changing environment is a fundamental systems-level challenge for cellular physiology, particularly in microorganisms. While the complete set of regulatory and functional pathways supporting growth and cellular proliferation are not yet known, portions of them are well understood. In particular, cellular proliferation is governed by mechanisms that are highly conserved from unicellular to multicellular organisms, and the disruption of these processes in metazoans is a major factor in the development of cancer. In this paper, we develop statistical methodology to identify quantitative aspects of the regulatory mechanisms underlying cellular proliferation in Saccharomyces cerevisiae. We find that the expression levels of a small set of genes can be exploited to predict the instantaneous growth rate of any cellular culture with high accuracy. The predictions obtained in this fashion are robust to changing biological conditions, experimental methods, and technological platforms. The proposed model is also effective in predicting growth rates for the related yeast Saccharomyces bayanus and the highly diverged yeast Schizosaccharomyces pombe, suggesting that the underlying regulatory signature is conserved across a wide range of unicellular evolution. We investigate the biological significance of the gene expression signature that the predictions are based upon from multiple perspectives: by perturbing the regulatory network through the Ras/PKA pathway, observing strong upregulation of growth rate even in the absence of appropriate nutrients, and discovering putative transcription factor binding sites, observing enrichment in growth-correlated genes. More broadly, the proposed methodology enables biological insights about growth at an instantaneous time scale, inaccessible by direct experimental methods. Data and tools enabling others to apply our methods are available at http://function.princeton.edu/growthrate.

  3. Clinicopathologic and gene expression parameters predict liver cancer prognosis

    International Nuclear Information System (INIS)

    Hao, Ke; Zhong, Hua; Greenawalt, Danielle; Ferguson, Mark D; Ng, Irene O; Sham, Pak C; Poon, Ronnie T; Molony, Cliona; Schadt, Eric E; Dai, Hongyue; Luk, John M; Lamb, John; Zhang, Chunsheng; Xie, Tao; Wang, Kai; Zhang, Bin; Chudin, Eugene; Lee, Nikki P; Mao, Mao

    2011-01-01

    The prognosis of hepatocellular carcinoma (HCC) varies following surgical resection and the large variation remains largely unexplained. Studies have revealed the ability of clinicopathologic parameters and gene expression to predict HCC prognosis. However, there has been little systematic effort to compare the performance of these two types of predictors or combine them in a comprehensive model. Tumor and adjacent non-tumor liver tissues were collected from 272 ethnic Chinese HCC patients who received curative surgery. We combined clinicopathologic parameters and gene expression data (from both tissue types) in predicting HCC prognosis. Cross-validation and independent studies were employed to assess prediction. HCC prognosis was significantly associated with six clinicopathologic parameters, which can partition the patients into good- and poor-prognosis groups. Within each group, gene expression data further divide patients into distinct prognostic subgroups. Our predictive genes significantly overlap with previously published gene sets predictive of prognosis. Moreover, the predictive genes were enriched for genes that underwent normal-to-tumor gene network transformation. Previously documented liver eSNPs underlying the HCC predictive gene signatures were enriched for SNPs that associated with HCC prognosis, providing support that these genes are involved in key processes of tumorigenesis. When applied individually, clinicopathologic parameters and gene expression offered similar predictive power for HCC prognosis. In contrast, a combination of the two types of data dramatically improved the power to predict HCC prognosis. Our results also provided a framework for understanding the impact of gene expression on the processes of tumorigenesis and clinical outcome

  4. A deep auto-encoder model for gene expression prediction.

    Science.gov (United States)

    Xie, Rui; Wen, Jia; Quitadamo, Andrew; Cheng, Jianlin; Shi, Xinghua

    2017-11-17

    Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance. To demonstrate the usage of this model, we apply MLP-SAE to a real genomic datasets with genotypes and gene expression profiles measured in yeast. Our results show that the MLP-SAE model with dropout outperforms other models including Lasso, Random Forests and the MLP-SAE model without dropout. Using the MLP-SAE model with dropout, we show that gene expression quantifications predicted by the model solely based on genotypes, align well with true gene expression patterns. We provide a deep auto-encoder model for predicting gene expression from SNP genotypes. This study demonstrates that deep learning is appropriate for tackling another genomic problem, i.e., building predictive models to understand genotypes' contribution to gene expression. With the emerging availability of richer genomic data, we anticipate that deep learning models play a bigger role in modeling and interpreting genomics.

  5. Embryo quality predictive models based on cumulus cells gene expression

    Directory of Open Access Journals (Sweden)

    Devjak R

    2016-06-01

    Full Text Available Since the introduction of in vitro fertilization (IVF in clinical practice of infertility treatment, the indicators for high quality embryos were investigated. Cumulus cells (CC have a specific gene expression profile according to the developmental potential of the oocyte they are surrounding, and therefore, specific gene expression could be used as a biomarker. The aim of our study was to combine more than one biomarker to observe improvement in prediction value of embryo development. In this study, 58 CC samples from 17 IVF patients were analyzed. This study was approved by the Republic of Slovenia National Medical Ethics Committee. Gene expression analysis [quantitative real time polymerase chain reaction (qPCR] for five genes, analyzed according to embryo quality level, was performed. Two prediction models were tested for embryo quality prediction: a binary logistic and a decision tree model. As the main outcome, gene expression levels for five genes were taken and the area under the curve (AUC for two prediction models were calculated. Among tested genes, AMHR2 and LIF showed significant expression difference between high quality and low quality embryos. These two genes were used for the construction of two prediction models: the binary logistic model yielded an AUC of 0.72 ± 0.08 and the decision tree model yielded an AUC of 0.73 ± 0.03. Two different prediction models yielded similar predictive power to differentiate high and low quality embryos. In terms of eventual clinical decision making, the decision tree model resulted in easy-to-interpret rules that are highly applicable in clinical practice.

  6. Random Subspace Aggregation for Cancer Prediction with Gene Expression Profiles

    Directory of Open Access Journals (Sweden)

    Liying Yang

    2016-01-01

    Full Text Available Background. Precisely predicting cancer is crucial for cancer treatment. Gene expression profiles make it possible to analyze patterns between genes and cancers on the genome-wide scale. Gene expression data analysis, however, is confronted with enormous challenges for its characteristics, such as high dimensionality, small sample size, and low Signal-to-Noise Ratio. Results. This paper proposes a method, termed RS_SVM, to predict gene expression profiles via aggregating SVM trained on random subspaces. After choosing gene features through statistical analysis, RS_SVM randomly selects feature subsets to yield random subspaces and training SVM classifiers accordingly and then aggregates SVM classifiers to capture the advantage of ensemble learning. Experiments on eight real gene expression datasets are performed to validate the RS_SVM method. Experimental results show that RS_SVM achieved better classification accuracy and generalization performance in contrast with single SVM, K-nearest neighbor, decision tree, Bagging, AdaBoost, and the state-of-the-art methods. Experiments also explored the effect of subspace size on prediction performance. Conclusions. The proposed RS_SVM method yielded superior performance in analyzing gene expression profiles, which demonstrates that RS_SVM provides a good channel for such biological data.

  7. Multiple Suboptimal Solutions for Prediction Rules in Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Osamu Komori

    2013-01-01

    Full Text Available This paper discusses mathematical and statistical aspects in analysis methods applied to microarray gene expressions. We focus on pattern recognition to extract informative features embedded in the data for prediction of phenotypes. It has been pointed out that there are severely difficult problems due to the unbalance in the number of observed genes compared with the number of observed subjects. We make a reanalysis of microarray gene expression published data to detect many other gene sets with almost the same performance. We conclude in the current stage that it is not possible to extract only informative genes with high performance in the all observed genes. We investigate the reason why this difficulty still exists even though there are actively proposed analysis methods and learning algorithms in statistical machine learning approaches. We focus on the mutual coherence or the absolute value of the Pearson correlations between two genes and describe the distributions of the correlation for the selected set of genes and the total set. We show that the problem of finding informative genes in high dimensional data is ill-posed and that the difficulty is closely related with the mutual coherence.

  8. Predictive modelling of gene expression from transcriptional regulatory elements.

    Science.gov (United States)

    Budden, David M; Hurley, Daniel G; Crampin, Edmund J

    2015-07-01

    Predictive modelling of gene expression provides a powerful framework for exploring the regulatory logic underpinning transcriptional regulation. Recent studies have demonstrated the utility of such models in identifying dysregulation of gene and miRNA expression associated with abnormal patterns of transcription factor (TF) binding or nucleosomal histone modifications (HMs). Despite the growing popularity of such approaches, a comparative review of the various modelling algorithms and feature extraction methods is lacking. We define and compare three methods of quantifying pairwise gene-TF/HM interactions and discuss their suitability for integrating the heterogeneous chromatin immunoprecipitation (ChIP)-seq binding patterns exhibited by TFs and HMs. We then construct log-linear and ϵ-support vector regression models from various mouse embryonic stem cell (mESC) and human lymphoblastoid (GM12878) data sets, considering both ChIP-seq- and position weight matrix- (PWM)-derived in silico TF-binding. The two algorithms are evaluated both in terms of their modelling prediction accuracy and ability to identify the established regulatory roles of individual TFs and HMs. Our results demonstrate that TF-binding and HMs are highly predictive of gene expression as measured by mRNA transcript abundance, irrespective of algorithm or cell type selection and considering both ChIP-seq and PWM-derived TF-binding. As we encourage other researchers to explore and develop these results, our framework is implemented using open-source software and made available as a preconfigured bootable virtual environment. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  9. A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

    Science.gov (United States)

    Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.

  10. Prediction of highly expressed genes in microbes based on chromatin accessibility

    DEFF Research Database (Denmark)

    Willenbrock, Hanni; Ussery, David

    2007-01-01

    BACKGROUND: It is well known that gene expression is dependent on chromatin structure in eukaryotes and it is likely that chromatin can play a role in bacterial gene expression as well. Here, we use a nucleosomal position preference measure of anisotropic DNA flexibility to predict highly expressed...

  11. Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.

    Directory of Open Access Journals (Sweden)

    Mika Gustafsson

    Full Text Available BACKGROUND: To predict gene expressions is an important endeavour within computational systems biology. It can both be a way to explore how drugs affect the system, as well as providing a framework for finding which genes are interrelated in a certain process. A practical problem, however, is how to assess and discriminate among the various algorithms which have been developed for this purpose. Therefore, the DREAM project invited the year 2008 to a challenge for predicting gene expression values, and here we present the algorithm with best performance. METHODOLOGY/PRINCIPAL FINDINGS: We develop an algorithm by exploring various regression schemes with different model selection procedures. It turns out that the most effective scheme is based on least squares, with a penalty term of a recently developed form called the "elastic net". Key components in the algorithm are the integration of expression data from other experimental conditions than those presented for the challenge and the utilization of transcription factor binding data for guiding the inference process towards known interactions. Of importance is also a cross-validation procedure where each form of external data is used only to the extent it increases the expected performance. CONCLUSIONS/SIGNIFICANCE: Our algorithm proves both the possibility to extract information from large-scale expression data concerning prediction of gene levels, as well as the benefits of integrating different data sources for improving the inference. We believe the former is an important message to those still hesitating on the possibilities for computational approaches, while the latter is part of an important way forward for the future development of the field of computational systems biology.

  12. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    Directory of Open Access Journals (Sweden)

    Teng Shaolei

    2013-01-01

    Full Text Available Abstract Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs and Support Vector Machines (SVMs were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression.

  13. Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans.

    Science.gov (United States)

    Gottlieb, Assaf; Daneshjou, Roxana; DeGorter, Marianne; Bourgeois, Stephane; Svensson, Peter J; Wadelius, Mia; Deloukas, Panos; Montgomery, Stephen B; Altman, Russ B

    2017-11-24

    Genome-wide association studies are useful for discovering genotype-phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into "gene level" effects. Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression-on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort.

  14. A Gene Expression Profile of BRCAness That Predicts for Responsiveness to Platinum and PARP Inhibitors

    Science.gov (United States)

    2017-02-01

    affecting the function of Fanconi Anemia (FA) genes ( FANCA /B/C/D2/E/F/G/I/J/L/M, PALB2) or DNA damage response genes involved in HR 5 (ATM, ATR...Award Number: W81XWH-10-1-0585 TITLE: A Gene Expression Profile of BRCAness That Predicts for Responsiveness to Platinum and PARP Inhibitors...To) 15 July 2010 – 2 Nov.2016 4. TITLE AND SUBTITLE A Gene Expression Profile of BRCAness That Predicts for Responsiveness to Platinum and PARP

  15. Predictive value of MSH2 gene expression in colorectal cancer treated with capecitabine

    DEFF Research Database (Denmark)

    Jensen, Lars H; Danenberg, Kathleen D; Danenberg, Peter V

    2007-01-01

    was associated with a hazard ratio of 0.5 (95% confidence interval, 0.23-1.11; P = 0.083) in survival analysis. CONCLUSION: The higher gene expression of MSH2 in responders and the trend for predicting overall survival indicates a predictive value of this marker in the treatment of advanced CRC with capecitabine.......PURPOSE: The objective of the present study was to evaluate the gene expression of the DNA mismatch repair gene MSH2 as a predictive marker in advanced colorectal cancer (CRC) treated with first-line capecitabine. PATIENTS AND METHODS: Microdissection of paraffin-embedded tumor tissue, RNA...

  16. Prediction of highly expressed genes in microbes based on chromatin accessibility

    Directory of Open Access Journals (Sweden)

    Ussery David W

    2007-02-01

    Full Text Available Abstract Background It is well known that gene expression is dependent on chromatin structure in eukaryotes and it is likely that chromatin can play a role in bacterial gene expression as well. Here, we use a nucleosomal position preference measure of anisotropic DNA flexibility to predict highly expressed genes in microbial genomes. We compare these predictions with those based on codon adaptation index (CAI values, and also with experimental data for 6 different microbial genomes, with a particular interest in experimental data from Escherichia coli. Moreover, position preference is examined further in 328 sequenced microbial genomes. Results We find that absolute gene expression levels are correlated with the position preference in many microbial genomes. It is postulated that in these regions, the DNA may be more accessible to the transcriptional machinery. Moreover, ribosomal proteins and ribosomal RNA are encoded by DNA having significantly lower position preference values than other genes in fast-replicating microbes. Conclusion This insight into DNA structure-dependent gene expression in microbes may be exploited for predicting the expression of non-translated genes such as non-coding RNAs that may not be predicted by any of the conventional codon usage bias approaches.

  17. Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans

    Directory of Open Access Journals (Sweden)

    Assaf Gottlieb

    2017-11-01

    Full Text Available Abstract Background Genome-wide association studies are useful for discovering genotype–phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into “gene level” effects. Methods Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression—on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. Results We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Conclusions Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort

  18. Testing the predictive value of peripheral gene expression for nonremission following citalopram treatment for major depression.

    Science.gov (United States)

    Guilloux, Jean-Philippe; Bassi, Sabrina; Ding, Ying; Walsh, Chris; Turecki, Gustavo; Tseng, George; Cyranowski, Jill M; Sibille, Etienne

    2015-02-01

    Major depressive disorder (MDD) in general, and anxious-depression in particular, are characterized by poor rates of remission with first-line treatments, contributing to the chronic illness burden suffered by many patients. Prospective research is needed to identify the biomarkers predicting nonremission prior to treatment initiation. We collected blood samples from a discovery cohort of 34 adult MDD patients with co-occurring anxiety and 33 matched, nondepressed controls at baseline and after 12 weeks (of citalopram plus psychotherapy treatment for the depressed cohort). Samples were processed on gene arrays and group differences in gene expression were investigated. Exploratory analyses suggest that at pretreatment baseline, nonremitting patients differ from controls with gene function and transcription factor analyses potentially related to elevated inflammation and immune activation. In a second phase, we applied an unbiased machine learning prediction model and corrected for model-selection bias. Results show that baseline gene expression predicted nonremission with 79.4% corrected accuracy with a 13-gene model. The same gene-only model predicted nonremission after 8 weeks of citalopram treatment with 76% corrected accuracy in an independent validation cohort of 63 MDD patients treated with citalopram at another institution. Together, these results demonstrate the potential, but also the limitations, of baseline peripheral blood-based gene expression to predict nonremission after citalopram treatment. These results not only support their use in future prediction tools but also suggest that increased accuracy may be obtained with the inclusion of additional predictors (eg, genetics and clinical scales).

  19. Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets

    Directory of Open Access Journals (Sweden)

    Karacali Bilge

    2007-10-01

    Full Text Available Abstract Background Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a all genes on the microarray platform and b a list of known disease-related genes (a priori selection. We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms. Results Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform. Conclusion Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine

  20. Adipose gene expression prior to weight loss can differentiate and weakly predict dietary responders.

    Directory of Open Access Journals (Sweden)

    David M Mutch

    Full Text Available BACKGROUND: The ability to identify obese individuals who will successfully lose weight in response to dietary intervention will revolutionize disease management. Therefore, we asked whether it is possible to identify subjects who will lose weight during dietary intervention using only a single gene expression snapshot. METHODOLOGY/PRINCIPAL FINDINGS: The present study involved 54 female subjects from the Nutrient-Gene Interactions in Human Obesity-Implications for Dietary Guidelines (NUGENOB trial to determine whether subcutaneous adipose tissue gene expression could be used to predict weight loss prior to the 10-week consumption of a low-fat hypocaloric diet. Using several statistical tests revealed that the gene expression profiles of responders (8-12 kgs weight loss could always be differentiated from non-responders (<4 kgs weight loss. We also assessed whether this differentiation was sufficient for prediction. Using a bottom-up (i.e. black-box approach, standard class prediction algorithms were able to predict dietary responders with up to 61.1%+/-8.1% accuracy. Using a top-down approach (i.e. using differentially expressed genes to build a classifier improved prediction accuracy to 80.9%+/-2.2%. CONCLUSION: Adipose gene expression profiling prior to the consumption of a low-fat diet is able to differentiate responders from non-responders as well as serve as a weak predictor of subjects destined to lose weight. While the degree of prediction accuracy currently achieved with a gene expression snapshot is perhaps insufficient for clinical use, this work reveals that the comprehensive molecular signature of adipose tissue paves the way for the future of personalized nutrition.

  1. EvoCor: a platform for predicting functionally related genes using phylogenetic and expression profiles.

    Science.gov (United States)

    Dittmar, W James; McIver, Lauren; Michalak, Pawel; Garner, Harold R; Valdez, Gregorio

    2014-07-01

    The wealth of publicly available gene expression and genomic data provides unique opportunities for computational inference to discover groups of genes that function to control specific cellular processes. Such genes are likely to have co-evolved and be expressed in the same tissues and cells. Unfortunately, the expertise and computational resources required to compare tens of genomes and gene expression data sets make this type of analysis difficult for the average end-user. Here, we describe the implementation of a web server that predicts genes involved in affecting specific cellular processes together with a gene of interest. We termed the server 'EvoCor', to denote that it detects functional relationships among genes through evolutionary analysis and gene expression correlation. This web server integrates profiles of sequence divergence derived by a Hidden Markov Model (HMM) and tissue-wide gene expression patterns to determine putative functional linkages between pairs of genes. This server is easy to use and freely available at http://pilot-hmm.vbi.vt.edu/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Gene expression

    International Nuclear Information System (INIS)

    Hildebrand, C.E.; Crawford, B.D.; Walters, R.A.; Enger, M.D.

    1983-01-01

    We prepared probes for isolating functional pieces of the metallothionein locus. The probes enabled a variety of experiments, eventually revealing two mechanisms for metallothionein gene expression, the order of the DNA coding units at the locus, and the location of the gene site in its chromosome. Once the switch regulating metallothionein synthesis was located, it could be joined by recombinant DNA methods to other, unrelated genes, then reintroduced into cells by gene-transfer techniques. The expression of these recombinant genes could then be induced by exposing the cells to Zn 2+ or Cd 2+ . We would thus take advantage of the clearly defined switching properties of the metallothionein gene to manipulate the expression of other, perhaps normally constitutive, genes. Already, despite an incomplete understanding of how the regulatory switch of the metallothionein locus operates, such experiments have been performed successfully

  3. Gene expression variation to predict 10-year survival in lymph-node-negative breast cancer

    International Nuclear Information System (INIS)

    Karlsson, Elin; Delle, Ulla; Danielsson, Anna; Olsson, Björn; Abel, Frida; Karlsson, Per; Helou, Khalil

    2008-01-01

    It is of great significance to find better markers to correctly distinguish between high-risk and low-risk breast cancer patients since the majority of breast cancer cases are at present being overtreated. 46 tumours from node-negative breast cancer patients were studied with gene expression microarrays. A t-test was carried out in order to find a set of genes where the expression might predict clinical outcome. Two classifiers were used for evaluation of the gene lists, a correlation-based classifier and a Voting Features Interval (VFI) classifier. We then evaluated the predictive accuracy of this expression signature on tumour sets from two similar studies on lymph-node negative patients. They had both developed gene expression signatures superior to current methods in classifying node-negative breast tumours. These two signatures were also tested on our material. A list of 51 genes whose expression profiles could predict clinical outcome with high accuracy in our material (96% or 89% accuracy in cross-validation, depending on type of classifier) was developed. When tested on two independent data sets, the expression signature based on the 51 identified genes had good predictive qualities in one of the data sets (74% accuracy), whereas their predictive value on the other data set were poor, presumably due to the fact that only 23 of the 51 genes were found in that material. We also found that previously developed expression signatures could predict clinical outcome well to moderately well in our material (72% and 61%, respectively). The list of 51 genes derived in this study might have potential for clinical utility as a prognostic gene set, and may include candidate genes of potential relevance for clinical outcome in breast cancer. According to the predictions by this expression signature, 30 of the 46 patients may have benefited from different adjuvant treatment than they recieved. The research on these tumours was approved by the Medical Faculty Research

  4. Clustering gene expression data based on predicted differential effects of GV interaction.

    Science.gov (United States)

    Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

    2005-02-01

    Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.

  5. Genomic Features That Predict Allelic Imbalance in Humans Suggest Patterns of Constraint on Gene Expression Variation

    Science.gov (United States)

    Fédrigo, Olivier; Haygood, Ralph; Mukherjee, Sayan; Wray, Gregory A.

    2009-01-01

    Variation in gene expression is an important contributor to phenotypic diversity within and between species. Although this variation often has a genetic component, identification of the genetic variants driving this relationship remains challenging. In particular, measurements of gene expression usually do not reveal whether the genetic basis for any observed variation lies in cis or in trans to the gene, a distinction that has direct relevance to the physical location of the underlying genetic variant, and which may also impact its evolutionary trajectory. Allelic imbalance measurements identify cis-acting genetic effects by assaying the relative contribution of the two alleles of a cis-regulatory region to gene expression within individuals. Identification of patterns that predict commonly imbalanced genes could therefore serve as a useful tool and also shed light on the evolution of cis-regulatory variation itself. Here, we show that sequence motifs, polymorphism levels, and divergence levels around a gene can be used to predict commonly imbalanced genes in a human data set. Reduction of this feature set to four factors revealed that only one factor significantly differentiated between commonly imbalanced and nonimbalanced genes. We demonstrate that these results are consistent between the original data set and a second published data set in humans obtained using different technical and statistical methods. Finally, we show that variation in the single allelic imbalance-associated factor is partially explained by the density of genes in the region of a target gene (allelic imbalance is less probable for genes in gene-dense regions), and, to a lesser extent, the evenness of expression of the gene across tissues and the magnitude of negative selection on putative regulatory regions of the gene. These results suggest that the genomic distribution of functional cis-regulatory variants in the human genome is nonrandom, perhaps due to local differences in evolutionary

  6. Muscle myeloid type I interferon gene expression may predict therapeutic responses to rituximab in myositis patients.

    Science.gov (United States)

    Nagaraju, Kanneboyina; Ghimbovschi, Svetlana; Rayavarapu, Sree; Phadke, Aditi; Rider, Lisa G; Hoffman, Eric P; Miller, Frederick W

    2016-09-01

    To identify muscle gene expression patterns that predict rituximab responses and assess the effects of rituximab on muscle gene expression in PM and DM. In an attempt to understand the molecular mechanism of response and non-response to rituximab therapy, we performed Affymetrix gene expression array analyses on muscle biopsy specimens taken before and after rituximab therapy from eight PM and two DM patients in the Rituximab in Myositis study. We also analysed selected muscle-infiltrating cell phenotypes in these biopsies by immunohistochemical staining. Partek and Ingenuity pathway analyses assessed the gene pathways and networks. Myeloid type I IFN signature genes were expressed at higher levels at baseline in the skeletal muscle of rituximab responders than in non-responders, whereas classic non-myeloid IFN signature genes were expressed at higher levels in non-responders at baseline. Also, rituximab responders have a greater reduction of the myeloid and non-myeloid type I IFN signatures than non-responders. The decrease in the type I IFN signature following administration of rituximab may be associated with the decreases in muscle-infiltrating CD19(+) B cells and CD68(+) macrophages in responders. Our findings suggest that high levels of myeloid type I IFN gene expression in skeletal muscle predict responses to rituximab in PM/DM and that rituximab responders also have a greater decrease in the expression of these genes. These data add further evidence to recent studies defining the type I IFN signature as both a predictor of therapeutic responses and a biomarker of myositis disease activity. Published by Oxford University Press on behalf British Society for Rheumatology 2016. This work is written by US Government employees and is in the public domain in the US.

  7. Exploring gene expression signatures for predicting disease free survival after resection of colorectal cancer liver metastases.

    Directory of Open Access Journals (Sweden)

    Nikol Snoeren

    Full Text Available BACKGROUND AND OBJECTIVES: This study was designed to identify and validate gene signatures that can predict disease free survival (DFS in patients undergoing a radical resection for their colorectal liver metastases (CRLM. METHODS: Tumor gene expression profiles were collected from 119 patients undergoing surgery for their CRLM in the Paul Brousse Hospital (France and the University Medical Center Utrecht (The Netherlands. Patients were divided into high and low risk groups. A randomly selected training set was used to find predictive gene signatures. The ability of these gene signatures to predict DFS was tested in an independent validation set comprising the remaining patients. Furthermore, 5 known clinical risk scores were tested in our complete patient cohort. RESULT: No gene signature was found that significantly predicted DFS in the validation set. In contrast, three out of five clinical risk scores were able to predict DFS in our patient cohort. CONCLUSIONS: No gene signature was found that could predict DFS in patients undergoing CRLM resection. Three out of five clinical risk scores were able to predict DFS in our patient cohort. These results emphasize the need for validating risk scores in independent patient groups and suggest improved designs for future studies.

  8. Intra- and interspecies gene expression models for predicting drug response in canine osteosarcoma.

    Science.gov (United States)

    Fowles, Jared S; Brown, Kristen C; Hess, Ann M; Duval, Dawn L; Gustafson, Daniel L

    2016-02-19

    Genomics-based predictors of drug response have the potential to improve outcomes associated with cancer therapy. Osteosarcoma (OS), the most common primary bone cancer in dogs, is commonly treated with adjuvant doxorubicin or carboplatin following amputation of the affected limb. We evaluated the use of gene-expression based models built in an intra- or interspecies manner to predict chemosensitivity and treatment outcome in canine OS. Models were built and evaluated using microarray gene expression and drug sensitivity data from human and canine cancer cell lines, and canine OS tumor datasets. The "COXEN" method was utilized to filter gene signatures between human and dog datasets based on strong co-expression patterns. Models were built using linear discriminant analysis via the misclassification penalized posterior algorithm. The best doxorubicin model involved genes identified in human lines that were co-expressed and trained on canine OS tumor data, which accurately predicted clinical outcome in 73 % of dogs (p = 0.0262, binomial). The best carboplatin model utilized canine lines for gene identification and model training, with canine OS tumor data for co-expression. Dogs whose treatment matched our predictions had significantly better clinical outcomes than those that didn't (p = 0.0006, Log Rank), and this predictor significantly associated with longer disease free intervals in a Cox multivariate analysis (hazard ratio = 0.3102, p = 0.0124). Our data show that intra- and interspecies gene expression models can successfully predict response in canine OS, which may improve outcome in dogs and serve as pre-clinical validation for similar methods in human cancer research.

  9. Prediction of metastasis from low-malignant breast cancer by gene expression profiling

    DEFF Research Database (Denmark)

    Thomassen, Mads; Tan, Qihua; Eiriksdottir, Freyja

    2007-01-01

    examined in these studies is the low-risk patients for whom outcome is very difficult to predict with currently used methods. These patients do not receive adjuvant treatment according to the guidelines of the Danish Breast Cancer Cooperative Group (DBCG). In this study, 26 tumors from low-risk patients...... with different characteristics and risk, expression-based classification specifically developed in low-risk patients have higher predictive power in this group.......Promising results for prediction of outcome in breast cancer have been obtained by genome wide gene expression profiling. Some studies have suggested that an extensive overtreatment of breast cancer patients might be reduced by risk assessment with gene expression profiling. A patient group hardly...

  10. Effectiveness of gene expression profiling for response prediction of rectal cancer to preoperative radiotherapy

    International Nuclear Information System (INIS)

    Ojima, Eiki; Inoue, Yasuhiro; Miki, Chikao; Kusunoki, Masato; Mori, Masaki

    2007-01-01

    Our aim was to determine whether the expression levels of specific genes could predict clinical radiosensitivity in human colorectal cancer. Radioresistant colorectal cancer cell lines were established by repeated X-ray exposure (total, 100 Gy), and the gene expressions of the parent and radioresistant cell lines were compared in a microarray analysis. To verify the microarray data, we carried out a reverse transcriptase-polymerase chain reaction analysis of identified genes in clinical samples from 30 irradiated rectal cancer patients. A comparison of the intensity data for the parent and three radioresistant cell lines revealed 17 upregulated and 142 downregulated genes in all radioresistant cell lines. Next, we focused on two upregulated genes, PTMA (prothymosin α) and EIF5a2 (eukaryotic translation initiation factor 5A), in the radioresistant cell lines. In clinical samples, the expression of PTMA was significantly higher in the minor effect group than in the major effect group (P=0.004), but there were no significant differences in EIF5a2 expression between the two groups. We identified radiation-related genes in colorectal cancer and demonstrated that PTMA may play an important role in radiosensitivity. Our findings suggest that PTMA may be a novel marker for predicting the effectiveness of radiotherapy in clinical cases. (author)

  11. Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer

    Directory of Open Access Journals (Sweden)

    Pingzhao Hu

    2006-01-01

    biological pathways. In particular, we observed that by integrating information from the insulin signalling pathway into our prediction model, we achieved better prediction of prostate cancer. Conclusions: Our data integration methodology provides an efficient way to identify biologically sound and statistically significant pathways from gene expression data. The significant gene expression phenotypes identified in our study have the potential to characterize complex genetic alterations in prostate cancer.

  12. Using gene co-expression network analysis to predict biomarkers for chronic lymphocytic leukemia

    Directory of Open Access Journals (Sweden)

    Borlawsky Tara B

    2010-10-01

    Full Text Available Abstract Background Chronic lymphocytic leukemia (CLL is the most common adult leukemia. It is a highly heterogeneous disease, and can be divided roughly into indolent and progressive stages based on classic clinical markers. Immunoglobin heavy chain variable region (IgVH mutational status was found to be associated with patient survival outcome, and biomarkers linked to the IgVH status has been a focus in the CLL prognosis research field. However, biomarkers highly correlated with IgVH mutational status which can accurately predict the survival outcome are yet to be discovered. Results In this paper, we investigate the use of gene co-expression network analysis to identify potential biomarkers for CLL. Specifically we focused on the co-expression network involving ZAP70, a well characterized biomarker for CLL. We selected 23 microarray datasets corresponding to multiple types of cancer from the Gene Expression Omnibus (GEO and used the frequent network mining algorithm CODENSE to identify highly connected gene co-expression networks spanning the entire genome, then evaluated the genes in the co-expression network in which ZAP70 is involved. We then applied a set of feature selection methods to further select genes which are capable of predicting IgVH mutation status from the ZAP70 co-expression network. Conclusions We have identified a set of genes that are potential CLL prognostic biomarkers IL2RB, CD8A, CD247, LAG3 and KLRK1, which can predict CLL patient IgVH mutational status with high accuracies. Their prognostic capabilities were cross-validated by applying these biomarker candidates to classify patients into different outcome groups using a CLL microarray datasets with clinical information.

  13. Adipose Gene Expression Prior to Weight Loss Can Differentiate and Weakly Predict Dietary Responders

    Science.gov (United States)

    Mutch, David M.; Temanni, M. Ramzi; Henegar, Corneliu; Combes, Florence; Pelloux, Véronique; Holst, Claus; Sørensen, Thorkild I. A.; Astrup, Arne; Martinez, J. Alfredo; Saris, Wim H. M.; Viguerie, Nathalie; Langin, Dominique; Zucker, Jean-Daniel; Clément, Karine

    2007-01-01

    Background The ability to identify obese individuals who will successfully lose weight in response to dietary intervention will revolutionize disease management. Therefore, we asked whether it is possible to identify subjects who will lose weight during dietary intervention using only a single gene expression snapshot. Methodology/Principal Findings The present study involved 54 female subjects from the Nutrient-Gene Interactions in Human Obesity-Implications for Dietary Guidelines (NUGENOB) trial to determine whether subcutaneous adipose tissue gene expression could be used to predict weight loss prior to the 10-week consumption of a low-fat hypocaloric diet. Using several statistical tests revealed that the gene expression profiles of responders (8–12 kgs weight loss) could always be differentiated from non-responders (diet is able to differentiate responders from non-responders as well as serve as a weak predictor of subjects destined to lose weight. While the degree of prediction accuracy currently achieved with a gene expression snapshot is perhaps insufficient for clinical use, this work reveals that the comprehensive molecular signature of adipose tissue paves the way for the future of personalized nutrition. PMID:18094752

  14. Cell-specific prediction and application of drug-induced gene expression profiles.

    Science.gov (United States)

    Hodos, Rachel; Zhang, Ping; Lee, Hao-Chih; Duan, Qiaonan; Wang, Zichen; Clark, Neil R; Ma'ayan, Avi; Wang, Fei; Kidd, Brian; Hu, Jianying; Sontag, David; Dudley, Joel

    2018-01-01

    Gene expression profiling of in vitro drug perturbations is useful for many biomedical discovery applications including drug repurposing and elucidation of drug mechanisms. However, limited data availability across cell types has hindered our capacity to leverage or explore the cell-specificity of these perturbations. While recent efforts have generated a large number of drug perturbation profiles across a variety of human cell types, many gaps remain in this combinatorial drug-cell space. Hence, we asked whether it is possible to fill these gaps by predicting cell-specific drug perturbation profiles using available expression data from related conditions--i.e. from other drugs and cell types. We developed a computational framework that first arranges existing profiles into a three-dimensional array (or tensor) indexed by drugs, genes, and cell types, and then uses either local (nearest-neighbors) or global (tensor completion) information to predict unmeasured profiles. We evaluate prediction accuracy using a variety of metrics, and find that the two methods have complementary performance, each superior in different regions in the drug-cell space. Predictions achieve correlations of 0.68 with true values, and maintain accurate differentially expressed genes (AUC 0.81). Finally, we demonstrate that the predicted profiles add value for making downstream associations with drug targets and therapeutic classes.

  15. Prediction of Associations between microRNAs and Gene Expression in Glioma Biology.

    Directory of Open Access Journals (Sweden)

    Stefan Wuchty

    Full Text Available Despite progress in the determination of miR interactions, their regulatory role in cancer is only beginning to be unraveled. Utilizing gene expression data from 27 glioblastoma samples we found that the mere knowledge of physical interactions between specific mRNAs and miRs can be used to determine associated regulatory interactions, allowing us to identify 626 associated interactions, involving 128 miRs that putatively modulate the expression of 246 mRNAs. Experimentally determining the expression of miRs, we found an over-representation of over(under-expressed miRs with various predicted mRNA target sequences. Such significantly associated miRs that putatively bind over-expressed genes strongly tend to have binding sites nearby the 3'UTR of the corresponding mRNAs, suggesting that the presence of the miRs near the translation stop site may be a factor in their regulatory ability. Our analysis predicted a significant association between miR-128 and the protein kinase WEE1, which we subsequently validated experimentally by showing that the over-expression of the naturally under-expressed miR-128 in glioma cells resulted in the inhibition of WEE1 in glioblastoma cells.

  16. Expression of estrogen-related gene markers in breast cancer tissue predicts aromatase inhibitor responsiveness.

    Directory of Open Access Journals (Sweden)

    Irene Moy

    Full Text Available Aromatase inhibitors (AIs are the most effective class of drugs in the endocrine treatment of breast cancer, with an approximate 50% treatment response rate. Our objective was to determine whether intratumoral expression levels of estrogen-related genes are predictive of AI responsiveness in postmenopausal women with breast cancer. Primary breast carcinomas were obtained from 112 women who received AI therapy after failing adjuvant tamoxifen therapy and developing recurrent breast cancer. Tumor ERα and PR protein expression were analyzed by immunohistochemistry (IHC. Messenger RNA (mRNA levels of 5 estrogen-related genes-AKR1C3, aromatase, ERα, and 2 estradiol/ERα target genes, BRCA1 and PR-were measured by real-time PCR. Tumor protein and mRNA levels were compared with breast cancer progression rates to determine predictive accuracy. Responsiveness to AI therapy-defined as the combined complete response, partial response, and stable disease rates for at least 6 months-was 51%; rates were 56% in ERα-IHC-positive and 14% in ERα-IHC-negative tumors. Levels of ERα, PR, or BRCA1 mRNA were independently predictive for responsiveness to AI. In cross-validated analyses, a combined measurement of tumor ERα and PR mRNA levels yielded a more superior specificity (36% and identical sensitivity (96% to the current clinical practice (ERα/PR-IHC. In patients with ERα/PR-IHC-negative tumors, analysis of mRNA expression revealed either non-significant trends or statistically significant positive predictive values for AI responsiveness. In conclusion, expression levels of estrogen-related mRNAs are predictive for AI responsiveness in postmenopausal women with breast cancer, and mRNA expression analysis may improve patient selection.

  17. An enhanced deterministic K-Means clustering algorithm for cancer subtype prediction from gene expression data.

    Science.gov (United States)

    Nidheesh, N; Abdul Nazeer, K A; Ameer, P M

    2017-12-01

    Clustering algorithms with steps involving randomness usually give different results on different executions for the same dataset. This non-deterministic nature of algorithms such as the K-Means clustering algorithm limits their applicability in areas such as cancer subtype prediction using gene expression data. It is hard to sensibly compare the results of such algorithms with those of other algorithms. The non-deterministic nature of K-Means is due to its random selection of data points as initial centroids. We propose an improved, density based version of K-Means, which involves a novel and systematic method for selecting initial centroids. The key idea of the algorithm is to select data points which belong to dense regions and which are adequately separated in feature space as the initial centroids. We compared the proposed algorithm to a set of eleven widely used single clustering algorithms and a prominent ensemble clustering algorithm which is being used for cancer data classification, based on the performances on a set of datasets comprising ten cancer gene expression datasets. The proposed algorithm has shown better overall performance than the others. There is a pressing need in the Biomedical domain for simple, easy-to-use and more accurate Machine Learning tools for cancer subtype prediction. The proposed algorithm is simple, easy-to-use and gives stable results. Moreover, it provides comparatively better predictions of cancer subtypes from gene expression data. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Pancreatic cancer circulating tumour cells express a cell motility gene signature that predicts survival after surgery

    International Nuclear Information System (INIS)

    Sergeant, Gregory; Eijsden, Rudy van; Roskams, Tania; Van Duppen, Victor; Topal, Baki

    2012-01-01

    (95% CI) = 1.366 (1.004 – 1.861)). Pancreatic CTC isolated from blood samples using FACS-based negative depletion, express a cell motility gene signature. Expression of this newly defined cell motility gene signature in the primary tumour can predict survival of patients undergoing surgical resection for pancreatic cancer. Clinical trials.gov NCT00495924

  19. A hemocyte gene expression signature correlated with predictive capacity of oysters to survive Vibrio infections

    Directory of Open Access Journals (Sweden)

    Rosa Rafael

    2012-06-01

    Full Text Available Abstract Background The complex balance between environmental and host factors is an important determinant of susceptibility to infection. Disturbances of this equilibrium may result in multifactorial diseases as illustrated by the summer mortality syndrome, a worldwide and complex phenomenon that affects the oysters, Crassostrea gigas. The summer mortality syndrome reveals a physiological intolerance making this oyster species susceptible to diseases. Exploration of genetic basis governing the oyster resistance or susceptibility to infections is thus a major goal for understanding field mortality events. In this context, we used high-throughput genomic approaches to identify genetic traits that may characterize inherent survival capacities in C. gigas. Results Using digital gene expression (DGE, we analyzed the transcriptomes of hemocytes (immunocompetent cells of oysters able or not able to survive infections by Vibrio species shown to be involved in summer mortalities. Hemocytes were nonlethally collected from oysters before Vibrio experimental infection, and two DGE libraries were generated from individuals that survived or did not survive. Exploration of DGE data and microfluidic qPCR analyses at individual level showed an extraordinary polymorphism in gene expressions, but also a set of hemocyte-expressed genes whose basal mRNA levels discriminate oyster capacity to survive infections by the pathogenic V. splendidus LGP32. Finally, we identified a signature of 14 genes that predicted oyster survival capacity. Their expressions are likely driven by distinct transcriptional regulation processes associated or not associated to gene copy number variation (CNV. Conclusions We provide here for the first time in oyster a gene expression survival signature that represents a useful tool for understanding mortality events and for assessing genetic traits of interest for disease resistance selection programs.

  20. Computational Prediction of MicroRNAs from Toxoplasma gondii Potentially Regulating the Hosts’ Gene Expression

    Directory of Open Access Journals (Sweden)

    Müşerref Duygu Saçar

    2014-10-01

    Full Text Available MicroRNAs (miRNAs were discovered two decades ago, yet there is still a great need for further studies elucidating their genesis and targeting in different phyla. Since experimental discovery and validation of miRNAs is difficult, computational predictions are indispensable and today most computational approaches employ machine learning. Toxoplasma gondii, a parasite residing within the cells of its hosts like human, uses miRNAs for its post-transcriptional gene regulation. It may also regulate its hosts’ gene expression, which has been shown in brain cancer. Since previous studies have shown that overexpressed miRNAs within the host are causal for disease onset, we hypothesized that T. gondii could export miRNAs into its host cell. We computationally predicted all hairpins from the genome of T. gondii and used mouse and human models to filter possible candidates. These were then further compared to known miRNAs in human and rodents and their expression was examined for T. gondii grown in mouse and human hosts, respectively. We found that among the millions of potential hairpins in T. gondii, only a few thousand pass filtering using a human or mouse model and that even fewer of those are expressed. Since they are expressed and differentially expressed in rodents and human, we suggest that there is a chance that T. gondii may export miRNAs into its hosts for direct regulation.

  1. Integrating circadian activity and gene expression profiles to predict chronotoxicity of Drosophila suzukii response to insecticides.

    Science.gov (United States)

    Hamby, Kelly A; Kwok, Rosanna S; Zalom, Frank G; Chiu, Joanna C

    2013-01-01

    Native to Southeast Asia, Drosophila suzukii (Matsumura) is a recent invader that infests intact ripe and ripening fruit, leading to significant crop losses in the U.S., Canada, and Europe. Since current D. suzukii management strategies rely heavily on insecticide usage and insecticide detoxification gene expression is under circadian regulation in the closely related Drosophila melanogaster, we set out to determine if integrative analysis of daily activity patterns and detoxification gene expression can predict chronotoxicity of D. suzukii to insecticides. Locomotor assays were performed under conditions that approximate a typical summer or winter day in Watsonville, California, where D. suzukii was first detected in North America. As expected, daily activity patterns of D. suzukii appeared quite different between 'summer' and 'winter' conditions due to differences in photoperiod and temperature. In the 'summer', D. suzukii assumed a more bimodal activity pattern, with maximum activity occurring at dawn and dusk. In the 'winter', activity was unimodal and restricted to the warmest part of the circadian cycle. Expression analysis of six detoxification genes and acute contact bioassays were performed at multiple circadian times, but only in conditions approximating Watsonville summer, the cropping season, when most insecticide applications occur. Five of the genes tested exhibited rhythmic expression, with the majority showing peak expression at dawn (ZT0, 6am). We observed significant differences in the chronotoxicity of D. suzukii towards malathion, with highest susceptibility at ZT0 (6am), corresponding to peak expression of cytochrome P450s that may be involved in bioactivation of malathion. High activity levels were not found to correlate with high insecticide susceptibility as initially hypothesized. Chronobiology and chronotoxicity of D. suzukii provide valuable insights for monitoring and control efforts, because insect activity as well as insecticide timing

  2. Radiation-induced gene expression in human subcutaneous fibroblasts is predictive of radiation-induced fibrosis

    DEFF Research Database (Denmark)

    Rødningen, Olaug Kristin; Børresen-Dale, Anne-Lise; Alsner, Jan

    2008-01-01

    BACKGROUND AND PURPOSE: Breast cancer patients show a large variation in normal tissue reactions after ionizing radiation (IR) therapy. One of the most common long-term adverse effects of ionizing radiotherapy is radiation-induced fibrosis (RIF), and several attempts have been made over the last...... years to develop predictive assays for RIF. Our aim was to identify basal and radiation-induced transcriptional profiles in fibroblasts from breast cancer patients that might be related to the individual risk of RIF in these patients. MATERIALS AND METHODS: Fibroblast cell lines from 31 individuals......-treated fibroblasts. Transcriptional differences in basal and radiation-induced gene expression profiles were investigated using 15K cDNA microarrays, and results analyzed by both SAM and PAM. RESULTS: Sixty differentially expressed genes were identified by applying SAM on 10 patients with the highest risk of RIF...

  3. Establishment of a 12-gene expression signature to predict colon cancer prognosis

    Directory of Open Access Journals (Sweden)

    Dalong Sun

    2018-06-01

    Full Text Available A robust and accurate gene expression signature is essential to assist oncologists to determine which subset of patients at similar Tumor-Lymph Node-Metastasis (TNM stage has high recurrence risk and could benefit from adjuvant therapies. Here we applied a two-step supervised machine-learning method and established a 12-gene expression signature to precisely predict colon adenocarcinoma (COAD prognosis by using COAD RNA-seq transcriptome data from The Cancer Genome Atlas (TCGA. The predictive performance of the 12-gene signature was validated with two independent gene expression microarray datasets: GSE39582 includes 566 COAD cases for the development of six molecular subtypes with distinct clinical, molecular and survival characteristics; GSE17538 is a dataset containing 232 colon cancer patients for the generation of a metastasis gene expression profile to predict recurrence and death in COAD patients. The signature could effectively separate the poor prognosis patients from good prognosis group (disease specific survival (DSS: Kaplan Meier (KM Log Rank p = 0.0034; overall survival (OS: KM Log Rank p = 0.0336 in GSE17538. For patients with proficient mismatch repair system (pMMR in GSE39582, the signature could also effectively distinguish high risk group from low risk group (OS: KM Log Rank p = 0.005; Relapse free survival (RFS: KM Log Rank p = 0.022. Interestingly, advanced stage patients were significantly enriched in high 12-gene score group (Fisher’s exact test p = 0.0003. After stage stratification, the signature could still distinguish poor prognosis patients in GSE17538 from good prognosis within stage II (Log Rank p = 0.01 and stage II & III (Log Rank p = 0.017 in the outcome of DFS. Within stage III or II/III pMMR patients treated with Adjuvant Chemotherapies (ACT and patients with higher 12-gene score showed poorer prognosis (III, OS: KM Log Rank p = 0.046; III & II, OS: KM Log Rank p = 0.041. Among stage II/III pMMR patients

  4. A Computational Gene Expression Score for Predicting Immune Injury in Renal Allografts.

    Directory of Open Access Journals (Sweden)

    Tara K Sigdel

    Full Text Available Whole genome microarray meta-analyses of 1030 kidney, heart, lung and liver allograft biopsies identified a common immune response module (CRM of 11 genes that define acute rejection (AR across different engrafted tissues. We evaluated if the CRM genes can provide a molecular microscope to quantify graft injury in acute rejection (AR and predict risk of progressive interstitial fibrosis and tubular atrophy (IFTA in histologically normal kidney biopsies.Computational modeling was done on tissue qPCR based gene expression measurements for the 11 CRM genes in 146 independent renal allografts from 122 unique patients with AR (n = 54 and no-AR (n = 92. 24 demographically matched patients with no-AR had 6 and 24 month paired protocol biopsies; all had histologically normal 6 month biopsies, and 12 had evidence of progressive IFTA (pIFTA on their 24 month biopsies. Results were correlated with demographic, clinical and pathology variables.The 11 gene qPCR based tissue CRM score (tCRM was significantly increased in AR (5.68 ± 0.91 when compared to STA (1.29 ± 0.28; p < 0.001 and pIFTA (7.94 ± 2.278 versus 2.28 ± 0.66; p = 0.04, with greatest significance for CXCL9 and CXCL10 in AR (p <0.001 and CD6 (p<0.01, CXCL9 (p<0.05, and LCK (p<0.01 in pIFTA. tCRM was a significant independent correlate of biopsy confirmed AR (p < 0.001; AUC of 0.900; 95% CI = 0.705-903. Gene expression modeling of 6 month biopsies across 7/11 genes (CD6, INPP5D, ISG20, NKG7, PSMB9, RUNX3, and TAP1 significantly (p = 0.037 predicted the development of pIFTA at 24 months.Genome-wide tissue gene expression data mining has supported the development of a tCRM-qPCR based assay for evaluating graft immune inflammation. The tCRM score quantifies injury in AR and stratifies patients at increased risk of future pIFTA prior to any perturbation of graft function or histology.

  5. Gene expression signatures that predict radiation exposure in mice and humans.

    Directory of Open Access Journals (Sweden)

    Holly K Dressman

    2007-04-01

    Full Text Available The capacity to assess environmental inputs to biological phenotypes is limited by methods that can accurately and quantitatively measure these contributions. One such example can be seen in the context of exposure to ionizing radiation.We have made use of gene expression analysis of peripheral blood (PB mononuclear cells to develop expression profiles that accurately reflect prior radiation exposure. We demonstrate that expression profiles can be developed that not only predict radiation exposure in mice but also distinguish the level of radiation exposure, ranging from 50 cGy to 1,000 cGy. Likewise, a molecular signature of radiation response developed solely from irradiated human patient samples can predict and distinguish irradiated human PB samples from nonirradiated samples with an accuracy of 90%, sensitivity of 85%, and specificity of 94%. We further demonstrate that a radiation profile developed in the mouse can correctly distinguish PB samples from irradiated and nonirradiated human patients with an accuracy of 77%, sensitivity of 82%, and specificity of 75%. Taken together, these data demonstrate that molecular profiles can be generated that are highly predictive of different levels of radiation exposure in mice and humans.We suggest that this approach, with additional refinement, could provide a method to assess the effects of various environmental inputs into biological phenotypes as well as providing a more practical application of a rapid molecular screening test for the diagnosis of radiation exposure.

  6. A Seasonal Time-Series Model Based on Gene Expression Programming for Predicting Financial Distress

    Directory of Open Access Journals (Sweden)

    Ching-Hsue Cheng

    2018-01-01

    Full Text Available The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i the proposed model is different from the previous models lacking the concept of time series; (ii the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies.

  7. A Seasonal Time-Series Model Based on Gene Expression Programming for Predicting Financial Distress

    Science.gov (United States)

    2018-01-01

    The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i) the proposed model is different from the previous models lacking the concept of time series; (ii) the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii) the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies. PMID:29765399

  8. A Seasonal Time-Series Model Based on Gene Expression Programming for Predicting Financial Distress.

    Science.gov (United States)

    Cheng, Ching-Hsue; Chan, Chia-Pang; Yang, Jun-He

    2018-01-01

    The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i) the proposed model is different from the previous models lacking the concept of time series; (ii) the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii) the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies.

  9. Comparative analysis of codon usage patterns and identification of predicted highly expressed genes in five Salmonella genomes

    Directory of Open Access Journals (Sweden)

    Mondal U

    2008-01-01

    Full Text Available Purpose: To anlyse codon usage patterns of five complete genomes of Salmonella , predict highly expressed genes, examine horizontally transferred pathogenicity-related genes to detect their presence in the strains, and scrutinize the nature of highly expressed genes to infer upon their lifestyle. Methods: Protein coding genes, ribosomal protein genes, and pathogenicity-related genes were analysed with Codon W and CAI (codon adaptation index Calculator. Results: Translational efficiency plays a role in codon usage variation in Salmonella genes. Low bias was noticed in most of the genes. GC3 (guanine cytosine at third position composition does not influence codon usage variation in the genes of these Salmonella strains. Among the cluster of orthologous groups (COGs, translation, ribosomal structure biogenesis [J], and energy production and conversion [C] contained the highest number of potentially highly expressed (PHX genes. Correspondence analysis reveals the conserved nature of the genes. Highly expressed genes were detected. Conclusions: Selection for translational efficiency is the major source of variation of codon usage in the genes of Salmonella . Evolution of pathogenicity-related genes as a unit suggests their ability to infect and exist as a pathogen. Presence of a lot of PHX genes in the information and storage-processing category of COGs indicated their lifestyle and revealed that they were not subjected to genome reduction.

  10. Response-predictive gene expression profiling of glioma progenitor cells in vitro.

    Directory of Open Access Journals (Sweden)

    Sylvia Moeckel

    Full Text Available High-grade gliomas are amongst the most deadly human tumors. Treatment results are disappointing. Still, in several trials around 20% of patients respond to therapy. To date, diagnostic strategies to identify patients that will profit from a specific therapy do not exist.In this study, we used serum-free short-term treated in vitro cell cultures to predict treatment response in vitro. This approach allowed us (a to enrich specimens for brain tumor initiating cells and (b to confront cells with a therapeutic agent before expression profiling.As a proof of principle we analyzed gene expression in 18 short-term serum-free cultures of high-grade gliomas enhanced for brain tumor initiating cells (BTIC before and after in vitro treatment with the tyrosine kinase inhibitor Sunitinib. Profiles from treated progenitor cells allowed to predict therapy-induced impairment of proliferation in vitro.For the tyrosine kinase inhibitor Sunitinib used in this dataset, the approach revealed additional predictive information in comparison to the evaluation of classical signaling analysis.

  11. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

    Science.gov (United States)

    Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

    2009-02-01

    Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

  12. Predicting response to primary chemotherapy: gene expression profiling of paraffin-embedded core biopsy tissue.

    Science.gov (United States)

    Mina, Lida; Soule, Sharon E; Badve, Sunil; Baehner, Fredrick L; Baker, Joffre; Cronin, Maureen; Watson, Drew; Liu, Mei-Lan; Sledge, George W; Shak, Steve; Miller, Kathy D

    2007-06-01

    Primary chemotherapy provides an ideal opportunity to correlate gene expression with response to treatment. We used paraffin-embedded core biopsies from a completed phase II trial to identify genes that correlate with response to primary chemotherapy. Patients with newly diagnosed stage II or III breast cancer were treated with sequential doxorubicin 75 mg/M2 q2 wks x 3 and docetaxel 40 mg/M2 weekly x 6; treatment order was randomly assigned. Pretreatment core biopsy samples were interrogated for genes that might correlate with pathologic complete response (pCR). In addition to the individual genes, the correlation of the Oncotype DX Recurrence Score with pCR was examined. Of 70 patients enrolled in the parent trial, core biopsies samples with sufficient RNA for gene analyses were available from 45 patients; 9 (20%) had inflammatory breast cancer (IBC). Six (14%) patients achieved a pCR. Twenty-two of the 274 candidate genes assessed correlated with pCR (p < 0.05). Genes correlating with pCR could be grouped into three large clusters: angiogenesis-related genes, proliferation related genes, and invasion-related genes. Expression of estrogen receptor (ER)-related genes and Recurrence Score did not correlate with pCR. In an exploratory analysis we compared gene expression in IBC to non-inflammatory breast cancer; twenty-four (9%) of the genes were differentially expressed (p < 0.05), 5 were upregulated and 19 were downregulated in IBC. Gene expression analysis on core biopsy samples is feasible and identifies candidate genes that correlate with pCR to primary chemotherapy. Gene expression in IBC differs significantly from noninflammatory breast cancer.

  13. Prediction of metabolic flux distribution from gene expression data based on the flux minimization principle.

    Directory of Open Access Journals (Sweden)

    Hyun-Seob Song

    Full Text Available Prediction of possible flux distributions in a metabolic network provides detailed phenotypic information that links metabolism to cellular physiology. To estimate metabolic steady-state fluxes, the most common approach is to solve a set of macroscopic mass balance equations subjected to stoichiometric constraints while attempting to optimize an assumed optimal objective function. This assumption is justifiable in specific cases but may be invalid when tested across different conditions, cell populations, or other organisms. With an aim to providing a more consistent and reliable prediction of flux distributions over a wide range of conditions, in this article we propose a framework that uses the flux minimization principle to predict active metabolic pathways from mRNA expression data. The proposed algorithm minimizes a weighted sum of flux magnitudes, while biomass production can be bounded to fit an ample range from very low to very high values according to the analyzed context. We have formulated the flux weights as a function of the corresponding enzyme reaction's gene expression value, enabling the creation of context-specific fluxes based on a generic metabolic network. In case studies of wild-type Saccharomyces cerevisiae, and wild-type and mutant Escherichia coli strains, our method achieved high prediction accuracy, as gauged by correlation coefficients and sums of squared error, with respect to the experimentally measured values. In contrast to other approaches, our method was able to provide quantitative predictions for both model organisms under a variety of conditions. Our approach requires no prior knowledge or assumption of a context-specific metabolic functionality and does not require trial-and-error parameter adjustments. Thus, our framework is of general applicability for modeling the transcription-dependent metabolism of bacteria and yeasts.

  14. Prediction of lymphatic metastasis based on gene expression profile analysis after brachytherapy for early-stage oral tongue carcinoma

    International Nuclear Information System (INIS)

    Watanabe, Hiroshi; Mogushi, Kaoru; Miura, Masahiko; Yoshimura, Ryo-ichi; Kurabayashi, Tohru; Shibuya, Hitoshi; Tanaka, Hiroshi; Noda, Shuhei; Iwakawa, Mayumi; Imai, Takashi

    2008-01-01

    Background and purpose: The management of lymphatic metastasis of early-stage oral tongue carcinoma patients is crucial for its prognosis. The purpose of this study was to evaluate the predictive ability of lymphatic metastasis after brachytherapy (BRT) for early-stage tongue carcinoma based on gene expression profiling. Patients and methods: Pre-therapeutic biopsies from 39 patients with T1 or T2 tongue cancer were analyzed for gene expression signatures using Codelink Uniset Human 20K Bioarray. All patients were treated with low dose-rate BRT for their primary lesions and underwent strict follow-up under a wait-and-see policy for cervical lymphatic metastasis. Candidate genes were selected for predicting lymph-node status in the reference group by the permutation test. Predictive accuracy was further evaluated by the prediction strength (PS) scoring system using an independent validation group. Results: We selected a set of 19 genes whose expression differed significantly between classes with or without lymphatic metastasis in the reference group. The lymph-node status in the validation group was predicted by the PS scoring system with an accuracy of 76%. Conclusions: Gene expression profiling using 19 genes in primary tumor tissues may allow prediction of lymphatic metastasis after BRT for early-stage oral tongue carcinoma

  15. Predicting spatial and temporal gene expression using an integrative model of transcription factor occupancy and chromatin state.

    Directory of Open Access Journals (Sweden)

    Bartek Wilczynski

    Full Text Available Precise patterns of spatial and temporal gene expression are central to metazoan complexity and act as a driving force for embryonic development. While there has been substantial progress in dissecting and predicting cis-regulatory activity, our understanding of how information from multiple enhancer elements converge to regulate a gene's expression remains elusive. This is in large part due to the number of different biological processes involved in mediating regulation as well as limited availability of experimental measurements for many of them. Here, we used a Bayesian approach to model diverse experimental regulatory data, leading to accurate predictions of both spatial and temporal aspects of gene expression. We integrated whole-embryo information on transcription factor recruitment to multiple cis-regulatory modules, insulator binding and histone modification status in the vicinity of individual gene loci, at a genome-wide scale during Drosophila development. The model uses Bayesian networks to represent the relation between transcription factor occupancy and enhancer activity in specific tissues and stages. All parameters are optimized in an Expectation Maximization procedure providing a model capable of predicting tissue- and stage-specific activity of new, previously unassayed genes. Performing the optimization with subsets of input data demonstrated that neither enhancer occupancy nor chromatin state alone can explain all gene expression patterns, but taken together allow for accurate predictions of spatio-temporal activity. Model predictions were validated using the expression patterns of more than 600 genes recently made available by the BDGP consortium, demonstrating an average 15-fold enrichment of genes expressed in the predicted tissue over a naïve model. We further validated the model by experimentally testing the expression of 20 predicted target genes of unknown expression, resulting in an accuracy of 95% for temporal

  16. Can survival prediction be improved by merging gene expression data sets?

    Directory of Open Access Journals (Sweden)

    Haleh Yasrebi

    Full Text Available BACKGROUND: High-throughput gene expression profiling technologies generating a wealth of data, are increasingly used for characterization of tumor biopsies for clinical trials. By applying machine learning algorithms to such clinically documented data sets, one hopes to improve tumor diagnosis, prognosis, as well as prediction of treatment response. However, the limited number of patients enrolled in a single trial study limits the power of machine learning approaches due to over-fitting. One could partially overcome this limitation by merging data from different studies. Nevertheless, such data sets differ from each other with regard to technical biases, patient selection criteria and follow-up treatment. It is therefore not clear at all whether the advantage of increased sample size outweighs the disadvantage of higher heterogeneity of merged data sets. Here, we present a systematic study to answer this question specifically for breast cancer data sets. We use survival prediction based on Cox regression as an assay to measure the added value of merged data sets. RESULTS: Using time-dependent Receiver Operating Characteristic-Area Under the Curve (ROC-AUC and hazard ratio as performance measures, we see in overall no significant improvement or deterioration of survival prediction with merged data sets as compared to individual data sets. This apparently was due to the fact that a few genes with strong prognostic power were not available on all microarray platforms and thus were not retained in the merged data sets. Surprisingly, we found that the overall best performance was achieved with a single-gene predictor consisting of CYB5D1. CONCLUSIONS: Merging did not deteriorate performance on average despite (a The diversity of microarray platforms used. (b The heterogeneity of patients cohorts. (c The heterogeneity of breast cancer disease. (d Substantial variation of time to death or relapse. (e The reduced number of genes in the merged data

  17. Minimal gene selection for classification and diagnosis prediction based on gene expression profile

    Directory of Open Access Journals (Sweden)

    Alireza Mehridehnavi

    2013-01-01

    Conclusion: We have shown that the use of two most significant genes based on their S/N ratios and selection of suitable training samples can lead to classify DLBCL patients with a rather good result. Actually with the aid of mentioned methods we could compensate lack of enough number of patients, improve accuracy of classifying and reduce complication of computations and so running time.

  18. PREDICTION OF THE COURSE OF OSTEOARTHROSIS FROM mTOR (MAMMALIAN TARGET OF RAPAMYCIN GENE EXPRESSION

    Directory of Open Access Journals (Sweden)

    E V Chetina

    2012-01-01

    Results. Analysis of gene expression in the outpatients with OA identified two subgroups: in one subgroup (n = 13 mTOR expression was considerably much less than that in the control group; the expression of ATG1 and p21 did not differ greatly from the control and that of caspase 3 and TNF-α was significantly higher. The other outpatients (n = 20 and all the examined patients needing endoprosthetic replacement were ascertained to have a higher gene expression of mTOR, ATG1, p21, caspase 3, and TNF-α than in the control group. Before endoprosthetic replacement, severe joint destruction in patients with OA was associated with enhanced gene expression of mTOR, ATG1, p21, and caspase 3. Conclusion. In early-stage disease, increased mTOR gene expression may serve as a prognostic marker of the severity of the disease and articular cartilage destruction.

  19. Gene expression programming for prediction of scour depth downstream of sills

    Science.gov (United States)

    Azamathulla, H. Md.

    2012-08-01

    SummaryLocal scour is crucial in the degradation of river bed and the stability of grade control structures, stilling basins, aprons, ski-jump bucket spillways, bed sills, weirs, check dams, etc. This short communication presents gene-expression programming (GEP), which is an extension to genetic programming (GP), as an alternative approach to predict scour depth downstream of sills. Published data were compiled from the literature for the scour depth downstream of sills. The proposed GEP approach gives satisfactory results (R2 = 0.967 and RMSE = 0.088) compared to the existing predictors (Chinnarasri and Kositgittiwong, 2008) with R2 = 0.87 and RMSE = 2.452 for relative scour depth.

  20. A Machine Learned Classifier That Uses Gene Expression Data to Accurately Predict Estrogen Receptor Status

    Science.gov (United States)

    Bastani, Meysam; Vos, Larissa; Asgarian, Nasimeh; Deschenes, Jean; Graham, Kathryn; Mackey, John; Greiner, Russell

    2013-01-01

    Background Selecting the appropriate treatment for breast cancer requires accurately determining the estrogen receptor (ER) status of the tumor. However, the standard for determining this status, immunohistochemical analysis of formalin-fixed paraffin embedded samples, suffers from numerous technical and reproducibility issues. Assessment of ER-status based on RNA expression can provide more objective, quantitative and reproducible test results. Methods To learn a parsimonious RNA-based classifier of hormone receptor status, we applied a machine learning tool to a training dataset of gene expression microarray data obtained from 176 frozen breast tumors, whose ER-status was determined by applying ASCO-CAP guidelines to standardized immunohistochemical testing of formalin fixed tumor. Results This produced a three-gene classifier that can predict the ER-status of a novel tumor, with a cross-validation accuracy of 93.17±2.44%. When applied to an independent validation set and to four other public databases, some on different platforms, this classifier obtained over 90% accuracy in each. In addition, we found that this prediction rule separated the patients' recurrence-free survival curves with a hazard ratio lower than the one based on the IHC analysis of ER-status. Conclusions Our efficient and parsimonious classifier lends itself to high throughput, highly accurate and low-cost RNA-based assessments of ER-status, suitable for routine high-throughput clinical use. This analytic method provides a proof-of-principle that may be applicable to developing effective RNA-based tests for other biomarkers and conditions. PMID:24312637

  1. A machine learned classifier that uses gene expression data to accurately predict estrogen receptor status.

    Directory of Open Access Journals (Sweden)

    Meysam Bastani

    Full Text Available BACKGROUND: Selecting the appropriate treatment for breast cancer requires accurately determining the estrogen receptor (ER status of the tumor. However, the standard for determining this status, immunohistochemical analysis of formalin-fixed paraffin embedded samples, suffers from numerous technical and reproducibility issues. Assessment of ER-status based on RNA expression can provide more objective, quantitative and reproducible test results. METHODS: To learn a parsimonious RNA-based classifier of hormone receptor status, we applied a machine learning tool to a training dataset of gene expression microarray data obtained from 176 frozen breast tumors, whose ER-status was determined by applying ASCO-CAP guidelines to standardized immunohistochemical testing of formalin fixed tumor. RESULTS: This produced a three-gene classifier that can predict the ER-status of a novel tumor, with a cross-validation accuracy of 93.17±2.44%. When applied to an independent validation set and to four other public databases, some on different platforms, this classifier obtained over 90% accuracy in each. In addition, we found that this prediction rule separated the patients' recurrence-free survival curves with a hazard ratio lower than the one based on the IHC analysis of ER-status. CONCLUSIONS: Our efficient and parsimonious classifier lends itself to high throughput, highly accurate and low-cost RNA-based assessments of ER-status, suitable for routine high-throughput clinical use. This analytic method provides a proof-of-principle that may be applicable to developing effective RNA-based tests for other biomarkers and conditions.

  2. Prediction of essential proteins based on subcellular localization and gene expression correlation.

    Science.gov (United States)

    Fan, Yetian; Tang, Xiwei; Hu, Xiaohua; Wu, Wei; Ping, Qing

    2017-12-01

    Essential proteins are indispensable to the survival and development process of living organisms. To understand the functional mechanisms of essential proteins, which can be applied to the analysis of disease and design of drugs, it is important to identify essential proteins from a set of proteins first. As traditional experimental methods designed to test out essential proteins are usually expensive and laborious, computational methods, which utilize biological and topological features of proteins, have attracted more attention in recent years. Protein-protein interaction networks, together with other biological data, have been explored to improve the performance of essential protein prediction. The proposed method SCP is evaluated on Saccharomyces cerevisiae datasets and compared with five other methods. The results show that our method SCP outperforms the other five methods in terms of accuracy of essential protein prediction. In this paper, we propose a novel algorithm named SCP, which combines the ranking by a modified PageRank algorithm based on subcellular compartments information, with the ranking by Pearson correlation coefficient (PCC) calculated from gene expression data. Experiments show that subcellular localization information is promising in boosting essential protein prediction.

  3. Expression Pattern Similarities Support the Prediction of Orthologs Retaining Common Functions after Gene Duplication Events1[OPEN

    Science.gov (United States)

    Haberer, Georg; Panda, Arup; Das Laha, Shayani; Ghosh, Tapas Chandra; Schäffner, Anton R.

    2016-01-01

    The identification of functionally equivalent, orthologous genes (functional orthologs) across genomes is necessary for accurate transfer of experimental knowledge from well-characterized organisms to others. This frequently relies on automated, coding sequence-based approaches such as OrthoMCL, Inparanoid, and KOG, which usually work well for one-to-one homologous states. However, this strategy does not reliably work for plants due to the occurrence of extensive gene/genome duplication. Frequently, for one query gene, multiple orthologous genes are predicted in the other genome, and it is not clear a priori from sequence comparison and similarity which one preserves the ancestral function. We have studied 11 organ-dependent and stress-induced gene expression patterns of 286 Arabidopsis lyrata duplicated gene groups and compared them with the respective Arabidopsis (Arabidopsis thaliana) genes to predict putative expressologs and nonexpressologs based on gene expression similarity. Promoter sequence divergence as an additional tool to substantiate functional orthology only partially overlapped with expressolog classification. By cloning eight A. lyrata homologs and complementing them in the respective four Arabidopsis loss-of-function mutants, we experimentally proved that predicted expressologs are indeed functional orthologs, while nonexpressologs or nonfunctionalized orthologs are not. Our study demonstrates that even a small set of gene expression data in addition to sequence homologies are instrumental in the assignment of functional orthologs in the presence of multiple orthologs. PMID:27303025

  4. Gene expression patterns in formalin-fixed, paraffin-embedded core biopsies predict docetaxel chemosensitivity in breast cancer patients.

    Science.gov (United States)

    Chang, Jenny C; Makris, Andreas; Gutierrez, M Carolina; Hilsenbeck, Susan G; Hackett, James R; Jeong, Jennie; Liu, Mei-Lan; Baker, Joffre; Clark-Langone, Kim; Baehner, Frederick L; Sexton, Krsytal; Mohsin, Syed; Gray, Tara; Alvarez, Laura; Chamness, Gary C; Osborne, C Kent; Shak, Steven

    2008-03-01

    Previously, we had identified gene expression patterns that predicted response to neoadjuvant docetaxel. Other studies have validated that a high Recurrence Score (RS) by the 21-gene RT-PCR assay is predictive of worse prognosis but better response to chemotherapy. We investigated whether tumor expression of these 21 genes and other candidate genes can predict response to docetaxel. Core biopsies from 97 patients were obtained before treatment with neoadjuvant docetaxel (4 cycles, 100 mg/m2 q3 weeks). Three 10-microm FFPE sections were submitted for quantitative RT-PCR assays of 192 genes that were selected from our previous work and the literature. Of the 97 patients, 81 (84%) had sufficient invasive cancer, 80 (82%) had sufficient RNA for QRTPCR assay, and 72 (74%) had clinical response data. Mean age was 48.5 years, and the median tumor size was 6 cm. Clinical complete responses (CR) were observed in 12 (17%), partial responses in 41 (57%), stable disease in 17 (24%), and progressive disease in 2 patients (3%). A significant relationship (P<0.05) between gene expression and CR was observed for 14 genes, including CYBA. CR was associated with lower expression of the ER gene group and higher expression of the proliferation gene group from the 21 gene assay. Of note, CR was more likely with a high RS (P=0.008). We have established molecular profiles of sensitivity to docetaxel. RT-PCR technology provides a potential platform for a predictive test of docetaxel chemosensitivity using small amounts of routinely processed material.

  5. A statistical method for predicting splice variants between two groups of samples using GeneChip® expression array data

    Directory of Open Access Journals (Sweden)

    Olson James M

    2006-04-01

    Full Text Available Abstract Background Alternative splicing of pre-messenger RNA results in RNA variants with combinations of selected exons. It is one of the essential biological functions and regulatory components in higher eukaryotic cells. Some of these variants are detectable with the Affymetrix GeneChip® that uses multiple oligonucleotide probes (i.e. probe set, since the target sequences for the multiple probes are adjacent within each gene. Hybridization intensity from a probe correlates with abundance of the corresponding transcript. Although the multiple-probe feature in the current GeneChip® was designed to assess expression values of individual genes, it also measures transcriptional abundance for a sub-region of a gene sequence. This additional capacity motivated us to develop a method to predict alternative splicing, taking advance of extensive repositories of GeneChip® gene expression array data. Results We developed a two-step approach to predict alternative splicing from GeneChip® data. First, we clustered the probes from a probe set into pseudo-exons based on similarity of probe intensities and physical adjacency. A pseudo-exon is defined as a sequence in the gene within which multiple probes have comparable probe intensity values. Second, for each pseudo-exon, we assessed the statistical significance of the difference in probe intensity between two groups of samples. Differentially expressed pseudo-exons are predicted to be alternatively spliced. We applied our method to empirical data generated from GeneChip® Hu6800 arrays, which include 7129 probe sets and twenty probes per probe set. The dataset consists of sixty-nine medulloblastoma (27 metastatic and 42 non-metastatic samples and four cerebellum samples as normal controls. We predicted that 577 genes would be alternatively spliced when we compared normal cerebellum samples to medulloblastomas, and predicted that thirteen genes would be alternatively spliced when we compared metastatic

  6. Transcription factor binding site enrichment analysis predicts drivers of altered gene expression in nonalcoholic steatohepatitis

    Czech Academy of Sciences Publication Activity Database

    Lake, A.D.; Chaput, A.L.; Novák, Petr; Cherrington, N.J.; Smith, C.L.

    2016-01-01

    Roč. 122, December 15 (2016), s. 62-71 ISSN 0006-2952 Institutional support: RVO:60077344 Keywords : Transcription factor * Liver * Gene expression * Bioinformatics Subject RIV: CE - Biochemistry Impact factor: 4.581, year: 2016

  7. An Individual-Based Diploid Model Predicts Limited Conditions Under Which Stochastic Gene Expression Becomes Advantageous

    KAUST Repository

    Matsumoto, Tomotaka; Mineta, Katsuhiko; Osada, Naoki; Araki, Hitoshi

    2015-01-01

    Recent studies suggest the existence of a stochasticity in gene expression (SGE) in many organisms, and its non-negligible effect on their phenotype and fitness. To date, however, how SGE affects the key parameters of population genetics

  8. Gene expression markers in circulating tumor cells may predict bone metastasis and response to hormonal treatment in breast cancer.

    Science.gov (United States)

    Wang, Haiying; Molina, Julian; Jiang, John; Ferber, Matthew; Pruthi, Sandhya; Jatkoe, Timothy; Derecho, Carlo; Rajpurohit, Yashoda; Zheng, Jian; Wang, Yixin

    2013-11-01

    Circulating tumor cells (CTCs) have recently attracted attention due to their potential as prognostic and predictive markers for the clinical management of metastatic breast cancer patients. The isolation of CTCs from patients may enable the molecular characterization of these cells, which may help establish a minimally invasive assay for the prediction of metastasis and further optimization of treatment. Molecular markers of proven clinical value may therefore be useful in predicting disease aggressiveness and response to treatment. In our earlier study, we identified a gene signature in breast cancer that appears to be significantly associated with bone metastasis. Among the genes that constitute this signature, trefoil factor 1 (TFF1) was identified as the most differentially expressed gene associated with bone metastasis. In this study, we investigated 25 candidate gene markers in the CTCs of metastatic breast cancer patients with different metastatic sites. The panel of the 25 markers was investigated in 80 baseline samples (first blood draw of CTCs) and 30 follow-up samples. In addition, 40 healthy blood donors (HBDs) were analyzed as controls. The assay was performed using quantitative reverse transcriptase polymerase chain reaction (qRT-PCR) with RNA extracted from CTCs captured by the CellSearch system. Our study indicated that 12 of the genes were uniquely expressed in CTCs and 10 were highly expressed in the CTCs obtained from patients compared to those obtained from HBDs. Among these genes, the expression of keratin 19 was highly correlated with the CTC count. The TFF1 expression in CTCs was a strong predictor of bone metastasis and the patients with a high expression of estrogen receptor β in CTCs exhibited a better response to hormonal treatment. Molecular characterization of these genes in CTCs may provide a better understanding of the mechanism underlying tumor metastasis and identify gene markers in CTCs for predicting disease progression and

  9. Predicting survival in patients with metastatic kidney cancer by gene-expression profiling in the primary tumor.

    Science.gov (United States)

    Vasselli, James R; Shih, Joanna H; Iyengar, Shuba R; Maranchie, Jodi; Riss, Joseph; Worrell, Robert; Torres-Cabala, Carlos; Tabios, Ray; Mariotti, Andra; Stearman, Robert; Merino, Maria; Walther, McClellan M; Simon, Richard; Klausner, Richard D; Linehan, W Marston

    2003-06-10

    To identify potential molecular determinants of tumor biology and possible clinical outcomes, global gene-expression patterns were analyzed in the primary tumors of patients with metastatic renal cell cancer by using cDNA microarrays. We used grossly dissected tumor masses that included tumor, blood vessels, connective tissue, and infiltrating immune cells to obtain a gene-expression "profile" from each primary tumor. Two patterns of gene expression were found within this uniformly staged patient population, which correlated with a significant difference in overall survival between the two patient groups. Subsets of genes most significantly associated with survival were defined, and vascular cell adhesion molecule-1 (VCAM-1) was the gene most predictive for survival. Therefore, despite the complex biological nature of metastatic cancer, basic clinical behavior as defined by survival may be determined by the gene-expression patterns expressed within the compilation of primary gross tumor cells. We conclude that survival in patients with metastatic renal cell cancer can be correlated with the expression of various genes based solely on the expression profile in the primary kidney tumor.

  10. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    Science.gov (United States)

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  11. Hierarchy in gene expression is predictive of risk, progression, and outcome in adult acute myeloid leukemia

    Science.gov (United States)

    Tripathi, Shubham; Deem, Michael W.

    2015-02-01

    Cancer progresses with a change in the structure of the gene network in normal cells. We define a measure of organizational hierarchy in gene networks of affected cells in adult acute myeloid leukemia (AML) patients. With a retrospective cohort analysis based on the gene expression profiles of 116 AML patients, we find that the likelihood of future cancer relapse and the level of clinical risk are directly correlated with the level of organization in the cancer related gene network. We also explore the variation of the level of organization in the gene network with cancer progression. We find that this variation is non-monotonic, which implies the fitness landscape in the evolution of AML cancer cells is non-trivial. We further find that the hierarchy in gene expression at the time of diagnosis may be a useful biomarker in AML prognosis.

  12. Hierarchy in gene expression is predictive of risk, progression, and outcome in adult acute myeloid leukemia

    International Nuclear Information System (INIS)

    Tripathi, Shubham; Deem, Michael W

    2015-01-01

    Cancer progresses with a change in the structure of the gene network in normal cells. We define a measure of organizational hierarchy in gene networks of affected cells in adult acute myeloid leukemia (AML) patients. With a retrospective cohort analysis based on the gene expression profiles of 116 AML patients, we find that the likelihood of future cancer relapse and the level of clinical risk are directly correlated with the level of organization in the cancer related gene network. We also explore the variation of the level of organization in the gene network with cancer progression. We find that this variation is non-monotonic, which implies the fitness landscape in the evolution of AML cancer cells is non-trivial. We further find that the hierarchy in gene expression at the time of diagnosis may be a useful biomarker in AML prognosis. (paper)

  13. In Silico Analysis of Microarray-Based Gene Expression Profiles Predicts Tumor Cell Response to Withanolides

    Directory of Open Access Journals (Sweden)

    Thomas Efferth

    2012-05-01

    Full Text Available Withania somnifera (L. Dunal (Indian ginseng, winter cherry, Solanaceae is widely used in traditional medicine. Roots are either chewed or used to prepare beverages (aqueous decocts. The major secondary metabolites of Withania somnifera are the withanolides, which are C-28-steroidal lactone triterpenoids. Withania somnifera extracts exert chemopreventive and anticancer activities in vitro and in vivo. The aims of the present in silico study were, firstly, to investigate whether tumor cells develop cross-resistance between standard anticancer drugs and withanolides and, secondly, to elucidate the molecular determinants of sensitivity and resistance of tumor cells towards withanolides. Using IC50 concentrations of eight different withanolides (withaferin A, withaferin A diacetate, 3-azerininylwithaferin A, withafastuosin D diacetate, 4-B-hydroxy-withanolide E, isowithanololide E, withafastuosin E, and withaperuvin and 19 established anticancer drugs, we analyzed the cross-resistance profile of 60 tumor cell lines. The cell lines revealed cross-resistance between the eight withanolides. Consistent cross-resistance between withanolides and nitrosoureas (carmustin, lomustin, and semimustin was also observed. Then, we performed transcriptomic microarray-based COMPARE and hierarchical cluster analyses of mRNA expression to identify mRNA expression profiles predicting sensitivity or resistance towards withanolides. Genes from diverse functional groups were significantly associated with response of tumor cells to withaferin A diacetate, e.g. genes functioning in DNA damage and repair, stress response, cell growth regulation, extracellular matrix components, cell adhesion and cell migration, constituents of the ribosome, cytoskeletal organization and regulation, signal transduction, transcription factors, and others.

  14. FocusHeuristics - expression-data-driven network optimization and disease gene prediction.

    Science.gov (United States)

    Ernst, Mathias; Du, Yang; Warsow, Gregor; Hamed, Mohamed; Endlich, Nicole; Endlich, Karlhans; Murua Escobar, Hugo; Sklarz, Lisa-Madeleine; Sender, Sina; Junghanß, Christian; Möller, Steffen; Fuellen, Georg; Struckmann, Stephan

    2017-02-16

    To identify genes contributing to disease phenotypes remains a challenge for bioinformatics. Static knowledge on biological networks is often combined with the dynamics observed in gene expression levels over disease development, to find markers for diagnostics and therapy, and also putative disease-modulatory drug targets and drugs. The basis of current methods ranges from a focus on expression-levels (Limma) to concentrating on network characteristics (PageRank, HITS/Authority Score), and both (DeMAND, Local Radiality). We present an integrative approach (the FocusHeuristics) that is thoroughly evaluated based on public expression data and molecular disease characteristics provided by DisGeNet. The FocusHeuristics combines three scores, i.e. the log fold change and another two, based on the sum and difference of log fold changes of genes/proteins linked in a network. A gene is kept when one of the scores to which it contributes is above a threshold. Our FocusHeuristics is both, a predictor for gene-disease-association and a bioinformatics method to reduce biological networks to their disease-relevant parts, by highlighting the dynamics observed in expression data. The FocusHeuristics is slightly, but significantly better than other methods by its more successful identification of disease-associated genes measured by AUC, and it delivers mechanistic explanations for its choice of genes.

  15. High-Throughput Gene Expression Profiles to Define Drug Similarity and Predict Compound Activity.

    Science.gov (United States)

    De Wolf, Hans; Cougnaud, Laure; Van Hoorde, Kirsten; De Bondt, An; Wegner, Joerg K; Ceulemans, Hugo; Göhlmann, Hinrich

    2018-04-01

    By adding biological information, beyond the chemical properties and desired effect of a compound, uncharted compound areas and connections can be explored. In this study, we add transcriptional information for 31K compounds of Janssen's primary screening deck, using the HT L1000 platform and assess (a) the transcriptional connection score for generating compound similarities, (b) machine learning algorithms for generating target activity predictions, and (c) the scaffold hopping potential of the resulting hits. We demonstrate that the transcriptional connection score is best computed from the significant genes only and should be interpreted within its confidence interval for which we provide the stats. These guidelines help to reduce noise, increase reproducibility, and enable the separation of specific and promiscuous compounds. The added value of machine learning is demonstrated for the NR3C1 and HSP90 targets. Support Vector Machine models yielded balanced accuracy values ≥80% when the expression values from DDIT4 & SERPINE1 and TMEM97 & SPR were used to predict the NR3C1 and HSP90 activity, respectively. Combining both models resulted in 22 new and confirmed HSP90-independent NR3C1 inhibitors, providing two scaffolds (i.e., pyrimidine and pyrazolo-pyrimidine), which could potentially be of interest in the treatment of depression (i.e., inhibiting the glucocorticoid receptor (i.e., NR3C1), while leaving its chaperone, HSP90, unaffected). As such, the initial hit rate increased by a factor 300, as less, but more specific chemistry could be screened, based on the upfront computed activity predictions.

  16. No specific gene expression signature in human granulosa and cumulus cells for prediction of oocyte fertilisation and embryo implantation.

    Directory of Open Access Journals (Sweden)

    Tanja Burnik Papler

    Full Text Available In human IVF procedures objective and reliable biomarkers of oocyte and embryo quality are needed in order to increase the use of single embryo transfer (SET and thus prevent multiple pregnancies. During folliculogenesis there is an intense bi-directional communication between oocyte and follicular cells. For this reason gene expression profile of follicular cells could be an important indicator and biomarker of oocyte and embryo quality. The objective of this study was to identify gene expression signature(s in human granulosa (GC and cumulus (CC cells predictive of successful embryo implantation and oocyte fertilization. Forty-one patients were included in the study and individual GC and CC samples were collected; oocytes were cultivated separately, allowing a correlation with IVF outcome and elective SET was performed. Gene expression analysis was performed using microarrays, followed by a quantitative real-time PCR validation. After statistical analysis of microarray data, there were no significantly differentially expressed genes (FDR<0,05 between non-fertilized and fertilized oocytes and non-implanted and implanted embryos in either of the cell type. Furthermore, the results of quantitative real-time PCR were in consent with microarray data as there were no significant differences in gene expression of genes selected for validation. In conclusion, we did not find biomarkers for prediction of oocyte fertilization and embryo implantation in IVF procedures in the present study.

  17. Prediction of the contact sensitizing potential of chemicals using analysis of gene expression changes in human THP-1 monocytes.

    Science.gov (United States)

    Arkusz, Joanna; Stępnik, Maciej; Sobala, Wojciech; Dastych, Jarosław

    2010-11-10

    The aim of this study was to find differentially regulated genes in THP-1 monocytic cells exposed to sensitizers and nonsensitizers and to investigate if such genes could be reliable markers for an in vitro predictive method for the identification of skin sensitizing chemicals. Changes in expression of 35 genes in the THP-1 cell line following treatment with chemicals of different sensitizing potential (from nonsensitizers to extreme sensitizers) were assessed using real-time PCR. Verification of 13 candidate genes by testing a large number of chemicals (an additional 22 sensitizers and 8 nonsensitizers) revealed that prediction of contact sensitization potential was possible based on evaluation of changes in three genes: IL8, HMOX1 and PAIMP1. In total, changes in expression of these genes allowed correct detection of sensitization potential of 21 out of 27 (78%) test sensitizers. The gene expression levels inside potency groups varied and did not allow estimation of sensitization potency of test chemicals. Results of this study indicate that evaluation of changes in expression of proposed biomarkers in THP-1 cells could be a valuable model for preliminary screening of chemicals to discriminate an appreciable majority of sensitizers from nonsensitizers. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  18. Identifying Growth Conditions for Nicotiana benthimiana Resulting in Predictable Gene Expression of Promoter-Gus Fusion

    Science.gov (United States)

    Sandoval, V.; Barton, K.; Longhurst, A.

    2012-12-01

    Revoluta (Rev) is a transcription factor that establishes leaf polarity inArabidopsis thaliana. Through previous work in Dr. Barton's Lab, it is known that Revoluta binds to the ZPR3 promoter, thus activating the ZPR3 gene product inArabidopsis thaliana. Using this knowledge, two separate DNA constructs were made, one carrying revgene and in the other, the ZPR3 promoter fussed with the GUS gene. When inoculated in Nicotiana benthimiana (tobacco), the pMDC32 plasmid produces the Rev protein. Rev binds to the ZPR3 promoter thereby activating the transcription of the GUS gene, which can only be expressed in the presence of Rev. When GUS protein comes in contact with X-Gluc it produce the blue stain seen (See Figure 1). In the past, variability has been seen of GUS expression on tobacco therefore we hypothesized that changing the growing conditions and leaf age might improve how well it's expressed.

  19. Prediction of the prognosis of breast cancer in routine histologic specimens using a simplified, low-cost gene expression signature

    DEFF Research Database (Denmark)

    Marcell, S.A.; Balazs, A.; Emese, A.

    2013-01-01

    Prediction of the prognosis of breast cancer in routine histologic specimens using a simplified, low-cost gene expression signature Background: Grade 2 breast carcinomas do not form a uniform prognostic group. Aim: To extend the number of patients and the investigated genes of a previously...... grade 2 breast carcinomas into prognostic groups. Gene expression was investigated by polymerase chain reaction in 249 formalin-fixed, paraffin-embedded breast tumors. The results were correlated with relapse-free survival. Results: Histologically grade 2 carcinomas were split into good and a poor...... identified prognostic signature described by the authors that reflect chromosomal instability in order to refine characterization of grade 2 breast cancers and identify driver genes. Methods: Using publicly available databases, the authors selected 9 target and 3 housekeeping genes that are capable to divide...

  20. Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence.

    Science.gov (United States)

    Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

    2015-01-01

    There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinformatics software, and we analyzed the relationship between gene expression profiles of adenoma-adenocarcinoma sequence and clinical prognosis of colorectal cancer. The mRNA expressions of adenoma-carcinoma sequence were significantly different between high-grade intraepithelial neoplasia group and adenocarcinoma group. The biological process of gene ontology function enrichment analysis on differentially expressed genes between high-grade intraepithelial neoplasia group and adenocarcinoma group showed that genes enriched in the extracellular structure organization, skeletal system development, biological adhesion and itself regulated growth regulation, with the P value after FDR correction of less than 0.05. In addition, IPR-related protein mainly focused on the insulin-like growth factor binding proteins. The variable trends of gene expression profiles for adenoma-carcinoma sequence were mainly concentrated in high-grade intraepithelial neoplasia and adenocarcinoma. The differentially expressed genes are significantly correlated between high-grade intraepithelial neoplasia group and adenocarcinoma group. Bioinformatics analysis is an effective way to study the gene expression profiles in the adenoma-carcinoma sequence, and may provide an effective tool to involve colorectal cancer research strategy into colorectal adenoma or advanced adenoma.

  1. Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence

    OpenAIRE

    Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

    2015-01-01

    Background: There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. Methods: All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinform...

  2. Gene expression signature in organized and growth arrested mammaryacini predicts good outcome in breast cancer

    Energy Technology Data Exchange (ETDEWEB)

    Fournier, Marcia V.; Martin, Katherine J.; Kenny, Paraic A.; Xhaja, Kris; Bosch, Irene; Yaswen, Paul; Bissell, Mina J.

    2006-02-08

    To understand how non-malignant human mammary epithelial cells (HMEC) transit from a disorganized proliferating to an organized growth arrested state, and to relate this process to the changes that occur in breast cancer, we studied gene expression changes in non-malignant HMEC grown in three-dimensional cultures, and in a previously published panel of microarray data for 295 breast cancer samples. We hypothesized that the gene expression pattern of organized and growth arrested mammary acini would share similarities with breast tumors with good prognoses. Using Affymetrix HG-U133A microarrays, we analyzed the expression of 22,283 gene transcripts in two HMEC cell lines, 184 (finite life span) and HMT3522 S1 (immortal non-malignant), on successive days post-seeding in a laminin-rich extracellular matrix assay. Both HMECs underwent growth arrest in G0/G1 and differentiated into polarized acini between days 5 and 7. We identified gene expression changes with the same temporal pattern in both lines. We show that genes that are significantly lower in the organized, growth arrested HMEC than in their proliferating counterparts can be used to classify breast cancer patients into poor and good prognosis groups with high accuracy. This study represents a novel unsupervised approach to identifying breast cancer markers that may be of use clinically.

  3. Prediction of graft-versus-host disease in humans by donor gene-expression profiling.

    Directory of Open Access Journals (Sweden)

    Chantal Baron

    2007-01-01

    Full Text Available BACKGROUND: Graft-versus-host disease (GVHD results from recognition of host antigens by donor T cells following allogeneic hematopoietic cell transplantation (AHCT. Notably, histoincompatibility between donor and recipient is necessary but not sufficient to elicit GVHD. Therefore, we tested the hypothesis that some donors may be "stronger alloresponders" than others, and consequently more likely to elicit GVHD. METHODS AND FINDINGS: To this end, we measured the gene-expression profiles of CD4(+ and CD8(+ T cells from 50 AHCT donors with microarrays. We report that pre-AHCT gene-expression profiling segregates donors whose recipient suffered from GVHD or not. Using quantitative PCR, established statistical tests, and analysis of multiple independent training-test datasets, we found that for chronic GVHD the "dangerous donor" trait (occurrence of GVHD in the recipient is under polygenic control and is shaped by the activity of genes that regulate transforming growth factor-beta signaling and cell proliferation. CONCLUSIONS: These findings strongly suggest that the donor gene-expression profile has a dominant influence on the occurrence of GVHD in the recipient. The ability to discriminate strong and weak alloresponders using gene-expression profiling could pave the way to personalized transplantation medicine.

  4. Cancer-Predicting Gene Expression Changes in Colonic Mucosa of Western Diet Fed Mlh1 +/- Mice

    Science.gov (United States)

    Dermadi Bebek, Denis; Valo, Satu; Reyhani, Nima; Ollila, Saara; Päivärinta, Essi; Peltomäki, Päivi; Mutanen, Marja; Nyström, Minna

    2013-01-01

    Colorectal cancer (CRC) is the second most common cause of cancer-related deaths in the Western world and interactions between genetic and environmental factors, including diet, are suggested to play a critical role in its etiology. We conducted a long-term feeding experiment in the mouse to address gene expression and methylation changes arising in histologically normal colonic mucosa as putative cancer-predisposing events available for early detection. The expression of 94 growth-regulatory genes previously linked to human CRC was studied at two time points (5 weeks and 12 months of age) in the heterozygote Mlh1 +/- mice, an animal model for human Lynch syndrome (LS), and wild type Mlh1 +/+ littermates, fed by either Western-style (WD) or AIN-93G control diet. In mice fed with WD, proximal colon mucosa, the predominant site of cancer formation in LS, exhibited a significant expression decrease in tumor suppressor genes, Dkk1, Hoxd1, Slc5a8, and Socs1, the latter two only in the Mlh1 +/- mice. Reduced mRNA expression was accompanied by increased promoter methylation of the respective genes. The strongest expression decrease (7.3 fold) together with a significant increase in its promoter methylation was seen in Dkk1, an antagonist of the canonical Wnt signaling pathway. Furthermore, the inactivation of Dkk1 seems to predispose to neoplasias in the proximal colon. This and the fact that Mlh1 which showed only modest methylation was still expressed in both Mlh1 +/- and Mlh1 +/+ mice indicate that the expression decreases and the inactivation of Dkk1 in particular is a prominent early marker for colon oncogenesis. PMID:24204690

  5. Cancer-predicting gene expression changes in colonic mucosa of Western diet fed Mlh1+/- mice.

    Directory of Open Access Journals (Sweden)

    Marjaana Pussila

    Full Text Available Colorectal cancer (CRC is the second most common cause of cancer-related deaths in the Western world and interactions between genetic and environmental factors, including diet, are suggested to play a critical role in its etiology. We conducted a long-term feeding experiment in the mouse to address gene expression and methylation changes arising in histologically normal colonic mucosa as putative cancer-predisposing events available for early detection. The expression of 94 growth-regulatory genes previously linked to human CRC was studied at two time points (5 weeks and 12 months of age in the heterozygote Mlh1(+/- mice, an animal model for human Lynch syndrome (LS, and wild type Mlh1(+/+ littermates, fed by either Western-style (WD or AIN-93G control diet. In mice fed with WD, proximal colon mucosa, the predominant site of cancer formation in LS, exhibited a significant expression decrease in tumor suppressor genes, Dkk1, Hoxd1, Slc5a8, and Socs1, the latter two only in the Mlh1(+/- mice. Reduced mRNA expression was accompanied by increased promoter methylation of the respective genes. The strongest expression decrease (7.3 fold together with a significant increase in its promoter methylation was seen in Dkk1, an antagonist of the canonical Wnt signaling pathway. Furthermore, the inactivation of Dkk1 seems to predispose to neoplasias in the proximal colon. This and the fact that Mlh1 which showed only modest methylation was still expressed in both Mlh1(+/- and Mlh1(+/+ mice indicate that the expression decreases and the inactivation of Dkk1 in particular is a prominent early marker for colon oncogenesis.

  6. Gene Expression Differences Predict Treatment Outcome of Merkel Cell Carcinoma Patients

    Directory of Open Access Journals (Sweden)

    Loren Masterson

    2014-01-01

    Full Text Available Due to the rarity of Merkel cell carcinoma (MCC, prospective clinical trials have not been practical. This study aimed to identify biomarkers with prognostic significance. While sixty-two patients were identified who were treated for MCC at our institution, only seventeen patients had adequate formalin-fixed paraffin-embedded archival tissue and followup to be included in the study. Patients were stratified into good, moderate, or poor prognosis. Laser capture microdissection was used to isolate tumor cells for subsequent RNA isolation and gene expression analysis with Affymetrix GeneChip Human Exon 1.0 ST arrays. Among the 191 genes demonstrating significant differential expression between prognostic groups, keratin 20 and neurofilament protein have previously been identified in studies of MCC and were significantly upregulated in tumors from patients with a poor prognosis. Immunohistochemistry further established that keratin 20 was overexpressed in the poor prognosis tumors. In addition, novel genes of interest such as phospholipase A2 group X, kinesin family member 3A, tumor protein D52, mucin 1, and KIT were upregulated in specimens from patients with poor prognosis. Our pilot study identified several gene expression differences which could be used in the future as prognostic biomarkers in MCC patients.

  7. Gene expression analysis predicts insect venom anaphylaxis in indolent systemic mastocytosis

    NARCIS (Netherlands)

    Niedoszytko, M.; Bruinenberg, M.; van Doormaal, J. J.; de Monchy, J. G. R.; Nedoszytko, B.; Koppelman, G. H.; Nawijn, M. C.; Wijmenga, C.; Jassem, E.; Oude Elberink, J. N. G.

    P>Background: Anaphylaxis to insect venom (Hymenoptera) is most severe in patients with mastocytosis and may even lead to death. However, not all patients with mastocytosis suffer from anaphylaxis. The aim of the study was to analyze differences in gene expression between patients with indolent

  8. Conservation of transcription factor binding events predicts gene expression across species

    Science.gov (United States)

    Hemberg, Martin; Kreiman, Gabriel

    2011-01-01

    Recent technological advances have made it possible to determine the genome-wide binding sites of transcription factors (TFs). Comparisons across species have suggested a relatively low degree of evolutionary conservation of experimentally defined TF binding events (TFBEs). Using binding data for six different TFs in hepatocytes and embryonic stem cells from human and mouse, we demonstrate that evolutionary conservation of TFBEs within orthologous proximal promoters is closely linked to function, defined as expression of the target genes. We show that (i) there is a significantly higher degree of conservation of TFBEs when the target gene is expressed in both species; (ii) there is increased conservation of binding events for groups of TFs compared to individual TFs; and (iii) conserved TFBEs have a greater impact on the expression of their target genes than non-conserved ones. These results link conservation of structural elements (TFBEs) to conservation of function (gene expression) and suggest a higher degree of functional conservation than implied by previous studies. PMID:21622661

  9. Prediction of Metastasis and Recurrence in Colorectal Cancer Based on Gene Expression Analysis: Ready for the Clinic?

    International Nuclear Information System (INIS)

    Shibayama, Masaki; Maak, Matthias; Nitsche, Ulrich; Gotoh, Kengo; Rosenberg, Robert; Janssen, Klaus-Peter

    2011-01-01

    Cancers of the colon and rectum, which rank among the most frequent human tumors, are currently treated by surgical resection in locally restricted tumor stages. However, disease recurrence and formation of local and distant metastasis frequently occur even in cases with successful curative resection of the primary tumor (R0). Recent technological advances in molecular diagnostic analysis have led to a wealth of knowledge about the changes in gene transcription in all stages of colorectal tumors. Differential gene expression, or transcriptome analysis, has been proposed by many groups to predict disease recurrence, clinical outcome, and also response to therapy, in addition to the well-established clinico-pathological factors. However, the clinical usability of gene expression profiling as a reliable and robust prognostic tool that allows evidence-based clinical decisions is currently under debate. In this review, we will discuss the most recent data on the prognostic significance and potential clinical application of genome wide expression analysis in colorectal cancer

  10. Prediction of Metastasis and Recurrence in Colorectal Cancer Based on Gene Expression Analysis: Ready for the Clinic?

    Energy Technology Data Exchange (ETDEWEB)

    Shibayama, Masaki [Sysmex Corporation, Central Research Laboratories, Kobe 651-2271 (Japan); Maak, Matthias; Nitsche, Ulrich [Chirurgische Klinik, Klinikum Rechts der Isar der TUM, München 81657 (Germany); Gotoh, Kengo [Sysmex Corporation, Central Research Laboratories, Kobe 651-2271 (Japan); Rosenberg, Robert; Janssen, Klaus-Peter, E-mail: klaus-peter.janssen@lrz.tum.de [Chirurgische Klinik, Klinikum Rechts der Isar der TUM, München 81657 (Germany)

    2011-07-07

    Cancers of the colon and rectum, which rank among the most frequent human tumors, are currently treated by surgical resection in locally restricted tumor stages. However, disease recurrence and formation of local and distant metastasis frequently occur even in cases with successful curative resection of the primary tumor (R0). Recent technological advances in molecular diagnostic analysis have led to a wealth of knowledge about the changes in gene transcription in all stages of colorectal tumors. Differential gene expression, or transcriptome analysis, has been proposed by many groups to predict disease recurrence, clinical outcome, and also response to therapy, in addition to the well-established clinico-pathological factors. However, the clinical usability of gene expression profiling as a reliable and robust prognostic tool that allows evidence-based clinical decisions is currently under debate. In this review, we will discuss the most recent data on the prognostic significance and potential clinical application of genome wide expression analysis in colorectal cancer.

  11. Tumour gene expression predicts response to cetuximab in patients with KRAS wild-type metastatic colorectal cancer.

    Science.gov (United States)

    Baker, J B; Dutta, D; Watson, D; Maddala, T; Munneke, B M; Shak, S; Rowinsky, E K; Xu, L-A; Harbison, C T; Clark, E A; Mauro, D J; Khambata-Ford, S

    2011-02-01

    Although it is accepted that metastatic colorectal cancers (mCRCs) that carry activating mutations in KRAS are unresponsive to anti-epidermal growth factor receptor (EGFR) monoclonal antibodies, a significant fraction of KRAS wild-type (wt) mCRCs are also unresponsive to anti-EGFR therapy. Genes encoding EGFR ligands amphiregulin (AREG) and epiregulin (EREG) are promising gene expression-based markers but have not been incorporated into a test to dichotomise KRAS wt mCRC patients with respect to sensitivity to anti-EGFR treatment. We used RT-PCR to test 110 candidate gene expression markers in primary tumours from 144 KRAS wt mCRC patients who received monotherapy with the anti-EGFR antibody cetuximab. Results were correlated with multiple clinical endpoints: disease control, objective response, and progression-free survival (PFS). Expression of many of the tested candidate genes, including EREG and AREG, strongly associate with all clinical endpoints. Using multivariate analysis with two-layer five-fold cross-validation, we constructed a four-gene predictive classifier. Strikingly, patients below the classifier cutpoint had PFS and disease control rates similar to those of patients with KRAS mutant mCRC. Gene expression appears to identify KRAS wt mCRC patients who receive little benefit from cetuximab. It will be important to test this model in an independent validation study.

  12. CRC-113 gene expression signature for predicting prognosis in patients with colorectal cancer.

    Science.gov (United States)

    Nguyen, Minh Nam; Choi, Tae Gyu; Nguyen, Dinh Truong; Kim, Jin-Hwan; Jo, Yong Hwa; Shahid, Muhammad; Akter, Salima; Aryal, Saurav Nath; Yoo, Ji Youn; Ahn, Yong-Joo; Cho, Kyoung Min; Lee, Ju-Seog; Choe, Wonchae; Kang, Insug; Ha, Joohun; Kim, Sung Soo

    2015-10-13

    Colorectal cancer (CRC) is the third leading cause of global cancer mortality. Recent studies have proposed several gene signatures to predict CRC prognosis, but none of those have proven reliable for predicting prognosis in clinical practice yet due to poor reproducibility and molecular heterogeneity. Here, we have established a prognostic signature of 113 probe sets (CRC-113) that include potential biomarkers and reflect the biological and clinical characteristics. Robustness and accuracy were significantly validated in external data sets from 19 centers in five countries. In multivariate analysis, CRC-113 gene signature showed a stronger prognostic value for survival and disease recurrence in CRC patients than current clinicopathological risk factors and molecular alterations. We also demonstrated that the CRC-113 gene signature reflected both genetic and epigenetic molecular heterogeneity in CRC patients. Furthermore, incorporation of the CRC-113 gene signature into a clinical context and molecular markers further refined the selection of the CRC patients who might benefit from postoperative chemotherapy. Conclusively, CRC-113 gene signature provides new possibilities for improving prognostic models and personalized therapeutic strategies.

  13. Genome wide gene expression regulation by HIP1 Protein Interactor, HIPPI: Prediction and validation

    Directory of Open Access Journals (Sweden)

    Lahiri Ansuman

    2011-09-01

    Full Text Available Abstract Background HIP1 Protein Interactor (HIPPI is a pro-apoptotic protein that induces Caspase8 mediated apoptosis in cell. We have shown earlier that HIPPI could interact with a specific 9 bp sequence motif, defined as the HIPPI binding site (HBS, present in the upstream promoter of Caspase1 gene and regulate its expression. We also have shown that HIPPI, without any known nuclear localization signal, could be transported to the nucleus by HIP1, a NLS containing nucleo-cytoplasmic shuttling protein. Thus our present work aims at the investigation of the role of HIPPI as a global transcription regulator. Results We carried out genome wide search for the presence of HBS in the upstream sequences of genes. Our result suggests that HBS was predominantly located within 2 Kb upstream from transcription start site. Transcription factors like CREBP1, TBP, OCT1, EVI1 and P53 half site were significantly enriched in the 100 bp vicinity of HBS indicating that they might co-operate with HIPPI for transcription regulation. To illustrate the role of HIPPI on transcriptome, we performed gene expression profiling by microarray. Exogenous expression of HIPPI in HeLa cells resulted in up-regulation of 580 genes (p HIP1 was knocked down. HIPPI-P53 interaction was necessary for HIPPI mediated up-regulation of Caspase1 gene. Finally, we analyzed published microarray data obtained with post mortem brains of Huntington's disease (HD patients to investigate the possible involvement of HIPPI in HD pathogenesis. We observed that along with the transcription factors like CREB, P300, SREBP1, Sp1 etc. which are already known to be involved in HD, HIPPI binding site was also significantly over-represented in the upstream sequences of genes altered in HD. Conclusions Taken together, the results suggest that HIPPI could act as an important transcription regulator in cell regulating a vast array of genes, particularly transcription factors and at least, in part, play a

  14. The accuracy of survival time prediction for patients with glioma is improved by measuring mitotic spindle checkpoint gene expression.

    Directory of Open Access Journals (Sweden)

    Li Bie

    Full Text Available Identification of gene expression changes that improve prediction of survival time across all glioma grades would be clinically useful. Four Affymetrix GeneChip datasets from the literature, containing data from 771 glioma samples representing all WHO grades and eight normal brain samples, were used in an ANOVA model to screen for transcript changes that correlated with grade. Observations were confirmed and extended using qPCR assays on RNA derived from 38 additional glioma samples and eight normal samples for which survival data were available. RNA levels of eight major mitotic spindle assembly checkpoint (SAC genes (BUB1, BUB1B, BUB3, CENPE, MAD1L1, MAD2L1, CDC20, TTK significantly correlated with glioma grade and six also significantly correlated with survival time. In particular, the level of BUB1B expression was highly correlated with survival time (p<0.0001, and significantly outperformed all other measured parameters, including two standards; WHO grade and MIB-1 (Ki-67 labeling index. Measurement of the expression levels of a small set of SAC genes may complement histological grade and other clinical parameters for predicting survival time.

  15. Gene expression signature of normal cell-of-origin predicts ovarian tumor outcomes.

    Directory of Open Access Journals (Sweden)

    Melissa A Merritt

    Full Text Available The potential role of the cell-of-origin in determining the tumor phenotype has been raised, but not adequately examined. We hypothesized that distinct cells-of-origin may play a role in determining ovarian tumor phenotype and outcome. Here we describe a new cell culture medium for in vitro culture of paired normal human ovarian (OV and fallopian tube (FT epithelial cells from donors without cancer. While these cells have been cultured individually for short periods of time, to our knowledge this is the first long-term culture of both cell types from the same donors. Through analysis of the gene expression profiles of the cultured OV/FT cells we identified a normal cell-of-origin gene signature that classified primary ovarian cancers into OV-like and FT-like subgroups; this classification correlated with significant differences in clinical outcomes. The identification of a prognostically significant gene expression signature derived solely from normal untransformed cells is consistent with the hypothesis that the normal cell-of-origin may be a source of ovarian tumor heterogeneity and the associated differences in tumor outcome.

  16. Predicting acute cardiac rejection from donor heart and pre-transplant recipient blood gene expression.

    Science.gov (United States)

    Hollander, Zsuzsanna; Chen, Virginia; Sidhu, Keerat; Lin, David; Ng, Raymond T; Balshaw, Robert; Cohen-Freue, Gabriela V; Ignaszewski, Andrew; Imai, Carol; Kaan, Annemarie; Tebbutt, Scott J; Wilson-McManus, Janet E; McMaster, Robert W; Keown, Paul A; McManus, Bruce M

    2013-02-01

    Acute rejection in cardiac transplant patients remains a contributory factor to limited survival of implanted hearts. Currently, there are no biomarkers in clinical use that can predict, at the time of transplantation, the likelihood of post-transplant acute cellular rejection. Such a development would be of great value in personalizing immunosuppressive treatment. Recipient age, donor age, cold ischemic time, warm ischemic time, panel-reactive antibody, gender mismatch, blood type mismatch and human leukocyte antigens (HLA-A, -B and -DR) mismatch between recipients and donors were tested in 53 heart transplant patients for their power to predict post-transplant acute cellular rejection. Donor transplant biopsy and recipient pre-transplant blood were also examined for the presence of genomic biomarkers in 7 rejection and 11 non-rejection patients, using non-targeted data mining techniques. The biomarker based on the 8 clinical variables had an area under the receiver operating characteristic curve (AUC) of 0.53. The pre-transplant recipient blood gene-based panel did not yield better performance, but the donor heart tissue gene-based panel had an AUC = 0.78. A combination of 25 probe sets from the transplant donor biopsy and 18 probe sets from the pre-transplant recipient whole blood had an AUC = 0.90. Biologic pathways implicated include VEGF- and EGFR-signaling, and MAPK. Based on this study, the best predictive biomarker panel contains genes from recipient whole blood and donor myocardial tissue. This panel provides clinically relevant prediction power and, if validated, may personalize immunosuppressive treatment and rejection monitoring. Copyright © 2013 International Society for Heart and Lung Transplantation. Published by Elsevier Inc. All rights reserved.

  17. A 7 gene expression score predicts for radiation response in cancer cervix

    International Nuclear Information System (INIS)

    Rajkumar, Thangarajan; Vijayalakshmi, Neelakantan; Sabitha, Kesavan; Shirley, Sundersingh; Selvaluxmy, Ganesharaja; Bose, Mayil Vahanan; Nambaru, Lavanya

    2009-01-01

    Cervical cancer is the most common cancer among Indian women. The current recommendations are to treat the stage IIB, IIIA, IIIB and IVA with radical radiotherapy and weekly cisplatin based chemotherapy. However, Radiotherapy alone can help cure more than 60% of stage IIB and up to 40% of stage IIIB patients. Archival RNA samples from 15 patients who had achieved complete remission and stayed disease free for more than 36 months (No Evidence of Disease or NED group) and 10 patients who had failed radical radiotherapy (Failed group) were included in the study. The RNA were amplified, labelled and hybridized to Stanford microarray chips and analyzed using BRB Array Tools software and Significance Analysis of Microarray (SAM) analysis. 20 genes were selected for further validation using Relative Quantitation (RQ) Taqman assay in a Taqman Low-Density Array (TLDA) format. The RQ value was calculated, using each of the NED sample once as a calibrator. A scoring system was developed based on the RQ value for the genes. Using a seven gene based scoring system, it was possible to distinguish between the tumours which were likely to respond to the radiotherapy and those likely to fail. The mean score ± 2 SE (standard error of mean) was used and at a cut-off score of greater than 5.60, the sensitivity, specificity, Positive predictive value (PPV) and Negative predictive value (NPV) were 0.64, 1.0, 1.0, 0.67, respectively, for the low risk group. We have identified a 7 gene signature which could help identify patients with cervical cancer who can be treated with radiotherapy alone. However, this needs to be validated in a larger patient population

  18. An Individual-Based Diploid Model Predicts Limited Conditions Under Which Stochastic Gene Expression Becomes Advantageous

    KAUST Repository

    Matsumoto, Tomotaka

    2015-11-24

    Recent studies suggest the existence of a stochasticity in gene expression (SGE) in many organisms, and its non-negligible effect on their phenotype and fitness. To date, however, how SGE affects the key parameters of population genetics are not well understood. SGE can increase the phenotypic variation and act as a load for individuals, if they are at the adaptive optimum in a stable environment. On the other hand, part of the phenotypic variation caused by SGE might become advantageous if individuals at the adaptive optimum become genetically less-adaptive, for example due to an environmental change. Furthermore, SGE of unimportant genes might have little or no fitness consequences. Thus, SGE can be advantageous, disadvantageous, or selectively neutral depending on its context. In addition, there might be a genetic basis that regulates magnitude of SGE, which is often referred to as “modifier genes,” but little is known about the conditions under which such an SGE-modifier gene evolves. In the present study, we conducted individual-based computer simulations to examine these conditions in a diploid model. In the simulations, we considered a single locus that determines organismal fitness for simplicity, and that SGE on the locus creates fitness variation in a stochastic manner. We also considered another locus that modifies the magnitude of SGE. Our results suggested that SGE was always deleterious in stable environments and increased the fixation probability of deleterious mutations in this model. Even under frequently changing environmental conditions, only very strong natural selection made SGE adaptive. These results suggest that the evolution of SGE-modifier genes requires strict balance among the strength of natural selection, magnitude of SGE, and frequency of environmental changes. However, the degree of dominance affected the condition under which SGE becomes advantageous, indicating a better opportunity for the evolution of SGE in different genetic

  19. Relative codon adaptation: a generic codon bias index for prediction of gene expression.

    Science.gov (United States)

    Fox, Jesse M; Erill, Ivan

    2010-06-01

    The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.

  20. EMX2 gene expression predicts liver metastasis and survival in colorectal cancer.

    Science.gov (United States)

    Aykut, Berk; Ochs, Markus; Radhakrishnan, Praveen; Brill, Adrian; Höcker, Hermine; Schwarz, Sandra; Weissinger, Daniel; Kehm, Roland; Kulu, Yakup; Ulrich, Alexis; Schneider, Martin

    2017-08-22

    The Empty Spiracles Homeobox (EMX-) 2 gene has been associated with regulation of growth and differentiation in neuronal development. While recent studies provide evidence that EMX2 regulates tumorigenesis of various solid tumors, its role in colorectal cancer remains unknown. We aimed to assess the prognostic significance of EMX2 expression in stage III colorectal adenocarcinoma. Expression levels of EMX2 in human colorectal cancer and adjacent mucosa were assessed by qRT-PCR technology, and results were correlated with clinical and survival data. siRNA-mediated knockdown and adenoviral delivery-mediated overexpression of EMX2 were performed in order to investigate its effects on the migration of colorectal cancer cells in vitro. Compared to corresponding healthy mucosa, colorectal tumor samples had decreased EMX2 expression levels. Furthermore, EMX2 down-regulation in colorectal cancer tissue was associated with distant metastasis (M1) and impaired overall patient survival. In vitro knockdown of EMX2 resulted in increased tumor cell migration. Conversely, overexpression of EMX2 led to an inhibition of tumor cell migration. EMX2 is frequently down-regulated in human colorectal cancer, and down-regulation of EMX2 is a prognostic marker for disease-free and overall survival. EMX2 might thus represent a promising therapeutic target in colorectal cancer.

  1. Predicting Recurrence and Progression of Noninvasive Papillary Bladder Cancer at Initial Presentation Based on Quantitative Gene Expression Profiles

    DEFF Research Database (Denmark)

    Birkhahn, M.; Mitra, A.P.; Williams, Johan

    2010-01-01

    % specificity. Since this is a small retrospective study using medium-throughput profiling, larger confirmatory studies are needed. Conclusions: Gene expression profiling across relevant cancer pathways appears to be a promising approach for Ta bladder tumor outcome prediction at initial diagnosis......Background: Currently, tumor grade is the best predictor of outcome at first presentation of noninvasive papillary (Ta) bladder cancer. However, reliable predictors of Ta tumor recurrence and progression for individual patients, which could optimize treatment and follow-up schedules based...... on specific tumor biology, are yet to be identified. Objective: To identify genes predictive for recurrence and progression in Ta bladder cancer at first presentation using a quantitative, pathway-specific approach. Design, setting, and participants: Retrospective study of patients with Ta G2/3 bladder tumors...

  2. Multiclass Prediction with Partial Least Square Regression for Gene Expression Data: Applications in Breast Cancer Intrinsic Taxonomy

    Directory of Open Access Journals (Sweden)

    Chi-Cheng Huang

    2013-01-01

    Full Text Available Multiclass prediction remains an obstacle for high-throughput data analysis such as microarray gene expression profiles. Despite recent advancements in machine learning and bioinformatics, most classification tools were limited to the applications of binary responses. Our aim was to apply partial least square (PLS regression for breast cancer intrinsic taxonomy, of which five distinct molecular subtypes were identified. The PAM50 signature genes were used as predictive variables in PLS analysis, and the latent gene component scores were used in binary logistic regression for each molecular subtype. The 139 prototypical arrays for PAM50 development were used as training dataset, and three independent microarray studies with Han Chinese origin were used for independent validation (n=535. The agreement between PAM50 centroid-based single sample prediction (SSP and PLS-regression was excellent (weighted Kappa: 0.988 within the training samples, but deteriorated substantially in independent samples, which could attribute to much more unclassified samples by PLS-regression. If these unclassified samples were removed, the agreement between PAM50 SSP and PLS-regression improved enormously (weighted Kappa: 0.829 as opposed to 0.541 when unclassified samples were analyzed. Our study ascertained the feasibility of PLS-regression in multi-class prediction, and distinct clinical presentations and prognostic discrepancies were observed across breast cancer molecular subtypes.

  3. Predicting Recurrence and Progression of Noninvasive Papillary Bladder Cancer at Initial Presentation Based on Quantitative Gene Expression Profiles

    DEFF Research Database (Denmark)

    Birkhahn, M.; Mitra, A.P.; Williams, Johan

    2010-01-01

    Background: Currently, tumor grade is the best predictor of outcome at first presentation of noninvasive papillary (Ta) bladder cancer. However, reliable predictors of Ta tumor recurrence and progression for individual patients, which could optimize treatment and follow-up schedules based...... on specific tumor biology, are yet to be identified. Objective: To identify genes predictive for recurrence and progression in Ta bladder cancer at first presentation using a quantitative, pathway-specific approach. Design, setting, and participants: Retrospective study of patients with Ta G2/3 bladder tumors...... at initial presentation with three distinct clinical outcomes: absence of recurrence (n = 16), recurrence without progression (n = 16), and progression to carcinoma in situ or invasive disease (n = 16). Measurements: Expressions of 24 genes that feature in relevant pathways that are deregulated in bladder...

  4. Gene expression profiles in paraffin-embedded core biopsy tissue predict response to chemotherapy in women with locally advanced breast cancer.

    Science.gov (United States)

    Gianni, Luca; Zambetti, Milvia; Clark, Kim; Baker, Joffre; Cronin, Maureen; Wu, Jenny; Mariani, Gabriella; Rodriguez, Jaime; Carcangiu, Marialuisa; Watson, Drew; Valagussa, Pinuccia; Rouzier, Roman; Symmans, W Fraser; Ross, Jeffrey S; Hortobagyi, Gabriel N; Pusztai, Lajos; Shak, Steven

    2005-10-10

    We sought to identify gene expression markers that predict the likelihood of chemotherapy response. We also tested whether chemotherapy response is correlated with the 21-gene Recurrence Score assay that quantifies recurrence risk. Patients with locally advanced breast cancer received neoadjuvant paclitaxel and doxorubicin. RNA was extracted from the pretreatment formalin-fixed paraffin-embedded core biopsies. The expression of 384 genes was quantified using reverse transcriptase polymerase chain reaction and correlated with pathologic complete response (pCR). The performance of genes predicting for pCR was tested in patients from an independent neoadjuvant study where gene expression was obtained using DNA microarrays. Of 89 assessable patients (mean age, 49.9 years; mean tumor size, 6.4 cm), 11 (12%) had a pCR. Eighty-six genes correlated with pCR (unadjusted P < .05); pCR was more likely with higher expression of proliferation-related genes and immune-related genes, and with lower expression of estrogen receptor (ER) -related genes. In 82 independent patients treated with neoadjuvant paclitaxel and doxorubicin, DNA microarray data were available for 79 of the 86 genes. In univariate analysis, 24 genes correlated with pCR with P < .05 (false discovery, four genes) and 32 genes showed correlation with P < .1 (false discovery, eight genes). The Recurrence Score was positively associated with the likelihood of pCR (P = .005), suggesting that the patients who are at greatest recurrence risk are more likely to have chemotherapy benefit. Quantitative expression of ER-related genes, proliferation genes, and immune-related genes are strong predictors of pCR in women with locally advanced breast cancer receiving neoadjuvant anthracyclines and paclitaxel.

  5. Predicting multi-level drug response with gene expression profile in multiple myeloma using hierarchical ordinal regression.

    Science.gov (United States)

    Zhang, Xinyan; Li, Bingzong; Han, Huiying; Song, Sha; Xu, Hongxia; Hong, Yating; Yi, Nengjun; Zhuang, Wenzhuo

    2018-05-10

    Multiple myeloma (MM), like other cancers, is caused by the accumulation of genetic abnormalities. Heterogeneity exists in the patients' response to treatments, for example, bortezomib. This urges efforts to identify biomarkers from numerous molecular features and build predictive models for identifying patients that can benefit from a certain treatment scheme. However, previous studies treated the multi-level ordinal drug response as a binary response where only responsive and non-responsive groups are considered. It is desirable to directly analyze the multi-level drug response, rather than combining the response to two groups. In this study, we present a novel method to identify significantly associated biomarkers and then develop ordinal genomic classifier using the hierarchical ordinal logistic model. The proposed hierarchical ordinal logistic model employs the heavy-tailed Cauchy prior on the coefficients and is fitted by an efficient quasi-Newton algorithm. We apply our hierarchical ordinal regression approach to analyze two publicly available datasets for MM with five-level drug response and numerous gene expression measures. Our results show that our method is able to identify genes associated with the multi-level drug response and to generate powerful predictive models for predicting the multi-level response. The proposed method allows us to jointly fit numerous correlated predictors and thus build efficient models for predicting the multi-level drug response. The predictive model for the multi-level drug response can be more informative than the previous approaches. Thus, the proposed approach provides a powerful tool for predicting multi-level drug response and has important impact on cancer studies.

  6. Histone modification profiles are predictive for tissue/cell-type specific expression of both protein-coding and microRNA genes

    Directory of Open Access Journals (Sweden)

    Zhang Michael Q

    2011-05-01

    Full Text Available Abstract Background Gene expression is regulated at both the DNA sequence level and through modification of chromatin. However, the effect of chromatin on tissue/cell-type specific gene regulation (TCSR is largely unknown. In this paper, we present a method to elucidate the relationship between histone modification/variation (HMV and TCSR. Results A classifier for differentiating CD4+ T cell-specific genes from housekeeping genes using HMV data was built. We found HMV in both promoter and gene body regions to be predictive of genes which are targets of TCSR. For example, the histone modification types H3K4me3 and H3K27ac were identified as the most predictive for CpG-related promoters, whereas H3K4me3 and H3K79me3 were the most predictive for nonCpG-related promoters. However, genes targeted by TCSR can be predicted using other type of HMVs as well. Such redundancy implies that multiple type of underlying regulatory elements, such as enhancers or intragenic alternative promoters, which can regulate gene expression in a tissue/cell-type specific fashion, may be marked by the HMVs. Finally, we show that the predictive power of HMV for TCSR is not limited to protein-coding genes in CD4+ T cells, as we successfully predicted TCSR targeted genes in muscle cells, as well as microRNA genes with expression specific to CD4+ T cells, by the same classifier which was trained on HMV data of protein-coding genes in CD4+ T cells. Conclusion We have begun to understand the HMV patterns that guide gene expression in both tissue/cell-type specific and ubiquitous manner.

  7. Gene Expression Signature TOPFOX Reflecting Chromosomal Instability Refines Prediction of Prognosis in Grade 2 Breast Cancer

    DEFF Research Database (Denmark)

    Szasz, A.; Li, Qiyuan; Sztupinszki, Z.

    2011-01-01

    Purpose: To assess the ability of genes selected from those reflecting chromosomal instability to identify good and poor prognostic subsets of Grade 2 breast carcinomas. Methods: We selected genes for splitting grade 2 tumours into low and high grade type groups by using public databases. Patient...

  8. Neighboring Genes Show Correlated Evolution in Gene Expression

    Science.gov (United States)

    Ghanbarian, Avazeh T.; Hurst, Laurence D.

    2015-01-01

    When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. PMID:25743543

  9. Gene Expression Omnibus (GEO)

    Data.gov (United States)

    U.S. Department of Health & Human Services — Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided...

  10. A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data.

    Science.gov (United States)

    Kang, Tianyu; Ding, Wei; Zhang, Luoyan; Ziemek, Daniel; Zarringhalam, Kourosh

    2017-12-19

    Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets.

  11. Gene Expression Profiles for Predicting Metastasis in Breast Cancer: A Cross-Study Comparison of Classification Methods

    Directory of Open Access Journals (Sweden)

    Mark Burton

    2012-01-01

    Full Text Available Machine learning has increasingly been used with microarray gene expression data and for the development of classifiers using a variety of methods. However, method comparisons in cross-study datasets are very scarce. This study compares the performance of seven classification methods and the effect of voting for predicting metastasis outcome in breast cancer patients, in three situations: within the same dataset or across datasets on similar or dissimilar microarray platforms. Combining classification results from seven classifiers into one voting decision performed significantly better during internal validation as well as external validation in similar microarray platforms than the underlying classification methods. When validating between different microarray platforms, random forest, another voting-based method, proved to be the best performing method. We conclude that voting based classifiers provided an advantage with respect to classifying metastasis outcome in breast cancer patients.

  12. Classification and Diagnostic Output Prediction of Cancer Using Gene Expression Profiling and Supervised Machine Learning Algorithms

    DEFF Research Database (Denmark)

    Yoo, C.; Gernaey, Krist

    2008-01-01

    importance in the projection (VIP) information of the DPLS method. The power of the gene selection method and the proposed supervised hierarchical clustering method is illustrated on a three microarray data sets of leukemia, breast, and colon cancer. Supervised machine learning algorithms thus enable...

  13. A network-based predictive gene-expression signature for adjuvant chemotherapy benefit in stage II colorectal cancer.

    Science.gov (United States)

    Cao, Bangrong; Luo, Liping; Feng, Lin; Ma, Shiqi; Chen, Tingqing; Ren, Yuan; Zha, Xiao; Cheng, Shujun; Zhang, Kaitai; Chen, Changmin

    2017-12-13

    The clinical benefit of adjuvant chemotherapy for stage II colorectal cancer (CRC) is controversial. This study aimed to explore novel gene signature to predict outcome benefit of postoperative 5-Fu-based therapy in stage II CRC. Gene-expression profiles of stage II CRCs from two datasets with 5-Fu-based adjuvant chemotherapy (training dataset, n = 212; validation dataset, n = 85) were analyzed to identify the indicator. A systemic approach by integrating gene-expression and protein-protein interaction (PPI) network was implemented to develop the predictive signature. Kaplan-Meier curves and Cox proportional hazards model were used to determine the survival benefit of adjuvant chemotherapy. Experiments with shRNA knock-down were carried out to confirm the signature identified in this study. In the training dataset, we identified 44 PPI sub-modules, by which we separate patients into two clusters (1 and 2) having different chemotherapeutic benefit. A predictor of 11 PPI sub-modules (11-PPI-Mod) was established to discriminate the two sub-groups, with an overall accuracy of 90.1%. This signature was independently validated in an external validation dataset. Kaplan-Meier curves showed an improved outcome for patients who received adjuvant chemotherapy in Cluster 1 sub-group, but even worse survival for those in Cluster 2 sub-group. Similar results were found in both the training and the validation dataset. Multivariate Cox regression revealed an interaction effect between 11-PPI-Mod signature and adjuvant therapy treatment in the training dataset (RFS, p = 0.007; OS, p = 0.006) and the validation dataset (RFS, p = 0.002). From the signature, we found that PTGES gene was up-regulated in CRC cells which were more resistant to 5-Fu. Knock-down of PTGES indicated a growth inhibition and up-regulation of apoptotic markers induced by 5-Fu in CRC cells. Only a small proportion of stage II CRC patients could benefit from adjuvant therapy. The 11-PPI-Mod as

  14. Baseline gene expression in conjunction with GSTM1 status predicts ozone exposure response

    Science.gov (United States)

    Air pollution exposure causes increased cardiopulmonary morbidity and mortality and has been linked to the deaths of 7 million people every year by the World Health Organization. Approximately 40% of the population lack expression of the antioxidant enzyme glutathione S-transfer...

  15. Gene expression analysis in predicting the effectiveness of insect venom immunotherapy

    NARCIS (Netherlands)

    Niedoszytko, M.; Bruinenberg, M.; de Monchy, J.; Wijmenga, C.; Platteel, M.; Jassem, E.; Oude Elberink, Joanna N.G.

    Background: Venom immunotherapy (VIT) enables longtime prevention of insect venom allergy in the majority of patients. However, in some, the risk of a resystemic reaction increases after completion of treatment. No reliable factors predicting individual lack of efficacy of VIT are currently

  16. Target genes prediction and functional analysis of microRNAs differentially expressed in gastric cancer stem cells MKN-45

    Directory of Open Access Journals (Sweden)

    Zohreh Salehi

    2017-01-01

    Conclusions: Bioinformatics analysis such as DAVID database, GO biological process, GO molecular function, Kyoto encyclopedia of genes and genomes pathways, BioCarta pathway, Panther pathway, and Reactome pathway revealed that target genes of differentially expressed miRNAs in gastric CSCs were connected to pivotal biological pathways that involved in cell cycle regulation, stemness properties, and differentiation.

  17. Bayesian mixture models for assessment of gene differential behaviour and prediction of pCR through the integration of copy number and gene expression data.

    Directory of Open Access Journals (Sweden)

    Filippo Trentini

    Full Text Available We consider modeling jointly microarray RNA expression and DNA copy number data. We propose Bayesian mixture models that define latent Gaussian probit scores for the DNA and RNA, and integrate between the two platforms via a regression of the RNA probit scores on the DNA probit scores. Such a regression conveniently allows us to include additional sample specific covariates such as biological conditions and clinical outcomes. The two developed methods are aimed respectively to make inference on differential behaviour of genes in patients showing different subtypes of breast cancer and to predict the pathological complete response (pCR of patients borrowing strength across the genomic platforms. Posterior inference is carried out via MCMC simulations. We demonstrate the proposed methodology using a published data set consisting of 121 breast cancer patients.

  18. Reduction in WT1 gene expression during early treatment predicts the outcome in patients with acute myeloid leukemia.

    Science.gov (United States)

    Andersson, Charlotta; Li, Xingru; Lorenz, Fryderyk; Golovleva, Irina; Wahlin, Anders; Li, Aihong

    2012-12-01

    Wilms tumor gene 1 (WT1) expression has been suggested as an applicable minimal residual disease marker in acute myeloid leukemia (AML). We evaluated the use of this marker in 43 adult AML patients. Quantitative assessment of WT1 gene transcripts was performed using real-time quantitative-polymerase chain reaction assay. Samples from both the peripheral blood and the bone marrow were analyzed at diagnosis and during follow-up. A strong correlation was observed between WT1 normalized with 2 different control genes (β-actin and ABL1, P0.05). A≥1-log reduction in WT1 expression in bone marrow samples taken freedom from relapse (P=0.010) when β-actin was used as control gene. Furthermore, a reduction in WT1 expression by ≥2 logs in peripheral blood samples taken at a later time point significantly correlated with a better outcome for overall survival (P=0.004) and freedom from relapse (P=0.012). This result was achieved when normalizing against both β-actin and ABL1. These results therefore suggest that WT1 gene expression can provide useful information for minimal residual disease detection in adult AML patients and that combined use of control genes can give more informative results.

  19. Prediction of drug efficacy for cancer treatment based on comparative analysis of chemosensitivity and gene expression data

    DEFF Research Database (Denmark)

    Wan, Peng; Li, Qiyuan; Larsen, Jens Erik Pontoppidan

    2012-01-01

    The NCI60 database is the largest available collection of compounds with measured anti-cancer activity. The strengths and limitations for using the NCI60 database as a source of new anti-cancer agents are explored and discussed in relation to previous studies. We selected a sub-set of 2333...... and in a data set of expression profiles of 1901 genes for the corresponding tumor cell lines. Five clusters were identified based on the gene expression data using self-organizing maps (SOM), comprising leukemia, melanoma, ovarian and prostate, basal breast, and luminal breast cancer cells, respectively....... The strong difference in gene expression between basal and luminal breast cancer cells was reflected clearly in the chemosensitivity data. Although most compounds in the data set were of low potency, high efficacy compounds that showed specificity with respect to tissue of origin could be found. Furthermore...

  20. Gene Expression Profiling to Predict Clinical Outcome of Breast Cancer: reproducing, analyzing and extending the Nature publication by vhVeer et al

    NARCIS (Netherlands)

    Li R.; Visser, H.M.

    2010-01-01

    Chemotherapy and hormonal therapy as adjuvant systemic therapies to inhibit breast cancer recurrence are not necessary for each patient. In Veer's paper "Gene expression profiling predicts clinical outcome of breast cancer" (Nature 2002, PMID: 11823860), they introduced a method based on DNA

  1. Gene expression and gene therapy imaging

    International Nuclear Information System (INIS)

    Rome, Claire; Couillaud, Franck; Moonen, Chrit T.W.

    2007-01-01

    The fast growing field of molecular imaging has achieved major advances in imaging gene expression, an important element of gene therapy. Gene expression imaging is based on specific probes or contrast agents that allow either direct or indirect spatio-temporal evaluation of gene expression. Direct evaluation is possible with, for example, contrast agents that bind directly to a specific target (e.g., receptor). Indirect evaluation may be achieved by using specific substrate probes for a target enzyme. The use of marker genes, also called reporter genes, is an essential element of MI approaches for gene expression in gene therapy. The marker gene may not have a therapeutic role itself, but by coupling the marker gene to a therapeutic gene, expression of the marker gene reports on the expression of the therapeutic gene. Nuclear medicine and optical approaches are highly sensitive (detection of probes in the picomolar range), whereas MRI and ultrasound imaging are less sensitive and require amplification techniques and/or accumulation of contrast agents in enlarged contrast particles. Recently developed MI techniques are particularly relevant for gene therapy. Amongst these are the possibility to track gene therapy vectors such as stem cells, and the techniques that allow spatiotemporal control of gene expression by non-invasive heating (with MRI guided focused ultrasound) and the use of temperature sensitive promoters. (orig.)

  2. Do aberrant crypt foci have predictive value for the occurrence of colorectal tumours? Potential of gene expression profiling in tumours

    NARCIS (Netherlands)

    Wijnands, M.V.W.; Erk, van M.J.; Doornbos, R.P.; Krul, C.A.M.; Woutersen, R.A.

    2004-01-01

    The effects of different dietary compounds on the formation of aberrant crypt foci (ACF) and colorectal tumours and on the expression of a selection of genes were studied in rats. Azoxymethane-treated male F344 rats were fed either a control diet or a diet containing 10% wheat bran (WB), 0.2%

  3. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks | Center for Cancer Research

    Science.gov (United States)

    The purpose of this study was to develop a method of classifying cancers to specific diagnostic categories based on their gene expression signatures using artificial neural networks (ANNs). We trained the ANNs using the small, round blue-cell tumors (SRBCTs) as a model. These cancers belong to four distinct diagnostic categories and often present diagnostic dilemmas in

  4. HPV and high-risk gene expression profiles predict response to chemoradiotherapy in head and neck cancer, independent of clinical factors

    International Nuclear Information System (INIS)

    Jong, Monique C. de; Pramana, Jimmy; Knegjens, Joost L.; Balm, Alfons J.M.; Brekel, Michiel W.M. van den; Hauptmann, Michael; Begg, Adrian C.; Rasch, Coen R.N.

    2010-01-01

    Purpose: The purpose of this study was to combine gene expression profiles and clinical factors to provide a better prediction model of local control after chemoradiotherapy for advanced head and neck cancer. Material and methods: Gene expression data were available for a series of 92 advanced stage head and neck cancer patients treated with primary chemoradiotherapy. The effect of the Chung high-risk and Slebos HPV expression profiles on local control was analyzed in a model with age at diagnosis, gender, tumor site, tumor volume, T-stage and N-stage and HPV profile status. Results: Among 75 patients included in the study, the only factors significantly predicting local control were tumor site (oral cavity vs. Pharynx, hazard ratio 4.2 [95% CI 1.4-12.5]), Chung gene expression status (high vs. Low risk profile, hazard ratio 4.4 [95% CI 1.5-13.3]) and HPV profile (negative vs. Positive profile, hazard ratio 6.2 [95% CI 1.7-22.5]). Conclusions: Chung high-risk expression profile and a negative HPV expression profile were significantly associated with increased risk of local recurrence after chemoradiotherapy in advanced pharynx and oral cavity tumors, independent of clinical factors.

  5. Vascular Gene Expression: A Hypothesis

    Directory of Open Access Journals (Sweden)

    Angélica Concepción eMartínez-Navarro

    2013-07-01

    Full Text Available The phloem is the conduit through which photoassimilates are distributed from autotrophic to heterotrophic tissues and is involved in the distribution of signaling molecules that coordinate plant growth and responses to the environment. Phloem function depends on the coordinate expression of a large array of genes. We have previously identified conserved motifs in upstream regions of the Arabidopsis genes, encoding the homologs of pumpkin phloem sap mRNAs, displaying expression in vascular tissues. This tissue-specific expression in Arabidopsis is predicted by the overrepresentation of GA/CT-rich motifs in gene promoters. In this work we have searched for common motifs in upstream regions of the homologous genes from plants considered to possess a primitive vascular tissue (a lycophyte, as well as from others that lack a true vascular tissue (a bryophyte, and finally from chlorophytes. Both lycophyte and bryophyte display motifs similar to those found in Arabidopsis with a significantly low E-value, while the chlorophytes showed either a different conserved motif or no conserved motif at all. These results suggest that these same genes are expressed coordinately in non- vascular plants; this coordinate expression may have been one of the prerequisites for the development of conducting tissues in plants. We have also analyzed the phylogeny of conserved proteins that may be involved in phloem function and development. The presence of CmPP16, APL, FT and YDA in chlorophytes suggests the recruitment of ancient regulatory networks for the development of the vascular tissue during evolution while OPS is a novel protein specific to vascular plants.

  6. Identification of genes showing differential expression profile ...

    Indian Academy of Sciences (India)

    3Department of Natural Sciences, International Christian University, Mitaka, Tokyo 181-8585, Japan ... the changes of expression predicted from gene function suggested association ... ate School of Science and Technology, Niigata University.

  7. The GENOTEND chip: a new tool to analyse gene expression in muscles of beef cattle for beef quality prediction.

    Science.gov (United States)

    Hocquette, Jean-Francois; Bernard-Capel, Carine; Vidal, Veronique; Jesson, Beline; Levéziel, Hubert; Renand, Gilles; Cassar-Malek, Isabelle

    2012-08-15

    Previous research programmes have described muscle biochemical traits and gene expression levels associated with beef tenderness. One of our results concerning the DNAJA1 gene (an Hsp40) was patented. This study aims to confirm the relationships previously identified between two gene families (heat shock proteins and energy metabolism) and beef quality. We developed an Agilent chip with specific probes for bovine muscular genes. More than 3000 genes involved in muscle biology or meat quality were selected from genetic, proteomic or transcriptomic studies, or from scientific publications. As far as possible, several probes were used for each gene (e.g. 17 probes for DNAJA1). RNA from Longissimus thoracis muscle samples was hybridised on the chips. Muscles samples were from four groups of Charolais cattle: two groups of young bulls and two groups of steers slaughtered in two different years. Principal component analysis, simple correlation of gene expression levels with tenderness scores, and then multiple regression analysis provided the means to detect the genes within two families (heat shock proteins and energy metabolism) which were the most associated with beef tenderness. For the 25 Charolais young bulls slaughtered in year 1, expression levels of DNAJA1 and other genes of the HSP family were related to the initial or overall beef tenderness. Similarly, expression levels of genes involved in fat or energy metabolism were related with the initial or overall beef tenderness but in the year 1 and year 2 groups of young bulls only. Generally, the genes individually correlated with tenderness are not consistent across genders and years indicating the strong influence of rearing conditions on muscle characteristics related to beef quality. However, a group of HSP genes, which explained about 40% of the variability in tenderness in the group of 25 young bulls slaughtered in year 1 (considered as the reference group), was validated in the groups of 30 Charolais young

  8. The GENOTEND chip: a new tool to analyse gene expression in muscles of beef cattle for beef quality prediction

    Directory of Open Access Journals (Sweden)

    Hocquette Jean-Francois

    2012-08-01

    Full Text Available Abstract Background Previous research programmes have described muscle biochemical traits and gene expression levels associated with beef tenderness. One of our results concerning the DNAJA1 gene (an Hsp40 was patented. This study aims to confirm the relationships previously identified between two gene families (heat shock proteins and energy metabolism and beef quality. Results We developed an Agilent chip with specific probes for bovine muscular genes. More than 3000 genes involved in muscle biology or meat quality were selected from genetic, proteomic or transcriptomic studies, or from scientific publications. As far as possible, several probes were used for each gene (e.g. 17 probes for DNAJA1. RNA from Longissimus thoracis muscle samples was hybridised on the chips. Muscles samples were from four groups of Charolais cattle: two groups of young bulls and two groups of steers slaughtered in two different years. Principal component analysis, simple correlation of gene expression levels with tenderness scores, and then multiple regression analysis provided the means to detect the genes within two families (heat shock proteins and energy metabolism which were the most associated with beef tenderness. For the 25 Charolais young bulls slaughtered in year 1, expression levels of DNAJA1 and other genes of the HSP family were related to the initial or overall beef tenderness. Similarly, expression levels of genes involved in fat or energy metabolism were related with the initial or overall beef tenderness but in the year 1 and year 2 groups of young bulls only. Generally, the genes individually correlated with tenderness are not consistent across genders and years indicating the strong influence of rearing conditions on muscle characteristics related to beef quality. However, a group of HSP genes, which explained about 40% of the variability in tenderness in the group of 25 young bulls slaughtered in year 1 (considered as the reference group, was

  9. A gene expression signature of RAS pathway dependence predicts response to PI3K and RAS pathway inhibitors and expands the population of RAS pathway activated tumors.

    Science.gov (United States)

    Loboda, Andrey; Nebozhyn, Michael; Klinghoffer, Rich; Frazier, Jason; Chastain, Michael; Arthur, William; Roberts, Brian; Zhang, Theresa; Chenard, Melissa; Haines, Brian; Andersen, Jannik; Nagashima, Kumiko; Paweletz, Cloud; Lynch, Bethany; Feldman, Igor; Dai, Hongyue; Huang, Pearl; Watters, James

    2010-06-30

    Hyperactivation of the Ras signaling pathway is a driver of many cancers, and RAS pathway activation can predict response to targeted therapies. Therefore, optimal methods for measuring Ras pathway activation are critical. The main focus of our work was to develop a gene expression signature that is predictive of RAS pathway dependence. We used the coherent expression of RAS pathway-related genes across multiple datasets to derive a RAS pathway gene expression signature and generate RAS pathway activation scores in pre-clinical cancer models and human tumors. We then related this signature to KRAS mutation status and drug response data in pre-clinical and clinical datasets. The RAS signature score is predictive of KRAS mutation status in lung tumors and cell lines with high (> 90%) sensitivity but relatively low (50%) specificity due to samples that have apparent RAS pathway activation in the absence of a KRAS mutation. In lung and breast cancer cell line panels, the RAS pathway signature score correlates with pMEK and pERK expression, and predicts resistance to AKT inhibition and sensitivity to MEK inhibition within both KRAS mutant and KRAS wild-type groups. The RAS pathway signature is upregulated in breast cancer cell lines that have acquired resistance to AKT inhibition, and is downregulated by inhibition of MEK. In lung cancer cell lines knockdown of KRAS using siRNA demonstrates that the RAS pathway signature is a better measure of dependence on RAS compared to KRAS mutation status. In human tumors, the RAS pathway signature is elevated in ER negative breast tumors and lung adenocarcinomas, and predicts resistance to cetuximab in metastatic colorectal cancer. These data demonstrate that the RAS pathway signature is superior to KRAS mutation status for the prediction of dependence on RAS signaling, can predict response to PI3K and RAS pathway inhibitors, and is likely to have the most clinical utility in lung and breast tumors.

  10. A gene expression signature of RAS pathway dependence predicts response to PI3K and RAS pathway inhibitors and expands the population of RAS pathway activated tumors

    Directory of Open Access Journals (Sweden)

    Paweletz Cloud

    2010-06-01

    Full Text Available Abstract Background Hyperactivation of the Ras signaling pathway is a driver of many cancers, and RAS pathway activation can predict response to targeted therapies. Therefore, optimal methods for measuring Ras pathway activation are critical. The main focus of our work was to develop a gene expression signature that is predictive of RAS pathway dependence. Methods We used the coherent expression of RAS pathway-related genes across multiple datasets to derive a RAS pathway gene expression signature and generate RAS pathway activation scores in pre-clinical cancer models and human tumors. We then related this signature to KRAS mutation status and drug response data in pre-clinical and clinical datasets. Results The RAS signature score is predictive of KRAS mutation status in lung tumors and cell lines with high (> 90% sensitivity but relatively low (50% specificity due to samples that have apparent RAS pathway activation in the absence of a KRAS mutation. In lung and breast cancer cell line panels, the RAS pathway signature score correlates with pMEK and pERK expression, and predicts resistance to AKT inhibition and sensitivity to MEK inhibition within both KRAS mutant and KRAS wild-type groups. The RAS pathway signature is upregulated in breast cancer cell lines that have acquired resistance to AKT inhibition, and is downregulated by inhibition of MEK. In lung cancer cell lines knockdown of KRAS using siRNA demonstrates that the RAS pathway signature is a better measure of dependence on RAS compared to KRAS mutation status. In human tumors, the RAS pathway signature is elevated in ER negative breast tumors and lung adenocarcinomas, and predicts resistance to cetuximab in metastatic colorectal cancer. Conclusions These data demonstrate that the RAS pathway signature is superior to KRAS mutation status for the prediction of dependence on RAS signaling, can predict response to PI3K and RAS pathway inhibitors, and is likely to have the most clinical

  11. Discovering biomarkers from gene expression data for predicting cancer subgroups using neural networks and relational fuzzy clustering

    Directory of Open Access Journals (Sweden)

    Sharma Animesh

    2007-01-01

    Full Text Available Abstract Background The four heterogeneous childhood cancers, neuroblastoma, non-Hodgkin lymphoma, rhabdomyosarcoma, and Ewing sarcoma present a similar histology of small round blue cell tumor (SRBCT and thus often leads to misdiagnosis. Identification of biomarkers for distinguishing these cancers is a well studied problem. Existing methods typically evaluate each gene separately and do not take into account the nonlinear interaction between genes and the tools that are used to design the diagnostic prediction system. Consequently, more genes are usually identified as necessary for prediction. We propose a general scheme for finding a small set of biomarkers to design a diagnostic system for accurate classification of the cancer subgroups. We use multilayer networks with online gene selection ability and relational fuzzy clustering to identify a small set of biomarkers for accurate classification of the training and blind test cases of a well studied data set. Results Our method discerned just seven biomarkers that precisely categorized the four subgroups of cancer both in training and blind samples. For the same problem, others suggested 19–94 genes. These seven biomarkers include three novel genes (NAB2, LSP1 and EHD1 – not identified by others with distinct class-specific signatures and important role in cancer biology, including cellular proliferation, transendothelial migration and trafficking of MHC class antigens. Interestingly, NAB2 is downregulated in other tumors including Non-Hodgkin lymphoma and Neuroblastoma but we observed moderate to high upregulation in a few cases of Ewing sarcoma and Rabhdomyosarcoma, suggesting that NAB2 might be mutated in these tumors. These genes can discover the subgroups correctly with unsupervised learning, can differentiate non-SRBCT samples and they perform equally well with other machine learning tools including support vector machines. These biomarkers lead to four simple human interpretable

  12. Integrative miRNA-Gene Expression Analysis Enables Refinement of Associated Biology and Prediction of Response to Cetuximab in Head and Neck Squamous Cell Cancer

    Directory of Open Access Journals (Sweden)

    Loris De Cecco

    2017-01-01

    Full Text Available This paper documents the process by which we, through gene and miRNA expression profiling of the same samples of head and neck squamous cell carcinomas (HNSCC and an integrative miRNA-mRNA expression analysis, were able to identify candidate biomarkers of progression-free survival (PFS in patients treated with cetuximab-based approaches. Through sparse partial least square–discriminant analysis (sPLS-DA and supervised analysis, 36 miRNAs were identified in two components that clearly separated long- and short-PFS patients. Gene set enrichment analysis identified a significant correlation between the miRNA first-component and EGFR signaling, keratinocyte differentiation, and p53. Another significant correlation was identified between the second component and RAS, NOTCH, immune/inflammatory response, epithelial–mesenchymal transition (EMT, and angiogenesis pathways. Regularized canonical correlation analysis of sPLS-DA miRNA and gene data combined with the MAGIA2 web-tool highlighted 16 miRNAs and 84 genes that were interconnected in a total of 245 interactions. After feature selection by a smoothed t-statistic support vector machine, we identified three miRNAs and five genes in the miRNA-gene network whose expression result was the most relevant in predicting PFS (Area Under the Curve, AUC = 0.992. Overall, using a well-defined clinical setting and up-to-date bioinformatics tools, we are able to give the proof of principle that an integrative miRNA-mRNA expression could greatly contribute to the refinement of the biology behind a predictive model.

  13. Perturbation of B Cell Gene Expression Persists in HIV-Infected Children Despite Effective Antiretroviral Therapy and Predicts H1N1 Response.

    Science.gov (United States)

    Cotugno, Nicola; De Armas, Lesley; Pallikkuth, Suresh; Rinaldi, Stefano; Issac, Biju; Cagigi, Alberto; Rossi, Paolo; Palma, Paolo; Pahwa, Savita

    2017-01-01

    Despite effective antiretroviral therapy (ART), HIV-infected individuals with apparently similar clinical and immunological characteristics can vary in responsiveness to vaccinations. However, molecular mechanisms responsible for such impairment, as well as biomarkers able to predict vaccine responsiveness in HIV-infected children, remain unknown. Following the hypothesis that a B cell qualitative impairment persists in HIV-infected children (HIV) despite effective ART and phenotypic B cell immune reconstitution, the aim of the current study was to investigate B cell gene expression of HIV compared to age-matched healthy controls (HCs) and to determine whether distinct gene expression patterns could predict the ability to respond to influenza vaccine. To do so, we analyzed prevaccination transcriptional levels of a 96-gene panel in equal numbers of sort-purified B cell subsets (SPBS) isolated from peripheral blood mononuclear cells using multiplexed RT-PCR. Immune responses to H1N1 antigen were determined by hemaglutination inhibition and memory B cell ELISpot assays following trivalent-inactivated influenza vaccination (TIV) for all study participants. Although there were no differences in terms of cell frequencies of SPBS between HIV and HC, the groups were distinguishable based upon gene expression analyses. Indeed, a 28-gene signature, characterized by higher expression of genes involved in the inflammatory response and immune activation was observed in activated memory B cells (CD27 + CD21 - ) from HIV when compared to HC despite long-term viral control (>24 months). Further analysis, taking into account H1N1 responses after TIV in HIV participants, revealed that a 25-gene signature in resting memory (RM) B cells (CD27 + CD21 + ) was able to distinguish vaccine responders from non-responders (NR). In fact, prevaccination RM B cells of responders showed a higher expression of gene sets involved in B cell adaptive immune responses ( APRIL, BTK, BLIMP1 ) and

  14. Imaging gene expression in gene therapy

    International Nuclear Information System (INIS)

    Wiebe, Leonard I.

    1997-01-01

    Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on 'suicide gene therapy' of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k + ) has been use for 'suicide' in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k + gene expression where the H S V-1 t k + gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([ 18 F]F H P G; [ 18 F]-A C V), and pyrimidine- ([ 123 / 131 I]I V R F U; [ 124 / 131I ]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [ 123 / 131I ]I V R F U imaging with the H S V-1 t k + reporter gene will be presented

  15. Imaging gene expression in gene therapy

    Energy Technology Data Exchange (ETDEWEB)

    Wiebe, Leonard I. [Alberta Univ., Edmonton (Canada). Noujaim Institute for Pharmaceutical Oncology Research

    1997-12-31

    Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on `suicide gene therapy` of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k{sup +}) has been use for `suicide` in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k{sup +} gene expression where the H S V-1 t k{sup +} gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([{sup 18} F]F H P G; [{sup 18} F]-A C V), and pyrimidine- ([{sup 123}/{sup 131} I]I V R F U; [{sup 124}/{sup 131I}]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [{sup 123}/{sup 131I}]I V R F U imaging with the H S V-1 t k{sup +} reporter gene will be presented

  16. The functional landscape of mouse gene expression

    Directory of Open Access Journals (Sweden)

    Zhang Wen

    2004-12-01

    Full Text Available Abstract Background Large-scale quantitative analysis of transcriptional co-expression has been used to dissect regulatory networks and to predict the functions of new genes discovered by genome sequencing in model organisms such as yeast. Although the idea that tissue-specific expression is indicative of gene function in mammals is widely accepted, it has not been objectively tested nor compared with the related but distinct strategy of correlating gene co-expression as a means to predict gene function. Results We generated microarray expression data for nearly 40,000 known and predicted mRNAs in 55 mouse tissues, using custom-built oligonucleotide arrays. We show that quantitative transcriptional co-expression is a powerful predictor of gene function. Hundreds of functional categories, as defined by Gene Ontology 'Biological Processes', are associated with characteristic expression patterns across all tissues, including categories that bear no overt relationship to the tissue of origin. In contrast, simple tissue-specific restriction of expression is a poor predictor of which genes are in which functional categories. As an example, the highly conserved mouse gene PWP1 is widely expressed across different tissues but is co-expressed with many RNA-processing genes; we show that the uncharacterized yeast homolog of PWP1 is required for rRNA biogenesis. Conclusions We conclude that 'functional genomics' strategies based on quantitative transcriptional co-expression will be as fruitful in mammals as they have been in simpler organisms, and that transcriptional control of mammalian physiology is more modular than is generally appreciated. Our data and analyses provide a public resource for mammalian functional genomics.

  17. Prediction and characterisation of a highly conserved, remote and cAMP responsive enhancer that regulates Msx1 gene expression in cardiac neural crest and outflow tract.

    Science.gov (United States)

    Miller, Kerry Ann; Davidson, Scott; Liaros, Angela; Barrow, John; Lear, Marissa; Heine, Danielle; Hoppler, Stefan; MacKenzie, Alasdair

    2008-05-15

    Double knockouts of the Msx1 and Msx2 genes in the mouse result in severe cardiac outflow tract malformations similar to those frequently found in newborn infants. Despite the known role of the Msx genes in cardiac formation little is known of the regulatory systems (ligand receptor, signal transduction and protein-DNA interactions) that regulate the tissue-specific expression of the Msx genes in mammals during the formation of the outflow tract. In the present study we have used a combination of multi-species comparative genomics, mouse transgenic analysis and in-situ hybridisation to predict and validate the existence of a remote ultra-conserved enhancer that supports the expression of the Msx1 gene in migrating mouse cardiac neural crest and the outflow tract primordia. Furthermore, culturing of embryonic explants derived from transgenic lines with agonists of the PKC and PKA signal transduction systems demonstrates that this remote enhancer is influenced by PKA but not PKC dependent gene regulatory systems. These studies demonstrate the efficacy of combining comparative genomics and transgenic analyses and provide a platform for the study of the possible roles of Msx gene mis-regulation in the aetiology of congenital heart malformation.

  18. Development and validation of a gene expression-based signature to predict distant metastasis in locoregionally advanced nasopharyngeal carcinoma: a retrospective, multicentre, cohort study.

    Science.gov (United States)

    Tang, Xin-Ran; Li, Ying-Qin; Liang, Shao-Bo; Jiang, Wei; Liu, Fang; Ge, Wen-Xiu; Tang, Ling-Long; Mao, Yan-Ping; He, Qing-Mei; Yang, Xiao-Jing; Zhang, Yuan; Wen, Xin; Zhang, Jian; Wang, Ya-Qin; Zhang, Pan-Pan; Sun, Ying; Yun, Jing-Ping; Zeng, Jing; Li, Li; Liu, Li-Zhi; Liu, Na; Ma, Jun

    2018-03-01

    Gene expression patterns can be used as prognostic biomarkers in various types of cancers. We aimed to identify a gene expression pattern for individual distant metastatic risk assessment in patients with locoregionally advanced nasopharyngeal carcinoma. In this multicentre, retrospective, cohort analysis, we included 937 patients with locoregionally advanced nasopharyngeal carcinoma from three Chinese hospitals: the Sun Yat-sen University Cancer Center (Guangzhou, China), the Affiliated Hospital of Guilin Medical University (Guilin, China), and the First People's Hospital of Foshan (Foshan, China). Using microarray analysis, we profiled mRNA gene expression between 24 paired locoregionally advanced nasopharyngeal carcinoma tumours from patients at Sun Yat-sen University Cancer Center with or without distant metastasis after radical treatment. Differentially expressed genes were examined using digital expression profiling in a training cohort (Guangzhou training cohort; n=410) to build a gene classifier using a penalised regression model. We validated the prognostic accuracy of this gene classifier in an internal validation cohort (Guangzhou internal validation cohort, n=204) and two external independent cohorts (Guilin cohort, n=165; Foshan cohort, n=158). The primary endpoint was distant metastasis-free survival. Secondary endpoints were disease-free survival and overall survival. We identified 137 differentially expressed genes between metastatic and non-metastatic locoregionally advanced nasopharyngeal carcinoma tissues. A distant metastasis gene signature for locoregionally advanced nasopharyngeal carcinoma (DMGN) that consisted of 13 genes was generated to classify patients into high-risk and low-risk groups in the training cohort. Patients with high-risk scores in the training cohort had shorter distant metastasis-free survival (hazard ratio [HR] 4·93, 95% CI 2·99-8·16; padvanced nasopharyngeal carcinoma and might be able to predict which patients benefit

  19. Regulation of eucaryotic gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Brent, R.; Ptashne, M.S

    1989-05-23

    This patent describes a method of regulating the expression of a gene in a eucaryotic cell. The method consists of: providing in the eucaryotic cell, a peptide, derived from or substantially similar to a peptide of a procaryotic cell able to bind to DNA upstream from or within the gene, the amount of the peptide being sufficient to bind to the gene and thereby control expression of the gene.

  20. Polycistronic gene expression in Aspergillus niger.

    Science.gov (United States)

    Schuetze, Tabea; Meyer, Vera

    2017-09-25

    Genome mining approaches predict dozens of biosynthetic gene clusters in each of the filamentous fungal genomes sequenced so far. However, the majority of these gene clusters still remain cryptic because they are not expressed in their natural host. Simultaneous expression of all genes belonging to a biosynthetic pathway in a heterologous host is one approach to activate biosynthetic gene clusters and to screen the metabolites produced for bioactivities. Polycistronic expression of all pathway genes under control of a single and tunable promoter would be the method of choice, as this does not only simplify cloning procedures, but also offers control on timing and strength of expression. However, polycistronic gene expression is a feature not commonly found in eukaryotic host systems, such as Aspergillus niger. In this study, we tested the suitability of the viral P2A peptide for co-expression of three genes in A. niger. Two genes descend from Fusarium oxysporum and are essential to produce the secondary metabolite enniatin (esyn1, ekivR). The third gene (luc) encodes the reporter luciferase which was included to study position effects. Expression of the polycistronic gene cassette was put under control of the Tet-On system to ensure tunable gene expression in A. niger. In total, three polycistronic expression cassettes which differed in the position of luc were constructed and targeted to the pyrG locus in A. niger. This allowed direct comparison of the luciferase activity based on the position of the luciferase gene. Doxycycline-mediated induction of the Tet-On expression cassettes resulted in the production of one long polycistronic mRNA as proven by Northern analyses, and ensured comparable production of enniatin in all three strains. Notably, gene position within the polycistronic expression cassette matters, as, luciferase activity was lowest at position one and had a comparable activity at positions two and three. The P2A peptide can be used to express at

  1. Differential Gene Expression and Aging

    Directory of Open Access Journals (Sweden)

    Laurent Seroude

    2002-01-01

    Full Text Available It has been established that an intricate program of gene expression controls progression through the different stages in development. The equally complex biological phenomenon known as aging is genetically determined and environmentally modulated. This review focuses on the genetic component of aging, with a special emphasis on differential gene expression. At least two genetic pathways regulating organism longevity act by modifying gene expression. Many genes are also subjected to age-dependent transcriptional regulation. Some age-related gene expression changes are prevented by caloric restriction, the most robust intervention that slows down the aging process. Manipulating the expression of some age-regulated genes can extend an organism's life span. Remarkably, the activity of many transcription regulatory elements is linked to physiological age as opposed to chronological age, indicating that orderly and tightly controlled regulatory pathways are active during aging.

  2. Gene expression in uninvolved oral mucosa of OSCC patients facilitates identification of markers predictive of OSCC outcomes.

    Directory of Open Access Journals (Sweden)

    Pawadee Lohavanichbutr

    Full Text Available Oral and oropharyngeal squamous cell carcinomas (OSCC are among the most common cancers worldwide, with approximately 60% 5-yr survival rate. To identify potential markers for disease progression, we used Affymetrix U133 plus 2.0 arrays to examine the gene expression profiles of 167 primary tumor samples from OSCC patients, 58 uninvolved oral mucosae from OSCC patients and 45 normal oral mucosae from patients without oral cancer, all enrolled at one of the three University of Washington-affiliated medical centers between 2003 to 2008. We found 2,596 probe sets differentially expressed between 167 tumor samples and 45 normal samples. Among 2,596 probe sets, 71 were significantly and consistently up- or down-regulated in the comparison between normal samples and uninvolved oral samples and between uninvolved oral samples and tumor samples. Cox regression analyses showed that 20 of the 71 probe sets were significantly associated with progression-free survival. The risk score for each patient was calculated from coefficients of a Cox model incorporating these 20 probe sets. The hazard ratio (HR associated with each unit change in the risk score adjusting for age, gender, tumor stage, and high-risk HPV status was 2.7 (95% CI: 2.0-3.8, p = 8.8E-10. The risk scores in an independent dataset of 74 OSCC patients from the MD Anderson Cancer Center was also significantly associated with progression-free survival independent of age, gender, and tumor stage (HR 1.6, 95% CI: 1.1-2.2, p = 0.008. Gene Set Enrichment Analysis showed that the most prominent biological pathway represented by the 71 probe sets was the Integrin cell surface interactions pathway. In conclusion, we identified 71 probe sets in which dysregulation occurred in both uninvolved oral mucosal and cancer samples. Dysregulation of 20 of the 71 probe sets was associated with progression-free survival and was validated in an independent dataset.

  3. Combined serial analysis of gene expression and transcription factor binding site prediction identifies novel-candidate-target genes of Nr2e1 in neocortex development.

    Science.gov (United States)

    Schmouth, Jean-François; Arenillas, David; Corso-Díaz, Ximena; Xie, Yuan-Yun; Bohacec, Slavita; Banks, Kathleen G; Bonaguro, Russell J; Wong, Siaw H; Jones, Steven J M; Marra, Marco A; Simpson, Elizabeth M; Wasserman, Wyeth W

    2015-07-24

    Nr2e1 (nuclear receptor subfamily 2, group e, member 1) encodes a transcription factor important in neocortex development. Previous work has shown that nuclear receptors can have hundreds of target genes, and bind more than 300 co-interacting proteins. However, recognition of the critical role of Nr2e1 in neural stem cells and neocortex development is relatively recent, thus the molecular mechanisms involved for this nuclear receptor are only beginning to be understood. Serial analysis of gene expression (SAGE), has given researchers both qualitative and quantitative information pertaining to biological processes. Thus, in this work, six LongSAGE mouse libraries were generated from laser microdissected tissue samples of dorsal VZ/SVZ (ventricular zone and subventricular zone) from the telencephalon of wild-type (Wt) and Nr2e1-null embryos at the critical development ages E13.5, E15.5, and E17.5. We then used a novel approach, implementing multiple computational methods followed by biological validation to further our understanding of Nr2e1 in neocortex development. In this work, we have generated a list of 1279 genes that are differentially expressed in response to altered Nr2e1 expression during in vivo neocortex development. We have refined this list to 64 candidate direct-targets of NR2E1. Our data suggested distinct roles for Nr2e1 during different neocortex developmental stages. Most importantly, our results suggest a possible novel pathway by which Nr2e1 regulates neurogenesis, which includes Lhx2 as one of the candidate direct-target genes, and SOX9 as a co-interactor. In conclusion, we have provided new candidate interacting partners and numerous well-developed testable hypotheses for understanding the pathways by which Nr2e1 functions to regulate neocortex development.

  4. Predicting protein-protein interactions in Arabidopsis thaliana through integration of orthology, gene ontology and co-expression

    Directory of Open Access Journals (Sweden)

    Vandepoele Klaas

    2009-06-01

    Full Text Available Abstract Background Large-scale identification of the interrelationships between different components of the cell, such as the interactions between proteins, has recently gained great interest. However, unraveling large-scale protein-protein interaction maps is laborious and expensive. Moreover, assessing the reliability of the interactions can be cumbersome. Results In this study, we have developed a computational method that exploits the existing knowledge on protein-protein interactions in diverse species through orthologous relations on the one hand, and functional association data on the other hand to predict and filter protein-protein interactions in Arabidopsis thaliana. A highly reliable set of protein-protein interactions is predicted through this integrative approach making use of existing protein-protein interaction data from yeast, human, C. elegans and D. melanogaster. Localization, biological process, and co-expression data are used as powerful indicators for protein-protein interactions. The functional repertoire of the identified interactome reveals interactions between proteins functioning in well-conserved as well as plant-specific biological processes. We observe that although common mechanisms (e.g. actin polymerization and components (e.g. ARPs, actin-related proteins exist between different lineages, they are active in specific processes such as growth, cancer metastasis and trichome development in yeast, human and Arabidopsis, respectively. Conclusion We conclude that the integration of orthology with functional association data is adequate to predict protein-protein interactions. Through this approach, a high number of novel protein-protein interactions with diverse biological roles is discovered. Overall, we have predicted a reliable set of protein-protein interactions suitable for further computational as well as experimental analyses.

  5. A gene expression signature of Retinoblastoma loss-of-function predicts resistance to neoadjuvant chemotherapy in ER-positive/HER2-positive breast cancer patients.

    Science.gov (United States)

    Risi, Emanuela; Grilli, Andrea; Migliaccio, Ilenia; Biagioni, Chiara; McCartney, Amelia; Guarducci, Cristina; Bonechi, Martina; Benelli, Matteo; Vitale, Stefania; Biganzoli, Laura; Bicciato, Silvio; Di Leo, Angelo; Malorni, Luca

    2018-07-01

    HER2-positive (HER2+) breast cancers show heterogeneous response to chemotherapy, with the ER-positive (ER+) subgroup deriving less benefit. Loss of retinoblastoma tumor suppressor gene (RB1) function has been suggested as a cardinal feature of breast cancers that are more sensitive to chemotherapy and conversely resistant to CDK4/6 inhibitors. We performed a retrospective analysis exploring RBsig, a gene signature of RB loss, as a potential predictive marker of response to neoadjuvant chemotherapy in ER+/HER2+ breast cancer patients. We selected clinical trials of neoadjuvant chemotherapy ± anti-HER2 therapy in HER2+ breast cancer patients with available information on gene expression data, hormone receptor status, and pathological complete response (pCR) rates. RBsig expression was computed in silico and correlated with pCR. Ten studies fulfilled the inclusion criteria and were included in the analysis (514 patients). Overall, of 211 ER+/HER2+ breast cancer patients, 49 achieved pCR (23%). The pCR rate following chemotherapy ± anti-HER2 drugs in patients with RBsig low expression was significantly lower compared to patients with RBsig high expression (16% vs. 30%, respectively; Fisher's exact test p = 0.015). The area under the ROC curve (AUC) was 0.62 (p = 0.005). In the 303 ER-negative (ER-)/HER2+ patients treated with chemotherapy ± anti-HER2 drugs, the pCR rate was 43%. No correlation was found between RBsig expression and pCR rate in this group. Low expression of RBsig identifies a subset of ER+/HER2+ patients with low pCR rates following neoadjuvant chemotherapy ± anti-HER2 therapy. These patients may potentially be spared chemotherapy in favor of anti-HER2, endocrine therapy, and CDK 4/6 inhibitor combinations.

  6. Verification of predicted alternatively spliced Wnt genes reveals two new splice variants (CTNNB1 and LRP5 and altered Axin-1 expression during tumour progression

    Directory of Open Access Journals (Sweden)

    Reich Jens G

    2006-06-01

    Full Text Available Abstract Background Splicing processes might play a major role in carcinogenesis and tumour progression. The Wnt pathway is of crucial relevance for cancer progression. Therefore we focussed on the Wnt/β-catenin signalling pathway in order to validate the expression of sequences predicted as alternatively spliced by bioinformatic methods. Splice variants of its key molecules were selected, which may be critical components for the understanding of colorectal tumour progression and may have the potential to act as biological markers. For some of the Wnt pathway genes the existence of splice variants was either proposed (e.g. β-Catenin and CTNNB1 or described only in non-colon tissues (e.g. GSK3β or hitherto not published (e.g. LRP5. Results Both splice variants – normal and alternative form – of all selected Wnt pathway components were found to be expressed in cell lines as well as in samples derived from tumour, normal and healthy tissues. All splice positions corresponded totally with the bioinformatical prediction as shown by sequencing. Two hitherto not described alternative splice forms (CTNNB1 and LRP5 were detected. Although the underlying EST data used for the bioinformatic analysis suggested a tumour-specific expression neither a qualitative nor a significant quantitative difference between the expression in tumour and healthy tissues was detected. Axin-1 expression was reduced in later stages and in samples from carcinomas forming distant metastases. Conclusion We were first to describe that splice forms of crucial genes of the Wnt-pathway are expressed in human colorectal tissue. Newly described splicefoms were found for β-Catenin, LRP5, GSK3β, Axin-1 and CtBP1. However, the predicted cancer specificity suggested by the origin of the underlying ESTs was neither qualitatively nor significant quantitatively confirmed. That let us to conclude that EST sequence data can give adequate hints for the existence of alternative splicing

  7. Development and validation of a gene profile predicting benefit of postmastectomy radiotherapy in patients with high-risk breast cancer: a study of gene expression in the DBCG82bc cohort.

    Science.gov (United States)

    Tramm, Trine; Mohammed, Hayat; Myhre, Simen; Kyndi, Marianne; Alsner, Jan; Børresen-Dale, Anne-Lise; Sørlie, Therese; Frigessi, Arnoldo; Overgaard, Jens

    2014-10-15

    To identify genes predicting benefit of radiotherapy in patients with high-risk breast cancer treated with systemic therapy and randomized to receive or not receive postmastectomy radiotherapy (PMRT). The study was based on the Danish Breast Cancer Cooperative Group (DBCG82bc) cohort. Gene-expression analysis was performed in a training set of frozen tumor tissue from 191 patients. Genes were identified through the Lasso method with the endpoint being locoregional recurrence (LRR). A weighted gene-expression index (DBCG-RT profile) was calculated and transferred to quantitative real-time PCR (qRT-PCR) in corresponding formalin-fixed, paraffin-embedded (FFPE) samples, before validation in FFPE from 112 additional patients. Seven genes were identified, and the derived DBCG-RT profile divided the 191 patients into "high LRR risk" and "low LRR risk" groups. PMRT significantly reduced risk of LRR in "high LRR risk" patients, whereas "low LRR risk" patients showed no additional reduction in LRR rate. Technical transfer of the DBCG-RT profile to FFPE/qRT-PCR was successful, and the predictive impact was successfully validated in another 112 patients. A DBCG-RT gene profile was identified and validated, identifying patients with very low risk of LRR and no benefit from PMRT. The profile may provide a method to individualize treatment with PMRT. ©2014 American Association for Cancer Research.

  8. Mammalian transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes and are predicted to act as transcriptional activator hubs.

    Science.gov (United States)

    Joshi, Anagha

    2014-12-30

    Transcriptional hotspots are defined as genomic regions bound by multiple factors. They have been identified recently as cell type specific enhancers regulating developmentally essential genes in many species such as worm, fly and humans. The in-depth analysis of hotspots across multiple cell types in same species still remains to be explored and can bring new biological insights. We therefore collected 108 transcription-related factor (TF) ChIP sequencing data sets in ten murine cell types and classified the peaks in each cell type in three groups according to binding occupancy as singletons (low-occupancy), combinatorials (mid-occupancy) and hotspots (high-occupancy). The peaks in the three groups clustered largely according to the occupancy, suggesting priming of genomic loci for mid occupancy irrespective of cell type. We then characterized hotspots for diverse structural functional properties. The genes neighbouring hotspots had a small overlap with hotspot genes in other cell types and were highly enriched for cell type specific function. Hotspots were enriched for sequence motifs of key TFs in that cell type and more than 90% of hotspots were occupied by pioneering factors. Though we did not find any sequence signature in the three groups, the H3K4me1 binding profile had bimodal peaks at hotspots, distinguishing hotspots from mono-modal H3K4me1 singletons. In ES cells, differentially expressed genes after perturbation of activators were enriched for hotspot genes suggesting hotspots primarily act as transcriptional activator hubs. Finally, we proposed that ES hotspots might be under control of SetDB1 and not DNMT for silencing. Transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes. In ES cells, they are predicted to act as transcriptional activator hubs and might be under SetDB1 control for silencing.

  9. Small, synthetic, GC-rich mRNA stem-loop modules 5' proximal to the AUG start-codon predictably tune gene expression in yeast.

    Science.gov (United States)

    Lamping, Erwin; Niimi, Masakazu; Cannon, Richard D

    2013-07-29

    A large range of genetic tools has been developed for the optimal design and regulation of complex metabolic pathways in bacteria. However, fewer tools exist in yeast that can precisely tune the expression of individual enzymes in novel metabolic pathways suitable for industrial-scale production of non-natural compounds. Tuning expression levels is critical for reducing the metabolic burden of over-expressed proteins, the accumulation of toxic intermediates, and for redirecting metabolic flux from native pathways involving essential enzymes without negatively affecting the viability of the host. We have developed a yeast membrane protein hyper-expression system with critical advantages over conventional, plasmid-based, expression systems. However, expression levels are sometimes so high that they adversely affect protein targeting/folding or the growth and/or phenotype of the host. Here we describe the use of small synthetic mRNA control modules that allowed us to predictably tune protein expression levels to any desired level. Down-regulation of expression was achieved by engineering small GC-rich mRNA stem-loops into the 5' UTR that inhibited translation initiation of the yeast ribosomal 43S preinitiation complex (PIC). Exploiting the fact that the yeast 43S PIC has great difficulty scanning through GC-rich mRNA stem-loops, we created yeast strains containing 17 different RNA stem-loop modules in the 5' UTR that expressed varying amounts of the fungal multidrug efflux pump reporter Cdr1p from Candida albicans. Increasing the length of mRNA stem-loops (that contained only GC-pairs) near the AUG start-codon led to a surprisingly large decrease in Cdr1p expression; ~2.7-fold for every additional GC-pair added to the stem, while the mRNA levels remained largely unaffected. An mRNA stem-loop of seven GC-pairs (∆G = -15.8 kcal/mol) reduced Cdr1p expression levels by >99%, and even the smallest possible stem-loop of only three GC-pairs (∆G = -4.4 kcal/mol) inhibited

  10. Small, synthetic, GC-rich mRNA stem-loop modules 5′ proximal to the AUG start-codon predictably tune gene expression in yeast

    Science.gov (United States)

    2013-01-01

    Background A large range of genetic tools has been developed for the optimal design and regulation of complex metabolic pathways in bacteria. However, fewer tools exist in yeast that can precisely tune the expression of individual enzymes in novel metabolic pathways suitable for industrial-scale production of non-natural compounds. Tuning expression levels is critical for reducing the metabolic burden of over-expressed proteins, the accumulation of toxic intermediates, and for redirecting metabolic flux from native pathways involving essential enzymes without negatively affecting the viability of the host. We have developed a yeast membrane protein hyper-expression system with critical advantages over conventional, plasmid-based, expression systems. However, expression levels are sometimes so high that they adversely affect protein targeting/folding or the growth and/or phenotype of the host. Here we describe the use of small synthetic mRNA control modules that allowed us to predictably tune protein expression levels to any desired level. Down-regulation of expression was achieved by engineering small GC-rich mRNA stem-loops into the 5′ UTR that inhibited translation initiation of the yeast ribosomal 43S preinitiation complex (PIC). Results Exploiting the fact that the yeast 43S PIC has great difficulty scanning through GC-rich mRNA stem-loops, we created yeast strains containing 17 different RNA stem-loop modules in the 5′ UTR that expressed varying amounts of the fungal multidrug efflux pump reporter Cdr1p from Candida albicans. Increasing the length of mRNA stem-loops (that contained only GC-pairs) near the AUG start-codon led to a surprisingly large decrease in Cdr1p expression; ~2.7-fold for every additional GC-pair added to the stem, while the mRNA levels remained largely unaffected. An mRNA stem-loop of seven GC-pairs (∆G = −15.8 kcal/mol) reduced Cdr1p expression levels by >99%, and even the smallest possible stem-loop of only three GC-pairs (

  11. Gene expression in early stage cervical cancer

    NARCIS (Netherlands)

    Biewenga, Petra; Buist, Marrije R.; Moerland, Perry D.; van Thernaat, Emiel Ver Loren; van Kampen, Antoine H. C.; ten Kate, Fiebo J. W.; Baas, Frank

    2008-01-01

    Objective. Pelvic lymph node metastases are the main prognostic factor for survival in early stage cervical cancer, yet accurate detection methods before surgery are lacking. In this study, we examined whether gene expression profiling can predict the presence of lymph node metastasis in early stage

  12. Gene expression in colorectal cancer

    DEFF Research Database (Denmark)

    Birkenkamp-Demtroder, Karin; Christensen, Lise Lotte; Olesen, Sanne Harder

    2002-01-01

    Understanding molecular alterations in colorectal cancer (CRC) is needed to define new biomarkers and treatment targets. We used oligonucleotide microarrays to monitor gene expression of about 6,800 known genes and 35,000 expressed sequence tags (ESTs) on five pools (four to six samples in each...... pool) of total RNA from left-sided sporadic colorectal carcinomas. We compared normal tissue to carcinoma tissue from Dukes' stages A-D (noninvasive to distant metastasis) and identified 908 known genes and 4,155 ESTs that changed remarkably from normal to tumor tissue. Based on intensive filtering 226...

  13. Correction of gene expression data

    DEFF Research Database (Denmark)

    Darbani Shirvanehdeh, Behrooz; Stewart, C. Neal, Jr.; Noeparvar, Shahin

    2014-01-01

    This report investigates for the first time the potential inter-treatment bias source of cell number for gene expression studies. Cell-number bias can affect gene expression analysis when comparing samples with unequal total cellular RNA content or with different RNA extraction efficiencies....... For maximal reliability of analysis, therefore, comparisons should be performed at the cellular level. This could be accomplished using an appropriate correction method that can detect and remove the inter-treatment bias for cell-number. Based on inter-treatment variations of reference genes, we introduce...

  14. Expression of Transketolase like gene 1 (TKTL1 predicts disease-free survival in patients with locally advanced rectal cancer receiving neoadjuvant chemoradiotherapy

    Directory of Open Access Journals (Sweden)

    Hofmann Wolf-Karsten

    2011-08-01

    Full Text Available Abstract Background For patients with locally advanced rectal cancer (LARC neoadjuvant chemoradiotherapy is recommended as standard therapy. So far, no predictive or prognostic molecular factors for patients undergoing multimodal treatment are established. Increased angiogenesis and altered tumour metabolism as adaption to hypoxic conditions in cancers play an important role in tumour progression and metastasis. Enhanced expression of Vascular-endothelial-growth-factor-receptor (VEGF-R and Transketolase-like-1 (TKTL1 are related to hypoxic conditions in tumours. In search for potential prognostic molecular markers we investigated the expression of VEGFR-1, VEGFR-2 and TKTL1 in patients with LARC treated with neoadjuvant chemoradiotherapy and cetuximab. Methods Tumour and corresponding normal tissue from pre-therapeutic biopsies of 33 patients (m: 23, f: 10; median age: 61 years with LARC treated in phase-I and II trials with neoadjuvant chemoradiotherapy (cetuximab, irinotecan, capecitabine in combination with radiotherapy were analysed by quantitative PCR. Results Significantly higher expression of VEGFR-1/2 was found in tumour tissue in pre-treatment biopsies as well as in resected specimen after neoadjuvant chemoradiotherapy compared to corresponding normal tissue. High TKTL1 expression significantly correlated with disease free survival. None of the markers had influence on early response parameters such as tumour regression grading. There was no correlation of gene expression between the investigated markers. Conclusion High TKTL-1 expression correlates with poor prognosis in terms of 3 year disease-free survival in patients with LARC treated with intensified neoadjuvant chemoradiotherapy and may therefore serve as a molecular prognostic marker which should be further evaluated in randomised clinical trials.

  15. Adrenal-kidney-gonad complex measurements may not predict gonad-specific changes in gene expression patterns during temperature-dependent sex determination in the red-eared slider turtle (Trachemys scripta elegans).

    Science.gov (United States)

    Ramsey, Mary; Crews, David

    2007-08-01

    Many turtles, including the red-eared slider turtle (Trachemys scripta elegans) have temperature-dependent sex determination in which gonadal sex is determined by temperature during the middle third of incubation. The gonad develops as part of a heterogenous tissue complex that comprises the developing adrenal, kidney, and gonad (AKG complex). Owing to the difficulty in excising the gonad from the adjacent tissues, the AKG complex is often used as tissue source in assays examining gene expression in the developing gonad. However, the gonad is a relatively small component of the AKG, and gene expression in the adrenal-kidney (AK) compartment may interfere with the detection of gonad-specific changes in gene expression, particularly during early key phases of gonadal development and sex determination. In this study, we examine transcript levels as measured by quantitative real-time polymerase chain reaction for five genes important in slider turtle sex determination and differentiation (AR, ERalpha, ERbeta, aromatase, and Sf1) in AKG, AK, and isolated gonad tissues. In all cases, gonad-specific gene expression patterns were attenuated in AKG versus gonad tissue. All five genes were expressed in the AK in addition to the gonad at all stages/temperatures. Inclusion of the AK compartment masked important changes in gonadal gene expression. In addition, AK and gonad expression patterns are not additive, and gonadal gene expression cannot be predicted from intact AKG measurements. (c) 2007 Wiley-Liss, Inc.

  16. Noise minimization in eukaryotic gene expression.

    Directory of Open Access Journals (Sweden)

    Hunter B Fraser

    2004-06-01

    Full Text Available All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or "noise." Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection.

  17. Noise minimization in eukaryotic gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Fraser, Hunter B.; Hirsh, Aaron E.; Giaever, Guri; Kumm, Jochen; Eisen, Michael B.

    2004-01-15

    All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or noise. Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection.

  18. Noise minimization in eukaryotic gene expression

    International Nuclear Information System (INIS)

    Fraser, Hunter B.; Hirsh, Aaron E.; Giaever, Guri; Kumm, Jochen; Eisen, Michael B.

    2004-01-01

    All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or noise. Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection

  19. Gene expression profile of blood cells for the prediction of delayed cerebral ischemia after intracranial aneurysm rupture: a pilot study in humans.

    Science.gov (United States)

    Baumann, Antoine; Devaux, Yvan; Audibert, Gérard; Zhang, Lu; Bracard, Serge; Colnat-Coulbois, Sophie; Klein, Olivier; Zannad, Faiez; Charpentier, Claire; Longrois, Dan; Mertes, Paul-Michel

    2013-01-01

    Delayed cerebral ischemia (DCI) is a potentially devastating complication after intracranial aneurysm rupture and its mechanisms remain poorly elucidated. Early identification of the patients prone to developing DCI after rupture may represent a major breakthrough in its prevention and treatment. The single gene approach of DCI has demonstrated interest in humans. We hypothesized that whole genome expression profile of blood cells may be useful for better comprehension and prediction of aneurysmal DCI. Over a 35-month period, 218 patients with aneurysm rupture were included in this study. DCI was defined as the occurrence of a new delayed neurological deficit occurring within 2 weeks after aneurysm rupture with evidence of ischemia either on perfusion-diffusion MRI, CT angiography or CT perfusion imaging, or with cerebral angiography. DCI patients were matched against controls based on 4 out of 5 criteria (age, sex, Fisher grade, aneurysm location and smoking status). Genome-wide expression analysis of blood cells obtained at admission was performed by microarrays. Transcriptomic analysis was performed using long oligonucleotide microarrays representing 25,000 genes. Quantitative PCR: 1 µg of total RNA extracted was reverse-transcribed, and the resulting cDNA was diluted 10-fold before performing quantitative PCR. Microarray data were first analyzed by 'Significance Analysis of Microarrays' software which includes the Benjamini correction for multiple testing. In a second step, microarray data fold change was compared using a two-tailed, paired t test. Analysis of receiver-operating characteristic (ROC) curves and the area under the ROC curves were used for prediction analysis. Logistic regression models were used to investigate the additive value of multiple biomarkers. A total of 16 patients demonstrated DCI. Significance Analysis of Microarrays software failed to retrieve significant genes, most probably because of the heterogeneity of the patients included in

  20. The relationship among gene expression, the evolution of gene dosage, and the rate of protein evolution.

    Directory of Open Access Journals (Sweden)

    Jean-François Gout

    2010-05-01

    Full Text Available The understanding of selective constraints affecting genes is a major issue in biology. It is well established that gene expression level is a major determinant of the rate of protein evolution, but the reasons for this relationship remain highly debated. Here we demonstrate that gene expression is also a major determinant of the evolution of gene dosage: the rate of gene losses after whole genome duplications in the Paramecium lineage is negatively correlated to the level of gene expression, and this relationship is not a byproduct of other factors known to affect the fate of gene duplicates. This indicates that changes in gene dosage are generally more deleterious for highly expressed genes. This rule also holds for other taxa: in yeast, we find a clear relationship between gene expression level and the fitness impact of reduction in gene dosage. To explain these observations, we propose a model based on the fact that the optimal expression level of a gene corresponds to a trade-off between the benefit and cost of its expression. This COSTEX model predicts that selective pressure against mutations changing gene expression level or affecting the encoded protein should on average be stronger in highly expressed genes and hence that both the frequency of gene loss and the rate of protein evolution should correlate negatively with gene expression. Thus, the COSTEX model provides a simple and common explanation for the general relationship observed between the level of gene expression and the different facets of gene evolution.

  1. Gene expression signatures predict outcome in non-muscle invasive bladder carcinoma - a multi-center validation study

    DEFF Research Database (Denmark)

    Andersen, Lars Dyrskjøt; Zieger, Karsten; Real, Francisco X.

    2007-01-01

    and carcinoma in situ (CIS) and for predicting disease recurrence and progression. EXPERIMENTAL DESIGN: We analyzed tumors from 404 patients diagnosed with bladder cancer in hospitals in Denmark, Sweden, England, Spain, and France using custom microarrays. Molecular classifications were compared with pathologic....... CONCLUSION: This multicenter validation study confirms in an independent series the clinical utility of molecular classifiers to predict the outcome of patients initially diagnosed with non-muscle-invasive bladder cancer. This information may be useful to better guide patient treatment....

  2. Transgenic Arabidopsis Gene Expression System

    Science.gov (United States)

    Ferl, Robert; Paul, Anna-Lisa

    2009-01-01

    The Transgenic Arabidopsis Gene Expression System (TAGES) investigation is one in a pair of investigations that use the Advanced Biological Research System (ABRS) facility. TAGES uses Arabidopsis thaliana, thale cress, with sensor promoter-reporter gene constructs that render the plants as biomonitors (an organism used to determine the quality of the surrounding environment) of their environment using real-time nondestructive Green Fluorescent Protein (GFP) imagery and traditional postflight analyses.

  3. Human papillomavirus gene expression

    International Nuclear Information System (INIS)

    Chow, L.T.; Hirochika, H.; Nasseri, M.; Stoler, M.H.; Wolinsky, S.M.; Chin, M.T.; Hirochika, R.; Arvan, D.S.; Broker, T.R.

    1987-01-01

    To determine the role of tissue differentiation on expression of each of the papillomavirus mRNA species identified by electron microscopy, the authors prepared exon-specific RNA probes that could distinguish the alternatively spliced mRNA species. Radioactively labeled single-stranded RNA probes were generated from a dual promoter vector system and individually hybridized to adjacent serial sections of formalin-fixed, paraffin-embedded biopsies of condylomata. Autoradiography showed that each of the message species had a characteristic tissue distribution and relative abundance. The authors have characterized a portion of the regulatory network of the HPVs by showing that the E2 ORF encodes a trans-acting enhancer-stimulating protein, as it does in BPV-1 (Spalholz et al. 1985). The HPV-11 enhancer was mapped to a 150-bp tract near the 3' end of the URR. Portions of this region are duplicated in some aggressive strains of HPV-6 (Boshart and zur Hausen 1986; Rando et al. 1986). To test the possible biological relevance of these duplications, they cloned tandem arrays of the enhancer and demonstrated, using a chloramphenicol acetyltransferase (CAT) assay, that they led to dramatically increased transcription proportional to copy number. Using the CAT assays, the authors found that the E2 proteins of several papillomavirus types can cross-stimulate the enhancers of most other types. This suggests that prior infection of a tissue with one papillomavirus type may provide a helper effect for superinfection and might account fo the HPV-6/HPV-16 coinfections in condylomata that they have observed

  4. Computational prediction of CTCF/cohesin-based intra-TAD loops that insulate chromatin contacts and gene expression in mouse liver.

    Science.gov (United States)

    Matthews, Bryan J; Waxman, David J

    2018-05-14

    CTCF and cohesin are key drivers of 3D-nuclear organization, anchoring the megabase-scale Topologically Associating Domains (TADs) that segment the genome. Here, we present and validate a computational method to predict cohesin-and-CTCF binding sites that form intra-TAD DNA loops. The intra-TAD loop anchors identified are structurally indistinguishable from TAD anchors regarding binding partners, sequence conservation, and resistance to cohesin knockdown; further, the intra-TAD loops retain key functional features of TADs, including chromatin contact insulation, blockage of repressive histone mark spread, and ubiquity across tissues. We propose that intra-TAD loops form by the same loop extrusion mechanism as the larger TAD loops, and that their shorter length enables finer regulatory control in restricting enhancer-promoter interactions, which enables selective, high-level expression of gene targets of super-enhancers and genes located within repressive nuclear compartments. These findings elucidate the role of intra-TAD cohesin-and-CTCF binding in nuclear organization associated with widespread insulation of distal enhancer activity. © 2018, Matthews et al.

  5. Homeobox gene expression in Brachiopoda

    DEFF Research Database (Denmark)

    Altenburger, Andreas; Martinez, Pedro; Wanninger, Andreas

    2011-01-01

    (ectoderm) specification with co-opted functions in notochord formation in chordates and left/right determination in ambulacrarians and vertebrates. The caudal ortholog, TtrCdx, is first expressed in the ectoderm of the gastrulating embryo in the posterior region of the blastopore. Its expression stays......The molecular control that underlies brachiopod ontogeny is largely unknown. In order to contribute to this issue we analyzed the expression pattern of two homeobox containing genes, Not and Cdx, during development of the rhynchonelliform (i.e., articulate) brachiopod Terebratalia transversa...... completion of larval development, which is marked by a three-lobed body with larval setae. Expression starts at gastrulation in two areas lateral to the blastopore and subsequently extends over the animal pole of the gastrula. With elongation of the gastrula, expression at the animal pole narrows to a small...

  6. Use of Artificial Intelligence and Machine Learning Algorithms with Gene Expression Profiling to Predict Recurrent Nonmuscle Invasive Urothelial Carcinoma of the Bladder.

    Science.gov (United States)

    Bartsch, Georg; Mitra, Anirban P; Mitra, Sheetal A; Almal, Arpit A; Steven, Kenneth E; Skinner, Donald G; Fry, David W; Lenehan, Peter F; Worzel, William P; Cote, Richard J

    2016-02-01

    Due to the high recurrence risk of nonmuscle invasive urothelial carcinoma it is crucial to distinguish patients at high risk from those with indolent disease. In this study we used a machine learning algorithm to identify the genes in patients with nonmuscle invasive urothelial carcinoma at initial presentation that were most predictive of recurrence. We used the genes in a molecular signature to predict recurrence risk within 5 years after transurethral resection of bladder tumor. Whole genome profiling was performed on 112 frozen nonmuscle invasive urothelial carcinoma specimens obtained at first presentation on Human WG-6 BeadChips (Illumina®). A genetic programming algorithm was applied to evolve classifier mathematical models for outcome prediction. Cross-validation based resampling and gene use frequencies were used to identify the most prognostic genes, which were combined into rules used in a voting algorithm to predict the sample target class. Key genes were validated by quantitative polymerase chain reaction. The classifier set included 21 genes that predicted recurrence. Quantitative polymerase chain reaction was done for these genes in a subset of 100 patients. A 5-gene combined rule incorporating a voting algorithm yielded 77% sensitivity and 85% specificity to predict recurrence in the training set, and 69% and 62%, respectively, in the test set. A singular 3-gene rule was constructed that predicted recurrence with 80% sensitivity and 90% specificity in the training set, and 71% and 67%, respectively, in the test set. Using primary nonmuscle invasive urothelial carcinoma from initial occurrences genetic programming identified transcripts in reproducible fashion, which were predictive of recurrence. These findings could potentially impact nonmuscle invasive urothelial carcinoma management. Copyright © 2016 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.

  7. An algorithm to discover gene signatures with predictive potential

    Directory of Open Access Journals (Sweden)

    Hallett Robin M

    2010-09-01

    Full Text Available Abstract Background The advent of global gene expression profiling has generated unprecedented insight into our molecular understanding of cancer, including breast cancer. For example, human breast cancer patients display significant diversity in terms of their survival, recurrence, metastasis as well as response to treatment. These patient outcomes can be predicted by the transcriptional programs of their individual breast tumors. Predictive gene signatures allow us to correctly classify human breast tumors into various risk groups as well as to more accurately target therapy to ensure more durable cancer treatment. Results Here we present a novel algorithm to generate gene signatures with predictive potential. The method first classifies the expression intensity for each gene as determined by global gene expression profiling as low, average or high. The matrix containing the classified data for each gene is then used to score the expression of each gene based its individual ability to predict the patient characteristic of interest. Finally, all examined genes are ranked based on their predictive ability and the most highly ranked genes are included in the master gene signature, which is then ready for use as a predictor. This method was used to accurately predict the survival outcomes in a cohort of human breast cancer patients. Conclusions We confirmed the capacity of our algorithm to generate gene signatures with bona fide predictive ability. The simplicity of our algorithm will enable biological researchers to quickly generate valuable gene signatures without specialized software or extensive bioinformatics training.

  8. Gene expression profile of pulpitis.

    Science.gov (United States)

    Galicia, J C; Henson, B R; Parker, J S; Khan, A A

    2016-06-01

    The cost, prevalence and pain associated with endodontic disease necessitate an understanding of the fundamental molecular aspects of its pathogenesis. This study was aimed to identify the genetic contributors to pulpal pain and inflammation. Inflamed pulps were collected from patients diagnosed with irreversible pulpitis (n=20). Normal pulps from teeth extracted for various reasons served as controls (n=20). Pain level was assessed using a visual analog scale (VAS). Genome-wide microarray analysis was performed using Affymetrix GeneTitan Multichannel Instrument. The difference in gene expression levels were determined by the significance analysis of microarray program using a false discovery rate (q-value) of 5%. Genes involved in immune response, cytokine-cytokine receptor interaction and signaling, integrin cell surface interactions, and others were expressed at relatively higher levels in the pulpitis group. Moreover, several genes known to modulate pain and inflammation showed differential expression in asymptomatic and mild pain patients (⩾30 mm on VAS) compared with those with moderate to severe pain. This exploratory study provides a molecular basis for the clinical diagnosis of pulpitis. With an enhanced understanding of pulpal inflammation, future studies on treatment and management of pulpitis and on pain associated with it can have a biological reference to bridge treatment strategies with pulpal biology.

  9. Aberrant Gene Expression in Acute Myeloid Leukaemia

    DEFF Research Database (Denmark)

    Bagger, Frederik Otzen

    model to investigate the role of telomerase in AML, we were able to translate the observed effect into human AML patients and identify specific genes involved, which also predict survival patterns in AML patients. During these studies we have applied methods for investigating differentially expressed......-based gene-lookup webservices, called HemaExplorer and BloodSpot. These web-services support the aim of making data and analysis of haematopoietic cells from mouse and human accessible for researchers without bioinformatics expertise. Finally, in order to aid the analysis of the very limited number...

  10. Assessment of the Prognostic and Treatment-Predictive Performance of the Combined HOXB13:IL17BR-MGI Gene Expression Signature in the Trans-ATAC Cohort

    Science.gov (United States)

    2013-12-01

    Shak S, Tang G, et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med 2004; 351: 2817–26. 7...Barlow WE, Shak S, et al, for The Breast Cancer Intergroup of North America. Prognostic and predictive value of the 21-gene recurrence score assay in

  11. Codon usage and amino acid usage influence genes expression level.

    Science.gov (United States)

    Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo

    2018-02-01

    Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.

  12. GSEH: A Novel Approach to Select Prostate Cancer-Associated Genes Using Gene Expression Heterogeneity.

    Science.gov (United States)

    Kim, Hyunjin; Choi, Sang-Min; Park, Sanghyun

    2018-01-01

    When a gene shows varying levels of expression among normal people but similar levels in disease patients or shows similar levels of expression among normal people but different levels in disease patients, we can assume that the gene is associated with the disease. By utilizing this gene expression heterogeneity, we can obtain additional information that abets discovery of disease-associated genes. In this study, we used collaborative filtering to calculate the degree of gene expression heterogeneity between classes and then scored the genes on the basis of the degree of gene expression heterogeneity to find "differentially predicted" genes. Through the proposed method, we discovered more prostate cancer-associated genes than 10 comparable methods. The genes prioritized by the proposed method are potentially significant to biological processes of a disease and can provide insight into them.

  13. Analysis of baseline gene expression levels from ...

    Science.gov (United States)

    The use of gene expression profiling to predict chemical mode of action would be enhanced by better characterization of variance due to individual, environmental, and technical factors. Meta-analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in gene expression. A dataset of control animal microarray expression data was assembled by a working group of the Health and Environmental Sciences Institute's Technical Committee on the Application of Genomics in Mechanism Based Risk Assessment in order to provide a public resource for assessments of variability in baseline gene expression. Data from over 500 Affymetrix microarrays from control rat liver and kidney were collected from 16 different institutions. Thirty-five biological and technical factors were obtained for each animal, describing a wide range of study characteristics, and a subset were evaluated in detail for their contribution to total variability using multivariate statistical and graphical techniques. The study factors that emerged as key sources of variability included gender, organ section, strain, and fasting state. These and other study factors were identified as key descriptors that should be included in the minimal information about a toxicogenomics study needed for interpretation of results by an independent source. Genes that are the most and least variable, gender-selectiv

  14. Gene expression profiles in stages II and III colon cancers

    DEFF Research Database (Denmark)

    Thorsteinsson, Morten; Kirkeby, Lene T; Hansen, Raino

    2012-01-01

    PURPOSE: A 128-gene signature has been proposed to predict outcome in patients with stages II and III colorectal cancers. In the present study, we aimed to reproduce and validate the 128-gene signature in external and independent material. METHODS: Gene expression data from the original material...... were retrieved from the Gene Expression Omnibus (GEO) (n¿=¿111) in addition to a Danish data set (n¿=¿37). All patients had stages II and III colon cancers. A Prediction Analysis of Microarray classifier, based on the 128-gene signature and the original training set of stage I (n¿=¿65) and stage IV (n...... correctly predicted as stage IV-like, and the remaining patients were predicted as stage I-like and unclassifiable, respectively. Stage II patients could not be stratified. CONCLUSIONS: The 128-gene signature showed reproducibility in stage III colon cancer, but could not predict recurrence in stage II...

  15. DNA Topoisomerase I Gene Copy Number and mRNA Expression Assessed as Predictive Biomarkers for Adjuvant Irinotecan in Stage II/III Colon Cancer

    DEFF Research Database (Denmark)

    Nygård, Sune Boris; Vainer, Ben; Nielsen, Signe L

    2016-01-01

    FISH and follow-up data were obtained from 534 patients. TOP1 gain was identified in 27 % using a single-probe enumeration strategy (≥ 4 TOP1 signals per cell), and in 31 % when defined by a TOP1/CEN20 ratio ≥ 1.5. The effect of additional irinotecan was not dependent on TOP1 FISH status. TOP1 m......PURPOSE: Prospective-retrospective assessment of the TOP1 gene copy number and TOP1 mRNA expression as predictive biomarkers for adjuvant irinotecan in stage II/III colon cancer (CC). EXPERIMENTAL DESIGN: Formalin-fixed, paraffin-embedded tissue microarrays were obtained from an adjuvant CC trial...... (PETACC3) where patients were randomized to 5-fluorouracil/folinic acid with or without additional irinotecan. TOP1 copy number status was analyzed by fluorescence in situ hybridization (FISH) using a TOP1/CEN20 dual-probe combination. TOP1 mRNA data were available from previous analyses. RESULTS: TOP1...

  16. Using gene expression noise to understand gene regulation

    NARCIS (Netherlands)

    Munsky, B.; Neuert, G.; van Oudenaarden, A.

    2012-01-01

    Phenotypic variation is ubiquitous in biology and is often traceable to underlying genetic and environmental variation. However, even genetically identical cells in identical environments display variable phenotypes. Stochastic gene expression, or gene expression "noise," has been suggested as a

  17. Regulation of methane genes and genome expression

    Energy Technology Data Exchange (ETDEWEB)

    John N. Reeve

    2009-09-09

    At the start of this project, it was known that methanogens were Archaeabacteria (now Archaea) and were therefore predicted to have gene expression and regulatory systems different from Bacteria, but few of the molecular biology details were established. The goals were then to establish the structures and organizations of genes in methanogens, and to develop the genetic technologies needed to investigate and dissect methanogen gene expression and regulation in vivo. By cloning and sequencing, we established the gene and operon structures of all of the “methane” genes that encode the enzymes that catalyze methane biosynthesis from carbon dioxide and hydrogen. This work identified unique sequences in the methane gene that we designated mcrA, that encodes the largest subunit of methyl-coenzyme M reductase, that could be used to identify methanogen DNA and establish methanogen phylogenetic relationships. McrA sequences are now the accepted standard and used extensively as hybridization probes to identify and quantify methanogens in environmental research. With the methane genes in hand, we used northern blot and then later whole-genome microarray hybridization analyses to establish how growth phase and substrate availability regulated methane gene expression in Methanobacterium thermautotrophicus ΔH (now Methanothermobacter thermautotrophicus). Isoenzymes or pairs of functionally equivalent enzymes catalyze several steps in the hydrogen-dependent reduction of carbon dioxide to methane. We established that hydrogen availability determine which of these pairs of methane genes is expressed and therefore which of the alternative enzymes is employed to catalyze methane biosynthesis under different environmental conditions. As were unable to establish a reliable genetic system for M. thermautotrophicus, we developed in vitro transcription as an alternative system to investigate methanogen gene expression and regulation. This led to the discovery that an archaeal protein

  18. High E6 Gene Expression Predicts for Distant Metastasis and Poor Survival in Patients With HPV-Positive Oropharyngeal Squamous Cell Carcinoma

    Energy Technology Data Exchange (ETDEWEB)

    Khwaja, Shariq S.; Baker, Callie; Haynes, Wesley; Spencer, Christopher R.; Gay, Hiram; Thorstad, Wade [Department of Radiation Oncology, Washington University School of Medicine, St. Louis, Missouri (United States); Adkins, Douglas R. [Division of Medical Oncology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, Missouri (United States); Nussenbaum, Brian [Department of Otolaryngology – Head and Neck Surgery, Washington University School of Medicine, St. Louis, Missouri (United States); Chernock, Rebecca D. [Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri (United States); Lewis, James S. [Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee (United States); Wang, Xiaowei, E-mail: xwang@radonc.wustl.edu [Department of Radiation Oncology, Washington University School of Medicine, St. Louis, Missouri (United States)

    2016-07-15

    Purpose: Patients with human papillomavirus (HPV)–positive oropharyngeal squamous cell carcinoma (OPSCC) have a favorable prognosis. As a result, de-escalation clinical trials are under way. However, approximately 10% of patients will experience distant recurrence even with standard-of-care treatment. Here, we sought to identify novel biomarkers to better risk-stratify HPV-positive patients with OPSCC. Methods and Materials: Gene expression profiling by RNA sequencing (RNA-seq) and quantitative polymerase chain reaction was performed on HPV-positive OPSCC primary tumor specimens from patients with and without distant metastasis (DM). Results: RNA-seq analysis of 39 HPV-positive OPSCC specimens revealed that patients with DM had 2-fold higher E6 gene expression levels than did patients without DM (P=.029). This observation was confirmed in a validation cohort comprising 93 patients with HPV-positive OPSCC. The mean normalized E6 expression level in the 17 recurring primary specimens was 13 ± 2 compared with 8 ± 1 in the remaining 76 nonrecurring primaries (P=.001). Receiver operating characteristic analysis established an E6 expression level of 7.3 as a cutoff for worse recurrence-free survival (RFS). Patients from this cohort with high E6 gene expression (E6-high) (n=51, 55%) had more cancer-related deaths (23% vs 2%, P<.001) and DM (26% vs 5%, P<.001) than did patients with low E6 gene expression (E6-low) (n=42, 45%). Kaplan-Meier survival analysis revealed that E6-high had worse RFS (95% vs 69%, P=.004) and cancer-specific survival (97% vs 79%, P=.007). E6-high maintained statistical significance in multivariate regression models balancing surgery, chemotherapy, nodal stage, and smoking status. Gene set enrichment analysis demonstrated that tumors with high E6 expression were associated with P53, epidermal growth factor receptor, activating transcription factor-2, and transforming growth factor-β signaling pathways. Conclusion: High E6 gene expression

  19. A constructive approach to gene expression dynamics

    International Nuclear Information System (INIS)

    Ochiai, T.; Nacher, J.C.; Akutsu, T.

    2004-01-01

    Recently, experiments on mRNA abundance (gene expression) have revealed that gene expression shows a stationary organization described by a scale-free distribution. Here we propose a constructive approach to gene expression dynamics which restores the scale-free exponent and describes the intermediate state dynamics. This approach requires only one assumption: Markov property

  20. Analysis of multiplex gene expression maps obtained by voxelation.

    Science.gov (United States)

    An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

    2009-04-29

    results confirm the hypothesis that genes with similar gene expression maps might have similar gene functions. The voxelation data takes into account the location information of gene expression level in mouse brain, which is novel in related research. The proposed approach can potentially be used to predict gene functions and provide helpful suggestions to biologists.

  1. Analysis of multiplex gene expression maps obtained by voxelation

    Directory of Open Access Journals (Sweden)

    Smith Desmond J

    2009-04-01

    cortex and corpus callosum. Conclusion The experimental results confirm the hypothesis that genes with similar gene expression maps might have similar gene functions. The voxelation data takes into account the location information of gene expression level in mouse brain, which is novel in related research. The proposed approach can potentially be used to predict gene functions and provide helpful suggestions to biologists.

  2. Improving the Prediction of Prostate Cancer Overall Survival by Supplementing Readily Available Clinical Data with Gene Expression Levels of IGFBP3 and F3 in Formalin-Fixed Paraffin Embedded Core Needle Biopsy Material.

    Directory of Open Access Journals (Sweden)

    Zhuochun Peng

    Full Text Available A previously reported expression signature of three genes (IGFBP3, F3 and VGLL3 was shown to have potential prognostic value in estimating overall and cancer-specific survivals at diagnosis of prostate cancer in a pilot cohort study using freshly frozen Fine Needle Aspiration (FNA samples.We carried out a new cohort study with 241 prostate cancer patients diagnosed from 2004-2007 with a follow-up exceeding 6 years in order to verify the prognostic value of gene expression signature in formalin fixed paraffin embedded (FFPE prostate core needle biopsy tissue samples. The cohort consisted of four patient groups with different survival times and death causes. A four multiplex one-step RT-qPCR test kit, designed and optimized for measuring the expression signature in FFPE core needle biopsy samples, was used. In archive FFPE biopsy samples the expression differences of two genes (IGFBP3 and F3 were measured. The survival time predictions using the current clinical parameters only, such as age at diagnosis, Gleason score, PSA value and tumor stage, and clinical parameters supplemented with the expression levels of IGFBP3 and F3, were compared.When combined with currently used clinical parameters, the gene expression levels of IGFBP3 and F3 are improving the prediction of survival time as compared to using clinical parameters alone.The assessment of IGFBP3 and F3 gene expression levels in FFPE prostate cancer tissue would provide an improved survival prediction for prostate cancer patients at the time of diagnosis.

  3. Detecting microRNA activity from gene expression data

    LENUS (Irish Health Repository)

    Madden, Stephen F

    2010-05-18

    Abstract Background MicroRNAs (miRNAs) are non-coding RNAs that regulate gene expression by binding to the messenger RNA (mRNA) of protein coding genes. They control gene expression by either inhibiting translation or inducing mRNA degradation. A number of computational techniques have been developed to identify the targets of miRNAs. In this study we used predicted miRNA-gene interactions to analyse mRNA gene expression microarray data to predict miRNAs associated with particular diseases or conditions. Results Here we combine correspondence analysis, between group analysis and co-inertia analysis (CIA) to determine which miRNAs are associated with differences in gene expression levels in microarray data sets. Using a database of miRNA target predictions from TargetScan, TargetScanS, PicTar4way PicTar5way, and miRanda and combining these data with gene expression levels from sets of microarrays, this method produces a ranked list of miRNAs associated with a specified split in samples. We applied this to three different microarray datasets, a papillary thyroid carcinoma dataset, an in-house dataset of lipopolysaccharide treated mouse macrophages, and a multi-tissue dataset. In each case we were able to identified miRNAs of biological importance. Conclusions We describe a technique to integrate gene expression data and miRNA target predictions from multiple sources.

  4. Detecting microRNA activity from gene expression data.

    LENUS (Irish Health Repository)

    Madden, Stephen F

    2010-01-01

    BACKGROUND: MicroRNAs (miRNAs) are non-coding RNAs that regulate gene expression by binding to the messenger RNA (mRNA) of protein coding genes. They control gene expression by either inhibiting translation or inducing mRNA degradation. A number of computational techniques have been developed to identify the targets of miRNAs. In this study we used predicted miRNA-gene interactions to analyse mRNA gene expression microarray data to predict miRNAs associated with particular diseases or conditions. RESULTS: Here we combine correspondence analysis, between group analysis and co-inertia analysis (CIA) to determine which miRNAs are associated with differences in gene expression levels in microarray data sets. Using a database of miRNA target predictions from TargetScan, TargetScanS, PicTar4way PicTar5way, and miRanda and combining these data with gene expression levels from sets of microarrays, this method produces a ranked list of miRNAs associated with a specified split in samples. We applied this to three different microarray datasets, a papillary thyroid carcinoma dataset, an in-house dataset of lipopolysaccharide treated mouse macrophages, and a multi-tissue dataset. In each case we were able to identified miRNAs of biological importance. CONCLUSIONS: We describe a technique to integrate gene expression data and miRNA target predictions from multiple sources.

  5. Bridging cancer biology with the clinic: relative expression of a GRHL2-mediated gene-set pair predicts breast cancer metastasis.

    Directory of Open Access Journals (Sweden)

    Xinan Yang

    Full Text Available Identification and characterization of crucial gene target(s that will allow focused therapeutics development remains a challenge. We have interrogated the putative therapeutic targets associated with the transcription factor Grainy head-like 2 (GRHL2, a critical epithelial regulatory factor. We demonstrate the possibility to define the molecular functions of critical genes in terms of their personalized expression profiles, allowing appropriate functional conclusions to be derived. A novel methodology, relative expression analysis with gene-set pairs (RXA-GSP, is designed to explore the potential clinical utility of cancer-biology discovery. Observing that Grhl2-overexpression leads to increased metastatic potential in vitro, we established a model assuming Grhl2-induced or -inhibited genes confer poor or favorable prognosis respectively for cancer metastasis. Training on public gene expression profiles of 995 breast cancer patients, this method prioritized one gene-set pair (GRHL2, CDH2, FN1, CITED2, MKI67 versus CTNNB1 and CTNNA3 from all 2717 possible gene-set pairs (GSPs. The identified GSP significantly dichotomized 295 independent patients for metastasis-free survival (log-rank tested p = 0.002; severe empirical p = 0.035. It also showed evidence of clinical prognostication in another independent 388 patients collected from three studies (log-rank tested p = 3.3e-6. This GSP is independent of most traditional prognostic indicators, and is only significantly associated with the histological grade of breast cancer (p = 0.0017, a GRHL2-associated clinical character (p = 6.8e-6, Spearman correlation, suggesting that this GSP is reflective of GRHL2-mediated events. Furthermore, a literature review indicates the therapeutic potential of the identified genes. This research demonstrates a novel strategy to integrate both biological experiments and clinical gene expression profiles for extracting and elucidating the genomic

  6. Modulation of gene expression made easy

    DEFF Research Database (Denmark)

    Solem, Christian; Jensen, Peter Ruhdal

    2002-01-01

    A new approach for modulating gene expression, based on randomization of promoter (spacer) sequences, was developed. The method was applied to chromosomal genes in Lactococcus lactis and shown to generate libraries of clones with broad ranges of expression levels of target genes. In one example...... that the method can be applied to modulating the expression of native genes on the chromosome. We constructed a series of strains in which the expression of the las operon, containing the genes pfk, pyk, and ldh, was modulated by integrating a truncated copy of the pfk gene. Importantly, the modulation affected...

  7. Synthetic promoter libraries- tuning of gene expression

    DEFF Research Database (Denmark)

    Hammer, Karin; Mijakovic, Ivan; Jensen, Peter Ruhdal

    2006-01-01

    knockout and strong overexpression. However, applications such as metabolic optimization and control analysis necessitate a continuous set of expression levels with only slight increments in strength to cover a specific window around the wildtype expression level of the studied gene; this requirement can......The study of gene function often requires changing the expression of a gene and evaluating the consequences. In principle, the expression of any given gene can be modulated in a quasi-continuum of discrete expression levels but the traditional approaches are usually limited to two extremes: gene...

  8. Genome-wide prediction and functional validation of promoter motifs regulating gene expression in spore and infection stages of Phytophthora infestans.

    Directory of Open Access Journals (Sweden)

    Sourav Roy

    2013-03-01

    Full Text Available Most eukaryotic pathogens have complex life cycles in which gene expression networks orchestrate the formation of cells specialized for dissemination or host colonization. In the oomycete Phytophthora infestans, the potato late blight pathogen, major shifts in mRNA profiles during developmental transitions were identified using microarrays. We used those data with search algorithms to discover about 100 motifs that are over-represented in promoters of genes up-regulated in hyphae, sporangia, sporangia undergoing zoosporogenesis, swimming zoospores, or germinated cysts forming appressoria (infection structures. Most of the putative stage-specific transcription factor binding sites (TFBSs thus identified had features typical of TFBSs such as position or orientation bias, palindromy, and conservation in related species. Each of six motifs tested in P. infestans transformants using the GUS reporter gene conferred the expected stage-specific expression pattern, and several were shown to bind nuclear proteins in gel-shift assays. Motifs linked to the appressoria-forming stage, including a functionally validated TFBS, were over-represented in promoters of genes encoding effectors and other pathogenesis-related proteins. To understand how promoter and genome architecture influence expression, we also mapped transcription patterns to the P. infestans genome assembly. Adjacent genes were not typically induced in the same stage, including genes transcribed in opposite directions from small intergenic regions, but co-regulated gene pairs occurred more than expected by random chance. These data help illuminate the processes regulating development and pathogenesis, and will enable future attempts to purify the cognate transcription factors.

  9. Predicting tissue-specific expressions based on sequence characteristics

    KAUST Repository

    Paik, Hyojung; Ryu, Tae Woo; Heo, Hyoungsam; Seo, Seungwon; Lee, Doheon; Hur, Cheolgoo

    2011-01-01

    In multicellular organisms, including humans, understanding expression specificity at the tissue level is essential for interpreting protein function, such as tissue differentiation. We developed a prediction approach via generated sequence features from overrepresented patterns in housekeeping (HK) and tissue-specific (TS) genes to classify TS expression in humans. Using TS domains and transcriptional factor binding sites (TFBSs), sequence characteristics were used as indices of expressed tissues in a Random Forest algorithm by scoring exclusive patterns considering the biological intuition; TFBSs regulate gene expression, and the domains reflect the functional specificity of a TS gene. Our proposed approach displayed better performance than previous attempts and was validated using computational and experimental methods.

  10. Predicting tissue-specific expressions based on sequence characteristics

    KAUST Repository

    Paik, Hyojung

    2011-04-30

    In multicellular organisms, including humans, understanding expression specificity at the tissue level is essential for interpreting protein function, such as tissue differentiation. We developed a prediction approach via generated sequence features from overrepresented patterns in housekeeping (HK) and tissue-specific (TS) genes to classify TS expression in humans. Using TS domains and transcriptional factor binding sites (TFBSs), sequence characteristics were used as indices of expressed tissues in a Random Forest algorithm by scoring exclusive patterns considering the biological intuition; TFBSs regulate gene expression, and the domains reflect the functional specificity of a TS gene. Our proposed approach displayed better performance than previous attempts and was validated using computational and experimental methods.

  11. Adaptive Evolution of Gene Expression in Drosophila

    Directory of Open Access Journals (Sweden)

    Armita Nourmohammad

    2017-08-01

    Full Text Available Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis.

  12. Determining Physical Mechanisms of Gene Expression Regulation from Single Cell Gene Expression Data

    OpenAIRE

    Ezer, Daphne; Moignard, Victoria; G?ttgens, Berthold; Adryan, Boris

    2016-01-01

    Many genes are expressed in bursts, which can contribute to cell-to-cell heterogeneity. It is now possible to measure this heterogeneity with high throughput single cell gene expression assays (single cell qPCR and RNA-seq). These experimental approaches generate gene expression distributions which can be used to estimate the kinetic parameters of gene expression bursting, namely the rate that genes turn on, the rate that genes turn off, and the rate of transcription. We construct a complete ...

  13. The evolution of gene expression in primates

    OpenAIRE

    Tashakkori Ghanbarian, Avazeh

    2015-01-01

    The evolution of a gene’s expression profile is commonly assumed to be independent of its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between expression of neighboring genes in extant taxa. Indeed, in all eukaryotic genomes, genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their e...

  14. Genetic architecture of gene expression in the chicken

    Directory of Open Access Journals (Sweden)

    Stanley Dragana

    2013-01-01

    Full Text Available Abstract Background The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. Results Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. Conclusions We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.

  15. Gene expression in Pseudomonas aeruginosa swarming motility

    Directory of Open Access Journals (Sweden)

    Déziel Eric

    2010-10-01

    Full Text Available Abstract Background The bacterium Pseudomonas aeruginosa is capable of three types of motilities: swimming, twitching and swarming. The latter is characterized by a fast and coordinated group movement over a semi-solid surface resulting from intercellular interactions and morphological differentiation. A striking feature of swarming motility is the complex fractal-like patterns displayed by migrating bacteria while they move away from their inoculation point. This type of group behaviour is still poorly understood and its characterization provides important information on bacterial structured communities such as biofilms. Using GeneChip® Affymetrix microarrays, we obtained the transcriptomic profiles of both bacterial populations located at the tip of migrating tendrils and swarm center of swarming colonies and compared these profiles to that of a bacterial control population grown on the same media but solidified to not allow swarming motility. Results Microarray raw data were corrected for background noise with the RMA algorithm and quantile normalized. Differentially expressed genes between the three conditions were selected using a threshold of 1.5 log2-fold, which gave a total of 378 selected genes (6.3% of the predicted open reading frames of strain PA14. Major shifts in gene expression patterns are observed in each growth conditions, highlighting the presence of distinct bacterial subpopulations within a swarming colony (tendril tips vs. swarm center. Unexpectedly, microarrays expression data reveal that a minority of genes are up-regulated in tendril tip populations. Among them, we found energy metabolism, ribosomal protein and transport of small molecules related genes. On the other hand, many well-known virulence factors genes were globally repressed in tendril tip cells. Swarm center cells are distinct and appear to be under oxidative and copper stress responses. Conclusions Results reported in this study show that, as opposed to

  16. Peak flood estimation using gene expression programming

    Science.gov (United States)

    Zorn, Conrad R.; Shamseldin, Asaad Y.

    2015-12-01

    As a case study for the Auckland Region of New Zealand, this paper investigates the potential use of gene-expression programming (GEP) in predicting specific return period events in comparison to the established and widely used Regional Flood Estimation (RFE) method. Initially calibrated to 14 gauged sites, the GEP derived model was further validated to 10 and 100 year flood events with a relative errors of 29% and 18%, respectively. This is compared to the RFE method providing 48% and 44% errors for the same flood events. While the effectiveness of GEP in predicting specific return period events is made apparent, it is argued that the derived equations should be used in conjunction with those existing methodologies rather than as a replacement.

  17. Methods for monitoring multiple gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA

    2012-05-01

    The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.

  18. Methods for monitoring multiple gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Berka, Randy; Bachkirova, Elena; Rey, Michael

    2013-10-01

    The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.

  19. cis sequence effects on gene expression

    Directory of Open Access Journals (Sweden)

    Jacobs Kevin

    2007-08-01

    Full Text Available Abstract Background Sequence and transcriptional variability within and between individuals are typically studied independently. The joint analysis of sequence and gene expression variation (genetical genomics provides insight into the role of linked sequence variation in the regulation of gene expression. We investigated the role of sequence variation in cis on gene expression (cis sequence effects in a group of genes commonly studied in cancer research in lymphoblastoid cell lines. We estimated the proportion of genes exhibiting cis sequence effects and the proportion of gene expression variation explained by cis sequence effects using three different analytical approaches, and compared our results to the literature. Results We generated gene expression profiling data at N = 697 candidate genes from N = 30 lymphoblastoid cell lines for this study and used available candidate gene resequencing data at N = 552 candidate genes to identify N = 30 candidate genes with sufficient variance in both datasets for the investigation of cis sequence effects. We used two additive models and the haplotype phylogeny scanning approach of Templeton (Tree Scanning to evaluate association between individual SNPs, all SNPs at a gene, and diplotypes, with log-transformed gene expression. SNPs and diplotypes at eight candidate genes exhibited statistically significant (p cis sequence effects in our study, respectively. Conclusion Based on analysis of our results and the extant literature, one in four genes exhibits significant cis sequence effects, and for these genes, about 30% of gene expression variation is accounted for by cis sequence variation. Despite diverse experimental approaches, the presence or absence of significant cis sequence effects is largely supported by previously published studies.

  20. Gene expression inference with deep learning.

    Science.gov (United States)

    Chen, Yifei; Li, Yi; Narayan, Rajiv; Subramanian, Aravind; Xie, Xiaohui

    2016-06-15

    Large-scale gene expression profiling has been widely used to characterize cellular states in response to various disease conditions, genetic perturbations, etc. Although the cost of whole-genome expression profiles has been dropping steadily, generating a compendium of expression profiling over thousands of samples is still very expensive. Recognizing that gene expressions are often highly correlated, researchers from the NIH LINCS program have developed a cost-effective strategy of profiling only ∼1000 carefully selected landmark genes and relying on computational methods to infer the expression of remaining target genes. However, the computational approach adopted by the LINCS program is currently based on linear regression (LR), limiting its accuracy since it does not capture complex nonlinear relationship between expressions of genes. We present a deep learning method (abbreviated as D-GEX) to infer the expression of target genes from the expression of landmark genes. We used the microarray-based Gene Expression Omnibus dataset, consisting of 111K expression profiles, to train our model and compare its performance to those from other methods. In terms of mean absolute error averaged across all genes, deep learning significantly outperforms LR with 15.33% relative improvement. A gene-wise comparative analysis shows that deep learning achieves lower error than LR in 99.97% of the target genes. We also tested the performance of our learned model on an independent RNA-Seq-based GTEx dataset, which consists of 2921 expression profiles. Deep learning still outperforms LR with 6.57% relative improvement, and achieves lower error in 81.31% of the target genes. D-GEX is available at https://github.com/uci-cbcl/D-GEX CONTACT: xhx@ics.uci.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Vertebrate gene predictions and the problem of large genes

    DEFF Research Database (Denmark)

    Wang, Jun; Li, ShengTing; Zhang, Yong

    2003-01-01

    To find unknown protein-coding genes, annotation pipelines use a combination of ab initio gene prediction and similarity to experimentally confirmed genes or proteins. Here, we show that although the ab initio predictions have an intrinsically high false-positive rate, they also have a consistent...

  2. Gene Expression Analysis of Four Radiation-resistant Bacteria

    OpenAIRE

    Gao, Na; Ma, Bin-Guang; Zhang, Yu-Sheng; Song, Qin; Chen, Ling-Ling; Zhang, Hong-Yu

    2009-01-01

    To investigate the general radiation-resistant mechanisms of bacteria, bioinformatic method was employed to predict highly expressed genes for four radiation-resistant bacteria, i.e. Deinococcus geothermalis (D. geo), Deinococcus radiodurans (D. rad), Kineococcus radiotolerans (K. rad) and Rubrobacter xylanophilus (R. xyl). It is revealed that most of the three reference gene sets, i.e. ribosomal proteins, transcription factors and major chaperones, are generally highly expressed in the four ...

  3. Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

    Science.gov (United States)

    Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

    2014-01-01

    Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.

  4. Determinants of human adipose tissue gene expression

    DEFF Research Database (Denmark)

    Viguerie, Nathalie; Montastier, Emilie; Maoret, Jean-José

    2012-01-01

    weight maintenance diets. For 175 genes, opposite regulation was observed during calorie restriction and weight maintenance phases, independently of variations in body weight. Metabolism and immunity genes showed inverse profiles. During the dietary intervention, network-based analyses revealed strong...... interconnection between expression of genes involved in de novo lipogenesis and components of the metabolic syndrome. Sex had a marked influence on AT expression of 88 transcripts, which persisted during the entire dietary intervention and after control for fat mass. In women, the influence of body mass index...... on expression of a subset of genes persisted during the dietary intervention. Twenty-two genes revealed a metabolic syndrome signature common to men and women. Genetic control of AT gene expression by cis signals was observed for 46 genes. Dietary intervention, sex, and cis genetic variants independently...

  5. Deriving Trading Rules Using Gene Expression Programming

    Directory of Open Access Journals (Sweden)

    Adrian VISOIU

    2011-01-01

    Full Text Available This paper presents how buy and sell trading rules are generated using gene expression programming with special setup. Market concepts are presented and market analysis is discussed with emphasis on technical analysis and quantitative methods. The use of genetic algorithms in deriving trading rules is presented. Gene expression programming is applied in a form where multiple types of operators and operands are used. This gives birth to multiple gene contexts and references between genes in order to keep the linear structure of the gene expression programming chromosome. The setup of multiple gene contexts is presented. The case study shows how to use the proposed gene setup to derive trading rules encoded by Boolean expressions, using a dataset with the reference exchange rates between the Euro and the Romanian leu. The conclusions highlight the positive results obtained in deriving useful trading rules.

  6. Semi-supervised prediction of gene regulatory networks using ...

    Indian Academy of Sciences (India)

    2015-09-28

    Sep 28, 2015 ... Use of computational methods to predict gene regulatory networks (GRNs) from gene expression data is a challenging ... two types of methods differ primarily based on whether ..... negligible, allowing us to draw the qualitative conclusions .... research will be conducted to develop additional biologically.

  7. Profiling Gene Expression in Germinating Brassica Roots.

    Science.gov (United States)

    Park, Myoung Ryoul; Wang, Yi-Hong; Hasenstein, Karl H

    2014-01-01

    Based on previously developed solid-phase gene extraction (SPGE) we examined the mRNA profile in primary roots of Brassica rapa seedlings for highly expressed genes like ACT7 (actin7), TUB (tubulin1), UBQ (ubiquitin), and low expressed GLK (glucokinase) during the first day post-germination. The assessment was based on the mRNA load of the SPGE probe of about 2.1 ng. The number of copies of the investigated genes changed spatially along the length of primary roots. The expression level of all genes differed significantly at each sample position. Among the examined genes ACT7 expression was most even along the root. UBQ was highest at the tip and root-shoot junction (RS). TUB and GLK showed a basipetal gradient. The temporal expression of UBQ was highest in the MZ 9 h after primary root emergence and higher than at any other sample position. Expressions of GLK in EZ and RS increased gradually over time. SPGE extraction is the result of oligo-dT and oligo-dA hybridization and the results illustrate that SPGE can be used for gene expression profiling at high spatial and temporal resolution. SPGE needles can be used within two weeks when stored at 4 °C. Our data indicate that gene expression studies that are based on the entire root miss important differences in gene expression that SPGE is able to resolve for example growth adjustments during gravitropism.

  8. Chromatin loops, gene positioning, and gene expression

    NARCIS (Netherlands)

    Holwerda, S.; de Laat, W.

    2012-01-01

    Technological developments and intense research over the last years have led to a better understanding of the 3D structure of the genome and its influence on genome function inside the cell nucleus. We will summarize topological studies performed on four model gene loci: the alpha- and beta-globin

  9. Using PCR to Target Misconceptions about Gene Expression

    Directory of Open Access Journals (Sweden)

    Leslie K. Wright

    2013-02-01

    Full Text Available We present a PCR-based laboratory exercise that can be used with first- or second-year biology students to help overcome common misconceptions about gene expression. Biology students typically do not have a clear understanding of the difference between genes (DNA and gene expression (mRNA/protein and often believe that genes exist in an organism or cell only when they are expressed. This laboratory exercise allows students to carry out a PCR-based experiment designed to challenge their misunderstanding of the difference between genes and gene expression. Students first transform E. coli with an inducible GFP gene containing plasmid and observe induced and un-induced colonies. The following exercise creates cognitive dissonance when actual PCR results contradict their initial (incorrect predictions of the presence of the GFP gene in transformed cells. Field testing of this laboratory exercise resulted in learning gains on both knowledge and application questions on concepts related to genes and gene expression.

  10. Serial analysis of gene expression (SAGE)

    NARCIS (Netherlands)

    van Ruissen, Fred; Baas, Frank

    2007-01-01

    In 1995, serial analysis of gene expression (SAGE) was developed as a versatile tool for gene expression studies. SAGE technology does not require pre-existing knowledge of the genome that is being examined and therefore SAGE can be applied to many different model systems. In this chapter, the SAGE

  11. Classification across gene expression microarray studies

    Directory of Open Access Journals (Sweden)

    Kuner Ruprecht

    2009-12-01

    Full Text Available Abstract Background The increasing number of gene expression microarray studies represents an important resource in biomedical research. As a result, gene expression based diagnosis has entered clinical practice for patient stratification in breast cancer. However, the integration and combined analysis of microarray studies remains still a challenge. We assessed the potential benefit of data integration on the classification accuracy and systematically evaluated the generalization performance of selected methods on four breast cancer studies comprising almost 1000 independent samples. To this end, we introduced an evaluation framework which aims to establish good statistical practice and a graphical way to monitor differences. The classification goal was to correctly predict estrogen receptor status (negative/positive and histological grade (low/high of each tumor sample in an independent study which was not used for the training. For the classification we chose support vector machines (SVM, predictive analysis of microarrays (PAM, random forest (RF and k-top scoring pairs (kTSP. Guided by considerations relevant for classification across studies we developed a generalization of kTSP which we evaluated in addition. Our derived version (DV aims to improve the robustness of the intrinsic invariance of kTSP with respect to technologies and preprocessing. Results For each individual study the generalization error was benchmarked via complete cross-validation and was found to be similar for all classification methods. The misclassification rates were substantially higher in classification across studies, when each single study was used as an independent test set while all remaining studies were combined for the training of the classifier. However, with increasing number of independent microarray studies used in the training, the overall classification performance improved. DV performed better than the average and showed slightly less variance. In

  12. Expression of Sox genes in tooth development.

    Science.gov (United States)

    Kawasaki, Katsushige; Kawasaki, Maiko; Watanabe, Momoko; Idrus, Erik; Nagai, Takahiro; Oommen, Shelly; Maeda, Takeyasu; Hagiwara, Nobuko; Que, Jianwen; Sharpe, Paul T; Ohazama, Atsushi

    2015-01-01

    Members of the Sox gene family play roles in many biological processes including organogenesis. We carried out comparative in situ hybridization analysis of seventeen sox genes (Sox1-14, 17, 18, 21) during murine odontogenesis from the epithelial thickening to the cytodifferentiation stages. Localized expression of five Sox genes (Sox6, 9, 13, 14 and 21) was observed in tooth bud epithelium. Sox13 showed restricted expression in the primary enamel knots. At the early bell stage, three Sox genes (Sox8, 11, 17 and 21) were expressed in pre-ameloblasts, whereas two others (Sox5 and 18) showed expression in odontoblasts. Sox genes thus showed a dynamic spatio-temporal expression during tooth development.

  13. Positron emission tomography imaging of gene expression

    International Nuclear Information System (INIS)

    Tang Ganghua

    2001-01-01

    The merging of molecular biology and nuclear medicine is developed into molecular nuclear medicine. Positron emission tomography (PET) of gene expression in molecular nuclear medicine has become an attractive area. Positron emission tomography imaging gene expression includes the antisense PET imaging and the reporter gene PET imaging. It is likely that the antisense PET imaging will lag behind the reporter gene PET imaging because of the numerous issues that have not yet to be resolved with this approach. The reporter gene PET imaging has wide application into animal experimental research and human applications of this approach will likely be reported soon

  14. Gene prediction validation and functional analysis of redundant pathways

    DEFF Research Database (Denmark)

    Sønderkær, Mads

    2011-01-01

    have employed a large mRNA-seq data set to improve and validate ab initio predicted gene models. This direct experimental evidence also provides reliable determinations of UTR regions and polyadenylation sites, which are not easily predicted in plants. Furthermore, once an annotated genome sequence...... is available, gene expression by mRNA-Seq enables acquisition of a more complete overview of gene isoform usage in complex enzymatic pathways enabling the identification of key genes. Metabolism in potatoes This information is useful e.g. for crop improvement based on manipulation of agronomically important...

  15. Gene expression and 18FDG uptake in atherosclerotic carotid plaques

    DEFF Research Database (Denmark)

    Pedersen, Sune Folke; Graebe, Martin; Fisker Hag, Anne Mette

    2010-01-01

    ) and an additional ipsilateral internal carotid artery stenosis of greater than 60% were recruited. FDG uptake in the carotids was determined by PET/computed tomography and expressed as mean and maximal standardized uptake values (SUVmean and SUVmax). The atherosclerotic plaques were subsequently recovered...... by carotid endarterectomy. The gene expression of markers of vulnerability - CD68, IL-18, matrix metalloproteinase 9, cathepsin K, GLUT-1, and hexokinase type II (HK2) - were measured in plaques by quantitative PCR. RESULTS: In a multivariate linear regression model, GLUT-1, CD68, cathepsin K, and HK2 gene...... expression remained in the final model as predictive variables of FDG accumulation calculated as SUVmean (R=0.26, PK, and HK2 gene expression as independent predictive variables of FDG accumulation calculated...

  16. Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

    Science.gov (United States)

    Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

    2018-02-23

    Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.

  17. A network approach to predict pathogenic genes for Fusarium graminearum.

    Science.gov (United States)

    Liu, Xiaoping; Tang, Wei-Hua; Zhao, Xing-Ming; Chen, Luonan

    2010-10-04

    Fusarium graminearum is the pathogenic agent of Fusarium head blight (FHB), which is a destructive disease on wheat and barley, thereby causing huge economic loss and health problems to human by contaminating foods. Identifying pathogenic genes can shed light on pathogenesis underlying the interaction between F. graminearum and its plant host. However, it is difficult to detect pathogenic genes for this destructive pathogen by time-consuming and expensive molecular biological experiments in lab. On the other hand, computational methods provide an alternative way to solve this problem. Since pathogenesis is a complicated procedure that involves complex regulations and interactions, the molecular interaction network of F. graminearum can give clues to potential pathogenic genes. Furthermore, the gene expression data of F. graminearum before and after its invasion into plant host can also provide useful information. In this paper, a novel systems biology approach is presented to predict pathogenic genes of F. graminearum based on molecular interaction network and gene expression data. With a small number of known pathogenic genes as seed genes, a subnetwork that consists of potential pathogenic genes is identified from the protein-protein interaction network (PPIN) of F. graminearum, where the genes in the subnetwork are further required to be differentially expressed before and after the invasion of the pathogenic fungus. Therefore, the candidate genes in the subnetwork are expected to be involved in the same biological processes as seed genes, which imply that they are potential pathogenic genes. The prediction results show that most of the pathogenic genes of F. graminearum are enriched in two important signal transduction pathways, including G protein coupled receptor pathway and MAPK signaling pathway, which are known related to pathogenesis in other fungi. In addition, several pathogenic genes predicted by our method are verified in other pathogenic fungi, which

  18. Altered expression of HER-2 and the mismatch repair genes MLH1 and MSH2 predicts the outcome of T1 high-grade bladder cancer.

    Science.gov (United States)

    Sanguedolce, Francesca; Cormio, Antonella; Massenio, Paolo; Pedicillo, Maria C; Cagiano, Simona; Fortunato, Francesca; Calò, Beppe; Di Fino, Giuseppe; Carrieri, Giuseppe; Bufo, Pantaleo; Cormio, Luigi

    2018-04-01

    The identification of factors predicting the outcome of stage T1 high-grade bladder cancer (BC) is a major clinical issue. We performed immunohistochemistry to assess the role of human epidermal growth factor receptor-2 (HER-2) and microsatellite instability (MSI) factors MutL homologue 1 (MLH1) and MutS homologue 2 (MSH2) in predicting recurrence and progression of T1 high-grade BCs having undergone transurethral resection of bladder tumor (TURBT) alone or TURBT + intravesical instillations of bacillus Calmette-Guerin (BCG). HER-2 overexpression was a significant predictor of disease-free survival (DFS) in the overall as well as in the two patients' population; as for progression-free survival (PFS), it was significant in the overall but not in the two patients' population. MLH1 was an independent predictor of PFS only in patients treated with BCG and MSH2 failed to predict DFS and PFS in all populations. Most importantly, the higher the number of altered markers the lowers the DFS and PFS. In multivariate Cox proportional-hazards regression analysis, the number of altered molecular markers and BCG treatment were significant predictors (p = 0.0004 and 0.0283, respectively) of DFS, whereas the number of altered molecular markers was the only significant predictor (p = 0.0054) of PFS. Altered expression of the proto-oncogene HER-2 and the two molecular markers of genetic instability MLH1 and MSH2 predicted T1 high-grade BC outcome with the higher the number of altered markers the lower the DFS and PFS. These findings provide grounds for further testing them in predicting the outcome of this challenging disease.

  19. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  20. A comparative gene expression database for invertebrates

    Directory of Open Access Journals (Sweden)

    Ormestad Mattias

    2011-08-01

    Full Text Available Abstract Background As whole genome and transcriptome sequencing gets cheaper and faster, a great number of 'exotic' animal models are emerging, rapidly adding valuable data to the ever-expanding Evo-Devo field. All these new organisms serve as a fantastic resource for the research community, but the sheer amount of data, some published, some not, makes detailed comparison of gene expression patterns very difficult to summarize - a problem sometimes even noticeable within a single lab. The need to merge existing data with new information in an organized manner that is publicly available to the research community is now more necessary than ever. Description In order to offer a homogenous way of storing and handling gene expression patterns from a variety of organisms, we have developed the first web-based comparative gene expression database for invertebrates that allows species-specific as well as cross-species gene expression comparisons. The database can be queried by gene name, developmental stage and/or expression domains. Conclusions This database provides a unique tool for the Evo-Devo research community that allows the retrieval, analysis and comparison of gene expression patterns within or among species. In addition, this database enables a quick identification of putative syn-expression groups that can be used to initiate, among other things, gene regulatory network (GRN projects.

  1. Adaptive Evolution of Gene Expression in Drosophila.

    Science.gov (United States)

    Nourmohammad, Armita; Rambeau, Joachim; Held, Torsten; Kovacova, Viera; Berg, Johannes; Lässig, Michael

    2017-08-08

    Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  2. Differential gene expression during Trypanosoma cruzi metacyclogenesis

    Directory of Open Access Journals (Sweden)

    Marco Aurelio Krieger

    1999-09-01

    Full Text Available The transformation of epimastigotes into metacyclic trypomastigotes involves changes in the pattern of expressed genes, resulting in important morphological and functional differences between these developmental forms of Trypanosoma cruzi. In order to identify and characterize genes involved in triggering the metacyclogenesis process and in conferring to metacyclic trypomastigotes their stage specific biological properties, we have developed a method allowing the isolation of genes specifically expressed when comparing two close related cell populations (representation of differential expression or RDE. The method is based on the PCR amplification of gene sequences selected by hybridizing and subtracting the populations in such a way that after some cycles of hybridization-amplification genes specific to a given population are highly enriched. The use of this method in the analysis of differential gene expression during T. cruzi metacyclogenesis (6 hr and 24 hr of differentiation and metacyclic trypomastigotes resulted in the isolation of several clones from each time point. Northern blot analysis showed that some genes are transiently expressed (6 hr and 24 hr differentiating cells, while others are present in differentiating cells and in metacyclic trypomastigotes. Nucleotide sequencing of six clones characterized so far showed that they do not display any homology to gene sequences available in the GeneBank.

  3. Gene expression profiling reveals multiple toxicity endpoints induced by hepatotoxicants

    Energy Technology Data Exchange (ETDEWEB)

    Huang Qihong; Jin Xidong; Gaillard, Elias T.; Knight, Brian L.; Pack, Franklin D.; Stoltz, James H.; Jayadev, Supriya; Blanchard, Kerry T

    2004-05-18

    Microarray technology continues to gain increased acceptance in the drug development process, particularly at the stage of toxicology and safety assessment. In the current study, microarrays were used to investigate gene expression changes associated with hepatotoxicity, the most commonly reported clinical liability with pharmaceutical agents. Acetaminophen, methotrexate, methapyrilene, furan and phenytoin were used as benchmark compounds capable of inducing specific but different types of hepatotoxicity. The goal of the work was to define gene expression profiles capable of distinguishing the different subtypes of hepatotoxicity. Sprague-Dawley rats were orally dosed with acetaminophen (single dose, 4500 mg/kg for 6, 24 and 72 h), methotrexate (1 mg/kg per day for 1, 7 and 14 days), methapyrilene (100 mg/kg per day for 3 and 7 days), furan (40 mg/kg per day for 1, 3, 7 and 14 days) or phenytoin (300 mg/kg per day for 14 days). Hepatic gene expression was assessed using toxicology-specific gene arrays containing 684 target genes or expressed sequence tags (ESTs). Principal component analysis (PCA) of gene expression data was able to provide a clear distinction of each compound, suggesting that gene expression data can be used to discern different hepatotoxic agents and toxicity endpoints. Gene expression data were applied to the multiplicity-adjusted permutation test and significantly changed genes were categorized and correlated to hepatotoxic endpoints. Repression of enzymes involved in lipid oxidation (acyl-CoA dehydrogenase, medium chain, enoyl CoA hydratase, very long-chain acyl-CoA synthetase) were associated with microvesicular lipidosis. Likewise, subsets of genes associated with hepatotocellular necrosis, inflammation, hepatitis, bile duct hyperplasia and fibrosis have been identified. The current study illustrates that expression profiling can be used to: (1) distinguish different hepatotoxic endpoints; (2) predict the development of toxic endpoints; and

  4. Stochastic gene expression in Arabidopsis thaliana.

    Science.gov (United States)

    Araújo, Ilka Schultheiß; Pietsch, Jessica Magdalena; Keizer, Emma Mathilde; Greese, Bettina; Balkunde, Rachappa; Fleck, Christian; Hülskamp, Martin

    2017-12-14

    Although plant development is highly reproducible, some stochasticity exists. This developmental stochasticity may be caused by noisy gene expression. Here we analyze the fluctuation of protein expression in Arabidopsis thaliana. Using the photoconvertible KikGR marker, we show that the protein expressions of individual cells fluctuate over time. A dual reporter system was used to study extrinsic and intrinsic noise of marker gene expression. We report that extrinsic noise is higher than intrinsic noise and that extrinsic noise in stomata is clearly lower in comparison to several other tissues/cell types. Finally, we show that cells are coupled with respect to stochastic protein expression in young leaves, hypocotyls and roots but not in mature leaves. Our data indicate that stochasticity of gene expression can vary between tissues/cell types and that it can be coupled in a non-cell-autonomous manner.

  5. Identification of differentially expressed genes in cutaneous squamous cell carcinoma by microarray expression profiling

    Directory of Open Access Journals (Sweden)

    Sterry Wolfram

    2006-08-01

    Full Text Available Abstract Background Carcinogenesis is a multi-step process indicated by several genes up- or down-regulated during tumor progression. This study examined and identified differentially expressed genes in cutaneous squamous cell carcinoma (SCC. Results Three different biopsies of 5 immunosuppressed organ-transplanted recipients each normal skin (all were pooled, actinic keratosis (AK (two were pooled, and invasive SCC and additionally 5 normal skin tissues from immunocompetent patients were analyzed. Thus, total RNA of 15 specimens were used for hybridization with Affymetrix HG-U133A microarray technology containing 22,283 genes. Data analyses were performed by prediction analysis of microarrays using nearest shrunken centroids with the threshold 3.5 and ANOVA analysis was independently performed in order to identify differentially expressed genes (p vs. AK and SCC were observed for 118 genes. Conclusion The majority of identified differentially expressed genes in cutaneous SCC were previously not described.

  6. Reduction in the copy number and expression level of the recurrent human papillomavirus integration gene fragile histidine triad (FHIT predicts the transition of cervical lesions.

    Directory of Open Access Journals (Sweden)

    Liming Wang

    Full Text Available Cervical cancer is the second most common cancer and the third leading cause of cancer death in females worldwide, especially in developing countries. High risk human papillomavirus (HR-HPV infection causes cervical cancer and precancerous cervical intraepithelial neoplasia (CIN. Integration of the HR-HPV genome into the host chromatin is an important step in cervical carcinogenesis. The detection of integrated papillomavirus sequences-PCR (DIPS-PCR allowed us to explore HPV integration in the human genome and to determine the pattern of this integration. We performed DIPS-PCR for 4 cell lines including 3 cervical cancer cell lines and 40 tissue samples. Overall, 32 HR-HPV integration loci were detected in the clinical samples and the HeLa and SiHa cell lines. Among all the integration loci, we identified three recurrent integration loci: 3p14.2 (3 samples, 13q22.1 (2 samples and a SiHa cell line and 8q24 (1 sample and a HeLa cell line. To further explore the effect of HR-HPV integration in the 3p14.2 locus, we used fluorescence in situ hybridization (FISH to determine the copy number of the 3p14.2 locus and immunohistochemistry (IHC to determine the protein expression levels of the related FHIT gene in the clinical samples. Both the 3p14.2 locus copy number and FHIT protein expression levels showed significant decreases when CIN transitioned to cervical cancer. HPV copy number was also evaluated in these clinical samples, and the copy number of HPV increased significantly between CIN and cervical cancer samples. Finally, we employed receiver operating characteristic curve (ROC curve analysis to evaluate the potential of all these indexes in distinguishing CIN and cervical cancer, and the HPV copy number, FHIT copy number and FHIT protein expression levels have good diagnostic efficiencies.

  7. Development of Gene Expression Signatures for Practical Radiation Biodosimetry

    International Nuclear Information System (INIS)

    Paul, Sunirmal; Amundson, Sally A.

    2008-01-01

    Purpose: In a large-scale radiologic emergency, estimates of exposure doses and radiation injury would be required for individuals without physical dosimeters. Current methods are inadequate for the task, so we are developing gene expression profiles for radiation biodosimetry. This approach could provide both an estimate of physical radiation dose and an indication of the extent of individual injury or future risk. Methods and Materials: We used whole genome microarray expression profiling as a discovery platform to identify genes with the potential to predict radiation dose across an exposure range relevant for medical decision making in a radiologic emergency. Human peripheral blood from 10 healthy donors was irradiated ex vivo, and global gene expression was measured both 6 and 24 h after exposure. Results: A 74-gene signature was identified that distinguishes between four radiation doses (0.5, 2, 5, and 8 Gy) and controls. More than one third of these genes are regulated by TP53. A nearest centroid classifier using these same 74 genes correctly predicted 98% of samples taken either 6 h or 24 h after treatment as unexposed, exposed to 0.5, 2, or ≥5 Gy. Expression patterns of five genes (CDKN1A, FDXR, SESN1, BBC3, and PHPT1) from this signature were also confirmed by real-time polymerase chain reaction. Conclusion: The ability of a single gene set to predict radiation dose throughout a window of time without need for individual pre-exposure controls represents an important advance in the development of gene expression for biodosimetry

  8. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    Science.gov (United States)

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  9. Gene expression in periodontal tissues following treatment

    Directory of Open Access Journals (Sweden)

    Eisenacher Martin

    2008-07-01

    Full Text Available Abstract Background In periodontitis, treatment aimed at controlling the periodontal biofilm infection results in a resolution of the clinical and histological signs of inflammation. Although the cell types found in periodontal tissues following treatment have been well described, information on gene expression is limited to few candidate genes. Therefore, the aim of the study was to determine the expression profiles of immune and inflammatory genes in periodontal tissues from sites with severe chronic periodontitis following periodontal therapy in order to identify genes involved in tissue homeostasis. Gingival biopsies from 12 patients with severe chronic periodontitis were taken six to eight weeks following non-surgical periodontal therapy, and from 11 healthy controls. As internal standard, RNA of an immortalized human keratinocyte line (HaCaT was used. Total RNA was subjected to gene expression profiling using a commercially available microarray system focusing on inflammation-related genes. Post-hoc confirmation of selected genes was done by Realtime-PCR. Results Out of the 136 genes analyzed, the 5% most strongly expressed genes compared to healthy controls were Interleukin-12A (IL-12A, Versican (CSPG-2, Matrixmetalloproteinase-1 (MMP-1, Down syndrome critical region protein-1 (DSCR-1, Macrophage inflammatory protein-2β (Cxcl-3, Inhibitor of apoptosis protein-1 (BIRC-1, Cluster of differentiation antigen 38 (CD38, Regulator of G-protein signalling-1 (RGS-1, and Finkel-Biskis-Jinkins murine osteosarcoma virus oncogene (C-FOS; the 5% least strongly expressed genes were Receptor-interacting Serine/Threonine Kinase-2 (RIP-2, Complement component 3 (C3, Prostaglandin-endoperoxide synthase-2 (COX-2, Interleukin-8 (IL-8, Endothelin-1 (EDN-1, Plasminogen activator inhibitor type-2 (PAI-2, Matrix-metalloproteinase-14 (MMP-14, and Interferon regulating factor-7 (IRF-7. Conclusion Gene expression profiles found in periodontal tissues following

  10. The identification of functional motifs in temporal gene expression analysis

    Directory of Open Access Journals (Sweden)

    Michael G. Surette

    2005-01-01

    Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.

  11. Widespread ectopic expression of olfactory receptor genes

    Directory of Open Access Journals (Sweden)

    Yanai Itai

    2006-05-01

    Full Text Available Abstract Background Olfactory receptors (ORs are the largest gene family in the human genome. Although they are expected to be expressed specifically in olfactory tissues, some ectopic expression has been reported, with special emphasis on sperm and testis. The present study systematically explores the expression patterns of OR genes in a large number of tissues and assesses the potential functional implication of such ectopic expression. Results We analyzed the expression of hundreds of human and mouse OR transcripts, via EST and microarray data, in several dozens of human and mouse tissues. Different tissues had specific, relatively small OR gene subsets which had particularly high expression levels. In testis, average expression was not particularly high, and very few highly expressed genes were found, none corresponding to ORs previously implicated in sperm chemotaxis. Higher expression levels were more common for genes with a non-OR genomic neighbor. Importantly, no correlation in expression levels was detected for human-mouse orthologous pairs. Also, no significant difference in expression levels was seen between intact and pseudogenized ORs, except for the pseudogenes of subfamily 7E which has undergone a human-specific expansion. Conclusion The OR superfamily as a whole, show widespread, locus-dependent and heterogeneous expression, in agreement with a neutral or near neutral evolutionary model for transcription control. These results cannot reject the possibility that small OR subsets might play functional roles in different tissues, however considerable care should be exerted when offering a functional interpretation for ectopic OR expression based only on transcription information.

  12. Regulation of Gene Expression in Protozoa Parasites

    Directory of Open Access Journals (Sweden)

    Consuelo Gomez

    2010-01-01

    Full Text Available Infections with protozoa parasites are associated with high burdens of morbidity and mortality across the developing world. Despite extensive efforts to control the transmission of these parasites, the spread of populations resistant to drugs and the lack of effective vaccines against them contribute to their persistence as major public health problems. Parasites should perform a strict control on the expression of genes involved in their pathogenicity, differentiation, immune evasion, or drug resistance, and the comprehension of the mechanisms implicated in that control could help to develop novel therapeutic strategies. However, until now these mechanisms are poorly understood in protozoa. Recent investigations into gene expression in protozoa parasites suggest that they possess many of the canonical machineries employed by higher eukaryotes for the control of gene expression at transcriptional, posttranscriptional, and epigenetic levels, but they also contain exclusive mechanisms. Here, we review the current understanding about the regulation of gene expression in Plasmodium sp., Trypanosomatids, Entamoeba histolytica and Trichomonas vaginalis.

  13. Regulation of gene expression in protozoa parasites.

    Science.gov (United States)

    Gomez, Consuelo; Esther Ramirez, M; Calixto-Galvez, Mercedes; Medel, Olivia; Rodríguez, Mario A

    2010-01-01

    Infections with protozoa parasites are associated with high burdens of morbidity and mortality across the developing world. Despite extensive efforts to control the transmission of these parasites, the spread of populations resistant to drugs and the lack of effective vaccines against them contribute to their persistence as major public health problems. Parasites should perform a strict control on the expression of genes involved in their pathogenicity, differentiation, immune evasion, or drug resistance, and the comprehension of the mechanisms implicated in that control could help to develop novel therapeutic strategies. However, until now these mechanisms are poorly understood in protozoa. Recent investigations into gene expression in protozoa parasites suggest that they possess many of the canonical machineries employed by higher eukaryotes for the control of gene expression at transcriptional, posttranscriptional, and epigenetic levels, but they also contain exclusive mechanisms. Here, we review the current understanding about the regulation of gene expression in Plasmodium sp., Trypanosomatids, Entamoeba histolytica and Trichomonas vaginalis.

  14. Inferring gene networks from discrete expression data

    KAUST Repository

    Zhang, L.; Mallick, B. K.

    2013-01-01

    graphical models applied to continuous data, which give a closedformmarginal likelihood. In this paper,we extend network modeling to discrete data, specifically data from serial analysis of gene expression, and RNA-sequencing experiments, both of which

  15. Gene Expression and Microarray Investigation of Dendrobium ...

    African Journals Online (AJOL)

    blood glucose > 16.7 mmol/L were used as the model group and treated with Dendrobium mixture. (DEN ... Keywords: Diabetes, Gene expression, Dendrobium mixture, Microarray testing ..... homeostasis in airway smooth muscle. Am J.

  16. Drosophila melanogaster gene expression changes after spaceflight.

    Data.gov (United States)

    National Aeronautics and Space Administration — Gene expression levels were determined in 3rd instar and adult Drosophila melanogaster reared during spaceflight to elucidate the genetic and molecular mechanisms...

  17. Exertional Heat Illness and Human Gene Expression

    National Research Council Canada - National Science Library

    Sonna, L.A; Sawka, M. N; Lilly, C. M

    2007-01-01

    Microarray analysis of gene expression at the level of RNA has generated new insights into the relationship between cellular responses to acute heat shock in vitro, exercise, and exertional heat illness...

  18. Expression Profiling of Tyrosine Kinase Genes

    National Research Council Canada - National Science Library

    Weier, Heinz

    2000-01-01

    ... of these genes parallels the progression of tumors to a more malignant phenotype. We developed a DNA micro-array based screening system to monitor the level of expression of tyrosine kinase (tk...

  19. Regulation of meiotic gene expression in plants

    Directory of Open Access Journals (Sweden)

    Adele eZhou

    2014-08-01

    Full Text Available With the recent advances in genomics and sequencing technologies, databases of transcriptomes representing many cellular processes have been built. Meiotic transcriptomes in plants have been studied in Arabidopsis thaliana, rice (Oryza sativa, wheat (Triticum aestivum, petunia (Petunia hybrida, sunflower (Helianthus annuus, and maize (Zea mays. Studies in all organisms, but particularly in plants, indicate that a very large number of genes are expressed during meiosis, though relatively few of them seem to be required for the completion of meiosis. In this review, we focus on gene expression at the RNA level and analyze the meiotic transcriptome datasets and explore expression patterns of known meiotic genes to elucidate how gene expression could be regulated during meiosis. We also discuss mechanisms, such as chromatin organization and non-coding RNAs, that might be involved in the regulation of meiotic transcription patterns.

  20. Identification of genes preferentially expressed during

    African Journals Online (AJOL)

    雨林木风

    2012-08-16

    Aug 16, 2012 ... The suppression subtractive hybridization (SSH) method conducted to generate ... which showed the lack of genomic information currently available for lily. ..... characterization of genes expressed during somatic embryo.

  1. Mining gene expression data of multiple sclerosis.

    Directory of Open Access Journals (Sweden)

    Pi Guo

    Full Text Available Microarray produces a large amount of gene expression data, containing various biological implications. The challenge is to detect a panel of discriminative genes associated with disease. This study proposed a robust classification model for gene selection using gene expression data, and performed an analysis to identify disease-related genes using multiple sclerosis as an example.Gene expression profiles based on the transcriptome of peripheral blood mononuclear cells from a total of 44 samples from 26 multiple sclerosis patients and 18 individuals with other neurological diseases (control were analyzed. Feature selection algorithms including Support Vector Machine based on Recursive Feature Elimination, Receiver Operating Characteristic Curve, and Boruta algorithms were jointly performed to select candidate genes associating with multiple sclerosis. Multiple classification models categorized samples into two different groups based on the identified genes. Models' performance was evaluated using cross-validation methods, and an optimal classifier for gene selection was determined.An overlapping feature set was identified consisting of 8 genes that were differentially expressed between the two phenotype groups. The genes were significantly associated with the pathways of apoptosis and cytokine-cytokine receptor interaction. TNFSF10 was significantly associated with multiple sclerosis. A Support Vector Machine model was established based on the featured genes and gave a practical accuracy of ∼86%. This binary classification model also outperformed the other models in terms of Sensitivity, Specificity and F1 score.The combined analytical framework integrating feature ranking algorithms and Support Vector Machine model could be used for selecting genes for other diseases.

  2. Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis

    Science.gov (United States)

    dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

    2015-01-01

    Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis. PMID:26393928

  3. Evaluation of suitable reference genes for gene expression studies ...

    Indian Academy of Sciences (India)

    2011-12-14

    Dec 14, 2011 ... MADS family of TFs control floral organ identity within each whorl of the flower by activating downstream genes. Measuring gene expression in different tissue types and developmental stages is of fundamental importance in TFs functional research. In last few years, quantitative real-time. PCR (qRT-PCR) ...

  4. PRAME gene expression profile in medulloblastoma

    Directory of Open Access Journals (Sweden)

    Tânia Maria Vulcani-Freitas

    2011-02-01

    Full Text Available Medulloblastoma is the most common malignant tumors of central nervous system in the childhood. The treatment is severe, harmful and, thus, has a dismal prognosis. As PRAME is present in various cancers, including meduloblastoma, and has limited expression in normal tissues, this antigen can be an ideal vaccine target for tumor immunotherapy. In order to find a potential molecular target, we investigated PRAME expression in medulloblastoma fragments and we compare the results with the clinical features of each patient. Analysis of gene expression was performed by real-time quantitative PCR from 37 tumor samples. The Mann-Whitney test was used to analysis the relationship between gene expression and clinical characteristics. Kaplan-Meier curves were used to evaluate survival. PRAME was overexpressed in 84% samples. But no statistical association was found between clinical features and PRAME overexpression. Despite that PRAME gene could be a strong candidate for immunotherapy since it is highly expressed in medulloblastomas.

  5. Comparative gene expression between two yeast species

    Directory of Open Access Journals (Sweden)

    Guan Yuanfang

    2013-01-01

    Full Text Available Abstract Background Comparative genomics brings insight into sequence evolution, but even more may be learned by coupling sequence analyses with experimental tests of gene function and regulation. However, the reliability of such comparisons is often limited by biased sampling of expression conditions and incomplete knowledge of gene functions across species. To address these challenges, we previously systematically generated expression profiles in Saccharomyces bayanus to maximize functional coverage as compared to an existing Saccharomyces cerevisiae data repository. Results In this paper, we take advantage of these two data repositories to compare patterns of ortholog expression in a wide variety of conditions. First, we developed a scalable metric for expression divergence that enabled us to detect a significant correlation between sequence and expression conservation on the global level, which previous smaller-scale expression studies failed to detect. Despite this global conservation trend, between-species gene expression neighborhoods were less well-conserved than within-species comparisons across different environmental perturbations, and approximately 4% of orthologs exhibited a significant change in co-expression partners. Furthermore, our analysis of matched perturbations collected in both species (such as diauxic shift and cell cycle synchrony demonstrated that approximately a quarter of orthologs exhibit condition-specific expression pattern differences. Conclusions Taken together, these analyses provide a global view of gene expression patterns between two species, both in terms of the conditions and timing of a gene's expression as well as co-expression partners. Our results provide testable hypotheses that will direct future experiments to determine how these changes may be specified in the genome.

  6. Accurate, model-based tuning of synthetic gene expression using introns in S. cerevisiae.

    Directory of Open Access Journals (Sweden)

    Ido Yofe

    2014-06-01

    Full Text Available Introns are key regulators of eukaryotic gene expression and present a potentially powerful tool for the design of synthetic eukaryotic gene expression systems. However, intronic control over gene expression is governed by a multitude of complex, incompletely understood, regulatory mechanisms. Despite this lack of detailed mechanistic understanding, here we show how a relatively simple model enables accurate and predictable tuning of synthetic gene expression system in yeast using several predictive intron features such as transcript folding and sequence motifs. Using only natural Saccharomyces cerevisiae introns as regulators, we demonstrate fine and accurate control over gene expression spanning a 100 fold expression range. These results broaden the engineering toolbox of synthetic gene expression systems and provide a framework in which precise and robust tuning of gene expression is accomplished.

  7. Prediction of epigenetically regulated genes in breast cancer cell lines

    Energy Technology Data Exchange (ETDEWEB)

    Loss, Leandro A; Sadanandam, Anguraj; Durinck, Steffen; Nautiyal, Shivani; Flaucher, Diane; Carlton, Victoria EH; Moorhead, Martin; Lu, Yontao; Gray, Joe W; Faham, Malek; Spellman, Paul; Parvin, Bahram

    2010-05-04

    panel of breast cancer cell lines. Subnetwork enrichment of these genes has identifed 35 common regulators with 6 or more predicted markers. In addition to identifying epigenetically regulated genes, we show evidence of differentially expressed methylation patterns between the basal and luminal subtypes. Our results indicate that the proposed computational protocol is a viable platform for identifying epigenetically regulated genes. Our protocol has generated a list of predictors including COL1A2, TOP2A, TFF1, and VAV3, genes whose key roles in epigenetic regulation is documented in the literature. Subnetwork enrichment of these predicted markers further suggests that epigenetic regulation of individual genes occurs in a coordinated fashion and through common regulators.

  8. Inferring gene networks from discrete expression data

    KAUST Repository

    Zhang, L.

    2013-07-18

    The modeling of gene networks from transcriptional expression data is an important tool in biomedical research to reveal signaling pathways and to identify treatment targets. Current gene network modeling is primarily based on the use of Gaussian graphical models applied to continuous data, which give a closedformmarginal likelihood. In this paper,we extend network modeling to discrete data, specifically data from serial analysis of gene expression, and RNA-sequencing experiments, both of which generate counts of mRNAtranscripts in cell samples.We propose a generalized linear model to fit the discrete gene expression data and assume that the log ratios of the mean expression levels follow a Gaussian distribution.We restrict the gene network structures to decomposable graphs and derive the graphs by selecting the covariance matrix of the Gaussian distribution with the hyper-inverse Wishart priors. Furthermore, we incorporate prior network models based on gene ontology information, which avails existing biological information on the genes of interest. We conduct simulation studies to examine the performance of our discrete graphical model and apply the method to two real datasets for gene network inference. © The Author 2013. Published by Oxford University Press. All rights reserved.

  9. Gene expression profiling in autoimmune diseases

    DEFF Research Database (Denmark)

    Bovin, Lone Frier; Brynskov, Jørn; Hegedüs, Laszlo

    2007-01-01

    A central issue in autoimmune disease is whether the underlying inflammation is a repeated stereotypical process or whether disease specific gene expression is involved. To shed light on this, we analysed whether genes previously found to be differentially regulated in rheumatoid arthritis (RA...

  10. Bayesian assignment of gene ontology terms to gene expression experiments.

    Science.gov (United States)

    Sykacek, P

    2012-09-15

    Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Source code under GPL license is available from the author. peter.sykacek@boku.ac.at.

  11. Bayesian assignment of gene ontology terms to gene expression experiments

    Science.gov (United States)

    Sykacek, P.

    2012-01-01

    Motivation: Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. Results: This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Availability: Source code under GPL license is available from the author. Contact: peter.sykacek@boku.ac.at PMID:22962488

  12. Alu Elements as Novel Regulators of Gene Expression in Type 1 Diabetes Susceptibility Genes?

    Science.gov (United States)

    Kaur, Simranjeet; Pociot, Flemming

    2015-07-13

    Despite numerous studies implicating Alu repeat elements in various diseases, there is sparse information available with respect to the potential functional and biological roles of the repeat elements in Type 1 diabetes (T1D). Therefore, we performed a genome-wide sequence analysis of T1D candidate genes to identify embedded Alu elements within these genes. We observed significant enrichment of Alu elements within the T1D genes (p-value genes harboring Alus revealed significant enrichment for immune-mediated processes (p-value genes harboring inverted Alus (IRAlus) within their 3' untranslated regions (UTRs) that are known to regulate the expression of host mRNAs by generating double stranded RNA duplexes. Our in silico analysis predicted the formation of duplex structures by IRAlus within the 3'UTRs of T1D genes. We propose that IRAlus might be involved in regulating the expression levels of the host T1D genes.

  13. Interdependence of cell growth and gene expression: origins and consequences.

    Science.gov (United States)

    Scott, Matthew; Gunderson, Carl W; Mateescu, Eduard M; Zhang, Zhongge; Hwa, Terence

    2010-11-19

    In bacteria, the rate of cell proliferation and the level of gene expression are intimately intertwined. Elucidating these relations is important both for understanding the physiological functions of endogenous genetic circuits and for designing robust synthetic systems. We describe a phenomenological study that reveals intrinsic constraints governing the allocation of resources toward protein synthesis and other aspects of cell growth. A theory incorporating these constraints can accurately predict how cell proliferation and gene expression affect one another, quantitatively accounting for the effect of translation-inhibiting antibiotics on gene expression and the effect of gratuitous protein expression on cell growth. The use of such empirical relations, analogous to phenomenological laws, may facilitate our understanding and manipulation of complex biological systems before underlying regulatory circuits are elucidated.

  14. Reference Gene Screening for Analyzing Gene Expression Across Goat Tissue

    Directory of Open Access Journals (Sweden)

    Yu Zhang

    2013-12-01

    Full Text Available Real-time quantitative PCR (qRT-PCR is one of the important methods for investigating the changes in mRNA expression levels in cells and tissues. Selection of the proper reference genes is very important when calibrating the results of real-time quantitative PCR. Studies on the selection of reference genes in goat tissues are limited, despite the economic importance of their meat and dairy products. We used real-time quantitative PCR to detect the expression levels of eight reference gene candidates (18S, TBP, HMBS, YWHAZ, ACTB, HPRT1, GAPDH and EEF1A2 in ten tissues types sourced from Boer goats. The optimal reference gene combination was selected according to the results determined by geNorm, NormFinder and Bestkeeper software packages. The analyses showed that tissue is an important variability factor in genes expression stability. When all tissues were considered, 18S, TBP and HMBS is the optimal reference combination for calibrating quantitative PCR analysis of gene expression from goat tissues. Dividing data set by tissues, ACTB was the most stable in stomach, small intestine and ovary, 18S in heart and spleen, HMBS in uterus and lung, TBP in liver, HPRT1 in kidney and GAPDH in muscle. Overall, this study provided valuable information about the goat reference genes that can be used in order to perform a proper normalisation when relative quantification by qRT-PCR studies is undertaken.

  15. Design parameters to control synthetic gene expression in Escherichia coli.

    Directory of Open Access Journals (Sweden)

    Mark Welch

    Full Text Available BACKGROUND: Production of proteins as therapeutic agents, research reagents and molecular tools frequently depends on expression in heterologous hosts. Synthetic genes are increasingly used for protein production because sequence information is easier to obtain than the corresponding physical DNA. Protein-coding sequences are commonly re-designed to enhance expression, but there are no experimentally supported design principles. PRINCIPAL FINDINGS: To identify sequence features that affect protein expression we synthesized and expressed in E. coli two sets of 40 genes encoding two commercially valuable proteins, a DNA polymerase and a single chain antibody. Genes differing only in synonymous codon usage expressed protein at levels ranging from undetectable to 30% of cellular protein. Using partial least squares regression we tested the correlation of protein production levels with parameters that have been reported to affect expression. We found that the amount of protein produced in E. coli was strongly dependent on the codons used to encode a subset of amino acids. Favorable codons were predominantly those read by tRNAs that are most highly charged during amino acid starvation, not codons that are most abundant in highly expressed E. coli proteins. Finally we confirmed the validity of our models by designing, synthesizing and testing new genes using codon biases predicted to perform well. CONCLUSION: The systematic analysis of gene design parameters shown in this study has allowed us to identify codon usage within a gene as a critical determinant of achievable protein expression levels in E. coli. We propose a biochemical basis for this, as well as design algorithms to ensure high protein production from synthetic genes. Replication of this methodology should allow similar design algorithms to be empirically derived for any expression system.

  16. A gene expression signature associated with survival in metastatic melanoma

    Science.gov (United States)

    Mandruzzato, Susanna; Callegaro, Andrea; Turcatel, Gianluca; Francescato, Samuela; Montesco, Maria C; Chiarion-Sileni, Vanna; Mocellin, Simone; Rossi, Carlo R; Bicciato, Silvio; Wang, Ena; Marincola, Francesco M; Zanovello, Paola

    2006-01-01

    Background Current clinical and histopathological criteria used to define the prognosis of melanoma patients are inadequate for accurate prediction of clinical outcome. We investigated whether genome screening by means of high-throughput gene microarray might provide clinically useful information on patient survival. Methods Forty-three tumor tissues from 38 patients with stage III and stage IV melanoma were profiled with a 17,500 element cDNA microarray. Expression data were analyzed using significance analysis of microarrays (SAM) to identify genes associated with patient survival, and supervised principal components (SPC) to determine survival prediction. Results SAM analysis revealed a set of 80 probes, corresponding to 70 genes, associated with survival, i.e. 45 probes characterizing longer and 35 shorter survival times, respectively. These transcripts were included in a survival prediction model designed using SPC and cross-validation which allowed identifying 30 predicting probes out of the 80 associated with survival. Conclusion The longer-survival group of genes included those expressed in immune cells, both innate and acquired, confirming the interplay between immunological mechanisms and the natural history of melanoma. Genes linked to immune cells were totally lacking in the poor-survival group, which was instead associated with a number of genes related to highly proliferative and invasive tumor cells. PMID:17129373

  17. A gene expression signature associated with survival in metastatic melanoma

    Directory of Open Access Journals (Sweden)

    Rossi Carlo R

    2006-11-01

    Full Text Available Abstract Background Current clinical and histopathological criteria used to define the prognosis of melanoma patients are inadequate for accurate prediction of clinical outcome. We investigated whether genome screening by means of high-throughput gene microarray might provide clinically useful information on patient survival. Methods Forty-three tumor tissues from 38 patients with stage III and stage IV melanoma were profiled with a 17,500 element cDNA microarray. Expression data were analyzed using significance analysis of microarrays (SAM to identify genes associated with patient survival, and supervised principal components (SPC to determine survival prediction. Results SAM analysis revealed a set of 80 probes, corresponding to 70 genes, associated with survival, i.e. 45 probes characterizing longer and 35 shorter survival times, respectively. These transcripts were included in a survival prediction model designed using SPC and cross-validation which allowed identifying 30 predicting probes out of the 80 associated with survival. Conclusion The longer-survival group of genes included those expressed in immune cells, both innate and acquired, confirming the interplay between immunological mechanisms and the natural history of melanoma. Genes linked to immune cells were totally lacking in the poor-survival group, which was instead associated with a number of genes related to highly proliferative and invasive tumor cells.

  18. Transcriptome database resource and gene expression atlas for the rose

    Science.gov (United States)

    2012-01-01

    Background For centuries roses have been selected based on a number of traits. Little information exists on the genetic and molecular basis that contributes to these traits, mainly because information on expressed genes for this economically important ornamental plant is scarce. Results Here, we used a combination of Illumina and 454 sequencing technologies to generate information on Rosa sp. transcripts using RNA from various tissues and in response to biotic and abiotic stresses. A total of 80714 transcript clusters were identified and 76611 peptides have been predicted among which 20997 have been clustered into 13900 protein families. BLASTp hits in closely related Rosaceae species revealed that about half of the predicted peptides in the strawberry and peach genomes have orthologs in Rosa dataset. Digital expression was obtained using RNA samples from organs at different development stages and under different stress conditions. qPCR validated the digital expression data for a selection of 23 genes with high or low expression levels. Comparative gene expression analyses between the different tissues and organs allowed the identification of clusters that are highly enriched in given tissues or under particular conditions, demonstrating the usefulness of the digital gene expression analysis. A web interface ROSAseq was created that allows data interrogation by BLAST, subsequent analysis of DNA clusters and access to thorough transcript annotation including best BLAST matches on Fragaria vesca, Prunus persica and Arabidopsis. The rose peptides dataset was used to create the ROSAcyc resource pathway database that allows access to the putative genes and enzymatic pathways. Conclusions The study provides useful information on Rosa expressed genes, with thorough annotation and an overview of expression patterns for transcripts with good accuracy. PMID:23164410

  19. Gene expression analysis of precision-cut human liver slices indicates stable expression of ADME-Tox related genes

    NARCIS (Netherlands)

    Elferink, M. G. L.; Olinga, P.; van Leeuwen, E. M.; Bauerschmidt, S.; Polman, J.; Schoonen, W. G.; Heisterkamp, S. H.; Groothuis, G. M. M.

    2011-01-01

    In the process of drug development it is of high importance to test the safety of new drugs with predictive value for human toxicity. A promising approach of toxicity testing is based on shifts in gene expression profiling of the liver. Toxicity screening based on animal liver cells cannot be

  20. DEEP--a tool for differential expression effector prediction.

    Science.gov (United States)

    Degenhardt, Jost; Haubrock, Martin; Dönitz, Jürgen; Wingender, Edgar; Crass, Torsten

    2007-07-01

    High-throughput methods for measuring transcript abundance, like SAGE or microarrays, are widely used for determining differences in gene expression between different tissue types, dignities (normal/malignant) or time points. Further analysis of such data frequently aims at the identification of gene interaction networks that form the causal basis for the observed properties of the systems under examination. To this end, it is usually not sufficient to rely on the measured gene expression levels alone; rather, additional biological knowledge has to be taken into account in order to generate useful hypotheses about the molecular mechanism leading to the realization of a certain phenotype. We present a method that combines gene expression data with biological expert knowledge on molecular interaction networks, as described by the TRANSPATH database on signal transduction, to predict additional--and not necessarily differentially expressed--genes or gene products which might participate in processes specific for either of the examined tissues or conditions. In a first step, significance values for over-expression in tissue/condition A or B are assigned to all genes in the expression data set. Genes with a significance value exceeding a certain threshold are used as starting points for the reconstruction of a graph with signaling components as nodes and signaling events as edges. In a subsequent graph traversal process, again starting from the previously identified differentially expressed genes, all encountered nodes 'inherit' all their starting nodes' significance values. In a final step, the graph is visualized, the nodes being colored according to a weighted average of their inherited significance values. Each node's, or sub-network's, predominant color, ranging from green (significant for tissue/condition A) over yellow (not significant for either tissue/condition) to red (significant for tissue/condition B), thus gives an immediate visual clue on which molecules

  1. Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

    Science.gov (United States)

    Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

    2018-04-23

    Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis

  2. Expression Study of Banana Pathogenic Resistance Genes

    Directory of Open Access Journals (Sweden)

    Fenny M. Dwivany

    2016-10-01

    Full Text Available Banana is one of the world's most important trade commodities. However, infection of banana pathogenic fungi (Fusarium oxysporum race 4 is one of the major causes of decreasing production in Indonesia. Genetic engineering has become an alternative way to control this problem by isolating genes that involved in plant defense mechanism against pathogens. Two of the important genes are API5 and ChiI1, each gene encodes apoptosis inhibitory protein and chitinase enzymes. The purpose of this study was to study the expression of API5 and ChiI1 genes as candidate pathogenic resistance genes. The amplified fragments were then cloned, sequenced, and confirmed with in silico studies. Based on sequence analysis, it is showed that partial API5 gene has putative transactivation domain and ChiI1 has 9 chitinase family GH19 protein motifs. Data obtained from this study will contribute in banana genetic improvement.

  3. Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

    Science.gov (United States)

    Osato, Naoki

    2018-01-19

    Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional

  4. Dlx homeobox gene family expression in osteoclasts.

    Science.gov (United States)

    Lézot, F; Thomas, B L; Blin-Wakkach, C; Castaneda, B; Bolanos, A; Hotton, D; Sharpe, P T; Heymann, D; Carles, G F; Grigoriadis, A E; Berdal, A

    2010-06-01

    Skeletal growth and homeostasis require the finely orchestrated secretion of mineralized tissue matrices by highly specialized cells, balanced with their degradation by osteoclasts. Time- and site-specific expression of Dlx and Msx homeobox genes in the cells secreting these matrices have been identified as important elements in the regulation of skeletal morphology. Such specific expression patterns have also been reported in osteoclasts for Msx genes. The aim of the present study was to establish the expression patterns of Dlx genes in osteoclasts and identify their function in regulating skeletal morphology. The expression patterns of all Dlx genes were examined during the whole osteoclastogenesis using different in vitro models. The results revealed that Dlx1 and Dlx2 are the only Dlx family members with a possible function in osteoclastogenesis as well as in mature osteoclasts. Dlx5 and Dlx6 were detected in the cultures but appear to be markers of monocytes and their derivatives. In vivo, Dlx2 expression in osteoclasts was examined using a Dlx2/LacZ transgenic mouse. Dlx2 is expressed in a subpopulation of osteoclasts in association with tooth, brain, nerve, and bone marrow volumetric growths. Altogether the present data suggest a role for Dlx2 in regulation of skeletal morphogenesis via functions within osteoclasts. (c) 2010 Wiley-Liss, Inc.

  5. Hepatocyte specific expression of human cloned genes

    Energy Technology Data Exchange (ETDEWEB)

    Cortese, R

    1986-01-01

    A large number of proteins are specifically synthesized in the hepatocyte. Only the adult liver expresses the complete repertoire of functions which are required at various stages during development. There is therefore a complex series of regulatory mechanisms responsible for the maintenance of the differentiated state and for the developmental and physiological variations in the pattern of gene expression. Human hepatoma cell lines HepG2 and Hep3B display a pattern of gene expression similar to adult and fetal liver, respectively; in contrast, cultured fibroblasts or HeLa cells do not express most of the liver specific genes. They have used these cell lines for transfection experiments with cloned human liver specific genes. DNA segments coding for alpha1-antitrypsin and retinol binding protein (two proteins synthesized both in fetal and adult liver) are expressed in the hepatoma cell lines HepG2 and Hep3B, but not in HeLa cells or fibroblasts. A DNA segment coding for haptoglobin (a protein synthesized only after birth) is only expressed in the hepatoma cell line HepG2 but not in Hep3B nor in non hepatic cell lines. The information for tissue specific expression is located in the 5' flanking region of all three genes. In vivo competition experiments show that these DNA segments bind to a common, apparently limiting, transacting factor. Conventional techniques (Bal deletions, site directed mutagenesis, etc.) have been used to precisely identify the DNA sequences responsible for these effects. The emerging picture is complex: they have identified multiple, separate transcriptional signals, essential for maximal promoter activation and tissue specific expression. Some of these signals show a negative effect on transcription in fibroblast cell lines.

  6. Gene expression profiles in skeletal muscle after gene electrotransfer

    DEFF Research Database (Denmark)

    Hojman, Pernille; Zibert, John R; Gissel, Hanne

    2007-01-01

    BACKGROUND: Gene transfer by electroporation (DNA electrotransfer) to muscle results in high level long term transgenic expression, showing great promise for treatment of e.g. protein deficiency syndromes. However little is known about the effects of DNA electrotransfer on muscle fibres. We have...... caused down-regulation of structural proteins e.g. sarcospan and catalytic enzymes. Injection of DNA induced down-regulation of intracellular transport proteins e.g. sentrin. The effects on muscle fibres were transient as the expression profiles 3 weeks after treatment were closely related......) followed by a long low voltage pulse (LV, 100 V/cm, 400 ms); a pulse combination optimised for efficient and safe gene transfer. Muscles were transfected with green fluorescent protein (GFP) and excised at 4 hours, 48 hours or 3 weeks after treatment. RESULTS: Differentially expressed genes were...

  7. Gene expression analysis of flax seed development

    Science.gov (United States)

    2011-01-01

    Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise

  8. Gene expression analysis of flax seed development

    Directory of Open Access Journals (Sweden)

    Sharpe Andrew

    2011-04-01

    Full Text Available Abstract Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages seed coats (globular and torpedo stages and endosperm (pooled globular to torpedo stages and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST (GenBank accessions LIBEST_026995 to LIBEST_027011 were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152 had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid

  9. Lithium ions induce prestalk-associated gene expression and inhibit prespore gene expression in Dictyostelium discoideum

    NARCIS (Netherlands)

    Peters, Dorien J.M.; Lookeren Campagne, Michiel M. van; Haastert, Peter J.M. van; Spek, Wouter; Schaap, Pauline

    1989-01-01

    We investigated the effect of Li+ on two types of cyclic AMP-regulated gene expression and on basal and cyclic AMP-stimulated inositol 1,4,5-trisphosphate (Ins(1,4,5)P3) levels. Li+ effectively inhibits cyclic AMP-induced prespore gene expression, half-maximal inhibition occurring at about 2mM-LiCl.

  10. Scaling of gene expression data allowing the comparison of different gene expression platforms

    NARCIS (Netherlands)

    van Ruissen, Fred; Schaaf, Gerben J.; Kool, Marcel; Baas, Frank; Ruijter, Jan M.

    2008-01-01

    Serial analysis of gene expression (SAGE) and microarrays have found a widespread application, but much ambiguity exists regarding the amalgamation of the data resulting from these technologies. Cross-platform utilization of gene expression data from the SAGE and microarray technology could reduce

  11. Population genetic variation in gene expression is associated withphenotypic variation in Saccharomyces cerevisiae

    Energy Technology Data Exchange (ETDEWEB)

    Fay, Justin C.; McCullough, Heather L.; Sniegowski, Paul D.; Eisen, Michael B.

    2004-02-25

    The relationship between genetic variation in gene expression and phenotypic variation observable in nature is not well understood. Identifying how many phenotypes are associated with differences in gene expression and how many gene-expression differences are associated with a phenotype is important to understanding the molecular basis and evolution of complex traits. Results: We compared levels of gene expression among nine natural isolates of Saccharomyces cerevisiae grown either in the presence or absence of copper sulfate. Of the nine strains, two show a reduced growth rate and two others are rust colored in the presence of copper sulfate. We identified 633 genes that show significant differences in expression among strains. Of these genes,20 were correlated with resistance to copper sulfate and 24 were correlated with rust coloration. The function of these genes in combination with their expression pattern suggests the presence of both correlative and causative expression differences. But the majority of differentially expressed genes were not correlated with either phenotype and showed the same expression pattern both in the presence and absence of copper sulfate. To determine whether these expression differences may contribute to phenotypic variation under other environmental conditions, we examined one phenotype, freeze tolerance, predicted by the differential expression of the aquaporin gene AQY2. We found freeze tolerance is associated with the expression of AQY2. Conclusions: Gene expression differences provide substantial insight into the molecular basis of naturally occurring traits and can be used to predict environment dependent phenotypic variation.

  12. Radiation-modulated gene expression in C. elegans

    International Nuclear Information System (INIS)

    Nelson, G.A.; Bayeta, E.; Perez, C.; Lloyd, E.; Jones, T.; Smith, A.; Tian, J.

    2003-01-01

    Full text: We use the nematode C. elegans to characterize the genotoxic and cytotoxic effects of ionizing radiation with emphasis effects of charged particle radiation and have described the fluence vs. response relationships for mutation, chromosome aberration and certain developmental errors. These endpoints quantify the biological after repair and compensation pathways have completed their work. In order to address the control of these reactions we have turned to gene expression profiling to identify genes that uniquely respond to high LET species or respond differentially as a function of radiation properties. We have employed whole genome microarray methods to map gene expression following exposure to gamma rays, protons and accelerated iron ions. We found that 599 of 17871 genes analyzed showed differential expression 3 hrs after exposure to 3 Gy of at least one radiation types. 193 were up-regulated, 406 were down-regulated, and 90% were affected by only one species of radiation. Genes whose transcription levels responded significantly mapped to definite statistical clusters that were unique for each radiation type. We are now trying to establish the functional relationships of the genes their relevance to mitigation of radiation-induced damage. Three approaches are being used. First, bioinformatics tools are being used to determine the roles of genes in co-regulated gene sets. Second, we are applying the technique of RNA interference to determine whether our radiation-induced genes affect cell survival (measured in terms of embryo survival) and chromosome aberration (intestinal anaphase bridges). Finally we are focussing on the response of the most strongly-regulated gene in our data set. This is the autosomal gene, F36D3.9, whose predicted structure is that of a cysteine protease resembling cathepsin B. An enzymological approach is being used to characterize this gene at the protein level. This work was supported by NASA Cooperative Agreement NCC9-149

  13. Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease.

    Science.gov (United States)

    Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

    2014-01-01

    We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. http://rged.wall-eva.net. © The Author(s) 2014. Published by Oxford University Press.

  14. Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease

    Science.gov (United States)

    Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

    2014-01-01

    We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Availability and implementation: Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. Database URL: http://rged.wall-eva.net PMID:25252782

  15. Time course of gene expression during mouse skeletal muscle hypertrophy.

    Science.gov (United States)

    Chaillou, Thomas; Lee, Jonah D; England, Jonathan H; Esser, Karyn A; McCarthy, John J

    2013-10-01

    The purpose of this study was to perform a comprehensive transcriptome analysis during skeletal muscle hypertrophy to identify signaling pathways that are operative throughout the hypertrophic response. Global gene expression patterns were determined from microarray results on days 1, 3, 5, 7, 10, and 14 during plantaris muscle hypertrophy induced by synergist ablation in adult mice. Principal component analysis and the number of differentially expressed genes (cutoffs ≥2-fold increase or ≥50% decrease compared with control muscle) revealed three gene expression patterns during overload-induced hypertrophy: early (1 day), intermediate (3, 5, and 7 days), and late (10 and 14 days) patterns. Based on the robust changes in total RNA content and in the number of differentially expressed genes, we focused our attention on the intermediate gene expression pattern. Ingenuity Pathway Analysis revealed a downregulation of genes encoding components of the branched-chain amino acid degradation pathway during hypertrophy. Among these genes, five were predicted by Ingenuity Pathway Analysis or previously shown to be regulated by the transcription factor Kruppel-like factor-15, which was also downregulated during hypertrophy. Moreover, the integrin-linked kinase signaling pathway was activated during hypertrophy, and the downregulation of muscle-specific micro-RNA-1 correlated with the upregulation of five predicted targets associated with the integrin-linked kinase pathway. In conclusion, we identified two novel pathways that may be involved in muscle hypertrophy, as well as two upstream regulators (Kruppel-like factor-15 and micro-RNA-1) that provide targets for future studies investigating the importance of these pathways in muscle hypertrophy.

  16. Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods.

    Science.gov (United States)

    Wang, Liming; Zhu, L; Luan, R; Wang, L; Fu, J; Wang, X; Sui, L

    2016-10-10

    Dilated cardiomyopathy (DCM) is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs) were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs) and microRNAs (miRNAs) of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT) were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family). Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1), potential TFs, as well as potential miRNAs, might be involved in DCM.

  17. Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods

    Directory of Open Access Journals (Sweden)

    Liming Wang

    Full Text Available Dilated cardiomyopathy (DCM is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs and microRNAs (miRNAs of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family. Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1, potential TFs, as well as potential miRNAs, might be involved in DCM.

  18. Transcriptomic analysis in the developing zebrafish embryo after compound exposure: Individual gene expression and pathway regulation

    Energy Technology Data Exchange (ETDEWEB)

    Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands); Pronk, Tessa E. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Brandhof, Evert-Jan van den [Centre for Environmental Quality, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Ven, Leo T.M. van der [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Piersma, Aldert H. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands)

    2013-10-01

    The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol and saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.

  19. Comprehensive analysis of gene expression patterns of hedgehog-related genes

    Directory of Open Access Journals (Sweden)

    Baillie David

    2006-10-01

    Full Text Available Abstract Background The Caenorhabditis elegans genome encodes ten proteins that share sequence similarity with the Hedgehog signaling molecule through their C-terminal autoprocessing Hint/Hog domain. These proteins contain novel N-terminal domains, and C. elegans encodes dozens of additional proteins containing only these N-terminal domains. These gene families are called warthog, groundhog, ground-like and quahog, collectively called hedgehog (hh-related genes. Previously, the expression pattern of seventeen genes was examined, which showed that they are primarily expressed in the ectoderm. Results With the completion of the C. elegans genome sequence in November 2002, we reexamined and identified 61 hh-related ORFs. Further, we identified 49 hh-related ORFs in C. briggsae. ORF analysis revealed that 30% of the genes still had errors in their predictions and we improved these predictions here. We performed a comprehensive expression analysis using GFP fusions of the putative intergenic regulatory sequence with one or two transgenic lines for most genes. The hh-related genes are expressed in one or a few of the following tissues: hypodermis, seam cells, excretory duct and pore cells, vulval epithelial cells, rectal epithelial cells, pharyngeal muscle or marginal cells, arcade cells, support cells of sensory organs, and neuronal cells. Using time-lapse recordings, we discovered that some hh-related genes are expressed in a cyclical fashion in phase with molting during larval development. We also generated several translational GFP fusions, but they did not show any subcellular localization. In addition, we also studied the expression patterns of two genes with similarity to Drosophila frizzled, T23D8.1 and F27E11.3A, and the ortholog of the Drosophila gene dally-like, gpn-1, which is a heparan sulfate proteoglycan. The two frizzled homologs are expressed in a few neurons in the head, and gpn-1 is expressed in the pharynx. Finally, we compare the

  20. Density based pruning for identification of differentially expressed genes from microarray data

    Directory of Open Access Journals (Sweden)

    Xu Jia

    2010-11-01

    Full Text Available Abstract Motivation Identification of differentially expressed genes from microarray datasets is one of the most important analyses for microarray data mining. Popular algorithms such as statistical t-test rank genes based on a single statistics. The false positive rate of these methods can be improved by considering other features of differentially expressed genes. Results We proposed a pattern recognition strategy for identifying differentially expressed genes. Genes are mapped to a two dimension feature space composed of average difference of gene expression and average expression levels. A density based pruning algorithm (DB Pruning is developed to screen out potential differentially expressed genes usually located in the sparse boundary region. Biases of popular algorithms for identifying differentially expressed genes are visually characterized. Experiments on 17 datasets from Gene Omnibus Database (GEO with experimentally verified differentially expressed genes showed that DB pruning can significantly improve the prediction accuracy of popular identification algorithms such as t-test, rank product, and fold change. Conclusions Density based pruning of non-differentially expressed genes is an effective method for enhancing statistical testing based algorithms for identifying differentially expressed genes. It improves t-test, rank product, and fold change by 11% to 50% in the numbers of identified true differentially expressed genes. The source code of DB pruning is freely available on our website http://mleg.cse.sc.edu/degprune

  1. Myocardial gene expression of microRNA-133a and myosin heavy and light chains, in conjunction with clinical parameters, predict regression of left ventricular hypertrophy after valve replacement in patients with aortic stenosis.

    Science.gov (United States)

    Villar, Ana V; Merino, David; Wenner, Mareike; Llano, Miguel; Cobo, Manuel; Montalvo, Cecilia; García, Raquel; Martín-Durán, Rafael; Hurlé, Juan M; Hurlé, María A; Nistal, J Francisco

    2011-07-01

    Left ventricular (LV) reverse remodelling after valve replacement in aortic stenosis (AS) has been classically linked to the hydraulic performance of the replacement device, but myocardial status at the time of surgery has received little attention. To establish predictors of LV mass (LVM) regression 1 year after valve replacement in a surgical cohort of patients with AS based on preoperative clinical and echocardiographic parameters and the myocardial gene expression profile at surgery. Transcript levels of remodelling-related proteins and regulators were determined in LV intraoperative biopsies from 46 patients with AS by RT-PCR. Using multiple linear regression analysis, an equation was developed (adjusted R²=0.73; pregression analysis identified microRNA-133a as a significant positive predictor of LVM normalisation, whereas β-myosin heavy chain and BMI constituted negative predictors. Hypertrophy regression 1 year after pressure overload release is related to the preoperative myocardial expression of remodelling-related genes, in conjunction with the patient's clinical background. In this scenario, miR-133 emerges as a key element of the reverse remodelling process. Postoperative improvement of valve haemodynamics does not predict the degree of hypertrophy regression or LVM normalisation. These results led us to reconsider the current reverse remodelling paradigm and (1) to include criteria of hypertrophy reversibility in the decision algorithm used to decide timing for the operation; and (2) to modify other prevailing factors (overweight, diabetes, etc) known to maintain LV hypertrophy.

  2. Combining gene prediction methods to improve metagenomic gene annotation

    Directory of Open Access Journals (Sweden)

    Rosen Gail L

    2011-01-01

    Full Text Available Abstract Background Traditional gene annotation methods rely on characteristics that may not be available in short reads generated from next generation technology, resulting in suboptimal performance for metagenomic (environmental samples. Therefore, in recent years, new programs have been developed that optimize performance on short reads. In this work, we benchmark three metagenomic gene prediction programs and combine their predictions to improve metagenomic read gene annotation. Results We not only analyze the programs' performance at different read-lengths like similar studies, but also separate different types of reads, including intra- and intergenic regions, for analysis. The main deficiencies are in the algorithms' ability to predict non-coding regions and gene edges, resulting in more false-positives and false-negatives than desired. In fact, the specificities of the algorithms are notably worse than the sensitivities. By combining the programs' predictions, we show significant improvement in specificity at minimal cost to sensitivity, resulting in 4% improvement in accuracy for 100 bp reads with ~1% improvement in accuracy for 200 bp reads and above. To correctly annotate the start and stop of the genes, we find that a consensus of all the predictors performs best for shorter read lengths while a unanimous agreement is better for longer read lengths, boosting annotation accuracy by 1-8%. We also demonstrate use of the classifier combinations on a real dataset. Conclusions To optimize the performance for both prediction and annotation accuracies, we conclude that the consensus of all methods (or a majority vote is the best for reads 400 bp and shorter, while using the intersection of GeneMark and Orphelia predictions is the best for reads 500 bp and longer. We demonstrate that most methods predict over 80% coding (including partially coding reads on a real human gut sample sequenced by Illumina technology.

  3. Gene expression analysis identifies global gene dosage sensitivity in cancer

    DEFF Research Database (Denmark)

    Fehrmann, Rudolf S. N.; Karjalainen, Juha M.; Krajewska, Malgorzata

    2015-01-01

    Many cancer-associated somatic copy number alterations (SCNAs) are known. Currently, one of the challenges is to identify the molecular downstream effects of these variants. Although several SCNAs are known to change gene expression levels, it is not clear whether each individual SCNA affects gen...

  4. A network approach to predict pathogenic genes for Fusarium graminearum.

    Directory of Open Access Journals (Sweden)

    Xiaoping Liu

    Full Text Available Fusarium graminearum is the pathogenic agent of Fusarium head blight (FHB, which is a destructive disease on wheat and barley, thereby causing huge economic loss and health problems to human by contaminating foods. Identifying pathogenic genes can shed light on pathogenesis underlying the interaction between F. graminearum and its plant host. However, it is difficult to detect pathogenic genes for this destructive pathogen by time-consuming and expensive molecular biological experiments in lab. On the other hand, computational methods provide an alternative way to solve this problem. Since pathogenesis is a complicated procedure that involves complex regulations and interactions, the molecular interaction network of F. graminearum can give clues to potential pathogenic genes. Furthermore, the gene expression data of F. graminearum before and after its invasion into plant host can also provide useful information. In this paper, a novel systems biology approach is presented to predict pathogenic genes of F. graminearum based on molecular interaction network and gene expression data. With a small number of known pathogenic genes as seed genes, a subnetwork that consists of potential pathogenic genes is identified from the protein-protein interaction network (PPIN of F. graminearum, where the genes in the subnetwork are further required to be differentially expressed before and after the invasion of the pathogenic fungus. Therefore, the candidate genes in the subnetwork are expected to be involved in the same biological processes as seed genes, which imply that they are potential pathogenic genes. The prediction results show that most of the pathogenic genes of F. graminearum are enriched in two important signal transduction pathways, including G protein coupled receptor pathway and MAPK signaling pathway, which are known related to pathogenesis in other fungi. In addition, several pathogenic genes predicted by our method are verified in other

  5. The predictive nature of transcript expression levels on protein expression in adult human brain.

    Science.gov (United States)

    Bauernfeind, Amy L; Babbitt, Courtney C

    2017-04-24

    Next generation sequencing methods are the gold standard for evaluating expression of the transcriptome. When determining the biological implications of such studies, the assumption is often made that transcript expression levels correspond to protein levels in a meaningful way. However, the strength of the overall correlation between transcript and protein expression is inconsistent, particularly in brain samples. Following high-throughput transcriptomic (RNA-Seq) and proteomic (liquid chromatography coupled with tandem mass spectrometry) analyses of adult human brain samples, we compared the correlation in the expression of transcripts and proteins that support various biological processes, molecular functions, and that are located in different areas of the cell. Although most categories of transcripts have extremely weak predictive value for the expression of their associated proteins (R 2 values of < 10%), transcripts coding for protein kinases and membrane-associated proteins, including those that are part of receptors or ion transporters, are among those that are most predictive of downstream protein expression levels. The predictive value of transcript expression for corresponding proteins is variable in human brain samples, reflecting the complex regulation of protein expression. However, we found that transcriptomic analyses are appropriate for assessing the expression levels of certain classes of proteins, including those that modify proteins, such as kinases and phosphatases, regulate metabolic and synaptic activity, or are associated with a cellular membrane. These findings can be used to guide the interpretation of gene expression results from primate brain samples.

  6. Metallothionein gene expression in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Deeksha Pal

    2014-01-01

    Full Text Available Introduction: Metallothioneins (MTs are a group of low-molecular weight, cysteine-rich proteins. In general, MT is known to modulate three fundamental processes: (1 the release of gaseous mediators such as hydroxyl radical or nitric oxide, (2 apoptosis and (3 the binding and exchange of heavy metals such as zinc, cadmium or copper. Previous studies have shown a positive correlation between the expression of MT with invasion, metastasis and poor prognosis in various cancers. Most of the previous studies primarily used immunohistochemistry to analyze localization of MT in renal cell carcinoma (RCC. No information is available on the gene expression of MT2A isoform in different types and grades of RCC. Materials and Methods: In the present study, total RNA was isolated from 38 histopathologically confirmed cases of RCC of different types and grades. Corresponding adjacent normal renal parenchyma was taken as control. Real-time polymerase chain reaction (RT PCR analysis was done for the MT2A gene expression using b-actin as an internal control. All statistical calculations were performed using SPSS software. Results: The MT2A gene expression was found to be significantly increased (P < 0.01 in clear cell RCC in comparison with the adjacent normal renal parenchyma. The expression of MT2A was two to three-fold higher in sarcomatoid RCC, whereas there was no change in papillary and collecting duct RCC. MT2A gene expression was significantly higher in lower grade (grades I and II, P < 0.05, while no change was observed in high-grade tumor (grade III and IV in comparison to adjacent normal renal tissue. Conclusion: The first report of the expression of MT2A in different types and grades of RCC and also these data further support the role of MT2A in tumorigenesis.

  7. Gene expression of the endolymphatic sac

    DEFF Research Database (Denmark)

    Friis, Morten; Martin-Bertelsen, Tomas; Friis-Hansen, Lennart

    2011-01-01

    that the endolymphatic sac has multiple and diverse functions in the inner ear. Objectives:The objective of this study was to provide a comprehensive review of the genes expressed in the endolymphatic sac in the rat and perform a functional characterization based on measured mRNA abundance. Methods:Microarray technology...

  8. Shrinkage Approach for Gene Expression Data Analysis

    Czech Academy of Sciences Publication Activity Database

    Haman, Jiří; Valenta, Zdeněk; Kalina, Jan

    2013-01-01

    Roč. 1, č. 1 (2013), s. 65-65 ISSN 1805-8698. [EFMI 2013 Special Topic Conference. 17.04.2013-19.04.2013, Prague] Institutional support: RVO:67985807 Keywords : shrinkage estimation * covariance matrix * high dimensional data * gene expression Subject RIV: IN - Informatics, Computer Science

  9. The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome.

    Directory of Open Access Journals (Sweden)

    Laurence D Hurst

    2015-12-01

    Full Text Available X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE and data from the Functional Annotation of the Mammalian Genome (FANTOM5 project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds, as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased

  10. The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome

    KAUST Repository

    Hurst, Laurence D.

    2015-12-18

    X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression

  11. Fluid Mechanics, Arterial Disease, and Gene Expression.

    Science.gov (United States)

    Tarbell, John M; Shi, Zhong-Dong; Dunn, Jessilyn; Jo, Hanjoong

    2014-01-01

    This review places modern research developments in vascular mechanobiology in the context of hemodynamic phenomena in the cardiovascular system and the discrete localization of vascular disease. The modern origins of this field are traced, beginning in the 1960s when associations between flow characteristics, particularly blood flow-induced wall shear stress, and the localization of atherosclerotic plaques were uncovered, and continuing to fluid shear stress effects on the vascular lining endothelial) cells (ECs), including their effects on EC morphology, biochemical production, and gene expression. The earliest single-gene studies and genome-wide analyses are considered. The final section moves from the ECs lining the vessel wall to the smooth muscle cells and fibroblasts within the wall that are fluid me chanically activated by interstitial flow that imposes shear stresses on their surfaces comparable with those of flowing blood on EC surfaces. Interstitial flow stimulates biochemical production and gene expression, much like blood flow on ECs.

  12. Gene Expression Commons: an open platform for absolute gene expression profiling.

    Directory of Open Access Journals (Sweden)

    Jun Seita

    Full Text Available Gene expression profiling using microarrays has been limited to comparisons of gene expression between small numbers of samples within individual experiments. However, the unknown and variable sensitivities of each probeset have rendered the absolute expression of any given gene nearly impossible to estimate. We have overcome this limitation by using a very large number (>10,000 of varied microarray data as a common reference, so that statistical attributes of each probeset, such as the dynamic range and threshold between low and high expression, can be reliably discovered through meta-analysis. This strategy is implemented in a web-based platform named "Gene Expression Commons" (https://gexc.stanford.edu/ which contains data of 39 distinct highly purified mouse hematopoietic stem/progenitor/differentiated cell populations covering almost the entire hematopoietic system. Since the Gene Expression Commons is designed as an open platform, investigators can explore the expression level of any gene, search by expression patterns of interest, submit their own microarray data, and design their own working models representing biological relationship among samples.

  13. Comparative gene expression of intestinal metabolizing enzymes.

    Science.gov (United States)

    Shin, Ho-Chul; Kim, Hye-Ryoung; Cho, Hee-Jung; Yi, Hee; Cho, Soo-Min; Lee, Dong-Goo; Abd El-Aty, A M; Kim, Jin-Suk; Sun, Duxin; Amidon, Gordon L

    2009-11-01

    The purpose of this study was to compare the expression profiles of drug-metabolizing enzymes in the intestine of mouse, rat and human. Total RNA was isolated from the duodenum and the mRNA expression was measured using Affymetrix GeneChip oligonucleotide arrays. Detected genes from the intestine of mouse, rat and human were ca. 60% of 22690 sequences, 40% of 8739 and 47% of 12559, respectively. Total genes of metabolizing enzymes subjected in this study were 95, 33 and 68 genes in mouse, rat and human, respectively. Of phase I enzymes, the mouse exhibited abundant gene expressions for Cyp3a25, Cyp4v3, Cyp2d26, followed by Cyp2b20, Cyp2c65 and Cyp4f14, whereas, the rat showed higher expression profiles of Cyp3a9, Cyp2b19, Cyp4f1, Cyp17a1, Cyp2d18, Cyp27a1 and Cyp4f6. However, the highly expressed P450 enzymes were CYP3A4, CYP3A5, CYP4F3, CYP2C18, CYP2C9, CYP2D6, CYP3A7, CYP11B1 and CYP2B6 in the human. For phase II enzymes, glucuronosyltransferase Ugt1a6, glutathione S-transferases Gstp1, Gstm3 and Gsta2, sulfotransferase Sult1b1 and acyltransferase Dgat1 were highly expressed in the mouse. The rat revealed predominant expression of glucuronosyltransferases Ugt1a1 and Ugt1a7, sulfotransferase Sult1b1, acetyltransferase Dlat and acyltransferase Dgat1. On the other hand, in human, glucuronosyltransferases UGT2B15 and UGT2B17, glutathione S-transferases MGST3, GSTP1, GSTA2 and GSTM4, sulfotransferases ST1A3 and SULT1A2, acetyltransferases SAT1 and CRAT, and acyltransferase AGPAT2 were dominantly detected. Therefore, current data indicated substantial interspecies differences in the pattern of intestinal gene expression both for P450 enzymes and phase II drug-metabolizing enzymes. This genomic database is expected to improve our understanding of interspecies variations in estimating intestinal prehepatic clearance of oral drugs.

  14. Structure and expression of thyroglobulin gene

    Energy Technology Data Exchange (ETDEWEB)

    Vassart, G; Brocas, H; Christophe, D; de Martynoff, G; Leriche, A; Mercken, L; Pohl, V; van Heuverswyn, B [Institut de Recherche Interdisciplinaire en Biologie Humaine et Nucleaire (IRIBHN), Faculte de Medecine, Universite libre de Bruxelles, Campus Hopital Erasme, Brussels (Belgium)

    1982-01-01

    Thyroglobulin is composed of two 300000 dalton polypeptide chains, translated from an 8000 base mRNA. Preparation of a full length cDNA and its cloning in E. coli have lead to the demonstration that the polypeptides of thyroglobulin protomers were identical. Used as molecular probes, the cloned cDNA allowed the isolation of a fragment of thyroglobulin gene. Electron microscopic studies have demonstrated that this gene contains more than 90 % intronic material separating small size exons (<200 bp). Sequencing of bovine thyroglobulin structural gene is in progress. Preliminary results show evidence for the existence of repetitive segments. Availability of cloned DNA complementary to bovine and human thyroglobulin mRNA allows the study of genetic defects of thyroglobulin gene expression in the human and in various animal models.

  15. Cerebrovascular gene expression in spontaneously hypertensive rats

    DEFF Research Database (Denmark)

    Grell, Anne-Sofie; Frederiksen, Simona Denise; Edvinsson, Lars

    2017-01-01

    Hypertension is a hemodynamic disorder and one of the most important and well-established risk factors for vascular diseases such as stroke. Blood vessels exposed to chronic shear stress develop structural changes and remodeling of the vascular wall through many complex mechanisms. However......, the molecular mechanisms involved are not fully understood. Hypertension-susceptible genes may provide a novel insight into potential molecular mechanisms of hypertension and secondary complications associated with hypertension. The aim of this exploratory study was to identify gene expression differences......, the identified genes in the middle cerebral arteries from spontaneously hypertensive rats could be possible mediators of the vascular changes and secondary complications associated with hypertension. This study supports the selection of key genes to investigate in the future research of hypertension-induced end...

  16. Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

    Science.gov (United States)

    Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

    2015-01-27

    Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.

  17. Gene expression signatures for colorectal cancer microsatellite status and HNPCC

    DEFF Research Database (Denmark)

    Kruhøffer, M; Jensen, J L; Laiho, P

    2005-01-01

    The majority of microsatellite instable (MSI) colorectal cancers are sporadic, but a subset belongs to the syndrome hereditary non-polyposis colorectal cancer (HNPCC). Microsatellite instability is caused by dysfunction of the mismatch repair (MMR) system that leads to a mutator phenotype, and MSI...... of 101 stage II and III colorectal cancers (34 MSI, 67 microsatellite stable (MSS)) using high-density oligonucleotide microarrays. From these data, we constructed a nine-gene signature capable of separating the mismatch repair proficient and deficient tumours. Subsequently, we demonstrated...... is correlated to prognosis and response to chemotherapy. Gene expression signatures as predictive markers are being developed for many cancers, and the identification of a signature for MMR deficiency would be of interest both clinically and biologically. To address this issue, we profiled the gene expression...

  18. Combining Gene Signatures Improves Prediction of Breast Cancer Survival

    Science.gov (United States)

    Zhao, Xi; Naume, Bjørn; Langerød, Anita; Frigessi, Arnoldo; Kristensen, Vessela N.; Børresen-Dale, Anne-Lise; Lingjærde, Ole Christian

    2011-01-01

    Background Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123) and test set (n = 81), respectively. Gene sets from eleven previously published gene signatures are included in the study. Principal Findings To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014). Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001). The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. Conclusion Combining the predictive strength of multiple gene signatures improves prediction of breast

  19. Combining gene signatures improves prediction of breast cancer survival.

    Directory of Open Access Journals (Sweden)

    Xi Zhao

    Full Text Available BACKGROUND: Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123 and test set (n = 81, respectively. Gene sets from eleven previously published gene signatures are included in the study. PRINCIPAL FINDINGS: To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014. Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001. The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. CONCLUSION: Combining the predictive strength of multiple gene signatures improves

  20. Global gene expression in Escherichia coli biofilms

    DEFF Research Database (Denmark)

    Schembri, Mark; Kjærgaard, K.; Klemm, Per

    2003-01-01

    It is now apparent that microorganisms undergo significant changes during the transition from planktonic to biofilm growth. These changes result in phenotypic adaptations that allow the formation of highly organized and structured sessile communities, which possess enhanced resistance to antimicr......It is now apparent that microorganisms undergo significant changes during the transition from planktonic to biofilm growth. These changes result in phenotypic adaptations that allow the formation of highly organized and structured sessile communities, which possess enhanced resistance...... the transition to biofilm growth, and these included genes expressed under oxygen-limiting conditions, genes encoding (putative) transport proteins, putative oxidoreductases and genes associated with enhanced heavy metal resistance. Of particular interest was the observation that many of the genes altered...... in expression have no current defined function. These genes, as well as those induced by stresses relevant to biofilm growth such as oxygen and nutrient limitation, may be important factors that trigger enhanced resistance mechanisms of sessile communities to antibiotics and hydrodynamic shear forces....

  1. Bioinformatic prediction and functional characterization of human KIAA0100 gene

    Directory of Open Access Journals (Sweden)

    He Cui

    2017-02-01

    Full Text Available Our previous study demonstrated that human KIAA0100 gene was a novel acute monocytic leukemia-associated antigen (MLAA gene. But the functional characterization of human KIAA0100 gene has remained unknown to date. Here, firstly, bioinformatic prediction of human KIAA0100 gene was carried out using online softwares; Secondly, Human KIAA0100 gene expression was downregulated by the clustered regularly interspaced short palindromic repeats (CRISPR/CRISPR-associated (Cas 9 system in U937 cells. Cell proliferation and apoptosis were next evaluated in KIAA0100-knockdown U937 cells. The bioinformatic prediction showed that human KIAA0100 gene was located on 17q11.2, and human KIAA0100 protein was located in the secretory pathway. Besides, human KIAA0100 protein contained a signalpeptide, a transmembrane region, three types of secondary structures (alpha helix, extended strand, and random coil , and four domains from mitochondrial protein 27 (FMP27. The observation on functional characterization of human KIAA0100 gene revealed that its downregulation inhibited cell proliferation, and promoted cell apoptosis in U937 cells. To summarize, these results suggest human KIAA0100 gene possibly comes within mitochondrial genome; moreover, it is a novel anti-apoptotic factor related to carcinogenesis or progression in acute monocytic leukemia, and may be a potential target for immunotherapy against acute monocytic leukemia.

  2. Supervised classification of combined copy number and gene expression data

    Directory of Open Access Journals (Sweden)

    Riccadonna S.

    2007-12-01

    Full Text Available In this paper we apply a predictive profiling method to genome copy number aberrations (CNA in combination with gene expression and clinical data to identify molecular patterns of cancer pathophysiology. Predictive models and optimal feature lists for the platforms are developed by a complete validation SVM-based machine learning system. Ranked list of genome CNA sites (assessed by comparative genomic hybridization arrays – aCGH and of differentially expressed genes (assessed by microarray profiling with Affy HG-U133A chips are computed and combined on a breast cancer dataset for the discrimination of Luminal/ ER+ (Lum/ER+ and Basal-like/ER- classes. Different encodings are developed and applied to the CNA data, and predictive variable selection is discussed. We analyze the combination of profiling information between the platforms, also considering the pathophysiological data. A specific subset of patients is identified that has a different response to classification by chromosomal gains and losses and by differentially expressed genes, corroborating the idea that genomic CNA can represent an independent source for tumor classification.

  3. Decomposition of gene expression state space trajectories.

    Directory of Open Access Journals (Sweden)

    Jessica C Mar

    2009-12-01

    Full Text Available Representing and analyzing complex networks remains a roadblock to creating dynamic network models of biological processes and pathways. The study of cell fate transitions can reveal much about the transcriptional regulatory programs that underlie these phenotypic changes and give rise to the coordinated patterns in expression changes that we observe. The application of gene expression state space trajectories to capture cell fate transitions at the genome-wide level is one approach currently used in the literature. In this paper, we analyze the gene expression dataset of Huang et al. (2005 which follows the differentiation of promyelocytes into neutrophil-like cells in the presence of inducers dimethyl sulfoxide and all-trans retinoic acid. Huang et al. (2005 build on the work of Kauffman (2004 who raised the attractor hypothesis, stating that cells exist in an expression landscape and their expression trajectories converge towards attractive sites in this landscape. We propose an alternative interpretation that explains this convergent behavior by recognizing that there are two types of processes participating in these cell fate transitions-core processes that include the specific differentiation pathways of promyelocytes to neutrophils, and transient processes that capture those pathways and responses specific to the inducer. Using functional enrichment analyses, specific biological examples and an analysis of the trajectories and their core and transient components we provide a validation of our hypothesis using the Huang et al. (2005 dataset.

  4.  DNA microarray-based gene expression profiling in diagnosis, assessing prognosis and predicting response to therapy in colorectal cancer

    Directory of Open Access Journals (Sweden)

    Przemysław Kwiatkowski

    2012-06-01

    Full Text Available  Colorectal cancer is the most common cancer of the gastrointestinal tract. It is considered as a biological model of a certain type of cancerogenesis process in which progression from an early to late stage adenoma and cancer is accompanied by distinct genetic alterations.Clinical and pathological parameters commonly used in clinical practice are often insufficient to determine groups of patients suitable for personalized treatment. Moreover, reliable molecular markers with high prognostic value have not yet been determined. Molecular studies using DNA-based microarrays have identified numerous genes involved in cell proliferation and differentiation during the process of cancerogenesis. Assessment of the genetic profile of colorectal cancer using the microarray technique might be a useful tool in determining the groups of patients with different clinical outcomes who would benefit from additional personalized treatment.The main objective of this study was to present the current state of knowledge on the practical application of gene profiling techniques using microarrays for determining diagnosis, prognosis and response to treatment in colorectal cancer.

  5. Identification of a robust gene signature that predicts breast cancer outcome in independent data sets

    International Nuclear Information System (INIS)

    Korkola, James E; Waldman, Frederic M; Blaveri, Ekaterina; DeVries, Sandy; Moore, Dan H II; Hwang, E Shelley; Chen, Yunn-Yi; Estep, Anne LH; Chew, Karen L; Jensen, Ronald H

    2007-01-01

    Breast cancer is a heterogeneous disease, presenting with a wide range of histologic, clinical, and genetic features. Microarray technology has shown promise in predicting outcome in these patients. We profiled 162 breast tumors using expression microarrays to stratify tumors based on gene expression. A subset of 55 tumors with extensive follow-up was used to identify gene sets that predicted outcome. The predictive gene set was further tested in previously published data sets. We used different statistical methods to identify three gene sets associated with disease free survival. A fourth gene set, consisting of 21 genes in common to all three sets, also had the ability to predict patient outcome. To validate the predictive utility of this derived gene set, it was tested in two published data sets from other groups. This gene set resulted in significant separation of patients on the basis of survival in these data sets, correctly predicting outcome in 62–65% of patients. By comparing outcome prediction within subgroups based on ER status, grade, and nodal status, we found that our gene set was most effective in predicting outcome in ER positive and node negative tumors. This robust gene selection with extensive validation has identified a predictive gene set that may have clinical utility for outcome prediction in breast cancer patients

  6. Global expression differences and tissue specific expression differences in rice evolution result in two contrasting types of differentially expressed genes

    KAUST Repository

    Horiuchi, Youko; Harushima, Yoshiaki; Fujisawa, Hironori; Mochizuki, Takako; Fujita, Masahiro; Ohyanagi, Hajime; Kurata, Nori

    2015-01-01

    Since the development of transcriptome analysis systems, many expression evolution studies characterized evolutionary forces acting on gene expression, without explicit discrimination between global expression differences and tissue

  7. Creating and validating cis-regulatory maps of tissue-specific gene expression regulation

    Science.gov (United States)

    O'Connor, Timothy R.; Bailey, Timothy L.

    2014-01-01

    Predicting which genomic regions control the transcription of a given gene is a challenge. We present a novel computational approach for creating and validating maps that associate genomic regions (cis-regulatory modules–CRMs) with genes. The method infers regulatory relationships that explain gene expression observed in a test tissue using widely available genomic data for ‘other’ tissues. To predict the regulatory targets of a CRM, we use cross-tissue correlation between histone modifications present at the CRM and expression at genes within 1 Mbp of it. To validate cis-regulatory maps, we show that they yield more accurate models of gene expression than carefully constructed control maps. These gene expression models predict observed gene expression from transcription factor binding in the CRMs linked to that gene. We show that our maps are able to identify long-range regulatory interactions and improve substantially over maps linking genes and CRMs based on either the control maps or a ‘nearest neighbor’ heuristic. Our results also show that it is essential to include CRMs predicted in multiple tissues during map-building, that H3K27ac is the most informative histone modification, and that CAGE is the most informative measure of gene expression for creating cis-regulatory maps. PMID:25200088

  8. Reduced expression of Autographa californica nucleopolyhedrovirus ORF34, an essential gene, enhances heterologous gene expression

    International Nuclear Information System (INIS)

    Salem, Tamer Z.; Zhang, Fengrui; Thiem, Suzanne M.

    2013-01-01

    Autographa californica multiple nucleopolyhedrovirus ORF34 is part of a transcriptional unit that includes ORF32, encoding a viral fibroblast growth factor (FGF) and ORF33. We identified ORF34 as a candidate for deletion to improve protein expression in the baculovirus expression system based on enhanced reporter gene expression in an RNAi screen of virus genes. However, ORF34 was shown to be an essential gene. To explore ORF34 function, deletion (KO34) and rescue bacmids were constructed and characterized. Infection did not spread from primary KO34 transfected cells and supernatants from KO34 transfected cells could not infect fresh Sf21 cells whereas the supernatant from the rescue bacmids transfection could recover the infection. In addition, budded viruses were not observed in KO34 transfected cells by electron microscopy, nor were viral proteins detected from the transfection supernatants by western blots. These demonstrate that ORF34 is an essential gene with a possible role in infectious virus production.

  9. Reduced expression of Autographa californica nucleopolyhedrovirus ORF34, an essential gene, enhances heterologous gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Salem, Tamer Z. [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Department of Microbial Molecular Biology, AGERI, Agricultural Research Center, Giza 12619 (Egypt); Division of Biomedical Sciences, Zewail University, Zewail City of Science and Technology, Giza 12588 (Egypt); Zhang, Fengrui [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Thiem, Suzanne M., E-mail: smthiem@msu.edu [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824 (United States)

    2013-01-20

    Autographa californica multiple nucleopolyhedrovirus ORF34 is part of a transcriptional unit that includes ORF32, encoding a viral fibroblast growth factor (FGF) and ORF33. We identified ORF34 as a candidate for deletion to improve protein expression in the baculovirus expression system based on enhanced reporter gene expression in an RNAi screen of virus genes. However, ORF34 was shown to be an essential gene. To explore ORF34 function, deletion (KO34) and rescue bacmids were constructed and characterized. Infection did not spread from primary KO34 transfected cells and supernatants from KO34 transfected cells could not infect fresh Sf21 cells whereas the supernatant from the rescue bacmids transfection could recover the infection. In addition, budded viruses were not observed in KO34 transfected cells by electron microscopy, nor were viral proteins detected from the transfection supernatants by western blots. These demonstrate that ORF34 is an essential gene with a possible role in infectious virus production.

  10. Identification of high-risk cutaneous melanoma tumors is improved when combining the online American Joint Committee on Cancer Individualized Melanoma Patient Outcome Prediction Tool with a 31-gene expression profile-based classification.

    Science.gov (United States)

    Ferris, Laura K; Farberg, Aaron S; Middlebrook, Brooke; Johnson, Clare E; Lassen, Natalie; Oelschlager, Kristen M; Maetzold, Derek J; Cook, Robert W; Rigel, Darrell S; Gerami, Pedram

    2017-05-01

    A significant proportion of patients with American Joint Committee on Cancer (AJCC)-defined early-stage cutaneous melanoma have disease recurrence and die. A 31-gene expression profile (GEP) that accurately assesses metastatic risk associated with primary cutaneous melanomas has been described. We sought to compare accuracy of the GEP in combination with risk determined using the web-based AJCC Individualized Melanoma Patient Outcome Prediction Tool. GEP results from 205 stage I/II cutaneous melanomas with sufficient clinical data for prognostication using the AJCC tool were classified as low (class 1) or high (class 2) risk. Two 5-year overall survival cutoffs (AJCC 79% and 68%), reflecting survival for patients with stage IIA or IIB disease, respectively, were assigned for binary AJCC risk. Cox univariate analysis revealed significant risk classification of distant metastasis-free and overall survival (hazard ratio range 3.2-9.4, P risk by GEP but low risk by AJCC. Specimens reflect tertiary care center referrals; more effective therapies have been approved for clinical use after accrual. The GEP provides valuable prognostic information and improves identification of high-risk melanomas when used together with the AJCC online prediction tool. Copyright © 2016 American Academy of Dermatology, Inc. Published by Elsevier Inc. All rights reserved.

  11. Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

    Science.gov (United States)

    Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

    2014-05-01

    Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  12. Gene Expression Profiling of Xeroderma Pigmentosum

    Directory of Open Access Journals (Sweden)

    Bowden Nikola A

    2006-05-01

    Full Text Available Abstract Xeroderma pigmentosum (XP is a rare recessive disorder that is characterized by extreme sensitivity to UV light. UV light exposure results in the formation of DNA damage such as cyclobutane dimers and (6-4 photoproducts. Nucleotide excision repair (NER orchestrates the removal of cyclobutane dimers and (6-4 photoproducts as well as some forms of bulky chemical DNA adducts. The disease XP is comprised of 7 complementation groups (XP-A to XP-G, which represent functional deficiencies in seven different genes, all of which are believed to be involved in NER. The main clinical feature of XP is various forms of skin cancers; however, neurological degeneration is present in XPA, XPB, XPD and XPG complementation groups. The relationship between NER and other types of DNA repair processes is now becoming evident but the exact relationships between the different complementation groups remains to be precisely determined. Using gene expression analysis we have identified similarities and differences after UV light exposure between the complementation groups XP-A, XP-C, XP-D, XP-E, XP-F, XP-G and an unaffected control. The results reveal that there is a graded change in gene expression patterns between the mildest, most similar to the control response (XP-E and the severest form (XP-A of the disease, with the exception of XP-D. Distinct differences between the complementation groups with neurological symptoms (XP-A, XP-D and XP-G and without (XP-C, XP-E and XP-F were also identified. Therefore, this analysis has revealed distinct gene expression profiles for the XP complementation groups and the first step towards understanding the neurological symptoms of XP.

  13. Changes in gene expression following androgen receptor blockade ...

    Indian Academy of Sciences (India)

    Madhu urs

    of gene expression in the ventral prostate, it is not clear whether all the gene expression ... These include clusterin, methionine adenosyl transferase IIα, and prostate-specific ..... MAGEE1 melanoma antigen and no similarity was found with the ...

  14. Rubisco activity and gene expression of tropical tree species under ...

    African Journals Online (AJOL)

    Young

    2013-05-15

    May 15, 2013 ... Proteomics analysis associated with gene expression of plants reveal .... Consequently, Rubisco enzyme plays a role in assi- milating into ... technique for examining gene expression encoded at the. mRNA level .... Ammonia.

  15. Gene structure, phylogeny and expression profile of the sucrose ...

    Indian Academy of Sciences (India)

    Gene structure, phylogeny and expression profile of the sucrose synthase gene family in .... 24, 701–713. Bate N. and Twell D. 1998 Functional architecture of a late pollen .... Manzara T. and Gruissem W. 1988 Organization and expression.

  16. Cholinergic regulation of VIP gene expression in human neuroblastoma cells

    DEFF Research Database (Denmark)

    Kristensen, Bo; Georg, Birgitte; Fahrenkrug, Jan

    1997-01-01

    Vasoactive intestinal polypeptide, muscarinic receptor, neuroblastoma cell, mRNA, gene expression, peptide processing......Vasoactive intestinal polypeptide, muscarinic receptor, neuroblastoma cell, mRNA, gene expression, peptide processing...

  17. Prognostic Gene Expression Profiles in Breast Cancer

    DEFF Research Database (Denmark)

    Sørensen, Kristina Pilekær

    Each year approximately 4,800 Danish women are diagnosed with breast cancer. Several clinical and pathological factors are used as prognostic and predictive markers to categorize the patients into groups of high or low risk. Around 90% of all patients are allocated to the high risk group...... clinical courses, and they may be useful as novel prognostic biomarkers in breast cancer. The aim of the present project was to predict the development of metastasis in lymph node negative breast cancer patients by RNA profiling. We collected and analyzed 82 primary breast tumors from patients who...... and the time of event. Previous findings have shown that high expression of the lncRNA HOTAIR is correlated with poor survival in breast cancer. We validated this finding by demonstrating that high HOTAIR expression in our primary tumors was significantly associated with worse prognosis independent...

  18. Allele specific expression in worker reproduction genes in the bumblebee Bombus terrestris

    Directory of Open Access Journals (Sweden)

    Harindra E. Amarasinghe

    2015-07-01

    Full Text Available Methylation has previously been associated with allele specific expression in ants. Recently, we found methylation is important in worker reproduction in the bumblebee Bombus terrestris. Here we searched for allele specific expression in twelve genes associated with worker reproduction in bees. We found allele specific expression in Ecdysone 20 monooxygenase and IMP-L2-like. Although we were unable to confirm a genetic or epigenetic cause for this allele specific expression, the expression patterns of the two genes match those predicted for imprinted genes.

  19. Nuclear AXIN2 represses MYC gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Rennoll, Sherri A.; Konsavage, Wesley M.; Yochum, Gregory S., E-mail: gsy3@psu.edu

    2014-01-03

    Highlights: •AXIN2 localizes to cytoplasmic and nuclear compartments in colorectal cancer cells. •Nuclear AXIN2 represses the activity of Wnt-responsive luciferase reporters. •β-Catenin bridges AXIN2 to TCF transcription factors. •AXIN2 binds the MYC promoter and represses MYC gene expression. -- Abstract: The β-catenin transcriptional coactivator is the key mediator of the canonical Wnt signaling pathway. In the absence of Wnt, β-catenin associates with a cytosolic and multi-protein destruction complex where it is phosphorylated and targeted for proteasomal degradation. In the presence of Wnt, the destruction complex is inactivated and β-catenin translocates into the nucleus. In the nucleus, β-catenin binds T-cell factor (TCF) transcription factors to activate expression of c-MYC (MYC) and Axis inhibition protein 2 (AXIN2). AXIN2 is a member of the destruction complex and, thus, serves in a negative feedback loop to control Wnt/β-catenin signaling. AXIN2 is also present in the nucleus, but its function within this compartment is unknown. Here, we demonstrate that AXIN2 localizes to the nuclei of epithelial cells within normal and colonic tumor tissues as well as colorectal cancer cell lines. In the nucleus, AXIN2 represses expression of Wnt/β-catenin-responsive luciferase reporters and forms a complex with β-catenin and TCF. We demonstrate that AXIN2 co-occupies β-catenin/TCF complexes at the MYC promoter region. When constitutively localized to the nucleus, AXIN2 alters the chromatin structure at the MYC promoter and directly represses MYC gene expression. These findings suggest that nuclear AXIN2 functions as a rheostat to control MYC expression in response to Wnt/β-catenin signaling.

  20. Nuclear AXIN2 represses MYC gene expression

    International Nuclear Information System (INIS)

    Rennoll, Sherri A.; Konsavage, Wesley M.; Yochum, Gregory S.

    2014-01-01

    Highlights: •AXIN2 localizes to cytoplasmic and nuclear compartments in colorectal cancer cells. •Nuclear AXIN2 represses the activity of Wnt-responsive luciferase reporters. •β-Catenin bridges AXIN2 to TCF transcription factors. •AXIN2 binds the MYC promoter and represses MYC gene expression. -- Abstract: The β-catenin transcriptional coactivator is the key mediator of the canonical Wnt signaling pathway. In the absence of Wnt, β-catenin associates with a cytosolic and multi-protein destruction complex where it is phosphorylated and targeted for proteasomal degradation. In the presence of Wnt, the destruction complex is inactivated and β-catenin translocates into the nucleus. In the nucleus, β-catenin binds T-cell factor (TCF) transcription factors to activate expression of c-MYC (MYC) and Axis inhibition protein 2 (AXIN2). AXIN2 is a member of the destruction complex and, thus, serves in a negative feedback loop to control Wnt/β-catenin signaling. AXIN2 is also present in the nucleus, but its function within this compartment is unknown. Here, we demonstrate that AXIN2 localizes to the nuclei of epithelial cells within normal and colonic tumor tissues as well as colorectal cancer cell lines. In the nucleus, AXIN2 represses expression of Wnt/β-catenin-responsive luciferase reporters and forms a complex with β-catenin and TCF. We demonstrate that AXIN2 co-occupies β-catenin/TCF complexes at the MYC promoter region. When constitutively localized to the nucleus, AXIN2 alters the chromatin structure at the MYC promoter and directly represses MYC gene expression. These findings suggest that nuclear AXIN2 functions as a rheostat to control MYC expression in response to Wnt/β-catenin signaling

  1. Molecular mechanisms of curcumin action: gene expression.

    Science.gov (United States)

    Shishodia, Shishir

    2013-01-01

    Curcumin derived from the tropical plant Curcuma longa has a long history of use as a dietary agent, food preservative, and in traditional Asian medicine. It has been used for centuries to treat biliary disorders, anorexia, cough, diabetic wounds, hepatic disorders, rheumatism, and sinusitis. The preventive and therapeutic properties of curcumin are associated with its antioxidant, anti-inflammatory, and anticancer properties. Extensive research over several decades has attempted to identify the molecular mechanisms of curcumin action. Curcumin modulates numerous molecular targets by altering their gene expression, signaling pathways, or through direct interaction. Curcumin regulates the expression of inflammatory cytokines (e.g., TNF, IL-1), growth factors (e.g., VEGF, EGF, FGF), growth factor receptors (e.g., EGFR, HER-2, AR), enzymes (e.g., COX-2, LOX, MMP9, MAPK, mTOR, Akt), adhesion molecules (e.g., ELAM-1, ICAM-1, VCAM-1), apoptosis related proteins (e.g., Bcl-2, caspases, DR, Fas), and cell cycle proteins (e.g., cyclin D1). Curcumin modulates the activity of several transcription factors (e.g., NF-κB, AP-1, STAT) and their signaling pathways. Based on its ability to affect multiple targets, curcumin has the potential for the prevention and treatment of various diseases including cancers, arthritis, allergies, atherosclerosis, aging, neurodegenerative disease, hepatic disorders, obesity, diabetes, psoriasis, and autoimmune diseases. This review summarizes the molecular mechanisms of modulation of gene expression by curcumin. Copyright © 2012 International Union of Biochemistry and Molecular Biology, Inc.

  2. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  3. Expression of minichromosome maintenance genes in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Zhong HB

    2017-11-01

    Full Text Available Hongbin Zhong,1,* Bin Chen,1,* Henrique Neves,2 Jinchun Xing,1 Youxin Ye,1 Ying Lin,1 Guohong Zhuang,3 Shu-Dong Zhang,4 Jiyi Huang,1,5 Hang Fai Kwok2 1Xiang’an Branch, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, People’s Republic of China; 2Faculty of Health Sciences, University of Macau, Taipa, Macau SAR; 3Medical College of Xiamen University, Xiamen, Fujian, People’s Republic of China; 4Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, Londonderry, UK; 5The First Clinical School of Fujian Medical University, Fuzhou, Fujian, People’s Republic of China *These authors contributed equally to this work Abstract: Minichromosome maintenance (MCM proteins play an essential role in DNA replication. They have been shown to be overexpressed in various types of cancer. However, the role of this family in renal cell carcinoma (RCC is widely unknown. In this study, we have identified a number of RCC datasets in the Gene Expression Omnibus database and also investigated the correlation between the expression levels of MCM genes and clinicopathological parameters. We found that the expression levels of MCM genes are positively correlated with one another. Expression levels of MCM2, MCM5, MCM6, and MCM7, but not of MCM3 and MCM4, were higher in RCC compared to paired adjacent normal tissue. Only the expression level of MCM4, but not of other MCMs, was positively correlated with tumor grade. In addition, a high-level expression of MCM2 in either primary tumor or metastases of RCC predicted a shorter disease-free survival time, while a high-level expression of MCM4 or MCM6 in primary tumor was also associated with poorer disease-free survival. Interestingly, we also demonstrated that patients with their primary RCC overexpressing 2 or more MCM genes had a shorter disease-free survival time, while those with RCC metastases overexpressing 3 or more MCM genes had a shorter

  4. A Classification Framework Applied to Cancer Gene Expression Profiles

    Directory of Open Access Journals (Sweden)

    Hussein Hijazi

    2013-01-01

    Full Text Available Classification of cancer based on gene expression has provided insight into possible treatment strategies. Thus, developing machine learning methods that can successfully distinguish among cancer subtypes or normal versus cancer samples is important. This work discusses supervised learning techniques that have been employed to classify cancers. Furthermore, a two-step feature selection method based on an attribute estimation method (e.g., ReliefF and a genetic algorithm was employed to find a set of genes that can best differentiate between cancer subtypes or normal versus cancer samples. The application of different classification methods (e.g., decision tree, k-nearest neighbor, support vector machine (SVM, bagging, and random forest on 5 cancer datasets shows that no classification method universally outperforms all the others. However, k-nearest neighbor and linear SVM generally improve the classification performance over other classifiers. Finally, incorporating diverse types of genomic data (e.g., protein-protein interaction data and gene expression increase the prediction accuracy as compared to using gene expression alone.

  5. Molecular Characterization and Expression Analysis of Equine ( Gene in Horse (

    Directory of Open Access Journals (Sweden)

    Ki-Duk Song

    2014-05-01

    Full Text Available The objective of this study was to determine the molecular characteristics of the horse vascular endothelial growth factor alpha gene (VEGFα by constructing a phylogenetic tree, and to investigate gene expression profiles in tissues and blood leukocytes after exercise for development of suitable biomarkers. Using published amino acid sequences of other vertebrate species (human, chimpanzee, mouse, rat, cow, pig, chicken and dog, we constructed a phylogenetic tree which showed that equine VEGFα belonged to the same clade of the pig VEGFα. Analysis for synonymous (Ks and non-synonymous substitution ratios (Ka revealed that the horse VEGFα underwent positive selection. RNA was extracted from blood samples before and after exercise and different tissue samples of three horses. Expression analyses using reverse transcription-polymerase chain reaction (RT-PCR and quantitative-polymerase chain reaction (qPCR showed ubiquitous expression of VEGFα mRNA in skeletal muscle, kidney, thyroid, lung, appendix, colon, spinal cord, and heart tissues. Analysis of differential expression of VEGFα gene in blood leukocytes after exercise indicated a unimodal pattern. These results will be useful in developing biomarkers that can predict the recovery capacity of racing horses.

  6. Retrotransposons as regulators of gene expression.

    Science.gov (United States)

    Elbarbary, Reyad A; Lucas, Bronwyn A; Maquat, Lynne E

    2016-02-12

    Transposable elements (TEs) are both a boon and a bane to eukaryotic organisms, depending on where they integrate into the genome and how their sequences function once integrated. We focus on two types of TEs: long interspersed elements (LINEs) and short interspersed elements (SINEs). LINEs and SINEs are retrotransposons; that is, they transpose via an RNA intermediate. We discuss how LINEs and SINEs have expanded in eukaryotic genomes and contribute to genome evolution. An emerging body of evidence indicates that LINEs and SINEs function to regulate gene expression by affecting chromatin structure, gene transcription, pre-mRNA processing, or aspects of mRNA metabolism. We also describe how adenosine-to-inosine editing influences SINE function and how ongoing retrotransposition is countered by the body's defense mechanisms. Copyright © 2016, American Association for the Advancement of Science.

  7. Inductive matrix completion for predicting gene-disease associations.

    Science.gov (United States)

    Natarajan, Nagarajan; Dhillon, Inderjit S

    2014-06-15

    Most existing methods for predicting causal disease genes rely on specific type of evidence, and are therefore limited in terms of applicability. More often than not, the type of evidence available for diseases varies-for example, we may know linked genes, keywords associated with the disease obtained by mining text, or co-occurrence of disease symptoms in patients. Similarly, the type of evidence available for genes varies-for example, specific microarray probes convey information only for certain sets of genes. In this article, we apply a novel matrix-completion method called Inductive Matrix Completion to the problem of predicting gene-disease associations; it combines multiple types of evidence (features) for diseases and genes to learn latent factors that explain the observed gene-disease associations. We construct features from different biological sources such as microarray expression data and disease-related textual data. A crucial advantage of the method is that it is inductive; it can be applied to diseases not seen at training time, unlike traditional matrix-completion approaches and network-based inference methods that are transductive. Comparison with state-of-the-art methods on diseases from the Online Mendelian Inheritance in Man (OMIM) database shows that the proposed approach is substantially better-it has close to one-in-four chance of recovering a true association in the top 100 predictions, compared to the recently proposed Catapult method (second best) that has bigdata.ices.utexas.edu/project/gene-disease. © The Author 2014. Published by Oxford University Press.

  8. Heart morphogenesis gene regulatory networks revealed by temporal expression analysis.

    Science.gov (United States)

    Hill, Jonathon T; Demarest, Bradley; Gorsi, Bushra; Smith, Megan; Yost, H Joseph

    2017-10-01

    During embryogenesis the heart forms as a linear tube that then undergoes multiple simultaneous morphogenetic events to obtain its mature shape. To understand the gene regulatory networks (GRNs) driving this phase of heart development, during which many congenital heart disease malformations likely arise, we conducted an RNA-seq timecourse in zebrafish from 30 hpf to 72 hpf and identified 5861 genes with altered expression. We clustered the genes by temporal expression pattern, identified transcription factor binding motifs enriched in each cluster, and generated a model GRN for the major gene batteries in heart morphogenesis. This approach predicted hundreds of regulatory interactions and found batteries enriched in specific cell and tissue types, indicating that the approach can be used to narrow the search for novel genetic markers and regulatory interactions. Subsequent analyses confirmed the GRN using two mutants, Tbx5 and nkx2-5 , and identified sets of duplicated zebrafish genes that do not show temporal subfunctionalization. This dataset provides an essential resource for future studies on the genetic/epigenetic pathways implicated in congenital heart defects and the mechanisms of cardiac transcriptional regulation. © 2017. Published by The Company of Biologists Ltd.

  9. Gene expression profiling of cutaneous wound healing

    Directory of Open Access Journals (Sweden)

    Wang Ena

    2007-02-01

    Full Text Available Abstract Background Although the sequence of events leading to wound repair has been described at the cellular and, to a limited extent, at the protein level this process has yet to be fully elucidated. Genome wide transcriptional analysis tools promise to further define the global picture of this complex progression of events. Study Design This study was part of a placebo-controlled double-blind clinical trial in which basal cell carcinomas were treated topically with an immunomodifier – toll-like receptor 7 agonist: imiquimod. The fourteen patients with basal cell carcinoma in the placebo arm of the trial received placebo treatment consisting solely of vehicle cream. A skin punch biopsy was obtained immediately before treatment and at the end of the placebo treatment (after 2, 4 or 8 days. 17.5K cDNA microarrays were utilized to profile the biopsy material. Results Four gene signatures whose expression changed relative to baseline (before wound induction by the pre-treatment biopsy were identified. The largest group was comprised predominantly of inflammatory genes whose expression was increased throughout the study. Two additional signatures were observed which included preferentially pro-inflammatory genes in the early post-treatment biopsies (2 days after pre-treatment biopsies and repair and angiogenesis genes in the later (4 to 8 days biopsies. The fourth and smallest set of genes was down-regulated throughout the study. Early in wound healing the expression of markers of both M1 and M2 macrophages were increased, but later M2 markers predominated. Conclusion The initial response to a cutaneous wound induces powerful transcriptional activation of pro-inflammatory stimuli which may alert the host defense. Subsequently and in the absence of infection, inflammation subsides and it is replaced by angiogenesis and remodeling. Understanding this transition which may be driven by a change from a mixed macrophage population to predominately M2

  10. Differential expression of cell adhesion genes

    DEFF Research Database (Denmark)

    Stein, Wilfred D; Litman, Thomas; Fojo, Tito

    2005-01-01

    that compare cells grown in suspension to similar cells grown attached to one another as aggregates have suggested that it is adhesion to the extracellular matrix of the basal membrane that confers resistance to apoptosis and, hence, resistance to cytotoxins. The genes whose expression correlates with poor...... in cell adhesion and the cytoskeleton. If the proteins involved in tethering cells to the extracellular matrix are important in conferring drug resistance, it may be possible to improve chemotherapy by designing drugs that target these proteins....

  11. Network Completion for Static Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Natsu Nakajima

    2014-01-01

    Full Text Available We tackle the problem of completing and inferring genetic networks under stationary conditions from static data, where network completion is to make the minimum amount of modifications to an initial network so that the completed network is most consistent with the expression data in which addition of edges and deletion of edges are basic modification operations. For this problem, we present a new method for network completion using dynamic programming and least-squares fitting. This method can find an optimal solution in polynomial time if the maximum indegree of the network is bounded by a constant. We evaluate the effectiveness of our method through computational experiments using synthetic data. Furthermore, we demonstrate that our proposed method can distinguish the differences between two types of genetic networks under stationary conditions from lung cancer and normal gene expression data.

  12. Inferring gene expression dynamics via functional regression analysis

    Directory of Open Access Journals (Sweden)

    Leng Xiaoyan

    2008-01-01

    Full Text Available Abstract Background Temporal gene expression profiles characterize the time-dynamics of expression of specific genes and are increasingly collected in current gene expression experiments. In the analysis of experiments where gene expression is obtained over the life cycle, it is of interest to relate temporal patterns of gene expression associated with different developmental stages to each other to study patterns of long-term developmental gene regulation. We use tools from functional data analysis to study dynamic changes by relating temporal gene expression profiles of different developmental stages to each other. Results We demonstrate that functional regression methodology can pinpoint relationships that exist between temporary gene expression profiles for different life cycle phases and incorporates dimension reduction as needed for these high-dimensional data. By applying these tools, gene expression profiles for pupa and adult phases are found to be strongly related to the profiles of the same genes obtained during the embryo phase. Moreover, one can distinguish between gene groups that exhibit relationships with positive and others with negative associations between later life and embryonal expression profiles. Specifically, we find a positive relationship in expression for muscle development related genes, and a negative relationship for strictly maternal genes for Drosophila, using temporal gene expression profiles. Conclusion Our findings point to specific reactivation patterns of gene expression during the Drosophila life cycle which differ in characteristic ways between various gene groups. Functional regression emerges as a useful tool for relating gene expression patterns from different developmental stages, and avoids the problems with large numbers of parameters and multiple testing that affect alternative approaches.

  13. Gene expression patterns in peripheral blood correlate with the extent of coronary artery disease.

    Directory of Open Access Journals (Sweden)

    Peter R Sinnaeve

    Full Text Available Systemic and local inflammation plays a prominent role in the pathogenesis of atherosclerotic coronary artery disease, but the relationship of whole blood gene expression changes with coronary disease remains unclear. We have investigated whether gene expression patterns in peripheral blood correlate with the severity of coronary disease and whether these patterns correlate with the extent of atherosclerosis in the vascular wall. Patients were selected according to their coronary artery disease index (CADi, a validated angiographical measure of the extent of coronary atherosclerosis that correlates with outcome. RNA was extracted from blood of 120 patients with at least a stenosis greater than 50% (CADi > or = 23 and from 121 controls without evidence of coronary stenosis (CADi = 0. 160 individual genes were found to correlate with CADi (rho > 0.2, P<0.003. Prominent differential expression was observed especially in genes involved in cell growth, apoptosis and inflammation. Using these 160 genes, a partial least squares multivariate regression model resulted in a highly predictive model (r(2 = 0.776, P<0.0001. The expression pattern of these 160 genes in aortic tissue also predicted the severity of atherosclerosis in human aortas, showing that peripheral blood gene expression associated with coronary atherosclerosis mirrors gene expression changes in atherosclerotic arteries. In conclusion, the simultaneous expression pattern of 160 genes in whole blood correlates with the severity of coronary artery disease and mirrors expression changes in the atherosclerotic vascular wall.

  14. Predicting the emotions expressed in music

    DEFF Research Database (Denmark)

    Madsen, Jens

    With the ever-growing popularity and availability of digital music through streaming services and digital download, making sense of the millions of songs, is ever more pertinent. However the traditional approach of creating music systems has treated songs like items in a store, like books...... and movies. However music is special, having origins in a number of evolutionary adaptations. The fundamental needs and goals of a users use of music, was investigated to create the next generation of music systems. People listen to music to regulate their mood and emotions was found to be the most important...... fundamental reason. (Mis)matching peoples mood with the emotions expressed in music was found to be an essential underlying mechanism, people use to regulate their emotions. This formed the basis and overall goal of the thesis, to investigate how to create a predictive model of emotions expressed in music...

  15. Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression.

    Science.gov (United States)

    Arnaiz, Olivier; Van Dijk, Erwin; Bétermier, Mireille; Lhuillier-Akakpo, Maoussi; de Vanssay, Augustin; Duharcourt, Sandra; Sallet, Erika; Gouzy, Jérôme; Sperling, Linda

    2017-06-26

    The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3' and 5' UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis

  16. Global expression differences and tissue specific expression differences in rice evolution result in two contrasting types of differentially expressed genes

    KAUST Repository

    Horiuchi, Youko

    2015-12-23

    Background Since the development of transcriptome analysis systems, many expression evolution studies characterized evolutionary forces acting on gene expression, without explicit discrimination between global expression differences and tissue specific expression differences. However, different types of gene expression alteration should have different effects on an organism, the evolutionary forces that act on them might be different, and different types of genes might show different types of differential expression between species. To confirm this, we studied differentially expressed (DE) genes among closely related groups that have extensive gene expression atlases, and clarified characteristics of different types of DE genes including the identification of regulating loci for differential expression using expression quantitative loci (eQTL) analysis data. Results We detected differentially expressed (DE) genes between rice subspecies in five homologous tissues that were verified using japonica and indica transcriptome atlases in public databases. Using the transcriptome atlases, we classified DE genes into two types, global DE genes and changed-tissues DE genes. Global type DE genes were not expressed in any tissues in the atlas of one subspecies, however changed-tissues type DE genes were expressed in both subspecies with different tissue specificity. For the five tissues in the two japonica-indica combinations, 4.6 ± 0.8 and 5.9 ± 1.5 % of highly expressed genes were global and changed-tissues DE genes, respectively. Changed-tissues DE genes varied in number between tissues, increasing linearly with the abundance of tissue specifically expressed genes in the tissue. Molecular evolution of global DE genes was rapid, unlike that of changed-tissues DE genes. Based on gene ontology, global and changed-tissues DE genes were different, having no common GO terms. Expression differences of most global DE genes were regulated by cis-eQTLs. Expression

  17. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  18. Expression profiling of hypothetical genes in Desulfovibrio vulgaris leads to improved functional annotation

    Energy Technology Data Exchange (ETDEWEB)

    Elias, Dwayne A.; Mukhopadhyay, Aindrila; Joachimiak, Marcin P.; Drury, Elliott C.; Redding, Alyssa M.; Yen, Huei-Che B.; Fields, Matthew W.; Hazen, Terry C.; Arkin, Adam P.; Keasling, Jay D.; Wall, Judy D.

    2008-10-27

    Hypothetical and conserved hypothetical genes account for>30percent of sequenced bacterial genomes. For the sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough, 347 of the 3634 genes were annotated as conserved hypothetical (9.5percent) along with 887 hypothetical genes (24.4percent). Given the large fraction of the genome, it is plausible that some of these genes serve critical cellular roles. The study goals were to determine which genes were expressed and provide a more functionally based annotation. To accomplish this, expression profiles of 1234 hypothetical and conserved genes were used from transcriptomic datasets of 11 environmental stresses, complemented with shotgun LC-MS/MS and AMT tag proteomic data. Genes were divided into putatively polycistronic operons and those predicted to be monocistronic, then classified by basal expression levels and grouped according to changes in expression for one or multiple stresses. 1212 of these genes were transcribed with 786 producing detectable proteins. There was no evidence for expression of 17 predicted genes. Except for the latter, monocistronic gene annotation was expanded using the above criteria along with matching Clusters of Orthologous Groups. Polycistronic genes were annotated in the same manner with inferences from their proximity to more confidently annotated genes. Two targeted deletion mutants were used as test cases to determine the relevance of the inferred functional annotations.

  19. Interactive visualization of gene regulatory networks with associated gene expression time series data

    NARCIS (Netherlands)

    Westenberg, M.A.; Hijum, van S.A.F.T.; Lulko, A.T.; Kuipers, O.P.; Roerdink, J.B.T.M.; Linsen, L.; Hagen, H.; Hamann, B.

    2008-01-01

    We present GENeVis, an application to visualize gene expression time series data in a gene regulatory network context. This is a network of regulator proteins that regulate the expression of their respective target genes. The networks are represented as graphs, in which the nodes represent genes,

  20. Predicting Hydrologic Function With Aquatic Gene Fragments

    Science.gov (United States)

    Good, S. P.; URycki, D. R.; Crump, B. C.

    2018-03-01

    Recent advances in microbiology techniques, such as genetic sequencing, allow for rapid and cost-effective collection of large quantities of genetic information carried within water samples. Here we posit that the unique composition of aquatic DNA material within a water sample contains relevant information about hydrologic function at multiple temporal scales. In this study, machine learning was used to develop discharge prediction models trained on the relative abundance of bacterial taxa classified into operational taxonomic units (OTUs) based on 16S rRNA gene sequences from six large arctic rivers. We term this approach "genohydrology," and show that OTU relative abundances can be used to predict river discharge at monthly and longer timescales. Based on a single DNA sample from each river, the average Nash-Sutcliffe efficiency (NSE) for predicted mean monthly discharge values throughout the year was 0.84, while the NSE for predicted discharge values across different return intervals was 0.67. These are considerable improvements over predictions based only on the area-scaled mean specific discharge of five similar rivers, which had average NSE values of 0.64 and -0.32 for seasonal and recurrence interval discharge values, respectively. The genohydrology approach demonstrates that genetic diversity within the aquatic microbiome is a large and underutilized data resource with benefits for prediction of hydrologic function.

  1. Ionizing Radiation Affects Gene Expression in Mouse Skin and Bone

    Science.gov (United States)

    Terada, Masahiro; Tahimic, Candice; Sowa, Marianne B.; Schreurs, Ann-Sofie; Shirazi-Fard, Yasaman; Alwood, Joshua; Globus, Ruth K.

    2017-01-01

    Future long-duration space exploration beyond low earth orbit will increase human exposure to space radiation and microgravity conditions as well as associated risks to skeletal health. In animal studies, radiation exposure (greater than 1 Gy) is associated with pathological changes in bone structure, enhanced bone resorption, reduced bone formation and decreased bone mineral density, which can lead to skeletal fragility. Definitive measurements and detection of bone loss typically require large and specialized equipment which can make their application to long duration space missions logistically challenging. Towards the goal of developing non-invasive and less complicated monitoring methods to predict astronauts' health during spaceflight, we examined whether radiation induced gene expression changes in skin may be predictive of the responses of skeletal tissue to radiation exposure. We examined oxidative stress and growth arrest pathways in mouse skin and long bones by measuring gene expression levels via quantitative polymerase chain reaction (qPCR) after exposure to total body irradiation (IR). To investigate the effects of irradiation on gene expression, we used skin and femora (cortical shaft) from the following treatment groups: control (normally loaded, sham-irradiated), and IR (0.5 Gy 56Fe 600 MeV/n and 0.5 Gy 1H 150 MeV/n), euthanized at one and 11 days post-irradiation (IR). To determine the extent of bone loss, tibiae were harvested and cancellous microarchitecture in the proximal tibia quantified ex vivo using microcomputed tomography (microCT). Statistical analysis was performed using Student's t-test. At one day post-IR, expression of FGF18 in skin was significantly greater (3.8X) than sham-irradiated controls, but did not differ at 11 days post IR. Expression levels of other genes associated with antioxidant response (Nfe2l2, FoxO3 and Sod1) and the cell cycle (Trp53, Cdkn1a, Gadd45g) did not significantly differ between the control and IR groups

  2. Positive selection on gene expression in the human brain

    DEFF Research Database (Denmark)

    Khaitovich, Philipp; Tang, Kun; Franz, Henriette

    2006-01-01

    Recent work has shown that the expression levels of genes transcribed in the brains of humans and chimpanzees have changed less than those of genes transcribed in other tissues [1] . However, when gene expression changes are mapped onto the evolutionary lineage in which they occurred, the brain...... shows more changes than other tissues in the human lineage compared to the chimpanzee lineage [1] , [2] and [3] . There are two possible explanations for this: either positive selection drove more gene expression changes to fixation in the human brain than in the chimpanzee brain, or genes expressed...... in the brain experienced less purifying selection in humans than in chimpanzees, i.e. gene expression in the human brain is functionally less constrained. The first scenario would be supported if genes that changed their expression in the brain in the human lineage showed more selective sweeps than other genes...

  3. Radiation Gene-expression Signatures in Primary Breast Cancer Cells.

    Science.gov (United States)

    Minafra, Luigi; Bravatà, Valentina; Cammarata, Francesco P; Russo, Giorgio; Gilardi, Maria C; Forte, Giusi I

    2018-05-01

    In breast cancer (BC) care, radiation therapy (RT) is an efficient treatment to control localized tumor. Radiobiological research is needed to understand molecular differences that affect radiosensitivity of different tumor subtypes and the response variability. The aim of this study was to analyze gene expression profiling (GEP) in primary BC cells following irradiation with doses of 9 Gy and 23 Gy delivered by intraoperative electron radiation therapy (IOERT) in order to define gene signatures of response to high doses of ionizing radiation. We performed GEP by cDNA microarrays and evaluated cell survival after IOERT treatment in primary BC cell cultures. Real-time quantitative reverse transcription polymerase chain reaction (qRT-PCR) was performed to validate candidate genes. We showed, for the first time, a 4-gene and a 6-gene signature, as new molecular biomarkers, in two primary BC cell cultures after exposure at 9 Gy and 23 Gy respectively, for which we observed a significantly high survival rate. Gene signatures activated by different doses of ionizing radiation may predict response to RT and contribute to defining a personalized biological-driven treatment plan. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.

  4. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  5. Gene organization in rice revealed by full-length cDNA mapping and gene expression analysis through microarray.

    Directory of Open Access Journals (Sweden)

    Kouji Satoh

    Full Text Available Rice (Oryza sativa L. is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE genes, 33K annotated non-expressed (ANE genes, and 5.5K non-annotated expressed (NAE genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.

  6. High-throughput analysis of candidate imprinted genes and allele-specific gene expression in the human term placenta

    Directory of Open Access Journals (Sweden)

    Clark Taane G

    2010-04-01

    Full Text Available Abstract Background Imprinted genes show expression from one parental allele only and are important for development and behaviour. This extreme mode of allelic imbalance has been described for approximately 56 human genes. Imprinting status is often disrupted in cancer and dysmorphic syndromes. More subtle variation of gene expression, that is not parent-of-origin specific, termed 'allele-specific gene expression' (ASE is more common and may give rise to milder phenotypic differences. Using two allele-specific high-throughput technologies alongside bioinformatics predictions, normal term human placenta was screened to find new imprinted genes and to ascertain the extent of ASE in this tissue. Results Twenty-three family trios of placental cDNA, placental genomic DNA (gDNA and gDNA from both parents were tested for 130 candidate genes with the Sequenom MassArray system. Six genes were found differentially expressed but none imprinted. The Illumina ASE BeadArray platform was then used to test 1536 SNPs in 932 genes. The array was enriched for the human orthologues of 124 mouse candidate genes from bioinformatics predictions and 10 human candidate imprinted genes from EST database mining. After quality control pruning, a total of 261 informative SNPs (214 genes remained for analysis. Imprinting with maternal expression was demonstrated for the lymphocyte imprinted gene ZNF331 in human placenta. Two potential differentially methylated regions (DMRs were found in the vicinity of ZNF331. None of the bioinformatically predicted candidates tested showed imprinting except for a skewed allelic expression in a parent-specific manner observed for PHACTR2, a neighbour of the imprinted PLAGL1 gene. ASE was detected for two or more individuals in 39 candidate genes (18%. Conclusions Both Sequenom and Illumina assays were sensitive enough to study imprinting and strong allelic bias. Previous bioinformatics approaches were not predictive of new imprinted genes

  7. Gene expression of circulating tumour cells in breast cancer patients

    Directory of Open Access Journals (Sweden)

    Bölke E

    2009-09-01

    Full Text Available Abstract Background The diagnostic tools to predict the prognosis in patients suffering from breast cancer (BC need further improvements. New technological achievements like the gene profiling of circulating tumour cells (CTC could help identify new prognostic markers in the clinical setting. Furthermore, gene expression patterns of CTC might provide important informations on the mechanisms of tumour cell metastasation. Materials and methods We performed realtime-PCR and multiplex-PCR analyses following immunomagnetic separation of CTC. Peripheral blood (PB samples of 63 patients with breast cancer of various stages were analyzed and compared to a control group of 14 healthy individuals. After reverse-transcription, we performed multiplex PCR using primers for the genes ga733.3, muc-1 and c-erbB2. Mammaglobin1, spdef and c-erbB2 were analyzed applying realtime-PCR. Results ga733.2 overexpression was found in 12.7% of breast cancer cases, muc-1 in 15.9%, mgb1 in 9.1% and spdef in 12.1%. In this study, c-erbB2 did not show any significant correlation to BC, possibly due to a highly ambient expression. Besides single gene analyses, gene profiles were additionally evaluated. Highly significant correlations to BC were found in single gene analyses of ga733.2 and muc-1 and in gene profile analyses of ga733.3*muc-1 and GA7 ga733.3*muc-1*mgb1*spdef. Conclusion Our study reveals that the single genes ga733.3, muc-1 and the gene profiles ga733.3*muc-1 and ga733.3*3muc-1*mgb1*spdef can serve as markers for the detection of CTC in BC. The multigene analyses found highly positive levels in BC patients. Our study indicates that not single gene analyses but subtle patterns of multiple genes lead to rising accuracy and low loss of specificity in detection of breast cancer cases.

  8. The utility of optical detection system (qPCR) and bioinformatics methods in reference gene expression analysis

    Science.gov (United States)

    Skarzyńska, Agnieszka; Pawełkowicz, Magdalena; PlÄ der, Wojciech; Przybecki, Zbigniew

    2016-09-01

    Real-time quantitative polymerase chain reaction is consider as the most reliable method for gene expression studies. However, the expression of target gene could be misinterpreted due to improper normalization. Therefore, the crucial step for analysing of qPCR data is selection of suitable reference genes, which should be validated experimentally. In order to choice the gene with stable expression in the designed experiment, we performed reference gene expression analysis. In this study genes described in the literature and novel genes predicted as control genes, based on the in silico analysis of transcriptome data were used. Analysis with geNorm and NormFinder algorithms allow to create the ranking of candidate genes and indicate the best reference for flower morphogenesis study. According to the results, genes CACS and CYCL were characterised the most stable expression, but the least suitable genes were TUA and EF.

  9. Genomic Prediction of Gene Bank Wheat Landraces

    Directory of Open Access Journals (Sweden)

    José Crossa

    2016-07-01

    Full Text Available This study examines genomic prediction within 8416 Mexican landrace accessions and 2403 Iranian landrace accessions stored in gene banks. The Mexican and Iranian collections were evaluated in separate field trials, including an optimum environment for several traits, and in two separate environments (drought, D and heat, H for the highly heritable traits, days to heading (DTH, and days to maturity (DTM. Analyses accounting and not accounting for population structure were performed. Genomic prediction models include genotype × environment interaction (G × E. Two alternative prediction strategies were studied: (1 random cross-validation of the data in 20% training (TRN and 80% testing (TST (TRN20-TST80 sets, and (2 two types of core sets, “diversity” and “prediction”, including 10% and 20%, respectively, of the total collections. Accounting for population structure decreased prediction accuracy by 15–20% as compared to prediction accuracy obtained when not accounting for population structure. Accounting for population structure gave prediction accuracies for traits evaluated in one environment for TRN20-TST80 that ranged from 0.407 to 0.677 for Mexican landraces, and from 0.166 to 0.662 for Iranian landraces. Prediction accuracy of the 20% diversity core set was similar to accuracies obtained for TRN20-TST80, ranging from 0.412 to 0.654 for Mexican landraces, and from 0.182 to 0.647 for Iranian landraces. The predictive core set gave similar prediction accuracy as the diversity core set for Mexican collections, but slightly lower for Iranian collections. Prediction accuracy when incorporating G × E for DTH and DTM for Mexican landraces for TRN20-TST80 was around 0.60, which is greater than without the G × E term. For Iranian landraces, accuracies were 0.55 for the G × E model with TRN20-TST80. Results show promising prediction accuracies for potential use in germplasm enhancement and rapid introgression of exotic germplasm

  10. Learning Gene Regulatory Networks Computationally from Gene Expression Data Using Weighted Consensus

    KAUST Repository

    Fujii, Chisato

    2015-04-16

    Gene regulatory networks analyze the relationships between genes allowing us to un- derstand the gene regulatory interactions in systems biology. Gene expression data from the microarray experiments is used to obtain the gene regulatory networks. How- ever, the microarray data is discrete, noisy and non-linear which makes learning the networks a challenging problem and existing gene network inference methods do not give consistent results. Current state-of-the-art study uses the average-ranking-based consensus method to combine and average the ranked predictions from individual methods. However each individual method has an equal contribution to the consen- sus prediction. We have developed a linear programming-based consensus approach which uses learned weights from linear programming among individual methods such that the methods have di↵erent weights depending on their performance. Our result reveals that assigning di↵erent weights to individual methods rather than giving them equal weights improves the performance of the consensus. The linear programming- based consensus method is evaluated and it had the best performance on in silico and Saccharomyces cerevisiae networks, and the second best on the Escherichia coli network outperformed by Inferelator Pipeline method which gives inconsistent results across a wide range of microarray data sets.

  11. Early pregnancy peripheral blood gene expression and risk of preterm delivery: a nested case control study

    Directory of Open Access Journals (Sweden)

    Muhie Seid Y

    2009-12-01

    Full Text Available Abstract Background Preterm delivery (PTD is a significant public health problem associated with greater risk of mortality and morbidity in infants and mothers. Pathophysiologic processes that may lead to PTD start early in pregnancy. We investigated early pregnancy peripheral blood global gene expression and PTD risk. Methods As part of a prospective study, ribonucleic acid was extracted from blood samples (collected at 16 weeks gestational age from 14 women who had PTD (cases and 16 women who delivered at term (controls. Gene expressions were measured using the GeneChip® Human Genome U133 Plus 2.0 Array. Student's T-test and fold change analysis were used to identify differentially expressed genes. We used hierarchical clustering and principle components analysis to characterize signature gene expression patterns among cases and controls. Pathway and promoter sequence analyses were used to investigate functions and functional relationships as well as regulatory regions of differentially expressed genes. Results A total of 209 genes, including potential candidate genes (e.g. PTGDS, prostaglandin D2 synthase 21 kDa, were differentially expressed. A set of these genes achieved accurate pre-diagnostic separation of cases and controls. These genes participate in functions related to immune system and inflammation, organ development, metabolism (lipid, carbohydrate and amino acid and cell signaling. Binding sites of putative transcription factors such as EGR1 (early growth response 1, TFAP2A (transcription factor AP2A, Sp1 (specificity protein 1 and Sp3 (specificity protein 3 were over represented in promoter regions of differentially expressed genes. Real-time PCR confirmed microarray expression measurements of selected genes. Conclusions PTD is associated with maternal early pregnancy peripheral blood gene expression changes. Maternal early pregnancy peripheral blood gene expression patterns may be useful for better understanding of PTD

  12. Kinetic models of gene expression including non-coding RNAs

    Energy Technology Data Exchange (ETDEWEB)

    Zhdanov, Vladimir P., E-mail: zhdanov@catalysis.r

    2011-03-15

    In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.

  13. Cross-study analysis of gene expression data for intermediate neuroblastoma identifies two biological subtypes

    International Nuclear Information System (INIS)

    Warnat, Patrick; Oberthuer, André; Fischer, Matthias; Westermann, Frank; Eils, Roland; Brors, Benedikt

    2007-01-01

    Neuroblastoma patients show heterogeneous clinical courses ranging from life-threatening progression to spontaneous regression. Recently, gene expression profiles of neuroblastoma tumours were associated with clinically different phenotypes. However, such data is still rare for important patient subgroups, such as patients with MYCN non-amplified advanced stage disease. Prediction of the individual course of disease and optimal therapy selection in this cohort is challenging. Additional research effort is needed to describe the patterns of gene expression in this cohort and to identify reliable prognostic markers for this subset of patients. We combined gene expression data from two studies in a meta-analysis in order to investigate differences in gene expression of advanced stage (3 or 4) tumours without MYCN amplification that show contrasting outcomes (alive or dead) at five years after initial diagnosis. In addition, a predictive model for outcome was generated. Gene expression profiles from 66 patients were included from two studies using different microarray platforms. In the combined data set, 72 genes were identified as differentially expressed by meta-analysis at a false discovery rate (FDR) of 8.33%. Meta-analysis detected 34 differentially expressed genes that were not found as significant in either single study. Outcome prediction based on data of both studies resulted in a predictive accuracy of 77%. Moreover, the genes that were differentially expressed in subgroups of advanced stage patients without MYCN amplification accurately separated MYCN amplified tumours from low stage tumours without MYCN amplification. Our findings support the hypothesis that neuroblastoma consists of two biologically distinct subgroups that differ by characteristic gene expression patterns, which are associated with divergent clinical outcome

  14. Understanding gene expression in coronary artery disease through ...

    Indian Academy of Sciences (India)

    Understanding gene expression in coronary artery disease through global profiling, network analysis and independent validation of key candidate genes. Prathima ... Table 2. Differentially expressed genes in CAD compared to age and gender matched controls. .... Regulation of nuclear pre-mRNA domain containing 1A.

  15. Improved gene expression signature of testicular carcinoma in situ

    DEFF Research Database (Denmark)

    Almstrup, Kristian; Leffers, Henrik; Lothe, Ragnhild A

    2007-01-01

    on global gene expression in testicular CIS have been previously published. We have merged the two data sets on CIS samples (n = 6) and identified the shared gene expression signature in relation to expression in normal testis. Among the top-20 highest expressed genes, one-third was transcription factors...... development' were significantly altered and could collectively affect cellular pathways like the WNT signalling cascade, which thus may be disrupted in testicular CIS. The merged CIS data from two different microarray platforms, to our knowledge, provide the most precise CIS gene expression signature to date....

  16. Genetic Variants Contribute to Gene Expression Variability in Humans

    Science.gov (United States)

    Hulse, Amanda M.; Cai, James J.

    2013-01-01

    Expression quantitative trait loci (eQTL) studies have established convincing relationships between genetic variants and gene expression. Most of these studies focused on the mean of gene expression level, but not the variance of gene expression level (i.e., gene expression variability). In the present study, we systematically explore genome-wide association between genetic variants and gene expression variability in humans. We adapt the double generalized linear model (dglm) to simultaneously fit the means and the variances of gene expression among the three possible genotypes of a biallelic SNP. The genomic loci showing significant association between the variances of gene expression and the genotypes are termed expression variability QTL (evQTL). Using a data set of gene expression in lymphoblastoid cell lines (LCLs) derived from 210 HapMap individuals, we identify cis-acting evQTL involving 218 distinct genes, among which 8 genes, ADCY1, CTNNA2, DAAM2, FERMT2, IL6, PLOD2, SNX7, and TNFRSF11B, are cross-validated using an extra expression data set of the same LCLs. We also identify ∼300 trans-acting evQTL between >13,000 common SNPs and 500 randomly selected representative genes. We employ two distinct scenarios, emphasizing single-SNP and multiple-SNP effects on expression variability, to explain the formation of evQTL. We argue that detecting evQTL may represent a novel method for effectively screening for genetic interactions, especially when the multiple-SNP influence on expression variability is implied. The implication of our results for revealing genetic mechanisms of gene expression variability is discussed. PMID:23150607

  17. Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex

    Science.gov (United States)

    Hulsman, Marc; Lelieveldt, Boudewijn P. F.; de Ridder, Jeroen; Reinders, Marcel

    2015-01-01

    The three dimensional conformation of the genome in the cell nucleus influences important biological processes such as gene expression regulation. Recent studies have shown a strong correlation between chromatin interactions and gene co-expression. However, predicting gene co-expression from frequent long-range chromatin interactions remains challenging. We address this by characterizing the topology of the cortical chromatin interaction network using scale-aware topological measures. We demonstrate that based on these characterizations it is possible to accurately predict spatial co-expression between genes in the mouse cortex. Consistent with previous findings, we find that the chromatin interaction profile of a gene-pair is a good predictor of their spatial co-expression. However, the accuracy of the prediction can be substantially improved when chromatin interactions are described using scale-aware topological measures of the multi-resolution chromatin interaction network. We conclude that, for co-expression prediction, it is necessary to take into account different levels of chromatin interactions ranging from direct interaction between genes (i.e. small-scale) to chromatin compartment interactions (i.e. large-scale). PMID:25965262

  18. Expression regulation of design process gene in product design

    DEFF Research Database (Denmark)

    Li, Bo; Fang, Lusheng; Li, Bo

    2011-01-01

    To improve the design process efficiency, this paper proposes the principle and methodology that design process gene controls the characteristics of design process under the framework of design process reuse and optimization based on design process gene. First, the concept of design process gene...... is proposed and analyzed, as well as its three categories i.e., the operator gene, the structural gene and the regulator gene. Second, the trigger mechanism that design objectives and constraints trigger the operator gene is constructed. Third, the expression principle of structural gene is analyzed...... with the example of design management gene. Last, the regulation mode that the regulator gene regulates the expression of the structural gene is established and it is illustrated by taking the design process management gene as an example. © (2011) Trans Tech Publications....

  19. Sex hormones and gene expression signatures in peripheral blood from postmenopausal women - the NOWAC postgenome study

    Directory of Open Access Journals (Sweden)

    Rylander Charlotta

    2011-03-01

    Full Text Available Abstract Background Postmenopausal hormone therapy (HT influences endogenous hormone concentrations and increases the risk of breast cancer. Gene expression profiling may reveal the mechanisms behind this relationship. Our objective was to explore potential associations between sex hormones and gene expression in whole blood from a population-based, random sample of postmenopausal women Methods Gene expression, as measured by the Applied Biosystems microarray platform, was compared between hormone therapy (HT users and non-users and between high and low hormone plasma concentrations using both gene-wise analysis and gene set analysis. Gene sets found to be associated with HT use were further analysed for enrichment in functional clusters and network predictions. The gene expression matrix included 285 samples and 16185 probes and was adjusted for significant technical variables. Results Gene-wise analysis revealed several genes significantly associated with different types of HT use. The functional cluster analyses provided limited information on these genes. Gene set analysis revealed 22 gene sets that were enriched between high and low estradiol concentration (HT-users excluded. Among these were seven oestrogen related gene sets, including our gene list associated with systemic estradiol use, which thereby represents a novel oestrogen signature. Seven gene sets were related to immune response. Among the 15 gene sets enriched for progesterone, 11 overlapped with estradiol. No significant gene expression patterns were found for testosterone, follicle stimulating hormone (FSH or sex hormone binding globulin (SHBG. Conclusions Distinct gene expression patterns associated with sex hormones are detectable in a random group of postmenopausal women, as demonstrated by the finding of a novel oestrogen signature.

  20. Dissecting specific and global transcriptional regulation of bacterial gene expression

    NARCIS (Netherlands)

    Gerosa, Luca; Kochanowski, Karl; Heinemann, Matthias; Sauer, Uwe

    Gene expression is regulated by specific transcriptional circuits but also by the global expression machinery as a function of growth. Simultaneous specific and global regulation thus constitutes an additional-but often neglected-layer of complexity in gene expression. Here, we develop an

  1. Bone Metastasis in Advanced Breast Cancer: Analysis of Gene Expression Microarray.

    Science.gov (United States)

    Cosphiadi, Irawan; Atmakusumah, Tubagus D; Siregar, Nurjati C; Muthalib, Abdul; Harahap, Alida; Mansyur, Muchtarruddin

    2018-03-08

    Approximately 30% to 40% of breast cancer recurrences involve bone metastasis (BM). Certain genes have been linked to BM; however, none have been able to predict bone involvement. In this study, we analyzed gene expression profiles in advanced breast cancer patients to elucidate genes that can be used to predict BM. A total of 92 advanced breast cancer patients, including 46 patients with BM and 46 patients without BM, were identified for this study. Immunohistochemistry and gene expression analysis was performed on 81 formalin-fixed paraffin-embedded samples. Data were collected through medical records, and gene expression of 200 selected genes compiled from 6 previous studies was performed using NanoString nCounter. Genetic expression profiles showed that 22 genes were significantly differentially expressed between breast cancer patients with metastasis in bone and other organs (BM+) and non-BM, whereas subjects with only BM showed 17 significantly differentially expressed genes. The following genes were associated with an increasing incidence of BM in the BM+ group: estrogen receptor 1 (ESR1), GATA binding protein 3 (GATA3), and melanophilin with an area under the curve (AUC) of 0.804. In the BM group, the following genes were associated with an increasing incidence of BM: ESR1, progesterone receptor, B-cell lymphoma 2, Rab escort protein, N-acetyltransferase 1, GATA3, annexin A9, and chromosome 9 open reading frame 116. ESR1 and GATA3 showed an increased strength of association with an AUC of 0.928. A combination of the identified 3 genes in BM+ and 8 genes in BM showed better prediction than did each individual gene, and this combination can be used as a training set. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  2. Ecological transition predictably associated with gene degeneration.

    Science.gov (United States)

    Wessinger, Carolyn A; Rausher, Mark D

    2015-02-01

    Gene degeneration or loss can significantly contribute to phenotypic diversification, but may generate genetic constraints on future evolutionary trajectories, potentially restricting phenotypic reversal. Such constraints may manifest as directional evolutionary trends when parallel phenotypic shifts consistently involve gene degeneration or loss. Here, we demonstrate that widespread parallel evolution in Penstemon from blue to red flowers predictably involves the functional inactivation and degeneration of the enzyme flavonoid 3',5'-hydroxylase (F3'5'H), an anthocyanin pathway enzyme required for the production of blue floral pigments. Other types of genetic mutations do not consistently accompany this phenotypic shift. This pattern may be driven by the relatively large mutational target size of degenerative mutations to this locus and the apparent lack of associated pleiotropic effects. The consistent degeneration of F3'5'H may provide a mechanistic explanation for the observed asymmetry in the direction of flower color evolution in Penstemon: Blue to red transitions are common, but reverse transitions have not been observed. Although phenotypic shifts in this system are likely driven by natural selection, internal constraints may generate predictable genetic outcomes and may restrict future evolutionary trajectories. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. Gene expression of the mismatch repair gene MSH2 in primary colorectal cancer

    DEFF Research Database (Denmark)

    Jensen, Lars Henrik; Kuramochi, Hidekazu; Crüger, Dorthe Gylling

    2011-01-01

    promoter was only detected in 14 samples and only at a low level with no correlation to gene expression. MSH2 gene expression was not a prognostic factor for overall survival in univariate or multivariate analysis. The gene expression of MSH2 is a potential quantitative marker ready for further clinical...

  4. Using RNA-Seq data to select refence genes for normalizing gene expression in apple roots

    Science.gov (United States)

    Gene expression in apple roots in response to various stress conditions is a less-explored research subject. Reliable reference genes for normalizing quantitative gene expression data have not been carefully investigated. In this study, the suitability of a set of 15 apple genes were evaluated for t...

  5. Prioritizing orphan proteins for further study using phylogenomics and gene expression profiles in Streptomyces coelicolor

    Directory of Open Access Journals (Sweden)

    Takano Eriko

    2011-09-01

    Full Text Available Abstract Background Streptomyces coelicolor, a model organism of antibiotic producing bacteria, has one of the largest genomes of the bacterial kingdom, including 7825 predicted protein coding genes. A large number of these genes, nearly 34%, are functionally orphan (hypothetical proteins with unknown function. However, in gene expression time course data, many of these functionally orphan genes show interesting expression patterns. Results In this paper, we analyzed all functionally orphan genes of Streptomyces coelicolor and identified a list of "high priority" orphans by combining gene expression analysis and additional phylogenetic information (i.e. the level of evolutionary conservation of each protein. Conclusions The prioritized orphan genes are promising candidates to be examined experimentally in the lab for further characterization of their function.

  6. CDX2 gene expression in acute lymphoblastic leukemia

    International Nuclear Information System (INIS)

    Arnaoaut, H.H.; Mokhtar, D.A.; Samy, R.M.; Omar, Sh.A.; Khames, S.A.

    2014-01-01

    CDX genes are classically known as regulators of axial elongation during early embryogenesis. An unsuspected role for CDX genes has been revealed during hematopoietic development. The CDX gene family member CDX2 belongs to the most frequent aberrantly expressed proto-oncogenes in human acute leukemias and is highly leukemogenic in experimental models. We used reversed transcriptase polymerase chain reaction (RT-PCR) to determine the expression level of CDX2 gene in 30 pediatric patients with acute lymphoblastic leukemia (ALL) at diagnosis and 30 healthy volunteers. ALL patients were followed up to detect minimal residual disease (MRD) on days 15 and 42 of induction. We found that CDX2 gene was expressed in 50% of patients and not expressed in controls. Associations between gene expression and different clinical and laboratory data of patients revealed no impact on different findings. With follow up, we could not confirm that CDX2 expression had a prognostic significance.

  7. A comparative analysis of soft computing techniques for gene prediction.

    Science.gov (United States)

    Goel, Neelam; Singh, Shailendra; Aseri, Trilok Chand

    2013-07-01

    The rapid growth of genomic sequence data for both human and nonhuman species has made analyzing these sequences, especially predicting genes in them, very important and is currently the focus of many research efforts. Beside its scientific interest in the molecular biology and genomics community, gene prediction is of considerable importance in human health and medicine. A variety of gene prediction techniques have been developed for eukaryotes over the past few years. This article reviews and analyzes the application of certain soft computing techniques in gene prediction. First, the problem of gene prediction and its challenges are described. These are followed by different soft computing techniques along with their application to gene prediction. In addition, a comparative analysis of different soft computing techniques for gene prediction is given. Finally some limitations of the current research activities and future research directions are provided. Copyright © 2013 Elsevier Inc. All rights reserved.

  8. Developmentally regulated expression of reporter gene in adult ...

    Indian Academy of Sciences (India)

    pression of reporter gene in adult brain specific GAL4 enhancer traps of. Drosophila ... genes based on their expression pattern, thus enabling us to overcome the ... order association and storage centres of olfactory learning and memory, and ...

  9. Expression profiling identifies genes involved in emphysema severity

    Directory of Open Access Journals (Sweden)

    Bowman Rayleen V

    2009-09-01

    Full Text Available Abstract Chronic obstructive pulmonary disease (COPD is a major public health problem. The aim of this study was to identify genes involved in emphysema severity in COPD patients. Gene expression profiling was performed on total RNA extracted from non-tumor lung tissue from 30 smokers with emphysema. Class comparison analysis based on gas transfer measurement was performed to identify differentially expressed genes. Genes were then selected for technical validation by quantitative reverse transcriptase-PCR (qRT-PCR if also represented on microarray platforms used in previously published emphysema studies. Genes technically validated advanced to tests of biological replication by qRT-PCR using an independent test set of 62 lung samples. Class comparison identified 98 differentially expressed genes (p p Gene expression profiling of lung from emphysema patients identified seven candidate genes associated with emphysema severity including COL6A3, SERPINF1, ZNHIT6, NEDD4, CDKN2A, NRN1 and GSTM3.

  10. Automated discovery of functional generality of human gene expression programs.

    Directory of Open Access Journals (Sweden)

    Georg K Gerber

    2007-08-01

    Full Text Available An important research problem in computational biology is the identification of expression programs, sets of co-expressed genes orchestrating normal or pathological processes, and the characterization of the functional breadth of these programs. The use of human expression data compendia for discovery of such programs presents several challenges including cellular inhomogeneity within samples, genetic and environmental variation across samples, uncertainty in the numbers of programs and sample populations, and temporal behavior. We developed GeneProgram, a new unsupervised computational framework based on Hierarchical Dirichlet Processes that addresses each of the above challenges. GeneProgram uses expression data to simultaneously organize tissues into groups and genes into overlapping programs with consistent temporal behavior, to produce maps of expression programs, which are sorted by generality scores that exploit the automatically learned groupings. Using synthetic and real gene expression data, we showed that GeneProgram outperformed several popular expression analysis methods. We applied GeneProgram to a compendium of 62 short time-series gene expression datasets exploring the responses of human cells to infectious agents and immune-modulating molecules. GeneProgram produced a map of 104 expression programs, a substantial number of which were significantly enriched for genes involved in key signaling pathways and/or bound by NF-kappaB transcription factors in genome-wide experiments. Further, GeneProgram discovered expression programs that appear to implicate surprising signaling pathways or receptor types in the response to infection, including Wnt signaling and neurotransmitter receptors. We believe the discovered map of expression programs involved in the response to infection will be useful for guiding future biological experiments; genes from programs with low generality scores might serve as new drug targets that exhibit minimal

  11. Analysis of multiplex gene expression maps obtained by voxelation

    OpenAIRE

    An, L; Xie, H; Chin, MH; Obradovic, Z; Smith, DJ; Megalooikonomou, V

    2009-01-01

    Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we presen...

  12. Gene Expression Dynamics Accompanying the Sponge Thermal Stress Response.

    Science.gov (United States)

    Guzman, Christine; Conaco, Cecilia

    2016-01-01

    Marine sponges are important members of coral reef ecosystems. Thus, their responses to changes in ocean chemistry and environmental conditions, particularly to higher seawater temperatures, will have potential impacts on the future of these reefs. To better understand the sponge thermal stress response, we investigated gene expression dynamics in the shallow water sponge, Haliclona tubifera (order Haplosclerida, class Demospongiae), subjected to elevated temperature. Using high-throughput transcriptome sequencing, we show that these conditions result in the activation of various processes that interact to maintain cellular homeostasis. Short-term thermal stress resulted in the induction of heat shock proteins, antioxidants, and genes involved in signal transduction and innate immunity pathways. Prolonged exposure to thermal stress affected the expression of genes involved in cellular damage repair, apoptosis, signaling and transcription. Interestingly, exposure to sublethal temperatures may improve the ability of the sponge to mitigate cellular damage under more extreme stress conditions. These insights into the potential mechanisms of adaptation and resilience of sponges contribute to a better understanding of sponge conservation status and the prediction of ecosystem trajectories under future climate conditions.

  13. Host Gene Expression Analysis in Sri Lankan Melioidosis Patients

    Science.gov (United States)

    2017-06-19

    CCL5 Chemokine (C-C motif) ligand 5 /RANTES. IFNγ Interferon gamma TNFα Tumor necrosis factor alpha HMGB1 High mobility group box 1 protein /high...aim of this study was to analyze gene expression levels of human host factors in melioidosis patients and establish useful correlation with disease...PBMC’s) of study subjects. Gene expression profiles of 25 gene targets including 19 immune response genes and 6 epigenetic factors were analyzed by

  14. Aging: a portrait from gene expression profile in blood cells.

    Science.gov (United States)

    Calabria, Elisa; Mazza, Emilia Maria Cristina; Dyar, Kenneth Allen; Pogliaghi, Silvia; Bruseghini, Paolo; Morandi, Carlo; Salvagno, Gian Luca; Gelati, Matteo; Guidi, Gian Cesare; Bicciato, Silvio; Schiaffino, Stefano; Schena, Federico; Capelli, Carlo

    2016-08-01

    The availability of reliable biomarkers of aging is important not only to monitor the effect of interventions and predict the timing of pathologies associated with aging but also to understand the mechanisms and devise appropriate countermeasures. Blood cells provide an easily available tissue and gene expression profiles from whole blood samples appear to mirror disease states and some aspects of the aging process itself. We report here a microarray analysis of whole blood samples from two cohorts of healthy adult and elderly subjects, aged 43±3 and 68±4 years, respectively, to monitor gene expression changes in the initial phase of the senescence process. A number of significant changes were found in the elderly compared to the adult group, including decreased levels of transcripts coding for components of the mitochondrial respiratory chain, which correlate with a parallel decline in the maximum rate of oxygen consumption (VO2max), as monitored in the same subjects. In addition, blood cells show age-related changes in the expression of several markers of immunosenescence, inflammation and oxidative stress. These findings support the notion that the immune system has a major role in tissue homeostasis and repair, which appears to be impaired since early stages of the aging process.

  15. Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process

    International Nuclear Information System (INIS)

    Chandran, Uma R; Ma, Changqing; Dhir, Rajiv; Bisceglia, Michelle; Lyons-Weiler, Maureen; Liang, Wenjing; Michalopoulos, George; Becich, Michael; Monzon, Federico A

    2007-01-01

    Prostate cancer is characterized by heterogeneity in the clinical course that often does not correlate with morphologic features of the tumor. Metastasis reflects the most adverse outcome of prostate cancer, and to date there are no reliable morphologic features or serum biomarkers that can reliably predict which patients are at higher risk of developing metastatic disease. Understanding the differences in the biology of metastatic and organ confined primary tumors is essential for developing new prognostic markers and therapeutic targets. Using Affymetrix oligonucleotide arrays, we analyzed gene expression profiles of 24 androgen-ablation resistant metastatic samples obtained from 4 patients and a previously published dataset of 64 primary prostate tumor samples. Differential gene expression was analyzed after removing potentially uninformative stromal genes, addressing the differences in cellular content between primary and metastatic tumors. The metastatic samples are highly heterogenous in expression; however, differential expression analysis shows that 415 genes are upregulated and 364 genes are downregulated at least 2 fold in every patient with metastasis. The expression profile of metastatic samples reveals changes in expression of a unique set of genes representing both the androgen ablation related pathways and other metastasis related gene networks such as cell adhesion, bone remodelling and cell cycle. The differentially expressed genes include metabolic enzymes, transcription factors such as Forkhead Box M1 (FoxM1) and cell adhesion molecules such as Osteopontin (SPP1). We hypothesize that these genes have a role in the biology of metastatic disease and that they represent potential therapeutic targets for prostate cancer

  16. Concerted down-regulation of immune-system related genes predicts metastasis in colorectal carcinoma

    International Nuclear Information System (INIS)

    Fehlker, Marion; Huska, Matthew R; Jöns, Thomas; Andrade-Navarro, Miguel A; Kemmner, Wolfgang

    2014-01-01

    This study aimed at the identification of prognostic gene expression markers in early primary colorectal carcinomas without metastasis at the time point of surgery by analyzing genome-wide gene expression profiles using oligonucleotide microarrays. Cryo-conserved tumor specimens from 45 patients with early colorectal cancers were examined, with the majority of them being UICC stage II or earlier and with a follow-up time of 41–115 months. Gene expression profiling was performed using Whole Human Genome 4x44K Oligonucleotide Microarrays. Validation of microarray data was performed on five of the genes in a smaller cohort. Using a novel algorithm based on the recursive application of support vector machines (SVMs), we selected a signature of 44 probes that discriminated between patients developing later metastasis and patients with a good prognosis. Interestingly, almost half of the genes was related to the patients’ immune response and showed reduced expression in the metastatic cases. Whereas up to now gene signatures containing genes with various biological functions have been described for prediction of metastasis in CRC, in this study metastasis could be well predicted by a set of gene expression markers consisting exclusively of genes related to the MHC class II complex involved in immune response. Thus, our data emphasize that the proper function of a comprehensive network of immune response genes is of vital importance for the survival of colorectal cancer patients

  17. Heterologous gene expression in filamentous fungi.

    Science.gov (United States)

    Su, Xiaoyun; Schmitz, George; Zhang, Meiling; Mackie, Roderick I; Cann, Isaac K O

    2012-01-01

    Filamentous fungi are critical to production of many commercial enzymes and organic compounds. Fungal-based systems have several advantages over bacterial-based systems for protein production because high-level secretion of enzymes is a common trait of their decomposer lifestyle. Furthermore, in the large-scale production of recombinant proteins of eukaryotic origin, the filamentous fungi become the vehicle of choice due to critical processes shared in gene expression with other eukaryotic organisms. The complexity and relative dearth of understanding of the physiology of filamentous fungi, compared to bacteria, have hindered rapid development of these organisms as highly efficient factories for the production of heterologous proteins. In this review, we highlight several of the known benefits and challenges in using filamentous fungi (particularly Aspergillus spp., Trichoderma reesei, and Neurospora crassa) for the production of proteins, especially heterologous, nonfungal enzymes. We review various techniques commonly employed in recombinant protein production in the filamentous fungi, including transformation methods, selection of gene regulatory elements such as promoters, protein secretion factors such as the signal peptide, and optimization of coding sequence. We provide insights into current models of host genomic defenses such as repeat-induced point mutation and quelling. Furthermore, we examine the regulatory effects of transcript sequences, including introns and untranslated regions, pre-mRNA (messenger RNA) processing, transcript transport, and mRNA stability. We anticipate that this review will become a resource for researchers who aim at advancing the use of these fascinating organisms as protein production factories, for both academic and industrial purposes, and also for scientists with general interest in the biology of the filamentous fungi. Copyright © 2012 Elsevier Inc. All rights reserved.

  18. Association between gene expression profile of the primary tumor and chemotherapy response of metastatic breast cancer

    NARCIS (Netherlands)

    Savci-Heijink, Cemile Dilara; Halfwerk, Hans; Koster, Jan; van de Vijver, Marc Joan

    2017-01-01

    Background: To better predict the likelihood of response to chemotherapy, we have conducted a study comparing the gene expression patterns of primary tumours with their corresponding response to systemic chemotherapy in the metastatic setting. Methods: mRNA expression profiles of breast carcinomas

  19. Rhythmic diel pattern of gene expression in juvenile maize leaf.

    Directory of Open Access Journals (Sweden)

    Maciej Jończyk

    Full Text Available BACKGROUND: Numerous biochemical and physiological parameters of living organisms follow a circadian rhythm. Although such rhythmic behavior is particularly pronounced in plants, which are strictly dependent on the daily photoperiod, data on the molecular aspects of the diurnal cycle in plants is scarce and mostly concerns the model species Arabidopsis thaliana. Here we studied the leaf transcriptome in seedlings of maize, an important C4 crop only distantly related to A. thaliana, throughout a cycle of 10 h darkness and 14 h light to look for rhythmic patterns of gene expression. RESULTS: Using DNA microarrays comprising ca. 43,000 maize-specific probes we found that ca. 12% of all genes showed clear-cut diel rhythms of expression. Cluster analysis identified 35 groups containing from four to ca. 1,000 genes, each comprising genes of similar expression patterns. Perhaps unexpectedly, the most pronounced and most common (concerning the highest number of genes expression maxima were observed towards and during the dark phase. Using Gene Ontology classification several meaningful functional associations were found among genes showing similar diel expression patterns, including massive induction of expression of genes related to gene expression, translation, protein modification and folding at dusk and night. Additionally, we found a clear-cut tendency among genes belonging to individual clusters to share defined transcription factor-binding sequences. CONCLUSIONS: Co-expressed genes belonging to individual clusters are likely to be regulated by common mechanisms. The nocturnal phase of the diurnal cycle involves gross induction of fundamental biochemical processes and should be studied more thoroughly than was appreciated in most earlier physiological studies. Although some general mechanisms responsible for the diel regulation of gene expression might be shared among plants, details of the diurnal regulation of gene expression seem to differ

  20. FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data

    DEFF Research Database (Denmark)

    Manijak, Mieszko P.; Nielsen, Henrik Bjørn

    2011-01-01

    circumvented by instead matching gene expression signatures to signatures of other experiments. FINDINGS: To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700...... Arabidopsis microarray experiments. CONCLUSIONS: Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/....

  1. PRAME Gene Expression in Acute Leukemia and Its Clinical Significance

    International Nuclear Information System (INIS)

    Ding, Kai; Wang, Xiao-ming; Fu, Rong; Ruan, Er-bao; Liu, Hui; Shao, Zong-hong

    2012-01-01

    To investigate the expression of the preferentially expressed antigen of melanoma (PRAME) gene in acute leukemia and its clinical significance. The level of expressed PRAME mRNA in bone marrow mononuclear cells from 34 patients with acute leukemia (AL) and in 12 bone marrow samples from healthy volunteers was measured via RT-PCR. Correlation analyses between PRAME gene expression and the clinical characteristics (gender, age, white blood count, immunophenotype of leukemia, percentage of blast cells, and karyotype) of the patients were performed. The PRAME gene was expressed in 38.2% of all 34 patients, in 40.7% of the patients with acute myelogenous leukemia (AML, n=27), and in 28.6% of the patients with acute lymphoblastic leukemia (ALL, n=7), but was not expressed in the healthy volunteers. The difference in the expression levels between AML and ALL patients was statistically significant. The rate of gene expression was 80% in M 3 , 33.3% in M 2 , and 28.6% in M 5 . Gene expression was also found to be correlated with CD15 and CD33 expression and abnormal karyotype, but not with age, gender, white blood count or percentage of blast cells. The PRAME gene is highly expressed in acute leukemia and could be a useful marker to monitor minimal residual disease. This gene is also a candidate target for the immunotherapy of acute leukemia

  2. ZCCHC17 is a master regulator of synaptic gene expression in Alzheimer's disease.

    Science.gov (United States)

    Tomljanovic, Zeljko; Patel, Mitesh; Shin, William; Califano, Andrea; Teich, Andrew F

    2018-02-01

    In an effort to better understand the molecular drivers of synaptic and neurophysiologic dysfunction in Alzheimer's disease (AD), we analyzed neuronal gene expression data from human AD brain tissue to identify master regulators of synaptic gene expression. Master regulator analysis identifies ZCCHC17 as normally supporting the expression of a network of synaptic genes, and predicts that ZCCHC17 dysfunction in AD leads to lower expression of these genes. We demonstrate that ZCCHC17 is normally expressed in neurons and is reduced early in the course of AD pathology. We show that ZCCHC17 loss in rat neurons leads to lower expression of the majority of the predicted synaptic targets and that ZCCHC17 drives the expression of a similar gene network in humans and rats. These findings support a conserved function for ZCCHC17 between species and identify ZCCHC17 loss as an important early driver of lower synaptic gene expression in AD. Matlab and R scripts used in this paper are available at https://github.com/afteich/AD_ZCC. aft25@cumc.columbia.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  3. Analyzing kernel matrices for the identification of differentially expressed genes.

    Directory of Open Access Journals (Sweden)

    Xiao-Lei Xia

    Full Text Available One of the most important applications of microarray data is the class prediction of biological samples. For this purpose, statistical tests have often been applied to identify the differentially expressed genes (DEGs, followed by the employment of the state-of-the-art learning machines including the Support Vector Machines (SVM in particular. The SVM is a typical sample-based classifier whose performance comes down to how discriminant samples are. However, DEGs identified by statistical tests are not guaranteed to result in a training dataset composed of discriminant samples. To tackle this problem, a novel gene ranking method namely the Kernel Matrix Gene Selection (KMGS is proposed. The rationale of the method, which roots in the fundamental ideas of the SVM algorithm, is described. The notion of ''the separability of a sample'' which is estimated by performing [Formula: see text]-like statistics on each column of the kernel matrix, is first introduced. The separability of a classification problem is then measured, from which the significance of a specific gene is deduced. Also described is a method of Kernel Matrix Sequential Forward Selection (KMSFS which shares the KMGS method's essential ideas but proceeds in a greedy manner. On three public microarray datasets, our proposed algorithms achieved noticeably competitive performance in terms of the B.632+ error rate.

  4. Hormonal modulation of breast cancer gene expression: implications for intrinsic subtyping in pre-menopausal women

    OpenAIRE

    Sarah M Bernhardt; Pallave Dasari; David Walsh; Amanda R Townsend; Amanda R Townsend; Timothy J Price; Timothy J Price; Wendy V Ingman

    2016-01-01

    Clinics are increasingly adopting gene expression profiling to diagnose breast cancer subtype, providing an intrinsic, molecular portrait of the tumour. For example, the PAM50-based Prosigna test quantifies expression of 50 key genes to classify breast cancer subtype, and this method of classification has been demonstrated to be superior over traditional immunohistochemical methods that detect proteins, to predict risk of disease recurrence. However, these tests were largely developed and val...

  5. Hormonal Modulation of Breast Cancer Gene Expression: Implications for Intrinsic Subtyping in Premenopausal Women

    OpenAIRE

    Bernhardt, Sarah M.; Dasari, Pallave; Walsh, David; Townsend, Amanda R.; Price, Timothy J.; Ingman, Wendy V.

    2016-01-01

    Clinics are increasingly adopting gene-expression profiling to diagnose breast cancer subtype, providing an intrinsic, molecular portrait of the tumor. For example, the PAM50-based Prosigna test quantifies expression of 50 key genes to classify breast cancer subtype, and this method of classification has been demonstrated to be superior over traditional immunohistochemical methods that detect proteins, to predict risk of disease recurrence. However, these tests were largely developed and vali...

  6. Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

    KAUST Repository

    Othoum, Ghofran K

    2013-05-01

    Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using

  7. Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

    KAUST Repository

    Othoum, Ghofran K

    2013-01-01

    Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using

  8. Global gene expression analysis for evaluation and design of biomaterials

    Directory of Open Access Journals (Sweden)

    Nobutaka Hanagata, Taro Takemura and Takashi Minowa

    2010-01-01

    Full Text Available Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data.

  9. Global gene expression analysis for evaluation and design of biomaterials

    International Nuclear Information System (INIS)

    Hanagata, Nobutaka; Takemura, Taro; Minowa, Takashi

    2010-01-01

    Comprehensive gene expression analysis using DNA microarrays has become a widespread technique in molecular biological research. In the biomaterials field, it is used to evaluate the biocompatibility or cellular toxicity of metals, polymers and ceramics. Studies in this field have extracted differentially expressed genes in the context of differences in cellular responses among multiple materials. Based on these genes, the effects of materials on cells at the molecular level have been examined. Expression data ranging from several to tens of thousands of genes can be obtained from DNA microarrays. For this reason, several tens or hundreds of differentially expressed genes are often present in different materials. In this review, we outline the principles of DNA microarrays, and provide an introduction to methods of extracting information which is useful for evaluating and designing biomaterials from comprehensive gene expression data. (topical review)

  10. Redox regulation of photosynthetic gene expression.

    Science.gov (United States)

    Queval, Guillaume; Foyer, Christine H

    2012-12-19

    Redox chemistry and redox regulation are central to the operation of photosynthesis and respiration. However, the roles of different oxidants and antioxidants in the regulation of photosynthetic or respiratory gene expression remain poorly understood. Leaf transcriptome profiles of a range of Arabidopsis thaliana genotypes that are deficient in either hydrogen peroxide processing enzymes or in low molecular weight antioxidant were therefore compared to determine how different antioxidant systems that process hydrogen peroxide influence transcripts encoding proteins targeted to the chloroplasts or mitochondria. Less than 10 per cent overlap was observed in the transcriptome patterns of leaves that are deficient in either photorespiratory (catalase (cat)2) or chloroplastic (thylakoid ascorbate peroxidase (tapx)) hydrogen peroxide processing. Transcripts encoding photosystem II (PSII) repair cycle components were lower in glutathione-deficient leaves, as were the thylakoid NAD(P)H (nicotinamide adenine dinucleotide (phosphate)) dehydrogenases (NDH) mRNAs. Some thylakoid NDH mRNAs were also less abundant in tAPX-deficient and ascorbate-deficient leaves. Transcripts encoding the external and internal respiratory NDHs were increased by low glutathione and low ascorbate. Regulation of transcripts encoding specific components of the photosynthetic and respiratory electron transport chains by hydrogen peroxide, ascorbate and glutathione may serve to balance non-cyclic and cyclic electron flow pathways in relation to oxidant production and reductant availability.

  11. Cell cycle gene expression under clinorotation

    Science.gov (United States)

    Artemenko, Olga

    2016-07-01

    Cyclins and cyclin-dependent kinase (CDK) are main regulators of the cell cycle of eukaryotes. It's assumes a significant change of their level in cells under microgravity conditions and by other physical factors actions. The clinorotation use enables to determine the influence of gravity on simulated events in the cell during the cell cycle - exit from the state of quiet stage and promotion presynthetic phase (G1) and DNA synthesis phase (S) of the cell cycle. For the clinorotation effect study on cell proliferation activity is the necessary studies of molecular mechanisms of cell cycle regulation and development of plants under altered gravity condition. The activity of cyclin D, which is responsible for the events of the cell cycle in presynthetic phase can be controlled by the action of endogenous as well as exogenous factors, but clinorotation is one of the factors that influence on genes expression that regulate the cell cycle.These data can be used as a model for further research of cyclin - CDK complex for study of molecular mechanisms regulation of growth and proliferation. In this investigation we tried to summarize and analyze known literature and own data we obtained relatively the main regulators of the cell cycle in altered gravity condition.

  12. Social Regulation of Gene Expression in Threespine Sticklebacks.

    Directory of Open Access Journals (Sweden)

    Anna K Greenwood

    Full Text Available Identifying genes that are differentially expressed in response to social interactions is informative for understanding the molecular basis of social behavior. To address this question, we described changes in gene expression as a result of differences in the extent of social interactions. We housed threespine stickleback (Gasterosteus aculeatus females in either group conditions or individually for one week, then measured levels of gene expression in three brain regions using RNA-sequencing. We found that numerous genes in the hindbrain/cerebellum had altered expression in response to group or individual housing. However, relatively few genes were differentially expressed in either the diencephalon or telencephalon. The list of genes upregulated in fish from social groups included many genes related to neural development and cell adhesion as well as genes with functions in sensory signaling, stress, and social and reproductive behavior. The list of genes expressed at higher levels in individually-housed fish included several genes previously identified as regulated by social interactions in other animals. The identified genes are interesting targets for future research on the molecular mechanisms of normal social interactions.

  13. Large scale gene expression meta-analysis reveals tissue-specific, sex-biased gene expression in humans

    Directory of Open Access Journals (Sweden)

    Benjamin Mayne

    2016-10-01

    Full Text Available The severity and prevalence of many diseases are known to differ between the sexes. Organ specific sex-biased gene expression may underpin these and other sexually dimorphic traits. To further our understanding of sex differences in transcriptional regulation, we performed meta-analyses of sex biased gene expression in multiple human tissues. We analysed 22 publicly available human gene expression microarray data sets including over 2500 samples from 15 different tissues and 9 different organs. Briefly, by using an inverse-variance method we determined the effect size difference of gene expression between males and females. We found the greatest sex differences in gene expression in the brain, specifically in the anterior cingulate cortex, (1818 genes, followed by the heart (375 genes, kidney (224 genes, colon (218 genes and thyroid (163 genes. More interestingly, we found different parts of the brain with varying numbers and identity of sex-biased genes, indicating that specific cortical regions may influence sexually dimorphic traits. The majority of sex-biased genes in other tissues such as the bladder, liver, lungs and pancreas were on the sex chromosomes or involved in sex hormone production. On average in each tissue, 32% of autosomal genes that were expressed in a sex-biased fashion contained androgen or estrogen hormone response elements. Interestingly, across all tissues, we found approximately two-thirds of autosomal genes that were sex-biased were not under direct influence of sex hormones. To our knowledge this is the largest analysis of sex-biased gene expression in human tissues to date. We identified many sex-biased genes that were not under the direct influence of sex chromosome genes or sex hormones. These may provide targets for future development of sex-specific treatments for diseases.

  14. Microarray gene expression profiling and analysis in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Sadhukhan Provash

    2004-06-01

    Full Text Available Abstract Background Renal cell carcinoma (RCC is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. Methods Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. Results Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR. Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. Conclusions This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most

  15. A stochastic approach to multi-gene expression dynamics

    International Nuclear Information System (INIS)

    Ochiai, T.; Nacher, J.C.; Akutsu, T.

    2005-01-01

    In the last years, tens of thousands gene expression profiles for cells of several organisms have been monitored. Gene expression is a complex transcriptional process where mRNA molecules are translated into proteins, which control most of the cell functions. In this process, the correlation among genes is crucial to determine the specific functions of genes. Here, we propose a novel multi-dimensional stochastic approach to deal with the gene correlation phenomena. Interestingly, our stochastic framework suggests that the study of the gene correlation requires only one theoretical assumption-Markov property-and the experimental transition probability, which characterizes the gene correlation system. Finally, a gene expression experiment is proposed for future applications of the model

  16. Gene expression differences between Noccaea caerulescens ecotypes help to identify candidate genes for metal phytoremediation.

    Science.gov (United States)

    Halimaa, Pauliina; Lin, Ya-Fen; Ahonen, Viivi H; Blande, Daniel; Clemens, Stephan; Gyenesei, Attila; Häikiö, Elina; Kärenlampi, Sirpa O; Laiho, Asta; Aarts, Mark G M; Pursiheimo, Juha-Pekka; Schat, Henk; Schmidt, Holger; Tuomainen, Marjo H; Tervahauta, Arja I

    2014-03-18

    Populations of Noccaea caerulescens show tremendous differences in their capacity to hyperaccumulate and hypertolerate metals. To explore the differences that could contribute to these traits, we undertook SOLiD high-throughput sequencing of the root transcriptomes of three phenotypically well-characterized N. caerulescens accessions, i.e., Ganges, La Calamine, and Monte Prinzera. Genes with possible contribution to zinc, cadmium, and nickel hyperaccumulation and hypertolerance were predicted. The most significant differences between the accessions were related to metal ion (di-, trivalent inorganic cation) transmembrane transporter activity, iron and calcium ion binding, (inorganic) anion transmembrane transporter activity, and antioxidant activity. Analysis of correlation between the expression profile of each gene and the metal-related characteristics of the accessions disclosed both previously characterized (HMA4, HMA3) and new candidate genes (e.g., for nickel IRT1, ZIP10, and PDF2.3) as possible contributors to the hyperaccumulation/tolerance phenotype. A number of unknown Noccaea-specific transcripts also showed correlation with Zn(2+), Cd(2+), or Ni(2+) hyperaccumulation/tolerance. This study shows that N. caerulescens populations have evolved great diversity in the expression of metal-related genes, facilitating adaptation to various metalliferous soils. The information will be helpful in the development of improved plants for metal phytoremediation.

  17. Unstable Expression of Commonly Used Reference Genes in Rat Pancreatic Islets Early after Isolation Affects Results of Gene Expression Studies.

    Directory of Open Access Journals (Sweden)

    Lucie Kosinová

    Full Text Available The use of RT-qPCR provides a powerful tool for gene expression studies; however, the proper interpretation of the obtained data is crucially dependent on accurate normalization based on stable reference genes. Recently, strong evidence has been shown indicating that the expression of many commonly used reference genes may vary significantly due to diverse experimental conditions. The isolation of pancreatic islets is a complicated procedure which creates severe mechanical and metabolic stress leading possibly to cellular damage and alteration of gene expression. Despite of this, freshly isolated islets frequently serve as a control in various gene expression and intervention studies. The aim of our study was to determine expression of 16 candidate reference genes and one gene of interest (F3 in isolated rat pancreatic islets during short-term cultivation in order to find a suitable endogenous control for gene expression studies. We compared the expression stability of the most commonly used reference genes and evaluated the reliability of relative and absolute quantification using RT-qPCR during 0-120 hrs after isolation. In freshly isolated islets, the expression of all tested genes was markedly depressed and it increased several times throughout the first 48 hrs of cultivation. We observed significant variability among samples at 0 and 24 hrs but substantial stabilization from 48 hrs onwards. During the first 48 hrs, relative quantification failed to reflect the real changes in respective mRNA concentrations while in the interval 48-120 hrs, the relative expression generally paralleled the results determined by absolute quantification. Thus, our data call into question the suitability of relative quantification for gene expression analysis in pancreatic islets during the first 48 hrs of cultivation, as the results may be significantly affected by unstable expression of reference genes. However, this method could provide reliable information

  18. Modeling insertional mutagenesis using gene length and expression in murine embryonic stem cells.

    Directory of Open Access Journals (Sweden)

    Alex S Nord

    2007-07-01

    Full Text Available High-throughput mutagenesis of the mammalian genome is a powerful means to facilitate analysis of gene function. Gene trapping in embryonic stem cells (ESCs is the most widely used form of insertional mutagenesis in mammals. However, the rules governing its efficiency are not fully understood, and the effects of vector design on the likelihood of gene-trapping events have not been tested on a genome-wide scale.In this study, we used public gene-trap data to model gene-trap likelihood. Using the association of gene length and gene expression with gene-trap likelihood, we constructed spline-based regression models that characterize which genes are susceptible and which genes are resistant to gene-trapping techniques. We report results for three classes of gene-trap vectors, showing that both length and expression are significant determinants of trap likelihood for all vectors. Using our models, we also quantitatively identified hotspots of gene-trap activity, which represent loci where the high likelihood of vector insertion is controlled by factors other than length and expression. These formalized statistical models describe a high proportion of the variance in the likelihood of a gene being trapped by expression-dependent vectors and a lower, but still significant, proportion of the variance for vectors that are predicted to be independent of endogenous gene expression.The findings of significant expression and length effects reported here further the understanding of the determinants of vector insertion. Results from this analysis can be applied to help identify other important determinants of this important biological phenomenon and could assist planning of large-scale mutagenesis efforts.

  19. Transcriptomic epidemiology of smoking: the effect of smoking on gene expression in lymphocytes

    Directory of Open Access Journals (Sweden)

    Almasy Laura

    2010-07-01

    Full Text Available Abstract Background This investigation offers insights into system-wide pathological processes induced in response to cigarette smoke exposure by determining its influences at the gene expression level. Methods We obtained genome-wide quantitative transcriptional profiles from 1,240 individuals from the San Antonio Family Heart Study, including 297 current smokers. Using lymphocyte samples, we identified 20,413 transcripts with significantly detectable expression levels, including both known and predicted genes. Correlation between smoking and gene expression levels was determined using a regression model that allows for residual genetic effects. Results With a conservative false-discovery rate of 5% we identified 323 unique genes (342 transcripts whose expression levels were significantly correlated with smoking behavior. These genes showed significant over-representation within a range of functional categories that correspond well with known smoking-related pathologies, including immune response, cell death, cancer, natural killer cell signaling and xenobiotic metabolism. Conclusions Our results indicate that not only individual genes but entire networks of gene interaction are influenced by cigarette smoking. This is the largest in vivo transcriptomic epidemiological study of smoking to date and reveals the significant and comprehensive influence of cigarette smoke, as an environmental variable, on the expression of genes. The central importance of this manuscript is to provide a summary of the relationships between gene expression and smoking in this exceptionally large cross-sectional data set.

  20. Identification and validation of suitable endogenous reference genes for gene expression studies in human peripheral blood

    Directory of Open Access Journals (Sweden)

    Turner Renee J

    2009-08-01

    Full Text Available Abstract Background Gene expression studies require appropriate normalization methods. One such method uses stably expressed reference genes. Since suitable reference genes appear to be unique for each tissue, we have identified an optimal set of the most stably expressed genes in human blood that can be used for normalization. Methods Whole-genome Affymetrix Human 2.0 Plus arrays were examined from 526 samples of males and females ages 2 to 78, including control subjects and patients with Tourette syndrome, stroke, migraine, muscular dystrophy, and autism. The top 100 most stably expressed genes with a broad range of expression levels were identified. To validate the best candidate genes, we performed quantitative RT-PCR on a subset of 10 genes (TRAP1, DECR1, FPGS, FARP1, MAPRE2, PEX16, GINS2, CRY2, CSNK1G2 and A4GALT, 4 commonly employed reference genes (GAPDH, ACTB, B2M and HMBS and PPIB, previously reported to be stably expressed in blood. Expression stability and ranking analysis were performed using GeNorm and NormFinder algorithms. Results Reference genes were ranked based on their expression stability and the minimum number of genes needed for nomalization as calculated using GeNorm showed that the fewest, most stably expressed genes needed for acurate normalization in RNA expression studies of human whole blood is a combination of TRAP1, FPGS, DECR1 and PPIB. We confirmed the ranking of the best candidate control genes by using an alternative algorithm (NormFinder. Conclusion The reference genes identified in this study are stably expressed in whole blood of humans of both genders with multiple disease conditions and ages 2 to 78. Importantly, they also have different functions within cells and thus should be expressed independently of each other. These genes should be useful as normalization genes for microarray and RT-PCR whole blood studies of human physiology, metabolism and disease.

  1. Characterization of differentially expressed genes using high-dimensional co-expression networks

    DEFF Research Database (Denmark)

    Coelho Goncalves de Abreu, Gabriel; Labouriau, Rodrigo S.

    2010-01-01

    We present a technique to characterize differentially expressed genes in terms of their position in a high-dimensional co-expression network. The set-up of Gaussian graphical models is used to construct representations of the co-expression network in such a way that redundancy and the propagation...... that allow to make effective inference in problems with high degree of complexity (e.g. several thousands of genes) and small number of observations (e.g. 10-100) as typically occurs in high throughput gene expression studies. Taking advantage of the internal structure of decomposable graphical models, we...... construct a compact representation of the co-expression network that allows to identify the regions with high concentration of differentially expressed genes. It is argued that differentially expressed genes located in highly interconnected regions of the co-expression network are less informative than...

  2. Gene Expression Measurement Module (GEMM) - a fully automated, miniaturized instrument for measuring gene expression in space

    Science.gov (United States)

    Karouia, Fathi; Ricco, Antonio; Pohorille, Andrew; Peyvan, Kianoosh

    2012-07-01

    The capability to measure gene expression on board spacecrafts opens the doors to a large number of experiments on the influence of space environment on biological systems that will profoundly impact our ability to conduct safe and effective space travel, and might also shed light on terrestrial physiology or biological function and human disease and aging processes. Measurements of gene expression will help us to understand adaptation of terrestrial life to conditions beyond the planet of origin, identify deleterious effects of the space environment on a wide range of organisms from microbes to humans, develop effective countermeasures against these effects, determine metabolic basis of microbial pathogenicity and drug resistance, test our ability to sustain and grow in space organisms that can be used for life support and in situ resource utilization during long-duration space exploration, and monitor both the spacecraft environment and crew health. These and other applications hold significant potential for discoveries in space biology, biotechnology and medicine. Accordingly, supported by funding from the NASA Astrobiology Science and Technology Instrument Development Program, we are developing a fully automated, miniaturized, integrated fluidic system for small spacecraft capable of in-situ measuring microbial expression of thousands of genes from multiple samples. The instrument will be capable of (1) lysing bacterial cell walls, (2) extracting and purifying RNA released from cells, (3) hybridizing it on a microarray and (4) providing electrochemical readout, all in a microfluidics cartridge. The prototype under development is suitable for deployment on nanosatellite platforms developed by the NASA Small Spacecraft Office. The first target application is to cultivate and measure gene expression of the photosynthetic bacterium Synechococcus elongatus, i.e. a cyanobacterium known to exhibit remarkable metabolic diversity and resilience to adverse conditions

  3. Gene Expression Analysis to Assess the Relevance of Rodent Models to Human Lung Injury.

    Science.gov (United States)

    Sweeney, Timothy E; Lofgren, Shane; Khatri, Purvesh; Rogers, Angela J

    2017-08-01

    The relevance of animal models to human diseases is an area of intense scientific debate. The degree to which mouse models of lung injury recapitulate human lung injury has never been assessed. Integrating data from both human and animal expression studies allows for increased statistical power and identification of conserved differential gene expression across organisms and conditions. We sought comprehensive integration of gene expression data in experimental acute lung injury (ALI) in rodents compared with humans. We performed two separate gene expression multicohort analyses to determine differential gene expression in experimental animal and human lung injury. We used correlational and pathway analyses combined with external in vitro gene expression data to identify both potential drivers of underlying inflammation and therapeutic drug candidates. We identified 21 animal lung tissue datasets and three human lung injury bronchoalveolar lavage datasets. We show that the metasignatures of animal and human experimental ALI are significantly correlated despite these widely varying experimental conditions. The gene expression changes among mice and rats across diverse injury models (ozone, ventilator-induced lung injury, LPS) are significantly correlated with human models of lung injury (Pearson r = 0.33-0.45, P human lung injury. Predicted therapeutic targets, peptide ligand signatures, and pathway analyses are also all highly overlapping. Gene expression changes are similar in animal and human experimental ALI, and provide several physiologic and therapeutic insights to the disease.

  4. Differentially expressed genes in iron-induced prion protein conversion

    International Nuclear Information System (INIS)

    Kim, Minsun; Kim, Eun-hee; Choi, Bo-Ran; Woo, Hee-Jong

    2016-01-01

    The conversion of the cellular prion protein (PrP C ) to the protease-resistant isoform is the key event in chronic neurodegenerative diseases, including transmissible spongiform encephalopathies (TSEs). Increased iron in prion-related disease has been observed due to the prion protein-ferritin complex. Additionally, the accumulation and conversion of recombinant PrP (rPrP) is specifically derived from Fe(III) but not Fe(II). Fe(III)-mediated PK-resistant PrP (PrP res ) conversion occurs within a complex cellular environment rather than via direct contact between rPrP and Fe(III). In this study, differentially expressed genes correlated with prion degeneration by Fe(III) were identified using Affymetrix microarrays. Following Fe(III) treatment, 97 genes were differentially expressed, including 85 upregulated genes and 12 downregulated genes (≥1.5-fold change in expression). However, Fe(II) treatment produced moderate alterations in gene expression without inducing dramatic alterations in gene expression profiles. Moreover, functional grouping of identified genes indicated that the differentially regulated genes were highly associated with cell growth, cell maintenance, and intra- and extracellular transport. These findings showed that Fe(III) may influence the expression of genes involved in PrP folding by redox mechanisms. The identification of genes with altered expression patterns in neural cells may provide insights into PrP conversion mechanisms during the development and progression of prion-related diseases. - Highlights: • Differential genes correlated with prion degeneration by Fe(III) were identified. • Genes were identified in cell proliferation and intra- and extracellular transport. • In PrP degeneration, redox related genes were suggested. • Cbr2, Rsad2, Slc40a1, Amph and Mvd were expressed significantly.

  5. Gene expression profile data for mouse facial development

    Directory of Open Access Journals (Sweden)

    Sonia M. Leach

    2017-08-01

    Full Text Available This article contains data related to the research articles "Spatial and Temporal Analysis of Gene Expression during Growth and Fusion of the Mouse Facial Prominences" (Feng et al., 2009 [1] and “Systems Biology of facial development: contributions of ectoderm and mesenchyme” (Hooper et al., 2017 In press [2]. Embryonic mammalian craniofacial development is a complex process involving the growth, morphogenesis, and fusion of distinct facial prominences into a functional whole. Aberrant gene regulation during this process can lead to severe craniofacial birth defects, including orofacial clefting. As a means to understand the genes involved in facial development, we had previously dissected the embryonic mouse face into distinct prominences: the mandibular, maxillary or nasal between E10.5 and E12.5. The prominences were then processed intact, or separated into ectoderm and mesenchyme layers, prior analysis of RNA expression using microarrays (Feng et al., 2009, Hooper et al., 2017 in press [1,2]. Here, individual gene expression profiles have been built from these datasets that illustrate the timing of gene expression in whole prominences or in the separated tissue layers. The data profiles are presented as an indexed and clickable list of the genes each linked to a graphical image of that gene׳s expression profile in the ectoderm, mesenchyme, or intact prominence. These data files will enable investigators to obtain a rapid assessment of the relative expression level of any gene on the array with respect to time, tissue, prominence, and expression trajectory.

  6. Stably Expressed Genes Involved in Basic Cellular Functions.

    Directory of Open Access Journals (Sweden)

    Kejian Wang

    Full Text Available Stably Expressed Genes (SEGs whose expression varies within a narrow range may be involved in core cellular processes necessary for basic functions. To identify such genes, we re-analyzed existing RNA-Seq gene expression profiles across 11 organs at 4 developmental stages (from immature to old age in both sexes of F344 rats (n = 4/group; 320 samples. Expression changes (calculated as the maximum expression / minimum expression for each gene of >19000 genes across organs, ages, and sexes ranged from 2.35 to >109-fold, with a median of 165-fold. The expression of 278 SEGs was found to vary ≤4-fold and these genes were significantly involved in protein catabolism (proteasome and ubiquitination, RNA transport, protein processing, and the spliceosome. Such stability of expression was further validated in human samples where the expression variability of the homologous human SEGs was significantly lower than that of other genes in the human genome. It was also found that the homologous human SEGs were generally less subject to non-synonymous mutation than other genes, as would be expected of stably expressed genes. We also found that knockout of SEG homologs in mouse models was more likely to cause complete preweaning lethality than non-SEG homologs, corroborating the fundamental roles played by SEGs in biological development. Such stably expressed genes and pathways across life-stages suggest that tight control of these processes is important in basic cellular functions and that perturbation by endogenous (e.g., genetics or exogenous agents (e.g., drugs, environmental factors may cause serious adverse effects.

  7. ANALYSES ON DIFFERENTIALLY EXPRESSED GENES ASSOCIATED WITH HUMAN BREAST CANCER

    Institute of Scientific and Technical Information of China (English)

    MENG Xu-li; DING Xiao-wen; XU Xiao-hong

    2006-01-01

    Objective: To investigate the molecular etiology of breast cancer by way of studying the differential expression and initial function of the related genes in the occurrence and development of breast cancer. Methods: Two hundred and eighty-eight human tumor related genes were chosen for preparation of the oligochips probe. mRNA was extracted from 16 breast cancer tissues and the corresponding normal breast tissues, and cDNA probe was prepared through reverse-transcription and hybridized with the gene chip. A laser focused fluorescent scanner was used to scan the chip. The different gene expressions were thereafter automatically compared and analyzed between the two sample groups. Cy3/Cy5>3.5 meant significant up-regulation. Cy3/Cy5<0.25 meant significant down-regulation. Results: The comparison between the breast cancer tissues and their corresponding normal tissues showed that 84 genes had differential expression in the Chip. Among the differently expressed genes, there were 4 genes with significant down-regulation and 6 with significant up-regulation. Compared with normal breast tissues, differentially expressed genes did partially exist in the breast cancer tissues. Conclusion: Changes in multi-gene expression regulations take place during the occurrence and development of breast cancer; and the research on related genes can help understanding the mechanism of tumor occurrence.

  8. Regulation of mitochondrial gene expression, the epigenetic enigma

    NARCIS (Netherlands)

    Mposhi, Archibold; van der Wijst, Monique G. P.; Faber, Klaas Nico; Rots, Marianne G.

    2017-01-01

    Epigenetics provides an important layer of information on top of the DNA sequence and is essential for establishing gene expression profiles. Extensive studies have shown that nuclear DNA methylation and histone modifications influence nuclear gene expression. However, it remains unclear whether

  9. Expression of KLK2 gene in prostate cancer

    Directory of Open Access Journals (Sweden)

    Sajad Shafai

    2018-01-01

    Conclusion: The expression of KLK2 gene in people with prostate cancer is the higher than the healthy person; finally, according to the results, it could be mentioned that the KLK2 gene considered as a useful factor in prostate cancer, whose expression is associated with progression and development of the prostate cancer.

  10. Comparative genomics of the relationship between gene structure and expression

    NARCIS (Netherlands)

    Ren, X.

    2006-01-01

    The relationship between the structure of genes and their expression is a relatively new aspect of genome organization and regulation. With more genome sequences and expression data becoming available, bioinformatics approaches can help the further elucidation of the relationships between gene

  11. The gene expressions of DNA methylation/demethylation enzymes ...

    African Journals Online (AJOL)

    user

    2011-01-31

    Jan 31, 2011 ... A decrease in mRNA levels for cytochrome c oxidase (COX) subunits was observed in skeletal muscle of hypothyroid rats. However, the precise expression mechanisms of the related genes in hypothyroid state still remain unclear. This study investigated gene expressions of DNA methyltransferases.

  12. Genome polymorphism markers and stress genes expression for ...

    African Journals Online (AJOL)

    SAM

    2014-06-11

    Jun 11, 2014 ... RNA extraction and purification for SOD and PAL gene expression. Fresh leaf tissues (100 mg), from ... Data analysis. Gelquant program for quantification of protein, DNA and RNA gel. (version 1.8.2) was used for .... by reprogramming the expression of endogenous genes. Higher level of these antioxidant ...

  13. Genome organization and expression of the rat ACBP gene family

    DEFF Research Database (Denmark)

    Mandrup, S; Andreasen, P H; Knudsen, J

    1993-01-01

    pool former. We have molecularly cloned and characterized the rat ACBP gene family which comprises one expressed and four processed pseudogenes. One of these was shown to exist in two allelic forms. A comprehensive computer-aided analysis of the promoter region of the expressed ACBP gene revealed...

  14. Effects of heat stress on gene expression in eggplant ( Solanum ...

    African Journals Online (AJOL)

    In order to identify differentially expressed genes involved in heat shock response, cDNA amplified fragment length polymorphism (cDNA-AFLP) and quantitative real-time polymerase chain reaction (QPCR) were used to study gene expression of eggplant seedlings subjected to 0, 6 and 12 h at 43°C. A total of 53 of over ...

  15. RNA preparation and characterization for gene expression studies

    DEFF Research Database (Denmark)

    Stangegaard, Michael

    2009-01-01

    Much information can be obtained from knowledge of the relative expression level of each gene in the transcriptome. With the current advances in technology as little as a single cell is required as starting material for gene expression experiments. The mRNA from a single cell may be linearly...

  16. The gene expressions of DNA methylation/demethylation enzymes ...

    African Journals Online (AJOL)

    A decrease in mRNA levels for cytochrome c oxidase (COX) subunits was observed in skeletal muscle of hypothyroid rats. However, the precise expression mechanisms of the related genes in hypothyroid state still remain unclear. This study investigated gene expressions of DNA methyltransferases (Dnmts), DNA ...

  17. Microarray analysis of the gene expression profile in triethylene ...

    African Journals Online (AJOL)

    Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells. ... Conclusions: Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.

  18. Elucidating gene function and function evolution through comparison of co-expression networks in plants

    Directory of Open Access Journals (Sweden)

    Marek eMutwil

    2014-08-01

    Full Text Available The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23. In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We show that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that, in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.

  19. Fungal and plant gene expression in arbuscular mycorrhizal symbiosis.

    Science.gov (United States)

    Balestrini, Raffaella; Lanfranco, Luisa

    2006-11-01

    Arbuscular mycorrhizas (AMs) are a unique example of symbiosis between two eukaryotes, soil fungi and plants. This association induces important physiological changes in each partner that lead to reciprocal benefits, mainly in nutrient supply. The symbiosis results from modifications in plant and fungal cell organization caused by specific changes in gene expression. Recently, much effort has gone into studying these gene expression patterns to identify a wider spectrum of genes involved. We aim in this review to describe AM symbiosis in terms of current knowledge on plant and fungal gene expression profiles.

  20. Expression and clinical significance of Pax6 gene in retinoblastoma

    Directory of Open Access Journals (Sweden)

    Hai-Dong Huang

    2013-07-01

    Full Text Available AIM: To discuss the expression and clinical significance of Pax6 gene in retinoblastoma(Rb. METHODS: Totally 15 cases of fresh Rb organizations were selected as observation group and 15 normal retinal organizations as control group. Western-Blot and reverse transcriptase polymerase chain reaction(RT-PCRmethods were used to detect Pax6 protein and Pax6 mRNA expressions of the normal retina organizations and Rb organizations. At the same time, Western Blot method was used to detect the Pax6 gene downstream MATH5 and BRN3b differentiation gene protein level expression. After the comparison between two groups, the expression and clinical significance of Pax6 gene in Rb were discussed. RESULTS: In the observation group, average value of mRNA expression of Pax6 gene was 0.99±0.03; average value of Pax6 gene protein expression was 2.07±0.15; average value of BRN3b protein expression was 0.195±0.016; average value of MATH5 protein expression was 0.190±0.031. They were significantly higher than the control group, and the differences were statistically significant(PCONCLUSION: Abnormal expression of Pax6 gene is likely to accelerate the occurrence of Rb.

  1. Gene expression in cerebral ischemia: a new approach for neuroprotection.

    Science.gov (United States)

    Millán, Mónica; Arenillas, Juan

    2006-01-01

    Cerebral ischemia is one of the strongest stimuli for gene induction in the brain. Hundreds of genes have been found to be induced by brain ischemia. Many genes are involved in neurodestructive functions such as excitotoxicity, inflammatory response and neuronal apoptosis. However, cerebral ischemia is also a powerful reformatting and reprogramming stimulus for the brain through neuroprotective gene expression. Several genes may participate in both cellular responses. Thus, isolation of candidate genes for neuroprotection strategies and interpretation of expression changes have been proven difficult. Nevertheless, many studies are being carried out to improve the knowledge of the gene activation and protein expression following ischemic stroke, as well as in the development of new therapies that modify biochemical, molecular and genetic changes underlying cerebral ischemia. Owing to the complexity of the process involving numerous critical genes expressed differentially in time, space and concentration, ongoing therapeutic efforts should be based on multiple interventions at different levels. By modification of the acute gene expression induced by ischemia or the apoptotic gene program, gene therapy is a promising treatment but is still in a very experimental phase. Some hurdles will have to be overcome before these therapies can be introduced into human clinical stroke trials. Copyright 2006 S. Karger AG, Basel.

  2. Decoupling Linear and Nonlinear Associations of Gene Expression

    KAUST Repository

    Itakura, Alan

    2013-01-01

    The FANTOM consortium has generated a large gene expression dataset of different cell lines and tissue cultures using the single-molecule sequencing technology of HeliscopeCAGE. This provides a unique opportunity to investigate novel associations between gene expression over time and different cell types. Here, we create a MatLab wrapper for a powerful and computationally intensive set of statistics known as Maximal Information Coefficient, and then calculate this statistic for a large, comprehensive dataset containing gene expression of a variety of differentiating tissues. We then distinguish between linear and nonlinear associations, and then create gene association networks. Following this analysis, we are then able to identify clusters of linear gene associations that then associate nonlinearly with other clusters of linearity, providing insight to much more complex connections between gene expression patterns than previously anticipated.

  3. Decoupling Linear and Nonlinear Associations of Gene Expression

    KAUST Repository

    Itakura, Alan

    2013-05-01

    The FANTOM consortium has generated a large gene expression dataset of different cell lines and tissue cultures using the single-molecule sequencing technology of HeliscopeCAGE. This provides a unique opportunity to investigate novel associations between gene expression over time and different cell types. Here, we create a MatLab wrapper for a powerful and computationally intensive set of statistics known as Maximal Information Coefficient, and then calculate this statistic for a large, comprehensive dataset containing gene expression of a variety of differentiating tissues. We then distinguish between linear and nonlinear associations, and then create gene association networks. Following this analysis, we are then able to identify clusters of linear gene associations that then associate nonlinearly with other clusters of linearity, providing insight to much more complex connections between gene expression patterns than previously anticipated.

  4. Gene expression profiling of placentas affected by pre-eclampsia

    DEFF Research Database (Denmark)

    Hoegh, Anne Mette; Borup, Rehannah; Nielsen, Finn Cilius

    2010-01-01

    Several studies point to the placenta as the primary cause of pre-eclampsia. Our objective was to identify placental genes that may contribute to the development of pre-eclampsia. RNA was purified from tissue biopsies from eleven pre-eclamptic placentas and eighteen normal controls. Messenger RNA...... expression from pooled samples was analysed by microarrays. Verification of the expression of selected genes was performed using real-time PCR. A surprisingly low number of genes (21 out of 15,000) were identified as differentially expressed. Among these were genes not previously associated with pre-eclampsia...... as bradykinin B1 receptor and a 14-3-3 protein, but also genes that have already been connected with pre-eclampsia, for example, inhibin beta A subunit and leptin. A low number of genes were repeatedly identified as differentially expressed, because they may represent the endpoint of a cascade of events...

  5. Predicting Expressive Dynamics in Piano Performances using Neural Networks

    NARCIS (Netherlands)

    van Herwaarden, Sam; Grachten, Maarten; de Haas, W. Bas

    2014-01-01

    This paper presents a model for predicting expressive accentuation in piano performances with neural networks. Using Restricted Boltzmann Machines (RBMs), features are learned from performance data, after which these features are used to predict performed loudness. During feature learning, data

  6. Isolation and characterization of LHY homolog gene expressed in ...

    African Journals Online (AJOL)

    STORAGESEVER

    2008-05-02

    May 2, 2008 ... responsible in negative feedback loop reaction of central oscillator in plant circadian clock system. The level of gene expression was found to be high four hours after dawn in flowering shoots and flower. This paper reported the isolation and characterization of the gene. Key words: LHY gene, circadian ...

  7. Gene mining a marama bean expressed sequence tags (ESTs ...

    African Journals Online (AJOL)

    The authors reported the identification of genes associated with embryonic development and microsatellite sequences. The future direction will entail characterization of these genes using gene over-expression and mutant assays. Key words: Namibia, simple sequence repeats (SSR), data mining, homology searches, ...

  8. Expression profiles of genes involved in tanshinone biosynthesis of ...

    Indian Academy of Sciences (India)

    Expression profiles of genes involved in tanshinone biosynthesis of two. Salvia miltiorrhiza genotypes with different tanshinone contents. Zhenqiao Song, Jianhua Wang and Xingfeng Li. J. Genet. 95, 433–439. Table 1. S. miltiorrhiza genes and primer pairs used for qRT-PCR. Gene. GenBank accession. Primer name.

  9. Differentially expressed genes in the midgut of Silkworm infected ...

    African Journals Online (AJOL)

    In this report, we employed suppression subtractive hybridization to compare differentially expressed genes in the midguts of CPV-infected and normal silkworm larvae. 36 genes and 20 novel ESTs were obtained from 2 reciprocal subtractive libraries. Three up-regulated genes (ferritin, rpL11 and alkaline nuclease) and 3 ...

  10. Cloning an expressed gene shared by the human sex chromosomes

    International Nuclear Information System (INIS)

    Darling, S.M.; Banting, G.S.; Pym, B.; Wolfe, J.; Goodfellow, P.N.

    1986-01-01

    The existence of genes shared by mammalian sex chromosomes has been predicted on both evolutionary and functional grounds. However, the only experimental evidence for such genes in humans is the cell-surface antigen encoded by loci on the X and Y chromosomes (MIC2X and MIC2Y, respectively), which is recognized by the monoclonal antibody 12E7. Using the bacteriophage λgt11 expression system in Escherichia coli and immunoscreening techniques, the authors have isolated a cDNA clone whose primary product is recognized by 12E7. Southern blot analysis using somatic cell hybrids containing only the human X or Y chromosomes shows that the sequences reacting with the cDNA clone are localized to the sex chromosomes. In addition, the clone hybridizes to DNAs isolated from mouse cells that have been transfected with human DNA and selected for 12E7 expression on the fluorescence-activated cell sorter. The authors conclude that the cDNA clone encodes the 12E7 antigen, which is the primary product of the MIC2 loci. The clone was used to explore sequence homology between MIC2X and MIC2Y; these loci are closely related, if not identical

  11. Adaptive differences in gene expression in European flounder ( Platichthys flesus )

    DEFF Research Database (Denmark)

    Larsen, Peter Foged; Eg Nielsen, Einar; Williams, T.D.

    2007-01-01

    levels of neutral genetic divergence, a high number of genes were significantly differentially expressed between North Sea and Baltic Sea flounders maintained in a long-term reciprocal transplantation experiment mimicking natural salinities. Several of the differentially regulated genes could be directly...... linked to fitness traits. These findings demonstrate that flounders, despite little neutral genetic divergence between populations, are differently adapted to local environmental conditions and imply that adaptation in gene expression could be common in other marine organisms with similar low levels...

  12. Gene Expression and the Diversity of Identified Neurons

    OpenAIRE

    Buck, L.; Stein, R.; Palazzolo, M.; Anderson, D. J.; Axel, R.

    1983-01-01

    Nervous systems consist of diverse populations of neurons that are anatomically and functionally distinct. The diversity of neurons and the precision with which they are interconnected suggest that specific genes or sets of genes are activated in some neurons but not expressed in others. Experimentally, this problem may be considered at two levels. First, what is the total number of genes expressed in the brain, and how are they distributed among the different populations of neurons? Second, ...

  13. Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

    Directory of Open Access Journals (Sweden)

    Tintle Nathan L

    2012-08-01

    Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

  14. Rethinking cell-cycle-dependent gene expression in Schizosaccharomyces pombe.

    Science.gov (United States)

    Cooper, Stephen

    2017-11-01

    Three studies of gene expression during the division cycle of Schizosaccharomyces pombe led to the proposal that a large number of genes are expressed at particular times during the S. pombe cell cycle. Yet only a small fraction of genes proposed to be expressed in a cell-cycle-dependent manner are reproducible in all three published studies. In addition to reproducibility problems, questions about expression amplitudes, cell-cycle timing of expression, synchronization artifacts, and the problem with methods for synchronizing cells must be considered. These problems and complications prompt the idea that caution should be used before accepting the conclusion that there are a large number of genes expressed in a cell-cycle-dependent manner in S. pombe.

  15. Automated Protocol for Large-Scale Modeling of Gene Expression Data.

    Science.gov (United States)

    Hall, Michelle Lynn; Calkins, David; Sherman, Woody

    2016-11-28

    With the continued rise of phenotypic- and genotypic-based screening projects, computational methods to analyze, process, and ultimately make predictions in this field take on growing importance. Here we show how automated machine learning workflows can produce models that are predictive of differential gene expression as a function of a compound structure using data from A673 cells as a proof of principle. In particular, we present predictive models with an average accuracy of greater than 70% across a highly diverse ∼1000 gene expression profile. In contrast to the usual in silico design paradigm, where one interrogates a particular target-based response, this work opens the opportunity for virtual screening and lead optimization for desired multitarget gene expression profiles.

  16. Validation of reference genes for quantifying changes in gene expression in virus-infected tobacco.

    Science.gov (United States)

    Baek, Eseul; Yoon, Ju-Yeon; Palukaitis, Peter

    2017-10-01

    To facilitate quantification of gene expression changes in virus-infected tobacco plants, eight housekeeping genes were evaluated for their stability of expression during infection by one of three systemically-infecting viruses (cucumber mosaic virus, potato virus X, potato virus Y) or a hypersensitive-response-inducing virus (tobacco mosaic virus; TMV) limited to the inoculated leaf. Five reference-gene validation programs were used to establish the order of the most stable genes for the systemically-infecting viruses as ribosomal protein L25 > β-Tubulin > Actin, and the least stable genes Ubiquitin-conjugating enzyme (UCE) genes were EF1α > Cysteine protease > Actin, and the least stable genes were GAPDH genes, three defense responsive genes were examined to compare their relative changes in gene expression caused by each virus. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. A Gene Expression Classifier of Node-Positive Colorectal Cancer

    Directory of Open Access Journals (Sweden)

    Paul F. Meeh

    2009-10-01

    Full Text Available We used digital long serial analysis of gene expression to discover gene expression differences between node-negative and node-positive colorectal tumors and developed a multigene classifier able to discriminate between these two tumor types. We prepared and sequenced long serial analysis of gene expression libraries from one node-negative and one node-positive colorectal tumor, sequenced to a depth of 26,060 unique tags, and identified 262 tags significantly differentially expressed between these two tumors (P < 2 x 10-6. We confirmed the tag-to-gene assignments and differential expression of 31 genes by quantitative real-time polymerase chain reaction, 12 of which were elevated in the node-positive tumor. We analyzed the expression levels of these 12 upregulated genes in a validation panel of 23 additional tumors and developed an optimized seven-gene logistic regression classifier. The classifier discriminated between node-negative and node-positive tumors with 86% sensitivity and 80% specificity. Receiver operating characteristic analysis of the classifier revealed an area under the curve of 0.86. Experimental manipulation of the function of one classification gene, Fibronectin, caused profound effects on invasion and migration of colorectal cancer cells in vitro. These results suggest that the development of node-positive colorectal cancer occurs in part through elevated epithelial FN1 expression and suggest novel strategies for the diagnosis and treatment of advanced disease.

  18. With Reference to Reference Genes: A Systematic Review of Endogenous Controls in Gene Expression Studies.

    Science.gov (United States)

    Chapman, Joanne R; Waldenström, Jonas

    2015-01-01

    The choice of reference genes that are stably expressed amongst treatment groups is a crucial step in real-time quantitative PCR gene expression studies. Recent guidelines have specified that a minimum of two validated reference genes should be used for normalisation. However, a quantitative review of the literature showed that the average number of reference genes used across all studies was 1.2. Thus, the vast majority of studies continue to use a single gene, with β-actin (ACTB) and/or glyceraldehyde 3-phosphate dehydrogenase (GAPDH) being commonly selected in studies of vertebrate gene expression. Few studies (15%) tested a panel of potential reference genes for stability of expression before using them to normalise data. Amongst studies specifically testing reference gene stability, few found ACTB or GAPDH to be optimal, whereby these genes were significantly less likely to be chosen when larger panels of potential reference genes were screened. Fewer reference genes were tested for stability in non-model organisms, presumably owing to a dearth of available primers in less well characterised species. Furthermore, the experimental conditions under which real-time quantitative PCR analyses were conducted had a large influence on the choice of reference genes, whereby different studies of rat brain tissue showed different reference genes to be the most stable. These results highlight the importance of validating the choice of normalising reference genes before conducting gene expression studies.

  19. Gene expression profiling of resting and activated vascular smooth muscle cells by serial analysis of gene expression and clustering analysis

    NARCIS (Netherlands)

    Beauchamp, Nicholas J.; van Achterberg, Tanja A. E.; Engelse, Marten A.; Pannekoek, Hans; de Vries, Carlie J. M.

    2003-01-01

    Migration and proliferation of vascular smooth muscle cells (SMCs) are key events in atherosclerosis. However, little is known about alterations in gene expression upon transition of the quiescent, contractile SMC to the proliferative SMC. We performed serial analysis of gene expression (SAGE) of

  20. Altered global gene expression profiles in human gastrointestinal epithelial Caco2 cells exposed to nanosilver

    Directory of Open Access Journals (Sweden)

    Saura C. Sahu

    Full Text Available Extensive consumer exposure to food- and cosmetics-related consumer products containing nanosilver is of public safety concern. Therefore, there is a need for suitable in vitro models and sensitive predictive rapid screening methods to assess their toxicity. Toxicogenomic profile showing subtle changes in gene expressions following nanosilver exposure is a sensitive toxicological endpoint for this purpose. We evaluated the Caco2 cells and global gene expression profiles as tools for predictive rapid toxicity screening of nanosilver. We evaluated and compared the gene expression profiles of Caco-2 cells exposed to 20 nm and 50 nm nanosilver at a concentration 2.5 μg/ml. The global gene expression analysis of Caco2 cells exposed to 20 nm nanosilver showed that a total of 93 genes were altered at 4 h exposure, out of which 90 genes were up-regulated and 3 genes were down-regulated. The 24 h exposure of 20 nm silver altered 15 genes in Caco2 cells, out of which 14 were up-regulated and one was down-regulated. The most pronounced changes in gene expression were detected at 4 h. The greater size (50 nm nanosilver at 4 h exposure altered more genes by more different pathways than the smaller (20 nm one. Metallothioneins and heat shock proteins were highly up-regulated as a result of exposure to both the nanosilvers. The cellular pathways affected by the nanosilver exposure is likely to lead to increased toxicity. The results of our study presented here suggest that the toxicogenomic characterization of Caco2 cells is a valuable in vitro tool for assessing toxicity of nanomaterials such as nanosilver. Keywords: Nanosilver, Silver nanoparticles, Nanoparticles, Toxicogenomics, DNA microarray, Global gene expression profiles, Caco2 cells

  1. Clinical value of prognosis gene expression signatures in colorectal cancer: a systematic review.

    Directory of Open Access Journals (Sweden)

    Rebeca Sanz-Pamplona

    Full Text Available INTRODUCTION: The traditional staging system is inadequate to identify those patients with stage II colorectal cancer (CRC at high risk of recurrence or with stage III CRC at low risk. A number of gene expression signatures to predict CRC prognosis have been proposed, but none is routinely used in the clinic. The aim of this work was to assess the prediction ability and potential clinical usefulness of these signatures in a series of independent datasets. METHODS: A literature review identified 31 gene expression signatures that used gene expression data to predict prognosis in CRC tissue. The search was based on the PubMed database and was restricted to papers published from January 2004 to December 2011. Eleven CRC gene expression datasets with outcome information were identified and downloaded from public repositories. Random Forest classifier was used to build predictors from the gene lists. Matthews correlation coefficient was chosen as a measure of classification accuracy and its associated p-value was used to assess association with prognosis. For clinical usefulness evaluation, positive and negative post-tests probabilities were computed in stage II and III samples. RESULTS: Five gene signatures showed significant association with prognosis and provided reasonable prediction accuracy in their own training datasets. Nevertheless, all signatures showed low reproducibility in independent data. Stratified analyses by stage or microsatellite instability status showed significant association but limited discrimination ability, especially in stage II tumors. From a clinical perspective, the most predictive signatures showed a minor but significant improvement over the classical staging system. CONCLUSIONS: The published signatures show low prediction accuracy but moderate clinical usefulness. Although gene expression data may inform prognosis, better strategies for signature validation are needed to encourage their widespread use in the clinic.

  2. EVALUATION OF THE PROGNOSTIC VALUE OF nm23 GENE EXPRESSION IN BREAST CANCER

    Institute of Scientific and Technical Information of China (English)

    刘红; 毛慧生; 傅西林; 方志沂; 冯玉梅; 范宇; 李树玲

    2002-01-01

    Objective: To investigate the expression of nm23 gene and evaluate its prognostic value in breast cancer. Methods: nm23 expressions were detected in 101 breast cancer patients (group 1) by immunohistochemistry. RT-PCR and immunohistochemistry were used to measure expressions of nm23 gene in another 68 patients with breast cancer (group 2). Results: nm23 gene expression in group 1 was inversely associated with distant metastasis and lymph node metastasis (P<0.05). In 44 patients with negative lymph node, 9 cases progressed to distant metastasis, 7 of them (77.8%) showed low expression of nm23 gene (P<0.05). In 57 patients with positive lymph node, 24 our of 29 patients who had no distant metastasis (82.8%) expressed nm23 gene at high level (P<0.05). Meanwhile, there were 6 patients with distant metastasis in the group 2, all of thenm expressed nm23 gene mRNA at low level. Conclusion: The results showed that nm23 gene might play an independent role in predicting prognosis of breast cancer.

  3. The Expression of Genes Encoding Secreted Proteins in Medicago truncatula A17 Inoculated Roots

    Directory of Open Access Journals (Sweden)

    LUCIA KUSUMAWATI

    2013-09-01

    Full Text Available Subtilisin-like serine protease (MtSBT, serine carboxypeptidase (MtSCP, MtN5, non-specific lipid transfer protein (MtnsLTP, early nodulin2-like protein (MtENOD2-like, FAD-binding domain containing protein (MtFAD-BP1, and rhicadhesin receptor protein (MtRHRE1 were among 34 proteins found in the supernatant of M. truncatula 2HA and sickle cell suspension cultures. This study investigated the expression of genes encoding those proteins in roots and developing nodules. Two methods were used: quantitative real time RT-PCR and gene expression analysis (with promoter:GUS fusion in roots. Those proteins are predicted as secreted proteins which is indirectly supported by the findings that promoter:GUS fusions of six of the seven genes encoding secreted proteins were strongly expressed in the vascular bundle of transgenic hairy roots. All six genes have expressed in 14-day old nodule. The expression levels of the selected seven genes were quantified in Sinorhizobium-inoculated and control plants using quantitative real time RT-PCR. In conclusion, among seven genes encoding secreted proteins analyzed, the expression level of only one gene, MtN5, was up-regulated significantly in inoculated root segments compared to controls. The expression of MtSBT1, MtSCP1, MtnsLTP, MtFAD-BP1, MtRHRE1 and MtN5 were higher in root tip than in other tissues examined.

  4. Transcriptome profiling in conifers and the PiceaGenExpress database show patterns of diversification within gene families and interspecific conservation in vascular gene expression

    Directory of Open Access Journals (Sweden)

    Raherison Elie

    2012-08-01

    Full Text Available Abstract Background Conifers have very large genomes (13 to 30 G