gene expression prediction: Topics by WorldWideScience.org

Sample records for gene expression prediction

Clinicopathologic and gene expression parameters predict liver cancer prognosis

International Nuclear Information System (INIS)

Hao, Ke; Zhong, Hua; Greenawalt, Danielle; Ferguson, Mark D; Ng, Irene O; Sham, Pak C; Poon, Ronnie T; Molony, Cliona; Schadt, Eric E; Dai, Hongyue; Luk, John M; Lamb, John; Zhang, Chunsheng; Xie, Tao; Wang, Kai; Zhang, Bin; Chudin, Eugene; Lee, Nikki P; Mao, Mao

2011-01-01

The prognosis of hepatocellular carcinoma (HCC) varies following surgical resection and the large variation remains largely unexplained. Studies have revealed the ability of clinicopathologic parameters and gene expression to predict HCC prognosis. However, there has been little systematic effort to compare the performance of these two types of predictors or combine them in a comprehensive model. Tumor and adjacent non-tumor liver tissues were collected from 272 ethnic Chinese HCC patients who received curative surgery. We combined clinicopathologic parameters and gene expression data (from both tissue types) in predicting HCC prognosis. Cross-validation and independent studies were employed to assess prediction. HCC prognosis was significantly associated with six clinicopathologic parameters, which can partition the patients into good- and poor-prognosis groups. Within each group, gene expression data further divide patients into distinct prognostic subgroups. Our predictive genes significantly overlap with previously published gene sets predictive of prognosis. Moreover, the predictive genes were enriched for genes that underwent normal-to-tumor gene network transformation. Previously documented liver eSNPs underlying the HCC predictive gene signatures were enriched for SNPs that associated with HCC prognosis, providing support that these genes are involved in key processes of tumorigenesis. When applied individually, clinicopathologic parameters and gene expression offered similar predictive power for HCC prognosis. In contrast, a combination of the two types of data dramatically improved the power to predict HCC prognosis. Our results also provided a framework for understanding the impact of gene expression on the processes of tumorigenesis and clinical outcome
A deep auto-encoder model for gene expression prediction.

Science.gov (United States)

Xie, Rui; Wen, Jia; Quitadamo, Andrew; Cheng, Jianlin; Shi, Xinghua

2017-11-17

Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance. To demonstrate the usage of this model, we apply MLP-SAE to a real genomic datasets with genotypes and gene expression profiles measured in yeast. Our results show that the MLP-SAE model with dropout outperforms other models including Lasso, Random Forests and the MLP-SAE model without dropout. Using the MLP-SAE model with dropout, we show that gene expression quantifications predicted by the model solely based on genotypes, align well with true gene expression patterns. We provide a deep auto-encoder model for predicting gene expression from SNP genotypes. This study demonstrates that deep learning is appropriate for tackling another genomic problem, i.e., building predictive models to understand genotypes' contribution to gene expression. With the emerging availability of richer genomic data, we anticipate that deep learning models play a bigger role in modeling and interpreting genomics.
Embryo quality predictive models based on cumulus cells gene expression

Directory of Open Access Journals (Sweden)

Devjak R

2016-06-01

Full Text Available Since the introduction of in vitro fertilization (IVF in clinical practice of infertility treatment, the indicators for high quality embryos were investigated. Cumulus cells (CC have a specific gene expression profile according to the developmental potential of the oocyte they are surrounding, and therefore, specific gene expression could be used as a biomarker. The aim of our study was to combine more than one biomarker to observe improvement in prediction value of embryo development. In this study, 58 CC samples from 17 IVF patients were analyzed. This study was approved by the Republic of Slovenia National Medical Ethics Committee. Gene expression analysis [quantitative real time polymerase chain reaction (qPCR] for five genes, analyzed according to embryo quality level, was performed. Two prediction models were tested for embryo quality prediction: a binary logistic and a decision tree model. As the main outcome, gene expression levels for five genes were taken and the area under the curve (AUC for two prediction models were calculated. Among tested genes, AMHR2 and LIF showed significant expression difference between high quality and low quality embryos. These two genes were used for the construction of two prediction models: the binary logistic model yielded an AUC of 0.72 ± 0.08 and the decision tree model yielded an AUC of 0.73 ± 0.03. Two different prediction models yielded similar predictive power to differentiate high and low quality embryos. In terms of eventual clinical decision making, the decision tree model resulted in easy-to-interpret rules that are highly applicable in clinical practice.
Prediction of highly expressed genes in microbes based on chromatin accessibility

Directory of Open Access Journals (Sweden)

Ussery David W

2007-02-01

Full Text Available Abstract Background It is well known that gene expression is dependent on chromatin structure in eukaryotes and it is likely that chromatin can play a role in bacterial gene expression as well. Here, we use a nucleosomal position preference measure of anisotropic DNA flexibility to predict highly expressed genes in microbial genomes. We compare these predictions with those based on codon adaptation index (CAI values, and also with experimental data for 6 different microbial genomes, with a particular interest in experimental data from Escherichia coli. Moreover, position preference is examined further in 328 sequenced microbial genomes. Results We find that absolute gene expression levels are correlated with the position preference in many microbial genomes. It is postulated that in these regions, the DNA may be more accessible to the transcriptional machinery. Moreover, ribosomal proteins and ribosomal RNA are encoded by DNA having significantly lower position preference values than other genes in fast-replicating microbes. Conclusion This insight into DNA structure-dependent gene expression in microbes may be exploited for predicting the expression of non-translated genes such as non-coding RNAs that may not be predicted by any of the conventional codon usage bias approaches.
Random Subspace Aggregation for Cancer Prediction with Gene Expression Profiles

Directory of Open Access Journals (Sweden)

Liying Yang

2016-01-01

Full Text Available Background. Precisely predicting cancer is crucial for cancer treatment. Gene expression profiles make it possible to analyze patterns between genes and cancers on the genome-wide scale. Gene expression data analysis, however, is confronted with enormous challenges for its characteristics, such as high dimensionality, small sample size, and low Signal-to-Noise Ratio. Results. This paper proposes a method, termed RS_SVM, to predict gene expression profiles via aggregating SVM trained on random subspaces. After choosing gene features through statistical analysis, RS_SVM randomly selects feature subsets to yield random subspaces and training SVM classifiers accordingly and then aggregates SVM classifiers to capture the advantage of ensemble learning. Experiments on eight real gene expression datasets are performed to validate the RS_SVM method. Experimental results show that RS_SVM achieved better classification accuracy and generalization performance in contrast with single SVM, K-nearest neighbor, decision tree, Bagging, AdaBoost, and the state-of-the-art methods. Experiments also explored the effect of subspace size on prediction performance. Conclusions. The proposed RS_SVM method yielded superior performance in analyzing gene expression profiles, which demonstrates that RS_SVM provides a good channel for such biological data.
Blood Gene Expression Predicts Bronchiolitis Obliterans Syndrome

Directory of Open Access Journals (Sweden)

Richard Danger

2018-01-01

Full Text Available Bronchiolitis obliterans syndrome (BOS, the main manifestation of chronic lung allograft dysfunction, leads to poor long-term survival after lung transplantation. Identifying predictors of BOS is essential to prevent the progression of dysfunction before irreversible damage occurs. By using a large set of 107 samples from lung recipients, we performed microarray gene expression profiling of whole blood to identify early biomarkers of BOS, including samples from 49 patients with stable function for at least 3 years, 32 samples collected at least 6 months before BOS diagnosis (prediction group, and 26 samples at or after BOS diagnosis (diagnosis group. An independent set from 25 lung recipients was used for validation by quantitative PCR (13 stables, 11 in the prediction group, and 8 in the diagnosis group. We identified 50 transcripts differentially expressed between stable and BOS recipients. Three genes, namely POU class 2 associating factor 1 (POU2AF1, T-cell leukemia/lymphoma protein 1A (TCL1A, and B cell lymphocyte kinase, were validated as predictive biomarkers of BOS more than 6 months before diagnosis, with areas under the curve of 0.83, 0.77, and 0.78 respectively. These genes allow stratification based on BOS risk (log-rank test p < 0.01 and are not associated with time posttransplantation. This is the first published large-scale gene expression analysis of blood after lung transplantation. The three-gene blood signature could provide clinicians with new tools to improve follow-up and adapt treatment of patients likely to develop BOS.
Prediction of highly expressed genes in microbes based on chromatin accessibility

DEFF Research Database (Denmark)

Willenbrock, Hanni; Ussery, David

2007-01-01

BACKGROUND: It is well known that gene expression is dependent on chromatin structure in eukaryotes and it is likely that chromatin can play a role in bacterial gene expression as well. Here, we use a nucleosomal position preference measure of anisotropic DNA flexibility to predict highly expressed...
Predicting cellular growth from gene expression signatures.

Directory of Open Access Journals (Sweden)

Edoardo M Airoldi

2009-01-01

Full Text Available Maintaining balanced growth in a changing environment is a fundamental systems-level challenge for cellular physiology, particularly in microorganisms. While the complete set of regulatory and functional pathways supporting growth and cellular proliferation are not yet known, portions of them are well understood. In particular, cellular proliferation is governed by mechanisms that are highly conserved from unicellular to multicellular organisms, and the disruption of these processes in metazoans is a major factor in the development of cancer. In this paper, we develop statistical methodology to identify quantitative aspects of the regulatory mechanisms underlying cellular proliferation in Saccharomyces cerevisiae. We find that the expression levels of a small set of genes can be exploited to predict the instantaneous growth rate of any cellular culture with high accuracy. The predictions obtained in this fashion are robust to changing biological conditions, experimental methods, and technological platforms. The proposed model is also effective in predicting growth rates for the related yeast Saccharomyces bayanus and the highly diverged yeast Schizosaccharomyces pombe, suggesting that the underlying regulatory signature is conserved across a wide range of unicellular evolution. We investigate the biological significance of the gene expression signature that the predictions are based upon from multiple perspectives: by perturbing the regulatory network through the Ras/PKA pathway, observing strong upregulation of growth rate even in the absence of appropriate nutrients, and discovering putative transcription factor binding sites, observing enrichment in growth-correlated genes. More broadly, the proposed methodology enables biological insights about growth at an instantaneous time scale, inaccessible by direct experimental methods. Data and tools enabling others to apply our methods are available at http://function.princeton.edu/growthrate.
Gene expression variation to predict 10-year survival in lymph-node-negative breast cancer

International Nuclear Information System (INIS)

Karlsson, Elin; Delle, Ulla; Danielsson, Anna; Olsson, Björn; Abel, Frida; Karlsson, Per; Helou, Khalil

2008-01-01

It is of great significance to find better markers to correctly distinguish between high-risk and low-risk breast cancer patients since the majority of breast cancer cases are at present being overtreated. 46 tumours from node-negative breast cancer patients were studied with gene expression microarrays. A t-test was carried out in order to find a set of genes where the expression might predict clinical outcome. Two classifiers were used for evaluation of the gene lists, a correlation-based classifier and a Voting Features Interval (VFI) classifier. We then evaluated the predictive accuracy of this expression signature on tumour sets from two similar studies on lymph-node negative patients. They had both developed gene expression signatures superior to current methods in classifying node-negative breast tumours. These two signatures were also tested on our material. A list of 51 genes whose expression profiles could predict clinical outcome with high accuracy in our material (96% or 89% accuracy in cross-validation, depending on type of classifier) was developed. When tested on two independent data sets, the expression signature based on the 51 identified genes had good predictive qualities in one of the data sets (74% accuracy), whereas their predictive value on the other data set were poor, presumably due to the fact that only 23 of the 51 genes were found in that material. We also found that previously developed expression signatures could predict clinical outcome well to moderately well in our material (72% and 61%, respectively). The list of 51 genes derived in this study might have potential for clinical utility as a prognostic gene set, and may include candidate genes of potential relevance for clinical outcome in breast cancer. According to the predictions by this expression signature, 30 of the 46 patients may have benefited from different adjuvant treatment than they recieved. The research on these tumours was approved by the Medical Faculty Research
Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans.

Science.gov (United States)

Gottlieb, Assaf; Daneshjou, Roxana; DeGorter, Marianne; Bourgeois, Stephane; Svensson, Peter J; Wadelius, Mia; Deloukas, Panos; Montgomery, Stephen B; Altman, Russ B

2017-11-24

Genome-wide association studies are useful for discovering genotype-phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into "gene level" effects. Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression-on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort.
A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

Science.gov (United States)

Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

2015-01-01

Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.
Adipose gene expression prior to weight loss can differentiate and weakly predict dietary responders.

Directory of Open Access Journals (Sweden)

David M Mutch

Full Text Available BACKGROUND: The ability to identify obese individuals who will successfully lose weight in response to dietary intervention will revolutionize disease management. Therefore, we asked whether it is possible to identify subjects who will lose weight during dietary intervention using only a single gene expression snapshot. METHODOLOGY/PRINCIPAL FINDINGS: The present study involved 54 female subjects from the Nutrient-Gene Interactions in Human Obesity-Implications for Dietary Guidelines (NUGENOB trial to determine whether subcutaneous adipose tissue gene expression could be used to predict weight loss prior to the 10-week consumption of a low-fat hypocaloric diet. Using several statistical tests revealed that the gene expression profiles of responders (8-12 kgs weight loss could always be differentiated from non-responders (<4 kgs weight loss. We also assessed whether this differentiation was sufficient for prediction. Using a bottom-up (i.e. black-box approach, standard class prediction algorithms were able to predict dietary responders with up to 61.1%+/-8.1% accuracy. Using a top-down approach (i.e. using differentially expressed genes to build a classifier improved prediction accuracy to 80.9%+/-2.2%. CONCLUSION: Adipose gene expression profiling prior to the consumption of a low-fat diet is able to differentiate responders from non-responders as well as serve as a weak predictor of subjects destined to lose weight. While the degree of prediction accuracy currently achieved with a gene expression snapshot is perhaps insufficient for clinical use, this work reveals that the comprehensive molecular signature of adipose tissue paves the way for the future of personalized nutrition.
Predictive modelling of gene expression from transcriptional regulatory elements.

Science.gov (United States)

Budden, David M; Hurley, Daniel G; Crampin, Edmund J

2015-07-01

Predictive modelling of gene expression provides a powerful framework for exploring the regulatory logic underpinning transcriptional regulation. Recent studies have demonstrated the utility of such models in identifying dysregulation of gene and miRNA expression associated with abnormal patterns of transcription factor (TF) binding or nucleosomal histone modifications (HMs). Despite the growing popularity of such approaches, a comparative review of the various modelling algorithms and feature extraction methods is lacking. We define and compare three methods of quantifying pairwise gene-TF/HM interactions and discuss their suitability for integrating the heterogeneous chromatin immunoprecipitation (ChIP)-seq binding patterns exhibited by TFs and HMs. We then construct log-linear and ϵ-support vector regression models from various mouse embryonic stem cell (mESC) and human lymphoblastoid (GM12878) data sets, considering both ChIP-seq- and position weight matrix- (PWM)-derived in silico TF-binding. The two algorithms are evaluated both in terms of their modelling prediction accuracy and ability to identify the established regulatory roles of individual TFs and HMs. Our results demonstrate that TF-binding and HMs are highly predictive of gene expression as measured by mRNA transcript abundance, irrespective of algorithm or cell type selection and considering both ChIP-seq and PWM-derived TF-binding. As we encourage other researchers to explore and develop these results, our framework is implemented using open-source software and made available as a preconfigured bootable virtual environment. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

Directory of Open Access Journals (Sweden)

Teng Shaolei

2013-01-01

Full Text Available Abstract Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs and Support Vector Machines (SVMs were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression.
Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans

Directory of Open Access Journals (Sweden)

Assaf Gottlieb

2017-11-01

Full Text Available Abstract Background Genome-wide association studies are useful for discovering genotype–phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into “gene level” effects. Methods Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression—on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. Results We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Conclusions Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort
Cell-specific prediction and application of drug-induced gene expression profiles.

Science.gov (United States)

Hodos, Rachel; Zhang, Ping; Lee, Hao-Chih; Duan, Qiaonan; Wang, Zichen; Clark, Neil R; Ma'ayan, Avi; Wang, Fei; Kidd, Brian; Hu, Jianying; Sontag, David; Dudley, Joel

2018-01-01

Gene expression profiling of in vitro drug perturbations is useful for many biomedical discovery applications including drug repurposing and elucidation of drug mechanisms. However, limited data availability across cell types has hindered our capacity to leverage or explore the cell-specificity of these perturbations. While recent efforts have generated a large number of drug perturbation profiles across a variety of human cell types, many gaps remain in this combinatorial drug-cell space. Hence, we asked whether it is possible to fill these gaps by predicting cell-specific drug perturbation profiles using available expression data from related conditions--i.e. from other drugs and cell types. We developed a computational framework that first arranges existing profiles into a three-dimensional array (or tensor) indexed by drugs, genes, and cell types, and then uses either local (nearest-neighbors) or global (tensor completion) information to predict unmeasured profiles. We evaluate prediction accuracy using a variety of metrics, and find that the two methods have complementary performance, each superior in different regions in the drug-cell space. Predictions achieve correlations of 0.68 with true values, and maintain accurate differentially expressed genes (AUC 0.81). Finally, we demonstrate that the predicted profiles add value for making downstream associations with drug targets and therapeutic classes.
Intra- and interspecies gene expression models for predicting drug response in canine osteosarcoma.

Science.gov (United States)

Fowles, Jared S; Brown, Kristen C; Hess, Ann M; Duval, Dawn L; Gustafson, Daniel L

2016-02-19

Genomics-based predictors of drug response have the potential to improve outcomes associated with cancer therapy. Osteosarcoma (OS), the most common primary bone cancer in dogs, is commonly treated with adjuvant doxorubicin or carboplatin following amputation of the affected limb. We evaluated the use of gene-expression based models built in an intra- or interspecies manner to predict chemosensitivity and treatment outcome in canine OS. Models were built and evaluated using microarray gene expression and drug sensitivity data from human and canine cancer cell lines, and canine OS tumor datasets. The "COXEN" method was utilized to filter gene signatures between human and dog datasets based on strong co-expression patterns. Models were built using linear discriminant analysis via the misclassification penalized posterior algorithm. The best doxorubicin model involved genes identified in human lines that were co-expressed and trained on canine OS tumor data, which accurately predicted clinical outcome in 73 % of dogs (p = 0.0262, binomial). The best carboplatin model utilized canine lines for gene identification and model training, with canine OS tumor data for co-expression. Dogs whose treatment matched our predictions had significantly better clinical outcomes than those that didn't (p = 0.0006, Log Rank), and this predictor significantly associated with longer disease free intervals in a Cox multivariate analysis (hazard ratio = 0.3102, p = 0.0124). Our data show that intra- and interspecies gene expression models can successfully predict response in canine OS, which may improve outcome in dogs and serve as pre-clinical validation for similar methods in human cancer research.
Prediction of metastasis from low-malignant breast cancer by gene expression profiling

DEFF Research Database (Denmark)

Thomassen, Mads; Tan, Qihua; Eiriksdottir, Freyja

2007-01-01

examined in these studies is the low-risk patients for whom outcome is very difficult to predict with currently used methods. These patients do not receive adjuvant treatment according to the guidelines of the Danish Breast Cancer Cooperative Group (DBCG). In this study, 26 tumors from low-risk patients...... with different characteristics and risk, expression-based classification specifically developed in low-risk patients have higher predictive power in this group.......Promising results for prediction of outcome in breast cancer have been obtained by genome wide gene expression profiling. Some studies have suggested that an extensive overtreatment of breast cancer patients might be reduced by risk assessment with gene expression profiling. A patient group hardly...
Predictive value of MSH2 gene expression in colorectal cancer treated with capecitabine

DEFF Research Database (Denmark)

Jensen, Lars H; Danenberg, Kathleen D; Danenberg, Peter V

2007-01-01

was associated with a hazard ratio of 0.5 (95% confidence interval, 0.23-1.11; P = 0.083) in survival analysis. CONCLUSION: The higher gene expression of MSH2 in responders and the trend for predicting overall survival indicates a predictive value of this marker in the treatment of advanced CRC with capecitabine.......PURPOSE: The objective of the present study was to evaluate the gene expression of the DNA mismatch repair gene MSH2 as a predictive marker in advanced colorectal cancer (CRC) treated with first-line capecitabine. PATIENTS AND METHODS: Microdissection of paraffin-embedded tumor tissue, RNA...
Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.

Directory of Open Access Journals (Sweden)

Mika Gustafsson

Full Text Available BACKGROUND: To predict gene expressions is an important endeavour within computational systems biology. It can both be a way to explore how drugs affect the system, as well as providing a framework for finding which genes are interrelated in a certain process. A practical problem, however, is how to assess and discriminate among the various algorithms which have been developed for this purpose. Therefore, the DREAM project invited the year 2008 to a challenge for predicting gene expression values, and here we present the algorithm with best performance. METHODOLOGY/PRINCIPAL FINDINGS: We develop an algorithm by exploring various regression schemes with different model selection procedures. It turns out that the most effective scheme is based on least squares, with a penalty term of a recently developed form called the "elastic net". Key components in the algorithm are the integration of expression data from other experimental conditions than those presented for the challenge and the utilization of transcription factor binding data for guiding the inference process towards known interactions. Of importance is also a cross-validation procedure where each form of external data is used only to the extent it increases the expected performance. CONCLUSIONS/SIGNIFICANCE: Our algorithm proves both the possibility to extract information from large-scale expression data concerning prediction of gene levels, as well as the benefits of integrating different data sources for improving the inference. We believe the former is an important message to those still hesitating on the possibilities for computational approaches, while the latter is part of an important way forward for the future development of the field of computational systems biology.

Predicting spatial and temporal gene expression using an integrative model of transcription factor occupancy and chromatin state.

Directory of Open Access Journals (Sweden)

Bartek Wilczynski

Full Text Available Precise patterns of spatial and temporal gene expression are central to metazoan complexity and act as a driving force for embryonic development. While there has been substantial progress in dissecting and predicting cis-regulatory activity, our understanding of how information from multiple enhancer elements converge to regulate a gene's expression remains elusive. This is in large part due to the number of different biological processes involved in mediating regulation as well as limited availability of experimental measurements for many of them. Here, we used a Bayesian approach to model diverse experimental regulatory data, leading to accurate predictions of both spatial and temporal aspects of gene expression. We integrated whole-embryo information on transcription factor recruitment to multiple cis-regulatory modules, insulator binding and histone modification status in the vicinity of individual gene loci, at a genome-wide scale during Drosophila development. The model uses Bayesian networks to represent the relation between transcription factor occupancy and enhancer activity in specific tissues and stages. All parameters are optimized in an Expectation Maximization procedure providing a model capable of predicting tissue- and stage-specific activity of new, previously unassayed genes. Performing the optimization with subsets of input data demonstrated that neither enhancer occupancy nor chromatin state alone can explain all gene expression patterns, but taken together allow for accurate predictions of spatio-temporal activity. Model predictions were validated using the expression patterns of more than 600 genes recently made available by the BDGP consortium, demonstrating an average 15-fold enrichment of genes expressed in the predicted tissue over a naïve model. We further validated the model by experimentally testing the expression of 20 predicted target genes of unknown expression, resulting in an accuracy of 95% for temporal
Clustering gene expression data based on predicted differential effects of GV interaction.

Science.gov (United States)

Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

2005-02-01

Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.
Using gene co-expression network analysis to predict biomarkers for chronic lymphocytic leukemia

Directory of Open Access Journals (Sweden)

Borlawsky Tara B

2010-10-01

Full Text Available Abstract Background Chronic lymphocytic leukemia (CLL is the most common adult leukemia. It is a highly heterogeneous disease, and can be divided roughly into indolent and progressive stages based on classic clinical markers. Immunoglobin heavy chain variable region (IgVH mutational status was found to be associated with patient survival outcome, and biomarkers linked to the IgVH status has been a focus in the CLL prognosis research field. However, biomarkers highly correlated with IgVH mutational status which can accurately predict the survival outcome are yet to be discovered. Results In this paper, we investigate the use of gene co-expression network analysis to identify potential biomarkers for CLL. Specifically we focused on the co-expression network involving ZAP70, a well characterized biomarker for CLL. We selected 23 microarray datasets corresponding to multiple types of cancer from the Gene Expression Omnibus (GEO and used the frequent network mining algorithm CODENSE to identify highly connected gene co-expression networks spanning the entire genome, then evaluated the genes in the co-expression network in which ZAP70 is involved. We then applied a set of feature selection methods to further select genes which are capable of predicting IgVH mutation status from the ZAP70 co-expression network. Conclusions We have identified a set of genes that are potential CLL prognostic biomarkers IL2RB, CD8A, CD247, LAG3 and KLRK1, which can predict CLL patient IgVH mutational status with high accuracies. Their prognostic capabilities were cross-validated by applying these biomarker candidates to classify patients into different outcome groups using a CLL microarray datasets with clinical information.
A Gene Expression Profile of BRCAness That Predicts for Responsiveness to Platinum and PARP Inhibitors

Science.gov (United States)

2017-02-01

affecting the function of Fanconi Anemia (FA) genes ( FANCA /B/C/D2/E/F/G/I/J/L/M, PALB2) or DNA damage response genes involved in HR 5 (ATM, ATR...Award Number: W81XWH-10-1-0585 TITLE: A Gene Expression Profile of BRCAness That Predicts for Responsiveness to Platinum and PARP Inhibitors...To) 15 July 2010 – 2 Nov.2016 4. TITLE AND SUBTITLE A Gene Expression Profile of BRCAness That Predicts for Responsiveness to Platinum and PARP
Multiple Suboptimal Solutions for Prediction Rules in Gene Expression Data

Directory of Open Access Journals (Sweden)

Osamu Komori

2013-01-01

Full Text Available This paper discusses mathematical and statistical aspects in analysis methods applied to microarray gene expressions. We focus on pattern recognition to extract informative features embedded in the data for prediction of phenotypes. It has been pointed out that there are severely difficult problems due to the unbalance in the number of observed genes compared with the number of observed subjects. We make a reanalysis of microarray gene expression published data to detect many other gene sets with almost the same performance. We conclude in the current stage that it is not possible to extract only informative genes with high performance in the all observed genes. We investigate the reason why this difficulty still exists even though there are actively proposed analysis methods and learning algorithms in statistical machine learning approaches. We focus on the mutual coherence or the absolute value of the Pearson correlations between two genes and describe the distributions of the correlation for the selected set of genes and the total set. We show that the problem of finding informative genes in high dimensional data is ill-posed and that the difficulty is closely related with the mutual coherence.
Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets

Directory of Open Access Journals (Sweden)

Karacali Bilge

2007-10-01

Full Text Available Abstract Background Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a all genes on the microarray platform and b a list of known disease-related genes (a priori selection. We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms. Results Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform. Conclusion Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine
Prediction of Associations between microRNAs and Gene Expression in Glioma Biology.

Directory of Open Access Journals (Sweden)

Stefan Wuchty

Full Text Available Despite progress in the determination of miR interactions, their regulatory role in cancer is only beginning to be unraveled. Utilizing gene expression data from 27 glioblastoma samples we found that the mere knowledge of physical interactions between specific mRNAs and miRs can be used to determine associated regulatory interactions, allowing us to identify 626 associated interactions, involving 128 miRs that putatively modulate the expression of 246 mRNAs. Experimentally determining the expression of miRs, we found an over-representation of over(under-expressed miRs with various predicted mRNA target sequences. Such significantly associated miRs that putatively bind over-expressed genes strongly tend to have binding sites nearby the 3'UTR of the corresponding mRNAs, suggesting that the presence of the miRs near the translation stop site may be a factor in their regulatory ability. Our analysis predicted a significant association between miR-128 and the protein kinase WEE1, which we subsequently validated experimentally by showing that the over-expression of the naturally under-expressed miR-128 in glioma cells resulted in the inhibition of WEE1 in glioblastoma cells.
EvoCor: a platform for predicting functionally related genes using phylogenetic and expression profiles.

Science.gov (United States)

Dittmar, W James; McIver, Lauren; Michalak, Pawel; Garner, Harold R; Valdez, Gregorio

2014-07-01

The wealth of publicly available gene expression and genomic data provides unique opportunities for computational inference to discover groups of genes that function to control specific cellular processes. Such genes are likely to have co-evolved and be expressed in the same tissues and cells. Unfortunately, the expertise and computational resources required to compare tens of genomes and gene expression data sets make this type of analysis difficult for the average end-user. Here, we describe the implementation of a web server that predicts genes involved in affecting specific cellular processes together with a gene of interest. We termed the server 'EvoCor', to denote that it detects functional relationships among genes through evolutionary analysis and gene expression correlation. This web server integrates profiles of sequence divergence derived by a Hidden Markov Model (HMM) and tissue-wide gene expression patterns to determine putative functional linkages between pairs of genes. This server is easy to use and freely available at http://pilot-hmm.vbi.vt.edu/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

Science.gov (United States)

Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

2012-07-15

Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.
Adipose Gene Expression Prior to Weight Loss Can Differentiate and Weakly Predict Dietary Responders

Science.gov (United States)

Mutch, David M.; Temanni, M. Ramzi; Henegar, Corneliu; Combes, Florence; Pelloux, Véronique; Holst, Claus; Sørensen, Thorkild I. A.; Astrup, Arne; Martinez, J. Alfredo; Saris, Wim H. M.; Viguerie, Nathalie; Langin, Dominique; Zucker, Jean-Daniel; Clément, Karine

2007-01-01

Background The ability to identify obese individuals who will successfully lose weight in response to dietary intervention will revolutionize disease management. Therefore, we asked whether it is possible to identify subjects who will lose weight during dietary intervention using only a single gene expression snapshot. Methodology/Principal Findings The present study involved 54 female subjects from the Nutrient-Gene Interactions in Human Obesity-Implications for Dietary Guidelines (NUGENOB) trial to determine whether subcutaneous adipose tissue gene expression could be used to predict weight loss prior to the 10-week consumption of a low-fat hypocaloric diet. Using several statistical tests revealed that the gene expression profiles of responders (8–12 kgs weight loss) could always be differentiated from non-responders (diet is able to differentiate responders from non-responders as well as serve as a weak predictor of subjects destined to lose weight. While the degree of prediction accuracy currently achieved with a gene expression snapshot is perhaps insufficient for clinical use, this work reveals that the comprehensive molecular signature of adipose tissue paves the way for the future of personalized nutrition. PMID:18094752
Effectiveness of gene expression profiling for response prediction of rectal cancer to preoperative radiotherapy

International Nuclear Information System (INIS)

Ojima, Eiki; Inoue, Yasuhiro; Miki, Chikao; Kusunoki, Masato; Mori, Masaki

2007-01-01

Our aim was to determine whether the expression levels of specific genes could predict clinical radiosensitivity in human colorectal cancer. Radioresistant colorectal cancer cell lines were established by repeated X-ray exposure (total, 100 Gy), and the gene expressions of the parent and radioresistant cell lines were compared in a microarray analysis. To verify the microarray data, we carried out a reverse transcriptase-polymerase chain reaction analysis of identified genes in clinical samples from 30 irradiated rectal cancer patients. A comparison of the intensity data for the parent and three radioresistant cell lines revealed 17 upregulated and 142 downregulated genes in all radioresistant cell lines. Next, we focused on two upregulated genes, PTMA (prothymosin α) and EIF5a2 (eukaryotic translation initiation factor 5A), in the radioresistant cell lines. In clinical samples, the expression of PTMA was significantly higher in the minor effect group than in the major effect group (P=0.004), but there were no significant differences in EIF5a2 expression between the two groups. We identified radiation-related genes in colorectal cancer and demonstrated that PTMA may play an important role in radiosensitivity. Our findings suggest that PTMA may be a novel marker for predicting the effectiveness of radiotherapy in clinical cases. (author)
Gene expression patterns in formalin-fixed, paraffin-embedded core biopsies predict docetaxel chemosensitivity in breast cancer patients.

Science.gov (United States)

Chang, Jenny C; Makris, Andreas; Gutierrez, M Carolina; Hilsenbeck, Susan G; Hackett, James R; Jeong, Jennie; Liu, Mei-Lan; Baker, Joffre; Clark-Langone, Kim; Baehner, Frederick L; Sexton, Krsytal; Mohsin, Syed; Gray, Tara; Alvarez, Laura; Chamness, Gary C; Osborne, C Kent; Shak, Steven

2008-03-01

Previously, we had identified gene expression patterns that predicted response to neoadjuvant docetaxel. Other studies have validated that a high Recurrence Score (RS) by the 21-gene RT-PCR assay is predictive of worse prognosis but better response to chemotherapy. We investigated whether tumor expression of these 21 genes and other candidate genes can predict response to docetaxel. Core biopsies from 97 patients were obtained before treatment with neoadjuvant docetaxel (4 cycles, 100 mg/m2 q3 weeks). Three 10-microm FFPE sections were submitted for quantitative RT-PCR assays of 192 genes that were selected from our previous work and the literature. Of the 97 patients, 81 (84%) had sufficient invasive cancer, 80 (82%) had sufficient RNA for QRTPCR assay, and 72 (74%) had clinical response data. Mean age was 48.5 years, and the median tumor size was 6 cm. Clinical complete responses (CR) were observed in 12 (17%), partial responses in 41 (57%), stable disease in 17 (24%), and progressive disease in 2 patients (3%). A significant relationship (P<0.05) between gene expression and CR was observed for 14 genes, including CYBA. CR was associated with lower expression of the ER gene group and higher expression of the proliferation gene group from the 21 gene assay. Of note, CR was more likely with a high RS (P=0.008). We have established molecular profiles of sensitivity to docetaxel. RT-PCR technology provides a potential platform for a predictive test of docetaxel chemosensitivity using small amounts of routinely processed material.
Testing the predictive value of peripheral gene expression for nonremission following citalopram treatment for major depression.

Science.gov (United States)

Guilloux, Jean-Philippe; Bassi, Sabrina; Ding, Ying; Walsh, Chris; Turecki, Gustavo; Tseng, George; Cyranowski, Jill M; Sibille, Etienne

2015-02-01

Major depressive disorder (MDD) in general, and anxious-depression in particular, are characterized by poor rates of remission with first-line treatments, contributing to the chronic illness burden suffered by many patients. Prospective research is needed to identify the biomarkers predicting nonremission prior to treatment initiation. We collected blood samples from a discovery cohort of 34 adult MDD patients with co-occurring anxiety and 33 matched, nondepressed controls at baseline and after 12 weeks (of citalopram plus psychotherapy treatment for the depressed cohort). Samples were processed on gene arrays and group differences in gene expression were investigated. Exploratory analyses suggest that at pretreatment baseline, nonremitting patients differ from controls with gene function and transcription factor analyses potentially related to elevated inflammation and immune activation. In a second phase, we applied an unbiased machine learning prediction model and corrected for model-selection bias. Results show that baseline gene expression predicted nonremission with 79.4% corrected accuracy with a 13-gene model. The same gene-only model predicted nonremission after 8 weeks of citalopram treatment with 76% corrected accuracy in an independent validation cohort of 63 MDD patients treated with citalopram at another institution. Together, these results demonstrate the potential, but also the limitations, of baseline peripheral blood-based gene expression to predict nonremission after citalopram treatment. These results not only support their use in future prediction tools but also suggest that increased accuracy may be obtained with the inclusion of additional predictors (eg, genetics and clinical scales).
Prediction of lymphatic metastasis based on gene expression profile analysis after brachytherapy for early-stage oral tongue carcinoma

International Nuclear Information System (INIS)

Watanabe, Hiroshi; Mogushi, Kaoru; Miura, Masahiko; Yoshimura, Ryo-ichi; Kurabayashi, Tohru; Shibuya, Hitoshi; Tanaka, Hiroshi; Noda, Shuhei; Iwakawa, Mayumi; Imai, Takashi

2008-01-01

Background and purpose: The management of lymphatic metastasis of early-stage oral tongue carcinoma patients is crucial for its prognosis. The purpose of this study was to evaluate the predictive ability of lymphatic metastasis after brachytherapy (BRT) for early-stage tongue carcinoma based on gene expression profiling. Patients and methods: Pre-therapeutic biopsies from 39 patients with T1 or T2 tongue cancer were analyzed for gene expression signatures using Codelink Uniset Human 20K Bioarray. All patients were treated with low dose-rate BRT for their primary lesions and underwent strict follow-up under a wait-and-see policy for cervical lymphatic metastasis. Candidate genes were selected for predicting lymph-node status in the reference group by the permutation test. Predictive accuracy was further evaluated by the prediction strength (PS) scoring system using an independent validation group. Results: We selected a set of 19 genes whose expression differed significantly between classes with or without lymphatic metastasis in the reference group. The lymph-node status in the validation group was predicted by the PS scoring system with an accuracy of 76%. Conclusions: Gene expression profiling using 19 genes in primary tumor tissues may allow prediction of lymphatic metastasis after BRT for early-stage oral tongue carcinoma
Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

Science.gov (United States)

Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

2009-02-01

Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.
Expression of estrogen-related gene markers in breast cancer tissue predicts aromatase inhibitor responsiveness.

Directory of Open Access Journals (Sweden)

Irene Moy

Full Text Available Aromatase inhibitors (AIs are the most effective class of drugs in the endocrine treatment of breast cancer, with an approximate 50% treatment response rate. Our objective was to determine whether intratumoral expression levels of estrogen-related genes are predictive of AI responsiveness in postmenopausal women with breast cancer. Primary breast carcinomas were obtained from 112 women who received AI therapy after failing adjuvant tamoxifen therapy and developing recurrent breast cancer. Tumor ERα and PR protein expression were analyzed by immunohistochemistry (IHC. Messenger RNA (mRNA levels of 5 estrogen-related genes-AKR1C3, aromatase, ERα, and 2 estradiol/ERα target genes, BRCA1 and PR-were measured by real-time PCR. Tumor protein and mRNA levels were compared with breast cancer progression rates to determine predictive accuracy. Responsiveness to AI therapy-defined as the combined complete response, partial response, and stable disease rates for at least 6 months-was 51%; rates were 56% in ERα-IHC-positive and 14% in ERα-IHC-negative tumors. Levels of ERα, PR, or BRCA1 mRNA were independently predictive for responsiveness to AI. In cross-validated analyses, a combined measurement of tumor ERα and PR mRNA levels yielded a more superior specificity (36% and identical sensitivity (96% to the current clinical practice (ERα/PR-IHC. In patients with ERα/PR-IHC-negative tumors, analysis of mRNA expression revealed either non-significant trends or statistically significant positive predictive values for AI responsiveness. In conclusion, expression levels of estrogen-related mRNAs are predictive for AI responsiveness in postmenopausal women with breast cancer, and mRNA expression analysis may improve patient selection.
Establishment of a 12-gene expression signature to predict colon cancer prognosis

Directory of Open Access Journals (Sweden)

Dalong Sun

2018-06-01

Full Text Available A robust and accurate gene expression signature is essential to assist oncologists to determine which subset of patients at similar Tumor-Lymph Node-Metastasis (TNM stage has high recurrence risk and could benefit from adjuvant therapies. Here we applied a two-step supervised machine-learning method and established a 12-gene expression signature to precisely predict colon adenocarcinoma (COAD prognosis by using COAD RNA-seq transcriptome data from The Cancer Genome Atlas (TCGA. The predictive performance of the 12-gene signature was validated with two independent gene expression microarray datasets: GSE39582 includes 566 COAD cases for the development of six molecular subtypes with distinct clinical, molecular and survival characteristics; GSE17538 is a dataset containing 232 colon cancer patients for the generation of a metastasis gene expression profile to predict recurrence and death in COAD patients. The signature could effectively separate the poor prognosis patients from good prognosis group (disease specific survival (DSS: Kaplan Meier (KM Log Rank p = 0.0034; overall survival (OS: KM Log Rank p = 0.0336 in GSE17538. For patients with proficient mismatch repair system (pMMR in GSE39582, the signature could also effectively distinguish high risk group from low risk group (OS: KM Log Rank p = 0.005; Relapse free survival (RFS: KM Log Rank p = 0.022. Interestingly, advanced stage patients were significantly enriched in high 12-gene score group (Fisher’s exact test p = 0.0003. After stage stratification, the signature could still distinguish poor prognosis patients in GSE17538 from good prognosis within stage II (Log Rank p = 0.01 and stage II & III (Log Rank p = 0.017 in the outcome of DFS. Within stage III or II/III pMMR patients treated with Adjuvant Chemotherapies (ACT and patients with higher 12-gene score showed poorer prognosis (III, OS: KM Log Rank p = 0.046; III & II, OS: KM Log Rank p = 0.041. Among stage II/III pMMR patients
Neighboring Genes Show Correlated Evolution in Gene Expression

Science.gov (United States)

Ghanbarian, Avazeh T.; Hurst, Laurence D.

2015-01-01

When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. PMID:25743543
Histone modification profiles are predictive for tissue/cell-type specific expression of both protein-coding and microRNA genes

Directory of Open Access Journals (Sweden)

Zhang Michael Q

2011-05-01

Full Text Available Abstract Background Gene expression is regulated at both the DNA sequence level and through modification of chromatin. However, the effect of chromatin on tissue/cell-type specific gene regulation (TCSR is largely unknown. In this paper, we present a method to elucidate the relationship between histone modification/variation (HMV and TCSR. Results A classifier for differentiating CD4+ T cell-specific genes from housekeeping genes using HMV data was built. We found HMV in both promoter and gene body regions to be predictive of genes which are targets of TCSR. For example, the histone modification types H3K4me3 and H3K27ac were identified as the most predictive for CpG-related promoters, whereas H3K4me3 and H3K79me3 were the most predictive for nonCpG-related promoters. However, genes targeted by TCSR can be predicted using other type of HMVs as well. Such redundancy implies that multiple type of underlying regulatory elements, such as enhancers or intragenic alternative promoters, which can regulate gene expression in a tissue/cell-type specific fashion, may be marked by the HMVs. Finally, we show that the predictive power of HMV for TCSR is not limited to protein-coding genes in CD4+ T cells, as we successfully predicted TCSR targeted genes in muscle cells, as well as microRNA genes with expression specific to CD4+ T cells, by the same classifier which was trained on HMV data of protein-coding genes in CD4+ T cells. Conclusion We have begun to understand the HMV patterns that guide gene expression in both tissue/cell-type specific and ubiquitous manner.
Predicting survival in patients with metastatic kidney cancer by gene-expression profiling in the primary tumor.

Science.gov (United States)

Vasselli, James R; Shih, Joanna H; Iyengar, Shuba R; Maranchie, Jodi; Riss, Joseph; Worrell, Robert; Torres-Cabala, Carlos; Tabios, Ray; Mariotti, Andra; Stearman, Robert; Merino, Maria; Walther, McClellan M; Simon, Richard; Klausner, Richard D; Linehan, W Marston

2003-06-10

To identify potential molecular determinants of tumor biology and possible clinical outcomes, global gene-expression patterns were analyzed in the primary tumors of patients with metastatic renal cell cancer by using cDNA microarrays. We used grossly dissected tumor masses that included tumor, blood vessels, connective tissue, and infiltrating immune cells to obtain a gene-expression "profile" from each primary tumor. Two patterns of gene expression were found within this uniformly staged patient population, which correlated with a significant difference in overall survival between the two patient groups. Subsets of genes most significantly associated with survival were defined, and vascular cell adhesion molecule-1 (VCAM-1) was the gene most predictive for survival. Therefore, despite the complex biological nature of metastatic cancer, basic clinical behavior as defined by survival may be determined by the gene-expression patterns expressed within the compilation of primary gross tumor cells. We conclude that survival in patients with metastatic renal cell cancer can be correlated with the expression of various genes based solely on the expression profile in the primary kidney tumor.

Expression Pattern Similarities Support the Prediction of Orthologs Retaining Common Functions after Gene Duplication Events1[OPEN

Science.gov (United States)

Haberer, Georg; Panda, Arup; Das Laha, Shayani; Ghosh, Tapas Chandra; Schäffner, Anton R.

2016-01-01

The identification of functionally equivalent, orthologous genes (functional orthologs) across genomes is necessary for accurate transfer of experimental knowledge from well-characterized organisms to others. This frequently relies on automated, coding sequence-based approaches such as OrthoMCL, Inparanoid, and KOG, which usually work well for one-to-one homologous states. However, this strategy does not reliably work for plants due to the occurrence of extensive gene/genome duplication. Frequently, for one query gene, multiple orthologous genes are predicted in the other genome, and it is not clear a priori from sequence comparison and similarity which one preserves the ancestral function. We have studied 11 organ-dependent and stress-induced gene expression patterns of 286 Arabidopsis lyrata duplicated gene groups and compared them with the respective Arabidopsis (Arabidopsis thaliana) genes to predict putative expressologs and nonexpressologs based on gene expression similarity. Promoter sequence divergence as an additional tool to substantiate functional orthology only partially overlapped with expressolog classification. By cloning eight A. lyrata homologs and complementing them in the respective four Arabidopsis loss-of-function mutants, we experimentally proved that predicted expressologs are indeed functional orthologs, while nonexpressologs or nonfunctionalized orthologs are not. Our study demonstrates that even a small set of gene expression data in addition to sequence homologies are instrumental in the assignment of functional orthologs in the presence of multiple orthologs. PMID:27303025
Muscle myeloid type I interferon gene expression may predict therapeutic responses to rituximab in myositis patients.

Science.gov (United States)

Nagaraju, Kanneboyina; Ghimbovschi, Svetlana; Rayavarapu, Sree; Phadke, Aditi; Rider, Lisa G; Hoffman, Eric P; Miller, Frederick W

2016-09-01

To identify muscle gene expression patterns that predict rituximab responses and assess the effects of rituximab on muscle gene expression in PM and DM. In an attempt to understand the molecular mechanism of response and non-response to rituximab therapy, we performed Affymetrix gene expression array analyses on muscle biopsy specimens taken before and after rituximab therapy from eight PM and two DM patients in the Rituximab in Myositis study. We also analysed selected muscle-infiltrating cell phenotypes in these biopsies by immunohistochemical staining. Partek and Ingenuity pathway analyses assessed the gene pathways and networks. Myeloid type I IFN signature genes were expressed at higher levels at baseline in the skeletal muscle of rituximab responders than in non-responders, whereas classic non-myeloid IFN signature genes were expressed at higher levels in non-responders at baseline. Also, rituximab responders have a greater reduction of the myeloid and non-myeloid type I IFN signatures than non-responders. The decrease in the type I IFN signature following administration of rituximab may be associated with the decreases in muscle-infiltrating CD19(+) B cells and CD68(+) macrophages in responders. Our findings suggest that high levels of myeloid type I IFN gene expression in skeletal muscle predict responses to rituximab in PM/DM and that rituximab responders also have a greater decrease in the expression of these genes. These data add further evidence to recent studies defining the type I IFN signature as both a predictor of therapeutic responses and a biomarker of myositis disease activity. Published by Oxford University Press on behalf British Society for Rheumatology 2016. This work is written by US Government employees and is in the public domain in the US.
Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer

Directory of Open Access Journals (Sweden)

Pingzhao Hu

2006-01-01

biological pathways. In particular, we observed that by integrating information from the insulin signalling pathway into our prediction model, we achieved better prediction of prostate cancer. Conclusions: Our data integration methodology provides an efficient way to identify biologically sound and statistically significant pathways from gene expression data. The significant gene expression phenotypes identified in our study have the potential to characterize complex genetic alterations in prostate cancer.
A statistical method for predicting splice variants between two groups of samples using GeneChip® expression array data

Directory of Open Access Journals (Sweden)

Olson James M

2006-04-01

Full Text Available Abstract Background Alternative splicing of pre-messenger RNA results in RNA variants with combinations of selected exons. It is one of the essential biological functions and regulatory components in higher eukaryotic cells. Some of these variants are detectable with the Affymetrix GeneChip® that uses multiple oligonucleotide probes (i.e. probe set, since the target sequences for the multiple probes are adjacent within each gene. Hybridization intensity from a probe correlates with abundance of the corresponding transcript. Although the multiple-probe feature in the current GeneChip® was designed to assess expression values of individual genes, it also measures transcriptional abundance for a sub-region of a gene sequence. This additional capacity motivated us to develop a method to predict alternative splicing, taking advance of extensive repositories of GeneChip® gene expression array data. Results We developed a two-step approach to predict alternative splicing from GeneChip® data. First, we clustered the probes from a probe set into pseudo-exons based on similarity of probe intensities and physical adjacency. A pseudo-exon is defined as a sequence in the gene within which multiple probes have comparable probe intensity values. Second, for each pseudo-exon, we assessed the statistical significance of the difference in probe intensity between two groups of samples. Differentially expressed pseudo-exons are predicted to be alternatively spliced. We applied our method to empirical data generated from GeneChip® Hu6800 arrays, which include 7129 probe sets and twenty probes per probe set. The dataset consists of sixty-nine medulloblastoma (27 metastatic and 42 non-metastatic samples and four cerebellum samples as normal controls. We predicted that 577 genes would be alternatively spliced when we compared normal cerebellum samples to medulloblastomas, and predicted that thirteen genes would be alternatively spliced when we compared metastatic
Gene expression markers in circulating tumor cells may predict bone metastasis and response to hormonal treatment in breast cancer.

Science.gov (United States)

Wang, Haiying; Molina, Julian; Jiang, John; Ferber, Matthew; Pruthi, Sandhya; Jatkoe, Timothy; Derecho, Carlo; Rajpurohit, Yashoda; Zheng, Jian; Wang, Yixin

2013-11-01

Circulating tumor cells (CTCs) have recently attracted attention due to their potential as prognostic and predictive markers for the clinical management of metastatic breast cancer patients. The isolation of CTCs from patients may enable the molecular characterization of these cells, which may help establish a minimally invasive assay for the prediction of metastasis and further optimization of treatment. Molecular markers of proven clinical value may therefore be useful in predicting disease aggressiveness and response to treatment. In our earlier study, we identified a gene signature in breast cancer that appears to be significantly associated with bone metastasis. Among the genes that constitute this signature, trefoil factor 1 (TFF1) was identified as the most differentially expressed gene associated with bone metastasis. In this study, we investigated 25 candidate gene markers in the CTCs of metastatic breast cancer patients with different metastatic sites. The panel of the 25 markers was investigated in 80 baseline samples (first blood draw of CTCs) and 30 follow-up samples. In addition, 40 healthy blood donors (HBDs) were analyzed as controls. The assay was performed using quantitative reverse transcriptase polymerase chain reaction (qRT-PCR) with RNA extracted from CTCs captured by the CellSearch system. Our study indicated that 12 of the genes were uniquely expressed in CTCs and 10 were highly expressed in the CTCs obtained from patients compared to those obtained from HBDs. Among these genes, the expression of keratin 19 was highly correlated with the CTC count. The TFF1 expression in CTCs was a strong predictor of bone metastasis and the patients with a high expression of estrogen receptor β in CTCs exhibited a better response to hormonal treatment. Molecular characterization of these genes in CTCs may provide a better understanding of the mechanism underlying tumor metastasis and identify gene markers in CTCs for predicting disease progression and
A Computational Gene Expression Score for Predicting Immune Injury in Renal Allografts.

Directory of Open Access Journals (Sweden)

Tara K Sigdel

Full Text Available Whole genome microarray meta-analyses of 1030 kidney, heart, lung and liver allograft biopsies identified a common immune response module (CRM of 11 genes that define acute rejection (AR across different engrafted tissues. We evaluated if the CRM genes can provide a molecular microscope to quantify graft injury in acute rejection (AR and predict risk of progressive interstitial fibrosis and tubular atrophy (IFTA in histologically normal kidney biopsies.Computational modeling was done on tissue qPCR based gene expression measurements for the 11 CRM genes in 146 independent renal allografts from 122 unique patients with AR (n = 54 and no-AR (n = 92. 24 demographically matched patients with no-AR had 6 and 24 month paired protocol biopsies; all had histologically normal 6 month biopsies, and 12 had evidence of progressive IFTA (pIFTA on their 24 month biopsies. Results were correlated with demographic, clinical and pathology variables.The 11 gene qPCR based tissue CRM score (tCRM was significantly increased in AR (5.68 ± 0.91 when compared to STA (1.29 ± 0.28; p < 0.001 and pIFTA (7.94 ± 2.278 versus 2.28 ± 0.66; p = 0.04, with greatest significance for CXCL9 and CXCL10 in AR (p <0.001 and CD6 (p<0.01, CXCL9 (p<0.05, and LCK (p<0.01 in pIFTA. tCRM was a significant independent correlate of biopsy confirmed AR (p < 0.001; AUC of 0.900; 95% CI = 0.705-903. Gene expression modeling of 6 month biopsies across 7/11 genes (CD6, INPP5D, ISG20, NKG7, PSMB9, RUNX3, and TAP1 significantly (p = 0.037 predicted the development of pIFTA at 24 months.Genome-wide tissue gene expression data mining has supported the development of a tCRM-qPCR based assay for evaluating graft immune inflammation. The tCRM score quantifies injury in AR and stratifies patients at increased risk of future pIFTA prior to any perturbation of graft function or histology.
Genomic Features That Predict Allelic Imbalance in Humans Suggest Patterns of Constraint on Gene Expression Variation

Science.gov (United States)

Fédrigo, Olivier; Haygood, Ralph; Mukherjee, Sayan; Wray, Gregory A.

2009-01-01

Variation in gene expression is an important contributor to phenotypic diversity within and between species. Although this variation often has a genetic component, identification of the genetic variants driving this relationship remains challenging. In particular, measurements of gene expression usually do not reveal whether the genetic basis for any observed variation lies in cis or in trans to the gene, a distinction that has direct relevance to the physical location of the underlying genetic variant, and which may also impact its evolutionary trajectory. Allelic imbalance measurements identify cis-acting genetic effects by assaying the relative contribution of the two alleles of a cis-regulatory region to gene expression within individuals. Identification of patterns that predict commonly imbalanced genes could therefore serve as a useful tool and also shed light on the evolution of cis-regulatory variation itself. Here, we show that sequence motifs, polymorphism levels, and divergence levels around a gene can be used to predict commonly imbalanced genes in a human data set. Reduction of this feature set to four factors revealed that only one factor significantly differentiated between commonly imbalanced and nonimbalanced genes. We demonstrate that these results are consistent between the original data set and a second published data set in humans obtained using different technical and statistical methods. Finally, we show that variation in the single allelic imbalance-associated factor is partially explained by the density of genes in the region of a target gene (allelic imbalance is less probable for genes in gene-dense regions), and, to a lesser extent, the evenness of expression of the gene across tissues and the magnitude of negative selection on putative regulatory regions of the gene. These results suggest that the genomic distribution of functional cis-regulatory variants in the human genome is nonrandom, perhaps due to local differences in evolutionary
The functional landscape of mouse gene expression

Directory of Open Access Journals (Sweden)

Zhang Wen

2004-12-01

Full Text Available Abstract Background Large-scale quantitative analysis of transcriptional co-expression has been used to dissect regulatory networks and to predict the functions of new genes discovered by genome sequencing in model organisms such as yeast. Although the idea that tissue-specific expression is indicative of gene function in mammals is widely accepted, it has not been objectively tested nor compared with the related but distinct strategy of correlating gene co-expression as a means to predict gene function. Results We generated microarray expression data for nearly 40,000 known and predicted mRNAs in 55 mouse tissues, using custom-built oligonucleotide arrays. We show that quantitative transcriptional co-expression is a powerful predictor of gene function. Hundreds of functional categories, as defined by Gene Ontology 'Biological Processes', are associated with characteristic expression patterns across all tissues, including categories that bear no overt relationship to the tissue of origin. In contrast, simple tissue-specific restriction of expression is a poor predictor of which genes are in which functional categories. As an example, the highly conserved mouse gene PWP1 is widely expressed across different tissues but is co-expressed with many RNA-processing genes; we show that the uncharacterized yeast homolog of PWP1 is required for rRNA biogenesis. Conclusions We conclude that 'functional genomics' strategies based on quantitative transcriptional co-expression will be as fruitful in mammals as they have been in simpler organisms, and that transcriptional control of mammalian physiology is more modular than is generally appreciated. Our data and analyses provide a public resource for mammalian functional genomics.
Response-predictive gene expression profiling of glioma progenitor cells in vitro.

Directory of Open Access Journals (Sweden)

Sylvia Moeckel

Full Text Available High-grade gliomas are amongst the most deadly human tumors. Treatment results are disappointing. Still, in several trials around 20% of patients respond to therapy. To date, diagnostic strategies to identify patients that will profit from a specific therapy do not exist.In this study, we used serum-free short-term treated in vitro cell cultures to predict treatment response in vitro. This approach allowed us (a to enrich specimens for brain tumor initiating cells and (b to confront cells with a therapeutic agent before expression profiling.As a proof of principle we analyzed gene expression in 18 short-term serum-free cultures of high-grade gliomas enhanced for brain tumor initiating cells (BTIC before and after in vitro treatment with the tyrosine kinase inhibitor Sunitinib. Profiles from treated progenitor cells allowed to predict therapy-induced impairment of proliferation in vitro.For the tyrosine kinase inhibitor Sunitinib used in this dataset, the approach revealed additional predictive information in comparison to the evaluation of classical signaling analysis.
Detecting microRNA activity from gene expression data

LENUS (Irish Health Repository)

Madden, Stephen F

2010-05-18

Abstract Background MicroRNAs (miRNAs) are non-coding RNAs that regulate gene expression by binding to the messenger RNA (mRNA) of protein coding genes. They control gene expression by either inhibiting translation or inducing mRNA degradation. A number of computational techniques have been developed to identify the targets of miRNAs. In this study we used predicted miRNA-gene interactions to analyse mRNA gene expression microarray data to predict miRNAs associated with particular diseases or conditions. Results Here we combine correspondence analysis, between group analysis and co-inertia analysis (CIA) to determine which miRNAs are associated with differences in gene expression levels in microarray data sets. Using a database of miRNA target predictions from TargetScan, TargetScanS, PicTar4way PicTar5way, and miRanda and combining these data with gene expression levels from sets of microarrays, this method produces a ranked list of miRNAs associated with a specified split in samples. We applied this to three different microarray datasets, a papillary thyroid carcinoma dataset, an in-house dataset of lipopolysaccharide treated mouse macrophages, and a multi-tissue dataset. In each case we were able to identified miRNAs of biological importance. Conclusions We describe a technique to integrate gene expression data and miRNA target predictions from multiple sources.
Detecting microRNA activity from gene expression data.

LENUS (Irish Health Repository)

Madden, Stephen F

2010-01-01

BACKGROUND: MicroRNAs (miRNAs) are non-coding RNAs that regulate gene expression by binding to the messenger RNA (mRNA) of protein coding genes. They control gene expression by either inhibiting translation or inducing mRNA degradation. A number of computational techniques have been developed to identify the targets of miRNAs. In this study we used predicted miRNA-gene interactions to analyse mRNA gene expression microarray data to predict miRNAs associated with particular diseases or conditions. RESULTS: Here we combine correspondence analysis, between group analysis and co-inertia analysis (CIA) to determine which miRNAs are associated with differences in gene expression levels in microarray data sets. Using a database of miRNA target predictions from TargetScan, TargetScanS, PicTar4way PicTar5way, and miRanda and combining these data with gene expression levels from sets of microarrays, this method produces a ranked list of miRNAs associated with a specified split in samples. We applied this to three different microarray datasets, a papillary thyroid carcinoma dataset, an in-house dataset of lipopolysaccharide treated mouse macrophages, and a multi-tissue dataset. In each case we were able to identified miRNAs of biological importance. CONCLUSIONS: We describe a technique to integrate gene expression data and miRNA target predictions from multiple sources.
Exploring gene expression signatures for predicting disease free survival after resection of colorectal cancer liver metastases.

Directory of Open Access Journals (Sweden)

Nikol Snoeren

Full Text Available BACKGROUND AND OBJECTIVES: This study was designed to identify and validate gene signatures that can predict disease free survival (DFS in patients undergoing a radical resection for their colorectal liver metastases (CRLM. METHODS: Tumor gene expression profiles were collected from 119 patients undergoing surgery for their CRLM in the Paul Brousse Hospital (France and the University Medical Center Utrecht (The Netherlands. Patients were divided into high and low risk groups. A randomly selected training set was used to find predictive gene signatures. The ability of these gene signatures to predict DFS was tested in an independent validation set comprising the remaining patients. Furthermore, 5 known clinical risk scores were tested in our complete patient cohort. RESULT: No gene signature was found that significantly predicted DFS in the validation set. In contrast, three out of five clinical risk scores were able to predict DFS in our patient cohort. CONCLUSIONS: No gene signature was found that could predict DFS in patients undergoing CRLM resection. Three out of five clinical risk scores were able to predict DFS in our patient cohort. These results emphasize the need for validating risk scores in independent patient groups and suggest improved designs for future studies.
An algorithm to discover gene signatures with predictive potential

Directory of Open Access Journals (Sweden)

Hallett Robin M

2010-09-01

Full Text Available Abstract Background The advent of global gene expression profiling has generated unprecedented insight into our molecular understanding of cancer, including breast cancer. For example, human breast cancer patients display significant diversity in terms of their survival, recurrence, metastasis as well as response to treatment. These patient outcomes can be predicted by the transcriptional programs of their individual breast tumors. Predictive gene signatures allow us to correctly classify human breast tumors into various risk groups as well as to more accurately target therapy to ensure more durable cancer treatment. Results Here we present a novel algorithm to generate gene signatures with predictive potential. The method first classifies the expression intensity for each gene as determined by global gene expression profiling as low, average or high. The matrix containing the classified data for each gene is then used to score the expression of each gene based its individual ability to predict the patient characteristic of interest. Finally, all examined genes are ranked based on their predictive ability and the most highly ranked genes are included in the master gene signature, which is then ready for use as a predictor. This method was used to accurately predict the survival outcomes in a cohort of human breast cancer patients. Conclusions We confirmed the capacity of our algorithm to generate gene signatures with bona fide predictive ability. The simplicity of our algorithm will enable biological researchers to quickly generate valuable gene signatures without specialized software or extensive bioinformatics training.
Can survival prediction be improved by merging gene expression data sets?

Directory of Open Access Journals (Sweden)

Haleh Yasrebi

Full Text Available BACKGROUND: High-throughput gene expression profiling technologies generating a wealth of data, are increasingly used for characterization of tumor biopsies for clinical trials. By applying machine learning algorithms to such clinically documented data sets, one hopes to improve tumor diagnosis, prognosis, as well as prediction of treatment response. However, the limited number of patients enrolled in a single trial study limits the power of machine learning approaches due to over-fitting. One could partially overcome this limitation by merging data from different studies. Nevertheless, such data sets differ from each other with regard to technical biases, patient selection criteria and follow-up treatment. It is therefore not clear at all whether the advantage of increased sample size outweighs the disadvantage of higher heterogeneity of merged data sets. Here, we present a systematic study to answer this question specifically for breast cancer data sets. We use survival prediction based on Cox regression as an assay to measure the added value of merged data sets. RESULTS: Using time-dependent Receiver Operating Characteristic-Area Under the Curve (ROC-AUC and hazard ratio as performance measures, we see in overall no significant improvement or deterioration of survival prediction with merged data sets as compared to individual data sets. This apparently was due to the fact that a few genes with strong prognostic power were not available on all microarray platforms and thus were not retained in the merged data sets. Surprisingly, we found that the overall best performance was achieved with a single-gene predictor consisting of CYB5D1. CONCLUSIONS: Merging did not deteriorate performance on average despite (a The diversity of microarray platforms used. (b The heterogeneity of patients cohorts. (c The heterogeneity of breast cancer disease. (d Substantial variation of time to death or relapse. (e The reduced number of genes in the merged data
Computational Prediction of MicroRNAs from Toxoplasma gondii Potentially Regulating the Hosts’ Gene Expression

Directory of Open Access Journals (Sweden)

Müşerref Duygu Saçar

2014-10-01

Full Text Available MicroRNAs (miRNAs were discovered two decades ago, yet there is still a great need for further studies elucidating their genesis and targeting in different phyla. Since experimental discovery and validation of miRNAs is difficult, computational predictions are indispensable and today most computational approaches employ machine learning. Toxoplasma gondii, a parasite residing within the cells of its hosts like human, uses miRNAs for its post-transcriptional gene regulation. It may also regulate its hosts’ gene expression, which has been shown in brain cancer. Since previous studies have shown that overexpressed miRNAs within the host are causal for disease onset, we hypothesized that T. gondii could export miRNAs into its host cell. We computationally predicted all hairpins from the genome of T. gondii and used mouse and human models to filter possible candidates. These were then further compared to known miRNAs in human and rodents and their expression was examined for T. gondii grown in mouse and human hosts, respectively. We found that among the millions of potential hairpins in T. gondii, only a few thousand pass filtering using a human or mouse model and that even fewer of those are expressed. Since they are expressed and differentially expressed in rodents and human, we suggest that there is a chance that T. gondii may export miRNAs into its hosts for direct regulation.
Gene expression profiles in paraffin-embedded core biopsy tissue predict response to chemotherapy in women with locally advanced breast cancer.

Science.gov (United States)

Gianni, Luca; Zambetti, Milvia; Clark, Kim; Baker, Joffre; Cronin, Maureen; Wu, Jenny; Mariani, Gabriella; Rodriguez, Jaime; Carcangiu, Marialuisa; Watson, Drew; Valagussa, Pinuccia; Rouzier, Roman; Symmans, W Fraser; Ross, Jeffrey S; Hortobagyi, Gabriel N; Pusztai, Lajos; Shak, Steven

2005-10-10

We sought to identify gene expression markers that predict the likelihood of chemotherapy response. We also tested whether chemotherapy response is correlated with the 21-gene Recurrence Score assay that quantifies recurrence risk. Patients with locally advanced breast cancer received neoadjuvant paclitaxel and doxorubicin. RNA was extracted from the pretreatment formalin-fixed paraffin-embedded core biopsies. The expression of 384 genes was quantified using reverse transcriptase polymerase chain reaction and correlated with pathologic complete response (pCR). The performance of genes predicting for pCR was tested in patients from an independent neoadjuvant study where gene expression was obtained using DNA microarrays. Of 89 assessable patients (mean age, 49.9 years; mean tumor size, 6.4 cm), 11 (12%) had a pCR. Eighty-six genes correlated with pCR (unadjusted P < .05); pCR was more likely with higher expression of proliferation-related genes and immune-related genes, and with lower expression of estrogen receptor (ER) -related genes. In 82 independent patients treated with neoadjuvant paclitaxel and doxorubicin, DNA microarray data were available for 79 of the 86 genes. In univariate analysis, 24 genes correlated with pCR with P < .05 (false discovery, four genes) and 32 genes showed correlation with P < .1 (false discovery, eight genes). The Recurrence Score was positively associated with the likelihood of pCR (P = .005), suggesting that the patients who are at greatest recurrence risk are more likely to have chemotherapy benefit. Quantitative expression of ER-related genes, proliferation genes, and immune-related genes are strong predictors of pCR in women with locally advanced breast cancer receiving neoadjuvant anthracyclines and paclitaxel.
An enhanced deterministic K-Means clustering algorithm for cancer subtype prediction from gene expression data.

Science.gov (United States)

Nidheesh, N; Abdul Nazeer, K A; Ameer, P M

2017-12-01

Clustering algorithms with steps involving randomness usually give different results on different executions for the same dataset. This non-deterministic nature of algorithms such as the K-Means clustering algorithm limits their applicability in areas such as cancer subtype prediction using gene expression data. It is hard to sensibly compare the results of such algorithms with those of other algorithms. The non-deterministic nature of K-Means is due to its random selection of data points as initial centroids. We propose an improved, density based version of K-Means, which involves a novel and systematic method for selecting initial centroids. The key idea of the algorithm is to select data points which belong to dense regions and which are adequately separated in feature space as the initial centroids. We compared the proposed algorithm to a set of eleven widely used single clustering algorithms and a prominent ensemble clustering algorithm which is being used for cancer data classification, based on the performances on a set of datasets comprising ten cancer gene expression datasets. The proposed algorithm has shown better overall performance than the others. There is a pressing need in the Biomedical domain for simple, easy-to-use and more accurate Machine Learning tools for cancer subtype prediction. The proposed algorithm is simple, easy-to-use and gives stable results. Moreover, it provides comparatively better predictions of cancer subtypes from gene expression data. Copyright © 2017 Elsevier Ltd. All rights reserved.
Prediction of the contact sensitizing potential of chemicals using analysis of gene expression changes in human THP-1 monocytes.

Science.gov (United States)

Arkusz, Joanna; Stępnik, Maciej; Sobala, Wojciech; Dastych, Jarosław

2010-11-10

The aim of this study was to find differentially regulated genes in THP-1 monocytic cells exposed to sensitizers and nonsensitizers and to investigate if such genes could be reliable markers for an in vitro predictive method for the identification of skin sensitizing chemicals. Changes in expression of 35 genes in the THP-1 cell line following treatment with chemicals of different sensitizing potential (from nonsensitizers to extreme sensitizers) were assessed using real-time PCR. Verification of 13 candidate genes by testing a large number of chemicals (an additional 22 sensitizers and 8 nonsensitizers) revealed that prediction of contact sensitization potential was possible based on evaluation of changes in three genes: IL8, HMOX1 and PAIMP1. In total, changes in expression of these genes allowed correct detection of sensitization potential of 21 out of 27 (78%) test sensitizers. The gene expression levels inside potency groups varied and did not allow estimation of sensitization potency of test chemicals. Results of this study indicate that evaluation of changes in expression of proposed biomarkers in THP-1 cells could be a valuable model for preliminary screening of chemicals to discriminate an appreciable majority of sensitizers from nonsensitizers. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Comprehensive analysis of gene expression patterns of hedgehog-related genes

Directory of Open Access Journals (Sweden)

Baillie David

2006-10-01

Full Text Available Abstract Background The Caenorhabditis elegans genome encodes ten proteins that share sequence similarity with the Hedgehog signaling molecule through their C-terminal autoprocessing Hint/Hog domain. These proteins contain novel N-terminal domains, and C. elegans encodes dozens of additional proteins containing only these N-terminal domains. These gene families are called warthog, groundhog, ground-like and quahog, collectively called hedgehog (hh-related genes. Previously, the expression pattern of seventeen genes was examined, which showed that they are primarily expressed in the ectoderm. Results With the completion of the C. elegans genome sequence in November 2002, we reexamined and identified 61 hh-related ORFs. Further, we identified 49 hh-related ORFs in C. briggsae. ORF analysis revealed that 30% of the genes still had errors in their predictions and we improved these predictions here. We performed a comprehensive expression analysis using GFP fusions of the putative intergenic regulatory sequence with one or two transgenic lines for most genes. The hh-related genes are expressed in one or a few of the following tissues: hypodermis, seam cells, excretory duct and pore cells, vulval epithelial cells, rectal epithelial cells, pharyngeal muscle or marginal cells, arcade cells, support cells of sensory organs, and neuronal cells. Using time-lapse recordings, we discovered that some hh-related genes are expressed in a cyclical fashion in phase with molting during larval development. We also generated several translational GFP fusions, but they did not show any subcellular localization. In addition, we also studied the expression patterns of two genes with similarity to Drosophila frizzled, T23D8.1 and F27E11.3A, and the ortholog of the Drosophila gene dally-like, gpn-1, which is a heparan sulfate proteoglycan. The two frizzled homologs are expressed in a few neurons in the head, and gpn-1 is expressed in the pharynx. Finally, we compare the
Comparative analysis of codon usage patterns and identification of predicted highly expressed genes in five Salmonella genomes

Directory of Open Access Journals (Sweden)

Mondal U

2008-01-01

Full Text Available Purpose: To anlyse codon usage patterns of five complete genomes of Salmonella , predict highly expressed genes, examine horizontally transferred pathogenicity-related genes to detect their presence in the strains, and scrutinize the nature of highly expressed genes to infer upon their lifestyle. Methods: Protein coding genes, ribosomal protein genes, and pathogenicity-related genes were analysed with Codon W and CAI (codon adaptation index Calculator. Results: Translational efficiency plays a role in codon usage variation in Salmonella genes. Low bias was noticed in most of the genes. GC3 (guanine cytosine at third position composition does not influence codon usage variation in the genes of these Salmonella strains. Among the cluster of orthologous groups (COGs, translation, ribosomal structure biogenesis [J], and energy production and conversion [C] contained the highest number of potentially highly expressed (PHX genes. Correspondence analysis reveals the conserved nature of the genes. Highly expressed genes were detected. Conclusions: Selection for translational efficiency is the major source of variation of codon usage in the genes of Salmonella . Evolution of pathogenicity-related genes as a unit suggests their ability to infect and exist as a pathogen. Presence of a lot of PHX genes in the information and storage-processing category of COGs indicated their lifestyle and revealed that they were not subjected to genome reduction.

No specific gene expression signature in human granulosa and cumulus cells for prediction of oocyte fertilisation and embryo implantation.

Directory of Open Access Journals (Sweden)

Tanja Burnik Papler

Full Text Available In human IVF procedures objective and reliable biomarkers of oocyte and embryo quality are needed in order to increase the use of single embryo transfer (SET and thus prevent multiple pregnancies. During folliculogenesis there is an intense bi-directional communication between oocyte and follicular cells. For this reason gene expression profile of follicular cells could be an important indicator and biomarker of oocyte and embryo quality. The objective of this study was to identify gene expression signature(s in human granulosa (GC and cumulus (CC cells predictive of successful embryo implantation and oocyte fertilization. Forty-one patients were included in the study and individual GC and CC samples were collected; oocytes were cultivated separately, allowing a correlation with IVF outcome and elective SET was performed. Gene expression analysis was performed using microarrays, followed by a quantitative real-time PCR validation. After statistical analysis of microarray data, there were no significantly differentially expressed genes (FDR<0,05 between non-fertilized and fertilized oocytes and non-implanted and implanted embryos in either of the cell type. Furthermore, the results of quantitative real-time PCR were in consent with microarray data as there were no significant differences in gene expression of genes selected for validation. In conclusion, we did not find biomarkers for prediction of oocyte fertilization and embryo implantation in IVF procedures in the present study.
Gene expression signatures that predict radiation exposure in mice and humans.

Directory of Open Access Journals (Sweden)

Holly K Dressman

2007-04-01

Full Text Available The capacity to assess environmental inputs to biological phenotypes is limited by methods that can accurately and quantitatively measure these contributions. One such example can be seen in the context of exposure to ionizing radiation.We have made use of gene expression analysis of peripheral blood (PB mononuclear cells to develop expression profiles that accurately reflect prior radiation exposure. We demonstrate that expression profiles can be developed that not only predict radiation exposure in mice but also distinguish the level of radiation exposure, ranging from 50 cGy to 1,000 cGy. Likewise, a molecular signature of radiation response developed solely from irradiated human patient samples can predict and distinguish irradiated human PB samples from nonirradiated samples with an accuracy of 90%, sensitivity of 85%, and specificity of 94%. We further demonstrate that a radiation profile developed in the mouse can correctly distinguish PB samples from irradiated and nonirradiated human patients with an accuracy of 77%, sensitivity of 82%, and specificity of 75%. Taken together, these data demonstrate that molecular profiles can be generated that are highly predictive of different levels of radiation exposure in mice and humans.We suggest that this approach, with additional refinement, could provide a method to assess the effects of various environmental inputs into biological phenotypes as well as providing a more practical application of a rapid molecular screening test for the diagnosis of radiation exposure.
Tumour gene expression predicts response to cetuximab in patients with KRAS wild-type metastatic colorectal cancer.

Science.gov (United States)

Baker, J B; Dutta, D; Watson, D; Maddala, T; Munneke, B M; Shak, S; Rowinsky, E K; Xu, L-A; Harbison, C T; Clark, E A; Mauro, D J; Khambata-Ford, S

2011-02-01

Although it is accepted that metastatic colorectal cancers (mCRCs) that carry activating mutations in KRAS are unresponsive to anti-epidermal growth factor receptor (EGFR) monoclonal antibodies, a significant fraction of KRAS wild-type (wt) mCRCs are also unresponsive to anti-EGFR therapy. Genes encoding EGFR ligands amphiregulin (AREG) and epiregulin (EREG) are promising gene expression-based markers but have not been incorporated into a test to dichotomise KRAS wt mCRC patients with respect to sensitivity to anti-EGFR treatment. We used RT-PCR to test 110 candidate gene expression markers in primary tumours from 144 KRAS wt mCRC patients who received monotherapy with the anti-EGFR antibody cetuximab. Results were correlated with multiple clinical endpoints: disease control, objective response, and progression-free survival (PFS). Expression of many of the tested candidate genes, including EREG and AREG, strongly associate with all clinical endpoints. Using multivariate analysis with two-layer five-fold cross-validation, we constructed a four-gene predictive classifier. Strikingly, patients below the classifier cutpoint had PFS and disease control rates similar to those of patients with KRAS mutant mCRC. Gene expression appears to identify KRAS wt mCRC patients who receive little benefit from cetuximab. It will be important to test this model in an independent validation study.
Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence.

Science.gov (United States)

Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

2015-01-01

There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinformatics software, and we analyzed the relationship between gene expression profiles of adenoma-adenocarcinoma sequence and clinical prognosis of colorectal cancer. The mRNA expressions of adenoma-carcinoma sequence were significantly different between high-grade intraepithelial neoplasia group and adenocarcinoma group. The biological process of gene ontology function enrichment analysis on differentially expressed genes between high-grade intraepithelial neoplasia group and adenocarcinoma group showed that genes enriched in the extracellular structure organization, skeletal system development, biological adhesion and itself regulated growth regulation, with the P value after FDR correction of less than 0.05. In addition, IPR-related protein mainly focused on the insulin-like growth factor binding proteins. The variable trends of gene expression profiles for adenoma-carcinoma sequence were mainly concentrated in high-grade intraepithelial neoplasia and adenocarcinoma. The differentially expressed genes are significantly correlated between high-grade intraepithelial neoplasia group and adenocarcinoma group. Bioinformatics analysis is an effective way to study the gene expression profiles in the adenoma-carcinoma sequence, and may provide an effective tool to involve colorectal cancer research strategy into colorectal adenoma or advanced adenoma.
A hemocyte gene expression signature correlated with predictive capacity of oysters to survive Vibrio infections

Directory of Open Access Journals (Sweden)

Rosa Rafael

2012-06-01

Full Text Available Abstract Background The complex balance between environmental and host factors is an important determinant of susceptibility to infection. Disturbances of this equilibrium may result in multifactorial diseases as illustrated by the summer mortality syndrome, a worldwide and complex phenomenon that affects the oysters, Crassostrea gigas. The summer mortality syndrome reveals a physiological intolerance making this oyster species susceptible to diseases. Exploration of genetic basis governing the oyster resistance or susceptibility to infections is thus a major goal for understanding field mortality events. In this context, we used high-throughput genomic approaches to identify genetic traits that may characterize inherent survival capacities in C. gigas. Results Using digital gene expression (DGE, we analyzed the transcriptomes of hemocytes (immunocompetent cells of oysters able or not able to survive infections by Vibrio species shown to be involved in summer mortalities. Hemocytes were nonlethally collected from oysters before Vibrio experimental infection, and two DGE libraries were generated from individuals that survived or did not survive. Exploration of DGE data and microfluidic qPCR analyses at individual level showed an extraordinary polymorphism in gene expressions, but also a set of hemocyte-expressed genes whose basal mRNA levels discriminate oyster capacity to survive infections by the pathogenic V. splendidus LGP32. Finally, we identified a signature of 14 genes that predicted oyster survival capacity. Their expressions are likely driven by distinct transcriptional regulation processes associated or not associated to gene copy number variation (CNV. Conclusions We provide here for the first time in oyster a gene expression survival signature that represents a useful tool for understanding mortality events and for assessing genetic traits of interest for disease resistance selection programs.
Genetic architecture of gene expression in the chicken

Directory of Open Access Journals (Sweden)

Stanley Dragana

2013-01-01

Full Text Available Abstract Background The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. Results Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. Conclusions We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.
Radiation-induced gene expression in human subcutaneous fibroblasts is predictive of radiation-induced fibrosis

DEFF Research Database (Denmark)

Rødningen, Olaug Kristin; Børresen-Dale, Anne-Lise; Alsner, Jan

2008-01-01

BACKGROUND AND PURPOSE: Breast cancer patients show a large variation in normal tissue reactions after ionizing radiation (IR) therapy. One of the most common long-term adverse effects of ionizing radiotherapy is radiation-induced fibrosis (RIF), and several attempts have been made over the last...... years to develop predictive assays for RIF. Our aim was to identify basal and radiation-induced transcriptional profiles in fibroblasts from breast cancer patients that might be related to the individual risk of RIF in these patients. MATERIALS AND METHODS: Fibroblast cell lines from 31 individuals......-treated fibroblasts. Transcriptional differences in basal and radiation-induced gene expression profiles were investigated using 15K cDNA microarrays, and results analyzed by both SAM and PAM. RESULTS: Sixty differentially expressed genes were identified by applying SAM on 10 patients with the highest risk of RIF...
Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex

Science.gov (United States)

Hulsman, Marc; Lelieveldt, Boudewijn P. F.; de Ridder, Jeroen; Reinders, Marcel

2015-01-01

The three dimensional conformation of the genome in the cell nucleus influences important biological processes such as gene expression regulation. Recent studies have shown a strong correlation between chromatin interactions and gene co-expression. However, predicting gene co-expression from frequent long-range chromatin interactions remains challenging. We address this by characterizing the topology of the cortical chromatin interaction network using scale-aware topological measures. We demonstrate that based on these characterizations it is possible to accurately predict spatial co-expression between genes in the mouse cortex. Consistent with previous findings, we find that the chromatin interaction profile of a gene-pair is a good predictor of their spatial co-expression. However, the accuracy of the prediction can be substantially improved when chromatin interactions are described using scale-aware topological measures of the multi-resolution chromatin interaction network. We conclude that, for co-expression prediction, it is necessary to take into account different levels of chromatin interactions ranging from direct interaction between genes (i.e. small-scale) to chromatin compartment interactions (i.e. large-scale). PMID:25965262
Creating and validating cis-regulatory maps of tissue-specific gene expression regulation

Science.gov (United States)

O'Connor, Timothy R.; Bailey, Timothy L.

2014-01-01

Predicting which genomic regions control the transcription of a given gene is a challenge. We present a novel computational approach for creating and validating maps that associate genomic regions (cis-regulatory modules–CRMs) with genes. The method infers regulatory relationships that explain gene expression observed in a test tissue using widely available genomic data for ‘other’ tissues. To predict the regulatory targets of a CRM, we use cross-tissue correlation between histone modifications present at the CRM and expression at genes within 1 Mbp of it. To validate cis-regulatory maps, we show that they yield more accurate models of gene expression than carefully constructed control maps. These gene expression models predict observed gene expression from transcription factor binding in the CRMs linked to that gene. We show that our maps are able to identify long-range regulatory interactions and improve substantially over maps linking genes and CRMs based on either the control maps or a ‘nearest neighbor’ heuristic. Our results also show that it is essential to include CRMs predicted in multiple tissues during map-building, that H3K27ac is the most informative histone modification, and that CAGE is the most informative measure of gene expression for creating cis-regulatory maps. PMID:25200088
HPV and high-risk gene expression profiles predict response to chemoradiotherapy in head and neck cancer, independent of clinical factors

International Nuclear Information System (INIS)

Jong, Monique C. de; Pramana, Jimmy; Knegjens, Joost L.; Balm, Alfons J.M.; Brekel, Michiel W.M. van den; Hauptmann, Michael; Begg, Adrian C.; Rasch, Coen R.N.

2010-01-01

Purpose: The purpose of this study was to combine gene expression profiles and clinical factors to provide a better prediction model of local control after chemoradiotherapy for advanced head and neck cancer. Material and methods: Gene expression data were available for a series of 92 advanced stage head and neck cancer patients treated with primary chemoradiotherapy. The effect of the Chung high-risk and Slebos HPV expression profiles on local control was analyzed in a model with age at diagnosis, gender, tumor site, tumor volume, T-stage and N-stage and HPV profile status. Results: Among 75 patients included in the study, the only factors significantly predicting local control were tumor site (oral cavity vs. Pharynx, hazard ratio 4.2 [95% CI 1.4-12.5]), Chung gene expression status (high vs. Low risk profile, hazard ratio 4.4 [95% CI 1.5-13.3]) and HPV profile (negative vs. Positive profile, hazard ratio 6.2 [95% CI 1.7-22.5]). Conclusions: Chung high-risk expression profile and a negative HPV expression profile were significantly associated with increased risk of local recurrence after chemoradiotherapy in advanced pharynx and oral cavity tumors, independent of clinical factors.
GSEH: A Novel Approach to Select Prostate Cancer-Associated Genes Using Gene Expression Heterogeneity.

Science.gov (United States)

Kim, Hyunjin; Choi, Sang-Min; Park, Sanghyun

2018-01-01

When a gene shows varying levels of expression among normal people but similar levels in disease patients or shows similar levels of expression among normal people but different levels in disease patients, we can assume that the gene is associated with the disease. By utilizing this gene expression heterogeneity, we can obtain additional information that abets discovery of disease-associated genes. In this study, we used collaborative filtering to calculate the degree of gene expression heterogeneity between classes and then scored the genes on the basis of the degree of gene expression heterogeneity to find "differentially predicted" genes. Through the proposed method, we discovered more prostate cancer-associated genes than 10 comparable methods. The genes prioritized by the proposed method are potentially significant to biological processes of a disease and can provide insight into them.
Gene expression programming for prediction of scour depth downstream of sills

Science.gov (United States)

Azamathulla, H. Md.

2012-08-01

SummaryLocal scour is crucial in the degradation of river bed and the stability of grade control structures, stilling basins, aprons, ski-jump bucket spillways, bed sills, weirs, check dams, etc. This short communication presents gene-expression programming (GEP), which is an extension to genetic programming (GP), as an alternative approach to predict scour depth downstream of sills. Published data were compiled from the literature for the scour depth downstream of sills. The proposed GEP approach gives satisfactory results (R2 = 0.967 and RMSE = 0.088) compared to the existing predictors (Chinnarasri and Kositgittiwong, 2008) with R2 = 0.87 and RMSE = 2.452 for relative scour depth.
Polycistronic gene expression in Aspergillus niger.

Science.gov (United States)

Schuetze, Tabea; Meyer, Vera

2017-09-25

Genome mining approaches predict dozens of biosynthetic gene clusters in each of the filamentous fungal genomes sequenced so far. However, the majority of these gene clusters still remain cryptic because they are not expressed in their natural host. Simultaneous expression of all genes belonging to a biosynthetic pathway in a heterologous host is one approach to activate biosynthetic gene clusters and to screen the metabolites produced for bioactivities. Polycistronic expression of all pathway genes under control of a single and tunable promoter would be the method of choice, as this does not only simplify cloning procedures, but also offers control on timing and strength of expression. However, polycistronic gene expression is a feature not commonly found in eukaryotic host systems, such as Aspergillus niger. In this study, we tested the suitability of the viral P2A peptide for co-expression of three genes in A. niger. Two genes descend from Fusarium oxysporum and are essential to produce the secondary metabolite enniatin (esyn1, ekivR). The third gene (luc) encodes the reporter luciferase which was included to study position effects. Expression of the polycistronic gene cassette was put under control of the Tet-On system to ensure tunable gene expression in A. niger. In total, three polycistronic expression cassettes which differed in the position of luc were constructed and targeted to the pyrG locus in A. niger. This allowed direct comparison of the luciferase activity based on the position of the luciferase gene. Doxycycline-mediated induction of the Tet-On expression cassettes resulted in the production of one long polycistronic mRNA as proven by Northern analyses, and ensured comparable production of enniatin in all three strains. Notably, gene position within the polycistronic expression cassette matters, as, luciferase activity was lowest at position one and had a comparable activity at positions two and three. The P2A peptide can be used to express at
The accuracy of survival time prediction for patients with glioma is improved by measuring mitotic spindle checkpoint gene expression.

Directory of Open Access Journals (Sweden)

Li Bie

Full Text Available Identification of gene expression changes that improve prediction of survival time across all glioma grades would be clinically useful. Four Affymetrix GeneChip datasets from the literature, containing data from 771 glioma samples representing all WHO grades and eight normal brain samples, were used in an ANOVA model to screen for transcript changes that correlated with grade. Observations were confirmed and extended using qPCR assays on RNA derived from 38 additional glioma samples and eight normal samples for which survival data were available. RNA levels of eight major mitotic spindle assembly checkpoint (SAC genes (BUB1, BUB1B, BUB3, CENPE, MAD1L1, MAD2L1, CDC20, TTK significantly correlated with glioma grade and six also significantly correlated with survival time. In particular, the level of BUB1B expression was highly correlated with survival time (p<0.0001, and significantly outperformed all other measured parameters, including two standards; WHO grade and MIB-1 (Ki-67 labeling index. Measurement of the expression levels of a small set of SAC genes may complement histological grade and other clinical parameters for predicting survival time.
Using PCR to Target Misconceptions about Gene Expression

Directory of Open Access Journals (Sweden)

Leslie K. Wright

2013-02-01

Full Text Available We present a PCR-based laboratory exercise that can be used with first- or second-year biology students to help overcome common misconceptions about gene expression. Biology students typically do not have a clear understanding of the difference between genes (DNA and gene expression (mRNA/protein and often believe that genes exist in an organism or cell only when they are expressed. This laboratory exercise allows students to carry out a PCR-based experiment designed to challenge their misunderstanding of the difference between genes and gene expression. Students first transform E. coli with an inducible GFP gene containing plasmid and observe induced and un-induced colonies. The following exercise creates cognitive dissonance when actual PCR results contradict their initial (incorrect predictions of the presence of the GFP gene in transformed cells. Field testing of this laboratory exercise resulted in learning gains on both knowledge and application questions on concepts related to genes and gene expression.
FocusHeuristics - expression-data-driven network optimization and disease gene prediction.

Science.gov (United States)

Ernst, Mathias; Du, Yang; Warsow, Gregor; Hamed, Mohamed; Endlich, Nicole; Endlich, Karlhans; Murua Escobar, Hugo; Sklarz, Lisa-Madeleine; Sender, Sina; Junghanß, Christian; Möller, Steffen; Fuellen, Georg; Struckmann, Stephan

2017-02-16

To identify genes contributing to disease phenotypes remains a challenge for bioinformatics. Static knowledge on biological networks is often combined with the dynamics observed in gene expression levels over disease development, to find markers for diagnostics and therapy, and also putative disease-modulatory drug targets and drugs. The basis of current methods ranges from a focus on expression-levels (Limma) to concentrating on network characteristics (PageRank, HITS/Authority Score), and both (DeMAND, Local Radiality). We present an integrative approach (the FocusHeuristics) that is thoroughly evaluated based on public expression data and molecular disease characteristics provided by DisGeNet. The FocusHeuristics combines three scores, i.e. the log fold change and another two, based on the sum and difference of log fold changes of genes/proteins linked in a network. A gene is kept when one of the scores to which it contributes is above a threshold. Our FocusHeuristics is both, a predictor for gene-disease-association and a bioinformatics method to reduce biological networks to their disease-relevant parts, by highlighting the dynamics observed in expression data. The FocusHeuristics is slightly, but significantly better than other methods by its more successful identification of disease-associated genes measured by AUC, and it delivers mechanistic explanations for its choice of genes.
Conservation of transcription factor binding events predicts gene expression across species

Science.gov (United States)

Hemberg, Martin; Kreiman, Gabriel

2011-01-01

Recent technological advances have made it possible to determine the genome-wide binding sites of transcription factors (TFs). Comparisons across species have suggested a relatively low degree of evolutionary conservation of experimentally defined TF binding events (TFBEs). Using binding data for six different TFs in hepatocytes and embryonic stem cells from human and mouse, we demonstrate that evolutionary conservation of TFBEs within orthologous proximal promoters is closely linked to function, defined as expression of the target genes. We show that (i) there is a significantly higher degree of conservation of TFBEs when the target gene is expressed in both species; (ii) there is increased conservation of binding events for groups of TFs compared to individual TFs; and (iii) conserved TFBEs have a greater impact on the expression of their target genes than non-conserved ones. These results link conservation of structural elements (TFBEs) to conservation of function (gene expression) and suggest a higher degree of functional conservation than implied by previous studies. PMID:21622661
Development of Gene Expression Signatures for Practical Radiation Biodosimetry

International Nuclear Information System (INIS)

Paul, Sunirmal; Amundson, Sally A.

2008-01-01

Purpose: In a large-scale radiologic emergency, estimates of exposure doses and radiation injury would be required for individuals without physical dosimeters. Current methods are inadequate for the task, so we are developing gene expression profiles for radiation biodosimetry. This approach could provide both an estimate of physical radiation dose and an indication of the extent of individual injury or future risk. Methods and Materials: We used whole genome microarray expression profiling as a discovery platform to identify genes with the potential to predict radiation dose across an exposure range relevant for medical decision making in a radiologic emergency. Human peripheral blood from 10 healthy donors was irradiated ex vivo, and global gene expression was measured both 6 and 24 h after exposure. Results: A 74-gene signature was identified that distinguishes between four radiation doses (0.5, 2, 5, and 8 Gy) and controls. More than one third of these genes are regulated by TP53. A nearest centroid classifier using these same 74 genes correctly predicted 98% of samples taken either 6 h or 24 h after treatment as unexposed, exposed to 0.5, 2, or ≥5 Gy. Expression patterns of five genes (CDKN1A, FDXR, SESN1, BBC3, and PHPT1) from this signature were also confirmed by real-time polymerase chain reaction. Conclusion: The ability of a single gene set to predict radiation dose throughout a window of time without need for individual pre-exposure controls represents an important advance in the development of gene expression for biodosimetry
Gene expression profiles in stages II and III colon cancers

DEFF Research Database (Denmark)

Thorsteinsson, Morten; Kirkeby, Lene T; Hansen, Raino

2012-01-01

PURPOSE: A 128-gene signature has been proposed to predict outcome in patients with stages II and III colorectal cancers. In the present study, we aimed to reproduce and validate the 128-gene signature in external and independent material. METHODS: Gene expression data from the original material...... were retrieved from the Gene Expression Omnibus (GEO) (n¿=¿111) in addition to a Danish data set (n¿=¿37). All patients had stages II and III colon cancers. A Prediction Analysis of Microarray classifier, based on the 128-gene signature and the original training set of stage I (n¿=¿65) and stage IV (n...... correctly predicted as stage IV-like, and the remaining patients were predicted as stage I-like and unclassifiable, respectively. Stage II patients could not be stratified. CONCLUSIONS: The 128-gene signature showed reproducibility in stage III colon cancer, but could not predict recurrence in stage II...
Prediction of Metastasis and Recurrence in Colorectal Cancer Based on Gene Expression Analysis: Ready for the Clinic?

Energy Technology Data Exchange (ETDEWEB)

Shibayama, Masaki [Sysmex Corporation, Central Research Laboratories, Kobe 651-2271 (Japan); Maak, Matthias; Nitsche, Ulrich [Chirurgische Klinik, Klinikum Rechts der Isar der TUM, München 81657 (Germany); Gotoh, Kengo [Sysmex Corporation, Central Research Laboratories, Kobe 651-2271 (Japan); Rosenberg, Robert; Janssen, Klaus-Peter, E-mail: klaus-peter.janssen@lrz.tum.de [Chirurgische Klinik, Klinikum Rechts der Isar der TUM, München 81657 (Germany)

2011-07-07

Cancers of the colon and rectum, which rank among the most frequent human tumors, are currently treated by surgical resection in locally restricted tumor stages. However, disease recurrence and formation of local and distant metastasis frequently occur even in cases with successful curative resection of the primary tumor (R0). Recent technological advances in molecular diagnostic analysis have led to a wealth of knowledge about the changes in gene transcription in all stages of colorectal tumors. Differential gene expression, or transcriptome analysis, has been proposed by many groups to predict disease recurrence, clinical outcome, and also response to therapy, in addition to the well-established clinico-pathological factors. However, the clinical usability of gene expression profiling as a reliable and robust prognostic tool that allows evidence-based clinical decisions is currently under debate. In this review, we will discuss the most recent data on the prognostic significance and potential clinical application of genome wide expression analysis in colorectal cancer.

Prediction of Metastasis and Recurrence in Colorectal Cancer Based on Gene Expression Analysis: Ready for the Clinic?

International Nuclear Information System (INIS)

Shibayama, Masaki; Maak, Matthias; Nitsche, Ulrich; Gotoh, Kengo; Rosenberg, Robert; Janssen, Klaus-Peter

2011-01-01

Cancers of the colon and rectum, which rank among the most frequent human tumors, are currently treated by surgical resection in locally restricted tumor stages. However, disease recurrence and formation of local and distant metastasis frequently occur even in cases with successful curative resection of the primary tumor (R0). Recent technological advances in molecular diagnostic analysis have led to a wealth of knowledge about the changes in gene transcription in all stages of colorectal tumors. Differential gene expression, or transcriptome analysis, has been proposed by many groups to predict disease recurrence, clinical outcome, and also response to therapy, in addition to the well-established clinico-pathological factors. However, the clinical usability of gene expression profiling as a reliable and robust prognostic tool that allows evidence-based clinical decisions is currently under debate. In this review, we will discuss the most recent data on the prognostic significance and potential clinical application of genome wide expression analysis in colorectal cancer
Accurate, model-based tuning of synthetic gene expression using introns in S. cerevisiae.

Directory of Open Access Journals (Sweden)

Ido Yofe

2014-06-01

Full Text Available Introns are key regulators of eukaryotic gene expression and present a potentially powerful tool for the design of synthetic eukaryotic gene expression systems. However, intronic control over gene expression is governed by a multitude of complex, incompletely understood, regulatory mechanisms. Despite this lack of detailed mechanistic understanding, here we show how a relatively simple model enables accurate and predictable tuning of synthetic gene expression system in yeast using several predictive intron features such as transcript folding and sequence motifs. Using only natural Saccharomyces cerevisiae introns as regulators, we demonstrate fine and accurate control over gene expression spanning a 100 fold expression range. These results broaden the engineering toolbox of synthetic gene expression systems and provide a framework in which precise and robust tuning of gene expression is accomplished.
Pancreatic cancer circulating tumour cells express a cell motility gene signature that predicts survival after surgery

International Nuclear Information System (INIS)

Sergeant, Gregory; Eijsden, Rudy van; Roskams, Tania; Van Duppen, Victor; Topal, Baki

2012-01-01

(95% CI) = 1.366 (1.004 – 1.861)). Pancreatic CTC isolated from blood samples using FACS-based negative depletion, express a cell motility gene signature. Expression of this newly defined cell motility gene signature in the primary tumour can predict survival of patients undergoing surgical resection for pancreatic cancer. Clinical trials.gov NCT00495924
Prediction of the prognosis of breast cancer in routine histologic specimens using a simplified, low-cost gene expression signature

DEFF Research Database (Denmark)

Marcell, S.A.; Balazs, A.; Emese, A.

2013-01-01

Prediction of the prognosis of breast cancer in routine histologic specimens using a simplified, low-cost gene expression signature Background: Grade 2 breast carcinomas do not form a uniform prognostic group. Aim: To extend the number of patients and the investigated genes of a previously...... grade 2 breast carcinomas into prognostic groups. Gene expression was investigated by polymerase chain reaction in 249 formalin-fixed, paraffin-embedded breast tumors. The results were correlated with relapse-free survival. Results: Histologically grade 2 carcinomas were split into good and a poor...... identified prognostic signature described by the authors that reflect chromosomal instability in order to refine characterization of grade 2 breast cancers and identify driver genes. Methods: Using publicly available databases, the authors selected 9 target and 3 housekeeping genes that are capable to divide...
Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

Science.gov (United States)

Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

2014-01-01

Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.
Automated Protocol for Large-Scale Modeling of Gene Expression Data.

Science.gov (United States)

Hall, Michelle Lynn; Calkins, David; Sherman, Woody

2016-11-28

With the continued rise of phenotypic- and genotypic-based screening projects, computational methods to analyze, process, and ultimately make predictions in this field take on growing importance. Here we show how automated machine learning workflows can produce models that are predictive of differential gene expression as a function of a compound structure using data from A673 cells as a proof of principle. In particular, we present predictive models with an average accuracy of greater than 70% across a highly diverse ∼1000 gene expression profile. In contrast to the usual in silico design paradigm, where one interrogates a particular target-based response, this work opens the opportunity for virtual screening and lead optimization for desired multitarget gene expression profiles.
High-throughput analysis of candidate imprinted genes and allele-specific gene expression in the human term placenta

Directory of Open Access Journals (Sweden)

Clark Taane G

2010-04-01

Full Text Available Abstract Background Imprinted genes show expression from one parental allele only and are important for development and behaviour. This extreme mode of allelic imbalance has been described for approximately 56 human genes. Imprinting status is often disrupted in cancer and dysmorphic syndromes. More subtle variation of gene expression, that is not parent-of-origin specific, termed 'allele-specific gene expression' (ASE is more common and may give rise to milder phenotypic differences. Using two allele-specific high-throughput technologies alongside bioinformatics predictions, normal term human placenta was screened to find new imprinted genes and to ascertain the extent of ASE in this tissue. Results Twenty-three family trios of placental cDNA, placental genomic DNA (gDNA and gDNA from both parents were tested for 130 candidate genes with the Sequenom MassArray system. Six genes were found differentially expressed but none imprinted. The Illumina ASE BeadArray platform was then used to test 1536 SNPs in 932 genes. The array was enriched for the human orthologues of 124 mouse candidate genes from bioinformatics predictions and 10 human candidate imprinted genes from EST database mining. After quality control pruning, a total of 261 informative SNPs (214 genes remained for analysis. Imprinting with maternal expression was demonstrated for the lymphocyte imprinted gene ZNF331 in human placenta. Two potential differentially methylated regions (DMRs were found in the vicinity of ZNF331. None of the bioinformatically predicted candidates tested showed imprinting except for a skewed allelic expression in a parent-specific manner observed for PHACTR2, a neighbour of the imprinted PLAGL1 gene. ASE was detected for two or more individuals in 39 candidate genes (18%. Conclusions Both Sequenom and Illumina assays were sensitive enough to study imprinting and strong allelic bias. Previous bioinformatics approaches were not predictive of new imprinted genes
Predicting tissue-specific expressions based on sequence characteristics

KAUST Repository

Paik, Hyojung; Ryu, Tae Woo; Heo, Hyoungsam; Seo, Seungwon; Lee, Doheon; Hur, Cheolgoo

2011-01-01

In multicellular organisms, including humans, understanding expression specificity at the tissue level is essential for interpreting protein function, such as tissue differentiation. We developed a prediction approach via generated sequence features from overrepresented patterns in housekeeping (HK) and tissue-specific (TS) genes to classify TS expression in humans. Using TS domains and transcriptional factor binding sites (TFBSs), sequence characteristics were used as indices of expressed tissues in a Random Forest algorithm by scoring exclusive patterns considering the biological intuition; TFBSs regulate gene expression, and the domains reflect the functional specificity of a TS gene. Our proposed approach displayed better performance than previous attempts and was validated using computational and experimental methods.
Predicting tissue-specific expressions based on sequence characteristics

KAUST Repository

Paik, Hyojung

2011-04-30

In multicellular organisms, including humans, understanding expression specificity at the tissue level is essential for interpreting protein function, such as tissue differentiation. We developed a prediction approach via generated sequence features from overrepresented patterns in housekeeping (HK) and tissue-specific (TS) genes to classify TS expression in humans. Using TS domains and transcriptional factor binding sites (TFBSs), sequence characteristics were used as indices of expressed tissues in a Random Forest algorithm by scoring exclusive patterns considering the biological intuition; TFBSs regulate gene expression, and the domains reflect the functional specificity of a TS gene. Our proposed approach displayed better performance than previous attempts and was validated using computational and experimental methods.
Gene expression and gene therapy imaging

International Nuclear Information System (INIS)

Rome, Claire; Couillaud, Franck; Moonen, Chrit T.W.

2007-01-01

The fast growing field of molecular imaging has achieved major advances in imaging gene expression, an important element of gene therapy. Gene expression imaging is based on specific probes or contrast agents that allow either direct or indirect spatio-temporal evaluation of gene expression. Direct evaluation is possible with, for example, contrast agents that bind directly to a specific target (e.g., receptor). Indirect evaluation may be achieved by using specific substrate probes for a target enzyme. The use of marker genes, also called reporter genes, is an essential element of MI approaches for gene expression in gene therapy. The marker gene may not have a therapeutic role itself, but by coupling the marker gene to a therapeutic gene, expression of the marker gene reports on the expression of the therapeutic gene. Nuclear medicine and optical approaches are highly sensitive (detection of probes in the picomolar range), whereas MRI and ultrasound imaging are less sensitive and require amplification techniques and/or accumulation of contrast agents in enlarged contrast particles. Recently developed MI techniques are particularly relevant for gene therapy. Amongst these are the possibility to track gene therapy vectors such as stem cells, and the techniques that allow spatiotemporal control of gene expression by non-invasive heating (with MRI guided focused ultrasound) and the use of temperature sensitive promoters. (orig.)
Gene expression and 18FDG uptake in atherosclerotic carotid plaques

DEFF Research Database (Denmark)

Pedersen, Sune Folke; Graebe, Martin; Fisker Hag, Anne Mette

2010-01-01

) and an additional ipsilateral internal carotid artery stenosis of greater than 60% were recruited. FDG uptake in the carotids was determined by PET/computed tomography and expressed as mean and maximal standardized uptake values (SUVmean and SUVmax). The atherosclerotic plaques were subsequently recovered...... by carotid endarterectomy. The gene expression of markers of vulnerability - CD68, IL-18, matrix metalloproteinase 9, cathepsin K, GLUT-1, and hexokinase type II (HK2) - were measured in plaques by quantitative PCR. RESULTS: In a multivariate linear regression model, GLUT-1, CD68, cathepsin K, and HK2 gene...... expression remained in the final model as predictive variables of FDG accumulation calculated as SUVmean (R=0.26, PK, and HK2 gene expression as independent predictive variables of FDG accumulation calculated...
Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence

OpenAIRE

Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

2015-01-01

Background: There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. Methods: All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinform...
Predicting response to primary chemotherapy: gene expression profiling of paraffin-embedded core biopsy tissue.

Science.gov (United States)

Mina, Lida; Soule, Sharon E; Badve, Sunil; Baehner, Fredrick L; Baker, Joffre; Cronin, Maureen; Watson, Drew; Liu, Mei-Lan; Sledge, George W; Shak, Steve; Miller, Kathy D

2007-06-01

Primary chemotherapy provides an ideal opportunity to correlate gene expression with response to treatment. We used paraffin-embedded core biopsies from a completed phase II trial to identify genes that correlate with response to primary chemotherapy. Patients with newly diagnosed stage II or III breast cancer were treated with sequential doxorubicin 75 mg/M2 q2 wks x 3 and docetaxel 40 mg/M2 weekly x 6; treatment order was randomly assigned. Pretreatment core biopsy samples were interrogated for genes that might correlate with pathologic complete response (pCR). In addition to the individual genes, the correlation of the Oncotype DX Recurrence Score with pCR was examined. Of 70 patients enrolled in the parent trial, core biopsies samples with sufficient RNA for gene analyses were available from 45 patients; 9 (20%) had inflammatory breast cancer (IBC). Six (14%) patients achieved a pCR. Twenty-two of the 274 candidate genes assessed correlated with pCR (p < 0.05). Genes correlating with pCR could be grouped into three large clusters: angiogenesis-related genes, proliferation related genes, and invasion-related genes. Expression of estrogen receptor (ER)-related genes and Recurrence Score did not correlate with pCR. In an exploratory analysis we compared gene expression in IBC to non-inflammatory breast cancer; twenty-four (9%) of the genes were differentially expressed (p < 0.05), 5 were upregulated and 19 were downregulated in IBC. Gene expression analysis on core biopsy samples is feasible and identifies candidate genes that correlate with pCR to primary chemotherapy. Gene expression in IBC differs significantly from noninflammatory breast cancer.
Integrating circadian activity and gene expression profiles to predict chronotoxicity of Drosophila suzukii response to insecticides.

Science.gov (United States)

Hamby, Kelly A; Kwok, Rosanna S; Zalom, Frank G; Chiu, Joanna C

2013-01-01

Native to Southeast Asia, Drosophila suzukii (Matsumura) is a recent invader that infests intact ripe and ripening fruit, leading to significant crop losses in the U.S., Canada, and Europe. Since current D. suzukii management strategies rely heavily on insecticide usage and insecticide detoxification gene expression is under circadian regulation in the closely related Drosophila melanogaster, we set out to determine if integrative analysis of daily activity patterns and detoxification gene expression can predict chronotoxicity of D. suzukii to insecticides. Locomotor assays were performed under conditions that approximate a typical summer or winter day in Watsonville, California, where D. suzukii was first detected in North America. As expected, daily activity patterns of D. suzukii appeared quite different between 'summer' and 'winter' conditions due to differences in photoperiod and temperature. In the 'summer', D. suzukii assumed a more bimodal activity pattern, with maximum activity occurring at dawn and dusk. In the 'winter', activity was unimodal and restricted to the warmest part of the circadian cycle. Expression analysis of six detoxification genes and acute contact bioassays were performed at multiple circadian times, but only in conditions approximating Watsonville summer, the cropping season, when most insecticide applications occur. Five of the genes tested exhibited rhythmic expression, with the majority showing peak expression at dawn (ZT0, 6am). We observed significant differences in the chronotoxicity of D. suzukii towards malathion, with highest susceptibility at ZT0 (6am), corresponding to peak expression of cytochrome P450s that may be involved in bioactivation of malathion. High activity levels were not found to correlate with high insecticide susceptibility as initially hypothesized. Chronobiology and chronotoxicity of D. suzukii provide valuable insights for monitoring and control efforts, because insect activity as well as insecticide timing
DEEP--a tool for differential expression effector prediction.

Science.gov (United States)

Degenhardt, Jost; Haubrock, Martin; Dönitz, Jürgen; Wingender, Edgar; Crass, Torsten

2007-07-01

High-throughput methods for measuring transcript abundance, like SAGE or microarrays, are widely used for determining differences in gene expression between different tissue types, dignities (normal/malignant) or time points. Further analysis of such data frequently aims at the identification of gene interaction networks that form the causal basis for the observed properties of the systems under examination. To this end, it is usually not sufficient to rely on the measured gene expression levels alone; rather, additional biological knowledge has to be taken into account in order to generate useful hypotheses about the molecular mechanism leading to the realization of a certain phenotype. We present a method that combines gene expression data with biological expert knowledge on molecular interaction networks, as described by the TRANSPATH database on signal transduction, to predict additional--and not necessarily differentially expressed--genes or gene products which might participate in processes specific for either of the examined tissues or conditions. In a first step, significance values for over-expression in tissue/condition A or B are assigned to all genes in the expression data set. Genes with a significance value exceeding a certain threshold are used as starting points for the reconstruction of a graph with signaling components as nodes and signaling events as edges. In a subsequent graph traversal process, again starting from the previously identified differentially expressed genes, all encountered nodes 'inherit' all their starting nodes' significance values. In a final step, the graph is visualized, the nodes being colored according to a weighted average of their inherited significance values. Each node's, or sub-network's, predominant color, ranging from green (significant for tissue/condition A) over yellow (not significant for either tissue/condition) to red (significant for tissue/condition B), thus gives an immediate visual clue on which molecules
Bone Metastasis in Advanced Breast Cancer: Analysis of Gene Expression Microarray.

Science.gov (United States)

Cosphiadi, Irawan; Atmakusumah, Tubagus D; Siregar, Nurjati C; Muthalib, Abdul; Harahap, Alida; Mansyur, Muchtarruddin

2018-03-08

Approximately 30% to 40% of breast cancer recurrences involve bone metastasis (BM). Certain genes have been linked to BM; however, none have been able to predict bone involvement. In this study, we analyzed gene expression profiles in advanced breast cancer patients to elucidate genes that can be used to predict BM. A total of 92 advanced breast cancer patients, including 46 patients with BM and 46 patients without BM, were identified for this study. Immunohistochemistry and gene expression analysis was performed on 81 formalin-fixed paraffin-embedded samples. Data were collected through medical records, and gene expression of 200 selected genes compiled from 6 previous studies was performed using NanoString nCounter. Genetic expression profiles showed that 22 genes were significantly differentially expressed between breast cancer patients with metastasis in bone and other organs (BM+) and non-BM, whereas subjects with only BM showed 17 significantly differentially expressed genes. The following genes were associated with an increasing incidence of BM in the BM+ group: estrogen receptor 1 (ESR1), GATA binding protein 3 (GATA3), and melanophilin with an area under the curve (AUC) of 0.804. In the BM group, the following genes were associated with an increasing incidence of BM: ESR1, progesterone receptor, B-cell lymphoma 2, Rab escort protein, N-acetyltransferase 1, GATA3, annexin A9, and chromosome 9 open reading frame 116. ESR1 and GATA3 showed an increased strength of association with an AUC of 0.928. A combination of the identified 3 genes in BM+ and 8 genes in BM showed better prediction than did each individual gene, and this combination can be used as a training set. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
The identification of functional motifs in temporal gene expression analysis

Directory of Open Access Journals (Sweden)

Michael G. Surette

2005-01-01

Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.
Gene Expression Differences Predict Treatment Outcome of Merkel Cell Carcinoma Patients

Directory of Open Access Journals (Sweden)

Loren Masterson

2014-01-01

Full Text Available Due to the rarity of Merkel cell carcinoma (MCC, prospective clinical trials have not been practical. This study aimed to identify biomarkers with prognostic significance. While sixty-two patients were identified who were treated for MCC at our institution, only seventeen patients had adequate formalin-fixed paraffin-embedded archival tissue and followup to be included in the study. Patients were stratified into good, moderate, or poor prognosis. Laser capture microdissection was used to isolate tumor cells for subsequent RNA isolation and gene expression analysis with Affymetrix GeneChip Human Exon 1.0 ST arrays. Among the 191 genes demonstrating significant differential expression between prognostic groups, keratin 20 and neurofilament protein have previously been identified in studies of MCC and were significantly upregulated in tumors from patients with a poor prognosis. Immunohistochemistry further established that keratin 20 was overexpressed in the poor prognosis tumors. In addition, novel genes of interest such as phospholipase A2 group X, kinesin family member 3A, tumor protein D52, mucin 1, and KIT were upregulated in specimens from patients with poor prognosis. Our pilot study identified several gene expression differences which could be used in the future as prognostic biomarkers in MCC patients.
Noise minimization in eukaryotic gene expression.

Directory of Open Access Journals (Sweden)

Hunter B Fraser

2004-06-01

Full Text Available All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or "noise." Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection.
Noise minimization in eukaryotic gene expression

Energy Technology Data Exchange (ETDEWEB)

Fraser, Hunter B.; Hirsh, Aaron E.; Giaever, Guri; Kumm, Jochen; Eisen, Michael B.

2004-01-15

All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or noise. Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection.

Noise minimization in eukaryotic gene expression

International Nuclear Information System (INIS)

Fraser, Hunter B.; Hirsh, Aaron E.; Giaever, Guri; Kumm, Jochen; Eisen, Michael B.

2004-01-01

All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or noise. Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection
Vascular Gene Expression: A Hypothesis

Directory of Open Access Journals (Sweden)

Angélica Concepción eMartínez-Navarro

2013-07-01

Full Text Available The phloem is the conduit through which photoassimilates are distributed from autotrophic to heterotrophic tissues and is involved in the distribution of signaling molecules that coordinate plant growth and responses to the environment. Phloem function depends on the coordinate expression of a large array of genes. We have previously identified conserved motifs in upstream regions of the Arabidopsis genes, encoding the homologs of pumpkin phloem sap mRNAs, displaying expression in vascular tissues. This tissue-specific expression in Arabidopsis is predicted by the overrepresentation of GA/CT-rich motifs in gene promoters. In this work we have searched for common motifs in upstream regions of the homologous genes from plants considered to possess a primitive vascular tissue (a lycophyte, as well as from others that lack a true vascular tissue (a bryophyte, and finally from chlorophytes. Both lycophyte and bryophyte display motifs similar to those found in Arabidopsis with a significantly low E-value, while the chlorophytes showed either a different conserved motif or no conserved motif at all. These results suggest that these same genes are expressed coordinately in non- vascular plants; this coordinate expression may have been one of the prerequisites for the development of conducting tissues in plants. We have also analyzed the phylogeny of conserved proteins that may be involved in phloem function and development. The presence of CmPP16, APL, FT and YDA in chlorophytes suggests the recruitment of ancient regulatory networks for the development of the vascular tissue during evolution while OPS is a novel protein specific to vascular plants.
Gene expression

International Nuclear Information System (INIS)

Hildebrand, C.E.; Crawford, B.D.; Walters, R.A.; Enger, M.D.

1983-01-01

We prepared probes for isolating functional pieces of the metallothionein locus. The probes enabled a variety of experiments, eventually revealing two mechanisms for metallothionein gene expression, the order of the DNA coding units at the locus, and the location of the gene site in its chromosome. Once the switch regulating metallothionein synthesis was located, it could be joined by recombinant DNA methods to other, unrelated genes, then reintroduced into cells by gene-transfer techniques. The expression of these recombinant genes could then be induced by exposing the cells to Zn 2+ or Cd 2+ . We would thus take advantage of the clearly defined switching properties of the metallothionein gene to manipulate the expression of other, perhaps normally constitutive, genes. Already, despite an incomplete understanding of how the regulatory switch of the metallothionein locus operates, such experiments have been performed successfully
Gene Expression Analysis of Four Radiation-resistant Bacteria

OpenAIRE

Gao, Na; Ma, Bin-Guang; Zhang, Yu-Sheng; Song, Qin; Chen, Ling-Ling; Zhang, Hong-Yu

2009-01-01

To investigate the general radiation-resistant mechanisms of bacteria, bioinformatic method was employed to predict highly expressed genes for four radiation-resistant bacteria, i.e. Deinococcus geothermalis (D. geo), Deinococcus radiodurans (D. rad), Kineococcus radiotolerans (K. rad) and Rubrobacter xylanophilus (R. xyl). It is revealed that most of the three reference gene sets, i.e. ribosomal proteins, transcription factors and major chaperones, are generally highly expressed in the four ...
Prediction of essential proteins based on subcellular localization and gene expression correlation.

Science.gov (United States)

Fan, Yetian; Tang, Xiwei; Hu, Xiaohua; Wu, Wei; Ping, Qing

2017-12-01

Essential proteins are indispensable to the survival and development process of living organisms. To understand the functional mechanisms of essential proteins, which can be applied to the analysis of disease and design of drugs, it is important to identify essential proteins from a set of proteins first. As traditional experimental methods designed to test out essential proteins are usually expensive and laborious, computational methods, which utilize biological and topological features of proteins, have attracted more attention in recent years. Protein-protein interaction networks, together with other biological data, have been explored to improve the performance of essential protein prediction. The proposed method SCP is evaluated on Saccharomyces cerevisiae datasets and compared with five other methods. The results show that our method SCP outperforms the other five methods in terms of accuracy of essential protein prediction. In this paper, we propose a novel algorithm named SCP, which combines the ranking by a modified PageRank algorithm based on subcellular compartments information, with the ranking by Pearson correlation coefficient (PCC) calculated from gene expression data. Experiments show that subcellular localization information is promising in boosting essential protein prediction.
Perturbation of B Cell Gene Expression Persists in HIV-Infected Children Despite Effective Antiretroviral Therapy and Predicts H1N1 Response.

Science.gov (United States)

Cotugno, Nicola; De Armas, Lesley; Pallikkuth, Suresh; Rinaldi, Stefano; Issac, Biju; Cagigi, Alberto; Rossi, Paolo; Palma, Paolo; Pahwa, Savita

2017-01-01

Despite effective antiretroviral therapy (ART), HIV-infected individuals with apparently similar clinical and immunological characteristics can vary in responsiveness to vaccinations. However, molecular mechanisms responsible for such impairment, as well as biomarkers able to predict vaccine responsiveness in HIV-infected children, remain unknown. Following the hypothesis that a B cell qualitative impairment persists in HIV-infected children (HIV) despite effective ART and phenotypic B cell immune reconstitution, the aim of the current study was to investigate B cell gene expression of HIV compared to age-matched healthy controls (HCs) and to determine whether distinct gene expression patterns could predict the ability to respond to influenza vaccine. To do so, we analyzed prevaccination transcriptional levels of a 96-gene panel in equal numbers of sort-purified B cell subsets (SPBS) isolated from peripheral blood mononuclear cells using multiplexed RT-PCR. Immune responses to H1N1 antigen were determined by hemaglutination inhibition and memory B cell ELISpot assays following trivalent-inactivated influenza vaccination (TIV) for all study participants. Although there were no differences in terms of cell frequencies of SPBS between HIV and HC, the groups were distinguishable based upon gene expression analyses. Indeed, a 28-gene signature, characterized by higher expression of genes involved in the inflammatory response and immune activation was observed in activated memory B cells (CD27 + CD21 - ) from HIV when compared to HC despite long-term viral control (>24 months). Further analysis, taking into account H1N1 responses after TIV in HIV participants, revealed that a 25-gene signature in resting memory (RM) B cells (CD27 + CD21 + ) was able to distinguish vaccine responders from non-responders (NR). In fact, prevaccination RM B cells of responders showed a higher expression of gene sets involved in B cell adaptive immune responses ( APRIL, BTK, BLIMP1 ) and
Identification of genes showing differential expression profile ...

Indian Academy of Sciences (India)

3Department of Natural Sciences, International Christian University, Mitaka, Tokyo 181-8585, Japan ... the changes of expression predicted from gene function suggested association ... ate School of Science and Technology, Niigata University.
Codon usage and amino acid usage influence genes expression level.

Science.gov (United States)

Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo

2018-02-01

Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.
Gene Expression Commons: an open platform for absolute gene expression profiling.

Directory of Open Access Journals (Sweden)

Jun Seita

Full Text Available Gene expression profiling using microarrays has been limited to comparisons of gene expression between small numbers of samples within individual experiments. However, the unknown and variable sensitivities of each probeset have rendered the absolute expression of any given gene nearly impossible to estimate. We have overcome this limitation by using a very large number (>10,000 of varied microarray data as a common reference, so that statistical attributes of each probeset, such as the dynamic range and threshold between low and high expression, can be reliably discovered through meta-analysis. This strategy is implemented in a web-based platform named "Gene Expression Commons" (https://gexc.stanford.edu/ which contains data of 39 distinct highly purified mouse hematopoietic stem/progenitor/differentiated cell populations covering almost the entire hematopoietic system. Since the Gene Expression Commons is designed as an open platform, investigators can explore the expression level of any gene, search by expression patterns of interest, submit their own microarray data, and design their own working models representing biological relationship among samples.
Cross-study analysis of gene expression data for intermediate neuroblastoma identifies two biological subtypes

International Nuclear Information System (INIS)

Warnat, Patrick; Oberthuer, André; Fischer, Matthias; Westermann, Frank; Eils, Roland; Brors, Benedikt

2007-01-01

Neuroblastoma patients show heterogeneous clinical courses ranging from life-threatening progression to spontaneous regression. Recently, gene expression profiles of neuroblastoma tumours were associated with clinically different phenotypes. However, such data is still rare for important patient subgroups, such as patients with MYCN non-amplified advanced stage disease. Prediction of the individual course of disease and optimal therapy selection in this cohort is challenging. Additional research effort is needed to describe the patterns of gene expression in this cohort and to identify reliable prognostic markers for this subset of patients. We combined gene expression data from two studies in a meta-analysis in order to investigate differences in gene expression of advanced stage (3 or 4) tumours without MYCN amplification that show contrasting outcomes (alive or dead) at five years after initial diagnosis. In addition, a predictive model for outcome was generated. Gene expression profiles from 66 patients were included from two studies using different microarray platforms. In the combined data set, 72 genes were identified as differentially expressed by meta-analysis at a false discovery rate (FDR) of 8.33%. Meta-analysis detected 34 differentially expressed genes that were not found as significant in either single study. Outcome prediction based on data of both studies resulted in a predictive accuracy of 77%. Moreover, the genes that were differentially expressed in subgroups of advanced stage patients without MYCN amplification accurately separated MYCN amplified tumours from low stage tumours without MYCN amplification. Our findings support the hypothesis that neuroblastoma consists of two biologically distinct subgroups that differ by characteristic gene expression patterns, which are associated with divergent clinical outcome
Population genetic variation in gene expression is associated withphenotypic variation in Saccharomyces cerevisiae

Energy Technology Data Exchange (ETDEWEB)

Fay, Justin C.; McCullough, Heather L.; Sniegowski, Paul D.; Eisen, Michael B.

2004-02-25

The relationship between genetic variation in gene expression and phenotypic variation observable in nature is not well understood. Identifying how many phenotypes are associated with differences in gene expression and how many gene-expression differences are associated with a phenotype is important to understanding the molecular basis and evolution of complex traits. Results: We compared levels of gene expression among nine natural isolates of Saccharomyces cerevisiae grown either in the presence or absence of copper sulfate. Of the nine strains, two show a reduced growth rate and two others are rust colored in the presence of copper sulfate. We identified 633 genes that show significant differences in expression among strains. Of these genes,20 were correlated with resistance to copper sulfate and 24 were correlated with rust coloration. The function of these genes in combination with their expression pattern suggests the presence of both correlative and causative expression differences. But the majority of differentially expressed genes were not correlated with either phenotype and showed the same expression pattern both in the presence and absence of copper sulfate. To determine whether these expression differences may contribute to phenotypic variation under other environmental conditions, we examined one phenotype, freeze tolerance, predicted by the differential expression of the aquaporin gene AQY2. We found freeze tolerance is associated with the expression of AQY2. Conclusions: Gene expression differences provide substantial insight into the molecular basis of naturally occurring traits and can be used to predict environment dependent phenotypic variation.
The relationship among gene expression, the evolution of gene dosage, and the rate of protein evolution.

Directory of Open Access Journals (Sweden)

Jean-François Gout

2010-05-01

Full Text Available The understanding of selective constraints affecting genes is a major issue in biology. It is well established that gene expression level is a major determinant of the rate of protein evolution, but the reasons for this relationship remain highly debated. Here we demonstrate that gene expression is also a major determinant of the evolution of gene dosage: the rate of gene losses after whole genome duplications in the Paramecium lineage is negatively correlated to the level of gene expression, and this relationship is not a byproduct of other factors known to affect the fate of gene duplicates. This indicates that changes in gene dosage are generally more deleterious for highly expressed genes. This rule also holds for other taxa: in yeast, we find a clear relationship between gene expression level and the fitness impact of reduction in gene dosage. To explain these observations, we propose a model based on the fact that the optimal expression level of a gene corresponds to a trade-off between the benefit and cost of its expression. This COSTEX model predicts that selective pressure against mutations changing gene expression level or affecting the encoded protein should on average be stronger in highly expressed genes and hence that both the frequency of gene loss and the rate of protein evolution should correlate negatively with gene expression. Thus, the COSTEX model provides a simple and common explanation for the general relationship observed between the level of gene expression and the different facets of gene evolution.
Semi-supervised prediction of gene regulatory networks using ...

Indian Academy of Sciences (India)

2015-09-28

Sep 28, 2015 ... Use of computational methods to predict gene regulatory networks (GRNs) from gene expression data is a challenging ... two types of methods differ primarily based on whether ..... negligible, allowing us to draw the qualitative conclusions .... research will be conducted to develop additional biologically.
Predicting Recurrence and Progression of Noninvasive Papillary Bladder Cancer at Initial Presentation Based on Quantitative Gene Expression Profiles

DEFF Research Database (Denmark)

Birkhahn, M.; Mitra, A.P.; Williams, Johan

2010-01-01

% specificity. Since this is a small retrospective study using medium-throughput profiling, larger confirmatory studies are needed. Conclusions: Gene expression profiling across relevant cancer pathways appears to be a promising approach for Ta bladder tumor outcome prediction at initial diagnosis......Background: Currently, tumor grade is the best predictor of outcome at first presentation of noninvasive papillary (Ta) bladder cancer. However, reliable predictors of Ta tumor recurrence and progression for individual patients, which could optimize treatment and follow-up schedules based...... on specific tumor biology, are yet to be identified. Objective: To identify genes predictive for recurrence and progression in Ta bladder cancer at first presentation using a quantitative, pathway-specific approach. Design, setting, and participants: Retrospective study of patients with Ta G2/3 bladder tumors...
Large clusters of co-expressed genes in the Drosophila genome.

Science.gov (United States)

Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I

2002-12-12

Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.
Analysis of multiplex gene expression maps obtained by voxelation

Directory of Open Access Journals (Sweden)

Smith Desmond J

2009-04-01

cortex and corpus callosum. Conclusion The experimental results confirm the hypothesis that genes with similar gene expression maps might have similar gene functions. The voxelation data takes into account the location information of gene expression level in mouse brain, which is novel in related research. The proposed approach can potentially be used to predict gene functions and provide helpful suggestions to biologists.
Analysis of multiplex gene expression maps obtained by voxelation.

Science.gov (United States)

An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

2009-04-29

results confirm the hypothesis that genes with similar gene expression maps might have similar gene functions. The voxelation data takes into account the location information of gene expression level in mouse brain, which is novel in related research. The proposed approach can potentially be used to predict gene functions and provide helpful suggestions to biologists.
Validation of differential gene expression algorithms: Application comparing fold-change estimation to hypothesis testing

Directory of Open Access Journals (Sweden)

Bickel David R

2010-01-01

Full Text Available Abstract Background Sustained research on the problem of determining which genes are differentially expressed on the basis of microarray data has yielded a plethora of statistical algorithms, each justified by theory, simulation, or ad hoc validation and yet differing in practical results from equally justified algorithms. Recently, a concordance method that measures agreement among gene lists have been introduced to assess various aspects of differential gene expression detection. This method has the advantage of basing its assessment solely on the results of real data analyses, but as it requires examining gene lists of given sizes, it may be unstable. Results Two methodologies for assessing predictive error are described: a cross-validation method and a posterior predictive method. As a nonparametric method of estimating prediction error from observed expression levels, cross validation provides an empirical approach to assessing algorithms for detecting differential gene expression that is fully justified for large numbers of biological replicates. Because it leverages the knowledge that only a small portion of genes are differentially expressed, the posterior predictive method is expected to provide more reliable estimates of algorithm performance, allaying concerns about limited biological replication. In practice, the posterior predictive method can assess when its approximations are valid and when they are inaccurate. Under conditions in which its approximations are valid, it corroborates the results of cross validation. Both comparison methodologies are applicable to both single-channel and dual-channel microarrays. For the data sets considered, estimating prediction error by cross validation demonstrates that empirical Bayes methods based on hierarchical models tend to outperform algorithms based on selecting genes by their fold changes or by non-hierarchical model-selection criteria. (The latter two approaches have comparable
Prediction of graft-versus-host disease in humans by donor gene-expression profiling.

Directory of Open Access Journals (Sweden)

Chantal Baron

2007-01-01

Full Text Available BACKGROUND: Graft-versus-host disease (GVHD results from recognition of host antigens by donor T cells following allogeneic hematopoietic cell transplantation (AHCT. Notably, histoincompatibility between donor and recipient is necessary but not sufficient to elicit GVHD. Therefore, we tested the hypothesis that some donors may be "stronger alloresponders" than others, and consequently more likely to elicit GVHD. METHODS AND FINDINGS: To this end, we measured the gene-expression profiles of CD4(+ and CD8(+ T cells from 50 AHCT donors with microarrays. We report that pre-AHCT gene-expression profiling segregates donors whose recipient suffered from GVHD or not. Using quantitative PCR, established statistical tests, and analysis of multiple independent training-test datasets, we found that for chronic GVHD the "dangerous donor" trait (occurrence of GVHD in the recipient is under polygenic control and is shaped by the activity of genes that regulate transforming growth factor-beta signaling and cell proliferation. CONCLUSIONS: These findings strongly suggest that the donor gene-expression profile has a dominant influence on the occurrence of GVHD in the recipient. The ability to discriminate strong and weak alloresponders using gene-expression profiling could pave the way to personalized transplantation medicine.
Imaging gene expression in gene therapy

International Nuclear Information System (INIS)

Wiebe, Leonard I.

1997-01-01

Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on 'suicide gene therapy' of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k + ) has been use for 'suicide' in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k + gene expression where the H S V-1 t k + gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([ 18 F]F H P G; [ 18 F]-A C V), and pyrimidine- ([ 123 / 131 I]I V R F U; [ 124 / 131I ]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [ 123 / 131I ]I V R F U imaging with the H S V-1 t k + reporter gene will be presented

Identification of differentially expressed genes in cutaneous squamous cell carcinoma by microarray expression profiling

Directory of Open Access Journals (Sweden)

Sterry Wolfram

2006-08-01

Full Text Available Abstract Background Carcinogenesis is a multi-step process indicated by several genes up- or down-regulated during tumor progression. This study examined and identified differentially expressed genes in cutaneous squamous cell carcinoma (SCC. Results Three different biopsies of 5 immunosuppressed organ-transplanted recipients each normal skin (all were pooled, actinic keratosis (AK (two were pooled, and invasive SCC and additionally 5 normal skin tissues from immunocompetent patients were analyzed. Thus, total RNA of 15 specimens were used for hybridization with Affymetrix HG-U133A microarray technology containing 22,283 genes. Data analyses were performed by prediction analysis of microarrays using nearest shrunken centroids with the threshold 3.5 and ANOVA analysis was independently performed in order to identify differentially expressed genes (p vs. AK and SCC were observed for 118 genes. Conclusion The majority of identified differentially expressed genes in cutaneous SCC were previously not described.
Gene prediction validation and functional analysis of redundant pathways

DEFF Research Database (Denmark)

Sønderkær, Mads

2011-01-01

have employed a large mRNA-seq data set to improve and validate ab initio predicted gene models. This direct experimental evidence also provides reliable determinations of UTR regions and polyadenylation sites, which are not easily predicted in plants. Furthermore, once an annotated genome sequence...... is available, gene expression by mRNA-Seq enables acquisition of a more complete overview of gene isoform usage in complex enzymatic pathways enabling the identification of key genes. Metabolism in potatoes This information is useful e.g. for crop improvement based on manipulation of agronomically important...
Multiclass Prediction with Partial Least Square Regression for Gene Expression Data: Applications in Breast Cancer Intrinsic Taxonomy

Directory of Open Access Journals (Sweden)

Chi-Cheng Huang

2013-01-01

Full Text Available Multiclass prediction remains an obstacle for high-throughput data analysis such as microarray gene expression profiles. Despite recent advancements in machine learning and bioinformatics, most classification tools were limited to the applications of binary responses. Our aim was to apply partial least square (PLS regression for breast cancer intrinsic taxonomy, of which five distinct molecular subtypes were identified. The PAM50 signature genes were used as predictive variables in PLS analysis, and the latent gene component scores were used in binary logistic regression for each molecular subtype. The 139 prototypical arrays for PAM50 development were used as training dataset, and three independent microarray studies with Han Chinese origin were used for independent validation (n=535. The agreement between PAM50 centroid-based single sample prediction (SSP and PLS-regression was excellent (weighted Kappa: 0.988 within the training samples, but deteriorated substantially in independent samples, which could attribute to much more unclassified samples by PLS-regression. If these unclassified samples were removed, the agreement between PAM50 SSP and PLS-regression improved enormously (weighted Kappa: 0.829 as opposed to 0.541 when unclassified samples were analyzed. Our study ascertained the feasibility of PLS-regression in multi-class prediction, and distinct clinical presentations and prognostic discrepancies were observed across breast cancer molecular subtypes.
Combining Gene Signatures Improves Prediction of Breast Cancer Survival

Science.gov (United States)

Zhao, Xi; Naume, Bjørn; Langerød, Anita; Frigessi, Arnoldo; Kristensen, Vessela N.; Børresen-Dale, Anne-Lise; Lingjærde, Ole Christian

2011-01-01

Background Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123) and test set (n = 81), respectively. Gene sets from eleven previously published gene signatures are included in the study. Principal Findings To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014). Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001). The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. Conclusion Combining the predictive strength of multiple gene signatures improves prediction of breast
Combining gene signatures improves prediction of breast cancer survival.

Directory of Open Access Journals (Sweden)

Xi Zhao

Full Text Available BACKGROUND: Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123 and test set (n = 81, respectively. Gene sets from eleven previously published gene signatures are included in the study. PRINCIPAL FINDINGS: To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014. Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001. The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. CONCLUSION: Combining the predictive strength of multiple gene signatures improves
A Machine Learned Classifier That Uses Gene Expression Data to Accurately Predict Estrogen Receptor Status

Science.gov (United States)

Bastani, Meysam; Vos, Larissa; Asgarian, Nasimeh; Deschenes, Jean; Graham, Kathryn; Mackey, John; Greiner, Russell

2013-01-01

Background Selecting the appropriate treatment for breast cancer requires accurately determining the estrogen receptor (ER) status of the tumor. However, the standard for determining this status, immunohistochemical analysis of formalin-fixed paraffin embedded samples, suffers from numerous technical and reproducibility issues. Assessment of ER-status based on RNA expression can provide more objective, quantitative and reproducible test results. Methods To learn a parsimonious RNA-based classifier of hormone receptor status, we applied a machine learning tool to a training dataset of gene expression microarray data obtained from 176 frozen breast tumors, whose ER-status was determined by applying ASCO-CAP guidelines to standardized immunohistochemical testing of formalin fixed tumor. Results This produced a three-gene classifier that can predict the ER-status of a novel tumor, with a cross-validation accuracy of 93.17±2.44%. When applied to an independent validation set and to four other public databases, some on different platforms, this classifier obtained over 90% accuracy in each. In addition, we found that this prediction rule separated the patients' recurrence-free survival curves with a hazard ratio lower than the one based on the IHC analysis of ER-status. Conclusions Our efficient and parsimonious classifier lends itself to high throughput, highly accurate and low-cost RNA-based assessments of ER-status, suitable for routine high-throughput clinical use. This analytic method provides a proof-of-principle that may be applicable to developing effective RNA-based tests for other biomarkers and conditions. PMID:24312637
Determining Physical Mechanisms of Gene Expression Regulation from Single Cell Gene Expression Data

OpenAIRE

Ezer, Daphne; Moignard, Victoria; G?ttgens, Berthold; Adryan, Boris

2016-01-01

Many genes are expressed in bursts, which can contribute to cell-to-cell heterogeneity. It is now possible to measure this heterogeneity with high throughput single cell gene expression assays (single cell qPCR and RNA-seq). These experimental approaches generate gene expression distributions which can be used to estimate the kinetic parameters of gene expression bursting, namely the rate that genes turn on, the rate that genes turn off, and the rate of transcription. We construct a complete ...
A gene expression signature associated with survival in metastatic melanoma

Science.gov (United States)

Mandruzzato, Susanna; Callegaro, Andrea; Turcatel, Gianluca; Francescato, Samuela; Montesco, Maria C; Chiarion-Sileni, Vanna; Mocellin, Simone; Rossi, Carlo R; Bicciato, Silvio; Wang, Ena; Marincola, Francesco M; Zanovello, Paola

2006-01-01

Background Current clinical and histopathological criteria used to define the prognosis of melanoma patients are inadequate for accurate prediction of clinical outcome. We investigated whether genome screening by means of high-throughput gene microarray might provide clinically useful information on patient survival. Methods Forty-three tumor tissues from 38 patients with stage III and stage IV melanoma were profiled with a 17,500 element cDNA microarray. Expression data were analyzed using significance analysis of microarrays (SAM) to identify genes associated with patient survival, and supervised principal components (SPC) to determine survival prediction. Results SAM analysis revealed a set of 80 probes, corresponding to 70 genes, associated with survival, i.e. 45 probes characterizing longer and 35 shorter survival times, respectively. These transcripts were included in a survival prediction model designed using SPC and cross-validation which allowed identifying 30 predicting probes out of the 80 associated with survival. Conclusion The longer-survival group of genes included those expressed in immune cells, both innate and acquired, confirming the interplay between immunological mechanisms and the natural history of melanoma. Genes linked to immune cells were totally lacking in the poor-survival group, which was instead associated with a number of genes related to highly proliferative and invasive tumor cells. PMID:17129373
A gene expression signature associated with survival in metastatic melanoma

Directory of Open Access Journals (Sweden)

Rossi Carlo R

2006-11-01

Full Text Available Abstract Background Current clinical and histopathological criteria used to define the prognosis of melanoma patients are inadequate for accurate prediction of clinical outcome. We investigated whether genome screening by means of high-throughput gene microarray might provide clinically useful information on patient survival. Methods Forty-three tumor tissues from 38 patients with stage III and stage IV melanoma were profiled with a 17,500 element cDNA microarray. Expression data were analyzed using significance analysis of microarrays (SAM to identify genes associated with patient survival, and supervised principal components (SPC to determine survival prediction. Results SAM analysis revealed a set of 80 probes, corresponding to 70 genes, associated with survival, i.e. 45 probes characterizing longer and 35 shorter survival times, respectively. These transcripts were included in a survival prediction model designed using SPC and cross-validation which allowed identifying 30 predicting probes out of the 80 associated with survival. Conclusion The longer-survival group of genes included those expressed in immune cells, both innate and acquired, confirming the interplay between immunological mechanisms and the natural history of melanoma. Genes linked to immune cells were totally lacking in the poor-survival group, which was instead associated with a number of genes related to highly proliferative and invasive tumor cells.
Clinical value of prognosis gene expression signatures in colorectal cancer: a systematic review.

Directory of Open Access Journals (Sweden)

Rebeca Sanz-Pamplona

Full Text Available INTRODUCTION: The traditional staging system is inadequate to identify those patients with stage II colorectal cancer (CRC at high risk of recurrence or with stage III CRC at low risk. A number of gene expression signatures to predict CRC prognosis have been proposed, but none is routinely used in the clinic. The aim of this work was to assess the prediction ability and potential clinical usefulness of these signatures in a series of independent datasets. METHODS: A literature review identified 31 gene expression signatures that used gene expression data to predict prognosis in CRC tissue. The search was based on the PubMed database and was restricted to papers published from January 2004 to December 2011. Eleven CRC gene expression datasets with outcome information were identified and downloaded from public repositories. Random Forest classifier was used to build predictors from the gene lists. Matthews correlation coefficient was chosen as a measure of classification accuracy and its associated p-value was used to assess association with prognosis. For clinical usefulness evaluation, positive and negative post-tests probabilities were computed in stage II and III samples. RESULTS: Five gene signatures showed significant association with prognosis and provided reasonable prediction accuracy in their own training datasets. Nevertheless, all signatures showed low reproducibility in independent data. Stratified analyses by stage or microsatellite instability status showed significant association but limited discrimination ability, especially in stage II tumors. From a clinical perspective, the most predictive signatures showed a minor but significant improvement over the classical staging system. CONCLUSIONS: The published signatures show low prediction accuracy but moderate clinical usefulness. Although gene expression data may inform prognosis, better strategies for signature validation are needed to encourage their widespread use in the clinic.
Imaging gene expression in gene therapy

Energy Technology Data Exchange (ETDEWEB)

Wiebe, Leonard I. [Alberta Univ., Edmonton (Canada). Noujaim Institute for Pharmaceutical Oncology Research

1997-12-31

Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on `suicide gene therapy` of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k{sup +}) has been use for `suicide` in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k{sup +} gene expression where the H S V-1 t k{sup +} gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([{sup 18} F]F H P G; [{sup 18} F]-A C V), and pyrimidine- ([{sup 123}/{sup 131} I]I V R F U; [{sup 124}/{sup 131I}]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [{sup 123}/{sup 131I}]I V R F U imaging with the H S V-1 t k{sup +} reporter gene will be presented
Transcriptome database resource and gene expression atlas for the rose

Science.gov (United States)

2012-01-01

Background For centuries roses have been selected based on a number of traits. Little information exists on the genetic and molecular basis that contributes to these traits, mainly because information on expressed genes for this economically important ornamental plant is scarce. Results Here, we used a combination of Illumina and 454 sequencing technologies to generate information on Rosa sp. transcripts using RNA from various tissues and in response to biotic and abiotic stresses. A total of 80714 transcript clusters were identified and 76611 peptides have been predicted among which 20997 have been clustered into 13900 protein families. BLASTp hits in closely related Rosaceae species revealed that about half of the predicted peptides in the strawberry and peach genomes have orthologs in Rosa dataset. Digital expression was obtained using RNA samples from organs at different development stages and under different stress conditions. qPCR validated the digital expression data for a selection of 23 genes with high or low expression levels. Comparative gene expression analyses between the different tissues and organs allowed the identification of clusters that are highly enriched in given tissues or under particular conditions, demonstrating the usefulness of the digital gene expression analysis. A web interface ROSAseq was created that allows data interrogation by BLAST, subsequent analysis of DNA clusters and access to thorough transcript annotation including best BLAST matches on Fragaria vesca, Prunus persica and Arabidopsis. The rose peptides dataset was used to create the ROSAcyc resource pathway database that allows access to the putative genes and enzymatic pathways. Conclusions The study provides useful information on Rosa expressed genes, with thorough annotation and an overview of expression patterns for transcripts with good accuracy. PMID:23164410
An expression meta-analysis of predicted microRNA targets identifies a diagnostic signature for lung cancer

Directory of Open Access Journals (Sweden)

Liang Yu

2008-12-01

Full Text Available Abstract Background Patients diagnosed with lung adenocarcinoma (AD and squamous cell carcinoma (SCC, two major histologic subtypes of lung cancer, currently receive similar standard treatments, but resistance to adjuvant chemotherapy is prevalent. Identification of differentially expressed genes marking AD and SCC may prove to be of diagnostic value and help unravel molecular basis of their histogenesis and biologies, and deliver more effective and specific systemic therapy. Methods MiRNA target genes were predicted by union of miRanda, TargetScan, and PicTar, followed by screening for matched gene symbols in NCBI human sequences and Gene Ontology (GO terms using the PANTHER database that was also used for analyzing the significance of biological processes and pathways within each ontology term. Microarray data were extracted from Gene Expression Omnibus repository, and tumor subtype prediction by gene expression used Prediction Analysis of Microarrays. Results Computationally predicted target genes of three microRNAs, miR-34b/34c/449, that were detected in human lung, testis, and fallopian tubes but not in other normal tissues, were filtered by representation of GO terms and their ability to classify lung cancer subtypes, followed by a meta-analysis of microarray data to classify AD and SCC. Expression of a minimal set of 17 predicted miR-34b/34c/449 target genes derived from the developmental process GO category was identified from a training set to classify 41 AD and 17 SCC, and correctly predicted in average 87% of 354 AD and 82% of 282 SCC specimens from total 9 independent published datasets. The accuracy of prediction still remains comparable when classifying 103 AD and 79 SCC samples from another 4 published datasets that have only 14 to 16 of the 17 genes available for prediction (84% and 85% for AD and SCC, respectively. Expression of this signature in two published datasets of epithelial cells obtained at bronchoscopy from cigarette
Allele specific expression in worker reproduction genes in the bumblebee Bombus terrestris

Directory of Open Access Journals (Sweden)

Harindra E. Amarasinghe

2015-07-01

Full Text Available Methylation has previously been associated with allele specific expression in ants. Recently, we found methylation is important in worker reproduction in the bumblebee Bombus terrestris. Here we searched for allele specific expression in twelve genes associated with worker reproduction in bees. We found allele specific expression in Ecdysone 20 monooxygenase and IMP-L2-like. Although we were unable to confirm a genetic or epigenetic cause for this allele specific expression, the expression patterns of the two genes match those predicted for imprinted genes.
Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

Science.gov (United States)

Osato, Naoki

2018-01-19

Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional
Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

Science.gov (United States)

Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

2017-08-01

This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
Supervised classification of combined copy number and gene expression data

Directory of Open Access Journals (Sweden)

Riccadonna S.

2007-12-01

Full Text Available In this paper we apply a predictive profiling method to genome copy number aberrations (CNA in combination with gene expression and clinical data to identify molecular patterns of cancer pathophysiology. Predictive models and optimal feature lists for the platforms are developed by a complete validation SVM-based machine learning system. Ranked list of genome CNA sites (assessed by comparative genomic hybridization arrays – aCGH and of differentially expressed genes (assessed by microarray profiling with Affy HG-U133A chips are computed and combined on a breast cancer dataset for the discrimination of Luminal/ ER+ (Lum/ER+ and Basal-like/ER- classes. Different encodings are developed and applied to the CNA data, and predictive variable selection is discussed. We analyze the combination of profiling information between the platforms, also considering the pathophysiological data. A specific subset of patients is identified that has a different response to classification by chromosomal gains and losses and by differentially expressed genes, corroborating the idea that genomic CNA can represent an independent source for tumor classification.
Gene expression analysis of precision-cut human liver slices indicates stable expression of ADME-Tox related genes

NARCIS (Netherlands)

Elferink, M. G. L.; Olinga, P.; van Leeuwen, E. M.; Bauerschmidt, S.; Polman, J.; Schoonen, W. G.; Heisterkamp, S. H.; Groothuis, G. M. M.

2011-01-01

In the process of drug development it is of high importance to test the safety of new drugs with predictive value for human toxicity. A promising approach of toxicity testing is based on shifts in gene expression profiling of the liver. Toxicity screening based on animal liver cells cannot be
Time course of gene expression during mouse skeletal muscle hypertrophy.

Science.gov (United States)

Chaillou, Thomas; Lee, Jonah D; England, Jonathan H; Esser, Karyn A; McCarthy, John J

2013-10-01

The purpose of this study was to perform a comprehensive transcriptome analysis during skeletal muscle hypertrophy to identify signaling pathways that are operative throughout the hypertrophic response. Global gene expression patterns were determined from microarray results on days 1, 3, 5, 7, 10, and 14 during plantaris muscle hypertrophy induced by synergist ablation in adult mice. Principal component analysis and the number of differentially expressed genes (cutoffs ≥2-fold increase or ≥50% decrease compared with control muscle) revealed three gene expression patterns during overload-induced hypertrophy: early (1 day), intermediate (3, 5, and 7 days), and late (10 and 14 days) patterns. Based on the robust changes in total RNA content and in the number of differentially expressed genes, we focused our attention on the intermediate gene expression pattern. Ingenuity Pathway Analysis revealed a downregulation of genes encoding components of the branched-chain amino acid degradation pathway during hypertrophy. Among these genes, five were predicted by Ingenuity Pathway Analysis or previously shown to be regulated by the transcription factor Kruppel-like factor-15, which was also downregulated during hypertrophy. Moreover, the integrin-linked kinase signaling pathway was activated during hypertrophy, and the downregulation of muscle-specific micro-RNA-1 correlated with the upregulation of five predicted targets associated with the integrin-linked kinase pathway. In conclusion, we identified two novel pathways that may be involved in muscle hypertrophy, as well as two upstream regulators (Kruppel-like factor-15 and micro-RNA-1) that provide targets for future studies investigating the importance of these pathways in muscle hypertrophy.
PREDICTION OF THE COURSE OF OSTEOARTHROSIS FROM mTOR (MAMMALIAN TARGET OF RAPAMYCIN GENE EXPRESSION

Directory of Open Access Journals (Sweden)

E V Chetina

2012-01-01

Results. Analysis of gene expression in the outpatients with OA identified two subgroups: in one subgroup (n = 13 mTOR expression was considerably much less than that in the control group; the expression of ATG1 and p21 did not differ greatly from the control and that of caspase 3 and TNF-α was significantly higher. The other outpatients (n = 20 and all the examined patients needing endoprosthetic replacement were ascertained to have a higher gene expression of mTOR, ATG1, p21, caspase 3, and TNF-α than in the control group. Before endoprosthetic replacement, severe joint destruction in patients with OA was associated with enhanced gene expression of mTOR, ATG1, p21, and caspase 3. Conclusion. In early-stage disease, increased mTOR gene expression may serve as a prognostic marker of the severity of the disease and articular cartilage destruction.

ZCCHC17 is a master regulator of synaptic gene expression in Alzheimer's disease.

Science.gov (United States)

Tomljanovic, Zeljko; Patel, Mitesh; Shin, William; Califano, Andrea; Teich, Andrew F

2018-02-01

In an effort to better understand the molecular drivers of synaptic and neurophysiologic dysfunction in Alzheimer's disease (AD), we analyzed neuronal gene expression data from human AD brain tissue to identify master regulators of synaptic gene expression. Master regulator analysis identifies ZCCHC17 as normally supporting the expression of a network of synaptic genes, and predicts that ZCCHC17 dysfunction in AD leads to lower expression of these genes. We demonstrate that ZCCHC17 is normally expressed in neurons and is reduced early in the course of AD pathology. We show that ZCCHC17 loss in rat neurons leads to lower expression of the majority of the predicted synaptic targets and that ZCCHC17 drives the expression of a similar gene network in humans and rats. These findings support a conserved function for ZCCHC17 between species and identify ZCCHC17 loss as an important early driver of lower synaptic gene expression in AD. Matlab and R scripts used in this paper are available at https://github.com/afteich/AD_ZCC. aft25@cumc.columbia.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Prediction of epigenetically regulated genes in breast cancer cell lines

Energy Technology Data Exchange (ETDEWEB)

Loss, Leandro A; Sadanandam, Anguraj; Durinck, Steffen; Nautiyal, Shivani; Flaucher, Diane; Carlton, Victoria EH; Moorhead, Martin; Lu, Yontao; Gray, Joe W; Faham, Malek; Spellman, Paul; Parvin, Bahram

2010-05-04

panel of breast cancer cell lines. Subnetwork enrichment of these genes has identifed 35 common regulators with 6 or more predicted markers. In addition to identifying epigenetically regulated genes, we show evidence of differentially expressed methylation patterns between the basal and luminal subtypes. Our results indicate that the proposed computational protocol is a viable platform for identifying epigenetically regulated genes. Our protocol has generated a list of predictors including COL1A2, TOP2A, TFF1, and VAV3, genes whose key roles in epigenetic regulation is documented in the literature. Subnetwork enrichment of these predicted markers further suggests that epigenetic regulation of individual genes occurs in a coordinated fashion and through common regulators.
A machine learned classifier that uses gene expression data to accurately predict estrogen receptor status.

Directory of Open Access Journals (Sweden)

Meysam Bastani

Full Text Available BACKGROUND: Selecting the appropriate treatment for breast cancer requires accurately determining the estrogen receptor (ER status of the tumor. However, the standard for determining this status, immunohistochemical analysis of formalin-fixed paraffin embedded samples, suffers from numerous technical and reproducibility issues. Assessment of ER-status based on RNA expression can provide more objective, quantitative and reproducible test results. METHODS: To learn a parsimonious RNA-based classifier of hormone receptor status, we applied a machine learning tool to a training dataset of gene expression microarray data obtained from 176 frozen breast tumors, whose ER-status was determined by applying ASCO-CAP guidelines to standardized immunohistochemical testing of formalin fixed tumor. RESULTS: This produced a three-gene classifier that can predict the ER-status of a novel tumor, with a cross-validation accuracy of 93.17±2.44%. When applied to an independent validation set and to four other public databases, some on different platforms, this classifier obtained over 90% accuracy in each. In addition, we found that this prediction rule separated the patients' recurrence-free survival curves with a hazard ratio lower than the one based on the IHC analysis of ER-status. CONCLUSIONS: Our efficient and parsimonious classifier lends itself to high throughput, highly accurate and low-cost RNA-based assessments of ER-status, suitable for routine high-throughput clinical use. This analytic method provides a proof-of-principle that may be applicable to developing effective RNA-based tests for other biomarkers and conditions.
A 65‑gene signature for prognostic prediction in colon adenocarcinoma.

Science.gov (United States)

Jiang, Hui; Du, Jun; Gu, Jiming; Jin, Liugen; Pu, Yong; Fei, Bojian

2018-04-01

The aim of the present study was to examine the molecular factors associated with the prognosis of colon cancer. Gene expression datasets were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus databases to screen differentially expressed genes (DEGs) between colon cancer samples and normal samples. Survival‑related genes were selected from the DEGs using the Cox regression method. A co‑expression network of survival‑related genes was then constructed, and functional clusters were extracted from this network. The significantly enriched functions and pathways of the genes in the network were identified. Using Bayesian discriminant analysis, a prognostic prediction system was established to distinguish the positive from negative prognostic samples. The discrimination efficacy of the system was validated in the GSE17538 dataset using Kaplan‑Meier survival analysis. A total of 636 and 1,892 DEGs between the colon cancer samples and normal samples were screened from the TCGA and GSE44861 dataset, respectively. There were 155 survival‑related genes selected. The co‑expression network of survival‑related genes included 138 genes, 534 lines (connections) and five functional clusters, including the signaling pathway, cellular response to cAMP, and immune system process functional clusters. The molecular function, cellular components and biological processes were the significantly enriched functions. The peroxisome proliferator‑activated receptor signaling pathway, Wnt signaling pathway, B cell receptor signaling pathway, and cytokine‑cytokine receptor interactions were the significant pathways. A prognostic prediction system based on a 65‑gene signature was established using this co‑expression network. Its discriminatory effect was validated in the TCGA dataset (P=3.56e‑12) and the GSE17538 dataset (P=1.67e‑6). The 65‑gene signature included kallikrein‑related peptidase 6 (KLK6), collagen type XI α1 (COL11A1), cartilage
Gene expression in early stage cervical cancer

NARCIS (Netherlands)

Biewenga, Petra; Buist, Marrije R.; Moerland, Perry D.; van Thernaat, Emiel Ver Loren; van Kampen, Antoine H. C.; ten Kate, Fiebo J. W.; Baas, Frank

2008-01-01

Objective. Pelvic lymph node metastases are the main prognostic factor for survival in early stage cervical cancer, yet accurate detection methods before surgery are lacking. In this study, we examined whether gene expression profiling can predict the presence of lymph node metastasis in early stage
A Seasonal Time-Series Model Based on Gene Expression Programming for Predicting Financial Distress.

Science.gov (United States)

Cheng, Ching-Hsue; Chan, Chia-Pang; Yang, Jun-He

2018-01-01

The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i) the proposed model is different from the previous models lacking the concept of time series; (ii) the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii) the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies.
Expression profiling of hypothetical genes in Desulfovibrio vulgaris leads to improved functional annotation

Energy Technology Data Exchange (ETDEWEB)

Elias, Dwayne A.; Mukhopadhyay, Aindrila; Joachimiak, Marcin P.; Drury, Elliott C.; Redding, Alyssa M.; Yen, Huei-Che B.; Fields, Matthew W.; Hazen, Terry C.; Arkin, Adam P.; Keasling, Jay D.; Wall, Judy D.

2008-10-27

Hypothetical and conserved hypothetical genes account for>30percent of sequenced bacterial genomes. For the sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough, 347 of the 3634 genes were annotated as conserved hypothetical (9.5percent) along with 887 hypothetical genes (24.4percent). Given the large fraction of the genome, it is plausible that some of these genes serve critical cellular roles. The study goals were to determine which genes were expressed and provide a more functionally based annotation. To accomplish this, expression profiles of 1234 hypothetical and conserved genes were used from transcriptomic datasets of 11 environmental stresses, complemented with shotgun LC-MS/MS and AMT tag proteomic data. Genes were divided into putatively polycistronic operons and those predicted to be monocistronic, then classified by basal expression levels and grouped according to changes in expression for one or multiple stresses. 1212 of these genes were transcribed with 786 producing detectable proteins. There was no evidence for expression of 17 predicted genes. Except for the latter, monocistronic gene annotation was expanded using the above criteria along with matching Clusters of Orthologous Groups. Polycistronic genes were annotated in the same manner with inferences from their proximity to more confidently annotated genes. Two targeted deletion mutants were used as test cases to determine the relevance of the inferred functional annotations.
Predicting multi-level drug response with gene expression profile in multiple myeloma using hierarchical ordinal regression.

Science.gov (United States)

Zhang, Xinyan; Li, Bingzong; Han, Huiying; Song, Sha; Xu, Hongxia; Hong, Yating; Yi, Nengjun; Zhuang, Wenzhuo

2018-05-10

Multiple myeloma (MM), like other cancers, is caused by the accumulation of genetic abnormalities. Heterogeneity exists in the patients' response to treatments, for example, bortezomib. This urges efforts to identify biomarkers from numerous molecular features and build predictive models for identifying patients that can benefit from a certain treatment scheme. However, previous studies treated the multi-level ordinal drug response as a binary response where only responsive and non-responsive groups are considered. It is desirable to directly analyze the multi-level drug response, rather than combining the response to two groups. In this study, we present a novel method to identify significantly associated biomarkers and then develop ordinal genomic classifier using the hierarchical ordinal logistic model. The proposed hierarchical ordinal logistic model employs the heavy-tailed Cauchy prior on the coefficients and is fitted by an efficient quasi-Newton algorithm. We apply our hierarchical ordinal regression approach to analyze two publicly available datasets for MM with five-level drug response and numerous gene expression measures. Our results show that our method is able to identify genes associated with the multi-level drug response and to generate powerful predictive models for predicting the multi-level response. The proposed method allows us to jointly fit numerous correlated predictors and thus build efficient models for predicting the multi-level drug response. The predictive model for the multi-level drug response can be more informative than the previous approaches. Thus, the proposed approach provides a powerful tool for predicting multi-level drug response and has important impact on cancer studies.
Density based pruning for identification of differentially expressed genes from microarray data

Directory of Open Access Journals (Sweden)

Xu Jia

2010-11-01

Full Text Available Abstract Motivation Identification of differentially expressed genes from microarray datasets is one of the most important analyses for microarray data mining. Popular algorithms such as statistical t-test rank genes based on a single statistics. The false positive rate of these methods can be improved by considering other features of differentially expressed genes. Results We proposed a pattern recognition strategy for identifying differentially expressed genes. Genes are mapped to a two dimension feature space composed of average difference of gene expression and average expression levels. A density based pruning algorithm (DB Pruning is developed to screen out potential differentially expressed genes usually located in the sparse boundary region. Biases of popular algorithms for identifying differentially expressed genes are visually characterized. Experiments on 17 datasets from Gene Omnibus Database (GEO with experimentally verified differentially expressed genes showed that DB pruning can significantly improve the prediction accuracy of popular identification algorithms such as t-test, rank product, and fold change. Conclusions Density based pruning of non-differentially expressed genes is an effective method for enhancing statistical testing based algorithms for identifying differentially expressed genes. It improves t-test, rank product, and fold change by 11% to 50% in the numbers of identified true differentially expressed genes. The source code of DB pruning is freely available on our website http://mleg.cse.sc.edu/degprune
Integrative miRNA-Gene Expression Analysis Enables Refinement of Associated Biology and Prediction of Response to Cetuximab in Head and Neck Squamous Cell Cancer

Directory of Open Access Journals (Sweden)

Loris De Cecco

2017-01-01

Full Text Available This paper documents the process by which we, through gene and miRNA expression profiling of the same samples of head and neck squamous cell carcinomas (HNSCC and an integrative miRNA-mRNA expression analysis, were able to identify candidate biomarkers of progression-free survival (PFS in patients treated with cetuximab-based approaches. Through sparse partial least square–discriminant analysis (sPLS-DA and supervised analysis, 36 miRNAs were identified in two components that clearly separated long- and short-PFS patients. Gene set enrichment analysis identified a significant correlation between the miRNA first-component and EGFR signaling, keratinocyte differentiation, and p53. Another significant correlation was identified between the second component and RAS, NOTCH, immune/inflammatory response, epithelial–mesenchymal transition (EMT, and angiogenesis pathways. Regularized canonical correlation analysis of sPLS-DA miRNA and gene data combined with the MAGIA2 web-tool highlighted 16 miRNAs and 84 genes that were interconnected in a total of 245 interactions. After feature selection by a smoothed t-statistic support vector machine, we identified three miRNAs and five genes in the miRNA-gene network whose expression result was the most relevant in predicting PFS (Area Under the Curve, AUC = 0.992. Overall, using a well-defined clinical setting and up-to-date bioinformatics tools, we are able to give the proof of principle that an integrative miRNA-mRNA expression could greatly contribute to the refinement of the biology behind a predictive model.
Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

Directory of Open Access Journals (Sweden)

Jing Zhao

Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.
Synergistic interactions of biotic and abiotic environmental stressors on gene expression.

Science.gov (United States)

Altshuler, Ianina; McLeod, Anne M; Colbourne, John K; Yan, Norman D; Cristescu, Melania E

2015-03-01

Understanding the response of organisms to multiple stressors is critical for predicting if populations can adapt to rapid environmental change. Natural and anthropogenic stressors often interact, complicating general predictions. In this study, we examined the interactive and cumulative effects of two common environmental stressors, lowered calcium concentration, an anthropogenic stressor, and predator presence, a natural stressor, on the water flea Daphnia pulex. We analyzed expression changes of five genes involved in calcium homeostasis - cuticle proteins (Cutie, Icp2), calbindin (Calb), and calcium pump and channel (Serca and Ip3R) - using real-time quantitative PCR (RT-qPCR) in a full factorial experiment. We observed strong synergistic interactions between low calcium concentration and predator presence. While the Ip3R gene was not affected by the stressors, the other four genes were affected in their transcriptional levels by the combination of the stressors. Transcriptional patterns of genes that code for cuticle proteins (Cutie and Icp2) and a sarcoplasmic calcium pump (Serca) only responded to the combination of stressors, changing their relative expression levels in a synergistic response, while a calcium-binding protein (Calb) responded to low calcium stress and the combination of both stressors. The expression pattern of these genes (Cutie, Icp2, and Serca) were nonlinear, yet they were dose dependent across the calcium gradient. Multiple stressors can have complex, often unexpected effects on ecosystems. This study demonstrates that the dominant interaction for the set of tested genes appears to be synergism. We argue that gene expression patterns can be used to understand and predict the type of interaction expected when organisms are exposed simultaneously to natural and anthropogenic stressors.
Blood-based gene-expression predictors of PTSD risk and resilience among deployed marines: a pilot study.

Science.gov (United States)

Glatt, Stephen J; Tylee, Daniel S; Chandler, Sharon D; Pazol, Joel; Nievergelt, Caroline M; Woelk, Christopher H; Baker, Dewleen G; Lohr, James B; Kremen, William S; Litz, Brett T; Tsuang, Ming T

2013-06-01

Susceptibility to PTSD is determined by both genes and environment. Similarly, gene-expression levels in peripheral blood are influenced by both genes and environment, and expression levels of many genes show good correspondence between peripheral blood and brain. Therefore, our objectives were to test the following hypotheses: (1) pre-trauma expression levels of a gene subset (particularly immune-system genes) in peripheral blood would differ between trauma-exposed Marines who later developed PTSD and those who did not; (2) a predictive biomarker panel of the eventual emergence of PTSD among high-risk individuals could be developed based on gene expression in readily assessable peripheral blood cells; and (3) a predictive panel based on expression of individual exons would surpass the accuracy of a model based on expression of full-length gene transcripts. Gene-expression levels were assayed in peripheral blood samples from 50 U.S. Marines (25 eventual PTSD cases and 25 non-PTSD comparison subjects) prior to their deployment overseas to war-zones in Iraq or Afghanistan. The panel of biomarkers dysregulated in peripheral blood cells of eventual PTSD cases prior to deployment was significantly enriched for immune genes, achieved 70% prediction accuracy in an independent sample based on the expression of 23 full-length transcripts, and attained 80% accuracy in an independent sample based on the expression of one exon from each of five genes. If the observed profiles of pre-deployment mRNA-expression in eventual PTSD cases can be further refined and replicated, they could suggest avenues for early intervention and prevention among individuals at high risk for trauma exposure. Copyright © 2013 Wiley Periodicals, Inc.
A network-based predictive gene-expression signature for adjuvant chemotherapy benefit in stage II colorectal cancer.

Science.gov (United States)

Cao, Bangrong; Luo, Liping; Feng, Lin; Ma, Shiqi; Chen, Tingqing; Ren, Yuan; Zha, Xiao; Cheng, Shujun; Zhang, Kaitai; Chen, Changmin

2017-12-13

The clinical benefit of adjuvant chemotherapy for stage II colorectal cancer (CRC) is controversial. This study aimed to explore novel gene signature to predict outcome benefit of postoperative 5-Fu-based therapy in stage II CRC. Gene-expression profiles of stage II CRCs from two datasets with 5-Fu-based adjuvant chemotherapy (training dataset, n = 212; validation dataset, n = 85) were analyzed to identify the indicator. A systemic approach by integrating gene-expression and protein-protein interaction (PPI) network was implemented to develop the predictive signature. Kaplan-Meier curves and Cox proportional hazards model were used to determine the survival benefit of adjuvant chemotherapy. Experiments with shRNA knock-down were carried out to confirm the signature identified in this study. In the training dataset, we identified 44 PPI sub-modules, by which we separate patients into two clusters (1 and 2) having different chemotherapeutic benefit. A predictor of 11 PPI sub-modules (11-PPI-Mod) was established to discriminate the two sub-groups, with an overall accuracy of 90.1%. This signature was independently validated in an external validation dataset. Kaplan-Meier curves showed an improved outcome for patients who received adjuvant chemotherapy in Cluster 1 sub-group, but even worse survival for those in Cluster 2 sub-group. Similar results were found in both the training and the validation dataset. Multivariate Cox regression revealed an interaction effect between 11-PPI-Mod signature and adjuvant therapy treatment in the training dataset (RFS, p = 0.007; OS, p = 0.006) and the validation dataset (RFS, p = 0.002). From the signature, we found that PTGES gene was up-regulated in CRC cells which were more resistant to 5-Fu. Knock-down of PTGES indicated a growth inhibition and up-regulation of apoptotic markers induced by 5-Fu in CRC cells. Only a small proportion of stage II CRC patients could benefit from adjuvant therapy. The 11-PPI-Mod as
Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression.

Science.gov (United States)

Arnaiz, Olivier; Van Dijk, Erwin; Bétermier, Mireille; Lhuillier-Akakpo, Maoussi; de Vanssay, Augustin; Duharcourt, Sandra; Sallet, Erika; Gouzy, Jérôme; Sperling, Linda

2017-06-26

The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3' and 5' UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis
A network approach to predict pathogenic genes for Fusarium graminearum.

Science.gov (United States)

Liu, Xiaoping; Tang, Wei-Hua; Zhao, Xing-Ming; Chen, Luonan

2010-10-04

Fusarium graminearum is the pathogenic agent of Fusarium head blight (FHB), which is a destructive disease on wheat and barley, thereby causing huge economic loss and health problems to human by contaminating foods. Identifying pathogenic genes can shed light on pathogenesis underlying the interaction between F. graminearum and its plant host. However, it is difficult to detect pathogenic genes for this destructive pathogen by time-consuming and expensive molecular biological experiments in lab. On the other hand, computational methods provide an alternative way to solve this problem. Since pathogenesis is a complicated procedure that involves complex regulations and interactions, the molecular interaction network of F. graminearum can give clues to potential pathogenic genes. Furthermore, the gene expression data of F. graminearum before and after its invasion into plant host can also provide useful information. In this paper, a novel systems biology approach is presented to predict pathogenic genes of F. graminearum based on molecular interaction network and gene expression data. With a small number of known pathogenic genes as seed genes, a subnetwork that consists of potential pathogenic genes is identified from the protein-protein interaction network (PPIN) of F. graminearum, where the genes in the subnetwork are further required to be differentially expressed before and after the invasion of the pathogenic fungus. Therefore, the candidate genes in the subnetwork are expected to be involved in the same biological processes as seed genes, which imply that they are potential pathogenic genes. The prediction results show that most of the pathogenic genes of F. graminearum are enriched in two important signal transduction pathways, including G protein coupled receptor pathway and MAPK signaling pathway, which are known related to pathogenesis in other fungi. In addition, several pathogenic genes predicted by our method are verified in other pathogenic fungi, which
A network approach to predict pathogenic genes for Fusarium graminearum.

Directory of Open Access Journals (Sweden)

Xiaoping Liu

Full Text Available Fusarium graminearum is the pathogenic agent of Fusarium head blight (FHB, which is a destructive disease on wheat and barley, thereby causing huge economic loss and health problems to human by contaminating foods. Identifying pathogenic genes can shed light on pathogenesis underlying the interaction between F. graminearum and its plant host. However, it is difficult to detect pathogenic genes for this destructive pathogen by time-consuming and expensive molecular biological experiments in lab. On the other hand, computational methods provide an alternative way to solve this problem. Since pathogenesis is a complicated procedure that involves complex regulations and interactions, the molecular interaction network of F. graminearum can give clues to potential pathogenic genes. Furthermore, the gene expression data of F. graminearum before and after its invasion into plant host can also provide useful information. In this paper, a novel systems biology approach is presented to predict pathogenic genes of F. graminearum based on molecular interaction network and gene expression data. With a small number of known pathogenic genes as seed genes, a subnetwork that consists of potential pathogenic genes is identified from the protein-protein interaction network (PPIN of F. graminearum, where the genes in the subnetwork are further required to be differentially expressed before and after the invasion of the pathogenic fungus. Therefore, the candidate genes in the subnetwork are expected to be involved in the same biological processes as seed genes, which imply that they are potential pathogenic genes. The prediction results show that most of the pathogenic genes of F. graminearum are enriched in two important signal transduction pathways, including G protein coupled receptor pathway and MAPK signaling pathway, which are known related to pathogenesis in other fungi. In addition, several pathogenic genes predicted by our method are verified in other
Identification of a robust gene signature that predicts breast cancer outcome in independent data sets

International Nuclear Information System (INIS)

Korkola, James E; Waldman, Frederic M; Blaveri, Ekaterina; DeVries, Sandy; Moore, Dan H II; Hwang, E Shelley; Chen, Yunn-Yi; Estep, Anne LH; Chew, Karen L; Jensen, Ronald H

2007-01-01

Breast cancer is a heterogeneous disease, presenting with a wide range of histologic, clinical, and genetic features. Microarray technology has shown promise in predicting outcome in these patients. We profiled 162 breast tumors using expression microarrays to stratify tumors based on gene expression. A subset of 55 tumors with extensive follow-up was used to identify gene sets that predicted outcome. The predictive gene set was further tested in previously published data sets. We used different statistical methods to identify three gene sets associated with disease free survival. A fourth gene set, consisting of 21 genes in common to all three sets, also had the ability to predict patient outcome. To validate the predictive utility of this derived gene set, it was tested in two published data sets from other groups. This gene set resulted in significant separation of patients on the basis of survival in these data sets, correctly predicting outcome in 62–65% of patients. By comparing outcome prediction within subgroups based on ER status, grade, and nodal status, we found that our gene set was most effective in predicting outcome in ER positive and node negative tumors. This robust gene selection with extensive validation has identified a predictive gene set that may have clinical utility for outcome prediction in breast cancer patients
Interdependence of cell growth and gene expression: origins and consequences.

Science.gov (United States)

Scott, Matthew; Gunderson, Carl W; Mateescu, Eduard M; Zhang, Zhongge; Hwa, Terence

2010-11-19

In bacteria, the rate of cell proliferation and the level of gene expression are intimately intertwined. Elucidating these relations is important both for understanding the physiological functions of endogenous genetic circuits and for designing robust synthetic systems. We describe a phenomenological study that reveals intrinsic constraints governing the allocation of resources toward protein synthesis and other aspects of cell growth. A theory incorporating these constraints can accurately predict how cell proliferation and gene expression affect one another, quantitatively accounting for the effect of translation-inhibiting antibiotics on gene expression and the effect of gratuitous protein expression on cell growth. The use of such empirical relations, analogous to phenomenological laws, may facilitate our understanding and manipulation of complex biological systems before underlying regulatory circuits are elucidated.
Five putative nucleoside triphosphate diphosphohydrolase genes are expressed in Trichomonas vaginalis.

Science.gov (United States)

Frasson, Amanda Piccoli; Dos Santos, Odelta; Meirelles, Lúcia Collares; Macedo, Alexandre José; Tasca, Tiana

2016-01-01

Trichomonas vaginalis is a protozoan that parasitizes the human urogenital tract causing trichomoniasis, the most common non-viral sexually transmitted disease. The parasite has unique genomic characteristics such as a large genome size and expanded gene families. Ectonucleoside triphosphate diphosphohydrolase (E-NTPDase) is an enzyme responsible for hydrolyzing nucleoside tri- and diphosphates and has already been biochemically characterized in T. vaginalis. Considering the important role of this enzyme in the production of extracellular adenosine for parasite uptake, we evaluated the gene expression of five putative NTPDases in T. vaginalis. We showed that all five putative TvNTPDase genes (TvNTPDase1-5) were expressed by both fresh clinical and long-term grown isolates. The amino acid alignment predicted the presence of the five crucial apyrase conserved regions, transmembrane domains, signal peptides, phosphorylation and catalytic sites. Moreover, a phylogenetic analysis showed that TvNTPDase sequences make up a clade with NTPDases intracellularly located. Biochemical NTPDase activity (ATP and ADP hydrolysis) is responsive to the serum-restrictive conditions and the gene expression of TvNTPDases was mostly increased, mainly TvNTPDase2 and TvNTPDase4, although there was not a clear pattern of expression among them. In summary, the present report demonstrates the gene expression patterns of predicted NTPDases in T. vaginalis. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Interpreting expression data with metabolic flux models: predicting Mycobacterium tuberculosis mycolic acid production.

Directory of Open Access Journals (Sweden)

Caroline Colijn

2009-08-01

Full Text Available Metabolism is central to cell physiology, and metabolic disturbances play a role in numerous disease states. Despite its importance, the ability to study metabolism at a global scale using genomic technologies is limited. In principle, complete genome sequences describe the range of metabolic reactions that are possible for an organism, but cannot quantitatively describe the behaviour of these reactions. We present a novel method for modeling metabolic states using whole cell measurements of gene expression. Our method, which we call E-Flux (as a combination of flux and expression, extends the technique of Flux Balance Analysis by modeling maximum flux constraints as a function of measured gene expression. In contrast to previous methods for metabolically interpreting gene expression data, E-Flux utilizes a model of the underlying metabolic network to directly predict changes in metabolic flux capacity. We applied E-Flux to Mycobacterium tuberculosis, the bacterium that causes tuberculosis (TB. Key components of mycobacterial cell walls are mycolic acids which are targets for several first-line TB drugs. We used E-Flux to predict the impact of 75 different drugs, drug combinations, and nutrient conditions on mycolic acid biosynthesis capacity in M. tuberculosis, using a public compendium of over 400 expression arrays. We tested our method using a model of mycolic acid biosynthesis as well as on a genome-scale model of M. tuberculosis metabolism. Our method correctly predicts seven of the eight known fatty acid inhibitors in this compendium and makes accurate predictions regarding the specificity of these compounds for fatty acid biosynthesis. Our method also predicts a number of additional potential modulators of TB mycolic acid biosynthesis. E-Flux thus provides a promising new approach for algorithmically predicting metabolic state from gene expression data.
Survey of the Heritability and Sparse Architecture of Gene Expression Traits across Human Tissues.

Science.gov (United States)

Wheeler, Heather E; Shah, Kaanan P; Brenner, Jonathon; Garcia, Tzintzuni; Aquino-Michaels, Keston; Cox, Nancy J; Nicolae, Dan L; Im, Hae Kyung

2016-11-01

Understanding the genetic architecture of gene expression traits is key to elucidating the underlying mechanisms of complex traits. Here, for the first time, we perform a systematic survey of the heritability and the distribution of effect sizes across all representative tissues in the human body. We find that local h2 can be relatively well characterized with 59% of expressed genes showing significant h2 (FDR Decomposition (OTD) approach. Through a series of simulations we show that the cross-tissue and tissue-specific components are identifiable via OTD. Heritability and sparsity estimates of these derived expression phenotypes show similar characteristics to the original traits. Consistent properties relative to prior GTEx multi-tissue analysis results suggest that these traits reflect the expected biology. Finally, we apply this knowledge to develop prediction models of gene expression traits for all tissues. The prediction models, heritability, and prediction performance R2 for original and decomposed expression phenotypes are made publicly available (https://github.com/hakyimlab/PrediXcan).
Gene expression patterns in peripheral blood correlate with the extent of coronary artery disease.

Directory of Open Access Journals (Sweden)

Peter R Sinnaeve

Full Text Available Systemic and local inflammation plays a prominent role in the pathogenesis of atherosclerotic coronary artery disease, but the relationship of whole blood gene expression changes with coronary disease remains unclear. We have investigated whether gene expression patterns in peripheral blood correlate with the severity of coronary disease and whether these patterns correlate with the extent of atherosclerosis in the vascular wall. Patients were selected according to their coronary artery disease index (CADi, a validated angiographical measure of the extent of coronary atherosclerosis that correlates with outcome. RNA was extracted from blood of 120 patients with at least a stenosis greater than 50% (CADi > or = 23 and from 121 controls without evidence of coronary stenosis (CADi = 0. 160 individual genes were found to correlate with CADi (rho > 0.2, P<0.003. Prominent differential expression was observed especially in genes involved in cell growth, apoptosis and inflammation. Using these 160 genes, a partial least squares multivariate regression model resulted in a highly predictive model (r(2 = 0.776, P<0.0001. The expression pattern of these 160 genes in aortic tissue also predicted the severity of atherosclerosis in human aortas, showing that peripheral blood gene expression associated with coronary atherosclerosis mirrors gene expression changes in atherosclerotic arteries. In conclusion, the simultaneous expression pattern of 160 genes in whole blood correlates with the severity of coronary artery disease and mirrors expression changes in the atherosclerotic vascular wall.
Inferring metabolic states in uncharacterized environments using gene-expression measurements.

Directory of Open Access Journals (Sweden)

Sergio Rossell

Full Text Available The large size of metabolic networks entails an overwhelming multiplicity in the possible steady-state flux distributions that are compatible with stoichiometric constraints. This space of possibilities is largest in the frequent situation where the nutrients available to the cells are unknown. These two factors: network size and lack of knowledge of nutrient availability, challenge the identification of the actual metabolic state of living cells among the myriad possibilities. Here we address this challenge by developing a method that integrates gene-expression measurements with genome-scale models of metabolism as a means of inferring metabolic states. Our method explores the space of alternative flux distributions that maximize the agreement between gene expression and metabolic fluxes, and thereby identifies reactions that are likely to be active in the culture from which the gene-expression measurements were taken. These active reactions are used to build environment-specific metabolic models and to predict actual metabolic states. We applied our method to model the metabolic states of Saccharomyces cerevisiae growing in rich media supplemented with either glucose or ethanol as the main energy source. The resulting models comprise about 50% of the reactions in the original model, and predict environment-specific essential genes with high sensitivity. By minimizing the sum of fluxes while forcing our predicted active reactions to carry flux, we predicted the metabolic states of these yeast cultures that are in large agreement with what is known about yeast physiology. Most notably, our method predicts the Crabtree effect in yeast cells growing in excess glucose, a long-known phenomenon that could not have been predicted by traditional constraint-based modeling approaches. Our method is of immediate practical relevance for medical and industrial applications, such as the identification of novel drug targets, and the development of
A Seasonal Time-Series Model Based on Gene Expression Programming for Predicting Financial Distress

Science.gov (United States)

2018-01-01

The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i) the proposed model is different from the previous models lacking the concept of time series; (ii) the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii) the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies. PMID:29765399
A Seasonal Time-Series Model Based on Gene Expression Programming for Predicting Financial Distress

Directory of Open Access Journals (Sweden)

Ching-Hsue Cheng

2018-01-01

Full Text Available The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i the proposed model is different from the previous models lacking the concept of time series; (ii the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies.
Construction and evaluation of yeast expression networks by database-guided predictions

Directory of Open Access Journals (Sweden)

Katharina Papsdorf

2016-05-01

Full Text Available DNA-Microarrays are powerful tools to obtain expression data on the genome-wide scale. We performed microarray experiments to elucidate the transcriptional networks, which are up- or down-regulated in response to the expression of toxic polyglutamine proteins in yeast. Such experiments initially generate hit lists containing differentially expressed genes. To look into transcriptional responses, we constructed networks from these genes. We therefore developed an algorithm, which is capable of dealing with very small numbers of microarrays by clustering the hits based on co-regulatory relationships obtained from the SPELL database. Here, we evaluate this algorithm according to several criteria and further develop its statistical capabilities. Initially, we define how the number of SPELL-derived co-regulated genes and the number of input hits influences the quality of the networks. We then show the ability of our networks to accurately predict further differentially expressed genes. Including these predicted genes into the networks improves the network quality and allows quantifying the predictive strength of the networks based on a newly implemented scoring method. We find that this approach is useful for our own experimental data sets and also for many other data sets which we tested from the SPELL microarray database. Furthermore, the clusters obtained by the described algorithm greatly improve the assignment to biological processes and transcription factors for the individual clusters. Thus, the described clustering approach, which will be available through the ClusterEx web interface, and the evaluation parameters derived from it represent valuable tools for the fast and informative analysis of yeast microarray data.
The SKP1-like gene family of Arabidopsis exhibits a high degree of differential gene expression and gene product interaction during development.

Directory of Open Access Journals (Sweden)

Mohammad H Dezfulian

Full Text Available The Arabidopsis thaliana genome encodes several families of polypeptides that are known or predicted to participate in the formation of the SCF-class of E3-ubiquitin ligase complexes. One such gene family encodes the Skp1-like class of polypeptide subunits, where 21 genes have been identified and are known to be expressed in Arabidopsis. Phylogenetic analysis based on deduced polypeptide sequence organizes the family of ASK proteins into 7 clades. The complexity of the ASK gene family, together with the close structural similarity among its members raises the prospect of significant functional redundancy among select paralogs. We have assessed the potential for functional redundancy within the ASK gene family by analyzing an expanded set of criteria that define redundancy with higher resolution. The criteria used include quantitative expression of locus-specific transcripts using qRT-PCR, assessment of the sub-cellular localization of individual ASK:YFP auto-fluorescent fusion proteins expressed in vivo as well as the in planta assessment of individual ASK-F-Box protein interactions using bimolecular fluorescent complementation techniques in combination with confocal imagery in live cells. The results indicate significant functional divergence of steady state transcript abundance and protein-protein interaction specificity involving ASK proteins in a pattern that is poorly predicted by sequence-based phylogeny. The information emerging from this and related studies will prove important for defining the functional intersection of expression, localization and gene product interaction that better predicts the formation of discrete SCF complexes, as a prelude to investigating their molecular mode of action.
Gene Expression Profiling to Predict Clinical Outcome of Breast Cancer: reproducing, analyzing and extending the Nature publication by vhVeer et al

NARCIS (Netherlands)

Li R.; Visser, H.M.

2010-01-01

Chemotherapy and hormonal therapy as adjuvant systemic therapies to inhibit breast cancer recurrence are not necessary for each patient. In Veer's paper "Gene expression profiling predicts clinical outcome of breast cancer" (Nature 2002, PMID: 11823860), they introduced a method based on DNA
Development and validation of a gene profile predicting benefit of postmastectomy radiotherapy in patients with high-risk breast cancer: a study of gene expression in the DBCG82bc cohort.

Science.gov (United States)

Tramm, Trine; Mohammed, Hayat; Myhre, Simen; Kyndi, Marianne; Alsner, Jan; Børresen-Dale, Anne-Lise; Sørlie, Therese; Frigessi, Arnoldo; Overgaard, Jens

2014-10-15

To identify genes predicting benefit of radiotherapy in patients with high-risk breast cancer treated with systemic therapy and randomized to receive or not receive postmastectomy radiotherapy (PMRT). The study was based on the Danish Breast Cancer Cooperative Group (DBCG82bc) cohort. Gene-expression analysis was performed in a training set of frozen tumor tissue from 191 patients. Genes were identified through the Lasso method with the endpoint being locoregional recurrence (LRR). A weighted gene-expression index (DBCG-RT profile) was calculated and transferred to quantitative real-time PCR (qRT-PCR) in corresponding formalin-fixed, paraffin-embedded (FFPE) samples, before validation in FFPE from 112 additional patients. Seven genes were identified, and the derived DBCG-RT profile divided the 191 patients into "high LRR risk" and "low LRR risk" groups. PMRT significantly reduced risk of LRR in "high LRR risk" patients, whereas "low LRR risk" patients showed no additional reduction in LRR rate. Technical transfer of the DBCG-RT profile to FFPE/qRT-PCR was successful, and the predictive impact was successfully validated in another 112 patients. A DBCG-RT gene profile was identified and validated, identifying patients with very low risk of LRR and no benefit from PMRT. The profile may provide a method to individualize treatment with PMRT. ©2014 American Association for Cancer Research.
Semi-supervised prediction of gene regulatory networks using machine learning algorithms.

Science.gov (United States)

Patel, Nihir; Wang, Jason T L

2015-10-01

Use of computational methods to predict gene regulatory networks (GRNs) from gene expression data is a challenging task. Many studies have been conducted using unsupervised methods to fulfill the task; however, such methods usually yield low prediction accuracies due to the lack of training data. In this article, we propose semi-supervised methods for GRN prediction by utilizing two machine learning algorithms, namely, support vector machines (SVM) and random forests (RF). The semi-supervised methods make use of unlabelled data for training. We investigated inductive and transductive learning approaches, both of which adopt an iterative procedure to obtain reliable negative training data from the unlabelled data. We then applied our semi-supervised methods to gene expression data of Escherichia coli and Saccharomyces cerevisiae, and evaluated the performance of our methods using the expression data. Our analysis indicated that the transductive learning approach outperformed the inductive learning approach for both organisms. However, there was no conclusive difference identified in the performance of SVM and RF. Experimental results also showed that the proposed semi-supervised methods performed better than existing supervised methods for both organisms.
Alu Elements as Novel Regulators of Gene Expression in Type 1 Diabetes Susceptibility Genes?

Science.gov (United States)

Kaur, Simranjeet; Pociot, Flemming

2015-07-13

Despite numerous studies implicating Alu repeat elements in various diseases, there is sparse information available with respect to the potential functional and biological roles of the repeat elements in Type 1 diabetes (T1D). Therefore, we performed a genome-wide sequence analysis of T1D candidate genes to identify embedded Alu elements within these genes. We observed significant enrichment of Alu elements within the T1D genes (p-value genes harboring Alus revealed significant enrichment for immune-mediated processes (p-value genes harboring inverted Alus (IRAlus) within their 3' untranslated regions (UTRs) that are known to regulate the expression of host mRNAs by generating double stranded RNA duplexes. Our in silico analysis predicted the formation of duplex structures by IRAlus within the 3'UTRs of T1D genes. We propose that IRAlus might be involved in regulating the expression levels of the host T1D genes.
Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

KAUST Repository

Othoum, Ghofran K

2013-05-01

Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using
Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

KAUST Repository

Othoum, Ghofran K

2013-01-01

Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using
Machine learning approaches to supporting the identification of photoreceptor-enriched genes based on expression data

Directory of Open Access Journals (Sweden)

Simpson David

2006-03-01

Full Text Available Abstract Background Retinal photoreceptors are highly specialised cells, which detect light and are central to mammalian vision. Many retinal diseases occur as a result of inherited dysfunction of the rod and cone photoreceptor cells. Development and maintenance of photoreceptors requires appropriate regulation of the many genes specifically or highly expressed in these cells. Over the last decades, different experimental approaches have been developed to identify photoreceptor enriched genes. Recent progress in RNA analysis technology has generated large amounts of gene expression data relevant to retinal development. This paper assesses a machine learning methodology for supporting the identification of photoreceptor enriched genes based on expression data. Results Based on the analysis of publicly-available gene expression data from the developing mouse retina generated by serial analysis of gene expression (SAGE, this paper presents a predictive methodology comprising several in silico models for detecting key complex features and relationships encoded in the data, which may be useful to distinguish genes in terms of their functional roles. In order to understand temporal patterns of photoreceptor gene expression during retinal development, a two-way cluster analysis was firstly performed. By clustering SAGE libraries, a hierarchical tree reflecting relationships between developmental stages was obtained. By clustering SAGE tags, a more comprehensive expression profile for photoreceptor cells was revealed. To demonstrate the usefulness of machine learning-based models in predicting functional associations from the SAGE data, three supervised classification models were compared. The results indicated that a relatively simple instance-based model (KStar model performed significantly better than relatively more complex algorithms, e.g. neural networks. To deal with the problem of functional class imbalance occurring in the dataset, two data re
Elucidating gene function and function evolution through comparison of co-expression networks in plants

Directory of Open Access Journals (Sweden)

Marek eMutwil

2014-08-01

Full Text Available The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23. In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We show that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that, in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.
Analysis of baseline gene expression levels from ...

Science.gov (United States)

The use of gene expression profiling to predict chemical mode of action would be enhanced by better characterization of variance due to individual, environmental, and technical factors. Meta-analysis of microarray data from untreated or vehicle-treated animals within the control arm of toxicogenomics studies has yielded useful information on baseline fluctuations in gene expression. A dataset of control animal microarray expression data was assembled by a working group of the Health and Environmental Sciences Institute's Technical Committee on the Application of Genomics in Mechanism Based Risk Assessment in order to provide a public resource for assessments of variability in baseline gene expression. Data from over 500 Affymetrix microarrays from control rat liver and kidney were collected from 16 different institutions. Thirty-five biological and technical factors were obtained for each animal, describing a wide range of study characteristics, and a subset were evaluated in detail for their contribution to total variability using multivariate statistical and graphical techniques. The study factors that emerged as key sources of variability included gender, organ section, strain, and fasting state. These and other study factors were identified as key descriptors that should be included in the minimal information about a toxicogenomics study needed for interpretation of results by an independent source. Genes that are the most and least variable, gender-selectiv
FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data

DEFF Research Database (Denmark)

Manijak, Mieszko P.; Nielsen, Henrik Bjørn

2011-01-01

circumvented by instead matching gene expression signatures to signatures of other experiments. FINDINGS: To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700...... Arabidopsis microarray experiments. CONCLUSIONS: Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/....
Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

Science.gov (United States)

Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

2015-01-27

Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.
Radiation-modulated gene expression in C. elegans

International Nuclear Information System (INIS)

Nelson, G.A.; Bayeta, E.; Perez, C.; Lloyd, E.; Jones, T.; Smith, A.; Tian, J.

2003-01-01

Full text: We use the nematode C. elegans to characterize the genotoxic and cytotoxic effects of ionizing radiation with emphasis effects of charged particle radiation and have described the fluence vs. response relationships for mutation, chromosome aberration and certain developmental errors. These endpoints quantify the biological after repair and compensation pathways have completed their work. In order to address the control of these reactions we have turned to gene expression profiling to identify genes that uniquely respond to high LET species or respond differentially as a function of radiation properties. We have employed whole genome microarray methods to map gene expression following exposure to gamma rays, protons and accelerated iron ions. We found that 599 of 17871 genes analyzed showed differential expression 3 hrs after exposure to 3 Gy of at least one radiation types. 193 were up-regulated, 406 were down-regulated, and 90% were affected by only one species of radiation. Genes whose transcription levels responded significantly mapped to definite statistical clusters that were unique for each radiation type. We are now trying to establish the functional relationships of the genes their relevance to mitigation of radiation-induced damage. Three approaches are being used. First, bioinformatics tools are being used to determine the roles of genes in co-regulated gene sets. Second, we are applying the technique of RNA interference to determine whether our radiation-induced genes affect cell survival (measured in terms of embryo survival) and chromosome aberration (intestinal anaphase bridges). Finally we are focussing on the response of the most strongly-regulated gene in our data set. This is the autosomal gene, F36D3.9, whose predicted structure is that of a cysteine protease resembling cathepsin B. An enzymological approach is being used to characterize this gene at the protein level. This work was supported by NASA Cooperative Agreement NCC9-149

Prediction of metabolic flux distribution from gene expression data based on the flux minimization principle.

Directory of Open Access Journals (Sweden)

Hyun-Seob Song

Full Text Available Prediction of possible flux distributions in a metabolic network provides detailed phenotypic information that links metabolism to cellular physiology. To estimate metabolic steady-state fluxes, the most common approach is to solve a set of macroscopic mass balance equations subjected to stoichiometric constraints while attempting to optimize an assumed optimal objective function. This assumption is justifiable in specific cases but may be invalid when tested across different conditions, cell populations, or other organisms. With an aim to providing a more consistent and reliable prediction of flux distributions over a wide range of conditions, in this article we propose a framework that uses the flux minimization principle to predict active metabolic pathways from mRNA expression data. The proposed algorithm minimizes a weighted sum of flux magnitudes, while biomass production can be bounded to fit an ample range from very low to very high values according to the analyzed context. We have formulated the flux weights as a function of the corresponding enzyme reaction's gene expression value, enabling the creation of context-specific fluxes based on a generic metabolic network. In case studies of wild-type Saccharomyces cerevisiae, and wild-type and mutant Escherichia coli strains, our method achieved high prediction accuracy, as gauged by correlation coefficients and sums of squared error, with respect to the experimentally measured values. In contrast to other approaches, our method was able to provide quantitative predictions for both model organisms under a variety of conditions. Our approach requires no prior knowledge or assumption of a context-specific metabolic functionality and does not require trial-and-error parameter adjustments. Thus, our framework is of general applicability for modeling the transcription-dependent metabolism of bacteria and yeasts.
A Classification Framework Applied to Cancer Gene Expression Profiles

Directory of Open Access Journals (Sweden)

Hussein Hijazi

2013-01-01

Full Text Available Classification of cancer based on gene expression has provided insight into possible treatment strategies. Thus, developing machine learning methods that can successfully distinguish among cancer subtypes or normal versus cancer samples is important. This work discusses supervised learning techniques that have been employed to classify cancers. Furthermore, a two-step feature selection method based on an attribute estimation method (e.g., ReliefF and a genetic algorithm was employed to find a set of genes that can best differentiate between cancer subtypes or normal versus cancer samples. The application of different classification methods (e.g., decision tree, k-nearest neighbor, support vector machine (SVM, bagging, and random forest on 5 cancer datasets shows that no classification method universally outperforms all the others. However, k-nearest neighbor and linear SVM generally improve the classification performance over other classifiers. Finally, incorporating diverse types of genomic data (e.g., protein-protein interaction data and gene expression increase the prediction accuracy as compared to using gene expression alone.
Concerted down-regulation of immune-system related genes predicts metastasis in colorectal carcinoma

International Nuclear Information System (INIS)

Fehlker, Marion; Huska, Matthew R; Jöns, Thomas; Andrade-Navarro, Miguel A; Kemmner, Wolfgang

2014-01-01

This study aimed at the identification of prognostic gene expression markers in early primary colorectal carcinomas without metastasis at the time point of surgery by analyzing genome-wide gene expression profiles using oligonucleotide microarrays. Cryo-conserved tumor specimens from 45 patients with early colorectal cancers were examined, with the majority of them being UICC stage II or earlier and with a follow-up time of 41–115 months. Gene expression profiling was performed using Whole Human Genome 4x44K Oligonucleotide Microarrays. Validation of microarray data was performed on five of the genes in a smaller cohort. Using a novel algorithm based on the recursive application of support vector machines (SVMs), we selected a signature of 44 probes that discriminated between patients developing later metastasis and patients with a good prognosis. Interestingly, almost half of the genes was related to the patients’ immune response and showed reduced expression in the metastatic cases. Whereas up to now gene signatures containing genes with various biological functions have been described for prediction of metastasis in CRC, in this study metastasis could be well predicted by a set of gene expression markers consisting exclusively of genes related to the MHC class II complex involved in immune response. Thus, our data emphasize that the proper function of a comprehensive network of immune response genes is of vital importance for the survival of colorectal cancer patients
Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

Science.gov (United States)

Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

2014-05-01

Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

Science.gov (United States)

Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

2018-04-23

Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis
Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities

Science.gov (United States)

Fang, Xin; Sastry, Anand; Mih, Nathan; Kim, Donghyuk; Tan, Justin; Lloyd, Colton J.; Gao, Ye; Yang, Laurence; Palsson, Bernhard O.

2017-01-01

Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN—probably the best characterized TRN—several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism’s TRN from disparate data types. PMID:28874552
Gene expression analysis predicts insect venom anaphylaxis in indolent systemic mastocytosis

NARCIS (Netherlands)

Niedoszytko, M.; Bruinenberg, M.; van Doormaal, J. J.; de Monchy, J. G. R.; Nedoszytko, B.; Koppelman, G. H.; Nawijn, M. C.; Wijmenga, C.; Jassem, E.; Oude Elberink, J. N. G.

P>Background: Anaphylaxis to insect venom (Hymenoptera) is most severe in patients with mastocytosis and may even lead to death. However, not all patients with mastocytosis suffer from anaphylaxis. The aim of the study was to analyze differences in gene expression between patients with indolent
Gene expression profiling reveals multiple toxicity endpoints induced by hepatotoxicants

Energy Technology Data Exchange (ETDEWEB)

Huang Qihong; Jin Xidong; Gaillard, Elias T.; Knight, Brian L.; Pack, Franklin D.; Stoltz, James H.; Jayadev, Supriya; Blanchard, Kerry T

2004-05-18

Microarray technology continues to gain increased acceptance in the drug development process, particularly at the stage of toxicology and safety assessment. In the current study, microarrays were used to investigate gene expression changes associated with hepatotoxicity, the most commonly reported clinical liability with pharmaceutical agents. Acetaminophen, methotrexate, methapyrilene, furan and phenytoin were used as benchmark compounds capable of inducing specific but different types of hepatotoxicity. The goal of the work was to define gene expression profiles capable of distinguishing the different subtypes of hepatotoxicity. Sprague-Dawley rats were orally dosed with acetaminophen (single dose, 4500 mg/kg for 6, 24 and 72 h), methotrexate (1 mg/kg per day for 1, 7 and 14 days), methapyrilene (100 mg/kg per day for 3 and 7 days), furan (40 mg/kg per day for 1, 3, 7 and 14 days) or phenytoin (300 mg/kg per day for 14 days). Hepatic gene expression was assessed using toxicology-specific gene arrays containing 684 target genes or expressed sequence tags (ESTs). Principal component analysis (PCA) of gene expression data was able to provide a clear distinction of each compound, suggesting that gene expression data can be used to discern different hepatotoxic agents and toxicity endpoints. Gene expression data were applied to the multiplicity-adjusted permutation test and significantly changed genes were categorized and correlated to hepatotoxic endpoints. Repression of enzymes involved in lipid oxidation (acyl-CoA dehydrogenase, medium chain, enoyl CoA hydratase, very long-chain acyl-CoA synthetase) were associated with microvesicular lipidosis. Likewise, subsets of genes associated with hepatotocellular necrosis, inflammation, hepatitis, bile duct hyperplasia and fibrosis have been identified. The current study illustrates that expression profiling can be used to: (1) distinguish different hepatotoxic endpoints; (2) predict the development of toxic endpoints; and
Large scale gene expression meta-analysis reveals tissue-specific, sex-biased gene expression in humans

Directory of Open Access Journals (Sweden)

Benjamin Mayne

2016-10-01

Full Text Available The severity and prevalence of many diseases are known to differ between the sexes. Organ specific sex-biased gene expression may underpin these and other sexually dimorphic traits. To further our understanding of sex differences in transcriptional regulation, we performed meta-analyses of sex biased gene expression in multiple human tissues. We analysed 22 publicly available human gene expression microarray data sets including over 2500 samples from 15 different tissues and 9 different organs. Briefly, by using an inverse-variance method we determined the effect size difference of gene expression between males and females. We found the greatest sex differences in gene expression in the brain, specifically in the anterior cingulate cortex, (1818 genes, followed by the heart (375 genes, kidney (224 genes, colon (218 genes and thyroid (163 genes. More interestingly, we found different parts of the brain with varying numbers and identity of sex-biased genes, indicating that specific cortical regions may influence sexually dimorphic traits. The majority of sex-biased genes in other tissues such as the bladder, liver, lungs and pancreas were on the sex chromosomes or involved in sex hormone production. On average in each tissue, 32% of autosomal genes that were expressed in a sex-biased fashion contained androgen or estrogen hormone response elements. Interestingly, across all tissues, we found approximately two-thirds of autosomal genes that were sex-biased were not under direct influence of sex hormones. To our knowledge this is the largest analysis of sex-biased gene expression in human tissues to date. We identified many sex-biased genes that were not under the direct influence of sex chromosome genes or sex hormones. These may provide targets for future development of sex-specific treatments for diseases.
Regulation of methane genes and genome expression

Energy Technology Data Exchange (ETDEWEB)

John N. Reeve

2009-09-09

At the start of this project, it was known that methanogens were Archaeabacteria (now Archaea) and were therefore predicted to have gene expression and regulatory systems different from Bacteria, but few of the molecular biology details were established. The goals were then to establish the structures and organizations of genes in methanogens, and to develop the genetic technologies needed to investigate and dissect methanogen gene expression and regulation in vivo. By cloning and sequencing, we established the gene and operon structures of all of the “methane” genes that encode the enzymes that catalyze methane biosynthesis from carbon dioxide and hydrogen. This work identified unique sequences in the methane gene that we designated mcrA, that encodes the largest subunit of methyl-coenzyme M reductase, that could be used to identify methanogen DNA and establish methanogen phylogenetic relationships. McrA sequences are now the accepted standard and used extensively as hybridization probes to identify and quantify methanogens in environmental research. With the methane genes in hand, we used northern blot and then later whole-genome microarray hybridization analyses to establish how growth phase and substrate availability regulated methane gene expression in Methanobacterium thermautotrophicus ΔH (now Methanothermobacter thermautotrophicus). Isoenzymes or pairs of functionally equivalent enzymes catalyze several steps in the hydrogen-dependent reduction of carbon dioxide to methane. We established that hydrogen availability determine which of these pairs of methane genes is expressed and therefore which of the alternative enzymes is employed to catalyze methane biosynthesis under different environmental conditions. As were unable to establish a reliable genetic system for M. thermautotrophicus, we developed in vitro transcription as an alternative system to investigate methanogen gene expression and regulation. This led to the discovery that an archaeal protein
The rapamycin-regulated gene expression signature determines prognosis for breast cancer

Directory of Open Access Journals (Sweden)

Tsavachidis Spiridon

2009-09-01

Full Text Available Abstract Background Mammalian target of rapamycin (mTOR is a serine/threonine kinase involved in multiple intracellular signaling pathways promoting tumor growth. mTOR is aberrantly activated in a significant portion of breast cancers and is a promising target for treatment. Rapamycin and its analogues are in clinical trials for breast cancer treatment. Patterns of gene expression (metagenes may also be used to simulate a biologic process or effects of a drug treatment. In this study, we tested the hypothesis that the gene-expression signature regulated by rapamycin could predict disease outcome for patients with breast cancer. Results Colony formation and sulforhodamine B (IC50 in vitro and in vivo gene expression data identified a signature, termed rapamycin metagene index (RMI, of 31 genes upregulated by rapamycin treatment in vitro as well as in vivo (false discovery rate of 10%. In the Miller dataset, RMI did not correlate with tumor size or lymph node status. High (>75th percentile RMI was significantly associated with longer survival (P = 0.015. On multivariate analysis, RMI (P = 0.029, tumor size (P = 0.015 and lymph node status (P = 0.001 were prognostic. In van 't Veer study, RMI was not associated with the time to develop distant metastasis (P = 0.41. In the Wang dataset, RMI predicted time to disease relapse (P = 0.009. Conclusion Rapamycin-regulated gene expression signature predicts clinical outcome in breast cancer. This supports the central role of mTOR signaling in breast cancer biology and provides further impetus to pursue mTOR-targeted therapies for breast cancer treatment.
EVALUATION OF THE PROGNOSTIC VALUE OF nm23 GENE EXPRESSION IN BREAST CANCER

Institute of Scientific and Technical Information of China (English)

刘红; 毛慧生; 傅西林; 方志沂; 冯玉梅; 范宇; 李树玲

2002-01-01

Objective: To investigate the expression of nm23 gene and evaluate its prognostic value in breast cancer. Methods: nm23 expressions were detected in 101 breast cancer patients (group 1) by immunohistochemistry. RT-PCR and immunohistochemistry were used to measure expressions of nm23 gene in another 68 patients with breast cancer (group 2). Results: nm23 gene expression in group 1 was inversely associated with distant metastasis and lymph node metastasis (P<0.05). In 44 patients with negative lymph node, 9 cases progressed to distant metastasis, 7 of them (77.8%) showed low expression of nm23 gene (P<0.05). In 57 patients with positive lymph node, 24 our of 29 patients who had no distant metastasis (82.8%) expressed nm23 gene at high level (P<0.05). Meanwhile, there were 6 patients with distant metastasis in the group 2, all of thenm expressed nm23 gene mRNA at low level. Conclusion: The results showed that nm23 gene might play an independent role in predicting prognosis of breast cancer.
Aberrant Gene Expression in Acute Myeloid Leukaemia

DEFF Research Database (Denmark)

Bagger, Frederik Otzen

model to investigate the role of telomerase in AML, we were able to translate the observed effect into human AML patients and identify specific genes involved, which also predict survival patterns in AML patients. During these studies we have applied methods for investigating differentially expressed......-based gene-lookup webservices, called HemaExplorer and BloodSpot. These web-services support the aim of making data and analysis of haematopoietic cells from mouse and human accessible for researchers without bioinformatics expertise. Finally, in order to aid the analysis of the very limited number...
Functional requirements for bacteriophage growth: gene essentiality and expression in mycobacteriophage Giles.

Science.gov (United States)

Dedrick, Rebekah M; Marinelli, Laura J; Newton, Gerald L; Pogliano, Kit; Pogliano, Joseph; Hatfull, Graham F

2013-05-01

Bacteriophages represent a majority of all life forms, and the vast, dynamic population with early origins is reflected in their enormous genetic diversity. A large number of bacteriophage genomes have been sequenced. They are replete with novel genes without known relatives. We know little about their functions, which genes are required for lytic growth, and how they are expressed. Furthermore, the diversity is such that even genes with required functions - such as virion proteins and repressors - cannot always be recognized. Here we describe a functional genomic dissection of mycobacteriophage Giles, in which the virion proteins are identified, genes required for lytic growth are determined, the repressor is identified, and the transcription patterns determined. We find that although all of the predicted phage genes are expressed either in lysogeny or in lytic growth, 45% of the predicted genes are non-essential for lytic growth. We also describe genes required for DNA replication, show that recombination is required for lytic growth, and that Giles encodes a novel repressor. RNAseq analysis reveals abundant expression of a small non-coding RNA in a lysogen and in late lytic growth, although it is non-essential for lytic growth and does not alter lysogeny. © 2013 Blackwell Publishing Ltd.
Differential Gene Expression and Aging

Directory of Open Access Journals (Sweden)

Laurent Seroude

2002-01-01

Full Text Available It has been established that an intricate program of gene expression controls progression through the different stages in development. The equally complex biological phenomenon known as aging is genetically determined and environmentally modulated. This review focuses on the genetic component of aging, with a special emphasis on differential gene expression. At least two genetic pathways regulating organism longevity act by modifying gene expression. Many genes are also subjected to age-dependent transcriptional regulation. Some age-related gene expression changes are prevented by caloric restriction, the most robust intervention that slows down the aging process. Manipulating the expression of some age-regulated genes can extend an organism's life span. Remarkably, the activity of many transcription regulatory elements is linked to physiological age as opposed to chronological age, indicating that orderly and tightly controlled regulatory pathways are active during aging.
Transcriptomic analysis in the developing zebrafish embryo after compound exposure: Individual gene expression and pathway regulation

Energy Technology Data Exchange (ETDEWEB)

Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands); Pronk, Tessa E. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Brandhof, Evert-Jan van den [Centre for Environmental Quality, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Ven, Leo T.M. van der [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Piersma, Aldert H. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands)

2013-10-01

The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol and saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.
Google goes cancer: improving outcome prediction for cancer patients by network-based ranking of marker genes.

Directory of Open Access Journals (Sweden)

Christof Winter

Full Text Available Predicting the clinical outcome of cancer patients based on the expression of marker genes in their tumors has received increasing interest in the past decade. Accurate predictors of outcome and response to therapy could be used to personalize and thereby improve therapy. However, state of the art methods used so far often found marker genes with limited prediction accuracy, limited reproducibility, and unclear biological relevance. To address this problem, we developed a novel computational approach to identify genes prognostic for outcome that couples gene expression measurements from primary tumor samples with a network of known relationships between the genes. Our approach ranks genes according to their prognostic relevance using both expression and network information in a manner similar to Google's PageRank. We applied this method to gene expression profiles which we obtained from 30 patients with pancreatic cancer, and identified seven candidate marker genes prognostic for outcome. Compared to genes found with state of the art methods, such as Pearson correlation of gene expression with survival time, we improve the prediction accuracy by up to 7%. Accuracies were assessed using support vector machine classifiers and Monte Carlo cross-validation. We then validated the prognostic value of our seven candidate markers using immunohistochemistry on an independent set of 412 pancreatic cancer samples. Notably, signatures derived from our candidate markers were independently predictive of outcome and superior to established clinical prognostic factors such as grade, tumor size, and nodal status. As the amount of genomic data of individual tumors grows rapidly, our algorithm meets the need for powerful computational approaches that are key to exploit these data for personalized cancer therapies in clinical practice.
Predicting gene regulatory networks of soybean nodulation from RNA-Seq transcriptome data.

Science.gov (United States)

Zhu, Mingzhu; Dahmen, Jeremy L; Stacey, Gary; Cheng, Jianlin

2013-09-22

High-throughput RNA sequencing (RNA-Seq) is a revolutionary technique to study the transcriptome of a cell under various conditions at a systems level. Despite the wide application of RNA-Seq techniques to generate experimental data in the last few years, few computational methods are available to analyze this huge amount of transcription data. The computational methods for constructing gene regulatory networks from RNA-Seq expression data of hundreds or even thousands of genes are particularly lacking and urgently needed. We developed an automated bioinformatics method to predict gene regulatory networks from the quantitative expression values of differentially expressed genes based on RNA-Seq transcriptome data of a cell in different stages and conditions, integrating transcriptional, genomic and gene function data. We applied the method to the RNA-Seq transcriptome data generated for soybean root hair cells in three different development stages of nodulation after rhizobium infection. The method predicted a soybean nodulation-related gene regulatory network consisting of 10 regulatory modules common for all three stages, and 24, 49 and 70 modules separately for the first, second and third stage, each containing both a group of co-expressed genes and several transcription factors collaboratively controlling their expression under different conditions. 8 of 10 common regulatory modules were validated by at least two kinds of validations, such as independent DNA binding motif analysis, gene function enrichment test, and previous experimental data in the literature. We developed a computational method to reliably reconstruct gene regulatory networks from RNA-Seq transcriptome data. The method can generate valuable hypotheses for interpreting biological data and designing biological experiments such as ChIP-Seq, RNA interference, and yeast two hybrid experiments.
CRC-113 gene expression signature for predicting prognosis in patients with colorectal cancer.

Science.gov (United States)

Nguyen, Minh Nam; Choi, Tae Gyu; Nguyen, Dinh Truong; Kim, Jin-Hwan; Jo, Yong Hwa; Shahid, Muhammad; Akter, Salima; Aryal, Saurav Nath; Yoo, Ji Youn; Ahn, Yong-Joo; Cho, Kyoung Min; Lee, Ju-Seog; Choe, Wonchae; Kang, Insug; Ha, Joohun; Kim, Sung Soo

2015-10-13

Colorectal cancer (CRC) is the third leading cause of global cancer mortality. Recent studies have proposed several gene signatures to predict CRC prognosis, but none of those have proven reliable for predicting prognosis in clinical practice yet due to poor reproducibility and molecular heterogeneity. Here, we have established a prognostic signature of 113 probe sets (CRC-113) that include potential biomarkers and reflect the biological and clinical characteristics. Robustness and accuracy were significantly validated in external data sets from 19 centers in five countries. In multivariate analysis, CRC-113 gene signature showed a stronger prognostic value for survival and disease recurrence in CRC patients than current clinicopathological risk factors and molecular alterations. We also demonstrated that the CRC-113 gene signature reflected both genetic and epigenetic molecular heterogeneity in CRC patients. Furthermore, incorporation of the CRC-113 gene signature into a clinical context and molecular markers further refined the selection of the CRC patients who might benefit from postoperative chemotherapy. Conclusively, CRC-113 gene signature provides new possibilities for improving prognostic models and personalized therapeutic strategies.
Gene expression signature of normal cell-of-origin predicts ovarian tumor outcomes.

Directory of Open Access Journals (Sweden)

Melissa A Merritt

Full Text Available The potential role of the cell-of-origin in determining the tumor phenotype has been raised, but not adequately examined. We hypothesized that distinct cells-of-origin may play a role in determining ovarian tumor phenotype and outcome. Here we describe a new cell culture medium for in vitro culture of paired normal human ovarian (OV and fallopian tube (FT epithelial cells from donors without cancer. While these cells have been cultured individually for short periods of time, to our knowledge this is the first long-term culture of both cell types from the same donors. Through analysis of the gene expression profiles of the cultured OV/FT cells we identified a normal cell-of-origin gene signature that classified primary ovarian cancers into OV-like and FT-like subgroups; this classification correlated with significant differences in clinical outcomes. The identification of a prognostically significant gene expression signature derived solely from normal untransformed cells is consistent with the hypothesis that the normal cell-of-origin may be a source of ovarian tumor heterogeneity and the associated differences in tumor outcome.

A novel mutual information-based Boolean network inference method from time-series gene expression data.

Directory of Open Access Journals (Sweden)

Shohag Barman

Full Text Available Inferring a gene regulatory network from time-series gene expression data in systems biology is a challenging problem. Many methods have been suggested, most of which have a scalability limitation due to the combinatorial cost of searching a regulatory set of genes. In addition, they have focused on the accurate inference of a network structure only. Therefore, there is a pressing need to develop a network inference method to search regulatory genes efficiently and to predict the network dynamics accurately.In this study, we employed a Boolean network model with a restricted update rule scheme to capture coarse-grained dynamics, and propose a novel mutual information-based Boolean network inference (MIBNI method. Given time-series gene expression data as an input, the method first identifies a set of initial regulatory genes using mutual information-based feature selection, and then improves the dynamics prediction accuracy by iteratively swapping a pair of genes between sets of the selected regulatory genes and the other genes. Through extensive simulations with artificial datasets, MIBNI showed consistently better performance than six well-known existing methods, REVEAL, Best-Fit, RelNet, CST, CLR, and BIBN in terms of both structural and dynamics prediction accuracy. We further tested the proposed method with two real gene expression datasets for an Escherichia coli gene regulatory network and a fission yeast cell cycle network, and also observed better results using MIBNI compared to the six other methods.Taken together, MIBNI is a promising tool for predicting both the structure and the dynamics of a gene regulatory network.
High-throughput Microarray Detection of Vomeronasal Receptor Gene Expression in Rodents

Directory of Open Access Journals (Sweden)

Xiaohong Zhang

2010-11-01

Full Text Available We performed comprehensive data mining to explore the vomeronasal receptor (V1R & V2R repertoires in mouse and rat using the mm5 and rn3 genome, respectively. This bioinformatic analysis was followed by investigation of gene expression using a custom designed high-density oligonucleotide array containing all of these receptors and other selected genes of interest. This array enabled us to detect the specific expression of V1R and V2Rs which were previously identified solely based on computational prediction from gene sequence data, thereby establishing that these genes are indeed part of the vomeronasal system, especially the V2Rs. 168 V1Rs and 98 V2Rs were detected to be highly enriched in mouse vomeronasal organ (VNO, and 108 V1Rs and 87 V2Rs in rat VNO. We monitored the expression profile of mouse VR genes in other non-VNO tissues with the result that some VR genes were re-designated as VR-like genes based on their non-olfactory expression pattern. Temporal expression profiles for mouse VR genes were characterized and their patterns were classified, revealing the developmental dynamics of these so-called pheromone receptors. We found numerous patterns of temporal expression which indicate possible behavior-related functions. The uneven composition of VR genes in certain patterns suggests a functional differentiation between the two types of VR genes. We found the coherence between VR genes and transcription factors in terms of their temporal expression patterns. In situ hybridization experiments were performed to evaluate the cell number change over time for selected receptor genes.
Phylogenomic detection and functional prediction of genes potentially important for plant meiosis.

Science.gov (United States)

Zhang, Luoyan; Kong, Hongzhi; Ma, Hong; Yang, Ji

2018-02-15

Meiosis is a specialized type of cell division necessary for sexual reproduction in eukaryotes. A better understanding of the cytological procedures of meiosis has been achieved by comprehensive cytogenetic studies in plants, while the genetic mechanisms regulating meiotic progression remain incompletely understood. The increasing accumulation of complete genome sequences and large-scale gene expression datasets has provided a powerful resource for phylogenomic inference and unsupervised identification of genes involved in plant meiosis. By integrating sequence homology and expression data, 164, 131, 124 and 162 genes potentially important for meiosis were identified in the genomes of Arabidopsis thaliana, Oryza sativa, Selaginella moellendorffii and Pogonatum aloides, respectively. The predicted genes were assigned to 45 meiotic GO terms, and their functions were related to different processes occurring during meiosis in various organisms. Most of the predicted meiotic genes underwent lineage-specific duplication events during plant evolution, with about 30% of the predicted genes retaining only a single copy in higher plant genomes. The results of this study provided clues to design experiments for better functional characterization of meiotic genes in plants, promoting the phylogenomic approach to the evolutionary dynamics of the plant meiotic machineries. Copyright © 2017 Elsevier B.V. All rights reserved.
Survey of the Heritability and Sparse Architecture of Gene Expression Traits across Human Tissues.

Directory of Open Access Journals (Sweden)

Heather E Wheeler

2016-11-01

Full Text Available Understanding the genetic architecture of gene expression traits is key to elucidating the underlying mechanisms of complex traits. Here, for the first time, we perform a systematic survey of the heritability and the distribution of effect sizes across all representative tissues in the human body. We find that local h2 can be relatively well characterized with 59% of expressed genes showing significant h2 (FDR < 0.1 in the DGN whole blood cohort. However, current sample sizes (n ≤ 922 do not allow us to compute distal h2. Bayesian Sparse Linear Mixed Model (BSLMM analysis provides strong evidence that the genetic contribution to local expression traits is dominated by a handful of genetic variants rather than by the collective contribution of a large number of variants each of modest size. In other words, the local architecture of gene expression traits is sparse rather than polygenic across all 40 tissues (from DGN and GTEx examined. This result is confirmed by the sparsity of optimal performing gene expression predictors via elastic net modeling. To further explore the tissue context specificity, we decompose the expression traits into cross-tissue and tissue-specific components using a novel Orthogonal Tissue Decomposition (OTD approach. Through a series of simulations we show that the cross-tissue and tissue-specific components are identifiable via OTD. Heritability and sparsity estimates of these derived expression phenotypes show similar characteristics to the original traits. Consistent properties relative to prior GTEx multi-tissue analysis results suggest that these traits reflect the expected biology. Finally, we apply this knowledge to develop prediction models of gene expression traits for all tissues. The prediction models, heritability, and prediction performance R2 for original and decomposed expression phenotypes are made publicly available (https://github.com/hakyimlab/PrediXcan.
Transcriptome profiling in conifers and the PiceaGenExpress database show patterns of diversification within gene families and interspecific conservation in vascular gene expression

Directory of Open Access Journals (Sweden)

Raherison Elie

2012-08-01

Full Text Available Abstract Background Conifers have very large genomes (13 to 30 Gigabases that are mostly uncharacterized although extensive cDNA resources have recently become available. This report presents a global overview of transcriptome variation in a conifer tree and documents conservation and diversity of gene expression patterns among major vegetative tissues. Results An oligonucleotide microarray was developed from Picea glauca and P. sitchensis cDNA datasets. It represents 23,853 unique genes and was shown to be suitable for transcriptome profiling in several species. A comparison of secondary xylem and phelloderm tissues showed that preferential expression in these vascular tissues was highly conserved among Picea spp. RNA-Sequencing strongly confirmed tissue preferential expression and provided a robust validation of the microarray design. A small database of transcription profiles called PiceaGenExpress was developed from over 150 hybridizations spanning eight major tissue types. In total, transcripts were detected for 92% of the genes on the microarray, in at least one tissue. Non-annotated genes were predominantly expressed at low levels in fewer tissues than genes of known or predicted function. Diversity of expression within gene families may be rapidly assessed from PiceaGenExpress. In conifer trees, dehydrins and late embryogenesis abundant (LEA osmotic regulation proteins occur in large gene families compared to angiosperms. Strong contrasts and low diversity was observed in the dehydrin family, while diverse patterns suggested a greater degree of diversification among LEAs. Conclusion Together, the oligonucleotide microarray and the PiceaGenExpress database represent the first resource of this kind for gymnosperm plants. The spruce transcriptome analysis reported here is expected to accelerate genetic studies in the large and important group comprised of conifer trees.
Gene organization in rice revealed by full-length cDNA mapping and gene expression analysis through microarray.

Directory of Open Access Journals (Sweden)

Kouji Satoh

Full Text Available Rice (Oryza sativa L. is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE genes, 33K annotated non-expressed (ANE genes, and 5.5K non-annotated expressed (NAE genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.
Sex-biased gene expression in dioecious garden asparagus (Asparagus officinalis).

Science.gov (United States)

Harkess, Alex; Mercati, Francesco; Shan, Hong-Yan; Sunseri, Francesco; Falavigna, Agostino; Leebens-Mack, Jim

2015-08-01

Sex chromosomes have evolved independently in phylogenetically diverse flowering plant lineages. The genes governing sex determination in dioecious species remain unknown, but theory predicts that the linkage of genes influencing male and female function will spur the origin and early evolution of sex chromosomes. For example, in an XY system, the origin of an active Y may be spurred by the linkage of female suppressing and male promoting genes. Garden asparagus (Asparagus officinalis) serves as a model for plant sex chromosome evolution, given that it has recently evolved an XX/XY sex chromosome system. In order to elucidate the molecular basis of gender differences and sex determination, we used RNA-sequencing (RNA-Seq) to identify differentially expressed genes between female (XX), male (XY) and supermale (YY) individuals. We identified 570 differentially expressed genes, and showed that significantly more genes exhibited male-biased than female-biased expression in garden asparagus. In the context of anther development, we identified genes involved in pollen microspore and tapetum development that were specifically expressed in males and supermales. Comparative analysis of genes in the Arabidopsis thaliana, Zea mays and Oryza sativa anther development pathways shows that anther sterility in females probably occurs through interruption of tapetum development before microspore meiosis. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Endosomal gene expression: a new indicator for prostate cancer patient prognosis?

LENUS (Irish Health Repository)

Johnson, Ian R D

2015-11-10

Prostate cancer continues to be a major cause of morbidity and mortality in men, but a method for accurate prognosis in these patients is yet to be developed. The recent discovery of altered endosomal biogenesis in prostate cancer has identified a fundamental change in the cell biology of this cancer, which holds great promise for the identification of novel biomarkers that can predict disease outcomes. Here we have identified significantly altered expression of endosomal genes in prostate cancer compared to non-malignant tissue in mRNA microarrays and confirmed these findings by qRT-PCR on fresh-frozen tissue. Importantly, we identified endosomal gene expression patterns that were predictive of patient outcomes. Two endosomal tri-gene signatures were identified from a previously published microarray cohort and had a significant capacity to stratify patient outcomes. The expression of APPL1, RAB5A, EEA1, PDCD6IP, NOX4 and SORT1 were altered in malignant patient tissue, when compared to indolent and normal prostate tissue. These findings support the initiation of a case-control study using larger cohorts of prostate tissue, with documented patient outcomes, to determine if different combinations of these new biomarkers can accurately predict disease status and clinical progression in prostate cancer patients.
Design parameters to control synthetic gene expression in Escherichia coli.

Directory of Open Access Journals (Sweden)

Mark Welch

Full Text Available BACKGROUND: Production of proteins as therapeutic agents, research reagents and molecular tools frequently depends on expression in heterologous hosts. Synthetic genes are increasingly used for protein production because sequence information is easier to obtain than the corresponding physical DNA. Protein-coding sequences are commonly re-designed to enhance expression, but there are no experimentally supported design principles. PRINCIPAL FINDINGS: To identify sequence features that affect protein expression we synthesized and expressed in E. coli two sets of 40 genes encoding two commercially valuable proteins, a DNA polymerase and a single chain antibody. Genes differing only in synonymous codon usage expressed protein at levels ranging from undetectable to 30% of cellular protein. Using partial least squares regression we tested the correlation of protein production levels with parameters that have been reported to affect expression. We found that the amount of protein produced in E. coli was strongly dependent on the codons used to encode a subset of amino acids. Favorable codons were predominantly those read by tRNAs that are most highly charged during amino acid starvation, not codons that are most abundant in highly expressed E. coli proteins. Finally we confirmed the validity of our models by designing, synthesizing and testing new genes using codon biases predicted to perform well. CONCLUSION: The systematic analysis of gene design parameters shown in this study has allowed us to identify codon usage within a gene as a critical determinant of achievable protein expression levels in E. coli. We propose a biochemical basis for this, as well as design algorithms to ensure high protein production from synthetic genes. Replication of this methodology should allow similar design algorithms to be empirically derived for any expression system.
Learning Gene Regulatory Networks Computationally from Gene Expression Data Using Weighted Consensus

KAUST Repository

Fujii, Chisato

2015-04-16

Gene regulatory networks analyze the relationships between genes allowing us to un- derstand the gene regulatory interactions in systems biology. Gene expression data from the microarray experiments is used to obtain the gene regulatory networks. How- ever, the microarray data is discrete, noisy and non-linear which makes learning the networks a challenging problem and existing gene network inference methods do not give consistent results. Current state-of-the-art study uses the average-ranking-based consensus method to combine and average the ranked predictions from individual methods. However each individual method has an equal contribution to the consen- sus prediction. We have developed a linear programming-based consensus approach which uses learned weights from linear programming among individual methods such that the methods have di↵erent weights depending on their performance. Our result reveals that assigning di↵erent weights to individual methods rather than giving them equal weights improves the performance of the consensus. The linear programming- based consensus method is evaluated and it had the best performance on in silico and Saccharomyces cerevisiae networks, and the second best on the Escherichia coli network outperformed by Inferelator Pipeline method which gives inconsistent results across a wide range of microarray data sets.
Early gene expression profiles of patients with chronic hepatitis C treated with pegylated interferon-alfa and ribavirin.

Science.gov (United States)

Younossi, Zobair M; Baranova, Ancha; Afendy, Arian; Collantes, Rochelle; Stepanova, Maria; Manyam, Ganiraju; Bakshi, Anita; Sigua, Christopher L; Chan, Joanne P; Iverson, Ayuko A; Santini, Christopher D; Chang, Sheng-Yung P

2009-03-01

Responsiveness to hepatitis C virus (HCV) therapy depends on viral and host factors. Our aim was to assess sustained virologic response (SVR)-associated early gene expression in patients with HCV receiving pegylated interferon-alpha2a (PEG-IFN-alpha2a) or PEG-IFN-alpha2b and ribavirin with the duration based on genotypes. Blood samples were collected into PAXgene tubes prior to treatment as well as 1, 7, 28, and 56 days after treatment. From the peripheral blood cells, total RNA was extracted, quantified, and used for one-step reverse transcription polymerase chain reaction to profile 154 messenger RNAs. Expression levels of messenger RNAs were normalized with six "housekeeping" genes and a reference RNA. Multiple regression and stepwise selection were performed to assess differences in gene expression at different time points, and predictive performance was evaluated for each model. A total of 68 patients were enrolled in the study and treated with combination therapy. The results of gene expression showed that SVR could be predicted by the gene expression of signal transducer and activator of transcription-6 (STAT-6) and suppressor of cytokine signaling-1 in the pretreatment samples. After 24 hours, SVR was predicted by the expression of interferon-dependent genes, and this dependence continued to be prominent throughout the treatment. Early gene expression during anti-HCV therapy may elucidate important molecular pathways that may be influencing the probability of achieving virologic response.
Gene expression inference with deep learning.

Science.gov (United States)

Chen, Yifei; Li, Yi; Narayan, Rajiv; Subramanian, Aravind; Xie, Xiaohui

2016-06-15

Large-scale gene expression profiling has been widely used to characterize cellular states in response to various disease conditions, genetic perturbations, etc. Although the cost of whole-genome expression profiles has been dropping steadily, generating a compendium of expression profiling over thousands of samples is still very expensive. Recognizing that gene expressions are often highly correlated, researchers from the NIH LINCS program have developed a cost-effective strategy of profiling only ∼1000 carefully selected landmark genes and relying on computational methods to infer the expression of remaining target genes. However, the computational approach adopted by the LINCS program is currently based on linear regression (LR), limiting its accuracy since it does not capture complex nonlinear relationship between expressions of genes. We present a deep learning method (abbreviated as D-GEX) to infer the expression of target genes from the expression of landmark genes. We used the microarray-based Gene Expression Omnibus dataset, consisting of 111K expression profiles, to train our model and compare its performance to those from other methods. In terms of mean absolute error averaged across all genes, deep learning significantly outperforms LR with 15.33% relative improvement. A gene-wise comparative analysis shows that deep learning achieves lower error than LR in 99.97% of the target genes. We also tested the performance of our learned model on an independent RNA-Seq-based GTEx dataset, which consists of 2921 expression profiles. Deep learning still outperforms LR with 6.57% relative improvement, and achieves lower error in 81.31% of the target genes. D-GEX is available at https://github.com/uci-cbcl/D-GEX CONTACT: xhx@ics.uci.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
A gene signature in histologically normal surgical margins is predictive of oral carcinoma recurrence

International Nuclear Information System (INIS)

Reis, Patricia P; Simpson, Colleen; Goldstein, David; Brown, Dale; Gilbert, Ralph; Gullane, Patrick; Irish, Jonathan; Jurisica, Igor; Kamel-Reid, Suzanne; Waldron, Levi; Perez-Ordonez, Bayardo; Pintilie, Melania; Galloni, Natalie Naranjo; Xuan, Yali; Cervigne, Nilva K; Warner, Giles C; Makitie, Antti A

2011-01-01

Oral Squamous Cell Carcinoma (OSCC) is a major cause of cancer death worldwide, which is mainly due to recurrence leading to treatment failure and patient death. Histological status of surgical margins is a currently available assessment for recurrence risk in OSCC; however histological status does not predict recurrence, even in patients with histologically negative margins. Therefore, molecular analysis of histologically normal resection margins and the corresponding OSCC may aid in identifying a gene signature predictive of recurrence. We used a meta-analysis of 199 samples (OSCCs and normal oral tissues) from five public microarray datasets, in addition to our microarray analysis of 96 OSCCs and histologically normal margins from 24 patients, to train a gene signature for recurrence. Validation was performed by quantitative real-time PCR using 136 samples from an independent cohort of 30 patients. We identified 138 significantly over-expressed genes (> 2-fold, false discovery rate of 0.01) in OSCC. By penalized likelihood Cox regression, we identified a 4-gene signature with prognostic value for recurrence in our training set. This signature comprised the invasion-related genes MMP1, COL4A1, P4HA2, and THBS2. Over-expression of this 4-gene signature in histologically normal margins was associated with recurrence in our training cohort (p = 0.0003, logrank test) and in our independent validation cohort (p = 0.04, HR = 6.8, logrank test). Gene expression alterations occur in histologically normal margins in OSCC. Over-expression of the 4-gene signature in histologically normal surgical margins was validated and highly predictive of recurrence in an independent patient cohort. Our findings may be applied to develop a molecular test, which would be clinically useful to help predict which patients are at a higher risk of local recurrence
Inductive matrix completion for predicting gene-disease associations.

Science.gov (United States)

Natarajan, Nagarajan; Dhillon, Inderjit S

2014-06-15

Most existing methods for predicting causal disease genes rely on specific type of evidence, and are therefore limited in terms of applicability. More often than not, the type of evidence available for diseases varies-for example, we may know linked genes, keywords associated with the disease obtained by mining text, or co-occurrence of disease symptoms in patients. Similarly, the type of evidence available for genes varies-for example, specific microarray probes convey information only for certain sets of genes. In this article, we apply a novel matrix-completion method called Inductive Matrix Completion to the problem of predicting gene-disease associations; it combines multiple types of evidence (features) for diseases and genes to learn latent factors that explain the observed gene-disease associations. We construct features from different biological sources such as microarray expression data and disease-related textual data. A crucial advantage of the method is that it is inductive; it can be applied to diseases not seen at training time, unlike traditional matrix-completion approaches and network-based inference methods that are transductive. Comparison with state-of-the-art methods on diseases from the Online Mendelian Inheritance in Man (OMIM) database shows that the proposed approach is substantially better-it has close to one-in-four chance of recovering a true association in the top 100 predictions, compared to the recently proposed Catapult method (second best) that has bigdata.ices.utexas.edu/project/gene-disease. © The Author 2014. Published by Oxford University Press.
Gene expression signature in organized and growth arrested mammaryacini predicts good outcome in breast cancer

Energy Technology Data Exchange (ETDEWEB)

Fournier, Marcia V.; Martin, Katherine J.; Kenny, Paraic A.; Xhaja, Kris; Bosch, Irene; Yaswen, Paul; Bissell, Mina J.

2006-02-08

To understand how non-malignant human mammary epithelial cells (HMEC) transit from a disorganized proliferating to an organized growth arrested state, and to relate this process to the changes that occur in breast cancer, we studied gene expression changes in non-malignant HMEC grown in three-dimensional cultures, and in a previously published panel of microarray data for 295 breast cancer samples. We hypothesized that the gene expression pattern of organized and growth arrested mammary acini would share similarities with breast tumors with good prognoses. Using Affymetrix HG-U133A microarrays, we analyzed the expression of 22,283 gene transcripts in two HMEC cell lines, 184 (finite life span) and HMT3522 S1 (immortal non-malignant), on successive days post-seeding in a laminin-rich extracellular matrix assay. Both HMECs underwent growth arrest in G0/G1 and differentiated into polarized acini between days 5 and 7. We identified gene expression changes with the same temporal pattern in both lines. We show that genes that are significantly lower in the organized, growth arrested HMEC than in their proliferating counterparts can be used to classify breast cancer patients into poor and good prognosis groups with high accuracy. This study represents a novel unsupervised approach to identifying breast cancer markers that may be of use clinically.
Patterns of gene expression in a scleractinian coral undergoing natural bleaching.

Science.gov (United States)

Seneca, Francois O; Forêt, Sylvain; Ball, Eldon E; Smith-Keune, Carolyn; Miller, David J; van Oppen, Madeleine J H

2010-10-01

Coral bleaching is a major threat to coral reefs worldwide and is predicted to intensify with increasing global temperature. This study represents the first investigation of gene expression in an Indo-Pacific coral species undergoing natural bleaching which involved the loss of algal symbionts. Quantitative real-time polymerase chain reaction experiments were conducted to select and evaluate coral internal control genes (ICGs), and to investigate selected coral genes of interest (GOIs) for changes in gene expression in nine colonies of the scleractinian coral Acropora millepora undergoing bleaching at Magnetic Island, Great Barrier Reef, Australia. Among the six ICGs tested, glyceraldehyde 3-phosphate dehydrogenase and the ribosomal protein genes S7 and L9 exhibited the most constant expression levels between samples from healthy-looking colonies and samples from the same colonies when severely bleached a year later. These ICGs were therefore utilised for normalisation of expression data for seven selected GOIs. Of the seven GOIs, homologues of catalase, C-type lectin and chromoprotein genes were significantly up-regulated as a result of bleaching by factors of 1.81, 1.46 and 1.61 (linear mixed models analysis of variance, P coral bleaching response genes. In contrast, three genes, including one putative ICG, showed highly variable levels of expression between coral colonies. Potential variation in microhabitat, gene function unrelated to the stress response and individualised stress responses may influence such differences between colonies and need to be better understood when designing and interpreting future studies of gene expression in natural coral populations.
Classification across gene expression microarray studies

Directory of Open Access Journals (Sweden)

Kuner Ruprecht

2009-12-01

Full Text Available Abstract Background The increasing number of gene expression microarray studies represents an important resource in biomedical research. As a result, gene expression based diagnosis has entered clinical practice for patient stratification in breast cancer. However, the integration and combined analysis of microarray studies remains still a challenge. We assessed the potential benefit of data integration on the classification accuracy and systematically evaluated the generalization performance of selected methods on four breast cancer studies comprising almost 1000 independent samples. To this end, we introduced an evaluation framework which aims to establish good statistical practice and a graphical way to monitor differences. The classification goal was to correctly predict estrogen receptor status (negative/positive and histological grade (low/high of each tumor sample in an independent study which was not used for the training. For the classification we chose support vector machines (SVM, predictive analysis of microarrays (PAM, random forest (RF and k-top scoring pairs (kTSP. Guided by considerations relevant for classification across studies we developed a generalization of kTSP which we evaluated in addition. Our derived version (DV aims to improve the robustness of the intrinsic invariance of kTSP with respect to technologies and preprocessing. Results For each individual study the generalization error was benchmarked via complete cross-validation and was found to be similar for all classification methods. The misclassification rates were substantially higher in classification across studies, when each single study was used as an independent test set while all remaining studies were combined for the training of the classifier. However, with increasing number of independent microarray studies used in the training, the overall classification performance improved. DV performed better than the average and showed slightly less variance. In
Cancer-Predicting Gene Expression Changes in Colonic Mucosa of Western Diet Fed Mlh1 +/- Mice

Science.gov (United States)

Dermadi Bebek, Denis; Valo, Satu; Reyhani, Nima; Ollila, Saara; Päivärinta, Essi; Peltomäki, Päivi; Mutanen, Marja; Nyström, Minna

2013-01-01

Colorectal cancer (CRC) is the second most common cause of cancer-related deaths in the Western world and interactions between genetic and environmental factors, including diet, are suggested to play a critical role in its etiology. We conducted a long-term feeding experiment in the mouse to address gene expression and methylation changes arising in histologically normal colonic mucosa as putative cancer-predisposing events available for early detection. The expression of 94 growth-regulatory genes previously linked to human CRC was studied at two time points (5 weeks and 12 months of age) in the heterozygote Mlh1 +/- mice, an animal model for human Lynch syndrome (LS), and wild type Mlh1 +/+ littermates, fed by either Western-style (WD) or AIN-93G control diet. In mice fed with WD, proximal colon mucosa, the predominant site of cancer formation in LS, exhibited a significant expression decrease in tumor suppressor genes, Dkk1, Hoxd1, Slc5a8, and Socs1, the latter two only in the Mlh1 +/- mice. Reduced mRNA expression was accompanied by increased promoter methylation of the respective genes. The strongest expression decrease (7.3 fold) together with a significant increase in its promoter methylation was seen in Dkk1, an antagonist of the canonical Wnt signaling pathway. Furthermore, the inactivation of Dkk1 seems to predispose to neoplasias in the proximal colon. This and the fact that Mlh1 which showed only modest methylation was still expressed in both Mlh1 +/- and Mlh1 +/+ mice indicate that the expression decreases and the inactivation of Dkk1 in particular is a prominent early marker for colon oncogenesis. PMID:24204690
Cancer-predicting gene expression changes in colonic mucosa of Western diet fed Mlh1+/- mice.

Directory of Open Access Journals (Sweden)

Marjaana Pussila

Full Text Available Colorectal cancer (CRC is the second most common cause of cancer-related deaths in the Western world and interactions between genetic and environmental factors, including diet, are suggested to play a critical role in its etiology. We conducted a long-term feeding experiment in the mouse to address gene expression and methylation changes arising in histologically normal colonic mucosa as putative cancer-predisposing events available for early detection. The expression of 94 growth-regulatory genes previously linked to human CRC was studied at two time points (5 weeks and 12 months of age in the heterozygote Mlh1(+/- mice, an animal model for human Lynch syndrome (LS, and wild type Mlh1(+/+ littermates, fed by either Western-style (WD or AIN-93G control diet. In mice fed with WD, proximal colon mucosa, the predominant site of cancer formation in LS, exhibited a significant expression decrease in tumor suppressor genes, Dkk1, Hoxd1, Slc5a8, and Socs1, the latter two only in the Mlh1(+/- mice. Reduced mRNA expression was accompanied by increased promoter methylation of the respective genes. The strongest expression decrease (7.3 fold together with a significant increase in its promoter methylation was seen in Dkk1, an antagonist of the canonical Wnt signaling pathway. Furthermore, the inactivation of Dkk1 seems to predispose to neoplasias in the proximal colon. This and the fact that Mlh1 which showed only modest methylation was still expressed in both Mlh1(+/- and Mlh1(+/+ mice indicate that the expression decreases and the inactivation of Dkk1 in particular is a prominent early marker for colon oncogenesis.
Scaling of gene expression data allowing the comparison of different gene expression platforms

NARCIS (Netherlands)

van Ruissen, Fred; Schaaf, Gerben J.; Kool, Marcel; Baas, Frank; Ruijter, Jan M.

2008-01-01

Serial analysis of gene expression (SAGE) and microarrays have found a widespread application, but much ambiguity exists regarding the amalgamation of the data resulting from these technologies. Cross-platform utilization of gene expression data from the SAGE and microarray technology could reduce

cis sequence effects on gene expression

Directory of Open Access Journals (Sweden)

Jacobs Kevin

2007-08-01

Full Text Available Abstract Background Sequence and transcriptional variability within and between individuals are typically studied independently. The joint analysis of sequence and gene expression variation (genetical genomics provides insight into the role of linked sequence variation in the regulation of gene expression. We investigated the role of sequence variation in cis on gene expression (cis sequence effects in a group of genes commonly studied in cancer research in lymphoblastoid cell lines. We estimated the proportion of genes exhibiting cis sequence effects and the proportion of gene expression variation explained by cis sequence effects using three different analytical approaches, and compared our results to the literature. Results We generated gene expression profiling data at N = 697 candidate genes from N = 30 lymphoblastoid cell lines for this study and used available candidate gene resequencing data at N = 552 candidate genes to identify N = 30 candidate genes with sufficient variance in both datasets for the investigation of cis sequence effects. We used two additive models and the haplotype phylogeny scanning approach of Templeton (Tree Scanning to evaluate association between individual SNPs, all SNPs at a gene, and diplotypes, with log-transformed gene expression. SNPs and diplotypes at eight candidate genes exhibited statistically significant (p cis sequence effects in our study, respectively. Conclusion Based on analysis of our results and the extant literature, one in four genes exhibits significant cis sequence effects, and for these genes, about 30% of gene expression variation is accounted for by cis sequence variation. Despite diverse experimental approaches, the presence or absence of significant cis sequence effects is largely supported by previously published studies.
Computational prediction and experimental validation of Ciona intestinalis microRNA genes

Directory of Open Access Journals (Sweden)

Pasquinelli Amy E

2007-11-01

Full Text Available Abstract Background This study reports the first collection of validated microRNA genes in the sea squirt, Ciona intestinalis. MicroRNAs are processed from hairpin precursors to ~22 nucleotide RNAs that base pair to target mRNAs and inhibit expression. As a member of the subphylum Urochordata (Tunicata whose larval form has a notochord, the sea squirt is situated at the emergence of vertebrates, and therefore may provide information about the evolution of molecular regulators of early development. Results In this study, computational methods were used to predict 14 microRNA gene families in Ciona intestinalis. The microRNA prediction algorithm utilizes configurable microRNA sequence conservation and stem-loop specificity parameters, grouping by miRNA family, and phylogenetic conservation to the related species, Ciona savignyi. The expression for 8, out of 9 attempted, of the putative microRNAs in the adult tissue of Ciona intestinalis was validated by Northern blot analyses. Additionally, a target prediction algorithm was implemented, which identified a high confidence list of 240 potential target genes. Over half of the predicted targets can be grouped into the gene ontology categories of metabolism, transport, regulation of transcription, and cell signaling. Conclusion The computational techniques implemented in this study can be applied to other organisms and serve to increase the understanding of the origins of non-coding RNAs, embryological and cellular developmental pathways, and the mechanisms for microRNA-controlled gene regulatory networks.
Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

Directory of Open Access Journals (Sweden)

Paules Richard S

2007-11-01

Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying
Predicting Recurrence and Progression of Noninvasive Papillary Bladder Cancer at Initial Presentation Based on Quantitative Gene Expression Profiles

DEFF Research Database (Denmark)

Birkhahn, M.; Mitra, A.P.; Williams, Johan

2010-01-01

Background: Currently, tumor grade is the best predictor of outcome at first presentation of noninvasive papillary (Ta) bladder cancer. However, reliable predictors of Ta tumor recurrence and progression for individual patients, which could optimize treatment and follow-up schedules based...... on specific tumor biology, are yet to be identified. Objective: To identify genes predictive for recurrence and progression in Ta bladder cancer at first presentation using a quantitative, pathway-specific approach. Design, setting, and participants: Retrospective study of patients with Ta G2/3 bladder tumors...... at initial presentation with three distinct clinical outcomes: absence of recurrence (n = 16), recurrence without progression (n = 16), and progression to carcinoma in situ or invasive disease (n = 16). Measurements: Expressions of 24 genes that feature in relevant pathways that are deregulated in bladder...
A Pathway Based Classification Method for Analyzing Gene Expression for Alzheimer's Disease Diagnosis.

Science.gov (United States)

Voyle, Nicola; Keohane, Aoife; Newhouse, Stephen; Lunnon, Katie; Johnston, Caroline; Soininen, Hilkka; Kloszewska, Iwona; Mecocci, Patrizia; Tsolaki, Magda; Vellas, Bruno; Lovestone, Simon; Hodges, Angela; Kiddle, Steven; Dobson, Richard Jb

2016-01-01

Recent studies indicate that gene expression levels in blood may be able to differentiate subjects with Alzheimer's disease (AD) from normal elderly controls and mild cognitively impaired (MCI) subjects. However, there is limited replicability at the single marker level. A pathway-based interpretation of gene expression may prove more robust. This study aimed to investigate whether a case/control classification model built on pathway level data was more robust than a gene level model and may consequently perform better in test data. The study used two batches of gene expression data from the AddNeuroMed (ANM) and Dementia Case Registry (DCR) cohorts. Our study used Illumina Human HT-12 Expression BeadChips to collect gene expression from blood samples. Random forest modeling with recursive feature elimination was used to predict case/control status. Age and APOE ɛ4 status were used as covariates for all analysis. Gene and pathway level models performed similarly to each other and to a model based on demographic information only. Any potential increase in concordance from the novel pathway level approach used here has not lead to a greater predictive ability in these datasets. However, we have only tested one method for creating pathway level scores. Further, we have been able to benchmark pathways against genes in datasets that had been extensively harmonized. Further work should focus on the use of alternative methods for creating pathway level scores, in particular those that incorporate pathway topology, and the use of an endophenotype based approach.
Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease.

Science.gov (United States)

Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

2014-01-01

We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. http://rged.wall-eva.net. © The Author(s) 2014. Published by Oxford University Press.
Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease

Science.gov (United States)

Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

2014-01-01

We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Availability and implementation: Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. Database URL: http://rged.wall-eva.net PMID:25252782
Directional gene expression and antisense transcripts in sexual and asexual stages of Plasmodium falciparum

Directory of Open Access Journals (Sweden)

López-Barragán María J

2011-11-01

Full Text Available Abstract Background It has been shown that nearly a quarter of the initial predicted gene models in the Plasmodium falciparum genome contain errors. Although there have been efforts to obtain complete cDNA sequences to correct the errors, the coverage of cDNA sequences on the predicted genes is still incomplete, and many gene models for those expressed in sexual or mosquito stages have not been validated. Antisense transcripts have widely been reported in P. falciparum; however, the extent and pattern of antisense transcripts in different developmental stages remain largely unknown. Results We have sequenced seven bidirectional libraries from ring, early and late trophozoite, schizont, gametocyte II, gametocyte V, and ookinete, and four strand-specific libraries from late trophozoite, schizont, gametocyte II, and gametocyte V of the 3D7 parasites. Alignment of the cDNA sequences to the 3D7 reference genome revealed stage-specific antisense transcripts and novel intron-exon splicing junctions. Sequencing of strand-specific cDNA libraries suggested that more genes are expressed in one direction in gametocyte than in schizont. Alternatively spliced genes, antisense transcripts, and stage-specific expressed genes were also characterized. Conclusions It is necessary to continue to sequence cDNA from different developmental stages, particularly those of non-erythrocytic stages. The presence of antisense transcripts in some gametocyte and ookinete genes suggests that these antisense RNA may play an important role in gene expression regulation and parasite development. Future gene expression studies should make use of directional cDNA libraries. Antisense transcripts may partly explain the observed discrepancy between levels of mRNA and protein expression.
Modulation of gene expression made easy

DEFF Research Database (Denmark)

Solem, Christian; Jensen, Peter Ruhdal

2002-01-01

A new approach for modulating gene expression, based on randomization of promoter (spacer) sequences, was developed. The method was applied to chromosomal genes in Lactococcus lactis and shown to generate libraries of clones with broad ranges of expression levels of target genes. In one example...... that the method can be applied to modulating the expression of native genes on the chromosome. We constructed a series of strains in which the expression of the las operon, containing the genes pfk, pyk, and ldh, was modulated by integrating a truncated copy of the pfk gene. Importantly, the modulation affected...
Ionizing Radiation Affects Gene Expression in Mouse Skin and Bone

Science.gov (United States)

Terada, Masahiro; Tahimic, Candice; Sowa, Marianne B.; Schreurs, Ann-Sofie; Shirazi-Fard, Yasaman; Alwood, Joshua; Globus, Ruth K.

2017-01-01

Future long-duration space exploration beyond low earth orbit will increase human exposure to space radiation and microgravity conditions as well as associated risks to skeletal health. In animal studies, radiation exposure (greater than 1 Gy) is associated with pathological changes in bone structure, enhanced bone resorption, reduced bone formation and decreased bone mineral density, which can lead to skeletal fragility. Definitive measurements and detection of bone loss typically require large and specialized equipment which can make their application to long duration space missions logistically challenging. Towards the goal of developing non-invasive and less complicated monitoring methods to predict astronauts' health during spaceflight, we examined whether radiation induced gene expression changes in skin may be predictive of the responses of skeletal tissue to radiation exposure. We examined oxidative stress and growth arrest pathways in mouse skin and long bones by measuring gene expression levels via quantitative polymerase chain reaction (qPCR) after exposure to total body irradiation (IR). To investigate the effects of irradiation on gene expression, we used skin and femora (cortical shaft) from the following treatment groups: control (normally loaded, sham-irradiated), and IR (0.5 Gy 56Fe 600 MeV/n and 0.5 Gy 1H 150 MeV/n), euthanized at one and 11 days post-irradiation (IR). To determine the extent of bone loss, tibiae were harvested and cancellous microarchitecture in the proximal tibia quantified ex vivo using microcomputed tomography (microCT). Statistical analysis was performed using Student's t-test. At one day post-IR, expression of FGF18 in skin was significantly greater (3.8X) than sham-irradiated controls, but did not differ at 11 days post IR. Expression levels of other genes associated with antioxidant response (Nfe2l2, FoxO3 and Sod1) and the cell cycle (Trp53, Cdkn1a, Gadd45g) did not significantly differ between the control and IR groups
The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome

KAUST Repository

Hurst, Laurence D.

2015-12-18

X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression
The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome.

Directory of Open Access Journals (Sweden)

Laurence D Hurst

2015-12-01

Full Text Available X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE and data from the Functional Annotation of the Mammalian Genome (FANTOM5 project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds, as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased
The predictive nature of transcript expression levels on protein expression in adult human brain.

Science.gov (United States)

Bauernfeind, Amy L; Babbitt, Courtney C

2017-04-24

Next generation sequencing methods are the gold standard for evaluating expression of the transcriptome. When determining the biological implications of such studies, the assumption is often made that transcript expression levels correspond to protein levels in a meaningful way. However, the strength of the overall correlation between transcript and protein expression is inconsistent, particularly in brain samples. Following high-throughput transcriptomic (RNA-Seq) and proteomic (liquid chromatography coupled with tandem mass spectrometry) analyses of adult human brain samples, we compared the correlation in the expression of transcripts and proteins that support various biological processes, molecular functions, and that are located in different areas of the cell. Although most categories of transcripts have extremely weak predictive value for the expression of their associated proteins (R 2 values of < 10%), transcripts coding for protein kinases and membrane-associated proteins, including those that are part of receptors or ion transporters, are among those that are most predictive of downstream protein expression levels. The predictive value of transcript expression for corresponding proteins is variable in human brain samples, reflecting the complex regulation of protein expression. However, we found that transcriptomic analyses are appropriate for assessing the expression levels of certain classes of proteins, including those that modify proteins, such as kinases and phosphatases, regulate metabolic and synaptic activity, or are associated with a cellular membrane. These findings can be used to guide the interpretation of gene expression results from primate brain samples.
Integration of steady-state and temporal gene expression data for the inference of gene regulatory networks.

Science.gov (United States)

Wang, Yi Kan; Hurley, Daniel G; Schnell, Santiago; Print, Cristin G; Crampin, Edmund J

2013-01-01

We develop a new regression algorithm, cMIKANA, for inference of gene regulatory networks from combinations of steady-state and time-series gene expression data. Using simulated gene expression datasets to assess the accuracy of reconstructing gene regulatory networks, we show that steady-state and time-series data sets can successfully be combined to identify gene regulatory interactions using the new algorithm. Inferring gene networks from combined data sets was found to be advantageous when using noisy measurements collected with either lower sampling rates or a limited number of experimental replicates. We illustrate our method by applying it to a microarray gene expression dataset from human umbilical vein endothelial cells (HUVECs) which combines time series data from treatment with growth factor TNF and steady state data from siRNA knockdown treatments. Our results suggest that the combination of steady-state and time-series datasets may provide better prediction of RNA-to-RNA interactions, and may also reveal biological features that cannot be identified from dynamic or steady state information alone. Finally, we consider the experimental design of genomics experiments for gene regulatory network inference and show that network inference can be improved by incorporating steady-state measurements with time-series data.
A gene expression signature of RAS pathway dependence predicts response to PI3K and RAS pathway inhibitors and expands the population of RAS pathway activated tumors.

Science.gov (United States)

Loboda, Andrey; Nebozhyn, Michael; Klinghoffer, Rich; Frazier, Jason; Chastain, Michael; Arthur, William; Roberts, Brian; Zhang, Theresa; Chenard, Melissa; Haines, Brian; Andersen, Jannik; Nagashima, Kumiko; Paweletz, Cloud; Lynch, Bethany; Feldman, Igor; Dai, Hongyue; Huang, Pearl; Watters, James

2010-06-30

Hyperactivation of the Ras signaling pathway is a driver of many cancers, and RAS pathway activation can predict response to targeted therapies. Therefore, optimal methods for measuring Ras pathway activation are critical. The main focus of our work was to develop a gene expression signature that is predictive of RAS pathway dependence. We used the coherent expression of RAS pathway-related genes across multiple datasets to derive a RAS pathway gene expression signature and generate RAS pathway activation scores in pre-clinical cancer models and human tumors. We then related this signature to KRAS mutation status and drug response data in pre-clinical and clinical datasets. The RAS signature score is predictive of KRAS mutation status in lung tumors and cell lines with high (> 90%) sensitivity but relatively low (50%) specificity due to samples that have apparent RAS pathway activation in the absence of a KRAS mutation. In lung and breast cancer cell line panels, the RAS pathway signature score correlates with pMEK and pERK expression, and predicts resistance to AKT inhibition and sensitivity to MEK inhibition within both KRAS mutant and KRAS wild-type groups. The RAS pathway signature is upregulated in breast cancer cell lines that have acquired resistance to AKT inhibition, and is downregulated by inhibition of MEK. In lung cancer cell lines knockdown of KRAS using siRNA demonstrates that the RAS pathway signature is a better measure of dependence on RAS compared to KRAS mutation status. In human tumors, the RAS pathway signature is elevated in ER negative breast tumors and lung adenocarcinomas, and predicts resistance to cetuximab in metastatic colorectal cancer. These data demonstrate that the RAS pathway signature is superior to KRAS mutation status for the prediction of dependence on RAS signaling, can predict response to PI3K and RAS pathway inhibitors, and is likely to have the most clinical utility in lung and breast tumors.
A gene expression signature of RAS pathway dependence predicts response to PI3K and RAS pathway inhibitors and expands the population of RAS pathway activated tumors

Directory of Open Access Journals (Sweden)

Paweletz Cloud

2010-06-01

Full Text Available Abstract Background Hyperactivation of the Ras signaling pathway is a driver of many cancers, and RAS pathway activation can predict response to targeted therapies. Therefore, optimal methods for measuring Ras pathway activation are critical. The main focus of our work was to develop a gene expression signature that is predictive of RAS pathway dependence. Methods We used the coherent expression of RAS pathway-related genes across multiple datasets to derive a RAS pathway gene expression signature and generate RAS pathway activation scores in pre-clinical cancer models and human tumors. We then related this signature to KRAS mutation status and drug response data in pre-clinical and clinical datasets. Results The RAS signature score is predictive of KRAS mutation status in lung tumors and cell lines with high (> 90% sensitivity but relatively low (50% specificity due to samples that have apparent RAS pathway activation in the absence of a KRAS mutation. In lung and breast cancer cell line panels, the RAS pathway signature score correlates with pMEK and pERK expression, and predicts resistance to AKT inhibition and sensitivity to MEK inhibition within both KRAS mutant and KRAS wild-type groups. The RAS pathway signature is upregulated in breast cancer cell lines that have acquired resistance to AKT inhibition, and is downregulated by inhibition of MEK. In lung cancer cell lines knockdown of KRAS using siRNA demonstrates that the RAS pathway signature is a better measure of dependence on RAS compared to KRAS mutation status. In human tumors, the RAS pathway signature is elevated in ER negative breast tumors and lung adenocarcinomas, and predicts resistance to cetuximab in metastatic colorectal cancer. Conclusions These data demonstrate that the RAS pathway signature is superior to KRAS mutation status for the prediction of dependence on RAS signaling, can predict response to PI3K and RAS pathway inhibitors, and is likely to have the most clinical
Improving the Prediction of Prostate Cancer Overall Survival by Supplementing Readily Available Clinical Data with Gene Expression Levels of IGFBP3 and F3 in Formalin-Fixed Paraffin Embedded Core Needle Biopsy Material.

Directory of Open Access Journals (Sweden)

Zhuochun Peng

Full Text Available A previously reported expression signature of three genes (IGFBP3, F3 and VGLL3 was shown to have potential prognostic value in estimating overall and cancer-specific survivals at diagnosis of prostate cancer in a pilot cohort study using freshly frozen Fine Needle Aspiration (FNA samples.We carried out a new cohort study with 241 prostate cancer patients diagnosed from 2004-2007 with a follow-up exceeding 6 years in order to verify the prognostic value of gene expression signature in formalin fixed paraffin embedded (FFPE prostate core needle biopsy tissue samples. The cohort consisted of four patient groups with different survival times and death causes. A four multiplex one-step RT-qPCR test kit, designed and optimized for measuring the expression signature in FFPE core needle biopsy samples, was used. In archive FFPE biopsy samples the expression differences of two genes (IGFBP3 and F3 were measured. The survival time predictions using the current clinical parameters only, such as age at diagnosis, Gleason score, PSA value and tumor stage, and clinical parameters supplemented with the expression levels of IGFBP3 and F3, were compared.When combined with currently used clinical parameters, the gene expression levels of IGFBP3 and F3 are improving the prediction of survival time as compared to using clinical parameters alone.The assessment of IGFBP3 and F3 gene expression levels in FFPE prostate cancer tissue would provide an improved survival prediction for prostate cancer patients at the time of diagnosis.
High-Throughput Gene Expression Profiles to Define Drug Similarity and Predict Compound Activity.

Science.gov (United States)

De Wolf, Hans; Cougnaud, Laure; Van Hoorde, Kirsten; De Bondt, An; Wegner, Joerg K; Ceulemans, Hugo; Göhlmann, Hinrich

2018-04-01

By adding biological information, beyond the chemical properties and desired effect of a compound, uncharted compound areas and connections can be explored. In this study, we add transcriptional information for 31K compounds of Janssen's primary screening deck, using the HT L1000 platform and assess (a) the transcriptional connection score for generating compound similarities, (b) machine learning algorithms for generating target activity predictions, and (c) the scaffold hopping potential of the resulting hits. We demonstrate that the transcriptional connection score is best computed from the significant genes only and should be interpreted within its confidence interval for which we provide the stats. These guidelines help to reduce noise, increase reproducibility, and enable the separation of specific and promiscuous compounds. The added value of machine learning is demonstrated for the NR3C1 and HSP90 targets. Support Vector Machine models yielded balanced accuracy values ≥80% when the expression values from DDIT4 & SERPINE1 and TMEM97 & SPR were used to predict the NR3C1 and HSP90 activity, respectively. Combining both models resulted in 22 new and confirmed HSP90-independent NR3C1 inhibitors, providing two scaffolds (i.e., pyrimidine and pyrazolo-pyrimidine), which could potentially be of interest in the treatment of depression (i.e., inhibiting the glucocorticoid receptor (i.e., NR3C1), while leaving its chaperone, HSP90, unaffected). As such, the initial hit rate increased by a factor 300, as less, but more specific chemistry could be screened, based on the upfront computed activity predictions.
Evaluating Transcription Factor Activity Changes by Scoring Unexplained Target Genes in Expression Data.

Directory of Open Access Journals (Sweden)

Evi Berchtold

Full Text Available Several methods predict activity changes of transcription factors (TFs from a given regulatory network and measured expression data. But available gene regulatory networks are incomplete and contain many condition-dependent regulations that are not relevant for the specific expression measurement. It is not known which combination of active TFs is needed to cause a change in the expression of a target gene. A method to systematically evaluate the inferred activity changes is missing. We present such an evaluation strategy that indicates for how many target genes the observed expression changes can be explained by a given set of active TFs. To overcome the problem that the exact combination of active TFs needed to activate a gene is typically not known, we assume a gene to be explained if there exists any combination for which the predicted active TFs can possibly explain the observed change of the gene. We introduce the i-score (inconsistency score, which quantifies how many genes could not be explained by the set of activity changes of TFs. We observe that, even for these minimal requirements, published methods yield many unexplained target genes, i.e. large i-scores. This holds for all methods and all expression datasets we evaluated. We provide new optimization methods to calculate the best possible (minimal i-score given the network and measured expression data. The evaluation of this optimized i-score on a large data compendium yields many unexplained target genes for almost every case. This indicates that currently available regulatory networks are still far from being complete. Both the presented Act-SAT and Act-A* methods produce optimal sets of TF activity changes, which can be used to investigate the difficult interplay of expression and network data. A web server and a command line tool to calculate our i-score and to find the active TFs associated with the minimal i-score is available from https://services.bio.ifi.lmu.de/i-score.
A seven-gene CpG-island methylation panel predicts breast cancer progression

International Nuclear Information System (INIS)

Li, Yan; Melnikov, Anatoliy A.; Levenson, Victor; Guerra, Emanuela; Simeone, Pasquale; Alberti, Saverio; Deng, Youping

2015-01-01

DNA methylation regulates gene expression, through the inhibition/activation of gene transcription of methylated/unmethylated genes. Hence, DNA methylation profiling can capture pivotal features of gene expression in cancer tissues from patients at the time of diagnosis. In this work, we analyzed a breast cancer case series, to identify DNA methylation determinants of metastatic versus non-metastatic tumors. CpG-island methylation was evaluated on a 56-gene cancer-specific biomarker microarray in metastatic versus non-metastatic breast cancers in a multi-institutional case series of 123 breast cancer patients. Global statistical modeling and unsupervised hierarchical clustering were applied to identify a multi-gene binary classifier with high sensitivity and specificity. Network analysis was utilized to quantify the connectivity of the identified genes. Seven genes (BRCA1, DAPK1, MSH2, CDKN2A, PGR, PRKCDBP, RANKL) were found informative for prognosis of metastatic diffusion and were used to calculate classifier accuracy versus the entire data-set. Individual-gene performances showed sensitivities of 63–79 %, 53–84 % specificities, positive predictive values of 59–83 % and negative predictive values of 63–80 %. When modelled together, these seven genes reached a sensitivity of 93 %, 100 % specificity, a positive predictive value of 100 % and a negative predictive value of 93 %, with high statistical power. Unsupervised hierarchical clustering independently confirmed these findings, in close agreement with the accuracy measurements. Network analyses indicated tight interrelationship between the identified genes, suggesting this to be a functionally-coordinated module, linked to breast cancer progression. Our findings identify CpG-island methylation profiles with deep impact on clinical outcome, paving the way for use as novel prognostic assays in clinical settings. The online version of this article (doi:10.1186/s12885-015-1412-9) contains supplementary

Using gene expression noise to understand gene regulation

NARCIS (Netherlands)

Munsky, B.; Neuert, G.; van Oudenaarden, A.

2012-01-01

Phenotypic variation is ubiquitous in biology and is often traceable to underlying genetic and environmental variation. However, even genetically identical cells in identical environments display variable phenotypes. Stochastic gene expression, or gene expression "noise," has been suggested as a
An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information

Directory of Open Access Journals (Sweden)

Ao Li

2009-04-01

Full Text Available Motivation: Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression and gene regulation information is desirable for clustering and analyzing. Methods: By incorporating gene regulatory information with gene expression data, we define regulated expression values (REV as indicators of how a gene is regulated by a specific factor. Existing bi-clustering methods are extended to a three dimensional data space by developing a heuristic TRI-Clustering algorithm. An additional approach named Automatic Boundary Searching algorithm (ABS is introduced to automatically determine the boundary threshold. Results: Results based on incorporating ChIP-chip data representing transcription factor-gene interactions show that the algorithms are efficient and robust for detecting tri-clusters. Detailed analysis of the tri-cluster extracted from yeast sporulation REV data shows genes in this cluster exhibited significant differences during the middle and late stages. The implicated regulatory network was then reconstructed for further study of defined regulatory mechanisms. Topological and statistical analysis of this network demonstrated evidence of significant changes of TF activities during the different stages of yeast sporulation, and suggests this approach might be a general way to study regulatory networks undergoing transformations.
The utility of optical detection system (qPCR) and bioinformatics methods in reference gene expression analysis

Science.gov (United States)

Skarzyńska, Agnieszka; Pawełkowicz, Magdalena; PlÄ der, Wojciech; Przybecki, Zbigniew

2016-09-01

Real-time quantitative polymerase chain reaction is consider as the most reliable method for gene expression studies. However, the expression of target gene could be misinterpreted due to improper normalization. Therefore, the crucial step for analysing of qPCR data is selection of suitable reference genes, which should be validated experimentally. In order to choice the gene with stable expression in the designed experiment, we performed reference gene expression analysis. In this study genes described in the literature and novel genes predicted as control genes, based on the in silico analysis of transcriptome data were used. Analysis with geNorm and NormFinder algorithms allow to create the ranking of candidate genes and indicate the best reference for flower morphogenesis study. According to the results, genes CACS and CYCL were characterised the most stable expression, but the least suitable genes were TUA and EF.
Adrenal-kidney-gonad complex measurements may not predict gonad-specific changes in gene expression patterns during temperature-dependent sex determination in the red-eared slider turtle (Trachemys scripta elegans).

Science.gov (United States)

Ramsey, Mary; Crews, David

2007-08-01

Many turtles, including the red-eared slider turtle (Trachemys scripta elegans) have temperature-dependent sex determination in which gonadal sex is determined by temperature during the middle third of incubation. The gonad develops as part of a heterogenous tissue complex that comprises the developing adrenal, kidney, and gonad (AKG complex). Owing to the difficulty in excising the gonad from the adjacent tissues, the AKG complex is often used as tissue source in assays examining gene expression in the developing gonad. However, the gonad is a relatively small component of the AKG, and gene expression in the adrenal-kidney (AK) compartment may interfere with the detection of gonad-specific changes in gene expression, particularly during early key phases of gonadal development and sex determination. In this study, we examine transcript levels as measured by quantitative real-time polymerase chain reaction for five genes important in slider turtle sex determination and differentiation (AR, ERalpha, ERbeta, aromatase, and Sf1) in AKG, AK, and isolated gonad tissues. In all cases, gonad-specific gene expression patterns were attenuated in AKG versus gonad tissue. All five genes were expressed in the AK in addition to the gonad at all stages/temperatures. Inclusion of the AK compartment masked important changes in gonadal gene expression. In addition, AK and gonad expression patterns are not additive, and gonadal gene expression cannot be predicted from intact AKG measurements. (c) 2007 Wiley-Liss, Inc.
Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods.

Science.gov (United States)

Wang, Liming; Zhu, L; Luan, R; Wang, L; Fu, J; Wang, X; Sui, L

2016-10-10

Dilated cardiomyopathy (DCM) is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs) were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs) and microRNAs (miRNAs) of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT) were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family). Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1), potential TFs, as well as potential miRNAs, might be involved in DCM.
Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods

Directory of Open Access Journals (Sweden)

Liming Wang

Full Text Available Dilated cardiomyopathy (DCM is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs and microRNAs (miRNAs of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family. Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1, potential TFs, as well as potential miRNAs, might be involved in DCM.
Gene Expression Differences in Peripheral Blood of Parkinson's Disease Patients with Distinct Progression Profiles.

Directory of Open Access Journals (Sweden)

Raquel Pinho

Full Text Available The prognosis of neurodegenerative disorders is clinically challenging due to the inexistence of established biomarkers for predicting disease progression. Here, we performed an exploratory cross-sectional, case-control study aimed at determining whether gene expression differences in peripheral blood may be used as a signature of Parkinson's disease (PD progression, thereby shedding light into potential molecular mechanisms underlying disease development. We compared transcriptional profiles in the blood from 34 PD patients who developed postural instability within ten years with those of 33 patients who did not develop postural instability within this time frame. Our study identified >200 differentially expressed genes between the two groups. The expression of several of the genes identified was previously found deregulated in animal models of PD and in PD patients. Relevant genes were selected for validation by real-time PCR in a subset of patients. The genes validated were linked to nucleic acid metabolism, mitochondria, immune response and intracellular-transport. Interestingly, we also found deregulation of these genes in a dopaminergic cell model of PD, a simple paradigm that can now be used to further dissect the role of these molecular players on dopaminergic cell loss. Altogether, our study provides preliminary evidence that expression changes in specific groups of genes and pathways, detected in peripheral blood samples, may be correlated with differential PD progression. Our exploratory study suggests that peripheral gene expression profiling may prove valuable for assisting in prediction of PD prognosis, and identifies novel culprits possibly involved in dopaminergic cell death. Given the exploratory nature of our study, further investigations using independent, well-characterized cohorts will be essential in order to validate our candidates as predictors of PD prognosis and to definitively confirm the value of gene expression
Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms

Directory of Open Access Journals (Sweden)

Deling Wang

2018-03-01

Full Text Available Breast cancer is one of the most common malignancies in women. Patient-derived tumor xenograft (PDX model is a cutting-edge approach for drug research on breast cancer. However, PDX still exhibits differences from original human tumors, thereby challenging the molecular understanding of tumorigenesis. In particular, gene expression changes after tissues are transplanted from human to mouse model. In this study, we propose a novel computational method by incorporating several machine learning algorithms, including Monte Carlo feature selection (MCFS, random forest (RF, and rough set-based rule learning, to identify genes with significant expression differences between PDX and original human tumors. First, 831 breast tumors, including 657 PDX and 174 human tumors, were collected. Based on MCFS and RF, 32 genes were then identified to be informative for the prediction of PDX and human tumors and can be used to construct a prediction model. The prediction model exhibits a Matthews coefficient correlation value of 0.777. Seven interpretable interactions within the informative gene were detected based on the rough set-based rule learning. Furthermore, the seven interpretable interactions can be well supported by previous experimental studies. Our study not only presents a method for identifying informative genes with differential expression but also provides insights into the mechanism through which gene expression changes after being transplanted from human tumor into mouse model. This work would be helpful for research and drug development for breast cancer.
Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms.

Science.gov (United States)

Wang, Deling; Li, Jia-Rui; Zhang, Yu-Hang; Chen, Lei; Huang, Tao; Cai, Yu-Dong

2018-03-12

Breast cancer is one of the most common malignancies in women. Patient-derived tumor xenograft (PDX) model is a cutting-edge approach for drug research on breast cancer. However, PDX still exhibits differences from original human tumors, thereby challenging the molecular understanding of tumorigenesis. In particular, gene expression changes after tissues are transplanted from human to mouse model. In this study, we propose a novel computational method by incorporating several machine learning algorithms, including Monte Carlo feature selection (MCFS), random forest (RF), and rough set-based rule learning, to identify genes with significant expression differences between PDX and original human tumors. First, 831 breast tumors, including 657 PDX and 174 human tumors, were collected. Based on MCFS and RF, 32 genes were then identified to be informative for the prediction of PDX and human tumors and can be used to construct a prediction model. The prediction model exhibits a Matthews coefficient correlation value of 0.777. Seven interpretable interactions within the informative gene were detected based on the rough set-based rule learning. Furthermore, the seven interpretable interactions can be well supported by previous experimental studies. Our study not only presents a method for identifying informative genes with differential expression but also provides insights into the mechanism through which gene expression changes after being transplanted from human tumor into mouse model. This work would be helpful for research and drug development for breast cancer.
In Silico Analysis of Microarray-Based Gene Expression Profiles Predicts Tumor Cell Response to Withanolides

Directory of Open Access Journals (Sweden)

Thomas Efferth

2012-05-01

Full Text Available Withania somnifera (L. Dunal (Indian ginseng, winter cherry, Solanaceae is widely used in traditional medicine. Roots are either chewed or used to prepare beverages (aqueous decocts. The major secondary metabolites of Withania somnifera are the withanolides, which are C-28-steroidal lactone triterpenoids. Withania somnifera extracts exert chemopreventive and anticancer activities in vitro and in vivo. The aims of the present in silico study were, firstly, to investigate whether tumor cells develop cross-resistance between standard anticancer drugs and withanolides and, secondly, to elucidate the molecular determinants of sensitivity and resistance of tumor cells towards withanolides. Using IC50 concentrations of eight different withanolides (withaferin A, withaferin A diacetate, 3-azerininylwithaferin A, withafastuosin D diacetate, 4-B-hydroxy-withanolide E, isowithanololide E, withafastuosin E, and withaperuvin and 19 established anticancer drugs, we analyzed the cross-resistance profile of 60 tumor cell lines. The cell lines revealed cross-resistance between the eight withanolides. Consistent cross-resistance between withanolides and nitrosoureas (carmustin, lomustin, and semimustin was also observed. Then, we performed transcriptomic microarray-based COMPARE and hierarchical cluster analyses of mRNA expression to identify mRNA expression profiles predicting sensitivity or resistance towards withanolides. Genes from diverse functional groups were significantly associated with response of tumor cells to withaferin A diacetate, e.g. genes functioning in DNA damage and repair, stress response, cell growth regulation, extracellular matrix components, cell adhesion and cell migration, constituents of the ribosome, cytoskeletal organization and regulation, signal transduction, transcription factors, and others.
Hierarchy in gene expression is predictive of risk, progression, and outcome in adult acute myeloid leukemia

Science.gov (United States)

Tripathi, Shubham; Deem, Michael W.

2015-02-01

Cancer progresses with a change in the structure of the gene network in normal cells. We define a measure of organizational hierarchy in gene networks of affected cells in adult acute myeloid leukemia (AML) patients. With a retrospective cohort analysis based on the gene expression profiles of 116 AML patients, we find that the likelihood of future cancer relapse and the level of clinical risk are directly correlated with the level of organization in the cancer related gene network. We also explore the variation of the level of organization in the gene network with cancer progression. We find that this variation is non-monotonic, which implies the fitness landscape in the evolution of AML cancer cells is non-trivial. We further find that the hierarchy in gene expression at the time of diagnosis may be a useful biomarker in AML prognosis.
Hierarchy in gene expression is predictive of risk, progression, and outcome in adult acute myeloid leukemia

International Nuclear Information System (INIS)

Tripathi, Shubham; Deem, Michael W

2015-01-01

Cancer progresses with a change in the structure of the gene network in normal cells. We define a measure of organizational hierarchy in gene networks of affected cells in adult acute myeloid leukemia (AML) patients. With a retrospective cohort analysis based on the gene expression profiles of 116 AML patients, we find that the likelihood of future cancer relapse and the level of clinical risk are directly correlated with the level of organization in the cancer related gene network. We also explore the variation of the level of organization in the gene network with cancer progression. We find that this variation is non-monotonic, which implies the fitness landscape in the evolution of AML cancer cells is non-trivial. We further find that the hierarchy in gene expression at the time of diagnosis may be a useful biomarker in AML prognosis. (paper)
Characterization of differentially expressed genes using high-dimensional co-expression networks

DEFF Research Database (Denmark)

Coelho Goncalves de Abreu, Gabriel; Labouriau, Rodrigo S.

2010-01-01

We present a technique to characterize differentially expressed genes in terms of their position in a high-dimensional co-expression network. The set-up of Gaussian graphical models is used to construct representations of the co-expression network in such a way that redundancy and the propagation...... that allow to make effective inference in problems with high degree of complexity (e.g. several thousands of genes) and small number of observations (e.g. 10-100) as typically occurs in high throughput gene expression studies. Taking advantage of the internal structure of decomposable graphical models, we...... construct a compact representation of the co-expression network that allows to identify the regions with high concentration of differentially expressed genes. It is argued that differentially expressed genes located in highly interconnected regions of the co-expression network are less informative than...
Regulation of eucaryotic gene expression

Energy Technology Data Exchange (ETDEWEB)

Brent, R.; Ptashne, M.S

1989-05-23

This patent describes a method of regulating the expression of a gene in a eucaryotic cell. The method consists of: providing in the eucaryotic cell, a peptide, derived from or substantially similar to a peptide of a procaryotic cell able to bind to DNA upstream from or within the gene, the amount of the peptide being sufficient to bind to the gene and thereby control expression of the gene.
A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data.

Science.gov (United States)

Kang, Tianyu; Ding, Wei; Zhang, Luoyan; Ziemek, Daniel; Zarringhalam, Kourosh

2017-12-19

Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets.
Personality and gene expression: Do individual differences exist in the leukocyte transcriptome?

Science.gov (United States)

Vedhara, Kavita; Gill, Sana; Eldesouky, Lameese; Campbell, Bruce K; Arevalo, Jesusa M G; Ma, Jeffrey; Cole, Steven W

2015-02-01

The temporal and situational stability of personality has led generations of researchers to hypothesize that personality may have enduring effects on health, but the biological mechanisms of such relationships remain poorly understood. In the present study, we utilized a functional genomics approach to examine the relationship between the 5 major dimensions of personality and patterns of gene expression as predicted by 'behavioural immune response' theory. We specifically focussed on two sets of genes previously linked to stress, threat, and adverse socio-environmental conditions: pro-inflammatory genes and genes involved in Type I interferon and antibody responses. An opportunity sample of 121 healthy individuals was recruited (86 females; mean age 24 years). Individuals completed a validated measure of personality; questions relating to current health behaviours; and provided a 5ml sample of peripheral blood for gene expression analysis. Extraversion was associated with increased expression of pro-inflammatory genes and Conscientiousness was associated with reduced expression of pro-inflammatory genes. Both associations were independent of health behaviours, negative affect, and leukocyte subset distributions. Antiviral and antibody-related gene expression was not associated with any personality dimension. The present data shed new light on the long-observed epidemiological associations between personality, physical health, and human longevity. Further research is required to elucidate the biological mechanisms underlying these associations. Copyright © 2014 Elsevier Ltd. All rights reserved.
Transcriptomic epidemiology of smoking: the effect of smoking on gene expression in lymphocytes

Directory of Open Access Journals (Sweden)

Almasy Laura

2010-07-01

Full Text Available Abstract Background This investigation offers insights into system-wide pathological processes induced in response to cigarette smoke exposure by determining its influences at the gene expression level. Methods We obtained genome-wide quantitative transcriptional profiles from 1,240 individuals from the San Antonio Family Heart Study, including 297 current smokers. Using lymphocyte samples, we identified 20,413 transcripts with significantly detectable expression levels, including both known and predicted genes. Correlation between smoking and gene expression levels was determined using a regression model that allows for residual genetic effects. Results With a conservative false-discovery rate of 5% we identified 323 unique genes (342 transcripts whose expression levels were significantly correlated with smoking behavior. These genes showed significant over-representation within a range of functional categories that correspond well with known smoking-related pathologies, including immune response, cell death, cancer, natural killer cell signaling and xenobiotic metabolism. Conclusions Our results indicate that not only individual genes but entire networks of gene interaction are influenced by cigarette smoking. This is the largest in vivo transcriptomic epidemiological study of smoking to date and reveals the significant and comprehensive influence of cigarette smoke, as an environmental variable, on the expression of genes. The central importance of this manuscript is to provide a summary of the relationships between gene expression and smoking in this exceptionally large cross-sectional data set.
Evaluation of gene expression profile of keratinocytes in response to JP-8 jet fuel

International Nuclear Information System (INIS)

Espinoza, Luis A.; Li Peijun; Lee, Richard Y.; Wang Yue; Boulares, A. Hamid; Clarke, Robert; Smulson, Mark E.

2004-01-01

The skin is the principal barrier against any environmental insult. Therefore, there is a high risk for a large number of military and civilian personnel exposed to jet fuel JP-8 to suffer percutaneous absorption of this fuel. This paper reports the use of cDNA microarray to identify the gene expression profile in normal human epidermal keratinocytes exposed to JP-8 for 24-h and 7-day periods. The effects of JP-8 exposure on keratinocytes at these two different periods induced a set of genes with altered expression in response to this type of insult. Microarray data were visualized using a novel algorithm based on simple statistical analyses to reduce data dimensionality and identify subsets of discriminant genes. Predictive neural networks were built using a multiplayer perceptron to carry out a proper classification task in microarray data in the untreated versus JP-8-treated samples. The pattern of expressions in response to JP-8 provides evidences that detoxificant-related and cell growth regulator genes with the most variability in the level of expression may be useful genetic markers in adverse health effects of personnel exposed to JP-8. The approaches in our analysis provide a simple, safe, novel, and effective method that is reliable in identifying and analyzing gene expression in samples treated with JP-8 or over potential toxic agents. Gene expression data from these studies can be used to build accurate predictive models that separate different molecular profiles. The data establish the use and effectiveness of these approaches for future prospective studies
Identifying Growth Conditions for Nicotiana benthimiana Resulting in Predictable Gene Expression of Promoter-Gus Fusion

Science.gov (United States)

Sandoval, V.; Barton, K.; Longhurst, A.

2012-12-01

Revoluta (Rev) is a transcription factor that establishes leaf polarity inArabidopsis thaliana. Through previous work in Dr. Barton's Lab, it is known that Revoluta binds to the ZPR3 promoter, thus activating the ZPR3 gene product inArabidopsis thaliana. Using this knowledge, two separate DNA constructs were made, one carrying revgene and in the other, the ZPR3 promoter fussed with the GUS gene. When inoculated in Nicotiana benthimiana (tobacco), the pMDC32 plasmid produces the Rev protein. Rev binds to the ZPR3 promoter thereby activating the transcription of the GUS gene, which can only be expressed in the presence of Rev. When GUS protein comes in contact with X-Gluc it produce the blue stain seen (See Figure 1). In the past, variability has been seen of GUS expression on tobacco therefore we hypothesized that changing the growing conditions and leaf age might improve how well it's expressed.
Array2BIO: from microarray expression data to functional annotation of co-regulated genes

Directory of Open Access Journals (Sweden)

Rasley Amy

2006-06-01

Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

Altered global gene expression profiles in human gastrointestinal epithelial Caco2 cells exposed to nanosilver

Directory of Open Access Journals (Sweden)

Saura C. Sahu

Full Text Available Extensive consumer exposure to food- and cosmetics-related consumer products containing nanosilver is of public safety concern. Therefore, there is a need for suitable in vitro models and sensitive predictive rapid screening methods to assess their toxicity. Toxicogenomic profile showing subtle changes in gene expressions following nanosilver exposure is a sensitive toxicological endpoint for this purpose. We evaluated the Caco2 cells and global gene expression profiles as tools for predictive rapid toxicity screening of nanosilver. We evaluated and compared the gene expression profiles of Caco-2 cells exposed to 20 nm and 50 nm nanosilver at a concentration 2.5 μg/ml. The global gene expression analysis of Caco2 cells exposed to 20 nm nanosilver showed that a total of 93 genes were altered at 4 h exposure, out of which 90 genes were up-regulated and 3 genes were down-regulated. The 24 h exposure of 20 nm silver altered 15 genes in Caco2 cells, out of which 14 were up-regulated and one was down-regulated. The most pronounced changes in gene expression were detected at 4 h. The greater size (50 nm nanosilver at 4 h exposure altered more genes by more different pathways than the smaller (20 nm one. Metallothioneins and heat shock proteins were highly up-regulated as a result of exposure to both the nanosilvers. The cellular pathways affected by the nanosilver exposure is likely to lead to increased toxicity. The results of our study presented here suggest that the toxicogenomic characterization of Caco2 cells is a valuable in vitro tool for assessing toxicity of nanomaterials such as nanosilver. Keywords: Nanosilver, Silver nanoparticles, Nanoparticles, Toxicogenomics, DNA microarray, Global gene expression profiles, Caco2 cells
Customized oligonucleotide microarray gene expression-based classification of neuroblastoma patients outperforms current clinical risk stratification.

Science.gov (United States)

Oberthuer, André; Berthold, Frank; Warnat, Patrick; Hero, Barbara; Kahlert, Yvonne; Spitz, Rüdiger; Ernestus, Karen; König, Rainer; Haas, Stefan; Eils, Roland; Schwab, Manfred; Brors, Benedikt; Westermann, Frank; Fischer, Matthias

2006-11-01

To develop a gene expression-based classifier for neuroblastoma patients that reliably predicts courses of the disease. Two hundred fifty-one neuroblastoma specimens were analyzed using a customized oligonucleotide microarray comprising 10,163 probes for transcripts with differential expression in clinical subgroups of the disease. Subsequently, the prediction analysis for microarrays (PAM) was applied to a first set of patients with maximally divergent clinical courses (n = 77). The classification accuracy was estimated by a complete 10-times-repeated 10-fold cross validation, and a 144-gene predictor was constructed from this set. This classifier's predictive power was evaluated in an independent second set (n = 174) by comparing results of the gene expression-based classification with those of risk stratification systems of current trials from Germany, Japan, and the United States. The first set of patients was accurately predicted by PAM (cross-validated accuracy, 99%). Within the second set, the PAM classifier significantly separated cohorts with distinct courses (3-year event-free survival [EFS] 0.86 +/- 0.03 [favorable; n = 115] v 0.52 +/- 0.07 [unfavorable; n = 59] and 3-year overall survival 0.99 +/- 0.01 v 0.84 +/- 0.05; both P model, the PAM predictor classified patients of the second set more accurately than risk stratification of current trials from Germany, Japan, and the United States (P < .001; hazard ratio, 4.756 [95% CI, 2.544 to 8.893]). Integration of gene expression-based class prediction of neuroblastoma patients may improve risk estimation of current neuroblastoma trials.
Prioritizing orphan proteins for further study using phylogenomics and gene expression profiles in Streptomyces coelicolor

Directory of Open Access Journals (Sweden)

Takano Eriko

2011-09-01

Full Text Available Abstract Background Streptomyces coelicolor, a model organism of antibiotic producing bacteria, has one of the largest genomes of the bacterial kingdom, including 7825 predicted protein coding genes. A large number of these genes, nearly 34%, are functionally orphan (hypothetical proteins with unknown function. However, in gene expression time course data, many of these functionally orphan genes show interesting expression patterns. Results In this paper, we analyzed all functionally orphan genes of Streptomyces coelicolor and identified a list of "high priority" orphans by combining gene expression analysis and additional phylogenetic information (i.e. the level of evolutionary conservation of each protein. Conclusions The prioritized orphan genes are promising candidates to be examined experimentally in the lab for further characterization of their function.
Cis-regulatory somatic mutations and gene-expression alteration in B-cell lymphomas.

Science.gov (United States)

Mathelier, Anthony; Lefebvre, Calvin; Zhang, Allen W; Arenillas, David J; Ding, Jiarui; Wasserman, Wyeth W; Shah, Sohrab P

2015-04-23

With the rapid increase of whole-genome sequencing of human cancers, an important opportunity to analyze and characterize somatic mutations lying within cis-regulatory regions has emerged. A focus on protein-coding regions to identify nonsense or missense mutations disruptive to protein structure and/or function has led to important insights; however, the impact on gene expression of mutations lying within cis-regulatory regions remains under-explored. We analyzed somatic mutations from 84 matched tumor-normal whole genomes from B-cell lymphomas with accompanying gene expression measurements to elucidate the extent to which these cancers are disrupted by cis-regulatory mutations. We characterize mutations overlapping a high quality set of well-annotated transcription factor binding sites (TFBSs), covering a similar portion of the genome as protein-coding exons. Our results indicate that cis-regulatory mutations overlapping predicted TFBSs are enriched in promoter regions of genes involved in apoptosis or growth/proliferation. By integrating gene expression data with mutation data, our computational approach culminates with identification of cis-regulatory mutations most likely to participate in dysregulation of the gene expression program. The impact can be measured along with protein-coding mutations to highlight key mutations disrupting gene expression and pathways in cancer. Our study yields specific genes with disrupted expression triggered by genomic mutations in either the coding or the regulatory space. It implies that mutated regulatory components of the genome contribute substantially to cancer pathways. Our analyses demonstrate that identifying genomically altered cis-regulatory elements coupled with analysis of gene expression data will augment biological interpretation of mutational landscapes of cancers.
Altered gene expression in blood and sputum in COPD frequent exacerbators in the ECLIPSE cohort.

Directory of Open Access Journals (Sweden)

Dave Singh

Full Text Available Patients with chronic obstructive pulmonary disease (COPD who are defined as frequent exacerbators suffer with 2 or more exacerbations every year. The molecular mechanisms responsible for this phenotype are poorly understood. We investigated gene expression profile patterns associated with frequent exacerbations in sputum and blood cells in a well-characterised cohort. Samples from subjects from the ECLIPSE COPD cohort were used; sputum and blood samples from 138 subjects were used for microarray gene expression analysis, while blood samples from 438 subjects were used for polymerase chain reaction (PCR testing. Using microarray, 150 genes were differentially expressed in blood (>±1.5 fold change, p≤0.01 between frequent compared to non-exacerbators. In sputum cells, only 6 genes were differentially expressed. The differentially regulated genes in blood included downregulation of those involved in lymphocyte signalling and upregulation of pro-apoptotic signalling genes. Multivariate analysis of the microarray data followed by confirmatory PCR analysis identified 3 genes that predicted frequent exacerbations; B3GNT, LAF4 and ARHGEF10. The sensitivity and specificity of these 3 genes to predict the frequent exacerbator phenotype was 88% and 33% respectively. There are alterations in systemic immune function associated with frequent exacerbations; down-regulation of lymphocyte function and a shift towards pro-apoptosis mechanisms are apparent in patients with frequent exacerbations.
Predictive gene lists for breast cancer prognosis: A topographic visualisation study

Directory of Open Access Journals (Sweden)

Lowe David

2008-04-01

Full Text Available Abstract Background The controversy surrounding the non-uniqueness of predictive gene lists (PGL of small selected subsets of genes from very large potential candidates as available in DNA microarray experiments is now widely acknowledged 1. Many of these studies have focused on constructing discriminative semi-parametric models and as such are also subject to the issue of random correlations of sparse model selection in high dimensional spaces. In this work we outline a different approach based around an unsupervised patient-specific nonlinear topographic projection in predictive gene lists. Methods We construct nonlinear topographic projection maps based on inter-patient gene-list relative dissimilarities. The Neuroscale, the Stochastic Neighbor Embedding(SNE and the Locally Linear Embedding(LLE techniques have been used to construct two-dimensional projective visualisation plots of 70 dimensional PGLs per patient, classifiers are also constructed to identify the prognosis indicator of each patient using the resulting projections from those visualisation techniques and investigate whether a-posteriori two prognosis groups are separable on the evidence of the gene lists. A literature-proposed predictive gene list for breast cancer is benchmarked against a separate gene list using the above methods. Generalisation ability is investigated by using the mapping capability of Neuroscale to visualise the follow-up study, but based on the projections derived from the original dataset. Results The results indicate that small subsets of patient-specific PGLs have insufficient prognostic dissimilarity to permit a distinction between two prognosis patients. Uncertainty and diversity across multiple gene expressions prevents unambiguous or even confident patient grouping. Comparative projections across different PGLs provide similar results. Conclusion The random correlation effect to an arbitrary outcome induced by small subset selection from very high
Genome wide gene expression regulation by HIP1 Protein Interactor, HIPPI: Prediction and validation

Directory of Open Access Journals (Sweden)

Lahiri Ansuman

2011-09-01

Full Text Available Abstract Background HIP1 Protein Interactor (HIPPI is a pro-apoptotic protein that induces Caspase8 mediated apoptosis in cell. We have shown earlier that HIPPI could interact with a specific 9 bp sequence motif, defined as the HIPPI binding site (HBS, present in the upstream promoter of Caspase1 gene and regulate its expression. We also have shown that HIPPI, without any known nuclear localization signal, could be transported to the nucleus by HIP1, a NLS containing nucleo-cytoplasmic shuttling protein. Thus our present work aims at the investigation of the role of HIPPI as a global transcription regulator. Results We carried out genome wide search for the presence of HBS in the upstream sequences of genes. Our result suggests that HBS was predominantly located within 2 Kb upstream from transcription start site. Transcription factors like CREBP1, TBP, OCT1, EVI1 and P53 half site were significantly enriched in the 100 bp vicinity of HBS indicating that they might co-operate with HIPPI for transcription regulation. To illustrate the role of HIPPI on transcriptome, we performed gene expression profiling by microarray. Exogenous expression of HIPPI in HeLa cells resulted in up-regulation of 580 genes (p HIP1 was knocked down. HIPPI-P53 interaction was necessary for HIPPI mediated up-regulation of Caspase1 gene. Finally, we analyzed published microarray data obtained with post mortem brains of Huntington's disease (HD patients to investigate the possible involvement of HIPPI in HD pathogenesis. We observed that along with the transcription factors like CREB, P300, SREBP1, Sp1 etc. which are already known to be involved in HD, HIPPI binding site was also significantly over-represented in the upstream sequences of genes altered in HD. Conclusions Taken together, the results suggest that HIPPI could act as an important transcription regulator in cell regulating a vast array of genes, particularly transcription factors and at least, in part, play a
Identifying the Gene Signatures from Gene-Pathway Bipartite Network Guarantees the Robust Model Performance on Predicting the Cancer Prognosis

Directory of Open Access Journals (Sweden)

Li He

2014-01-01

Full Text Available For the purpose of improving the prediction of cancer prognosis in the clinical researches, various algorithms have been developed to construct the predictive models with the gene signatures detected by DNA microarrays. Due to the heterogeneity of the clinical samples, the list of differentially expressed genes (DEGs generated by the statistical methods or the machine learning algorithms often involves a number of false positive genes, which are not associated with the phenotypic differences between the compared clinical conditions, and subsequently impacts the reliability of the predictive models. In this study, we proposed a strategy, which combined the statistical algorithm with the gene-pathway bipartite networks, to generate the reliable lists of cancer-related DEGs and constructed the models by using support vector machine for predicting the prognosis of three types of cancers, namely, breast cancer, acute myeloma leukemia, and glioblastoma. Our results demonstrated that, combined with the gene-pathway bipartite networks, our proposed strategy can efficiently generate the reliable cancer-related DEG lists for constructing the predictive models. In addition, the model performance in the swap analysis was similar to that in the original analysis, indicating the robustness of the models in predicting the cancer outcomes.
Resistance gene expression determines the in vitro chemosensitivity of non-small cell lung cancer (NSCLC)

International Nuclear Information System (INIS)

Glaysher, Sharon; Modi, Paul; Rahamim, Joe; Smith, Mark E; Amer, Khalid; Addis, Bruce; Poole, Matthew; Narayanan, Ajit; Gulliford, Tim J; Andreotti, Peter E; Cree, Ian A; Yiannakis, Dennis; Gabriel, Francis G; Johnson, Penny; Polak, Marta E; Knight, Louise A; Goldthorpe, Zoe; Peregrin, Katharine; Gyi, Mya

2009-01-01

NSCLC exhibits considerable heterogeneity in its sensitivity to chemotherapy and similar heterogeneity is noted in vitro in a variety of model systems. This study has tested the hypothesis that the molecular basis of the observed in vitro chemosensitivity of NSCLC lies within the known resistance mechanisms inherent to these patients' tumors. The chemosensitivity of a series of 49 NSCLC tumors was assessed using the ATP-based tumor chemosensitivity assay (ATP-TCA) and compared with quantitative expression of resistance genes measured by RT-PCR in a Taqman Array™ following extraction of RNA from formalin-fixed paraffin-embedded (FFPE) tissue. There was considerable heterogeneity between tumors within the ATP-TCA, and while this showed no direct correlation with individual gene expression, there was strong correlation of multi-gene signatures for many of the single agents and combinations tested. For instance, docetaxel activity showed some dependence on the expression of drug pumps, while cisplatin activity showed some dependence on DNA repair enzyme expression. Activity of both drugs was influenced more strongly still by the expression of anti- and pro-apoptotic genes by the tumor for both docetaxel and cisplatin. The doublet combinations of cisplatin with gemcitabine and cisplatin with docetaxel showed gene expression signatures incorporating resistance mechanisms for both agents. Genes predicted to be involved in known mechanisms drug sensitivity and resistance correlate well with in vitro chemosensitivity and may allow the definition of predictive signatures to guide individualized chemotherapy in lung cancer
Gene expression signatures for colorectal cancer microsatellite status and HNPCC

DEFF Research Database (Denmark)

Kruhøffer, M; Jensen, J L; Laiho, P

2005-01-01

The majority of microsatellite instable (MSI) colorectal cancers are sporadic, but a subset belongs to the syndrome hereditary non-polyposis colorectal cancer (HNPCC). Microsatellite instability is caused by dysfunction of the mismatch repair (MMR) system that leads to a mutator phenotype, and MSI...... of 101 stage II and III colorectal cancers (34 MSI, 67 microsatellite stable (MSS)) using high-density oligonucleotide microarrays. From these data, we constructed a nine-gene signature capable of separating the mismatch repair proficient and deficient tumours. Subsequently, we demonstrated...... is correlated to prognosis and response to chemotherapy. Gene expression signatures as predictive markers are being developed for many cancers, and the identification of a signature for MMR deficiency would be of interest both clinically and biologically. To address this issue, we profiled the gene expression...
Inferring gene expression dynamics via functional regression analysis

Directory of Open Access Journals (Sweden)

Leng Xiaoyan

2008-01-01

Full Text Available Abstract Background Temporal gene expression profiles characterize the time-dynamics of expression of specific genes and are increasingly collected in current gene expression experiments. In the analysis of experiments where gene expression is obtained over the life cycle, it is of interest to relate temporal patterns of gene expression associated with different developmental stages to each other to study patterns of long-term developmental gene regulation. We use tools from functional data analysis to study dynamic changes by relating temporal gene expression profiles of different developmental stages to each other. Results We demonstrate that functional regression methodology can pinpoint relationships that exist between temporary gene expression profiles for different life cycle phases and incorporates dimension reduction as needed for these high-dimensional data. By applying these tools, gene expression profiles for pupa and adult phases are found to be strongly related to the profiles of the same genes obtained during the embryo phase. Moreover, one can distinguish between gene groups that exhibit relationships with positive and others with negative associations between later life and embryonal expression profiles. Specifically, we find a positive relationship in expression for muscle development related genes, and a negative relationship for strictly maternal genes for Drosophila, using temporal gene expression profiles. Conclusion Our findings point to specific reactivation patterns of gene expression during the Drosophila life cycle which differ in characteristic ways between various gene groups. Functional regression emerges as a useful tool for relating gene expression patterns from different developmental stages, and avoids the problems with large numbers of parameters and multiple testing that affect alternative approaches.
Synthetic promoter libraries- tuning of gene expression

DEFF Research Database (Denmark)

Hammer, Karin; Mijakovic, Ivan; Jensen, Peter Ruhdal

2006-01-01

knockout and strong overexpression. However, applications such as metabolic optimization and control analysis necessitate a continuous set of expression levels with only slight increments in strength to cover a specific window around the wildtype expression level of the studied gene; this requirement can......The study of gene function often requires changing the expression of a gene and evaluating the consequences. In principle, the expression of any given gene can be modulated in a quasi-continuum of discrete expression levels but the traditional approaches are usually limited to two extremes: gene...
MicroRNA expression, target genes, and signaling pathways in infants with a ventricular septal defect.

Science.gov (United States)

Chai, Hui; Yan, Zhaoyuan; Huang, Ke; Jiang, Yuanqing; Zhang, Lin

2018-02-01

This study aimed to systematically investigate the relationship between miRNA expression and the occurrence of ventricular septal defect (VSD), and characterize the miRNA target genes and pathways that can lead to VSD. The miRNAs that were differentially expressed in blood samples from VSD and normal infants were screened and validated by implementing miRNA microarrays and qRT-PCR. The target genes regulated by differentially expressed miRNAs were predicted using three target gene databases. The functions and signaling pathways of the target genes were enriched using the GO database and KEGG database, respectively. The transcription and protein expression of specific target genes in critical pathways were compared in the VSD and normal control groups using qRT-PCR and western blotting, respectively. Compared with the normal control group, the VSD group had 22 differentially expressed miRNAs; 19 were downregulated and three were upregulated. The 10,677 predicted target genes participated in many biological functions related to cardiac development and morphogenesis. Four target genes (mGLUR, Gq, PLC, and PKC) were involved in the PKC pathway and four (ECM, FAK, PI3 K, and PDK1) were involved in the PI3 K-Akt pathway. The transcription and protein expression of these eight target genes were significantly upregulated in the VSD group. The 22 miRNAs that were dysregulated in the VSD group were mainly downregulated, which may result in the dysregulation of several key genes and biological functions related to cardiac development. These effects could also be exerted via the upregulation of eight specific target genes, the subsequent over-activation of the PKC and PI3 K-Akt pathways, and the eventual abnormal cardiac development and VSD.
Expression of genes encoding multi-transmembrane proteins in specific primate taste cell populations.

Directory of Open Access Journals (Sweden)

Bryan D Moyer

Full Text Available BACKGROUND: Using fungiform (FG and circumvallate (CV taste buds isolated by laser capture microdissection and analyzed using gene arrays, we previously constructed a comprehensive database of gene expression in primates, which revealed over 2,300 taste bud-associated genes. Bioinformatics analyses identified hundreds of genes predicted to encode multi-transmembrane domain proteins with no previous association with taste function. A first step in elucidating the roles these gene products play in gustation is to identify the specific taste cell types in which they are expressed. METHODOLOGY/PRINCIPAL FINDINGS: Using double label in situ hybridization analyses, we identified seven new genes expressed in specific taste cell types, including sweet, bitter, and umami cells (TRPM5-positive, sour cells (PKD2L1-positive, as well as other taste cell populations. Transmembrane protein 44 (TMEM44, a protein with seven predicted transmembrane domains with no homology to GPCRs, is expressed in a TRPM5-negative and PKD2L1-negative population that is enriched in the bottom portion of taste buds and may represent developmentally immature taste cells. Calcium homeostasis modulator 1 (CALHM1, a component of a novel calcium channel, along with family members CALHM2 and CALHM3; multiple C2 domains; transmembrane 1 (MCTP1, a calcium-binding transmembrane protein; and anoctamin 7 (ANO7, a member of the recently identified calcium-gated chloride channel family, are all expressed in TRPM5 cells. These proteins may modulate and effect calcium signalling stemming from sweet, bitter, and umami receptor activation. Synaptic vesicle glycoprotein 2B (SV2B, a regulator of synaptic vesicle exocytosis, is expressed in PKD2L1 cells, suggesting that this taste cell population transmits tastant information to gustatory afferent nerve fibers via exocytic neurotransmitter release. CONCLUSIONS/SIGNIFICANCE: Identification of genes encoding multi-transmembrane domain proteins
Gene expression profiling in Ishikawa cells: A fingerprint for estrogen active compounds

International Nuclear Information System (INIS)

Boehme, Kathleen; Simon, Stephanie; Mueller, Stefan O.

2009-01-01

Several anthropogenous and naturally occurring substances, referred to as estrogen active compounds (EACs), are able to interfere with hormone and in particular estrogen receptor signaling. EACs can either cause adverse health effects in humans and wildlife populations or have beneficial effects on estrogen-dependent diseases. The aim of this study was to examine global gene expression profiles in estrogen receptor (ER)-proficient Ishikawa plus and ER-deficient Ishikawa minus endometrial cancer cells treated with selected well-known EACs (Diethylstilbestrol, Genistein, Zearalenone, Resveratrol, Bisphenol A and o,p'-DDT). We also investigated the effect of the pure antiestrogen ICI 182,780 (ICI) on the expression patterns caused by these compounds. Transcript levels were quantified 24 h after compound treatment using Illumina BeadChip Arrays. We identified 87 genes with similar expression changes in response to all EAC treatments in Ishikawa plus. ICI lowered the magnitude or reversed the expression of these genes, indicating ER dependent regulation. Apart from estrogenic gene regulation, Bisphenol A, o,p'-DDT, Zearalenone, Genistein and Resveratrol displayed similarities to ICI in their expression patterns, suggesting mixed estrogenic/antiestrogenic properties. In particular, the predominant antiestrogenic expression response of Resveratrol could be clearly distinguished from the other test compounds, indicating a distinct mechanism of action. Divergent gene expression patterns of the phytoestrogens, as well as weaker estrogenic gene expression regulation determined for the anthropogenous chemicals Bisphenol A and o,p'-DDT, warrants a careful assessment of potential detrimental and/or beneficial effects of EACs. The characteristic expression fingerprints and the identified subset of putative marker genes can be used for screening chemicals with an unknown mode of action and for predicting their potential to exert endocrine disrupting effects
Adaptive Evolution of Gene Expression in Drosophila.

Science.gov (United States)

Nourmohammad, Armita; Rambeau, Joachim; Held, Torsten; Kovacova, Viera; Berg, Johannes; Lässig, Michael

2017-08-08

Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Adaptive Evolution of Gene Expression in Drosophila

Directory of Open Access Journals (Sweden)

Armita Nourmohammad

2017-08-01

Full Text Available Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis.
Manipulation of colony environment modulates honey bee aggression and brain gene expression.

Science.gov (United States)

Rittschof, C C; Robinson, G E

2013-11-01

The social environment plays an essential role in shaping behavior for most animals. Social effects on behavior are often linked to changes in brain gene expression. In the honey bee (Apis mellifera L.), social modulation of individual aggression allows colonies to adjust the intensity with which they defend their hive in response to predation threat. Previous research has showed social effects on both aggression and aggression-related brain gene expression in honey bees, caused by alarm pheromone and unknown factors related to colony genotype. For example, some bees from less aggressive genetic stock reared in colonies with genetic predispositions toward increased aggression show both increased aggression and more aggressive-like brain gene expression profiles. We tested the hypothesis that exposure to a colony environment influenced by high levels of predation threat results in increased aggression and aggressive-like gene expression patterns in individual bees. We assessed gene expression using four marker genes. Experimentally induced predation threats modified behavior, but the effect was opposite of our predictions: disturbed colonies showed decreased aggression. Disturbed colonies also decreased foraging activity, suggesting that they did not habituate to threats; other explanations for this finding are discussed. Bees in disturbed colonies also showed changes in brain gene expression, some of which paralleled behavioral findings. These results show that bee aggression and associated molecular processes are subject to complex social influences. © 2013 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Systematic assessment of multi-gene predictors of pan-cancer cell line sensitivity to drugs exploiting gene expression data [version 1; referees: 2 approved

Directory of Open Access Journals (Sweden)

Linh Nguyen

2016-12-01

Full Text Available Background: Selected gene mutations are routinely used to guide the selection of cancer drugs for a given patient tumour. Large pharmacogenomic data sets were introduced to discover more of these single-gene markers of drug sensitivity. Very recently, machine learning regression has been used to investigate how well cancer cell line sensitivity to drugs is predicted depending on the type of molecular profile. The latter has revealed that gene expression data is the most predictive profile in the pan-cancer setting. However, no study to date has exploited GDSC data to systematically compare the performance of machine learning models based on multi-gene expression data against that of widely-used single-gene markers based on genomics data. Methods: Here we present this systematic comparison using Random Forest (RF classifiers exploiting the expression levels of 13,321 genes and an average of 501 tested cell lines per drug. To account for time-dependent batch effects in IC50 measurements, we employ independent test sets generated with more recent GDSC data than that used to train the predictors and show that this is a more realistic validation than K-fold cross-validation. Results and Discussion: Across 127 GDSC drugs, our results show that the single-gene markers unveiled by the MANOVA analysis tend to achieve higher precision than these RF-based multi-gene models, at the cost of generally having a poor recall (i.e. correctly detecting only a small part of the cell lines sensitive to the drug. Regarding overall classification performance, about two thirds of the drugs are better predicted by multi-gene RF classifiers. Among the drugs with the most predictive of these models, we found pyrimethamine, sunitinib and 17-AAG. Conclusions: We now know that this type of models can predict in vitro tumour response to these drugs. These models can thus be further investigated on in vivo tumour models.
Development and validation of a gene expression-based signature to predict distant metastasis in locoregionally advanced nasopharyngeal carcinoma: a retrospective, multicentre, cohort study.

Science.gov (United States)

Tang, Xin-Ran; Li, Ying-Qin; Liang, Shao-Bo; Jiang, Wei; Liu, Fang; Ge, Wen-Xiu; Tang, Ling-Long; Mao, Yan-Ping; He, Qing-Mei; Yang, Xiao-Jing; Zhang, Yuan; Wen, Xin; Zhang, Jian; Wang, Ya-Qin; Zhang, Pan-Pan; Sun, Ying; Yun, Jing-Ping; Zeng, Jing; Li, Li; Liu, Li-Zhi; Liu, Na; Ma, Jun

2018-03-01

Gene expression patterns can be used as prognostic biomarkers in various types of cancers. We aimed to identify a gene expression pattern for individual distant metastatic risk assessment in patients with locoregionally advanced nasopharyngeal carcinoma. In this multicentre, retrospective, cohort analysis, we included 937 patients with locoregionally advanced nasopharyngeal carcinoma from three Chinese hospitals: the Sun Yat-sen University Cancer Center (Guangzhou, China), the Affiliated Hospital of Guilin Medical University (Guilin, China), and the First People's Hospital of Foshan (Foshan, China). Using microarray analysis, we profiled mRNA gene expression between 24 paired locoregionally advanced nasopharyngeal carcinoma tumours from patients at Sun Yat-sen University Cancer Center with or without distant metastasis after radical treatment. Differentially expressed genes were examined using digital expression profiling in a training cohort (Guangzhou training cohort; n=410) to build a gene classifier using a penalised regression model. We validated the prognostic accuracy of this gene classifier in an internal validation cohort (Guangzhou internal validation cohort, n=204) and two external independent cohorts (Guilin cohort, n=165; Foshan cohort, n=158). The primary endpoint was distant metastasis-free survival. Secondary endpoints were disease-free survival and overall survival. We identified 137 differentially expressed genes between metastatic and non-metastatic locoregionally advanced nasopharyngeal carcinoma tissues. A distant metastasis gene signature for locoregionally advanced nasopharyngeal carcinoma (DMGN) that consisted of 13 genes was generated to classify patients into high-risk and low-risk groups in the training cohort. Patients with high-risk scores in the training cohort had shorter distant metastasis-free survival (hazard ratio [HR] 4·93, 95% CI 2·99-8·16; padvanced nasopharyngeal carcinoma and might be able to predict which patients benefit

Modeling insertional mutagenesis using gene length and expression in murine embryonic stem cells.

Directory of Open Access Journals (Sweden)

Alex S Nord

2007-07-01

Full Text Available High-throughput mutagenesis of the mammalian genome is a powerful means to facilitate analysis of gene function. Gene trapping in embryonic stem cells (ESCs is the most widely used form of insertional mutagenesis in mammals. However, the rules governing its efficiency are not fully understood, and the effects of vector design on the likelihood of gene-trapping events have not been tested on a genome-wide scale.In this study, we used public gene-trap data to model gene-trap likelihood. Using the association of gene length and gene expression with gene-trap likelihood, we constructed spline-based regression models that characterize which genes are susceptible and which genes are resistant to gene-trapping techniques. We report results for three classes of gene-trap vectors, showing that both length and expression are significant determinants of trap likelihood for all vectors. Using our models, we also quantitatively identified hotspots of gene-trap activity, which represent loci where the high likelihood of vector insertion is controlled by factors other than length and expression. These formalized statistical models describe a high proportion of the variance in the likelihood of a gene being trapped by expression-dependent vectors and a lower, but still significant, proportion of the variance for vectors that are predicted to be independent of endogenous gene expression.The findings of significant expression and length effects reported here further the understanding of the determinants of vector insertion. Results from this analysis can be applied to help identify other important determinants of this important biological phenomenon and could assist planning of large-scale mutagenesis efforts.
Diagnosis of partial body radiation exposure in mice using peripheral blood gene expression profiles.

Directory of Open Access Journals (Sweden)

Sarah K Meadows

2010-07-01

Full Text Available In the event of a terrorist-mediated attack in the United States using radiological or improvised nuclear weapons, it is expected that hundreds of thousands of people could be exposed to life-threatening levels of ionizing radiation. We have recently shown that genome-wide expression analysis of the peripheral blood (PB can generate gene expression profiles that can predict radiation exposure and distinguish the dose level of exposure following total body irradiation (TBI. However, in the event a radiation-mass casualty scenario, many victims will have heterogeneous exposure due to partial shielding and it is unknown whether PB gene expression profiles would be useful in predicting the status of partially irradiated individuals. Here, we identified gene expression profiles in the PB that were characteristic of anterior hemibody-, posterior hemibody- and single limb-irradiation at 0.5 Gy, 2 Gy and 10 Gy in C57Bl6 mice. These PB signatures predicted the radiation status of partially irradiated mice with a high level of accuracy (range 79-100% compared to non-irradiated mice. Interestingly, PB signatures of partial body irradiation were poorly predictive of radiation status by site of injury (range 16-43%, suggesting that the PB molecular response to partial body irradiation was anatomic site specific. Importantly, PB gene signatures generated from TBI-treated mice failed completely to predict the radiation status of partially irradiated animals or non-irradiated controls. These data demonstrate that partial body irradiation, even to a single limb, generates a characteristic PB signature of radiation injury and thus may necessitate the use of multiple signatures, both partial body and total body, to accurately assess the status of an individual exposed to radiation.
Differential gene expression between African American and European American colorectal cancer patients.

Directory of Open Access Journals (Sweden)

Biljana Jovov

Full Text Available The incidence and mortality of colorectal cancer (CRC is higher in African Americans (AAs than other ethnic groups in the U. S., but reasons for the disparities are unknown. We performed gene expression profiling of sporadic CRCs from AAs vs. European Americans (EAs to assess the contribution to CRC disparities. We evaluated the gene expression of 43 AA and 43 EA CRC tumors matched by stage and 40 matching normal colorectal tissues using the Agilent human whole genome 4x44K cDNA arrays. Gene and pathway analyses were performed using Significance Analysis of Microarrays (SAM, Ten-fold cross validation, and Ingenuity Pathway Analysis (IPA. SAM revealed that 95 genes were differentially expressed between AA and EA patients at a false discovery rate of ≤5%. Using IPA we determined that most prominent disease and pathway associations of differentially expressed genes were related to inflammation and immune response. Ten-fold cross validation demonstrated that following 10 genes can predict ethnicity with an accuracy of 94%: CRYBB2, PSPH, ADAL, VSIG10L, C17orf81, ANKRD36B, ZNF835, ARHGAP6, TRNT1 and WDR8. Expression of these 10 genes was validated by qRT-PCR in an independent test set of 28 patients (10 AA, 18 EA. Our results are the first to implicate differential gene expression in CRC racial disparities and indicate prominent difference in CRC inflammation between AA and EA patients. Differences in susceptibility to inflammation support the existence of distinct tumor microenvironments in these two patient populations.
Double-filter identification of vascular-expressed genes using Arabidopsis plants with vascular hypertrophy and hypotrophy.

Science.gov (United States)

Ckurshumova, Wenzislava; Scarpella, Enrico; Goldstein, Rochelle S; Berleth, Thomas

2011-08-01

Genes expressed in vascular tissues have been identified by several strategies, usually with a focus on mature vascular cells. In this study, we explored the possibility of using two opposite types of altered tissue compositions in combination with a double-filter selection to identify genes with a high probability of vascular expression in early organ primordia. Specifically, we generated full-transcriptome microarray profiles of plants with (a) genetically strongly reduced and (b) pharmacologically vastly increased vascular tissues and identified a reproducible cohort of 158 transcripts that fulfilled the dual requirement of being underrepresented in (a) and overrepresented in (b). In order to assess the predictive value of our identification scheme for vascular gene expression, we determined the expression patterns of genes in two unbiased subsamples. First, we assessed the expression patterns of all twenty annotated transcription factor genes from the cohort of 158 genes and found that seventeen of the twenty genes were preferentially expressed in leaf vascular cells. Remarkably, fifteen of these seventeen vascular genes were clearly expressed already very early in leaf vein development. Twelve genes with published leaf expression patterns served as a second subsample to monitor the representation of vascular genes in our cohort. Of those twelve genes, eleven were preferentially expressed in leaf vascular tissues. Based on these results we propose that our compendium of 158 genes represents a sample that is highly enriched for genes expressed in vascular tissues and that our approach is particularly suited to detect genes expressed in vascular cell lineages at early stages of their inception. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Gene expression profile of blood cells for the prediction of delayed cerebral ischemia after intracranial aneurysm rupture: a pilot study in humans.

Science.gov (United States)

Baumann, Antoine; Devaux, Yvan; Audibert, Gérard; Zhang, Lu; Bracard, Serge; Colnat-Coulbois, Sophie; Klein, Olivier; Zannad, Faiez; Charpentier, Claire; Longrois, Dan; Mertes, Paul-Michel

2013-01-01

Delayed cerebral ischemia (DCI) is a potentially devastating complication after intracranial aneurysm rupture and its mechanisms remain poorly elucidated. Early identification of the patients prone to developing DCI after rupture may represent a major breakthrough in its prevention and treatment. The single gene approach of DCI has demonstrated interest in humans. We hypothesized that whole genome expression profile of blood cells may be useful for better comprehension and prediction of aneurysmal DCI. Over a 35-month period, 218 patients with aneurysm rupture were included in this study. DCI was defined as the occurrence of a new delayed neurological deficit occurring within 2 weeks after aneurysm rupture with evidence of ischemia either on perfusion-diffusion MRI, CT angiography or CT perfusion imaging, or with cerebral angiography. DCI patients were matched against controls based on 4 out of 5 criteria (age, sex, Fisher grade, aneurysm location and smoking status). Genome-wide expression analysis of blood cells obtained at admission was performed by microarrays. Transcriptomic analysis was performed using long oligonucleotide microarrays representing 25,000 genes. Quantitative PCR: 1 µg of total RNA extracted was reverse-transcribed, and the resulting cDNA was diluted 10-fold before performing quantitative PCR. Microarray data were first analyzed by 'Significance Analysis of Microarrays' software which includes the Benjamini correction for multiple testing. In a second step, microarray data fold change was compared using a two-tailed, paired t test. Analysis of receiver-operating characteristic (ROC) curves and the area under the ROC curves were used for prediction analysis. Logistic regression models were used to investigate the additive value of multiple biomarkers. A total of 16 patients demonstrated DCI. Significance Analysis of Microarrays software failed to retrieve significant genes, most probably because of the heterogeneity of the patients included in
Vertebrate gene predictions and the problem of large genes

DEFF Research Database (Denmark)

Wang, Jun; Li, ShengTing; Zhang, Yong

2003-01-01

To find unknown protein-coding genes, annotation pipelines use a combination of ab initio gene prediction and similarity to experimentally confirmed genes or proteins. Here, we show that although the ab initio predictions have an intrinsically high false-positive rate, they also have a consistent...
General theory for integrated analysis of growth, gene, and protein expression in biofilms.

Science.gov (United States)

Zhang, Tianyu; Pabst, Breana; Klapper, Isaac; Stewart, Philip S

2013-01-01

A theory for analysis and prediction of spatial and temporal patterns of gene and protein expression within microbial biofilms is derived. The theory integrates phenomena of solute reaction and diffusion, microbial growth, mRNA or protein synthesis, biomass advection, and gene transcript or protein turnover. Case studies illustrate the capacity of the theory to simulate heterogeneous spatial patterns and predict microbial activities in biofilms that are qualitatively different from those of planktonic cells. Specific scenarios analyzed include an inducible GFP or fluorescent protein reporter, a denitrification gene repressed by oxygen, an acid stress response gene, and a quorum sensing circuit. It is shown that the patterns of activity revealed by inducible stable fluorescent proteins or reporter unstable proteins overestimate the region of activity. This is due to advective spreading and finite protein turnover rates. In the cases of a gene induced by either limitation for a metabolic substrate or accumulation of a metabolic product, maximal expression is predicted in an internal stratum of the biofilm. A quorum sensing system that includes an oxygen-responsive negative regulator exhibits behavior that is distinct from any stage of a batch planktonic culture. Though here the analyses have been limited to simultaneous interactions of up to two substrates and two genes, the framework applies to arbitrarily large networks of genes and metabolites. Extension of reaction-diffusion modeling in biofilms to the analysis of individual genes and gene networks is an important advance that dovetails with the growing toolkit of molecular and genetic experimental techniques.
Gene expression in Pseudomonas aeruginosa swarming motility

Directory of Open Access Journals (Sweden)

Déziel Eric

2010-10-01

Full Text Available Abstract Background The bacterium Pseudomonas aeruginosa is capable of three types of motilities: swimming, twitching and swarming. The latter is characterized by a fast and coordinated group movement over a semi-solid surface resulting from intercellular interactions and morphological differentiation. A striking feature of swarming motility is the complex fractal-like patterns displayed by migrating bacteria while they move away from their inoculation point. This type of group behaviour is still poorly understood and its characterization provides important information on bacterial structured communities such as biofilms. Using GeneChip® Affymetrix microarrays, we obtained the transcriptomic profiles of both bacterial populations located at the tip of migrating tendrils and swarm center of swarming colonies and compared these profiles to that of a bacterial control population grown on the same media but solidified to not allow swarming motility. Results Microarray raw data were corrected for background noise with the RMA algorithm and quantile normalized. Differentially expressed genes between the three conditions were selected using a threshold of 1.5 log2-fold, which gave a total of 378 selected genes (6.3% of the predicted open reading frames of strain PA14. Major shifts in gene expression patterns are observed in each growth conditions, highlighting the presence of distinct bacterial subpopulations within a swarming colony (tendril tips vs. swarm center. Unexpectedly, microarrays expression data reveal that a minority of genes are up-regulated in tendril tip populations. Among them, we found energy metabolism, ribosomal protein and transport of small molecules related genes. On the other hand, many well-known virulence factors genes were globally repressed in tendril tip cells. Swarm center cells are distinct and appear to be under oxidative and copper stress responses. Conclusions Results reported in this study show that, as opposed to
Fitness Effects of Network Non-Linearity Induced by Gene Expression Noise

Science.gov (United States)

Ray, Christian; Cooper, Tim; Balazsi, Gabor

2012-02-01

In the non-equilibrium dynamics of growing microbial cells, metabolic enzymes can create non-linearities in metabolite concentration because of non-linear degradation (utilization): an enzyme can saturate in the process of metabolite utilization. Increasing metabolite production past the saturation point then results in an ultrasensitive metabolite response. If the production rate of a metabolite depends on a second enzyme or other protein-mediated process, uncorrelated gene expression noise can thus cause transient metabolite concentration bursts. Such bursts are physiologically unnecessary and may represent a source of selection against the ultrasensitive switch, especially if the fluctuating metabolic intermediate is toxic. Selection may therefore favor correlated gene expression fluctuations for enzymes in the same pathway, such as by same-operon membership in bacteria. Using a modified experimental lac operon system, we are undertaking a combined theoretical-experimental approach to demonstrate that (i) the lac operon has an implicit ultrasensitive switch that we predict is avoided by gene expression correlations induced by same-operon membership; (ii) bacterial growth rates are sensitive to crossing the ultrasensitive threshold. Our results suggest that correlations in intrinsic gene expression noise are exploited by evolution to ameliorate the detrimental effects of nonlinearities in metabolite concentrations.
Characterization and gene expression analysis of the cir multi-gene family of plasmodium chabaudi chabaudi (AS)

KAUST Repository

Lawton, Jennifer

2012-03-29

Background: The pir genes comprise the largest multi-gene family in Plasmodium, with members found in P. vivax, P. knowlesi and the rodent malaria species. Despite comprising up to 5% of the genome, little is known about the functions of the proteins encoded by pir genes. P. chabaudi causes chronic infection in mice, which may be due to antigenic variation. In this model, pir genes are called cirs and may be involved in this mechanism, allowing evasion of host immune responses. In order to fully understand the role(s) of CIR proteins during P. chabaudi infection, a detailed characterization of the cir gene family was required.Results: The cir repertoire was annotated and a detailed bioinformatic characterization of the encoded CIR proteins was performed. Two major sub-families were identified, which have been named A and B. Members of each sub-family displayed different amino acid motifs, and were thus predicted to have undergone functional divergence. In addition, the expression of the entire cir repertoire was analyzed via RNA sequencing and microarray. Up to 40% of the cir gene repertoire was expressed in the parasite population during infection, and dominant cir transcripts could be identified. In addition, some differences were observed in the pattern of expression between the cir subgroups at the peak of P. chabaudi infection. Finally, specific cir genes were expressed at different time points during asexual blood stages.Conclusions: In conclusion, the large number of cir genes and their expression throughout the intraerythrocytic cycle of development indicates that CIR proteins are likely to be important for parasite survival. In particular, the detection of dominant cir transcripts at the peak of P. chabaudi infection supports the idea that CIR proteins are expressed, and could perform important functions in the biology of this parasite. Further application of the methodologies described here may allow the elucidation of CIR sub-family A and B protein
Characterization and gene expression analysis of the cir multi-gene family of plasmodium chabaudi chabaudi (AS

Directory of Open Access Journals (Sweden)

Lawton Jennifer

2012-03-01

Full Text Available Abstract Background The pir genes comprise the largest multi-gene family in Plasmodium, with members found in P. vivax, P. knowlesi and the rodent malaria species. Despite comprising up to 5% of the genome, little is known about the functions of the proteins encoded by pir genes. P. chabaudi causes chronic infection in mice, which may be due to antigenic variation. In this model, pir genes are called cirs and may be involved in this mechanism, allowing evasion of host immune responses. In order to fully understand the role(s of CIR proteins during P. chabaudi infection, a detailed characterization of the cir gene family was required. Results The cir repertoire was annotated and a detailed bioinformatic characterization of the encoded CIR proteins was performed. Two major sub-families were identified, which have been named A and B. Members of each sub-family displayed different amino acid motifs, and were thus predicted to have undergone functional divergence. In addition, the expression of the entire cir repertoire was analyzed via RNA sequencing and microarray. Up to 40% of the cir gene repertoire was expressed in the parasite population during infection, and dominant cir transcripts could be identified. In addition, some differences were observed in the pattern of expression between the cir subgroups at the peak of P. chabaudi infection. Finally, specific cir genes were expressed at different time points during asexual blood stages. Conclusions In conclusion, the large number of cir genes and their expression throughout the intraerythrocytic cycle of development indicates that CIR proteins are likely to be important for parasite survival. In particular, the detection of dominant cir transcripts at the peak of P. chabaudi infection supports the idea that CIR proteins are expressed, and could perform important functions in the biology of this parasite. Further application of the methodologies described here may allow the elucidation of CIR sub
Characterization and gene expression analysis of the cir multi-gene family of plasmodium chabaudi chabaudi (AS)

KAUST Repository

Lawton, Jennifer; Brugat, Thibaut; Yan, Yam Xue; Reid, Adam James; Bö hme, Ulrike; Otto, Thomas Dan; Pain, Arnab; Jackson, Andrew; Berriman, Matthew; Cunningham, Deirdre; Preiser, Peter; Langhorne, Jean

2012-01-01

Background: The pir genes comprise the largest multi-gene family in Plasmodium, with members found in P. vivax, P. knowlesi and the rodent malaria species. Despite comprising up to 5% of the genome, little is known about the functions of the proteins encoded by pir genes. P. chabaudi causes chronic infection in mice, which may be due to antigenic variation. In this model, pir genes are called cirs and may be involved in this mechanism, allowing evasion of host immune responses. In order to fully understand the role(s) of CIR proteins during P. chabaudi infection, a detailed characterization of the cir gene family was required.Results: The cir repertoire was annotated and a detailed bioinformatic characterization of the encoded CIR proteins was performed. Two major sub-families were identified, which have been named A and B. Members of each sub-family displayed different amino acid motifs, and were thus predicted to have undergone functional divergence. In addition, the expression of the entire cir repertoire was analyzed via RNA sequencing and microarray. Up to 40% of the cir gene repertoire was expressed in the parasite population during infection, and dominant cir transcripts could be identified. In addition, some differences were observed in the pattern of expression between the cir subgroups at the peak of P. chabaudi infection. Finally, specific cir genes were expressed at different time points during asexual blood stages.Conclusions: In conclusion, the large number of cir genes and their expression throughout the intraerythrocytic cycle of development indicates that CIR proteins are likely to be important for parasite survival. In particular, the detection of dominant cir transcripts at the peak of P. chabaudi infection supports the idea that CIR proteins are expressed, and could perform important functions in the biology of this parasite. Further application of the methodologies described here may allow the elucidation of CIR sub-family A and B protein
Hormonal Modulation of Breast Cancer Gene Expression: Implications for Intrinsic Subtyping in Premenopausal Women

OpenAIRE

Bernhardt, Sarah M.; Dasari, Pallave; Walsh, David; Townsend, Amanda R.; Price, Timothy J.; Ingman, Wendy V.

2016-01-01

Clinics are increasingly adopting gene-expression profiling to diagnose breast cancer subtype, providing an intrinsic, molecular portrait of the tumor. For example, the PAM50-based Prosigna test quantifies expression of 50 key genes to classify breast cancer subtype, and this method of classification has been demonstrated to be superior over traditional immunohistochemical methods that detect proteins, to predict risk of disease recurrence. However, these tests were largely developed and vali...
Meta-analysis of differentiating mouse embryonic stem cell gene expression kinetics reveals early change of a small gene set.

Directory of Open Access Journals (Sweden)

Clive H Glover

2006-11-01

Full Text Available Stem cell differentiation involves critical changes in gene expression. Identification of these should provide endpoints useful for optimizing stem cell propagation as well as potential clues about mechanisms governing stem cell maintenance. Here we describe the results of a new meta-analysis methodology applied to multiple gene expression datasets from three mouse embryonic stem cell (ESC lines obtained at specific time points during the course of their differentiation into various lineages. We developed methods to identify genes with expression changes that correlated with the altered frequency of functionally defined, undifferentiated ESC in culture. In each dataset, we computed a novel statistical confidence measure for every gene which captured the certainty that a particular gene exhibited an expression pattern of interest within that dataset. This permitted a joint analysis of the datasets, despite the different experimental designs. Using a ranking scheme that favored genes exhibiting patterns of interest, we focused on the top 88 genes whose expression was consistently changed when ESC were induced to differentiate. Seven of these (103728_at, 8430410A17Rik, Klf2, Nr0b1, Sox2, Tcl1, and Zfp42 showed a rapid decrease in expression concurrent with a decrease in frequency of undifferentiated cells and remained predictive when evaluated in additional maintenance and differentiating protocols. Through a novel meta-analysis, this study identifies a small set of genes whose expression is useful for identifying changes in stem cell frequencies in cultures of mouse ESC. The methods and findings have broader applicability to understanding the regulation of self-renewal of other stem cell types.
Gene Expression Analysis to Assess the Relevance of Rodent Models to Human Lung Injury.

Science.gov (United States)

Sweeney, Timothy E; Lofgren, Shane; Khatri, Purvesh; Rogers, Angela J

2017-08-01

The relevance of animal models to human diseases is an area of intense scientific debate. The degree to which mouse models of lung injury recapitulate human lung injury has never been assessed. Integrating data from both human and animal expression studies allows for increased statistical power and identification of conserved differential gene expression across organisms and conditions. We sought comprehensive integration of gene expression data in experimental acute lung injury (ALI) in rodents compared with humans. We performed two separate gene expression multicohort analyses to determine differential gene expression in experimental animal and human lung injury. We used correlational and pathway analyses combined with external in vitro gene expression data to identify both potential drivers of underlying inflammation and therapeutic drug candidates. We identified 21 animal lung tissue datasets and three human lung injury bronchoalveolar lavage datasets. We show that the metasignatures of animal and human experimental ALI are significantly correlated despite these widely varying experimental conditions. The gene expression changes among mice and rats across diverse injury models (ozone, ventilator-induced lung injury, LPS) are significantly correlated with human models of lung injury (Pearson r = 0.33-0.45, P human lung injury. Predicted therapeutic targets, peptide ligand signatures, and pathway analyses are also all highly overlapping. Gene expression changes are similar in animal and human experimental ALI, and provide several physiologic and therapeutic insights to the disease.
Subclinical pregnancy toxemia induced gene expression changes in ovine placenta and uterus

Directory of Open Access Journals (Sweden)

Ramanathan K Kasimanickam

2016-08-01

Full Text Available The objective was to elucidate gene expression differences in uterus, caruncle and cotyledon of ewes with subclinical pregnancy toxemia (SCPT and healthy ewes, and to identify associated biological functions and pathways involved in pregnancy toxemia. On Day 136 (±1 day post breeding ewes (n=18 had body condition score (BCS; 1 to 5; 1, emaciated; 5, obese assessed and blood samples were collected for plasma glucose and β-hydroxybutyrate (BHBA analyses. The ewes were euthanized and tissue samples were collected from the gravid uterus and placentomes. Based on BCS (2.0 ± 0.02, glucose (2.4 ± 0.33 and BHBA (0.97 ± 0.06 concentrations, ewes (n=10 were grouped as healthy (n=5 and subclinical SCPT (n=5 ewes. The mRNA expressions were determined by quantitative PCR method and prediction of miRNA partners and target genes for the predicted miRNA were identified using miRDB (http://mirdb.org/miRDB/. Top ranked target genes were used to identify associated biological functions and pathways in response to subclinical pregnancy toxemia using PANTHER. The angiogenesis genes VEGF and PlGF, and AdipoQ, AdipoR2, PPARG, LEP, IGF1, IGF2, IL1b and TNFα mRNA expressions were lower in abundances; whereas hypoxia genes eNOS, HIF1a, and HIF 2a, and sFlt1 and KDR mRNA expressions were greater in abundances in uterus and placenta of SCPT ewes compared to healthy ewes (P<0.05. The predicted miRNA and associated target genes contributed to several biological processes, including apoptosis, biological adhesion, biological regulation, cellular component biogenesis, cellular process, developmental process, immune system process, localization, metabolic process, multicellular organismal process, reproduction, and response to stimulus. The target genes were involved in several pathways including angiogenesis, cytoskeletal regulation, hypoxia response via HIF activation, interleukin signaling, ubiquitin proteasome and VEGF signaling pathway. In conclusion, genes
Identifying potential maternal genes of Bombyx mori using digital gene expression profiling

Science.gov (United States)

Xu, Pingzhen

2018-01-01

Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160
Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

Directory of Open Access Journals (Sweden)

Boris P Hejblum

2015-06-01

Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.
Expression of the human growth hormone variant gene in cultured fibroblasts and transgenic mice

International Nuclear Information System (INIS)

Selden, R.F.; Wagner, T.E.; Blethen, S.; Yun, J.S.; Rowe, M.E.; Goodman, H.M.

1988-01-01

The nucleotide sequence of the human growth hormone variant gene, one of the five members of the growth hormone gene family, predicts that it encodes a growth hormone-like protein. As a first step in determining whether this gene is functional in humans, the authors have expressed a mouse methallothionein I/human growth hormone variant fusion gene in mouse L cells and in transgenic mice. The growth hormone variant protein expressed in transiently transfected L cells is distinct from growth hormone itself with respect to reactivity with anti-growth hormone monoclonal antibodies, behavior during column chromatography, and isoelectric point. Transgenic mice expressing the growth hormone variant protein are 1.4- to 1.9-fold larger than nontransgenic controls, suggesting that the protein has growth-promoting properties
Resistance gene expression determines the in vitro chemosensitivity of non-small cell lung cancer (NSCLC

Directory of Open Access Journals (Sweden)

Amer Khalid

2009-08-01

Full Text Available Abstract Background NSCLC exhibits considerable heterogeneity in its sensitivity to chemotherapy and similar heterogeneity is noted in vitro in a variety of model systems. This study has tested the hypothesis that the molecular basis of the observed in vitro chemosensitivity of NSCLC lies within the known resistance mechanisms inherent to these patients' tumors. Methods The chemosensitivity of a series of 49 NSCLC tumors was assessed using the ATP-based tumor chemosensitivity assay (ATP-TCA and compared with quantitative expression of resistance genes measured by RT-PCR in a Taqman Array™ following extraction of RNA from formalin-fixed paraffin-embedded (FFPE tissue. Results There was considerable heterogeneity between tumors within the ATP-TCA, and while this showed no direct correlation with individual gene expression, there was strong correlation of multi-gene signatures for many of the single agents and combinations tested. For instance, docetaxel activity showed some dependence on the expression of drug pumps, while cisplatin activity showed some dependence on DNA repair enzyme expression. Activity of both drugs was influenced more strongly still by the expression of anti- and pro-apoptotic genes by the tumor for both docetaxel and cisplatin. The doublet combinations of cisplatin with gemcitabine and cisplatin with docetaxel showed gene expression signatures incorporating resistance mechanisms for both agents. Conclusion Genes predicted to be involved in known mechanisms drug sensitivity and resistance correlate well with in vitro chemosensitivity and may allow the definition of predictive signatures to guide individualized chemotherapy in lung cancer.

Presymptomatic Diagnosis of Celiac Disease in Predisposed Children: The Role of Gene Expression Profile.

Science.gov (United States)

Galatola, Martina; Cielo, Donatella; Panico, Camilla; Stellato, Pio; Malamisura, Basilio; Carbone, Lorenzo; Gianfrani, Carmen; Troncone, Riccardo; Greco, Luigi; Auricchio, Renata

2017-09-01

The prevalence of celiac disease (CD) has increased significantly in recent years, and risk prediction and early diagnosis have become imperative especially in at-risk families. In a previous study, we identified individuals with CD based on the expression profile of a set of candidate genes in peripheral blood monocytes. Here we evaluated the expression of a panel of CD candidate genes in peripheral blood mononuclear cells from at-risk infants long time before any symptom or production of antibodies. We analyzed the gene expression of a set of 9 candidate genes, associated with CD, in 22 human leukocyte antigen predisposed children from at-risk families for CD, studied from birth to 6 years of age. Nine of them developed CD (patients) and 13 did not (controls). We analyzed gene expression at 3 different time points (age matched in the 2 groups): 4-19 months before diagnosis, at the time of CD diagnosis, and after at least 1 year of a gluten-free diet. At similar age points, controls were also evaluated. Three genes (KIAA, TAGAP [T-cell Activation GTPase Activating Protein], and SH2B3 [SH2B Adaptor Protein 3]) were overexpressed in patients, compared with controls, at least 9 months before CD diagnosis. At a stepwise discriminant analysis, 4 genes (RGS1 [Regulator of G-protein signaling 1], TAGAP, TNFSF14 [Tumor Necrosis Factor (Ligand) Superfamily member 14], and SH2B3) differentiate patients from controls before serum antibodies production and clinical symptoms. Multivariate equation correctly classified CD from non-CD children in 95.5% of patients. The expression of a small set of candidate genes in peripheral blood mononuclear cells can predict CD at least 9 months before the appearance of any clinical and serological signs of the disease.
Molecular Characterization and Expression Analysis of Equine ( Gene in Horse (

Directory of Open Access Journals (Sweden)

Ki-Duk Song

2014-05-01

Full Text Available The objective of this study was to determine the molecular characteristics of the horse vascular endothelial growth factor alpha gene (VEGFα by constructing a phylogenetic tree, and to investigate gene expression profiles in tissues and blood leukocytes after exercise for development of suitable biomarkers. Using published amino acid sequences of other vertebrate species (human, chimpanzee, mouse, rat, cow, pig, chicken and dog, we constructed a phylogenetic tree which showed that equine VEGFα belonged to the same clade of the pig VEGFα. Analysis for synonymous (Ks and non-synonymous substitution ratios (Ka revealed that the horse VEGFα underwent positive selection. RNA was extracted from blood samples before and after exercise and different tissue samples of three horses. Expression analyses using reverse transcription-polymerase chain reaction (RT-PCR and quantitative-polymerase chain reaction (qPCR showed ubiquitous expression of VEGFα mRNA in skeletal muscle, kidney, thyroid, lung, appendix, colon, spinal cord, and heart tissues. Analysis of differential expression of VEGFα gene in blood leukocytes after exercise indicated a unimodal pattern. These results will be useful in developing biomarkers that can predict the recovery capacity of racing horses.
aeGEPUCI: a database of gene expression in the dengue vector mosquito, Aedes aegypti

Directory of Open Access Journals (Sweden)

James Anthony A

2010-10-01

Full Text Available Abstract Background Aedes aegypti is the principal vector of dengue and yellow fever viruses. The availability of the sequenced and annotated genome enables genome-wide analyses of gene expression in this mosquito. The large amount of data resulting from these analyses requires efficient cataloguing before it becomes useful as the basis for new insights into gene expression patterns and studies of the underlying molecular mechanisms for generating these patterns. Findings We provide a publicly-accessible database and data-mining tool, aeGEPUCI, that integrates 1 microarray analyses of sex- and stage-specific gene expression in Ae. aegypti, 2 functional gene annotation, 3 genomic sequence data, and 4 computational sequence analysis tools. The database can be used to identify genes expressed in particular stages and patterns of interest, and to analyze putative cis-regulatory elements (CREs that may play a role in coordinating these patterns. The database is accessible from the address http://www.aegep.bio.uci.edu. Conclusions The combination of gene expression, function and sequence data coupled with integrated sequence analysis tools allows for identification of expression patterns and streamlines the development of CRE predictions and experiments to assess how patterns of expression are coordinated at the molecular level.
The rules of gene expression in plants: Organ identity and gene body methylation are key factors for regulation of gene expression in Arabidopsis thaliana

Directory of Open Access Journals (Sweden)

Gutiérrez Rodrigo A

2008-09-01

Full Text Available Abstract Background Microarray technology is a widely used approach for monitoring genome-wide gene expression. For Arabidopsis, there are over 1,800 microarray hybridizations representing many different experimental conditions on Affymetrix™ ATH1 gene chips alone. This huge amount of data offers a unique opportunity to infer the principles that govern the regulation of gene expression in plants. Results We used bioinformatics methods to analyze publicly available data obtained using the ATH1 chip from Affymetrix. A total of 1887 ATH1 hybridizations were normalized and filtered to eliminate low-quality hybridizations. We classified and compared control and treatment hybridizations and determined differential gene expression. The largest differences in gene expression were observed when comparing samples obtained from different organs. On average, ten-fold more genes were differentially expressed between organs as compared to any other experimental variable. We defined "gene responsiveness" as the number of comparisons in which a gene changed its expression significantly. We defined genes with the highest and lowest responsiveness levels as hypervariable and housekeeping genes, respectively. Remarkably, housekeeping genes were best distinguished from hypervariable genes by differences in methylation status in their transcribed regions. Moreover, methylation in the transcribed region was inversely correlated (R2 = 0.8 with gene responsiveness on a genome-wide scale. We provide an example of this negative relationship using genes encoding TCA cycle enzymes, by contrasting their regulatory responsiveness to nitrate and methylation status in their transcribed regions. Conclusion Our results indicate that the Arabidopsis transcriptome is largely established during development and is comparatively stable when faced with external perturbations. We suggest a novel functional role for DNA methylation in the transcribed region as a key determinant
Global map of physical interactions among differentially expressed genes in multiple sclerosis relapses and remissions.

Science.gov (United States)

Tuller, Tamir; Atar, Shimshi; Ruppin, Eytan; Gurevich, Michael; Achiron, Anat

2011-09-15

to report new sets of genes that according to their gene expression and physical interactions are predicted to be differentially expressed in MS versus healthy subjects, and in MS patients in relapse versus remission. Some of these genes may be useful biomarkers for diagnosing MS and predicting relapses in MS patients.
A 7 gene expression score predicts for radiation response in cancer cervix

International Nuclear Information System (INIS)

Rajkumar, Thangarajan; Vijayalakshmi, Neelakantan; Sabitha, Kesavan; Shirley, Sundersingh; Selvaluxmy, Ganesharaja; Bose, Mayil Vahanan; Nambaru, Lavanya

2009-01-01

Cervical cancer is the most common cancer among Indian women. The current recommendations are to treat the stage IIB, IIIA, IIIB and IVA with radical radiotherapy and weekly cisplatin based chemotherapy. However, Radiotherapy alone can help cure more than 60% of stage IIB and up to 40% of stage IIIB patients. Archival RNA samples from 15 patients who had achieved complete remission and stayed disease free for more than 36 months (No Evidence of Disease or NED group) and 10 patients who had failed radical radiotherapy (Failed group) were included in the study. The RNA were amplified, labelled and hybridized to Stanford microarray chips and analyzed using BRB Array Tools software and Significance Analysis of Microarray (SAM) analysis. 20 genes were selected for further validation using Relative Quantitation (RQ) Taqman assay in a Taqman Low-Density Array (TLDA) format. The RQ value was calculated, using each of the NED sample once as a calibrator. A scoring system was developed based on the RQ value for the genes. Using a seven gene based scoring system, it was possible to distinguish between the tumours which were likely to respond to the radiotherapy and those likely to fail. The mean score ± 2 SE (standard error of mean) was used and at a cut-off score of greater than 5.60, the sensitivity, specificity, Positive predictive value (PPV) and Negative predictive value (NPV) were 0.64, 1.0, 1.0, 0.67, respectively, for the low risk group. We have identified a 7 gene signature which could help identify patients with cervical cancer who can be treated with radiotherapy alone. However, this needs to be validated in a larger patient population
Expression of Sox genes in tooth development.

Science.gov (United States)

Kawasaki, Katsushige; Kawasaki, Maiko; Watanabe, Momoko; Idrus, Erik; Nagai, Takahiro; Oommen, Shelly; Maeda, Takeyasu; Hagiwara, Nobuko; Que, Jianwen; Sharpe, Paul T; Ohazama, Atsushi

2015-01-01

Members of the Sox gene family play roles in many biological processes including organogenesis. We carried out comparative in situ hybridization analysis of seventeen sox genes (Sox1-14, 17, 18, 21) during murine odontogenesis from the epithelial thickening to the cytodifferentiation stages. Localized expression of five Sox genes (Sox6, 9, 13, 14 and 21) was observed in tooth bud epithelium. Sox13 showed restricted expression in the primary enamel knots. At the early bell stage, three Sox genes (Sox8, 11, 17 and 21) were expressed in pre-ameloblasts, whereas two others (Sox5 and 18) showed expression in odontoblasts. Sox genes thus showed a dynamic spatio-temporal expression during tooth development.
Profiling Gene Expression in Germinating Brassica Roots.

Science.gov (United States)

Park, Myoung Ryoul; Wang, Yi-Hong; Hasenstein, Karl H

2014-01-01

Based on previously developed solid-phase gene extraction (SPGE) we examined the mRNA profile in primary roots of Brassica rapa seedlings for highly expressed genes like ACT7 (actin7), TUB (tubulin1), UBQ (ubiquitin), and low expressed GLK (glucokinase) during the first day post-germination. The assessment was based on the mRNA load of the SPGE probe of about 2.1 ng. The number of copies of the investigated genes changed spatially along the length of primary roots. The expression level of all genes differed significantly at each sample position. Among the examined genes ACT7 expression was most even along the root. UBQ was highest at the tip and root-shoot junction (RS). TUB and GLK showed a basipetal gradient. The temporal expression of UBQ was highest in the MZ 9 h after primary root emergence and higher than at any other sample position. Expressions of GLK in EZ and RS increased gradually over time. SPGE extraction is the result of oligo-dT and oligo-dA hybridization and the results illustrate that SPGE can be used for gene expression profiling at high spatial and temporal resolution. SPGE needles can be used within two weeks when stored at 4 °C. Our data indicate that gene expression studies that are based on the entire root miss important differences in gene expression that SPGE is able to resolve for example growth adjustments during gravitropism.
Stage-specific gene expression during sexual development in Phytophthora infestans

DEFF Research Database (Denmark)

Fabritius, Anna-Liisa; Cvitanich, Cristina; Judelson, Howard S.

2002-01-01

Eight genes that are upregulated during sexual development in the heterothallic oomycete, Phytophthora infestans, were identified by suppression subtractive hybridization. Two genes showed very low but detectable expression in vegetative hyphae and became induced about 40- to >100-fold early...... revealed that the predicted products of three of the genes had similarity to proteins that influence RNA stability, namely a ribonuclease activator, the pumilio family of RNA-binding proteins and RNase H. The products of two other mating-induced genes resembled two types of Phytophthora proteins previously...
Unstable Expression of Commonly Used Reference Genes in Rat Pancreatic Islets Early after Isolation Affects Results of Gene Expression Studies.

Directory of Open Access Journals (Sweden)

Lucie Kosinová

Full Text Available The use of RT-qPCR provides a powerful tool for gene expression studies; however, the proper interpretation of the obtained data is crucially dependent on accurate normalization based on stable reference genes. Recently, strong evidence has been shown indicating that the expression of many commonly used reference genes may vary significantly due to diverse experimental conditions. The isolation of pancreatic islets is a complicated procedure which creates severe mechanical and metabolic stress leading possibly to cellular damage and alteration of gene expression. Despite of this, freshly isolated islets frequently serve as a control in various gene expression and intervention studies. The aim of our study was to determine expression of 16 candidate reference genes and one gene of interest (F3 in isolated rat pancreatic islets during short-term cultivation in order to find a suitable endogenous control for gene expression studies. We compared the expression stability of the most commonly used reference genes and evaluated the reliability of relative and absolute quantification using RT-qPCR during 0-120 hrs after isolation. In freshly isolated islets, the expression of all tested genes was markedly depressed and it increased several times throughout the first 48 hrs of cultivation. We observed significant variability among samples at 0 and 24 hrs but substantial stabilization from 48 hrs onwards. During the first 48 hrs, relative quantification failed to reflect the real changes in respective mRNA concentrations while in the interval 48-120 hrs, the relative expression generally paralleled the results determined by absolute quantification. Thus, our data call into question the suitability of relative quantification for gene expression analysis in pancreatic islets during the first 48 hrs of cultivation, as the results may be significantly affected by unstable expression of reference genes. However, this method could provide reliable information
Spatio Temporal Expression Pattern of an Insecticidal Gene (cry2A in Transgenic Cotton Lines

Directory of Open Access Journals (Sweden)

Allah BAKHSH

2012-11-01

Full Text Available The production of transgenic plants with stable, high-level transgene expression is important for the success of crop improvement programs based on genetic engineering. The present study was conducted to evaluate genomic integration and spatio temporal expression of an insecticidal gene (cry2A in pre-existing transgenic lines of cotton. Genomic integration of cry2A was evaluated using various molecular approaches. The expression levels of cry2A were determined at vegetative and reproductive stages of cotton at regular intervals. These lines showed a stable integration of insecticidal gene in advance lines of transgenic cotton whereas gene expression was found variable with at various growth stages as well as in different plant parts throughout the season. The leaves of transgenic cotton were found to have maximum expression of cry2A gene followed by squares, bolls, anthers and petals. The protein level in fruiting part was less as compared to other parts showing inconsistency in gene expression. It was concluded that for culturing of transgenic crops, strategies should be developed to ensure the foreign genes expression efficient, consistent and in a predictable manner.
Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

Science.gov (United States)

Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

2018-02-23

Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.
In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer.

Science.gov (United States)

Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi

2013-10-04

Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC. Copyright © 2013 Elsevier Inc. All rights reserved.
Reduction in WT1 gene expression during early treatment predicts the outcome in patients with acute myeloid leukemia.

Science.gov (United States)

Andersson, Charlotta; Li, Xingru; Lorenz, Fryderyk; Golovleva, Irina; Wahlin, Anders; Li, Aihong

2012-12-01

Wilms tumor gene 1 (WT1) expression has been suggested as an applicable minimal residual disease marker in acute myeloid leukemia (AML). We evaluated the use of this marker in 43 adult AML patients. Quantitative assessment of WT1 gene transcripts was performed using real-time quantitative-polymerase chain reaction assay. Samples from both the peripheral blood and the bone marrow were analyzed at diagnosis and during follow-up. A strong correlation was observed between WT1 normalized with 2 different control genes (β-actin and ABL1, P0.05). A≥1-log reduction in WT1 expression in bone marrow samples taken freedom from relapse (P=0.010) when β-actin was used as control gene. Furthermore, a reduction in WT1 expression by ≥2 logs in peripheral blood samples taken at a later time point significantly correlated with a better outcome for overall survival (P=0.004) and freedom from relapse (P=0.012). This result was achieved when normalizing against both β-actin and ABL1. These results therefore suggest that WT1 gene expression can provide useful information for minimal residual disease detection in adult AML patients and that combined use of control genes can give more informative results.
Functional Associations by Response Overlap (FARO, a functional genomics approach matching gene expression phenotypes.

Directory of Open Access Journals (Sweden)

Henrik Bjørn Nielsen

2007-08-01

Full Text Available The systematic comparison of transcriptional responses of organisms is a powerful tool in functional genomics. For example, mutants may be characterized by comparing their transcript profiles to those obtained in other experiments querying the effects on gene expression of many experimental factors including treatments, mutations and pathogen infections. Similarly, drugs may be discovered by the relationship between the transcript profiles effectuated or impacted by a candidate drug and by the target disease. The integration of such data enables systems biology to predict the interplay between experimental factors affecting a biological system. Unfortunately, direct comparisons of gene expression profiles obtained in independent, publicly available microarray experiments are typically compromised by substantial, experiment-specific biases. Here we suggest a novel yet conceptually simple approach for deriving 'Functional Association(s by Response Overlap' (FARO between microarray gene expression studies. The transcriptional response is defined by the set of differentially expressed genes independent from the magnitude or direction of the change. This approach overcomes the limited comparability between studies that is typical for methods that rely on correlation in gene expression. We apply FARO to a compendium of 242 diverse Arabidopsis microarray experimental factors, including phyto-hormones, stresses and pathogens, growth conditions/stages, tissue types and mutants. We also use FARO to confirm and further delineate the functions of Arabidopsis MAP kinase 4 in disease and stress responses. Furthermore, we find that a large, well-defined set of genes responds in opposing directions to different stress conditions and predict the effects of different stress combinations. This demonstrates the usefulness of our approach for exploiting public microarray data to derive biologically meaningful associations between experimental factors. Finally, our
Simple Comparative Analyses of Differentially Expressed Gene Lists May Overestimate Gene Overlap.

Science.gov (United States)

Lawhorn, Chelsea M; Schomaker, Rachel; Rowell, Jonathan T; Rueppell, Olav

2018-04-16

Comparing the overlap between sets of differentially expressed genes (DEGs) within or between transcriptome studies is regularly used to infer similarities between biological processes. Significant overlap between two sets of DEGs is usually determined by a simple test. The number of potentially overlapping genes is compared to the number of genes that actually occur in both lists, treating every gene as equal. However, gene expression is controlled by transcription factors that bind to a variable number of transcription factor binding sites, leading to variation among genes in general variability of their expression. Neglecting this variability could therefore lead to inflated estimates of significant overlap between DEG lists. With computer simulations, we demonstrate that such biases arise from variation in the control of gene expression. Significant overlap commonly arises between two lists of DEGs that are randomly generated, assuming that the control of gene expression is variable among genes but consistent between corresponding experiments. More overlap is observed when transcription factors are specific to their binding sites and when the number of genes is considerably higher than the number of different transcription factors. In contrast, overlap between two DEG lists is always lower than expected when the genetic architecture of expression is independent between the two experiments. Thus, the current methods for determining significant overlap between DEGs are potentially confounding biologically meaningful overlap with overlap that arises due to variability in control of expression among genes, and more sophisticated approaches are needed.
Methods for monitoring multiple gene expression

Energy Technology Data Exchange (ETDEWEB)

Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA

2012-05-01

The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.
Methods for monitoring multiple gene expression

Energy Technology Data Exchange (ETDEWEB)

Berka, Randy; Bachkirova, Elena; Rey, Michael

2013-10-01

The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.
Gene expression profiles associated with anaemia and ITPA genotypes in patients with chronic hepatitis C (CH-C).

Science.gov (United States)

Birerdinc, A; Estep, M; Afendy, A; Stepanova, M; Younossi, I; Baranova, A; Younossi, Z M

2012-06-01

Anaemia is a common side effect of ribavirin (RBV) which is used for the treatment of hepatitis C. Inosine triphosphatase gene polymorphism (C to A) protects against RBV-induced anaemia. The aim of our study was to genotype patients for inosine triphosphatase gene polymorphism rs1127354 SNP (CC or CA) and associate treatment-induced anaemia with gene expression profile and genotypes. We used 67 hepatitis C patients with available gene expression, clinical, laboratory data and whole-blood samples. Whole blood was used to determine inosine triphosphatase gene polymorphism rs1127354 genotypes (CC or CA). The cohort with inosine triphosphatase gene polymorphism CA genotype revealed a distinct pattern of protection against anaemia and a lower drop in haemoglobin. A variation in the propensity of CC carriers to develop anaemia prompted us to look for additional predictors of anaemia during pegylated interferon (PEG-IFN) and RBV. Pretreatment blood samples of patients receiving a full course of PEG-IFN and RBV were used to assess expression of 153 genes previously implicated in host response to viral infections. The gene expression data were analysed according to presence of anaemia and inosine triphosphatase gene polymorphism genotypes. Thirty-six genes were associated with treatment-related anaemia, six of which are involved in the response to hypoxia pathway (HIF1A, AIF1, RHOC, PTEN, LCK and PDGFB). There was a substantial overlap between sustained virological response (SVR)-predicting and anaemia-related genes; however, of the nine JAK-STAT pathway-related genes associated with SVR, none were implicated in anaemia. These observations exclude the direct involvement of antiviral response in the development of anaemia associated with PEG-IFN and RBV treatment, whereas another, distinct component within the SVR-associated gene expression response may predict anaemia. We have identified baseline gene expression signatures associated with RBV-induced anaemia and identified
The ordering of expression among a few genes can provide simple cancer biomarkers and signal BRCA1 mutations

Directory of Open Access Journals (Sweden)

Parmigiani Giovanni

2009-08-01

Full Text Available Abstract Background A major challenge in computational biology is to extract knowledge about the genetic nature of disease from high-throughput data. However, an important obstacle to both biological understanding and clinical applications is the "black box" nature of the decision rules provided by most machine learning approaches, which usually involve many genes combined in a highly complex fashion. Achieving biologically relevant results argues for a different strategy. A promising alternative is to base prediction entirely upon the relative expression ordering of a small number of genes. Results We present a three-gene version of "relative expression analysis" (RXA, a rigorous and systematic comparison with earlier approaches in a variety of cancer studies, a clinically relevant application to predicting germline BRCA1 mutations in breast cancer and a cross-study validation for predicting ER status. In the BRCA1 study, RXA yields high accuracy with a simple decision rule: in tumors carrying mutations, the expression of a "reference gene" falls between the expression of two differentially expressed genes, PPP1CB and RNF14. An analysis of the protein-protein interactions among the triplet of genes and BRCA1 suggests that the classifier has a biological foundation. Conclusion RXA has the potential to identify genomic "marker interactions" with plausible biological interpretation and direct clinical applicability. It provides a general framework for understanding the roles of the genes involved in decision rules, as illustrated for the difficult and clinically relevant problem of identifying BRCA1 mutation carriers.

Determinants of human adipose tissue gene expression

DEFF Research Database (Denmark)

Viguerie, Nathalie; Montastier, Emilie; Maoret, Jean-José

2012-01-01

weight maintenance diets. For 175 genes, opposite regulation was observed during calorie restriction and weight maintenance phases, independently of variations in body weight. Metabolism and immunity genes showed inverse profiles. During the dietary intervention, network-based analyses revealed strong...... interconnection between expression of genes involved in de novo lipogenesis and components of the metabolic syndrome. Sex had a marked influence on AT expression of 88 transcripts, which persisted during the entire dietary intervention and after control for fat mass. In women, the influence of body mass index...... on expression of a subset of genes persisted during the dietary intervention. Twenty-two genes revealed a metabolic syndrome signature common to men and women. Genetic control of AT gene expression by cis signals was observed for 46 genes. Dietary intervention, sex, and cis genetic variants independently...
Altered Expression of Genes Implicated in Xylan Biosynthesis Affects Penetration Resistance against Powdery Mildew.

Science.gov (United States)

Chowdhury, Jamil; Lück, Stefanie; Rajaraman, Jeyaraman; Douchkov, Dimitar; Shirley, Neil J; Schwerdt, Julian G; Schweizer, Patrick; Fincher, Geoffrey B; Burton, Rachel A; Little, Alan

2017-01-01

Heteroxylan has recently been identified as an important component of papillae, which are formed during powdery mildew infection of barley leaves. Deposition of heteroxylan near the sites of attempted fungal penetration in the epidermal cell wall is believed to enhance the physical resistance to the fungal penetration peg and hence to improve pre-invasion resistance. Several glycosyltransferase (GT) families are implicated in the assembly of heteroxylan in the plant cell wall, and are likely to work together in a multi-enzyme complex. Members of key GT families reported to be involved in heteroxylan biosynthesis are up-regulated in the epidermal layer of barley leaves during powdery mildew infection. Modulation of their expression leads to altered susceptibility levels, suggesting that these genes are important for penetration resistance. The highest level of resistance was achieved when a GT43 gene was co-expressed with a GT47 candidate gene, both of which have been predicted to be involved in xylan backbone biosynthesis. Altering the expression level of several candidate heteroxylan synthesis genes can significantly alter disease susceptibility. This is predicted to occur through changes in the amount and structure of heteroxylan in barley papillae.
LINE FUSION GENES: a database of LINE expression in human genes

Directory of Open Access Journals (Sweden)

Park Hong-Seog

2006-06-01

Full Text Available Abstract Background Long Interspersed Nuclear Elements (LINEs are the most abundant retrotransposons in humans. About 79% of human genes are estimated to contain at least one segment of LINE per transcription unit. Recent studies have shown that LINE elements can affect protein sequences, splicing patterns and expression of human genes. Description We have developed a database, LINE FUSION GENES, for elucidating LINE expression throughout the human gene database. We searched the 28,171 genes listed in the NCBI database for LINE elements and analyzed their structures and expression patterns. The results show that the mRNA sequences of 1,329 genes were affected by LINE expression. The LINE expression types were classified on the basis of LINEs in the 5' UTR, exon or 3' UTR sequences of the mRNAs. Our database provides further information, such as the tissue distribution and chromosomal location of the genes, and the domain structure that is changed by LINE integration. We have linked all the accession numbers to the NCBI data bank to provide mRNA sequences for subsequent users. Conclusion We believe that our work will interest genome scientists and might help them to gain insight into the implications of LINE expression for human evolution and disease. Availability http://www.primate.or.kr/line
Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family.

Science.gov (United States)

Guo, Chunlei; Guo, Rongrong; Xu, Xiaozhao; Gao, Min; Li, Xiaoqin; Song, Junyang; Zheng, Yi; Wang, Xiping

2014-04-01

WRKY proteins comprise a large family of transcription factors that play important roles in plant defence regulatory networks, including responses to various biotic and abiotic stresses. To date, no large-scale study of WRKY genes has been undertaken in grape (Vitis vinifera L.). In this study, a total of 59 putative grape WRKY genes (VvWRKY) were identified and renamed on the basis of their respective chromosome distribution. A multiple sequence alignment analysis using all predicted grape WRKY genes coding sequences, together with those from Arabidopsis thaliana and tomato (Solanum lycopersicum), indicated that the 59 VvWRKY genes can be classified into three main groups (I-III). An evaluation of the duplication events suggested that several WRKY genes arose before the divergence of the grape and Arabidopsis lineages. Moreover, expression profiles derived from semiquantitative PCR and real-time quantitative PCR analyses showed distinct expression patterns in various tissues and in response to different treatments. Four VvWRKY genes showed a significantly higher expression in roots or leaves, 55 responded to varying degrees to at least one abiotic stress treatment, and the expression of 38 were altered following powdery mildew (Erysiphe necator) infection. Most VvWRKY genes were downregulated in response to abscisic acid or salicylic acid treatments, while the expression of a subset was upregulated by methyl jasmonate or ethylene treatments.
Expression of minichromosome maintenance genes in renal cell carcinoma

Directory of Open Access Journals (Sweden)

Zhong HB

2017-11-01

Full Text Available Hongbin Zhong,1,* Bin Chen,1,* Henrique Neves,2 Jinchun Xing,1 Youxin Ye,1 Ying Lin,1 Guohong Zhuang,3 Shu-Dong Zhang,4 Jiyi Huang,1,5 Hang Fai Kwok2 1Xiang’an Branch, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, People’s Republic of China; 2Faculty of Health Sciences, University of Macau, Taipa, Macau SAR; 3Medical College of Xiamen University, Xiamen, Fujian, People’s Republic of China; 4Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, Londonderry, UK; 5The First Clinical School of Fujian Medical University, Fuzhou, Fujian, People’s Republic of China *These authors contributed equally to this work Abstract: Minichromosome maintenance (MCM proteins play an essential role in DNA replication. They have been shown to be overexpressed in various types of cancer. However, the role of this family in renal cell carcinoma (RCC is widely unknown. In this study, we have identified a number of RCC datasets in the Gene Expression Omnibus database and also investigated the correlation between the expression levels of MCM genes and clinicopathological parameters. We found that the expression levels of MCM genes are positively correlated with one another. Expression levels of MCM2, MCM5, MCM6, and MCM7, but not of MCM3 and MCM4, were higher in RCC compared to paired adjacent normal tissue. Only the expression level of MCM4, but not of other MCMs, was positively correlated with tumor grade. In addition, a high-level expression of MCM2 in either primary tumor or metastases of RCC predicted a shorter disease-free survival time, while a high-level expression of MCM4 or MCM6 in primary tumor was also associated with poorer disease-free survival. Interestingly, we also demonstrated that patients with their primary RCC overexpressing 2 or more MCM genes had a shorter disease-free survival time, while those with RCC metastases overexpressing 3 or more MCM genes had a shorter
Predictive gene signatures: molecular markers distinguishing colon adenomatous polyp and carcinoma.

Directory of Open Access Journals (Sweden)

Janice E Drew

Full Text Available Cancers exhibit abnormal molecular signatures associated with disease initiation and progression. Molecular signatures could improve cancer screening, detection, drug development and selection of appropriate drug therapies for individual patients. Typically only very small amounts of tissue are available from patients for analysis and biopsy samples exhibit broad heterogeneity that cannot be captured using a single marker. This report details application of an in-house custom designed GenomeLab System multiplex gene expression assay, the hCellMarkerPlex, to assess predictive gene signatures of normal, adenomatous polyp and carcinoma colon tissue using archived tissue bank material. The hCellMarkerPlex incorporates twenty-one gene markers: epithelial (EZR, KRT18, NOX1, SLC9A2, proliferation (PCNA, CCND1, MS4A12, differentiation (B4GANLT2, CDX1, CDX2, apoptotic (CASP3, NOX1, NTN1, fibroblast (FSP1, COL1A1, structural (ACTG2, CNN1, DES, gene transcription (HDAC1, stem cell (LGR5, endothelial (VWF and mucin production (MUC2. Gene signatures distinguished normal, adenomatous polyp and carcinoma. Individual gene targets significantly contributing to molecular tissue types, classifier genes, were further characterised using real-time PCR, in-situ hybridisation and immunohistochemistry revealing aberrant epithelial expression of MS4A12, LGR5 CDX2, NOX1 and SLC9A2 prior to development of carcinoma. Identified gene signatures identify aberrant epithelial expression of genes prior to cancer development using in-house custom designed gene expression multiplex assays. This approach may be used to assist in objective classification of disease initiation, staging, progression and therapeutic responses using biopsy material.
In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer

Energy Technology Data Exchange (ETDEWEB)

Pandi, Narayanan Sathiya, E-mail: sathiyapandi@gmail.com; Suganya, Sivagurunathan; Rajendran, Suriliyandi

2013-10-04

Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC.
In silico analysis of stomach lineage specific gene set expression pattern in gastric cancer

International Nuclear Information System (INIS)

Pandi, Narayanan Sathiya; Suganya, Sivagurunathan; Rajendran, Suriliyandi

2013-01-01

Highlights: •Identified stomach lineage specific gene set (SLSGS) was found to be under expressed in gastric tumors. •Elevated expression of SLSGS in gastric tumor is a molecular predictor of metabolic type gastric cancer. •In silico pathway scanning identified estrogen-α signaling is a putative regulator of SLSGS in gastric cancer. •Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. -- Abstract: Stomach lineage specific gene products act as a protective barrier in the normal stomach and their expression maintains the normal physiological processes, cellular integrity and morphology of the gastric wall. However, the regulation of stomach lineage specific genes in gastric cancer (GC) is far less clear. In the present study, we sought to investigate the role and regulation of stomach lineage specific gene set (SLSGS) in GC. SLSGS was identified by comparing the mRNA expression profiles of normal stomach tissue with other organ tissue. The obtained SLSGS was found to be under expressed in gastric tumors. Functional annotation analysis revealed that the SLSGS was enriched for digestive function and gastric epithelial maintenance. Employing a single sample prediction method across GC mRNA expression profiles identified the under expression of SLSGS in proliferative type and invasive type gastric tumors compared to the metabolic type gastric tumors. Integrative pathway activation prediction analysis revealed a close association between estrogen-α signaling and SLSGS expression pattern in GC. Elevated expression of SLSGS in GC is associated with an overall increase in the survival of GC patients. In conclusion, our results highlight that estrogen mediated regulation of SLSGS in gastric tumor is a molecular predictor of metabolic type GC and prognostic factor in GC
Gene Expression Profiles for Predicting Metastasis in Breast Cancer: A Cross-Study Comparison of Classification Methods

Directory of Open Access Journals (Sweden)

Mark Burton

2012-01-01

Full Text Available Machine learning has increasingly been used with microarray gene expression data and for the development of classifiers using a variety of methods. However, method comparisons in cross-study datasets are very scarce. This study compares the performance of seven classification methods and the effect of voting for predicting metastasis outcome in breast cancer patients, in three situations: within the same dataset or across datasets on similar or dissimilar microarray platforms. Combining classification results from seven classifiers into one voting decision performed significantly better during internal validation as well as external validation in similar microarray platforms than the underlying classification methods. When validating between different microarray platforms, random forest, another voting-based method, proved to be the best performing method. We conclude that voting based classifiers provided an advantage with respect to classifying metastasis outcome in breast cancer patients.
Global expression differences and tissue specific expression differences in rice evolution result in two contrasting types of differentially expressed genes

KAUST Repository

Horiuchi, Youko

2015-12-23

Background Since the development of transcriptome analysis systems, many expression evolution studies characterized evolutionary forces acting on gene expression, without explicit discrimination between global expression differences and tissue specific expression differences. However, different types of gene expression alteration should have different effects on an organism, the evolutionary forces that act on them might be different, and different types of genes might show different types of differential expression between species. To confirm this, we studied differentially expressed (DE) genes among closely related groups that have extensive gene expression atlases, and clarified characteristics of different types of DE genes including the identification of regulating loci for differential expression using expression quantitative loci (eQTL) analysis data. Results We detected differentially expressed (DE) genes between rice subspecies in five homologous tissues that were verified using japonica and indica transcriptome atlases in public databases. Using the transcriptome atlases, we classified DE genes into two types, global DE genes and changed-tissues DE genes. Global type DE genes were not expressed in any tissues in the atlas of one subspecies, however changed-tissues type DE genes were expressed in both subspecies with different tissue specificity. For the five tissues in the two japonica-indica combinations, 4.6 ± 0.8 and 5.9 ± 1.5 % of highly expressed genes were global and changed-tissues DE genes, respectively. Changed-tissues DE genes varied in number between tissues, increasing linearly with the abundance of tissue specifically expressed genes in the tissue. Molecular evolution of global DE genes was rapid, unlike that of changed-tissues DE genes. Based on gene ontology, global and changed-tissues DE genes were different, having no common GO terms. Expression differences of most global DE genes were regulated by cis-eQTLs. Expression
Social Regulation of Gene Expression in Threespine Sticklebacks.

Directory of Open Access Journals (Sweden)

Anna K Greenwood

Full Text Available Identifying genes that are differentially expressed in response to social interactions is informative for understanding the molecular basis of social behavior. To address this question, we described changes in gene expression as a result of differences in the extent of social interactions. We housed threespine stickleback (Gasterosteus aculeatus females in either group conditions or individually for one week, then measured levels of gene expression in three brain regions using RNA-sequencing. We found that numerous genes in the hindbrain/cerebellum had altered expression in response to group or individual housing. However, relatively few genes were differentially expressed in either the diencephalon or telencephalon. The list of genes upregulated in fish from social groups included many genes related to neural development and cell adhesion as well as genes with functions in sensory signaling, stress, and social and reproductive behavior. The list of genes expressed at higher levels in individually-housed fish included several genes previously identified as regulated by social interactions in other animals. The identified genes are interesting targets for future research on the molecular mechanisms of normal social interactions.
Building prognostic models for breast cancer patients using clinical variables and hundreds of gene expression signatures

Directory of Open Access Journals (Sweden)

Liu Yufeng

2011-01-01

Full Text Available Abstract Background Multiple breast cancer gene expression profiles have been developed that appear to provide similar abilities to predict outcome and may outperform clinical-pathologic criteria; however, the extent to which seemingly disparate profiles provide additive prognostic information is not known, nor do we know whether prognostic profiles perform equally across clinically defined breast cancer subtypes. We evaluated whether combining the prognostic powers of standard breast cancer clinical variables with a large set of gene expression signatures could improve on our ability to predict patient outcomes. Methods Using clinical-pathological variables and a collection of 323 gene expression "modules", including 115 previously published signatures, we build multivariate Cox proportional hazards models using a dataset of 550 node-negative systemically untreated breast cancer patients. Models predictive of pathological complete response (pCR to neoadjuvant chemotherapy were also built using this approach. Results We identified statistically significant prognostic models for relapse-free survival (RFS at 7 years for the entire population, and for the subgroups of patients with ER-positive, or Luminal tumors. Furthermore, we found that combined models that included both clinical and genomic parameters improved prognostication compared with models with either clinical or genomic variables alone. Finally, we were able to build statistically significant combined models for pathological complete response (pCR predictions for the entire population. Conclusions Integration of gene expression signatures and clinical-pathological factors is an improved method over either variable type alone. Highly prognostic models could be created when using all patients, and for the subset of patients with lymph node-negative and ER-positive breast cancers. Other variables beyond gene expression and clinical-pathological variables, like gene mutation status or DNA
Systematic assessment of multi-gene predictors of pan-cancer cell line sensitivity to drugs exploiting gene expression data [version 2; referees: 2 approved

Directory of Open Access Journals (Sweden)

Linh Nguyen

2017-03-01

Full Text Available Background: Selected gene mutations are routinely used to guide the selection of cancer drugs for a given patient tumour. Large pharmacogenomic data sets, such as those by Genomics of Drug Sensitivity in Cancer (GDSC consortium, were introduced to discover more of these single-gene markers of drug sensitivity. Very recently, machine learning regression has been used to investigate how well cancer cell line sensitivity to drugs is predicted depending on the type of molecular profile. The latter has revealed that gene expression data is the most predictive profile in the pan-cancer setting. However, no study to date has exploited GDSC data to systematically compare the performance of machine learning models based on multi-gene expression data against that of widely-used single-gene markers based on genomics data. Methods: Here we present this systematic comparison using Random Forest (RF classifiers exploiting the expression levels of 13,321 genes and an average of 501 tested cell lines per drug. To account for time-dependent batch effects in IC50 measurements, we employ independent test sets generated with more recent GDSC data than that used to train the predictors and show that this is a more realistic validation than standard k-fold cross-validation. Results and Discussion: Across 127 GDSC drugs, our results show that the single-gene markers unveiled by the MANOVA analysis tend to achieve higher precision than these RF-based multi-gene models, at the cost of generally having a poor recall (i.e. correctly detecting only a small part of the cell lines sensitive to the drug. Regarding overall classification performance, about two thirds of the drugs are better predicted by the multi-gene RF classifiers. Among the drugs with the most predictive of these models, we found pyrimethamine, sunitinib and 17-AAG. Conclusions: Thanks to this unbiased validation, we now know that this type of models can predict in vitro tumour response to some of these
Proteome Profiling Outperforms Transcriptome Profiling for Coexpression Based Gene Function Prediction

Energy Technology Data Exchange (ETDEWEB)

Wang, Jing; Ma, Zihao; Carr, Steven A.; Mertins, Philipp; Zhang, Hui; Zhang, Zhen; Chan, Daniel W.; Ellis, Matthew J. C.; Townsend, R. Reid; Smith, Richard D.; McDermott, Jason E.; Chen, Xian; Paulovich, Amanda G.; Boja, Emily S.; Mesri, Mehdi; Kinsinger, Christopher R.; Rodriguez, Henry; Rodland, Karin D.; Liebler, Daniel C.; Zhang, Bing

2016-11-11

Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies
VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

Science.gov (United States)

Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

2013-12-16

Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis
A constructive approach to gene expression dynamics

International Nuclear Information System (INIS)

Ochiai, T.; Nacher, J.C.; Akutsu, T.

2004-01-01

Recently, experiments on mRNA abundance (gene expression) have revealed that gene expression shows a stationary organization described by a scale-free distribution. Here we propose a constructive approach to gene expression dynamics which restores the scale-free exponent and describes the intermediate state dynamics. This approach requires only one assumption: Markov property
Effects of Ionizing Radiation on Murine Gene Expression in Skin and Bone

Science.gov (United States)

Terada, Masahiro; Schreurs, Ann-Sofie; Shirazi-Fard, Yasaman; Alwood, Joshua; Tahimic, Candice; Sowa, Marianne B.; Globus, Ruth K.

2017-01-01

Long duration spaceflight causes a negative calcium balance and reduces bone density in astronauts. The potential for exposure to space radiation to contribute to lasting decrements in bone mass is not yet understood. Sustained changes to bone mass have a relatively long latency for development, however skin is a radiation sensitive organ and changes in skin gene expression may serve as an early radiation biomarker of exposures and may correlate with adverse effects on skeletal tissue. Previous studies have shown that FGF18 gene expression levels of hair follicles collected from astronauts on the ISS rose over time. In the hair follicle, FGF18 signaling mediates radioresistance in the telogen by arresting the cell cycle, and FGF18 has the potential to function as a radioprotector. In bone, FGF18 appears to regulate cell proliferation and differentiation positively during osteogenesis and negatively during chondrogenesis. Cellular defense responses to radiation are shared by a variety of organs, hence in this study, we examined whether radiation induced gene expression changes in skin may be predictive of the responses of skeletal tissue to radiation exposure. We have examined oxidative stress and growth arrest pathways in mouse skin and long bones by measuring gene expression levels via quantitative polymerase chain reaction (qPCR) after exposure to total body irradiation (TBI). To investigate the effects of irradiation on gene expression, we used skin and femora (cortical shaft) from the following treatment groups: control (normally loaded, sham-irradiated), and TBI (0.5 Gy Fe-56 600 MeV/n and 0.5 Gy H-1 150 MeV/n). Animals were euthanized one and 11 days post-IR. Statistical analysis was performed via a Student's ttest. In skin samples one day after IR, skin expression of FGF18 was significantly greater (3.8X) than sham-irradiated controls (3.8X), but did not differ 11 days post TBI. Expression levels of other radiation related genes (Nfe2l2, Trp53, Cdkn1a, FoxO3
Stably Expressed Genes Involved in Basic Cellular Functions.

Directory of Open Access Journals (Sweden)

Kejian Wang

Full Text Available Stably Expressed Genes (SEGs whose expression varies within a narrow range may be involved in core cellular processes necessary for basic functions. To identify such genes, we re-analyzed existing RNA-Seq gene expression profiles across 11 organs at 4 developmental stages (from immature to old age in both sexes of F344 rats (n = 4/group; 320 samples. Expression changes (calculated as the maximum expression / minimum expression for each gene of >19000 genes across organs, ages, and sexes ranged from 2.35 to >109-fold, with a median of 165-fold. The expression of 278 SEGs was found to vary ≤4-fold and these genes were significantly involved in protein catabolism (proteasome and ubiquitination, RNA transport, protein processing, and the spliceosome. Such stability of expression was further validated in human samples where the expression variability of the homologous human SEGs was significantly lower than that of other genes in the human genome. It was also found that the homologous human SEGs were generally less subject to non-synonymous mutation than other genes, as would be expected of stably expressed genes. We also found that knockout of SEG homologs in mouse models was more likely to cause complete preweaning lethality than non-SEG homologs, corroborating the fundamental roles played by SEGs in biological development. Such stably expressed genes and pathways across life-stages suggest that tight control of these processes is important in basic cellular functions and that perturbation by endogenous (e.g., genetics or exogenous agents (e.g., drugs, environmental factors may cause serious adverse effects.
Hormonal modulation of breast cancer gene expression: implications for intrinsic subtyping in pre-menopausal women

OpenAIRE

Sarah M Bernhardt; Pallave Dasari; David Walsh; Amanda R Townsend; Amanda R Townsend; Timothy J Price; Timothy J Price; Wendy V Ingman

2016-01-01

Clinics are increasingly adopting gene expression profiling to diagnose breast cancer subtype, providing an intrinsic, molecular portrait of the tumour. For example, the PAM50-based Prosigna test quantifies expression of 50 key genes to classify breast cancer subtype, and this method of classification has been demonstrated to be superior over traditional immunohistochemical methods that detect proteins, to predict risk of disease recurrence. However, these tests were largely developed and val...
Lithium ions induce prestalk-associated gene expression and inhibit prespore gene expression in Dictyostelium discoideum

NARCIS (Netherlands)

Peters, Dorien J.M.; Lookeren Campagne, Michiel M. van; Haastert, Peter J.M. van; Spek, Wouter; Schaap, Pauline

1989-01-01

We investigated the effect of Li+ on two types of cyclic AMP-regulated gene expression and on basal and cyclic AMP-stimulated inositol 1,4,5-trisphosphate (Ins(1,4,5)P3) levels. Li+ effectively inhibits cyclic AMP-induced prespore gene expression, half-maximal inhibition occurring at about 2mM-LiCl.

Gene expression profiling of fast- and slow- growing gonadotroph non-functioning pituitary adenomas

DEFF Research Database (Denmark)

Falch, Camilla Maria; Sundaram, Arvind Y M; Øystese, Kristin Astrid

2018-01-01

Objective Reliable biomarkers associated with aggressiveness of non-functioning gonadotroph adenomas (GAs) are lacking. As the growth of tumor remnants is highly variable, molecular markers for growth potential prediction are necessary. We hypothesized that fast- and slow - growing GAs present......, GPM6A and six EMT-related genes (SPAG9, SKIL, MTDH, HOOK1, CNOT6L and PRKACB). MTDH, but not EMCN, demonstrated involvement in cell migration and association with EMT-markers. Conclusions Fast- and slow- growing GAs present different gene expression profiles and genes related to EMT have higher...... expression in fast-growing tumors. In addition to MTDH, identified as an important contributor to aggressiveness, the other genes might represent markers for tumor growth potential and possible targets for drug therapy. ....
Gene expression signatures of radiation response are specific, durable and accurate in mice and humans.

Directory of Open Access Journals (Sweden)

Sarah K Meadows

2008-04-01

Full Text Available Previous work has demonstrated the potential for peripheral blood (PB gene expression profiling for the detection of disease or environmental exposures.We have sought to determine the impact of several variables on the PB gene expression profile of an environmental exposure, ionizing radiation, and to determine the specificity of the PB signature of radiation versus other genotoxic stresses. Neither genotype differences nor the time of PB sampling caused any lessening of the accuracy of PB signatures to predict radiation exposure, but sex difference did influence the accuracy of the prediction of radiation exposure at the lowest level (50 cGy. A PB signature of sepsis was also generated and both the PB signature of radiation and the PB signature of sepsis were found to be 100% specific at distinguishing irradiated from septic animals. We also identified human PB signatures of radiation exposure and chemotherapy treatment which distinguished irradiated patients and chemotherapy-treated individuals within a heterogeneous population with accuracies of 90% and 81%, respectively.We conclude that PB gene expression profiles can be identified in mice and humans that are accurate in predicting medical conditions, are specific to each condition and remain highly accurate over time.
Peak flood estimation using gene expression programming

Science.gov (United States)

Zorn, Conrad R.; Shamseldin, Asaad Y.

2015-12-01

As a case study for the Auckland Region of New Zealand, this paper investigates the potential use of gene-expression programming (GEP) in predicting specific return period events in comparison to the established and widely used Regional Flood Estimation (RFE) method. Initially calibrated to 14 gauged sites, the GEP derived model was further validated to 10 and 100 year flood events with a relative errors of 29% and 18%, respectively. This is compared to the RFE method providing 48% and 44% errors for the same flood events. While the effectiveness of GEP in predicting specific return period events is made apparent, it is argued that the derived equations should be used in conjunction with those existing methodologies rather than as a replacement.
Gene expression alterations associated with outcome in aromatase inhibitor-treated ER+ early-stage breast cancer patients

DEFF Research Database (Denmark)

Gravgaard Thomsen, Karina Hedelund; Lyng, Maria Bibi; Elias, Daniel

2015-01-01

predictive of outcome of ER+ breast cancer patients treated with AIs are needed. Global gene expression analysis was performed on ER+ primary breast cancers from patients treated with adjuvant AI monotherapy; half experienced recurrence (median follow-up 6.7 years). Gene expression alterations were validated...... by qRT-PCR, and functional studies evaluating the effect of siRNA-mediated gene knockdown on cell growth were performed. Twenty-six genes, including TFF3, DACH1, RGS5, and GHR, were shown to exhibit altered expression in tumors from patients with recurrence versus non-recurrent (fold change ≥1.5, p ....05), and the gene expression alterations were confirmed using qRT-PCR. Ten of these 26 genes could be linked in a network associated with cellular proliferation, growth, and development. TFF3, which encodes for trefoil factor 3 and is an estrogen-responsive oncogene shown to play a functional role in tamoxifen...
Molecular characterization, sequence analysis and tissue expression of a porcine gene – MOSPD2

Directory of Open Access Journals (Sweden)

Yang Jie

2017-01-01

Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.
Prediction of drug efficacy for cancer treatment based on comparative analysis of chemosensitivity and gene expression data

DEFF Research Database (Denmark)

Wan, Peng; Li, Qiyuan; Larsen, Jens Erik Pontoppidan

2012-01-01

The NCI60 database is the largest available collection of compounds with measured anti-cancer activity. The strengths and limitations for using the NCI60 database as a source of new anti-cancer agents are explored and discussed in relation to previous studies. We selected a sub-set of 2333...... and in a data set of expression profiles of 1901 genes for the corresponding tumor cell lines. Five clusters were identified based on the gene expression data using self-organizing maps (SOM), comprising leukemia, melanoma, ovarian and prostate, basal breast, and luminal breast cancer cells, respectively....... The strong difference in gene expression between basal and luminal breast cancer cells was reflected clearly in the chemosensitivity data. Although most compounds in the data set were of low potency, high efficacy compounds that showed specificity with respect to tissue of origin could be found. Furthermore...
Validation of commonly used reference genes for sleep-related gene expression studies

Directory of Open Access Journals (Sweden)

Castro Rosa MRPS

2009-05-01

Full Text Available Abstract Background Sleep is a restorative process and is essential for maintenance of mental and physical health. In an attempt to understand the complexity of sleep, multidisciplinary strategies, including genetic approaches, have been applied to sleep research. Although quantitative real time PCR has been used in previous sleep-related gene expression studies, proper validation of reference genes is currently lacking. Thus, we examined the effect of total or paradoxical sleep deprivation (TSD or PSD on the expression stability of the following frequently used reference genes in brain and blood: beta-actin (b-actin, beta-2-microglobulin (B2M, glyceraldehyde-3-phosphate dehydrogenase (GAPDH, and hypoxanthine guanine phosphoribosyl transferase (HPRT. Results Neither TSD nor PSD affected the expression stability of all tested genes in both tissues indicating that b-actin, B2M, GAPDH and HPRT are appropriate reference genes for the sleep-related gene expression studies. In order to further verify these results, the relative expression of brain derived neurotrophic factor (BDNF and glycerol-3-phosphate dehydrogenase1 (GPD1 was evaluated in brain and blood, respectively. The normalization with each of four reference genes produced similar pattern of expression in control and sleep deprived rats, but subtle differences in the magnitude of expression fold change were observed which might affect the statistical significance. Conclusion This study demonstrated that sleep deprivation does not alter the expression stability of commonly used reference genes in brain and blood. Nonetheless, the use of multiple reference genes in quantitative RT-PCR is required for the accurate results.
Cloning and expression analysis of a novel ammonium transporter gene from eichhornia

International Nuclear Information System (INIS)

Li, Y.; Yan, G.; Zheng, L.

2014-01-01

In order to explore the molecular mechanism for Eichhornia crassipes to transport ammonium from outside, we cloned a novel ammonium transporter (EcAMT) gene from E. crassipes and identified its function by using yeast complementation experiment. The full-length cDNA of EcAMT contains a 1506 nucletide-long open reading frame which encodes a protein of 501 amino acids. Bioinformatics analysis predicted that EcAMT had 8 transmembrane regions. The expressions of EcAMT gene under three different nitrogen conditions were evaluated by quantitative reverse transcriptase PCR (qRT-PCR) and the results showed that the expression of EcAMT gene was up-regulated under nitrogen starvation. Our study results revealed some molecular mechanism of E. crassipes to absorb the ammonium in eutrophic water. (author)
Subclinical Pregnancy Toxemia-Induced Gene Expression Changes in Ovine Placenta and Uterus.

Science.gov (United States)

Kasimanickam, Ramanathan K

2016-01-01

The objective was to elucidate gene expression differences in uterus, caruncle, and cotyledon of ewes with subclinical pregnancy toxemia (SCPT) and healthy ewes, and to identify associated biological functions and pathways involved in pregnancy toxemia. On Day 136 (±1 day) post-breeding, ewes (n = 18) had body condition score (BCS; 1-5; 1, emaciated; 5, obese) assessed, and blood samples were collected for plasma glucose and β-hydroxybutyrate (BHBA) analyses. The ewes were euthanized, and tissue samples were collected from the gravid uterus and placentomes. Based on BCS (2.0 ± 0.02), glucose (2.4 ± 0.33), and BHBA (0.97 ± 0.06) concentrations, ewes (n = 10) were grouped as healthy (n = 5) and subclinical SCPT (n = 5) ewes. The mRNA expressions were determined by quantitative PCR method, and prediction of miRNA partners and target genes for the predicted miRNA were identified using miRDB (http://mirdb.org/miRDB/). Top ranked target genes were used to identify associated biological functions and pathways in response to SPCT using PANTHER. The angiogenesis genes VEGF and PlGF, and AdipoQ, AdipoR2, PPARG, LEP, IGF1, IGF2, IL1b, and TNFα mRNA expressions were lower in abundances, whereas hypoxia genes eNOS, HIF1a, and HIF 2a, and sFlt1 and KDR mRNA expressions were greater in abundances in uterus and placenta of SCPT ewes compared to healthy ewes (P influence placental vascular development and angiogenesis as noted in this study set the course for hemodynamic changes and hence have a major impact on the rate of transplacental nutrient exchange, fetal growth, and health of the dam.
Gene Expression Ratios Lead to Accurate and Translatable Predictors of DR5 Agonism across Multiple Tumor Lineages.

Directory of Open Access Journals (Sweden)

Anupama Reddy

Full Text Available Death Receptor 5 (DR5 agonists demonstrate anti-tumor activity in preclinical models but have yet to demonstrate robust clinical responses. A key limitation may be the lack of patient selection strategies to identify those most likely to respond to treatment. To overcome this limitation, we screened a DR5 agonist Nanobody across >600 cell lines representing 21 tumor lineages and assessed molecular features associated with response. High expression of DR5 and Casp8 were significantly associated with sensitivity, but their expression thresholds were difficult to translate due to low dynamic ranges. To address the translational challenge of establishing thresholds of gene expression, we developed a classifier based on ratios of genes that predicted response across lineages. The ratio classifier outperformed the DR5+Casp8 classifier, as well as standard approaches for feature selection and classification using genes, instead of ratios. This classifier was independently validated using 11 primary patient-derived pancreatic xenograft models showing perfect predictions as well as a striking linearity between prediction probability and anti-tumor response. A network analysis of the genes in the ratio classifier captured important biological relationships mediating drug response, specifically identifying key positive and negative regulators of DR5 mediated apoptosis, including DR5, CASP8, BID, cFLIP, XIAP and PEA15. Importantly, the ratio classifier shows translatability across gene expression platforms (from Affymetrix microarrays to RNA-seq and across model systems (in vitro to in vivo. Our approach of using gene expression ratios presents a robust and novel method for constructing translatable biomarkers of compound response, which can also probe the underlying biology of treatment response.
Gene Expression Ratios Lead to Accurate and Translatable Predictors of DR5 Agonism across Multiple Tumor Lineages.

Science.gov (United States)

Reddy, Anupama; Growney, Joseph D; Wilson, Nick S; Emery, Caroline M; Johnson, Jennifer A; Ward, Rebecca; Monaco, Kelli A; Korn, Joshua; Monahan, John E; Stump, Mark D; Mapa, Felipa A; Wilson, Christopher J; Steiger, Janine; Ledell, Jebediah; Rickles, Richard J; Myer, Vic E; Ettenberg, Seth A; Schlegel, Robert; Sellers, William R; Huet, Heather A; Lehár, Joseph

2015-01-01

Death Receptor 5 (DR5) agonists demonstrate anti-tumor activity in preclinical models but have yet to demonstrate robust clinical responses. A key limitation may be the lack of patient selection strategies to identify those most likely to respond to treatment. To overcome this limitation, we screened a DR5 agonist Nanobody across >600 cell lines representing 21 tumor lineages and assessed molecular features associated with response. High expression of DR5 and Casp8 were significantly associated with sensitivity, but their expression thresholds were difficult to translate due to low dynamic ranges. To address the translational challenge of establishing thresholds of gene expression, we developed a classifier based on ratios of genes that predicted response across lineages. The ratio classifier outperformed the DR5+Casp8 classifier, as well as standard approaches for feature selection and classification using genes, instead of ratios. This classifier was independently validated using 11 primary patient-derived pancreatic xenograft models showing perfect predictions as well as a striking linearity between prediction probability and anti-tumor response. A network analysis of the genes in the ratio classifier captured important biological relationships mediating drug response, specifically identifying key positive and negative regulators of DR5 mediated apoptosis, including DR5, CASP8, BID, cFLIP, XIAP and PEA15. Importantly, the ratio classifier shows translatability across gene expression platforms (from Affymetrix microarrays to RNA-seq) and across model systems (in vitro to in vivo). Our approach of using gene expression ratios presents a robust and novel method for constructing translatable biomarkers of compound response, which can also probe the underlying biology of treatment response.
The Expression of Genes Encoding Secreted Proteins in Medicago truncatula A17 Inoculated Roots

Directory of Open Access Journals (Sweden)

LUCIA KUSUMAWATI

2013-09-01

Full Text Available Subtilisin-like serine protease (MtSBT, serine carboxypeptidase (MtSCP, MtN5, non-specific lipid transfer protein (MtnsLTP, early nodulin2-like protein (MtENOD2-like, FAD-binding domain containing protein (MtFAD-BP1, and rhicadhesin receptor protein (MtRHRE1 were among 34 proteins found in the supernatant of M. truncatula 2HA and sickle cell suspension cultures. This study investigated the expression of genes encoding those proteins in roots and developing nodules. Two methods were used: quantitative real time RT-PCR and gene expression analysis (with promoter:GUS fusion in roots. Those proteins are predicted as secreted proteins which is indirectly supported by the findings that promoter:GUS fusions of six of the seven genes encoding secreted proteins were strongly expressed in the vascular bundle of transgenic hairy roots. All six genes have expressed in 14-day old nodule. The expression levels of the selected seven genes were quantified in Sinorhizobium-inoculated and control plants using quantitative real time RT-PCR. In conclusion, among seven genes encoding secreted proteins analyzed, the expression level of only one gene, MtN5, was up-regulated significantly in inoculated root segments compared to controls. The expression of MtSBT1, MtSCP1, MtnsLTP, MtFAD-BP1, MtRHRE1 and MtN5 were higher in root tip than in other tissues examined.
Stochastic gene expression in Arabidopsis thaliana.

Science.gov (United States)

Araújo, Ilka Schultheiß; Pietsch, Jessica Magdalena; Keizer, Emma Mathilde; Greese, Bettina; Balkunde, Rachappa; Fleck, Christian; Hülskamp, Martin

2017-12-14

Although plant development is highly reproducible, some stochasticity exists. This developmental stochasticity may be caused by noisy gene expression. Here we analyze the fluctuation of protein expression in Arabidopsis thaliana. Using the photoconvertible KikGR marker, we show that the protein expressions of individual cells fluctuate over time. A dual reporter system was used to study extrinsic and intrinsic noise of marker gene expression. We report that extrinsic noise is higher than intrinsic noise and that extrinsic noise in stomata is clearly lower in comparison to several other tissues/cell types. Finally, we show that cells are coupled with respect to stochastic protein expression in young leaves, hypocotyls and roots but not in mature leaves. Our data indicate that stochasticity of gene expression can vary between tissues/cell types and that it can be coupled in a non-cell-autonomous manner.
Prediction and characterisation of a highly conserved, remote and cAMP responsive enhancer that regulates Msx1 gene expression in cardiac neural crest and outflow tract.

Science.gov (United States)

Miller, Kerry Ann; Davidson, Scott; Liaros, Angela; Barrow, John; Lear, Marissa; Heine, Danielle; Hoppler, Stefan; MacKenzie, Alasdair

2008-05-15

Double knockouts of the Msx1 and Msx2 genes in the mouse result in severe cardiac outflow tract malformations similar to those frequently found in newborn infants. Despite the known role of the Msx genes in cardiac formation little is known of the regulatory systems (ligand receptor, signal transduction and protein-DNA interactions) that regulate the tissue-specific expression of the Msx genes in mammals during the formation of the outflow tract. In the present study we have used a combination of multi-species comparative genomics, mouse transgenic analysis and in-situ hybridisation to predict and validate the existence of a remote ultra-conserved enhancer that supports the expression of the Msx1 gene in migrating mouse cardiac neural crest and the outflow tract primordia. Furthermore, culturing of embryonic explants derived from transgenic lines with agonists of the PKC and PKA signal transduction systems demonstrates that this remote enhancer is influenced by PKA but not PKC dependent gene regulatory systems. These studies demonstrate the efficacy of combining comparative genomics and transgenic analyses and provide a platform for the study of the possible roles of Msx gene mis-regulation in the aetiology of congenital heart malformation.
Bronchial airway gene expression in smokers with lung or head and neck cancer

International Nuclear Information System (INIS)

Van Dyck, Eric; Nazarov, Petr V; Muller, Arnaud; Nicot, Nathalie; Bosseler, Manon; Pierson, Sandrine; Van Moer, Kris; Palissot, Valérie; Mascaux, Céline; Knolle, Ulrich; Ninane, Vincent; Nati, Romain; Bremnes, Roy M; Vallar, Laurent; Berchem, Guy; Schlesser, Marc

2014-01-01

Cigarette smoking is the major cause of cancers of the respiratory tract, including non-small cell lung cancer (NSCLC) and head and neck cancer (HNC). In order to better understand carcinogenesis of the lung and upper airways, we have compared the gene expression profiles of tumor-distant, histologically normal bronchial biopsy specimens obtained from current smokers with NSCLC or HNC (SC, considered as a single group), as well as nonsmokers (NS) and smokers without cancer (SNC). RNA from a total of 97 biopsies was used for gene expression profiling (Affymetrix HG-U133 Plus 2.0 array). Differentially expressed genes were used to compare NS, SNC, and SC, and functional analysis was carried out using Ingenuity Pathway Analysis (IPA). Smoking-related cancer of the respiratory tract was found to affect the expression of genes encoding xenobiotic biotransformation proteins, as well as proteins associated with crucial inflammation/immunity pathways and other processes that protect the airway from the chemicals in cigarette smoke or contribute to carcinogenesis. Finally, we used the prediction analysis for microarray (PAM) method to identify gene signatures of cigarette smoking and cancer, and uncovered a 15-gene signature that distinguished between SNC and SC with an accuracy of 83%. Thus, gene profiling of histologically normal bronchial biopsy specimens provided insight into cigarette-induced carcinogenesis of the respiratory tract and gene signatures of cancer in smokers
Network statistics of genetically-driven gene co-expression modules in mouse crosses

Directory of Open Access Journals (Sweden)

Marie-Pier eScott-Boyer

2013-12-01

Full Text Available In biology, networks are used in different contexts as ways to represent relationships between entities, such as for instance interactions between genes, proteins or metabolites. Despite progress in the analysis of such networks and their potential to better understand the collective impact of genes on complex traits, one remaining challenge is to establish the biologic validity of gene co-expression networks and to determine what governs their organization. We used WGCNA to construct and analyze seven gene expression datasets from several tissues of mouse recombinant inbred strains (RIS. For six out of the 7 networks, we found that linkage to module QTLs (mQTLs could be established for 29.3% of gene co-expression modules detected in the several mouse RIS. For about 74.6% of such genetically-linked modules, the mQTL was on the same chromosome as the one contributing most genes to the module, with genes originating from that chromosome showing higher connectivity than other genes in the modules. Such modules (that we considered as genetically-driven had network statistic properties (density, centralization and heterogeneity that set them apart from other modules in the network. Altogether, a sizeable portion of gene co-expression modules detected in mouse RIS panels had genetic determinants as their main organizing principle. In addition to providing a biologic interpretation validation for these modules, these genetic determinants imparted on them particular properties that set them apart from other modules in the network, to the point that they can be predicted to a large extent on the basis of their network statistics.
Deriving Trading Rules Using Gene Expression Programming

Directory of Open Access Journals (Sweden)

Adrian VISOIU

2011-01-01

Full Text Available This paper presents how buy and sell trading rules are generated using gene expression programming with special setup. Market concepts are presented and market analysis is discussed with emphasis on technical analysis and quantitative methods. The use of genetic algorithms in deriving trading rules is presented. Gene expression programming is applied in a form where multiple types of operators and operands are used. This gives birth to multiple gene contexts and references between genes in order to keep the linear structure of the gene expression programming chromosome. The setup of multiple gene contexts is presented. The case study shows how to use the proposed gene setup to derive trading rules encoded by Boolean expressions, using a dataset with the reference exchange rates between the Euro and the Romanian leu. The conclusions highlight the positive results obtained in deriving useful trading rules.
MicroRNA-124-3p expression and its prospective functional pathways in hepatocellular carcinoma: A quantitative polymerase chain reaction, gene expression omnibus and bioinformatics study.

Science.gov (United States)

He, Rong-Quan; Yang, Xia; Liang, Liang; Chen, Gang; Ma, Jie

2018-04-01

The present study aimed to explore the potential clinical significance of microRNA (miR)-124-3p expression in the hepatocarcinogenesis and development of hepatocellular carcinoma (HCC), as well as the potential target genes of functional HCC pathways. Reverse transcription-quantitative polymerase chain reaction was performed to evaluate the expression of miR-124-3p in 101 HCC and adjacent non-cancerous tissue samples. Additionally, the association between miR-124-3p expression and clinical parameters was also analyzed. Differentially expressed genes identified following miR-124-3p transfection, the prospective target genes predicted in silico and the key genes of HCC obtained from Natural Language Processing (NLP) were integrated to obtain potential target genes of miR-124-3p in HCC. Relevant signaling pathways were assessed with protein-protein interaction (PPI) networks, Gene Ontology (GO) enrichment analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) and Protein Annotation Through Evolutionary Relationships (PANTHER) pathway enrichment analysis. miR-124-3p expression was significantly reduced in HCC tissues compared with expression in adjacent non-cancerous liver tissues. In HCC, miR-124-3p was demonstrated to be associated with clinical stage. The mean survival time of the low miR-124-3p expression group was reduced compared with that of the high expression group. A total of 132 genes overlapped from differentially expressed genes, miR-124-3p predicted target genes and NLP identified genes. PPI network construction revealed a total of 109 nodes and 386 edges, and 20 key genes were identified. The major enriched terms of three GO categories included regulation of cell proliferation, positive regulation of cellular biosynthetic processes, cell leading edge, cytosol and cell projection, protein kinase activity, transcription activator activity and enzyme binding. KEGG analysis revealed pancreatic cancer, prostate cancer and non-small cell lung cancer as the
Gene Expression Dynamics Accompanying the Sponge Thermal Stress Response.

Science.gov (United States)

Guzman, Christine; Conaco, Cecilia

2016-01-01

Marine sponges are important members of coral reef ecosystems. Thus, their responses to changes in ocean chemistry and environmental conditions, particularly to higher seawater temperatures, will have potential impacts on the future of these reefs. To better understand the sponge thermal stress response, we investigated gene expression dynamics in the shallow water sponge, Haliclona tubifera (order Haplosclerida, class Demospongiae), subjected to elevated temperature. Using high-throughput transcriptome sequencing, we show that these conditions result in the activation of various processes that interact to maintain cellular homeostasis. Short-term thermal stress resulted in the induction of heat shock proteins, antioxidants, and genes involved in signal transduction and innate immunity pathways. Prolonged exposure to thermal stress affected the expression of genes involved in cellular damage repair, apoptosis, signaling and transcription. Interestingly, exposure to sublethal temperatures may improve the ability of the sponge to mitigate cellular damage under more extreme stress conditions. These insights into the potential mechanisms of adaptation and resilience of sponges contribute to a better understanding of sponge conservation status and the prediction of ecosystem trajectories under future climate conditions.
Differential peripheral blood gene expression profile based on Her2 expression on primary tumors of breast cancer patients.

Directory of Open Access Journals (Sweden)

Oana Tudoran

Full Text Available Breast cancer prognosis and treatment is highly dependent on the molecular features of the primary tumors. These tumors release specific molecules into the environment that trigger characteristic responses into the circulatory cells. In this study we investigated the expression pattern of 84 genes known to be involved in breast cancer signaling in the peripheral blood of breast cancer patients with ER-, PR- primary tumors. The patients were grouped according to Her2 expression on the primary tumors in Her2+ and Her2- cohorts. Transcriptional analysis revealed 15 genes to be differentially expressed between the two groups highlighting that Her2 signaling in primary tumors could be associated with specific blood gene expression. We found CCNA1 to be up-regulated, while ERBB2, RASSF1, CDH1, MKI67, GATA3, GLI1, SFN, PTGS2, JUN, NOTCH1, CTNNB1, KRT8, SRC, and HIC1 genes were down-regulated in the blood of triple negative breast cancer patients compared to Her2+ cohort. IPA network analysis predicts that the identified genes are interconnected and regulate each other. These genes code for cell cycle regulators, cell adhesion molecules, transcription factors or signal transducers that modulate immune signaling, several genes being also associated with cancer progression and treatment response. These results indicate an altered immune signaling in the peripheral blood of triple negative breast cancer patients. The involvement of the immune system is necessary in favorable treatment response, therefore these results could explain the low response rates observed for triple negative breast cancer patients.

Heart morphogenesis gene regulatory networks revealed by temporal expression analysis.

Science.gov (United States)

Hill, Jonathon T; Demarest, Bradley; Gorsi, Bushra; Smith, Megan; Yost, H Joseph

2017-10-01

During embryogenesis the heart forms as a linear tube that then undergoes multiple simultaneous morphogenetic events to obtain its mature shape. To understand the gene regulatory networks (GRNs) driving this phase of heart development, during which many congenital heart disease malformations likely arise, we conducted an RNA-seq timecourse in zebrafish from 30 hpf to 72 hpf and identified 5861 genes with altered expression. We clustered the genes by temporal expression pattern, identified transcription factor binding motifs enriched in each cluster, and generated a model GRN for the major gene batteries in heart morphogenesis. This approach predicted hundreds of regulatory interactions and found batteries enriched in specific cell and tissue types, indicating that the approach can be used to narrow the search for novel genetic markers and regulatory interactions. Subsequent analyses confirmed the GRN using two mutants, Tbx5 and nkx2-5 , and identified sets of duplicated zebrafish genes that do not show temporal subfunctionalization. This dataset provides an essential resource for future studies on the genetic/epigenetic pathways implicated in congenital heart defects and the mechanisms of cardiac transcriptional regulation. © 2017. Published by The Company of Biologists Ltd.
Differential gene expression during Trypanosoma cruzi metacyclogenesis

Directory of Open Access Journals (Sweden)

Marco Aurelio Krieger

1999-09-01

Full Text Available The transformation of epimastigotes into metacyclic trypomastigotes involves changes in the pattern of expressed genes, resulting in important morphological and functional differences between these developmental forms of Trypanosoma cruzi. In order to identify and characterize genes involved in triggering the metacyclogenesis process and in conferring to metacyclic trypomastigotes their stage specific biological properties, we have developed a method allowing the isolation of genes specifically expressed when comparing two close related cell populations (representation of differential expression or RDE. The method is based on the PCR amplification of gene sequences selected by hybridizing and subtracting the populations in such a way that after some cycles of hybridization-amplification genes specific to a given population are highly enriched. The use of this method in the analysis of differential gene expression during T. cruzi metacyclogenesis (6 hr and 24 hr of differentiation and metacyclic trypomastigotes resulted in the isolation of several clones from each time point. Northern blot analysis showed that some genes are transiently expressed (6 hr and 24 hr differentiating cells, while others are present in differentiating cells and in metacyclic trypomastigotes. Nucleotide sequencing of six clones characterized so far showed that they do not display any homology to gene sequences available in the GeneBank.
Conditional gene expression in the mouse using a Sleeping Beauty gene-trap transposon

Directory of Open Access Journals (Sweden)

Hackett Perry B

2006-06-01

Full Text Available Abstract Background Insertional mutagenesis techniques with transposable elements have been popular among geneticists studying model organisms from E. coli to Drosophila and, more recently, the mouse. One such element is the Sleeping Beauty (SB transposon that has been shown in several studies to be an effective insertional mutagen in the mouse germline. SB transposon vector studies have employed different functional elements and reporter molecules to disrupt and report the expression of endogenous mouse genes. We sought to generate a transposon system that would be capable of reporting the expression pattern of a mouse gene while allowing for conditional expression of a gene of interest in a tissue- or temporal-specific pattern. Results Here we report the systematic development and testing of a transposon-based gene-trap system incorporating the doxycycline-repressible Tet-Off (tTA system that is capable of activating the expression of genes under control of a Tet response element (TRE promoter. We demonstrate that the gene trap system is fully functional in vitro by introducing the "gene-trap tTA" vector into human cells by transposition and identifying clones that activate expression of a TRE-luciferase transgene in a doxycycline-dependent manner. In transgenic mice, we mobilize gene-trap tTA vectors, discover parameters that can affect germline mobilization rates, and identify candidate gene insertions to demonstrate the in vivo functionality of the vector system. We further demonstrate that the gene-trap can act as a reporter of endogenous gene expression and it can be coupled with bioluminescent imaging to identify genes with tissue-specific expression patterns. Conclusion Akin to the GAL4/UAS system used in the fly, we have made progress developing a tool for mutating and revealing the expression of mouse genes by generating the tTA transactivator in the presence of a secondary TRE-regulated reporter molecule. A vector like the gene
A comparative gene expression database for invertebrates

Directory of Open Access Journals (Sweden)

Ormestad Mattias

2011-08-01

Full Text Available Abstract Background As whole genome and transcriptome sequencing gets cheaper and faster, a great number of 'exotic' animal models are emerging, rapidly adding valuable data to the ever-expanding Evo-Devo field. All these new organisms serve as a fantastic resource for the research community, but the sheer amount of data, some published, some not, makes detailed comparison of gene expression patterns very difficult to summarize - a problem sometimes even noticeable within a single lab. The need to merge existing data with new information in an organized manner that is publicly available to the research community is now more necessary than ever. Description In order to offer a homogenous way of storing and handling gene expression patterns from a variety of organisms, we have developed the first web-based comparative gene expression database for invertebrates that allows species-specific as well as cross-species gene expression comparisons. The database can be queried by gene name, developmental stage and/or expression domains. Conclusions This database provides a unique tool for the Evo-Devo research community that allows the retrieval, analysis and comparison of gene expression patterns within or among species. In addition, this database enables a quick identification of putative syn-expression groups that can be used to initiate, among other things, gene regulatory network (GRN projects.
Genetic Variants Contribute to Gene Expression Variability in Humans

Science.gov (United States)

Hulse, Amanda M.; Cai, James J.

2013-01-01

Expression quantitative trait loci (eQTL) studies have established convincing relationships between genetic variants and gene expression. Most of these studies focused on the mean of gene expression level, but not the variance of gene expression level (i.e., gene expression variability). In the present study, we systematically explore genome-wide association between genetic variants and gene expression variability in humans. We adapt the double generalized linear model (dglm) to simultaneously fit the means and the variances of gene expression among the three possible genotypes of a biallelic SNP. The genomic loci showing significant association between the variances of gene expression and the genotypes are termed expression variability QTL (evQTL). Using a data set of gene expression in lymphoblastoid cell lines (LCLs) derived from 210 HapMap individuals, we identify cis-acting evQTL involving 218 distinct genes, among which 8 genes, ADCY1, CTNNA2, DAAM2, FERMT2, IL6, PLOD2, SNX7, and TNFRSF11B, are cross-validated using an extra expression data set of the same LCLs. We also identify ∼300 trans-acting evQTL between >13,000 common SNPs and 500 randomly selected representative genes. We employ two distinct scenarios, emphasizing single-SNP and multiple-SNP effects on expression variability, to explain the formation of evQTL. We argue that detecting evQTL may represent a novel method for effectively screening for genetic interactions, especially when the multiple-SNP influence on expression variability is implied. The implication of our results for revealing genetic mechanisms of gene expression variability is discussed. PMID:23150607
Correction of gene expression data

DEFF Research Database (Denmark)

Darbani Shirvanehdeh, Behrooz; Stewart, C. Neal, Jr.; Noeparvar, Shahin

2014-01-01

This report investigates for the first time the potential inter-treatment bias source of cell number for gene expression studies. Cell-number bias can affect gene expression analysis when comparing samples with unequal total cellular RNA content or with different RNA extraction efficiencies....... For maximal reliability of analysis, therefore, comparisons should be performed at the cellular level. This could be accomplished using an appropriate correction method that can detect and remove the inter-treatment bias for cell-number. Based on inter-treatment variations of reference genes, we introduce...
Gene expression in colorectal cancer

DEFF Research Database (Denmark)

Birkenkamp-Demtroder, Karin; Christensen, Lise Lotte; Olesen, Sanne Harder

2002-01-01

Understanding molecular alterations in colorectal cancer (CRC) is needed to define new biomarkers and treatment targets. We used oligonucleotide microarrays to monitor gene expression of about 6,800 known genes and 35,000 expressed sequence tags (ESTs) on five pools (four to six samples in each...... pool) of total RNA from left-sided sporadic colorectal carcinomas. We compared normal tissue to carcinoma tissue from Dukes' stages A-D (noninvasive to distant metastasis) and identified 908 known genes and 4,155 ESTs that changed remarkably from normal to tumor tissue. Based on intensive filtering 226...
Multiscale Embedded Gene Co-expression Network Analysis.

Directory of Open Access Journals (Sweden)

Won-Min Song

2015-11-01

Full Text Available Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3, the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA by: i introducing quality control of co-expression similarities, ii parallelizing embedded network construction, and iii developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs. We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA. MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Multiscale Embedded Gene Co-expression Network Analysis.

Science.gov (United States)

Song, Won-Min; Zhang, Bin

2015-11-01

Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Common changes in global gene expression induced by RNA polymerase inhibitors in Shigella flexneri.

Directory of Open Access Journals (Sweden)

Hua Fu

Full Text Available Characterization of expression profile of organisms in response to antimicrobials provides important information on the potential mechanism of action of the drugs. The special expression signature can be used to predict whether other drugs act on the same target. Here, the common response of Shigella flexneri to two inhibitors of RNA polymerase was examined using gene expression profiling. Consistent with similar effects of the two drugs, the gene expression profiles indicated that responses of the bacteria to these drugs were roughly the same, with 225 genes affected commonly. Of them, 88 were induced and 137 were repressed. Real-time PCR was performed for selected genes to verify the microarray results. Analysis of the expression data revealed that more than 30% of the plasmid-encoded genes on the array were up-regulated by the antibiotics including virF regulon, other virulence-related genes, and genes responsible for plasmid replication, maintenance, and transfer. In addition, some chromosome-encoded genes involved in virulence and genes acquired from horizontal transfer were also significantly up-regulated. However, the expression of genes encoding the beta-subunit of RNA polymerase was increased moderately. The repressed genes include those that code for products associated with the ribosome, citrate cycle, glycolysis, thiamine biosynthesis, purine metabolism, fructose metabolism, mannose metabolism, and cold shock proteins. This study demonstrates that the two antibiotics induce rapid cessation of RNA synthesis resulting in inhibition of translation components. It also indicates that the production of virulence factors involved in intercellular dissemination, tissue invasion and inflammatory destruction may be enhanced through derepressing horizontal transfer genes by the drugs.
Kinetic models of gene expression including non-coding RNAs

Energy Technology Data Exchange (ETDEWEB)

Zhdanov, Vladimir P., E-mail: zhdanov@catalysis.r

2011-03-15

In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.
Gene expression profile associated with superimposed non-alcoholic fatty liver disease and hepatic fibrosis in patients with chronic hepatitis C.

Science.gov (United States)

Younossi, Zobair M; Afendy, Arian; Stepanova, Maria; Hossain, Noreen; Younossi, Issah; Ankrah, Kathy; Gramlich, Terry; Baranova, Ancha

2009-10-01

Hepatic steatosis occurs in 40-70% of patients chronically infected with hepatitis C virus [chronic hepatitis C (CH-C)]. Hepatic steatosis in CH-C is associated with progressive liver disease and a low response rate to antiviral therapy. Gene expression profiles were examined in CH-C patients with and without hepatic steatosis, non-alcoholic steatohepatitis (NASH) and fibrosis. This study included 65 CH-C patients who were not receiving antiviral treatment. Total RNA was extracted from peripheral blood mononuclear cells, quantified and used for one-step reverse transcriptase-polymerase chain reaction to profile 153 mRNAs that were normalized with six 'housekeeping' genes and a reference RNA. Multiple regression and stepwise selection assessed differences in gene expression and the models' performances were evaluated. Models predicting the grade of hepatic steatosis in patients with CH-C genotype 3 involved two genes: SOCS1 and IFITM1, which progressively changed their expression level with the increasing grade of steatosis. On the other hand, models predicting hepatic steatosis in non-genotype 3 patients highlighted MIP-1 cytokine encoding genes: CCL3 and CCL4 as well as IFNAR and PRKRIR. Expression levels of PRKRIR and SMAD3 differentiated patients with and without superimposed NASH only in the non-genotype 3 cohort (area under the receiver operating characteristic curve=0.822, P-value 0.006]. Gene expression signatures related to hepatic fibrosis were not genotype specific. Gene expression might predict moderate to severe hepatic steatosis, NASH and fibrosis in patients with CH-C, providing potential insights into the pathogenesis of hepatic steatosis and fibrosis in these patients.
Predicting acute cardiac rejection from donor heart and pre-transplant recipient blood gene expression.

Science.gov (United States)

Hollander, Zsuzsanna; Chen, Virginia; Sidhu, Keerat; Lin, David; Ng, Raymond T; Balshaw, Robert; Cohen-Freue, Gabriela V; Ignaszewski, Andrew; Imai, Carol; Kaan, Annemarie; Tebbutt, Scott J; Wilson-McManus, Janet E; McMaster, Robert W; Keown, Paul A; McManus, Bruce M

2013-02-01

Acute rejection in cardiac transplant patients remains a contributory factor to limited survival of implanted hearts. Currently, there are no biomarkers in clinical use that can predict, at the time of transplantation, the likelihood of post-transplant acute cellular rejection. Such a development would be of great value in personalizing immunosuppressive treatment. Recipient age, donor age, cold ischemic time, warm ischemic time, panel-reactive antibody, gender mismatch, blood type mismatch and human leukocyte antigens (HLA-A, -B and -DR) mismatch between recipients and donors were tested in 53 heart transplant patients for their power to predict post-transplant acute cellular rejection. Donor transplant biopsy and recipient pre-transplant blood were also examined for the presence of genomic biomarkers in 7 rejection and 11 non-rejection patients, using non-targeted data mining techniques. The biomarker based on the 8 clinical variables had an area under the receiver operating characteristic curve (AUC) of 0.53. The pre-transplant recipient blood gene-based panel did not yield better performance, but the donor heart tissue gene-based panel had an AUC = 0.78. A combination of 25 probe sets from the transplant donor biopsy and 18 probe sets from the pre-transplant recipient whole blood had an AUC = 0.90. Biologic pathways implicated include VEGF- and EGFR-signaling, and MAPK. Based on this study, the best predictive biomarker panel contains genes from recipient whole blood and donor myocardial tissue. This panel provides clinically relevant prediction power and, if validated, may personalize immunosuppressive treatment and rejection monitoring. Copyright © 2013 International Society for Heart and Lung Transplantation. Published by Elsevier Inc. All rights reserved.
The evolution of gene expression in primates

OpenAIRE

Tashakkori Ghanbarian, Avazeh

2015-01-01

The evolution of a gene’s expression profile is commonly assumed to be independent of its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between expression of neighboring genes in extant taxa. Indeed, in all eukaryotic genomes, genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their e...
Early pregnancy peripheral blood gene expression and risk of preterm delivery: a nested case control study

Directory of Open Access Journals (Sweden)

Muhie Seid Y

2009-12-01

Full Text Available Abstract Background Preterm delivery (PTD is a significant public health problem associated with greater risk of mortality and morbidity in infants and mothers. Pathophysiologic processes that may lead to PTD start early in pregnancy. We investigated early pregnancy peripheral blood global gene expression and PTD risk. Methods As part of a prospective study, ribonucleic acid was extracted from blood samples (collected at 16 weeks gestational age from 14 women who had PTD (cases and 16 women who delivered at term (controls. Gene expressions were measured using the GeneChip® Human Genome U133 Plus 2.0 Array. Student's T-test and fold change analysis were used to identify differentially expressed genes. We used hierarchical clustering and principle components analysis to characterize signature gene expression patterns among cases and controls. Pathway and promoter sequence analyses were used to investigate functions and functional relationships as well as regulatory regions of differentially expressed genes. Results A total of 209 genes, including potential candidate genes (e.g. PTGDS, prostaglandin D2 synthase 21 kDa, were differentially expressed. A set of these genes achieved accurate pre-diagnostic separation of cases and controls. These genes participate in functions related to immune system and inflammation, organ development, metabolism (lipid, carbohydrate and amino acid and cell signaling. Binding sites of putative transcription factors such as EGR1 (early growth response 1, TFAP2A (transcription factor AP2A, Sp1 (specificity protein 1 and Sp3 (specificity protein 3 were over represented in promoter regions of differentially expressed genes. Real-time PCR confirmed microarray expression measurements of selected genes. Conclusions PTD is associated with maternal early pregnancy peripheral blood gene expression changes. Maternal early pregnancy peripheral blood gene expression patterns may be useful for better understanding of PTD
Analysis of gene expression profile microarray data in complex regional pain syndrome.

Science.gov (United States)

Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing

2017-09-01

The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.
Expression changes in the stroma of prostate cancer predict subsequent relapse.

Directory of Open Access Journals (Sweden)

Zhenyu Jia

Full Text Available Biomarkers are needed to address overtreatment that occurs for the majority of prostate cancer patients that would not die of the disease but receive radical treatment. A possible barrier to biomarker discovery may be the polyclonal/multifocal nature of prostate tumors as well as cell-type heterogeneity between patient samples. Tumor-adjacent stroma (tumor microenvironment is less affected by genetic alteration and might therefore yield more consistent biomarkers in response to tumor aggressiveness. To this end we compared Affymetrix gene expression profiles in stroma near tumor and identified a set of 115 probe sets for which the expression levels were significantly correlated with time-to-relapse. We also compared patients that chemically relapsed shortly after prostatectomy (<1 year, and patients that did not relapse in the first four years after prostatectomy. We identified 131 differentially expressed microarray probe sets between these two categories. 19 probe sets (15 genes overlapped between the two gene lists with p<0.0001. We developed a PAM-based classifier by training on samples containing stroma near tumor: 9 rapid relapse patient samples and 9 indolent patient samples. We then tested the classifier on 47 different samples, containing 90% or more stroma. The classifier predicted the risk status of patients with an average accuracy of 87%. This is the first general tumor microenvironment-based prognostic classifier. These results indicate that the prostate cancer microenvironment exhibits reproducible changes useful for predicting outcomes for patients.
Expression of DNA repair genes in ovarian cancer samples: biological and clinical considerations.

Science.gov (United States)

Ganzinelli, M; Mariani, P; Cattaneo, D; Fossati, R; Fruscio, R; Corso, S; Ricci, F; Broggini, M; Damia, G

2011-05-01

The purpose of this study was to investigate retrospectively the mRNA expression of genes involved in different DNA repair pathways implicated in processing platinum-induced damage in 171 chemotherapy-naïve ovarian tumours and correlate the expression of the different genes with clinical parameters. The expression of genes involved in DNA repair pathways (PARP1, ERCC1, XPA, XPF, XPG, BRCA1, FANCA, FANCC, FANCD2, FANCF and PolEta), and in DNA damage transduction (Chk1 and Claspin) was measured by RT-PCR in 13 stage I borderline and 77 stage I and 88 III ovarian carcinomas. ERCC1, XPA, XPF and XPG genes were significantly less expressed in stage III than in stage I carcinoma; BRCA1, FANCA, FANCC, FANCD2 gene expressions were low in borderline tumours, higher in stage I carcinomas and lower in stage III samples. High levels of ERCC1, XPA, FANCC, XPG and PolEta correlated with an increase in Overall Survival (OS) and Progression Free Survival (PFS), whilst high BRCA1 levels were associated with PFS on univariate analysis. With multivariate analyses no genes retained an association when adjusted by stage, grade and residual tumour. A tendency towards a better PFS was observed in patients with the highest level of ERCC1 and BRCA1 after platinum-based therapy than those given both platinum and taxol. The expression of DNA repair genes differed in borderline stage I, stage I and stage III ovarian carcinomas. The role of DNA repair genes in predicting the response in ovarian cancer patients seems far from being established. Copyright © 2010 Elsevier Ltd. All rights reserved.
Genome-wide identification and expression analysis of NBS-encoding genes in Malus x domestica and expansion of NBS genes family in Rosaceae.

Directory of Open Access Journals (Sweden)

Preeti Arya

Full Text Available Nucleotide binding site leucine-rich repeats (NBS-LRR disease resistance proteins play an important role in plant defense against pathogen attack. A number of recent studies have been carried out to identify and characterize NBS-LRR gene families in many important plant species. In this study, we identified NBS-LRR gene family comprising of 1015 NBS-LRRs using highly stringent computational methods. These NBS-LRRs were characterized on the basis of conserved protein motifs, gene duplication events, chromosomal locations, phylogenetic relationships and digital gene expression analysis. Surprisingly, equal distribution of Toll/interleukin-1 receptor (TIR and coiled coil (CC (1 ∶ 1 was detected in apple while the unequal distribution was reported in majority of all other known plant genome studies. Prediction of gene duplication events intriguingly revealed that not only tandem duplication but also segmental duplication may equally be responsible for the expansion of the apple NBS-LRR gene family. Gene expression profiling using expressed sequence tags database of apple and quantitative real-time PCR (qRT-PCR revealed the expression of these genes in wide range of tissues and disease conditions, respectively. Taken together, this study will provide a blueprint for future efforts towards improvement of disease resistance in apple.
Genome-wide identification and expression analysis of NBS-encoding genes in Malus x domestica and expansion of NBS genes family in Rosaceae.

Science.gov (United States)

Arya, Preeti; Kumar, Gulshan; Acharya, Vishal; Singh, Anil K

2014-01-01

Nucleotide binding site leucine-rich repeats (NBS-LRR) disease resistance proteins play an important role in plant defense against pathogen attack. A number of recent studies have been carried out to identify and characterize NBS-LRR gene families in many important plant species. In this study, we identified NBS-LRR gene family comprising of 1015 NBS-LRRs using highly stringent computational methods. These NBS-LRRs were characterized on the basis of conserved protein motifs, gene duplication events, chromosomal locations, phylogenetic relationships and digital gene expression analysis. Surprisingly, equal distribution of Toll/interleukin-1 receptor (TIR) and coiled coil (CC) (1 ∶ 1) was detected in apple while the unequal distribution was reported in majority of all other known plant genome studies. Prediction of gene duplication events intriguingly revealed that not only tandem duplication but also segmental duplication may equally be responsible for the expansion of the apple NBS-LRR gene family. Gene expression profiling using expressed sequence tags database of apple and quantitative real-time PCR (qRT-PCR) revealed the expression of these genes in wide range of tissues and disease conditions, respectively. Taken together, this study will provide a blueprint for future efforts towards improvement of disease resistance in apple.

Identification and expression analysis of cold and freezing stress responsive genes of Brassica oleracea.

Science.gov (United States)

Ahmed, Nasar Uddin; Jung, Hee-Jeong; Park, Jong-In; Cho, Yong-Gu; Hur, Yoonkang; Nou, Ill-Sup

2015-01-10

Cold and freezing stress is a major environmental constraint to the production of Brassica crops. Enhancement of tolerance by exploiting cold and freezing tolerance related genes offers the most efficient approach to address this problem. Cold-induced transcriptional profiling is a promising approach to the identification of potential genes related to cold and freezing stress tolerance. In this study, 99 highly expressed genes were identified from a whole genome microarray dataset of Brassica rapa. Blast search analysis of the Brassica oleracea database revealed the corresponding homologous genes. To validate their expression, pre-selected cold tolerant and susceptible cabbage lines were analyzed. Out of 99 BoCRGs, 43 were differentially expressed in response to varying degrees of cold and freezing stress in the contrasting cabbage lines. Among the differentially expressed genes, 18 were highly up-regulated in the tolerant lines, which is consistent with their microarray expression. Additionally, 12 BoCRGs were expressed differentially after cold stress treatment in two contrasting cabbage lines, and BoCRG54, 56, 59, 62, 70, 72 and 99 were predicted to be involved in cold regulatory pathways. Taken together, the cold-responsive genes identified in this study provide additional direction for elucidating the regulatory network of low temperature stress tolerance and developing cold and freezing stress resistant Brassica crops. Copyright © 2014 Elsevier B.V. All rights reserved.
Reduced expression of Autographa californica nucleopolyhedrovirus ORF34, an essential gene, enhances heterologous gene expression

International Nuclear Information System (INIS)

Salem, Tamer Z.; Zhang, Fengrui; Thiem, Suzanne M.

2013-01-01

Autographa californica multiple nucleopolyhedrovirus ORF34 is part of a transcriptional unit that includes ORF32, encoding a viral fibroblast growth factor (FGF) and ORF33. We identified ORF34 as a candidate for deletion to improve protein expression in the baculovirus expression system based on enhanced reporter gene expression in an RNAi screen of virus genes. However, ORF34 was shown to be an essential gene. To explore ORF34 function, deletion (KO34) and rescue bacmids were constructed and characterized. Infection did not spread from primary KO34 transfected cells and supernatants from KO34 transfected cells could not infect fresh Sf21 cells whereas the supernatant from the rescue bacmids transfection could recover the infection. In addition, budded viruses were not observed in KO34 transfected cells by electron microscopy, nor were viral proteins detected from the transfection supernatants by western blots. These demonstrate that ORF34 is an essential gene with a possible role in infectious virus production.
Reduced expression of Autographa californica nucleopolyhedrovirus ORF34, an essential gene, enhances heterologous gene expression

Energy Technology Data Exchange (ETDEWEB)

Salem, Tamer Z. [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Department of Microbial Molecular Biology, AGERI, Agricultural Research Center, Giza 12619 (Egypt); Division of Biomedical Sciences, Zewail University, Zewail City of Science and Technology, Giza 12588 (Egypt); Zhang, Fengrui [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Thiem, Suzanne M., E-mail: smthiem@msu.edu [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824 (United States)

2013-01-20

Autographa californica multiple nucleopolyhedrovirus ORF34 is part of a transcriptional unit that includes ORF32, encoding a viral fibroblast growth factor (FGF) and ORF33. We identified ORF34 as a candidate for deletion to improve protein expression in the baculovirus expression system based on enhanced reporter gene expression in an RNAi screen of virus genes. However, ORF34 was shown to be an essential gene. To explore ORF34 function, deletion (KO34) and rescue bacmids were constructed and characterized. Infection did not spread from primary KO34 transfected cells and supernatants from KO34 transfected cells could not infect fresh Sf21 cells whereas the supernatant from the rescue bacmids transfection could recover the infection. In addition, budded viruses were not observed in KO34 transfected cells by electron microscopy, nor were viral proteins detected from the transfection supernatants by western blots. These demonstrate that ORF34 is an essential gene with a possible role in infectious virus production.
Integrated analyses of microRNAs demonstrate their widespread influence on gene expression in high-grade serous ovarian carcinoma.

Science.gov (United States)

Creighton, Chad J; Hernandez-Herrera, Anadulce; Jacobsen, Anders; Levine, Douglas A; Mankoo, Parminder; Schultz, Nikolaus; Du, Ying; Zhang, Yiqun; Larsson, Erik; Sheridan, Robert; Xiao, Weimin; Spellman, Paul T; Getz, Gad; Wheeler, David A; Perou, Charles M; Gibbs, Richard A; Sander, Chris; Hayes, D Neil; Gunaratne, Preethi H

2012-01-01

The Cancer Genome Atlas (TCGA) Network recently comprehensively catalogued the molecular aberrations in 487 high-grade serous ovarian cancers, with much remaining to be elucidated regarding the microRNAs (miRNAs). Here, using TCGA ovarian data, we surveyed the miRNAs, in the context of their predicted gene targets. Integration of miRNA and gene patterns yielded evidence that proximal pairs of miRNAs are processed from polycistronic primary transcripts, and that intronic miRNAs and their host gene mRNAs derive from common transcripts. Patterns of miRNA expression revealed multiple tumor subtypes and a set of 34 miRNAs predictive of overall patient survival. In a global analysis, miRNA:mRNA pairs anti-correlated in expression across tumors showed a higher frequency of in silico predicted target sites in the mRNA 3'-untranslated region (with less frequency observed for coding sequence and 5'-untranslated regions). The miR-29 family and predicted target genes were among the most strongly anti-correlated miRNA:mRNA pairs; over-expression of miR-29a in vitro repressed several anti-correlated genes (including DNMT3A and DNMT3B) and substantially decreased ovarian cancer cell viability. This study establishes miRNAs as having a widespread impact on gene expression programs in ovarian cancer, further strengthening our understanding of miRNA biology as it applies to human cancer. As with gene transcripts, miRNAs exhibit high diversity reflecting the genomic heterogeneity within a clinically homogeneous disease population. Putative miRNA:mRNA interactions, as identified using integrative analysis, can be validated. TCGA data are a valuable resource for the identification of novel tumor suppressive miRNAs in ovarian as well as other cancers.
Combining gene prediction methods to improve metagenomic gene annotation

Directory of Open Access Journals (Sweden)

Rosen Gail L

2011-01-01

Full Text Available Abstract Background Traditional gene annotation methods rely on characteristics that may not be available in short reads generated from next generation technology, resulting in suboptimal performance for metagenomic (environmental samples. Therefore, in recent years, new programs have been developed that optimize performance on short reads. In this work, we benchmark three metagenomic gene prediction programs and combine their predictions to improve metagenomic read gene annotation. Results We not only analyze the programs' performance at different read-lengths like similar studies, but also separate different types of reads, including intra- and intergenic regions, for analysis. The main deficiencies are in the algorithms' ability to predict non-coding regions and gene edges, resulting in more false-positives and false-negatives than desired. In fact, the specificities of the algorithms are notably worse than the sensitivities. By combining the programs' predictions, we show significant improvement in specificity at minimal cost to sensitivity, resulting in 4% improvement in accuracy for 100 bp reads with ~1% improvement in accuracy for 200 bp reads and above. To correctly annotate the start and stop of the genes, we find that a consensus of all the predictors performs best for shorter read lengths while a unanimous agreement is better for longer read lengths, boosting annotation accuracy by 1-8%. We also demonstrate use of the classifier combinations on a real dataset. Conclusions To optimize the performance for both prediction and annotation accuracies, we conclude that the consensus of all methods (or a majority vote is the best for reads 400 bp and shorter, while using the intersection of GeneMark and Orphelia predictions is the best for reads 500 bp and longer. We demonstrate that most methods predict over 80% coding (including partially coding reads on a real human gut sample sequenced by Illumina technology.
A gene expression signature of Retinoblastoma loss-of-function predicts resistance to neoadjuvant chemotherapy in ER-positive/HER2-positive breast cancer patients.

Science.gov (United States)

Risi, Emanuela; Grilli, Andrea; Migliaccio, Ilenia; Biagioni, Chiara; McCartney, Amelia; Guarducci, Cristina; Bonechi, Martina; Benelli, Matteo; Vitale, Stefania; Biganzoli, Laura; Bicciato, Silvio; Di Leo, Angelo; Malorni, Luca

2018-07-01

HER2-positive (HER2+) breast cancers show heterogeneous response to chemotherapy, with the ER-positive (ER+) subgroup deriving less benefit. Loss of retinoblastoma tumor suppressor gene (RB1) function has been suggested as a cardinal feature of breast cancers that are more sensitive to chemotherapy and conversely resistant to CDK4/6 inhibitors. We performed a retrospective analysis exploring RBsig, a gene signature of RB loss, as a potential predictive marker of response to neoadjuvant chemotherapy in ER+/HER2+ breast cancer patients. We selected clinical trials of neoadjuvant chemotherapy ± anti-HER2 therapy in HER2+ breast cancer patients with available information on gene expression data, hormone receptor status, and pathological complete response (pCR) rates. RBsig expression was computed in silico and correlated with pCR. Ten studies fulfilled the inclusion criteria and were included in the analysis (514 patients). Overall, of 211 ER+/HER2+ breast cancer patients, 49 achieved pCR (23%). The pCR rate following chemotherapy ± anti-HER2 drugs in patients with RBsig low expression was significantly lower compared to patients with RBsig high expression (16% vs. 30%, respectively; Fisher's exact test p = 0.015). The area under the ROC curve (AUC) was 0.62 (p = 0.005). In the 303 ER-negative (ER-)/HER2+ patients treated with chemotherapy ± anti-HER2 drugs, the pCR rate was 43%. No correlation was found between RBsig expression and pCR rate in this group. Low expression of RBsig identifies a subset of ER+/HER2+ patients with low pCR rates following neoadjuvant chemotherapy ± anti-HER2 therapy. These patients may potentially be spared chemotherapy in favor of anti-HER2, endocrine therapy, and CDK 4/6 inhibitor combinations.
Verification of predicted alternatively spliced Wnt genes reveals two new splice variants (CTNNB1 and LRP5 and altered Axin-1 expression during tumour progression

Directory of Open Access Journals (Sweden)

Reich Jens G

2006-06-01

Full Text Available Abstract Background Splicing processes might play a major role in carcinogenesis and tumour progression. The Wnt pathway is of crucial relevance for cancer progression. Therefore we focussed on the Wnt/β-catenin signalling pathway in order to validate the expression of sequences predicted as alternatively spliced by bioinformatic methods. Splice variants of its key molecules were selected, which may be critical components for the understanding of colorectal tumour progression and may have the potential to act as biological markers. For some of the Wnt pathway genes the existence of splice variants was either proposed (e.g. β-Catenin and CTNNB1 or described only in non-colon tissues (e.g. GSK3β or hitherto not published (e.g. LRP5. Results Both splice variants – normal and alternative form – of all selected Wnt pathway components were found to be expressed in cell lines as well as in samples derived from tumour, normal and healthy tissues. All splice positions corresponded totally with the bioinformatical prediction as shown by sequencing. Two hitherto not described alternative splice forms (CTNNB1 and LRP5 were detected. Although the underlying EST data used for the bioinformatic analysis suggested a tumour-specific expression neither a qualitative nor a significant quantitative difference between the expression in tumour and healthy tissues was detected. Axin-1 expression was reduced in later stages and in samples from carcinomas forming distant metastases. Conclusion We were first to describe that splice forms of crucial genes of the Wnt-pathway are expressed in human colorectal tissue. Newly described splicefoms were found for β-Catenin, LRP5, GSK3β, Axin-1 and CtBP1. However, the predicted cancer specificity suggested by the origin of the underlying ESTs was neither qualitatively nor significant quantitatively confirmed. That let us to conclude that EST sequence data can give adequate hints for the existence of alternative splicing
Alternative splicing and differential gene expression in colon cancer detected by a whole genome exon array

Directory of Open Access Journals (Sweden)

Sugnet Charles

2006-12-01

Full Text Available Abstract Background Alternative splicing is a mechanism for increasing protein diversity by excluding or including exons during post-transcriptional processing. Alternatively spliced proteins are particularly relevant in oncology since they may contribute to the etiology of cancer, provide selective drug targets, or serve as a marker set for cancer diagnosis. While conventional identification of splice variants generally targets individual genes, we present here a new exon-centric array (GeneChip Human Exon 1.0 ST that allows genome-wide identification of differential splice variation, and concurrently provides a flexible and inclusive analysis of gene expression. Results We analyzed 20 paired tumor-normal colon cancer samples using a microarray designed to detect over one million putative exons that can be virtually assembled into potential gene-level transcripts according to various levels of prior supporting evidence. Analysis of high confidence (empirically supported transcripts identified 160 differentially expressed genes, with 42 genes occupying a network impacting cell proliferation and another twenty nine genes with unknown functions. A more speculative analysis, including transcripts based solely on computational prediction, produced another 160 differentially expressed genes, three-fourths of which have no previous annotation. We also present a comparison of gene signal estimations from the Exon 1.0 ST and the U133 Plus 2.0 arrays. Novel splicing events were predicted by experimental algorithms that compare the relative contribution of each exon to the cognate transcript intensity in each tissue. The resulting candidate splice variants were validated with RT-PCR. We found nine genes that were differentially spliced between colon tumors and normal colon tissues, several of which have not been previously implicated in cancer. Top scoring candidates from our analysis were also found to substantially overlap with EST-based bioinformatic
Widespread ectopic expression of olfactory receptor genes

Directory of Open Access Journals (Sweden)

Yanai Itai

2006-05-01

Full Text Available Abstract Background Olfactory receptors (ORs are the largest gene family in the human genome. Although they are expected to be expressed specifically in olfactory tissues, some ectopic expression has been reported, with special emphasis on sperm and testis. The present study systematically explores the expression patterns of OR genes in a large number of tissues and assesses the potential functional implication of such ectopic expression. Results We analyzed the expression of hundreds of human and mouse OR transcripts, via EST and microarray data, in several dozens of human and mouse tissues. Different tissues had specific, relatively small OR gene subsets which had particularly high expression levels. In testis, average expression was not particularly high, and very few highly expressed genes were found, none corresponding to ORs previously implicated in sperm chemotaxis. Higher expression levels were more common for genes with a non-OR genomic neighbor. Importantly, no correlation in expression levels was detected for human-mouse orthologous pairs. Also, no significant difference in expression levels was seen between intact and pseudogenized ORs, except for the pseudogenes of subfamily 7E which has undergone a human-specific expansion. Conclusion The OR superfamily as a whole, show widespread, locus-dependent and heterogeneous expression, in agreement with a neutral or near neutral evolutionary model for transcription control. These results cannot reject the possibility that small OR subsets might play functional roles in different tissues, however considerable care should be exerted when offering a functional interpretation for ectopic OR expression based only on transcription information.
Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

Science.gov (United States)

Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

2013-01-01

The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867
Functional redundancy and/or ongoing pseudogenization among F-box protein genes expressed in Arabidopsis male gametophyte.

Science.gov (United States)

Ikram, Sobia; Durandet, Monique; Vesa, Simona; Pereira, Serge; Guerche, Philippe; Bonhomme, Sandrine

2014-06-01

F-box protein genes family is one of the largest gene families in plants, with almost 700 predicted genes in the model plant Arabidopsis. F-box proteins are key components of the ubiquitin proteasome system that allows targeted protein degradation. Transcriptome analyses indicate that half of these F-box protein genes are found expressed in microspore and/or pollen, i.e., during male gametogenesis. To assess the role of F-box protein genes during this crucial developmental step, we selected 34 F-box protein genes recorded as highly and specifically expressed in pollen and isolated corresponding insertion mutants. We checked the expression level of each selected gene by RT-PCR and confirmed pollen expression for 25 genes, but specific expression for only 10 of the 34 F-box protein genes. In addition, we tested the expression level of selected F-box protein genes in 24 mutant lines and showed that 11 of them were null mutants. Transmission analysis of the mutations to the progeny showed that none of the single mutations was gametophytic lethal. These unaffected transmission efficiencies suggested leaky mutations or functional redundancy among F-box protein genes. Cytological observation of the gametophytes in the mutants confirmed these results. Combinations of mutations in F-box protein genes from the same subfamily did not lead to transmission defect either, further highlighting functional redundancy and/or a high proportion of pseudogenes among these F-box protein genes.
Gene expression changes in peripheral blood mononuclear cells in occupational exposure to nickel.

Science.gov (United States)

Bonin, Serena; Larese, Francesca Filon; Trevisan, Giusto; Avian, Andrea; Rui, Francesca; Stanta, Giorgio; Bovenzi, Massimo

2011-02-01

Allergic contact dermatitis is preceded by a clinically silent phase of sensitisation. In this study, we investigated whether the expression levels of six genes were related to nickel exposure and/or nickel sensitisation, and whether they could predict allergic manifestations to nickel. The mRNA expression level of six genes involved in cell growth (PIM1 and ETS2), metabolism/synthesis (HSD11B1 and PRDX4), apoptosis (CASP8) and signal transduction (CISH) was investigated by means of quantitative real-time RT-PCR in a cohort of 110 subjects, including healthy controls (n=51), nickel-exposed workers (n=23) and patients allergic to nickel (n=36). Our findings show that the expression levels of the analysed genes did not differ between allergic patients and healthy controls, while higher expression levels of ETS2 and CASP8 were detected in the nickel-exposed workers. Changes in ETS2 and CASP8 expression are likely to be related to nickel exposure rather than to allergy. © 2011 John Wiley & Sons A/S.
Dynamic association rules for gene expression data analysis.

Science.gov (United States)

Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

2015-10-14

The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed
Gene expression in periodontal tissues following treatment

Directory of Open Access Journals (Sweden)

Eisenacher Martin

2008-07-01

Full Text Available Abstract Background In periodontitis, treatment aimed at controlling the periodontal biofilm infection results in a resolution of the clinical and histological signs of inflammation. Although the cell types found in periodontal tissues following treatment have been well described, information on gene expression is limited to few candidate genes. Therefore, the aim of the study was to determine the expression profiles of immune and inflammatory genes in periodontal tissues from sites with severe chronic periodontitis following periodontal therapy in order to identify genes involved in tissue homeostasis. Gingival biopsies from 12 patients with severe chronic periodontitis were taken six to eight weeks following non-surgical periodontal therapy, and from 11 healthy controls. As internal standard, RNA of an immortalized human keratinocyte line (HaCaT was used. Total RNA was subjected to gene expression profiling using a commercially available microarray system focusing on inflammation-related genes. Post-hoc confirmation of selected genes was done by Realtime-PCR. Results Out of the 136 genes analyzed, the 5% most strongly expressed genes compared to healthy controls were Interleukin-12A (IL-12A, Versican (CSPG-2, Matrixmetalloproteinase-1 (MMP-1, Down syndrome critical region protein-1 (DSCR-1, Macrophage inflammatory protein-2β (Cxcl-3, Inhibitor of apoptosis protein-1 (BIRC-1, Cluster of differentiation antigen 38 (CD38, Regulator of G-protein signalling-1 (RGS-1, and Finkel-Biskis-Jinkins murine osteosarcoma virus oncogene (C-FOS; the 5% least strongly expressed genes were Receptor-interacting Serine/Threonine Kinase-2 (RIP-2, Complement component 3 (C3, Prostaglandin-endoperoxide synthase-2 (COX-2, Interleukin-8 (IL-8, Endothelin-1 (EDN-1, Plasminogen activator inhibitor type-2 (PAI-2, Matrix-metalloproteinase-14 (MMP-14, and Interferon regulating factor-7 (IRF-7. Conclusion Gene expression profiles found in periodontal tissues following
Gene expression profiles in skeletal muscle after gene electrotransfer

DEFF Research Database (Denmark)

Hojman, Pernille; Zibert, John R; Gissel, Hanne

2007-01-01

BACKGROUND: Gene transfer by electroporation (DNA electrotransfer) to muscle results in high level long term transgenic expression, showing great promise for treatment of e.g. protein deficiency syndromes. However little is known about the effects of DNA electrotransfer on muscle fibres. We have...... caused down-regulation of structural proteins e.g. sarcospan and catalytic enzymes. Injection of DNA induced down-regulation of intracellular transport proteins e.g. sentrin. The effects on muscle fibres were transient as the expression profiles 3 weeks after treatment were closely related......) followed by a long low voltage pulse (LV, 100 V/cm, 400 ms); a pulse combination optimised for efficient and safe gene transfer. Muscles were transfected with green fluorescent protein (GFP) and excised at 4 hours, 48 hours or 3 weeks after treatment. RESULTS: Differentially expressed genes were...
Gene expression patterns in CD4+ peripheral blood cells in healthy subjects and stage IV melanoma patients.

Science.gov (United States)

Felts, Sara J; Van Keulen, Virginia P; Scheid, Adam D; Allen, Kathleen S; Bradshaw, Renee K; Jen, Jin; Peikert, Tobias; Middha, Sumit; Zhang, Yuji; Block, Matthew S; Markovic, Svetomir N; Pease, Larry R

2015-11-01

Melanoma patients exhibit changes in immune responsiveness in the local tumor environment, draining lymph nodes, and peripheral blood. Immune-targeting therapies are revolutionizing melanoma patient care increasingly, and studies show that patients derive clinical benefit from these newer agents. Nonetheless, predicting which patients will benefit from these costly therapies remains a challenge. In an effort to capture individual differences in immune responsiveness, we are analyzing patterns of gene expression in human peripheral blood cells using RNAseq. Focusing on CD4+ peripheral blood cells, we describe multiple categories of immune regulating genes, which are expressed in highly ordered patterns shared by cohorts of healthy subjects and stage IV melanoma patients. Despite displaying conservation in overall transcriptome structure, CD4+ peripheral blood cells from melanoma patients differ quantitatively from healthy subjects in the expression of more than 2000 genes. Moreover, 1300 differentially expressed genes are found in transcript response patterns following activation of CD4+ cells ex vivo, suggesting that widespread functional discrepancies differentiate the immune systems of healthy subjects and melanoma patients. While our analysis reveals that the transcriptome architecture characteristic of healthy subjects is maintained in cancer patients, the genes expressed differentially among individuals and across cohorts provide opportunities for understanding variable immune states as well as response potentials, thus establishing a foundation for predicting individual responses to stimuli such as immunotherapeutic agents.
Comparative gene expression between two yeast species

Directory of Open Access Journals (Sweden)

Guan Yuanfang

2013-01-01

Full Text Available Abstract Background Comparative genomics brings insight into sequence evolution, but even more may be learned by coupling sequence analyses with experimental tests of gene function and regulation. However, the reliability of such comparisons is often limited by biased sampling of expression conditions and incomplete knowledge of gene functions across species. To address these challenges, we previously systematically generated expression profiles in Saccharomyces bayanus to maximize functional coverage as compared to an existing Saccharomyces cerevisiae data repository. Results In this paper, we take advantage of these two data repositories to compare patterns of ortholog expression in a wide variety of conditions. First, we developed a scalable metric for expression divergence that enabled us to detect a significant correlation between sequence and expression conservation on the global level, which previous smaller-scale expression studies failed to detect. Despite this global conservation trend, between-species gene expression neighborhoods were less well-conserved than within-species comparisons across different environmental perturbations, and approximately 4% of orthologs exhibited a significant change in co-expression partners. Furthermore, our analysis of matched perturbations collected in both species (such as diauxic shift and cell cycle synchrony demonstrated that approximately a quarter of orthologs exhibit condition-specific expression pattern differences. Conclusions Taken together, these analyses provide a global view of gene expression patterns between two species, both in terms of the conditions and timing of a gene's expression as well as co-expression partners. Our results provide testable hypotheses that will direct future experiments to determine how these changes may be specified in the genome.
Bioinformatic prediction and functional characterization of human KIAA0100 gene

Directory of Open Access Journals (Sweden)

He Cui

2017-02-01

Full Text Available Our previous study demonstrated that human KIAA0100 gene was a novel acute monocytic leukemia-associated antigen (MLAA gene. But the functional characterization of human KIAA0100 gene has remained unknown to date. Here, firstly, bioinformatic prediction of human KIAA0100 gene was carried out using online softwares; Secondly, Human KIAA0100 gene expression was downregulated by the clustered regularly interspaced short palindromic repeats (CRISPR/CRISPR-associated (Cas 9 system in U937 cells. Cell proliferation and apoptosis were next evaluated in KIAA0100-knockdown U937 cells. The bioinformatic prediction showed that human KIAA0100 gene was located on 17q11.2, and human KIAA0100 protein was located in the secretory pathway. Besides, human KIAA0100 protein contained a signalpeptide, a transmembrane region, three types of secondary structures (alpha helix, extended strand, and random coil , and four domains from mitochondrial protein 27 (FMP27. The observation on functional characterization of human KIAA0100 gene revealed that its downregulation inhibited cell proliferation, and promoted cell apoptosis in U937 cells. To summarize, these results suggest human KIAA0100 gene possibly comes within mitochondrial genome; moreover, it is a novel anti-apoptotic factor related to carcinogenesis or progression in acute monocytic leukemia, and may be a potential target for immunotherapy against acute monocytic leukemia.
Transcription factor binding site enrichment analysis predicts drivers of altered gene expression in nonalcoholic steatohepatitis

Czech Academy of Sciences Publication Activity Database

Lake, A.D.; Chaput, A.L.; Novák, Petr; Cherrington, N.J.; Smith, C.L.

2016-01-01

Roč. 122, December 15 (2016), s. 62-71 ISSN 0006-2952 Institutional support: RVO:60077344 Keywords : Transcription factor * Liver * Gene expression * Bioinformatics Subject RIV: CE - Biochemistry Impact factor: 4.581, year: 2016
Interactive visualization of gene regulatory networks with associated gene expression time series data

NARCIS (Netherlands)

Westenberg, M.A.; Hijum, van S.A.F.T.; Lulko, A.T.; Kuipers, O.P.; Roerdink, J.B.T.M.; Linsen, L.; Hagen, H.; Hamann, B.

2008-01-01

We present GENeVis, an application to visualize gene expression time series data in a gene regulatory network context. This is a network of regulator proteins that regulate the expression of their respective target genes. The networks are represented as graphs, in which the nodes represent genes,

Effects of sample size on robustness and prediction accuracy of a prognostic gene signature

Directory of Open Access Journals (Sweden)

Kim Seon-Young

2009-05-01

Full Text Available Abstract Background Few overlap between independently developed gene signatures and poor inter-study applicability of gene signatures are two of major concerns raised in the development of microarray-based prognostic gene signatures. One recent study suggested that thousands of samples are needed to generate a robust prognostic gene signature. Results A data set of 1,372 samples was generated by combining eight breast cancer gene expression data sets produced using the same microarray platform and, using the data set, effects of varying samples sizes on a few performances of a prognostic gene signature were investigated. The overlap between independently developed gene signatures was increased linearly with more samples, attaining an average overlap of 16.56% with 600 samples. The concordance between predicted outcomes by different gene signatures also was increased with more samples up to 94.61% with 300 samples. The accuracy of outcome prediction also increased with more samples. Finally, analysis using only Estrogen Receptor-positive (ER+ patients attained higher prediction accuracy than using both patients, suggesting that sub-type specific analysis can lead to the development of better prognostic gene signatures Conclusion Increasing sample sizes generated a gene signature with better stability, better concordance in outcome prediction, and better prediction accuracy. However, the degree of performance improvement by the increased sample size was different between the degree of overlap and the degree of concordance in outcome prediction, suggesting that the sample size required for a study should be determined according to the specific aims of the study.
Serial analysis of gene expression (SAGE)

NARCIS (Netherlands)

van Ruissen, Fred; Baas, Frank

2007-01-01

In 1995, serial analysis of gene expression (SAGE) was developed as a versatile tool for gene expression studies. SAGE technology does not require pre-existing knowledge of the genome that is being examined and therefore SAGE can be applied to many different model systems. In this chapter, the SAGE
Characterization and Expression Analysis of a Retinoblastoma-Related Gene from Chinese Wild Vitis pseudoreticulata.

Science.gov (United States)

Wen, Zhifeng; Gao, Min; Jiao, Chen; Wang, Qian; Xu, Hui; Walter, Monika; Xu, Weirong; Bassett, Carole; Wang, Xiping

2012-01-01

Retinoblastoma-related (RBR) genes, a conserved gene family in higher eukaryotes, play important roles in cell differentiation, development, and mammalian cell death; however, little is known of their function in plants. In this study, a RBR gene was isolated from the Chinese wild grape, Vitis pseudoreticulata W. T. Wang clone "Baihe-35-1", and designated as VpRBR . The cDNA sequence of VpRBR was 3,030 bp and contained an open reading frame of 3,024 bp. Conceptual translation of this gene indicated a composition of 1,007 amino acids with a predicted molecular mass of 117.3 kDa. The predicted protein showed a retinoblastoma-associated protein domain A from amino acid residues 416 to 579, and domain B from residues 726 to 855. The result of expression analysis indicated that VpRBR was expressed in tissues, leaves, stem, tendrils, flower, and grape skin at different expression levels. Further quantitative reverse transcription-PCR (qRT-PCR) data indicated that VpRBR levels were higher in Erysiphe necator-treated "Baihe-35-1" and "Baihe-13-1", two resistant clones of Chinese wild V. pseudoreticulata , than in E. necator-treated "Hunan-1", a susceptible clone of V. pseudoreticulata . Furthermore, the expression of VpRBR in response to salicylic acid (SA), methyl jasmonate (MeJA), and ethylene (Eth) in grape leaves was also investigated. Taken together, these data indicate that VpRBR may contribute to some aspect of powdery mildew resistance in grape.
An Interactive Database of Cocaine-Responsive Gene Expression

Directory of Open Access Journals (Sweden)

Willard M. Freeman

2002-01-01

Full Text Available The postgenomic era of large-scale gene expression studies is inundating drug abuse researchers and many other scientists with findings related to gene expression. This information is distributed across many different journals, and requires laborious literature searches. Here, we present an interactive database that combines existing information related to cocaine-mediated changes in gene expression in an easy-to-use format. The database is limited to statistically significant changes in mRNA or protein expression after cocaine administration. The Flash-based program is integrated into a Web page, and organizes changes in gene expression based on neuroanatomical region, general function, and gene name. Accompanying each gene is a description of the gene, links to the original publications, and a link to the appropriate OMIM (Online Mendelian Inheritance in Man entry. The nature of this review allows for timely modifications and rapid inclusion of new publications, and should help researchers build second-generation hypotheses on the role of gene expression changes in the physiology and behavior of cocaine abuse. Furthermore, this method of organizing large volumes of scientific information can easily be adapted to assist researchers in fields outside of drug abuse.
CDX2 gene expression in acute lymphoblastic leukemia

International Nuclear Information System (INIS)

Arnaoaut, H.H.; Mokhtar, D.A.; Samy, R.M.; Omar, Sh.A.; Khames, S.A.

2014-01-01

CDX genes are classically known as regulators of axial elongation during early embryogenesis. An unsuspected role for CDX genes has been revealed during hematopoietic development. The CDX gene family member CDX2 belongs to the most frequent aberrantly expressed proto-oncogenes in human acute leukemias and is highly leukemogenic in experimental models. We used reversed transcriptase polymerase chain reaction (RT-PCR) to determine the expression level of CDX2 gene in 30 pediatric patients with acute lymphoblastic leukemia (ALL) at diagnosis and 30 healthy volunteers. ALL patients were followed up to detect minimal residual disease (MRD) on days 15 and 42 of induction. We found that CDX2 gene was expressed in 50% of patients and not expressed in controls. Associations between gene expression and different clinical and laboratory data of patients revealed no impact on different findings. With follow up, we could not confirm that CDX2 expression had a prognostic significance.
Identification of reference genes in human myelomonocytic cells for gene expression studies in altered gravity.

Science.gov (United States)

Thiel, Cora S; Hauschild, Swantje; Tauber, Svantje; Paulsen, Katrin; Raig, Christiane; Raem, Arnold; Biskup, Josefine; Gutewort, Annett; Hürlimann, Eva; Unverdorben, Felix; Buttron, Isabell; Lauber, Beatrice; Philpot, Claudia; Lier, Hartwin; Engelmann, Frank; Layer, Liliana E; Ullrich, Oliver

2015-01-01

Gene expression studies are indispensable for investigation and elucidation of molecular mechanisms. For the process of normalization, reference genes ("housekeeping genes") are essential to verify gene expression analysis. Thus, it is assumed that these reference genes demonstrate similar expression levels over all experimental conditions. However, common recommendations about reference genes were established during 1 g conditions and therefore their applicability in studies with altered gravity has not been demonstrated yet. The microarray technology is frequently used to generate expression profiles under defined conditions and to determine the relative difference in expression levels between two or more different states. In our study, we searched for potential reference genes with stable expression during different gravitational conditions (microgravity, normogravity, and hypergravity) which are additionally not altered in different hardware systems. We were able to identify eight genes (ALB, B4GALT6, GAPDH, HMBS, YWHAZ, ABCA5, ABCA9, and ABCC1) which demonstrated no altered gene expression levels in all tested conditions and therefore represent good candidates for the standardization of gene expression studies in altered gravity.
Inferring gene networks from discrete expression data

KAUST Repository

Zhang, L.

2013-07-18

The modeling of gene networks from transcriptional expression data is an important tool in biomedical research to reveal signaling pathways and to identify treatment targets. Current gene network modeling is primarily based on the use of Gaussian graphical models applied to continuous data, which give a closedformmarginal likelihood. In this paper,we extend network modeling to discrete data, specifically data from serial analysis of gene expression, and RNA-sequencing experiments, both of which generate counts of mRNAtranscripts in cell samples.We propose a generalized linear model to fit the discrete gene expression data and assume that the log ratios of the mean expression levels follow a Gaussian distribution.We restrict the gene network structures to decomposable graphs and derive the graphs by selecting the covariance matrix of the Gaussian distribution with the hyper-inverse Wishart priors. Furthermore, we incorporate prior network models based on gene ontology information, which avails existing biological information on the genes of interest. We conduct simulation studies to examine the performance of our discrete graphical model and apply the method to two real datasets for gene network inference. © The Author 2013. Published by Oxford University Press. All rights reserved.
Reference Gene Screening for Analyzing Gene Expression Across Goat Tissue

Directory of Open Access Journals (Sweden)

Yu Zhang

2013-12-01

Full Text Available Real-time quantitative PCR (qRT-PCR is one of the important methods for investigating the changes in mRNA expression levels in cells and tissues. Selection of the proper reference genes is very important when calibrating the results of real-time quantitative PCR. Studies on the selection of reference genes in goat tissues are limited, despite the economic importance of their meat and dairy products. We used real-time quantitative PCR to detect the expression levels of eight reference gene candidates (18S, TBP, HMBS, YWHAZ, ACTB, HPRT1, GAPDH and EEF1A2 in ten tissues types sourced from Boer goats. The optimal reference gene combination was selected according to the results determined by geNorm, NormFinder and Bestkeeper software packages. The analyses showed that tissue is an important variability factor in genes expression stability. When all tissues were considered, 18S, TBP and HMBS is the optimal reference combination for calibrating quantitative PCR analysis of gene expression from goat tissues. Dividing data set by tissues, ACTB was the most stable in stomach, small intestine and ovary, 18S in heart and spleen, HMBS in uterus and lung, TBP in liver, HPRT1 in kidney and GAPDH in muscle. Overall, this study provided valuable information about the goat reference genes that can be used in order to perform a proper normalisation when relative quantification by qRT-PCR studies is undertaken.
Cloning, expression and characterization of COI1 gene (AsCOI1 from Aquilaria sinensis (Lour. Gilg

Directory of Open Access Journals (Sweden)

Yongcui Liao

2015-09-01

Full Text Available Aquilaria sinensis, a kind of typically wounding-induced medicinal plant with a great economical value, is widely used in the production of traditional Chinese medicine, perfume and incense. Coronatine-insensitive protein 1 (COI1 acts as a receptor in jasmonate (JA signaling pathway, and regulates the expression of JA-responsive genes in plant defense. However, little is known about the COI1 gene in A. sinensis. Here, based on the transcriptome data, a full-length cDNA sequence of COI1 (termed as AsCOI1 was firstly cloned by RT–PCR and rapid-amplification of cDNA ends (RACE strategies. AsCOI1 is 2330 bp in length (GenBank accession No. KM189194, and contains a complete open frame (ORF of 1839 bp. The deduced protein was composed of 612 amino acids, with a predicted molecular weight of 68.93 kDa and an isoelectric point of 6.56, and was predicted to possess F-box and LRRs domains. Combining bioinformatics prediction with subcellular localization experiment analysis, AsCOI1 was appeared to locate in nucleus. AsCOI1 gene was highly expressed in roots and stems, the major organs of agarwood formation. Methyl jasmonate (MeJA, mechanical wounding and heat stress could significantly induce the expression level of AsCOI1 gene. AsCOI1 is an early wound-responsive gene, and it likely plays some role in agarwood formation.
Differential cytokine gene expression according to outcome in a hamster model of leptospirosis.

Directory of Open Access Journals (Sweden)

Frédérique Vernel-Pauillac

Full Text Available BACKGROUND: Parameters predicting the evolution of leptospirosis would be useful for clinicians, as well as to better understand severe leptospirosis, but are scarce and rarely validated. Because severe leptospirosis includes septic shock, similarities with predictors evidenced for sepsis and septic shock were studied in a hamster model. METHODOLOGY/PRINCIPAL FINDINGS: Using an LD50 model of leptospirosis in hamsters, we first determined that 3 days post-infection was a time-point that allowed studying the regulation of immune gene expression and represented the onset of the clinical signs of the disease. In the absence of tools to assess serum concentrations of immune effectors in hamsters, we determined mRNA levels of various immune genes, especially cytokines, together with leptospiraemia at this particular time-point. We found differential expression of both pro- and anti-inflammatory mediators, with significantly higher expression levels of tumor necrosis factor alpha, interleukin 1alpha, cyclo-oxygenase 2 and interleukin 10 genes in nonsurvivors compared to survivors. Higher leptospiraemia was also observed in nonsurvivors. Lastly, we demonstrated the relevance of these results by comparing their respective expression levels using a LD100 model or an isogenic high-passage nonvirulent variant. CONCLUSIONS/SIGNIFICANCE: Up-regulated gene expression of both pro- and anti-inflammatory immune effectors in hamsters with fatal outcome in an LD50 model of leptospirosis, together with a higher Leptospira burden, suggest that these gene expression levels could be predictors of adverse outcome in leptospirosis.
Studying the Complex Expression Dependences between Sets of Coexpressed Genes

Directory of Open Access Journals (Sweden)

Mario Huerta

2014-01-01

Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.
Microarray-based analysis of differential gene expression between infective and noninfective larvae of Strongyloides stercoralis.

Directory of Open Access Journals (Sweden)

Roshan Ramanathan

2011-05-01

Full Text Available Differences between noninfective first-stage (L1 and infective third-stage (L3i larvae of parasitic nematode Strongyloides stercoralis at the molecular level are relatively uncharacterized. DNA microarrays were developed and utilized for this purpose.Oligonucleotide hybridization probes for the array were designed to bind 3,571 putative mRNA transcripts predicted by analysis of 11,335 expressed sequence tags (ESTs obtained as part of the Nematode EST project. RNA obtained from S. stercoralis L3i and L1 was co-hybridized to each array after labeling the individual samples with different fluorescent tags. Bioinformatic predictions of gene function were developed using a novel cDNA Annotation System software. We identified 935 differentially expressed genes (469 L3i-biased; 466 L1-biased having two-fold expression differences or greater and microarray signals with a p value<0.01. Based on a functional analysis, L1 larvae have a larger number of genes putatively involved in transcription (p = 0.004, and L3i larvae have biased expression of putative heat shock proteins (such as hsp-90. Genes with products known to be immunoreactive in S. stercoralis-infected humans (such as SsIR and NIE had L3i biased expression. Abundantly expressed L3i contigs of interest included S. stercoralis orthologs of cytochrome oxidase ucr 2.1 and hsp-90, which may be potential chemotherapeutic targets. The S. stercoralis ortholog of fatty acid and retinol binding protein-1, successfully used in a vaccine against Ancylostoma ceylanicum, was identified among the 25 most highly expressed L3i genes. The sperm-containing glycoprotein domain, utilized in a vaccine against the nematode Cooperia punctata, was exclusively found in L3i biased genes and may be a valuable S. stercoralis target of interest.A new DNA microarray tool for the examination of S. stercoralis biology has been developed and provides new and valuable insights regarding differences between infective and
Gene expression of the mismatch repair gene MSH2 in primary colorectal cancer

DEFF Research Database (Denmark)

Jensen, Lars Henrik; Kuramochi, Hidekazu; Crüger, Dorthe Gylling

2011-01-01

promoter was only detected in 14 samples and only at a low level with no correlation to gene expression. MSH2 gene expression was not a prognostic factor for overall survival in univariate or multivariate analysis. The gene expression of MSH2 is a potential quantitative marker ready for further clinical...
Association between gene expression profile of the primary tumor and chemotherapy response of metastatic breast cancer

NARCIS (Netherlands)

Savci-Heijink, Cemile Dilara; Halfwerk, Hans; Koster, Jan; van de Vijver, Marc Joan

2017-01-01

Background: To better predict the likelihood of response to chemotherapy, we have conducted a study comparing the gene expression patterns of primary tumours with their corresponding response to systemic chemotherapy in the metastatic setting. Methods: mRNA expression profiles of breast carcinomas
Sex hormones and gene expression signatures in peripheral blood from postmenopausal women - the NOWAC postgenome study

Directory of Open Access Journals (Sweden)

Rylander Charlotta

2011-03-01

Full Text Available Abstract Background Postmenopausal hormone therapy (HT influences endogenous hormone concentrations and increases the risk of breast cancer. Gene expression profiling may reveal the mechanisms behind this relationship. Our objective was to explore potential associations between sex hormones and gene expression in whole blood from a population-based, random sample of postmenopausal women Methods Gene expression, as measured by the Applied Biosystems microarray platform, was compared between hormone therapy (HT users and non-users and between high and low hormone plasma concentrations using both gene-wise analysis and gene set analysis. Gene sets found to be associated with HT use were further analysed for enrichment in functional clusters and network predictions. The gene expression matrix included 285 samples and 16185 probes and was adjusted for significant technical variables. Results Gene-wise analysis revealed several genes significantly associated with different types of HT use. The functional cluster analyses provided limited information on these genes. Gene set analysis revealed 22 gene sets that were enriched between high and low estradiol concentration (HT-users excluded. Among these were seven oestrogen related gene sets, including our gene list associated with systemic estradiol use, which thereby represents a novel oestrogen signature. Seven gene sets were related to immune response. Among the 15 gene sets enriched for progesterone, 11 overlapped with estradiol. No significant gene expression patterns were found for testosterone, follicle stimulating hormone (FSH or sex hormone binding globulin (SHBG. Conclusions Distinct gene expression patterns associated with sex hormones are detectable in a random group of postmenopausal women, as demonstrated by the finding of a novel oestrogen signature.
Analysis of antisense expression by whole genome tiling microarrays and siRNAs suggests mis-annotation of Arabidopsis orphan protein-coding genes.

Directory of Open Access Journals (Sweden)

Casey R Richardson

2010-05-01

Full Text Available MicroRNAs (miRNAs and trans-acting small-interfering RNAs (tasi-RNAs are small (20-22 nt long RNAs (smRNAs generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery.We explored rice (Oryza sativa sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis 'orphan' hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the "ancient" (deeply conserved class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for "new" rapidly-evolving MIRNA genes.Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other
Positive selection on gene expression in the human brain

DEFF Research Database (Denmark)

Khaitovich, Philipp; Tang, Kun; Franz, Henriette

2006-01-01

Recent work has shown that the expression levels of genes transcribed in the brains of humans and chimpanzees have changed less than those of genes transcribed in other tissues [1] . However, when gene expression changes are mapped onto the evolutionary lineage in which they occurred, the brain...... shows more changes than other tissues in the human lineage compared to the chimpanzee lineage [1] , [2] and [3] . There are two possible explanations for this: either positive selection drove more gene expression changes to fixation in the human brain than in the chimpanzee brain, or genes expressed...... in the brain experienced less purifying selection in humans than in chimpanzees, i.e. gene expression in the human brain is functionally less constrained. The first scenario would be supported if genes that changed their expression in the brain in the human lineage showed more selective sweeps than other genes...
Pathway analysis of gene signatures predicting metastasis of node-negative primary breast cancer

International Nuclear Information System (INIS)

Yu, Jack X; Sieuwerts, Anieta M; Zhang, Yi; Martens, John WM; Smid, Marcel; Klijn, Jan GM; Wang, Yixin; Foekens, John A

2007-01-01

Published prognostic gene signatures in breast cancer have few genes in common. Here we provide a rationale for this observation by studying the prognostic power and the underlying biological pathways of different gene signatures. Gene signatures to predict the development of metastases in estrogen receptor-positive and estrogen receptor-negative tumors were identified using 500 re-sampled training sets and mapping to Gene Ontology Biological Process to identify over-represented pathways. The Global Test program confirmed that gene expression profilings in the common pathways were associated with the metastasis of the patients. The apoptotic pathway and cell division, or cell growth regulation and G-protein coupled receptor signal transduction, were most significantly associated with the metastatic capability of estrogen receptor-positive or estrogen-negative tumors, respectively. A gene signature derived of the common pathways predicted metastasis in an independent cohort. Mapping of the pathways represented by different published prognostic signatures showed that they share 53% of the identified pathways. We show that divergent gene sets classifying patients for the same clinical endpoint represent similar biological processes and that pathway-derived signatures can be used to predict prognosis. Furthermore, our study reveals that the underlying biology related to aggressiveness of estrogen receptor subgroups of breast cancer is quite different
Gene expression differences between Noccaea caerulescens ecotypes help to identify candidate genes for metal phytoremediation.

Science.gov (United States)

Halimaa, Pauliina; Lin, Ya-Fen; Ahonen, Viivi H; Blande, Daniel; Clemens, Stephan; Gyenesei, Attila; Häikiö, Elina; Kärenlampi, Sirpa O; Laiho, Asta; Aarts, Mark G M; Pursiheimo, Juha-Pekka; Schat, Henk; Schmidt, Holger; Tuomainen, Marjo H; Tervahauta, Arja I

2014-03-18

Populations of Noccaea caerulescens show tremendous differences in their capacity to hyperaccumulate and hypertolerate metals. To explore the differences that could contribute to these traits, we undertook SOLiD high-throughput sequencing of the root transcriptomes of three phenotypically well-characterized N. caerulescens accessions, i.e., Ganges, La Calamine, and Monte Prinzera. Genes with possible contribution to zinc, cadmium, and nickel hyperaccumulation and hypertolerance were predicted. The most significant differences between the accessions were related to metal ion (di-, trivalent inorganic cation) transmembrane transporter activity, iron and calcium ion binding, (inorganic) anion transmembrane transporter activity, and antioxidant activity. Analysis of correlation between the expression profile of each gene and the metal-related characteristics of the accessions disclosed both previously characterized (HMA4, HMA3) and new candidate genes (e.g., for nickel IRT1, ZIP10, and PDF2.3) as possible contributors to the hyperaccumulation/tolerance phenotype. A number of unknown Noccaea-specific transcripts also showed correlation with Zn(2+), Cd(2+), or Ni(2+) hyperaccumulation/tolerance. This study shows that N. caerulescens populations have evolved great diversity in the expression of metal-related genes, facilitating adaptation to various metalliferous soils. The information will be helpful in the development of improved plants for metal phytoremediation.
A stochastic approach to multi-gene expression dynamics

International Nuclear Information System (INIS)

Ochiai, T.; Nacher, J.C.; Akutsu, T.

2005-01-01

In the last years, tens of thousands gene expression profiles for cells of several organisms have been monitored. Gene expression is a complex transcriptional process where mRNA molecules are translated into proteins, which control most of the cell functions. In this process, the correlation among genes is crucial to determine the specific functions of genes. Here, we propose a novel multi-dimensional stochastic approach to deal with the gene correlation phenomena. Interestingly, our stochastic framework suggests that the study of the gene correlation requires only one theoretical assumption-Markov property-and the experimental transition probability, which characterizes the gene correlation system. Finally, a gene expression experiment is proposed for future applications of the model

Assays for noninvasive imaging of reporter gene expression

International Nuclear Information System (INIS)

Gambhir, S.S.; Barrio, J.R.; Herschman, H.R.; Phelps, M.E.

1999-01-01

Repeated, noninvasive imaging of reporter gene expression is emerging as a valuable tool for monitoring the expression of genes in animals and humans. Monitoring of organ/cell transplantation in living animals and humans, and the assessment of environmental, behavioral, and pharmacologic modulation of gene expression in transgenic animals should soon be possible. The earliest clinical application is likely to be monitoring human gene therapy in tumors transduced with the herpes simplex virus type 1 thymidine kinase (HSV1-tk) suicide gene. Several candidate assays for imaging reporter gene expression have been studied, utilizing cytosine deaminase (CD), HSV1-tk, and dopamine 2 receptor (D2R) as reporter genes. For the HSV1-tk reporter gene, both uracil nucleoside derivatives (e.g., 5-iodo-2'-fluoro-2'-deoxy-1-β-D-arabinofuranosyl-5-iodouracil [FIAU] labeled with 124 I, 131 I ) and acycloguanosine derivatives {e.g., 8-[ 18 F]fluoro-9-[[2-hydroxy-1-(hydroxymethyl)ethoxy]methyl]guanine (8-[ 18 F]-fluoroganciclovir) ([ 18 F]FGCV), 9-[(3-[ 18 F]fluoro-1-hydroxy-2-propoxy)methyl]guanine ([ 18 F]FHPG)} have been investigated as reporter probes. For the D2R reporter gene, a derivative of spiperone {3-(2'-[ 18 F]-Fluoroethyl)spiperone ([ 18 F]FESP)} has been used with positron emission tomography (PET) imaging. In this review, the principles and specific assays for imaging reporter gene expression are presented and discussed. Specific examples utilizing adenoviral-mediated delivery of a reporter gene as well as tumors expressing reporter genes are discussed
PRAME gene expression profile in medulloblastoma

Directory of Open Access Journals (Sweden)

Tânia Maria Vulcani-Freitas

2011-02-01

Full Text Available Medulloblastoma is the most common malignant tumors of central nervous system in the childhood. The treatment is severe, harmful and, thus, has a dismal prognosis. As PRAME is present in various cancers, including meduloblastoma, and has limited expression in normal tissues, this antigen can be an ideal vaccine target for tumor immunotherapy. In order to find a potential molecular target, we investigated PRAME expression in medulloblastoma fragments and we compare the results with the clinical features of each patient. Analysis of gene expression was performed by real-time quantitative PCR from 37 tumor samples. The Mann-Whitney test was used to analysis the relationship between gene expression and clinical characteristics. Kaplan-Meier curves were used to evaluate survival. PRAME was overexpressed in 84% samples. But no statistical association was found between clinical features and PRAME overexpression. Despite that PRAME gene could be a strong candidate for immunotherapy since it is highly expressed in medulloblastomas.
An Individual-Based Diploid Model Predicts Limited Conditions Under Which Stochastic Gene Expression Becomes Advantageous

KAUST Repository

Matsumoto, Tomotaka; Mineta, Katsuhiko; Osada, Naoki; Araki, Hitoshi

2015-01-01

Recent studies suggest the existence of a stochasticity in gene expression (SGE) in many organisms, and its non-negligible effect on their phenotype and fitness. To date, however, how SGE affects the key parameters of population genetics
Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

Science.gov (United States)

Caracausi, Maria; Piovesan, Allison; Antonaros, Francesca; Strippoli, Pierluigi; Vitale, Lorenza; Pelleri, Maria Chiara

2017-09-01

The ideal reference, or control, gene for the study of gene expression in a given organism should be expressed at a medium‑high level for easy detection, should be expressed at a constant/stable level throughout different cell types and within the same cell type undergoing different treatments, and should maintain these features through as many different tissues of the organism. From a biological point of view, these theoretical requirements of an ideal reference gene appear to be best suited to housekeeping (HK) genes. Recent advancements in the quality and completeness of human expression microarray data and in their statistical analysis may provide new clues toward the quantitative standardization of human gene expression studies in biology and medicine, both cross‑ and within‑tissue. The systematic approach used by the present study is based on the Transcriptome Mapper tool and exploits the automated reassignment of probes to corresponding genes, intra‑ and inter‑sample normalization, elaboration and representation of gene expression values in linear form within an indexed and searchable database with a graphical interface recording quantitative levels of expression, expression variability and cross‑tissue width of expression for more than 31,000 transcripts. The present study conducted a meta‑analysis of a pool of 646 expression profile data sets from 54 different human tissues and identified actin γ 1 as the HK gene that best fits the combination of all the traditional criteria to be used as a reference gene for general use; two ribosomal protein genes, RPS18 and RPS27, and one aquaporin gene, POM121 transmembrane nucleporin C, were also identified. The present study provided a list of tissue‑ and organ‑specific genes that may be most suited for the following individual tissues/organs: Adipose tissue, bone marrow, brain, heart, kidney, liver, lung, ovary, skeletal muscle and testis; and also provides in these cases a representative
Paired hormone response elements predict caveolin-1 as a glucocorticoid target gene.

Directory of Open Access Journals (Sweden)

Marinus F van Batenburg

2010-01-01

Full Text Available Glucocorticoids act in part via glucocorticoid receptor binding to hormone response elements (HREs, but their direct target genes in vivo are still largely unknown. We developed the criterion that genomic occurrence of paired HREs at an inter-HRE distance less than 200 bp predicts hormone responsiveness, based on synergy of multiple HREs, and HRE information from known target genes. This criterion predicts a substantial number of novel responsive genes, when applied to genomic regions 10 kb upstream of genes. Multiple-tissue in situ hybridization showed that mRNA expression of 6 out of 10 selected genes was induced in a tissue-specific manner in mice treated with a single dose of corticosterone, with the spleen being the most responsive organ. Caveolin-1 was strongly responsive in several organs, and the HRE pair in its upstream region showed increased occupancy by glucocorticoid receptor in response to corticosterone. Our approach allowed for discovery of novel tissue specific glucocorticoid target genes, which may exemplify responses underlying the permissive actions of glucocorticoids.
Expression profiling to predict the clinical behaviour of ovarian cancer fails independent evaluation

International Nuclear Information System (INIS)

Gevaert, Olivier; De Smet, Frank; Van Gorp, Toon; Pochet, Nathalie; Engelen, Kristof; Amant, Frederic; De Moor, Bart; Timmerman, Dirk; Vergote, Ignace

2008-01-01

In a previously published pilot study we explored the performance of microarrays in predicting clinical behaviour of ovarian tumours. For this purpose we performed microarray analysis on 20 patients and estimated that we could predict advanced stage disease with 100% accuracy and the response to platin-based chemotherapy with 76.92% accuracy using leave-one-out cross validation techniques in combination with Least Squares Support Vector Machines (LS-SVMs). In the current study we evaluate whether tumour characteristics in an independent set of 49 patients can be predicted using the pilot data set with principal component analysis or LS-SVMs. The results of the principal component analysis suggest that the gene expression data from stage I, platin-sensitive advanced stage and platin-resistant advanced stage tumours in the independent data set did not correspond to their respective classes in the pilot study. Additionally, LS-SVM models built using the data from the pilot study – although they only misclassified one of four stage I tumours and correctly classified all 45 advanced stage tumours – were not able to predict resistance to platin-based chemotherapy. Furthermore, models based on the pilot data and on previously published gene sets related to ovarian cancer outcomes, did not perform significantly better than our models. We discuss possible reasons for failure of the model for predicting response to platin-based chemotherapy and conclude that existing results based on gene expression patterns of ovarian tumours need to be thoroughly scrutinized before these results can be accepted to reflect the true performance of microarray technology
mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling

Directory of Open Access Journals (Sweden)

Hala Alshamlan

2015-01-01

Full Text Available An artificial bee colony (ABC is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR, and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO. The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.
mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.

Science.gov (United States)

Alshamlan, Hala; Badr, Ghada; Alohali, Yousef

2015-01-01

An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.
SIGNATURE: A workbench for gene expression signature analysis

Directory of Open Access Journals (Sweden)

Chang Jeffrey T

2011-11-01

Full Text Available Abstract Background The biological phenotype of a cell, such as a characteristic visual image or behavior, reflects activities derived from the expression of collections of genes. As such, an ability to measure the expression of these genes provides an opportunity to develop more precise and varied sets of phenotypes. However, to use this approach requires computational methods that are difficult to implement and apply, and thus there is a critical need for intelligent software tools that can reduce the technical burden of the analysis. Tools for gene expression analyses are unusually difficult to implement in a user-friendly way because their application requires a combination of biological data curation, statistical computational methods, and database expertise. Results We have developed SIGNATURE, a web-based resource that simplifies gene expression signature analysis by providing software, data, and protocols to perform the analysis successfully. This resource uses Bayesian methods for processing gene expression data coupled with a curated database of gene expression signatures, all carried out within a GenePattern web interface for easy use and access. Conclusions SIGNATURE is available for public use at http://genepattern.genome.duke.edu/signature/.
Mining gene expression data of multiple sclerosis.

Directory of Open Access Journals (Sweden)

Pi Guo

Full Text Available Microarray produces a large amount of gene expression data, containing various biological implications. The challenge is to detect a panel of discriminative genes associated with disease. This study proposed a robust classification model for gene selection using gene expression data, and performed an analysis to identify disease-related genes using multiple sclerosis as an example.Gene expression profiles based on the transcriptome of peripheral blood mononuclear cells from a total of 44 samples from 26 multiple sclerosis patients and 18 individuals with other neurological diseases (control were analyzed. Feature selection algorithms including Support Vector Machine based on Recursive Feature Elimination, Receiver Operating Characteristic Curve, and Boruta algorithms were jointly performed to select candidate genes associating with multiple sclerosis. Multiple classification models categorized samples into two different groups based on the identified genes. Models' performance was evaluated using cross-validation methods, and an optimal classifier for gene selection was determined.An overlapping feature set was identified consisting of 8 genes that were differentially expressed between the two phenotype groups. The genes were significantly associated with the pathways of apoptosis and cytokine-cytokine receptor interaction. TNFSF10 was significantly associated with multiple sclerosis. A Support Vector Machine model was established based on the featured genes and gave a practical accuracy of ∼86%. This binary classification model also outperformed the other models in terms of Sensitivity, Specificity and F1 score.The combined analytical framework integrating feature ranking algorithms and Support Vector Machine model could be used for selecting genes for other diseases.
Bayesian mixture models for assessment of gene differential behaviour and prediction of pCR through the integration of copy number and gene expression data.

Directory of Open Access Journals (Sweden)

Filippo Trentini

Full Text Available We consider modeling jointly microarray RNA expression and DNA copy number data. We propose Bayesian mixture models that define latent Gaussian probit scores for the DNA and RNA, and integrate between the two platforms via a regression of the RNA probit scores on the DNA probit scores. Such a regression conveniently allows us to include additional sample specific covariates such as biological conditions and clinical outcomes. The two developed methods are aimed respectively to make inference on differential behaviour of genes in patients showing different subtypes of breast cancer and to predict the pathological complete response (pCR of patients borrowing strength across the genomic platforms. Posterior inference is carried out via MCMC simulations. We demonstrate the proposed methodology using a published data set consisting of 121 breast cancer patients.
Systematic analysis of gene expression pattern in has-miR-197 over-expressed human uterine leiomyoma cells.

Science.gov (United States)

Ling, Jing; Wu, Xiaoli; Fu, Ziyi; Tan, Jie; Xu, Qing

2015-10-01

, FBLN2, C10orf35, HOXD12, CACNG7, and LOC100134279. Our study explored gene expression patterns after miR-197 overexpression and confirmed 17 dominantly dys-regulated genes, which could expand the insights into the function of miR-197 and the molecular mechanisms during the development and progression of uterine leiomyomas. This study might afford new clues for understanding the pathogenesis of uterine leiomyomas, and it could likely provide a unique method for diagnosing or predicting prognosis in the clinical treatment of leiomyoma. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer

DEFF Research Database (Denmark)

Thomassen, Mads; Tan, Qihua; Kruse, Torben

2008-01-01

ABSTRACT: BACKGROUND: Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent stud...
Plasticity-Related Gene Expression During Eszopiclone-Induced Sleep.

Science.gov (United States)

Gerashchenko, Dmitry; Pasumarthi, Ravi K; Kilduff, Thomas S

2017-07-01

Experimental evidence suggests that restorative processes depend on synaptic plasticity changes in the brain during sleep. We used the expression of plasticity-related genes to assess synaptic plasticity changes during drug-induced sleep. We first characterized sleep induced by eszopiclone in mice during baseline conditions and during the recovery from sleep deprivation. We then compared the expression of 18 genes and two miRNAs critically involved in synaptic plasticity in these mice. Gene expression was assessed in the cerebral cortex and hippocampus by the TaqMan reverse transcription polymerase chain reaction and correlated with sleep parameters. Eszopiclone reduced the latency to nonrapid eye movement (NREM) sleep and increased NREM sleep amounts. Eszopiclone had no effect on slow wave activity (SWA) during baseline conditions but reduced the SWA increase during recovery sleep (RS) after sleep deprivation. Gene expression analyses revealed three distinct patterns: (1) four genes had higher expression either in the cortex or hippocampus in the group of mice with increased amounts of wakefulness; (2) a large proportion of plasticity-related genes (7 out of 18 genes) had higher expression during RS in the cortex but not in the hippocampus; and (3) six genes and the two miRNAs showed no significant changes across conditions. Even at a relatively high dose (20 mg/kg), eszopiclone did not reduce the expression of plasticity-related genes during RS period in the cortex. These results indicate that gene expression associated with synaptic plasticity occurs in the cortex in the presence of a hypnotic medication. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
Evaluation of suitable reference genes for gene expression studies in bovine muscular tissue

Directory of Open Access Journals (Sweden)

Dunner Susana

2008-09-01

Full Text Available Abstract Background Real-time reverse transcriptase quantitative polymerase chain reaction (real-time RTqPCR is a technique used to measure mRNA species copy number as a way to determine key genes involved in different biological processes. However, the expression level of these key genes may vary among tissues or cells not only as a consequence of differential expression but also due to different factors, including choice of reference genes to normalize the expression levels of the target genes; thus the selection of reference genes is critical for expression studies. For this purpose, ten candidate reference genes were investigated in bovine muscular tissue. Results The value of stability of ten candidate reference genes included in three groups was estimated: the so called 'classical housekeeping' genes (18S, GAPDH and ACTB, a second set of genes used in expression studies conducted on other tissues (B2M, RPII, UBC and HMBS and a third set of novel genes (SF3A1, EEF1A2 and CASC3. Three different statistical algorithms were used to rank the genes by their stability measures as produced by geNorm, NormFinder and Bestkeeper. The three methods tend to agree on the most stably expressed genes and the least in muscular tissue. EEF1A2 and HMBS followed by SF3A1, ACTB, and CASC3 can be considered as stable reference genes, and B2M, RPII, UBC and GAPDH would not be appropriate. Although the rRNA-18S stability measure seems to be within the range of acceptance, its use is not recommended because its synthesis regulation is not representative of mRNA levels. Conclusion Based on geNorm algorithm, we propose the use of three genes SF3A1, EEF1A2 and HMBS as references for normalization of real-time RTqPCR in muscle expression studies.
Expression profiling identifies genes involved in emphysema severity

Directory of Open Access Journals (Sweden)

Bowman Rayleen V

2009-09-01

Full Text Available Abstract Chronic obstructive pulmonary disease (COPD is a major public health problem. The aim of this study was to identify genes involved in emphysema severity in COPD patients. Gene expression profiling was performed on total RNA extracted from non-tumor lung tissue from 30 smokers with emphysema. Class comparison analysis based on gas transfer measurement was performed to identify differentially expressed genes. Genes were then selected for technical validation by quantitative reverse transcriptase-PCR (qRT-PCR if also represented on microarray platforms used in previously published emphysema studies. Genes technically validated advanced to tests of biological replication by qRT-PCR using an independent test set of 62 lung samples. Class comparison identified 98 differentially expressed genes (p p Gene expression profiling of lung from emphysema patients identified seven candidate genes associated with emphysema severity including COL6A3, SERPINF1, ZNHIT6, NEDD4, CDKN2A, NRN1 and GSTM3.
Decoupling Linear and Nonlinear Associations of Gene Expression

KAUST Repository

Itakura, Alan

2013-05-01

The FANTOM consortium has generated a large gene expression dataset of different cell lines and tissue cultures using the single-molecule sequencing technology of HeliscopeCAGE. This provides a unique opportunity to investigate novel associations between gene expression over time and different cell types. Here, we create a MatLab wrapper for a powerful and computationally intensive set of statistics known as Maximal Information Coefficient, and then calculate this statistic for a large, comprehensive dataset containing gene expression of a variety of differentiating tissues. We then distinguish between linear and nonlinear associations, and then create gene association networks. Following this analysis, we are then able to identify clusters of linear gene associations that then associate nonlinearly with other clusters of linearity, providing insight to much more complex connections between gene expression patterns than previously anticipated.
Decoupling Linear and Nonlinear Associations of Gene Expression

KAUST Repository

Itakura, Alan

2013-01-01

The FANTOM consortium has generated a large gene expression dataset of different cell lines and tissue cultures using the single-molecule sequencing technology of HeliscopeCAGE. This provides a unique opportunity to investigate novel associations between gene expression over time and different cell types. Here, we create a MatLab wrapper for a powerful and computationally intensive set of statistics known as Maximal Information Coefficient, and then calculate this statistic for a large, comprehensive dataset containing gene expression of a variety of differentiating tissues. We then distinguish between linear and nonlinear associations, and then create gene association networks. Following this analysis, we are then able to identify clusters of linear gene associations that then associate nonlinearly with other clusters of linearity, providing insight to much more complex connections between gene expression patterns than previously anticipated.
The effects of MicroRNA transfections on global patterns of gene expression in ovarian cancer cells are functionally coordinated

Directory of Open Access Journals (Sweden)

Shahab Shubin W

2012-08-01

Full Text Available Abstract Background MicroRNAs (miRNAs are a class of small RNAs that have been linked to a number of diseases including cancer. The potential application of miRNAs in the diagnostics and therapeutics of ovarian and other cancers is an area of intense interest. A current challenge is the inability to accurately predict the functional consequences of exogenous modulations in the levels of potentially therapeutic miRNAs. Methods In an initial effort to systematically address this issue, we conducted miRNA transfection experiments using two miRNAs (miR-7, miR-128. We monitored the consequent changes in global patterns of gene expression by microarray and quantitative (real-time polymerase chain reaction. Network analysis of the expression data was used to predict the consequence of each transfection on cellular function and these predictions were experimentally tested. Results While ~20% of the changes in expression patterns of hundreds to thousands of genes could be attributed to direct miRNA-mRNA interactions, the majority of the changes are indirect, involving the downstream consequences of miRNA-mediated changes in regulatory gene expression. The changes in gene expression induced by individual miRNAs are functionally coordinated but distinct between the two miRNAs. MiR-7 transfection into ovarian cancer cells induces changes in cell adhesion and other developmental networks previously associated with epithelial-mesenchymal transitions (EMT and other processes linked with metastasis. In contrast, miR-128 transfection induces changes in cell cycle control and other processes commonly linked with cellular replication. Conclusions The functionally coordinated patterns of gene expression displayed by different families of miRNAs have the potential to provide clinicians with a strategy to treat cancers from a systems rather than a single gene perspective.
Gene expression patterns associated with p53 status in breast cancer

International Nuclear Information System (INIS)

Troester, Melissa A; Herschkowitz, Jason I; Oh, Daniel S; He, Xiaping; Hoadley, Katherine A; Barbier, Claire S; Perou, Charles M

2006-01-01

Breast cancer subtypes identified in genomic studies have different underlying genetic defects. Mutations in the tumor suppressor p53 occur more frequently in estrogen receptor (ER) negative, basal-like and HER2-amplified tumors than in luminal, ER positive tumors. Thus, because p53 mutation status is tightly linked to other characteristics of prognostic importance, it is difficult to identify p53's independent prognostic effects. The relation between p53 status and subtype can be better studied by combining data from primary tumors with data from isogenic cell line pairs (with and without p53 function). The p53-dependent gene expression signatures of four cell lines (MCF-7, ZR-75-1, and two immortalized human mammary epithelial cell lines) were identified by comparing p53-RNAi transduced cell lines to their parent cell lines. Cell lines were treated with vehicle only or doxorubicin to identify p53 responses in both non-induced and induced states. The cell line signatures were compared with p53-mutation associated genes in breast tumors. Each cell line displayed distinct patterns of p53-dependent gene expression, but cell type specific (basal vs. luminal) commonalities were evident. Further, a common gene expression signature associated with p53 loss across all four cell lines was identified. This signature showed overlap with the signature of p53 loss/mutation status in primary breast tumors. Moreover, the common cell-line tumor signature excluded genes that were breast cancer subtype-associated, but not downstream of p53. To validate the biological relevance of the common signature, we demonstrated that this gene set predicted relapse-free, disease-specific, and overall survival in independent test data. In the presence of breast cancer heterogeneity, experimental and biologically-based methods for assessing gene expression in relation to p53 status provide prognostic and biologically-relevant gene lists. Our biologically-based refinements excluded genes

Bayesian assignment of gene ontology terms to gene expression experiments

Science.gov (United States)

Sykacek, P.

2012-01-01

Motivation: Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. Results: This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Availability: Source code under GPL license is available from the author. Contact: peter.sykacek@boku.ac.at PMID:22962488
Bayesian assignment of gene ontology terms to gene expression experiments.

Science.gov (United States)

Sykacek, P

2012-09-15

Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Source code under GPL license is available from the author. peter.sykacek@boku.ac.at.
Genome-Wide Identification of the Alba Gene Family in Plants and Stress-Responsive Expression of the Rice Alba Genes.

Science.gov (United States)

Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan

2018-03-28

Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa , Zea mays , Sorghum bicolor , Cicer arietinum , and Vitis vinifera , and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii , Physcomitrella patens , and Amborella trichopoda , revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice ( OsAlba ), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure-function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants.
Gene expression profile data for mouse facial development

Directory of Open Access Journals (Sweden)

Sonia M. Leach

2017-08-01

Full Text Available This article contains data related to the research articles "Spatial and Temporal Analysis of Gene Expression during Growth and Fusion of the Mouse Facial Prominences" (Feng et al., 2009 [1] and “Systems Biology of facial development: contributions of ectoderm and mesenchyme” (Hooper et al., 2017 In press [2]. Embryonic mammalian craniofacial development is a complex process involving the growth, morphogenesis, and fusion of distinct facial prominences into a functional whole. Aberrant gene regulation during this process can lead to severe craniofacial birth defects, including orofacial clefting. As a means to understand the genes involved in facial development, we had previously dissected the embryonic mouse face into distinct prominences: the mandibular, maxillary or nasal between E10.5 and E12.5. The prominences were then processed intact, or separated into ectoderm and mesenchyme layers, prior analysis of RNA expression using microarrays (Feng et al., 2009, Hooper et al., 2017 in press [1,2]. Here, individual gene expression profiles have been built from these datasets that illustrate the timing of gene expression in whole prominences or in the separated tissue layers. The data profiles are presented as an indexed and clickable list of the genes each linked to a graphical image of that gene׳s expression profile in the ectoderm, mesenchyme, or intact prominence. These data files will enable investigators to obtain a rapid assessment of the relative expression level of any gene on the array with respect to time, tissue, prominence, and expression trajectory.
Integrated olfactory receptor and microarray gene expression databases

Directory of Open Access Journals (Sweden)

Crasto Chiquito J

2007-06-01

Full Text Available Abstract Background Gene expression patterns of olfactory receptors (ORs are an important component of the signal encoding mechanism in the olfactory system since they determine the interactions between odorant ligands and sensory neurons. We have developed the Olfactory Receptor Microarray Database (ORMD to house OR gene expression data. ORMD is integrated with the Olfactory Receptor Database (ORDB, which is a key repository of OR gene information. Both databases aim to aid experimental research related to olfaction. Description ORMD is a Web-accessible database that provides a secure data repository for OR microarray experiments. It contains both publicly available and private data; accessing the latter requires authenticated login. The ORMD is designed to allow users to not only deposit gene expression data but also manage their projects/experiments. For example, contributors can choose whether to make their datasets public. For each experiment, users can download the raw data files and view and export the gene expression data. For each OR gene being probed in a microarray experiment, a hyperlink to that gene in ORDB provides access to genomic and proteomic information related to the corresponding olfactory receptor. Individual ORs archived in ORDB are also linked to ORMD, allowing users access to the related microarray gene expression data. Conclusion ORMD serves as a data repository and project management system. It facilitates the study of microarray experiments of gene expression in the olfactory system. In conjunction with ORDB, ORMD integrates gene expression data with the genomic and functional data of ORs, and is thus a useful resource for both olfactory researchers and the public.
Gene expression analysis of flax seed development

Science.gov (United States)

2011-01-01

Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise
Gene expression analysis of flax seed development

Directory of Open Access Journals (Sweden)

Sharpe Andrew

2011-04-01

Full Text Available Abstract Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages seed coats (globular and torpedo stages and endosperm (pooled globular to torpedo stages and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST (GenBank accessions LIBEST_026995 to LIBEST_027011 were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152 had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid
Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.

Science.gov (United States)

Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui

2013-12-01

MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple. © 2013.
Two pheromone precursor genes are transcriptionally expressed in the homothallic ascomycete Sordaria macrospora.

Science.gov (United States)

Pöggeler, S

2000-06-01

In order to analyze the involvement of pheromones in cell recognition and mating in a homothallic fungus, two putative pheromone precursor genes, named ppg1 and ppg2, were isolated from a genomic library of Sordaria macrospora. The ppg1 gene is predicted to encode a precursor pheromone that is processed by a Kex2-like protease to yield a pheromone that is structurally similar to the alpha-factor of the yeast Saccharomyces cerevisiae. The ppg2 gene encodes a 24-amino-acid polypeptide that contains a putative farnesylated and carboxy methylated C-terminal cysteine residue. The sequences of the predicted pheromones display strong structural similarity to those encoded by putative pheromones of heterothallic filamentous ascomycetes. Both genes are expressed during the life cycle of S. macrospora. This is the first description of pheromone precursor genes encoded by a homothallic fungus. Southern-hybridization experiments indicated that ppg1 and ppg2 homologues are also present in other homothallic ascomycetes.
EBF factors drive expression of multiple classes of target genes governing neuronal development.

Science.gov (United States)

Green, Yangsook S; Vetter, Monica L

2011-04-30

Early B cell factor (EBF) family members are transcription factors known to have important roles in several aspects of vertebrate neurogenesis, including commitment, migration and differentiation. Knowledge of how EBF family members contribute to neurogenesis is limited by a lack of detailed understanding of genes that are transcriptionally regulated by these factors. We performed a microarray screen in Xenopus animal caps to search for targets of EBF transcriptional activity, and identified candidate targets with multiple roles, including transcription factors of several classes. We determined that, among the most upregulated candidate genes with expected neuronal functions, most require EBF activity for some or all of their expression, and most have overlapping expression with ebf genes. We also found that the candidate target genes that had the most strongly overlapping expression patterns with ebf genes were predicted to be direct transcriptional targets of EBF transcriptional activity. The identification of candidate targets that are transcription factor genes, including nscl-1, emx1 and aml1, improves our understanding of how EBF proteins participate in the hierarchy of transcription control during neuronal development, and suggests novel mechanisms by which EBF activity promotes migration and differentiation. Other candidate targets, including pcdh8 and kcnk5, expand our knowledge of the types of terminal differentiated neuronal functions that EBF proteins regulate.
Correlation of in vitro lymphocyte radiosensitivity and gene expression with late normal tissue reactions following curative radiotherapy for breast cancer

International Nuclear Information System (INIS)

Finnon, Paul; Kabacik, Sylwia; MacKay, Alan; Raffy, Claudine; A’Hern, Roger; Owen, Roger; Badie, Christophe; Yarnold, John; Bouffler, Simon

2012-01-01

Background and purpose: Identification of mechanisms of late normal tissue responses to curative radiotherapy that discriminate individuals with marked or mild responses would aid response prediction. This study aimed to identify differences in gene expression, apoptosis, residual DNA double strand breaks and chromosomal damage after in vitro irradiation of lymphocytes in a series of patients with marked (31 cases) or mild (28 controls) late adverse reaction to adjuvant breast radiotherapy. Materials and methods: Gene expression arrays, residual γH2AX, apoptosis, G2 chromosomal radiosensitivity and G0 micronucleus assay were used to compare case and control lymphocyte radiation responses. Results: Five hundred and thirty genes were up-regulated and 819 down-regulated by ionising radiation. Irradiated samples were identified with an overall cross-validated error rate of 3.4%. Prediction analyses to classify cases and controls using unirradiated (0 Gy), irradiated (4 Gy) or radiation response (4–0 Gy) expression profiles correctly identified samples with, respectively, 25%, 22% or 18.5% error rates. Significant inter-sample variation was observed for all cellular endpoints but cases and controls could not be distinguished. Conclusions: Variation in lymphocyte radiosensitivity does not necessarily correlate with normal tissue response to radiotherapy. Gene expression analysis can predict of radiation exposure and may in the future help prediction of normal tissue radiosensitivity.
Supplementary Material for: Global expression differences and tissue specific expression differences in rice evolution result in two contrasting types of differentially expressed genes

KAUST Repository

Horiuchi, Youko; Harushima, Yoshiaki; Fujisawa, Hironori; Mochizuki, Takako; Fujita, Masahiro; Ohyanagi, Hajime; Kurata, Nori

2015-01-01

Abstract Background Since the development of transcriptome analysis systems, many expression evolution studies characterized evolutionary forces acting on gene expression, without explicit discrimination between global expression differences and tissue specific expression differences. However, different types of gene expression alteration should have different effects on an organism, the evolutionary forces that act on them might be different, and different types of genes might show different types of differential expression between species. To confirm this, we studied differentially expressed (DE) genes among closely related groups that have extensive gene expression atlases, and clarified characteristics of different types of DE genes including the identification of regulating loci for differential expression using expression quantitative loci (eQTL) analysis data. Results We detected differentially expressed (DE) genes between rice subspecies in five homologous tissues that were verified using japonica and indica transcriptome atlases in public databases. Using the transcriptome atlases, we classified DE genes into two types, global DE genes and changed-tissues DE genes. Global type DE genes were not expressed in any tissues in the atlas of one subspecies, however changed-tissues type DE genes were expressed in both subspecies with different tissue specificity. For the five tissues in the two japonica-indica combinations, 4.6 ± 0.8 and 5.9 ± 1.5 % of highly expressed genes were global and changed-tissues DE genes, respectively. Changed-tissues DE genes varied in number between tissues, increasing linearly with the abundance of tissue specifically expressed genes in the tissue. Molecular evolution of global DE genes was rapid, unlike that of changed-tissues DE genes. Based on gene ontology, global and changed-tissues DE genes were different, having no common GO terms. Expression differences of most global DE genes were regulated by cis
Computational prediction of CTCF/cohesin-based intra-TAD loops that insulate chromatin contacts and gene expression in mouse liver.

Science.gov (United States)

Matthews, Bryan J; Waxman, David J

2018-05-14

CTCF and cohesin are key drivers of 3D-nuclear organization, anchoring the megabase-scale Topologically Associating Domains (TADs) that segment the genome. Here, we present and validate a computational method to predict cohesin-and-CTCF binding sites that form intra-TAD DNA loops. The intra-TAD loop anchors identified are structurally indistinguishable from TAD anchors regarding binding partners, sequence conservation, and resistance to cohesin knockdown; further, the intra-TAD loops retain key functional features of TADs, including chromatin contact insulation, blockage of repressive histone mark spread, and ubiquity across tissues. We propose that intra-TAD loops form by the same loop extrusion mechanism as the larger TAD loops, and that their shorter length enables finer regulatory control in restricting enhancer-promoter interactions, which enables selective, high-level expression of gene targets of super-enhancers and genes located within repressive nuclear compartments. These findings elucidate the role of intra-TAD cohesin-and-CTCF binding in nuclear organization associated with widespread insulation of distal enhancer activity. © 2018, Matthews et al.
Effect of chemical mutagens and carcinogens on gene expression profiles in human TK6 cells.

Directory of Open Access Journals (Sweden)

Lode Godderis

Full Text Available Characterization of toxicogenomic signatures of carcinogen exposure holds significant promise for mechanistic and predictive toxicology. In vitro transcriptomic studies allow the comparison of the response to chemicals with diverse mode of actions under controlled experimental conditions. We conducted an in vitro study in TK6 cells to characterize gene expression signatures of exposure to 15 genotoxic carcinogens frequently used in European industries. We also examined the dose-responsive changes in gene expression, and perturbation of biochemical pathways in response to these carcinogens. TK6 cells were exposed at 3 dose levels for 24 h with and without S9 human metabolic mix. Since S9 had an impact on gene expression (885 genes, we analyzed the gene expression data from cells cultures incubated with S9 and without S9 independently. The ribosome pathway was affected by all chemical-dose combinations. However in general, no similar gene expression was observed among carcinogens. Further, pathways, i.e. cell cycle, DNA repair mechanisms, RNA degradation, that were common within sets of chemical-dose combination were suggested by clustergram. Linear trends in dose-response of gene expression were observed for Trichloroethylene, Benz[a]anthracene, Epichlorohydrin, Benzene, and Hydroquinone. The significantly altered genes were involved in the regulation of (anti- apoptosis, maintenance of cell survival, tumor necrosis factor-related pathways and immune response, in agreement with several other studies. Similarly in S9+ cultures, Benz[a]pyrene, Styrene and Trichloroethylene each modified over 1000 genes at high concentrations. Our findings expand our understanding of the transcriptomic response to genotoxic carcinogens, revealing the alteration of diverse sets of genes and pathways involved in cellular homeostasis and cell cycle control.
Identification of suitable reference genes for gene expression studies of shoulder instability.

Directory of Open Access Journals (Sweden)

Mariana Ferreira Leal

Full Text Available Shoulder instability is a common shoulder injury, and patients present with plastic deformation of the glenohumeral capsule. Gene expression analysis may be a useful tool for increasing the general understanding of capsule deformation, and reverse-transcription quantitative polymerase chain reaction (RT-qPCR has become an effective method for such studies. Although RT-qPCR is highly sensitive and specific, it requires the use of suitable reference genes for data normalization to guarantee meaningful and reproducible results. In the present study, we evaluated the suitability of a set of reference genes using samples from the glenohumeral capsules of individuals with and without shoulder instability. We analyzed the expression of six commonly used reference genes (ACTB, B2M, GAPDH, HPRT1, TBP and TFRC in the antero-inferior, antero-superior and posterior portions of the glenohumeral capsules of cases and controls. The stability of the candidate reference gene expression was determined using four software packages: NormFinder, geNorm, BestKeeper and DataAssist. Overall, HPRT1 was the best single reference gene, and HPRT1 and B2M composed the best pair of reference genes from different analysis groups, including simultaneous analysis of all tissue samples. GenEx software was used to identify the optimal number of reference genes to be used for normalization and demonstrated that the accumulated standard deviation resulting from the use of 2 reference genes was similar to that resulting from the use of 3 or more reference genes. To identify the optimal combination of reference genes, we evaluated the expression of COL1A1. Although the use of different reference gene combinations yielded variable normalized quantities, the relative quantities within sample groups were similar and confirmed that no obvious differences were observed when using 2, 3 or 4 reference genes. Consequently, the use of 2 stable reference genes for normalization, especially
Naringenin Regulates Expression of Genes Involved in Cell Wall Synthesis in Herbaspirillum seropedicae▿

Science.gov (United States)

Tadra-Sfeir, M. Z.; Souza, E. M.; Faoro, H.; Müller-Santos, M.; Baura, V. A.; Tuleski, T. R.; Rigo, L. U.; Yates, M. G.; Wassem, R.; Pedrosa, F. O.; Monteiro, R. A.

2011-01-01

Five thousand mutants of Herbaspirillum seropedicae SmR1 carrying random insertions of transposon pTnMod-OGmKmlacZ were screened for differential expression of LacZ in the presence of naringenin. Among the 16 mutants whose expression was regulated by naringenin were genes predicted to be involved in the synthesis of exopolysaccharides, lipopolysaccharides, and auxin. These loci are probably involved in establishing interactions with host plants. PMID:21257805
Naringenin regulates expression of genes involved in cell wall synthesis in Herbaspirillum seropedicae.

Science.gov (United States)

Tadra-Sfeir, M Z; Souza, E M; Faoro, H; Müller-Santos, M; Baura, V A; Tuleski, T R; Rigo, L U; Yates, M G; Wassem, R; Pedrosa, F O; Monteiro, R A

2011-03-01

Five thousand mutants of Herbaspirillum seropedicae SmR1 carrying random insertions of transposon pTnMod-OGmKmlacZ were screened for differential expression of LacZ in the presence of naringenin. Among the 16 mutants whose expression was regulated by naringenin were genes predicted to be involved in the synthesis of exopolysaccharides, lipopolysaccharides, and auxin. These loci are probably involved in establishing interactions with host plants.
Concordance of gene expression in human protein complexes reveals tissue specificity and pathology

DEFF Research Database (Denmark)

Börnigen, Daniela; Pers, Tune Hannes; Thorrez, Lieven

2013-01-01

Disease-causing variants in human genes usually lead to phenotypes specific to only a few tissues. Here, we present a method for predicting tissue specificity based on quantitative deregulation of protein complexes. The underlying assumption is that the degree of coordinated expression among prot...
Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process

International Nuclear Information System (INIS)

Chandran, Uma R; Ma, Changqing; Dhir, Rajiv; Bisceglia, Michelle; Lyons-Weiler, Maureen; Liang, Wenjing; Michalopoulos, George; Becich, Michael; Monzon, Federico A

2007-01-01

Prostate cancer is characterized by heterogeneity in the clinical course that often does not correlate with morphologic features of the tumor. Metastasis reflects the most adverse outcome of prostate cancer, and to date there are no reliable morphologic features or serum biomarkers that can reliably predict which patients are at higher risk of developing metastatic disease. Understanding the differences in the biology of metastatic and organ confined primary tumors is essential for developing new prognostic markers and therapeutic targets. Using Affymetrix oligonucleotide arrays, we analyzed gene expression profiles of 24 androgen-ablation resistant metastatic samples obtained from 4 patients and a previously published dataset of 64 primary prostate tumor samples. Differential gene expression was analyzed after removing potentially uninformative stromal genes, addressing the differences in cellular content between primary and metastatic tumors. The metastatic samples are highly heterogenous in expression; however, differential expression analysis shows that 415 genes are upregulated and 364 genes are downregulated at least 2 fold in every patient with metastasis. The expression profile of metastatic samples reveals changes in expression of a unique set of genes representing both the androgen ablation related pathways and other metastasis related gene networks such as cell adhesion, bone remodelling and cell cycle. The differentially expressed genes include metabolic enzymes, transcription factors such as Forkhead Box M1 (FoxM1) and cell adhesion molecules such as Osteopontin (SPP1). We hypothesize that these genes have a role in the biology of metastatic disease and that they represent potential therapeutic targets for prostate cancer
Relative codon adaptation: a generic codon bias index for prediction of gene expression.

Science.gov (United States)

Fox, Jesse M; Erill, Ivan

2010-06-01

The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.

Gene expression results in lipopolysaccharide-stimulated monocytes depend significantly on the choice of reference genes

Directory of Open Access Journals (Sweden)

Øvstebø Reidun

2010-05-01

Full Text Available Abstract Background Gene expression in lipopolysaccharide (LPS-stimulated monocytes is mainly studied by quantitative real-time reverse transcription PCR (RT-qPCR using GAPDH (glyceraldehyde 3-phosphate dehydrogenase or ACTB (beta-actin as reference gene for normalization. Expression of traditional reference genes has been shown to vary substantially under certain conditions leading to invalid results. To investigate whether traditional reference genes are stably expressed in LPS-stimulated monocytes or if RT-qPCR results are dependent on the choice of reference genes, we have assessed and evaluated gene expression stability of twelve candidate reference genes in this model system. Results Twelve candidate reference genes were quantified by RT-qPCR in LPS-stimulated, human monocytes and evaluated using the programs geNorm, Normfinder and BestKeeper. geNorm ranked PPIB (cyclophilin B, B2M (beta-2-microglobulin and PPIA (cyclophilin A as the best combination for gene expression normalization in LPS-stimulated monocytes. Normfinder suggested TBP (TATA-box binding protein and B2M as the best combination. Compared to these combinations, normalization using GAPDH alone resulted in significantly higher changes of TNF-α (tumor necrosis factor-alpha and IL10 (interleukin 10 expression. Moreover, a significant difference in TNF-α expression between monocytes stimulated with equimolar concentrations of LPS from N. meningitides and E. coli, respectively, was identified when using the suggested combinations of reference genes for normalization, but stayed unrecognized when employing a single reference gene, ACTB or GAPDH. Conclusions Gene expression levels in LPS-stimulated monocytes based on RT-qPCR results differ significantly when normalized to a single gene or a combination of stably expressed reference genes. Proper evaluation of reference gene stabiliy is therefore mandatory before reporting RT-qPCR results in LPS-stimulated monocytes.
Differentially expressed genes in iron-induced prion protein conversion

International Nuclear Information System (INIS)

Kim, Minsun; Kim, Eun-hee; Choi, Bo-Ran; Woo, Hee-Jong

2016-01-01

The conversion of the cellular prion protein (PrP C ) to the protease-resistant isoform is the key event in chronic neurodegenerative diseases, including transmissible spongiform encephalopathies (TSEs). Increased iron in prion-related disease has been observed due to the prion protein-ferritin complex. Additionally, the accumulation and conversion of recombinant PrP (rPrP) is specifically derived from Fe(III) but not Fe(II). Fe(III)-mediated PK-resistant PrP (PrP res ) conversion occurs within a complex cellular environment rather than via direct contact between rPrP and Fe(III). In this study, differentially expressed genes correlated with prion degeneration by Fe(III) were identified using Affymetrix microarrays. Following Fe(III) treatment, 97 genes were differentially expressed, including 85 upregulated genes and 12 downregulated genes (≥1.5-fold change in expression). However, Fe(II) treatment produced moderate alterations in gene expression without inducing dramatic alterations in gene expression profiles. Moreover, functional grouping of identified genes indicated that the differentially regulated genes were highly associated with cell growth, cell maintenance, and intra- and extracellular transport. These findings showed that Fe(III) may influence the expression of genes involved in PrP folding by redox mechanisms. The identification of genes with altered expression patterns in neural cells may provide insights into PrP conversion mechanisms during the development and progression of prion-related diseases. - Highlights: • Differential genes correlated with prion degeneration by Fe(III) were identified. • Genes were identified in cell proliferation and intra- and extracellular transport. • In PrP degeneration, redox related genes were suggested. • Cbr2, Rsad2, Slc40a1, Amph and Mvd were expressed significantly.
Regulation of meiotic gene expression in plants

Directory of Open Access Journals (Sweden)

Adele eZhou

2014-08-01

Full Text Available With the recent advances in genomics and sequencing technologies, databases of transcriptomes representing many cellular processes have been built. Meiotic transcriptomes in plants have been studied in Arabidopsis thaliana, rice (Oryza sativa, wheat (Triticum aestivum, petunia (Petunia hybrida, sunflower (Helianthus annuus, and maize (Zea mays. Studies in all organisms, but particularly in plants, indicate that a very large number of genes are expressed during meiosis, though relatively few of them seem to be required for the completion of meiosis. In this review, we focus on gene expression at the RNA level and analyze the meiotic transcriptome datasets and explore expression patterns of known meiotic genes to elucidate how gene expression could be regulated during meiosis. We also discuss mechanisms, such as chromatin organization and non-coding RNAs, that might be involved in the regulation of meiotic transcription patterns.
Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

Directory of Open Access Journals (Sweden)

Tintle Nathan L

2012-08-01

Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.
Construction of a novel multi-gene assay (42-gene classifier) for prediction of late recurrence in ER-positive breast cancer patients.

Science.gov (United States)

Tsunashima, Ryo; Naoi, Yasuto; Shimazu, Kenzo; Kagara, Naofumi; Shimoda, Masashi; Tanei, Tomonori; Miyake, Tomohiro; Kim, Seung Jin; Noguchi, Shinzaburo

2018-05-04

Prediction models for late (> 5 years) recurrence in ER-positive breast cancer need to be developed for the accurate selection of patients for extended hormonal therapy. We attempted to develop such a prediction model focusing on the differences in gene expression between breast cancers with early and late recurrence. For the training set, 779 ER-positive breast cancers treated with tamoxifen alone for 5 years were selected from the databases (GSE6532, GSE12093, GSE17705, and GSE26971). For the validation set, 221 ER-positive breast cancers treated with adjuvant hormonal therapy for 5 years with or without chemotherapy at our hospital were included. Gene expression was assayed by DNA microarray analysis (Affymetrix U133 plus 2.0). With the 42 genes differentially expressed in early and late recurrence breast cancers in the training set, a prediction model (42GC) for late recurrence was constructed. The patients classified by 42GC into the late recurrence-like group showed a significantly (P = 0.006) higher late recurrence rate as expected but a significantly (P = 1.62 × E-13) lower rate for early recurrence than non-late recurrence-like group. These observations were confirmed for the validation set, i.e., P = 0.020 for late recurrence and P = 5.70 × E-5 for early recurrence. We developed a unique prediction model (42GC) for late recurrence by focusing on the biological differences between breast cancers with early and late recurrence. Interestingly, patients in the late recurrence-like group by 42GC were at low risk for early recurrence.
Temporal gene expression variation associated with eyespot size plasticity in Bicyclus anynana.

Directory of Open Access Journals (Sweden)

Jeffrey C Oliver

Full Text Available Seasonal polyphenism demonstrates an organism's ability to respond to predictable environmental variation with alternative phenotypes, each presumably better suited to its respective environment. However, the molecular mechanisms linking environmental variation to alternative phenotypes via shifts in development remain relatively unknown. Here we investigate temporal gene expression variation in the seasonally polyphenic butterfly Bicyclus anynana. This species shows drastic changes in eyespot size depending on the temperature experienced during larval development. The wet season form (larvae reared over 24°C has large ventral wing eyespots while the dry season form (larvae reared under 19°C has much smaller eyespots. We compared the expression of three proteins, Notch, Engrailed, and Distal-less, in the future eyespot centers of the two forms to determine if eyespot size variation is associated with heterochronic shifts in the onset of their expression. For two of these proteins, Notch and Engrailed, expression in eyespot centers occurred earlier in dry season than in wet season larvae, while Distal-less showed no temporal difference between the two forms. These results suggest that differences between dry and wet season adult wings could be due to a delay in the onset of expression of these eyespot-associated genes. Early in eyespot development, Notch and Engrailed may be functioning as repressors rather than activators of the eyespot gene network. Alternatively, temporal variation in the onset of early expressed genes between forms may have no functional consequences to eyespot size regulation and may indicate the presence of an 'hourglass' model of development in butterfly eyespots.
Gene prediction using the Self-Organizing Map: automatic generation of multiple gene models.

Science.gov (United States)

Mahony, Shaun; McInerney, James O; Smith, Terry J; Golden, Aaron

2004-03-05

Many current gene prediction methods use only one model to represent protein-coding regions in a genome, and so are less likely to predict the location of genes that have an atypical sequence composition. It is likely that future improvements in gene finding will involve the development of methods that can adequately deal with intra-genomic compositional variation. This work explores a new approach to gene-prediction, based on the Self-Organizing Map, which has the ability to automatically identify multiple gene models within a genome. The current implementation, named RescueNet, uses relative synonymous codon usage as the indicator of protein-coding potential. While its raw accuracy rate can be less than other methods, RescueNet consistently identifies some genes that other methods do not, and should therefore be of interest to gene-prediction software developers and genome annotation teams alike. RescueNet is recommended for use in conjunction with, or as a complement to, other gene prediction methods.
Fate of a redundant gamma-globin gene in the atelid clade of New World monkeys: implications concerning fetal globin gene expression.

Science.gov (United States)

Meireles, C M; Schneider, M P; Sampaio, M I; Schneider, H; Slightom, J L; Chiu, C H; Neiswanger, K; Gumucio, D L; Czelusniak, J; Goodman, M

1995-01-01

Conclusive evidence was provided that gamma 1, the upstream of the two linked simian gamma-globin loci (5'-gamma 1-gamma 2-3'), is a pseudogene in a major group of New World monkeys. Sequence analysis of PCR-amplified genomic fragments of predicted sizes revealed that all extant genera of the platyrrhine family Atelidae [Lagothrix (woolly monkeys), Brachyteles (woolly spider monkeys), Ateles (spider monkeys), and Alouatta (howler monkeys)] share a large deletion that removed most of exon 2, all of intron 2 and exon 3, and much of the 3' flanking sequence of gamma 1. The fact that two functional gamma-globin genes were not present in early ancestors of the Atelidae (and that gamma 1 was the dispensible gene) suggests that for much or even all of their evolution, platyrrhines have had gamma 2 as the primary fetally expressed gamma-globin gene, in contrast to catarrhines (e.g., humans and chimpanzees) that have gamma 1 as the primary fetally expressed gamma-globin gene. Results from promoter sequences further suggest that all three platyrrhine families (Atelidae, Cebidae, and Pitheciidae) have gamma 2 rather than gamma 1 as their primary fetally expressed gamma-globin gene. The implications of this suggestion were explored in terms of how gene redundancy, regulatory mutations, and distance of each gamma-globin gene from the locus control region were possibly involved in the acquisition and maintenance of fetal, rather than embryonic, expression. Images Fig. 2 PMID:7535927
PROSPECT improves cis-acting regulatory element prediction by integrating expression profile data with consensus pattern searches

Science.gov (United States)

Fujibuchi, Wataru; Anderson, John S. J.; Landsman, David

2001-01-01

Consensus pattern and matrix-based searches designed to predict cis-acting transcriptional regulatory sequences have historically been subject to large numbers of false positives. We sought to decrease false positives by incorporating expression profile data into a consensus pattern-based search method. We have systematically analyzed the expression phenotypes of over 6000 yeast genes, across 121 expression profile experiments, and correlated them with the distribution of 14 known regulatory elements over sequences upstream of the genes. Our method is based on a metric we term probabilistic element assessment (PEA), which is a ranking of potential sites based on sequence similarity in the upstream regions of genes with similar expression phenotypes. For eight of the 14 known elements that we examined, our method had a much higher selectivity than a naïve consensus pattern search. Based on our analysis, we have developed a web-based tool called PROSPECT, which allows consensus pattern-based searching of gene clusters obtained from microarray data. PMID:11574681
Prognostic significance of glucose transporter-1 (GLUT1) gene expression in rectal cancer after preoperative chemoradiotherapy

International Nuclear Information System (INIS)

Saigusa, Susumu; Toiyama, Yuji; Tanaka, Koji; Okugawa, Yoshinaga; Fujikawa, Hiroyuki; Matsushita, Kohei; Uchida, Keiichi; Inoue, Yasuhiro; Kusunoki, Masato

2012-01-01

Most cancer cells exhibit increased glycolysis. The elevated glucose transporter 1 (GLUT1) expression has been reported to be associated with resistance to therapeutic agents and a poor prognosis. We wondered whether GLUT1 expression was associated with the clinical outcome in rectal cancer after preoperative chemoradiotherapy (CRT), and whether glycolysis inhibition could represent a novel anticancer treatment. We obtained total RNA from residual cancer cells using microdissection from a total of 52 rectal cancer specimens from patients who underwent preoperative CRT. We performed transcriptional analyzes, and studied the association of the GLUT1 gene expression levels with the clinical outcomes. In addition, we examined each proliferative response of three selected colorectal cancer cell lines to a glycolysis inhibitor, 3-bromopyruvic acid (3-BrPA), with regard to their expression of the GLUT1 gene. An elevated GLUT1 gene expression was associated with a high postoperative stage, the presence of lymph node metastasis, and distant recurrence. Moreover, elevated GLUT1 gene expression independently predicted both the recurrence-free and overall survival. In the in vitro studies, we observed that 3-BrPA significantly suppressed the proliferation of colon cancer cells with high GLUT1 gene expression, compared with those with low expression. An elevated GLUT1 expression may be a useful predictor of distant recurrence and poor prognosis in rectal cancer patients after preoperative CRT. (author)
Accurate Gene Expression-Based Biodosimetry Using a Minimal Set of Human Gene Transcripts

Energy Technology Data Exchange (ETDEWEB)

Tucker, James D., E-mail: jtucker@biology.biosci.wayne.edu [Department of Biological Sciences, Wayne State University, Detroit, Michigan (United States); Joiner, Michael C. [Department of Radiation Oncology, Wayne State University, Detroit, Michigan (United States); Thomas, Robert A.; Grever, William E.; Bakhmutsky, Marina V. [Department of Biological Sciences, Wayne State University, Detroit, Michigan (United States); Chinkhota, Chantelle N.; Smolinski, Joseph M. [Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan (United States); Divine, George W. [Department of Public Health Sciences, Henry Ford Hospital, Detroit, Michigan (United States); Auner, Gregory W. [Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan (United States)

2014-03-15

Purpose: Rapid and reliable methods for conducting biological dosimetry are a necessity in the event of a large-scale nuclear event. Conventional biodosimetry methods lack the speed, portability, ease of use, and low cost required for triaging numerous victims. Here we address this need by showing that polymerase chain reaction (PCR) on a small number of gene transcripts can provide accurate and rapid dosimetry. The low cost and relative ease of PCR compared with existing dosimetry methods suggest that this approach may be useful in mass-casualty triage situations. Methods and Materials: Human peripheral blood from 60 adult donors was acutely exposed to cobalt-60 gamma rays at doses of 0 (control) to 10 Gy. mRNA expression levels of 121 selected genes were obtained 0.5, 1, and 2 days after exposure by reverse-transcriptase real-time PCR. Optimal dosimetry at each time point was obtained by stepwise regression of dose received against individual gene transcript expression levels. Results: Only 3 to 4 different gene transcripts, ASTN2, CDKN1A, GDF15, and ATM, are needed to explain ≥0.87 of the variance (R{sup 2}). Receiver-operator characteristics, a measure of sensitivity and specificity, of 0.98 for these statistical models were achieved at each time point. Conclusions: The actual and predicted radiation doses agree very closely up to 6 Gy. Dosimetry at 8 and 10 Gy shows some effect of saturation, thereby slightly diminishing the ability to quantify higher exposures. Analyses of these gene transcripts may be advantageous for use in a field-portable device designed to assess exposures in mass casualty situations or in clinical radiation emergencies.
Evaluation of Appropriate Reference Genes for Gene Expression Normalization during Watermelon Fruit Development.

Directory of Open Access Journals (Sweden)

Qiusheng Kong

Full Text Available Gene expression analysis in watermelon (Citrullus lanatus fruit has drawn considerable attention with the availability of genome sequences to understand the regulatory mechanism of fruit development and to improve its quality. Real-time quantitative reverse-transcription PCR (qRT-PCR is a routine technique for gene expression analysis. However, appropriate reference genes for transcript normalization in watermelon fruits have not been well characterized. The aim of this study was to evaluate the appropriateness of 12 genes for their potential use as reference genes in watermelon fruits. Expression variations of these genes were measured in 48 samples obtained from 12 successive developmental stages of parthenocarpic and fertilized fruits of two watermelon genotypes by using qRT-PCR analysis. Considering the effects of genotype, fruit setting method, and developmental stage, geNorm determined clathrin adaptor complex subunit (ClCAC, β-actin (ClACT, and alpha tubulin 5 (ClTUA5 as the multiple reference genes in watermelon fruit. Furthermore, ClCAC alone or together with SAND family protein (ClSAND was ranked as the single or two best reference genes by NormFinder. By using the top-ranked reference genes to normalize the transcript abundance of phytoene synthase (ClPSY1, a good correlation between lycopene accumulation and ClPSY1 expression pattern was observed in ripening watermelon fruit. These validated reference genes will facilitate the accurate measurement of gene expression in the studies on watermelon fruit biology.
Evaluation of Appropriate Reference Genes for Gene Expression Normalization during Watermelon Fruit Development.

Science.gov (United States)

Kong, Qiusheng; Yuan, Jingxian; Gao, Lingyun; Zhao, Liqiang; Cheng, Fei; Huang, Yuan; Bie, Zhilong

2015-01-01

Gene expression analysis in watermelon (Citrullus lanatus) fruit has drawn considerable attention with the availability of genome sequences to understand the regulatory mechanism of fruit development and to improve its quality. Real-time quantitative reverse-transcription PCR (qRT-PCR) is a routine technique for gene expression analysis. However, appropriate reference genes for transcript normalization in watermelon fruits have not been well characterized. The aim of this study was to evaluate the appropriateness of 12 genes for their potential use as reference genes in watermelon fruits. Expression variations of these genes were measured in 48 samples obtained from 12 successive developmental stages of parthenocarpic and fertilized fruits of two watermelon genotypes by using qRT-PCR analysis. Considering the effects of genotype, fruit setting method, and developmental stage, geNorm determined clathrin adaptor complex subunit (ClCAC), β-actin (ClACT), and alpha tubulin 5 (ClTUA5) as the multiple reference genes in watermelon fruit. Furthermore, ClCAC alone or together with SAND family protein (ClSAND) was ranked as the single or two best reference genes by NormFinder. By using the top-ranked reference genes to normalize the transcript abundance of phytoene synthase (ClPSY1), a good correlation between lycopene accumulation and ClPSY1 expression pattern was observed in ripening watermelon fruit. These validated reference genes will facilitate the accurate measurement of gene expression in the studies on watermelon fruit biology.
Novel gene sets improve set-level classification of prokaryotic gene expression data.

Science.gov (United States)

Holec, Matěj; Kuželka, Ondřej; Železný, Filip

2015-10-28

Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.
A gene expression profile indicative of early stage HER2 targeted therapy response.

Science.gov (United States)

O'Neill, Fiona; Madden, Stephen F; Clynes, Martin; Crown, John; Doolan, Padraig; Aherne, Sinéad T; O'Connor, Robert

2013-07-01

Efficacious application of HER2-targetting agents requires the identification of novel predictive biomarkers. Lapatinib, afatinib and neratinib are tyrosine kinase inhibitors (TKIs) of HER2 and EGFR growth factor receptors. A panel of breast cancer cell lines was treated with these agents, trastuzumab, gefitinib and cytotoxic therapies and the expression pattern of a specific panel of genes using RT-PCR was investigated as a potential marker of early drug response to HER2-targeting therapies. Treatment of HER2 TKI-sensitive SKBR3 and BT474 cell lines with lapatinib, afatinib and neratinib induced an increase in the expression of RB1CC1, ERBB3, FOXO3a and NR3C1. The response directly correlated with the degree of sensitivity. This expression pattern switched from up-regulated to down-regulated in the HER2 expressing, HER2-TKI insensitive cell line MDAMB453. Expression of the CCND1 gene demonstrated an inversely proportional response to drug exposure. A similar expression pattern was observed following the treatment with both neratinib and afatinib. These patterns were retained following exposure to traztuzumab and lapatinib plus capecitabine. In contrast, gefitinib, dasatinib and epirubicin treatment resulted in a completely different expression pattern change. In these HER2-expressing cell line models, lapatinib, neratinib, afatinib and trastuzumab treatment generated a characteristic and specific gene expression response, proportionate to the sensitivity of the cell lines to the HER2 inhibitor.Characterisation of the induced changes in expression levels of these genes may therefore give a valuable, very early predictor of the likely extent and specificity of tumour HER2 inhibitor response in patients, potentially guiding more specific use of these agents.
Differential neutrophil gene expression in early bovine pregnancy

Directory of Open Access Journals (Sweden)

Kizaki Keiichiro

2013-02-01

Full Text Available Abstract Background In food production animals, especially cattle, the diagnosis of gestation is important because the timing of gestation directly affects the running of farms. Various methods have been used to detect gestation, but none of them are ideal because of problems with the timing of detection or the accuracy, simplicity, or cost of the method. A new method for detecting gestation, which involves assessing interferon-tau (IFNT-stimulated gene expression in peripheral blood leukocytes (PBL, was recently proposed. PBL fractionation methods were used to examine whether the expression profiles of various PBL populations could be used as reliable diagnostic markers of bovine gestation. Methods PBL were collected on days 0 (just before artificial insemination, 7, 14, 17, 21, and 28 of gestation. The gene expression levels of the PBL were assessed with microarray analysis and/or quantitative real-time reverse transcription (q PCR. PBL fractions were collected by flow cytometry or density gradient cell separation using Histopaque 1083 or Ficoll-Conray solutions. The expression levels of four IFNT-stimulated genes, interferon-stimulated protein 15 kDa (ISG15, myxovirus-resistance (MX 1 and 2, and 2′-5′-oligoadenylate synthetase (OAS1, were then analyzed in each fraction through day 28 of gestation using qPCR. Results Microarray analysis detected 72 and 28 genes in whole PBL that were significantly higher on days 14 and 21 of gestation, respectively, than on day 0. The upregulated genes included IFNT-stimulated genes. The expression levels of these genes increased with the progression of gestation until day 21. In flow cytometry experiments, on day 14 the expression levels of all of the genes were significantly higher in the granulocyte fraction than in the other fractions. Their expression gradually decreased through day 28 of gestation. Strong correlations were observed between the expression levels of the four genes in the granulocyte
Expression of an isoflavone reductase-like gene enhanced by pollen tube growth in pistils of Solanum tuberosum.

Science.gov (United States)

van Eldik, G J; Ruiter, R K; Colla, P H; van Herpen, M M; Schrauwen, J A; Wullems, G J

1997-03-01

Successful sexual reproduction relies on gene products delivered by the pistil to create an environment suitable for pollen tube growth. These compounds are either produced before pollination or formed during the interactions between pistil and pollen tubes. Here we describe the pollination-enhanced expression of the cp100 gene in pistils of Solanum tuberosum. Temporal analysis of gene expression revealed an enhanced expression already one hour after pollination and lasts more than 72 h. Increase in expression also occurred after touching the stigma and was not restricted to the site of touch but spread into the style. The predicted CP100 protein shows similarity to leguminous isoflavone reductases (IFRs), but belongs to a family of IFR-like NAD(P)H-dependent oxidoreductases present in various plant species.
Validation of suitable reference genes for quantitative gene expression analysis in Panax ginseng

Directory of Open Access Journals (Sweden)

Meizhen eWang

2016-01-01

Full Text Available Reverse transcription-qPCR (RT-qPCR has become a popular method for gene expression studies. Its results require data normalization by housekeeping genes. No single gene is proved to be stably expressed under all experimental conditions. Therefore, systematic evaluation of reference genes is necessary. With the aim to identify optimum reference genes for RT-qPCR analysis of gene expression in different tissues of Panax ginseng and the seedlings grown under heat stress, we investigated the expression stability of eight candidate reference genes, including elongation factor 1-beta (EF1-β, elongation factor 1-gamma (EF1-γ, eukaryotic translation initiation factor 3G (IF3G, eukaryotic translation initiation factor 3B (IF3B, actin (ACT, actin11 (ACT11, glyceraldehyde-3-phosphate dehydrogenase (GAPDH and cyclophilin ABH-like protein (CYC, using four widely used computational programs: geNorm, Normfinder, BestKeeper, and the comparative ΔCt method. The results were then integrated using the web-based tool RefFinder. As a result, EF1-γ, IF3G and EF1-β were the three most stable genes in different tissues of P. ginseng, while IF3G, ACT11 and GAPDH were the top three-ranked genes in seedlings treated with heat. Using three better reference genes alone or in combination as internal control, we examined the expression profiles of MAR, a multiple function-associated mRNA-like non-coding RNA (mlncRNA in P. ginseng. Taken together, we recommended EF1-γ/IF3G and IF3G/ACT11 as the suitable pair of reference genes for RT-qPCR analysis of gene expression in different tissues of P. ginseng and the seedlings grown under heat stress, respectively. The results serve as a foundation for future studies on P. ginseng functional genomics.
The Spike-and-Slab Lasso Generalized Linear Models for Prediction and Associated Genes Detection.

Science.gov (United States)

Tang, Zaixiang; Shen, Yueping; Zhang, Xinyan; Yi, Nengjun

2017-01-01

Large-scale "omics" data have been increasingly used as an important resource for prognostic prediction of diseases and detection of associated genes. However, there are considerable challenges in analyzing high-dimensional molecular data, including the large number of potential molecular predictors, limited number of samples, and small effect of each predictor. We propose new Bayesian hierarchical generalized linear models, called spike-and-slab lasso GLMs, for prognostic prediction and detection of associated genes using large-scale molecular data. The proposed model employs a spike-and-slab mixture double-exponential prior for coefficients that can induce weak shrinkage on large coefficients, and strong shrinkage on irrelevant coefficients. We have developed a fast and stable algorithm to fit large-scale hierarchal GLMs by incorporating expectation-maximization (EM) steps into the fast cyclic coordinate descent algorithm. The proposed approach integrates nice features of two popular methods, i.e., penalized lasso and Bayesian spike-and-slab variable selection. The performance of the proposed method is assessed via extensive simulation studies. The results show that the proposed approach can provide not only more accurate estimates of the parameters, but also better prediction. We demonstrate the proposed procedure on two cancer data sets: a well-known breast cancer data set consisting of 295 tumors, and expression data of 4919 genes; and the ovarian cancer data set from TCGA with 362 tumors, and expression data of 5336 genes. Our analyses show that the proposed procedure can generate powerful models for predicting outcomes and detecting associated genes. The methods have been implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/). Copyright © 2017 by the Genetics Society of America.
Identification of differentially expressed genes in flax (Linum usitatissimum L.) under saline-alkaline stress by digital gene expression.

Science.gov (United States)

Yu, Ying; Huang, Wengong; Chen, Hongyu; Wu, Guangwen; Yuan, Hongmei; Song, Xixia; Kang, Qinghua; Zhao, Dongsheng; Jiang, Weidong; Liu, Yan; Wu, Jianzhong; Cheng, Lili; Yao, Yubo; Guan, Fengzhi

2014-10-01

The salinization and alkalization of soil are widespread environmental problems, and alkaline salt stress is more destructive than neutral salt stress. Therefore, understanding the mechanism of plant tolerance to saline-alkaline stress has become a major challenge. However, little attention has been paid to the mechanism of plant alkaline salt tolerance. In this study, gene expression profiling of flax was analyzed under alkaline-salt stress (AS2), neutral salt stress (NSS) and alkaline stress (AS) by digital gene expression. Three-week-old flax seedlings were placed in 25 mM Na2CO3 (pH11.6) (AS2), 50mM NaCl (NSS) and NaOH (pH11.6) (AS) for 18 h. There were 7736, 1566 and 454 differentially expressed genes in AS2, NSS and AS compared to CK, respectively. The GO category gene enrichment analysis revealed that photosynthesis was particularly affected in AS2, carbohydrate metabolism was particularly affected in NSS, and the response to biotic stimulus was particularly affected in AS. We also analyzed the expression pattern of five categories of genes including transcription factors, signaling transduction proteins, phytohormones, reactive oxygen species proteins and transporters under these three stresses. Some key regulatory gene families involved in abiotic stress, such as WRKY, MAPKKK, ABA, PrxR and ion channels, were differentially expressed. Compared with NSS and AS, AS2 triggered more differentially expressed genes and special pathways, indicating that the mechanism of AS2 was more complex than NSS and AS. To the best of our knowledge, this was the first transcriptome analysis of flax in response to saline-alkaline stress. These data indicate that common and diverse features of saline-alkaline stress provide novel insights into the molecular mechanisms of plant saline-alkaline tolerance and offer a number of candidate genes as potential markers of tolerance to saline-alkaline stress. Copyright © 2014 Elsevier B.V. All rights reserved.

Interplay of Noisy Gene Expression and Dynamics Explains Patterns of Bacterial Operon Organization

Science.gov (United States)

Igoshin, Oleg

2011-03-01

Bacterial chromosomes are organized into operons -- sets of genes co-transcribed into polycistronic messenger RNA. Hypotheses explaining the emergence and maintenance of operons include proportional co-regulation, horizontal transfer of intact ``selfish'' operons, emergence via gene duplication, and co-production of physically interacting proteins to speed their association. We hypothesized an alternative: operons can reduce or increase intrinsic gene expression noise in a manner dependent on the post-translational interactions, thereby resulting in selection for or against operons in depending on the network architecture. We devised five classes of two-gene network modules and show that the effects of operons on intrinsic noise depend on class membership. Two classes exhibit decreased noise with co-transcription, two others reveal increased noise, and the remaining one does not show a significant difference. To test our modeling predictions we employed bioinformatic analysis to determine the relationship gene expression noise and operon organization. The results confirm the overrepresentation of noise-minimizing operon architectures and provide evidence against other hypotheses. Our results thereby suggest a central role for gene expression noise in selecting for or maintaining operons in bacterial chromosomes. This demonstrates how post-translational network dynamics may provide selective pressure for organizing bacterial chromosomes, and has practical consequences for designing synthetic gene networks. This work is supported by National Institutes of Health grant 1R01GM096189-01.
Improved gene expression signature of testicular carcinoma in situ

DEFF Research Database (Denmark)

Almstrup, Kristian; Leffers, Henrik; Lothe, Ragnhild A

2007-01-01

on global gene expression in testicular CIS have been previously published. We have merged the two data sets on CIS samples (n = 6) and identified the shared gene expression signature in relation to expression in normal testis. Among the top-20 highest expressed genes, one-third was transcription factors...... development' were significantly altered and could collectively affect cellular pathways like the WNT signalling cascade, which thus may be disrupted in testicular CIS. The merged CIS data from two different microarray platforms, to our knowledge, provide the most precise CIS gene expression signature to date....
G-cimp status prediction of glioblastoma samples using mRNA expression data.

Science.gov (United States)

Baysan, Mehmet; Bozdag, Serdar; Cam, Margaret C; Kotliarova, Svetlana; Ahn, Susie; Walling, Jennifer; Killian, Jonathan K; Stevenson, Holly; Meltzer, Paul; Fine, Howard A

2012-01-01

Glioblastoma Multiforme (GBM) is a tumor with high mortality and no known cure. The dramatic molecular and clinical heterogeneity seen in this tumor has led to attempts to define genetically similar subgroups of GBM with the hope of developing tumor specific therapies targeted to the unique biology within each of these subgroups. Recently, a subset of relatively favorable prognosis GBMs has been identified. These glioma CpG island methylator phenotype, or G-CIMP tumors, have distinct genomic copy number aberrations, DNA methylation patterns, and (mRNA) expression profiles compared to other GBMs. While the standard method for identifying G-CIMP tumors is based on genome-wide DNA methylation data, such data is often not available compared to the more widely available gene expression data. In this study, we have developed and evaluated a method to predict the G-CIMP status of GBM samples based solely on gene expression data.
Identification, Expression Analysis, and Target Prediction of Flax Genotroph MicroRNAs Under Normal and Nutrient Stress Conditions

Science.gov (United States)

Melnikova, Nataliya V.; Dmitriev, Alexey A.; Belenikin, Maxim S.; Koroban, Nadezhda V.; Speranskaya, Anna S.; Krinitsina, Anastasia A.; Krasnov, George S.; Lakunina, Valentina A.; Snezhkina, Anastasiya V.; Sadritdinova, Asiya F.; Kishlyan, Natalya V.; Rozhmina, Tatiana A.; Klimina, Kseniya M.; Amosova, Alexandra V.; Zelenin, Alexander V.; Muravenko, Olga V.; Bolsheva, Nadezhda L.; Kudryavtseva, Anna V.

2016-01-01

Cultivated flax (Linum usitatissimum L.) is an important plant valuable for industry. Some flax lines can undergo heritable phenotypic and genotypic changes (LIS-1 insertion being the most common) in response to nutrient stress and are called plastic lines. Offspring of plastic lines, which stably inherit the changes, are called genotrophs. MicroRNAs (miRNAs) are involved in a crucial regulatory mechanism of gene expression. They have previously been assumed to take part in nutrient stress response and can, therefore, participate in genotroph formation. In the present study, we performed high-throughput sequencing of small RNAs (sRNAs) extracted from flax plants grown under normal, phosphate deficient and nutrient excess conditions to identify miRNAs and evaluate their expression. Our analysis revealed expression of 96 conserved miRNAs from 21 families in flax. Moreover, 475 novel potential miRNAs were identified for the first time, and their targets were predicted. However, none of the identified miRNAs were transcribed from LIS-1. Expression of seven miRNAs (miR168, miR169, miR395, miR398, miR399, miR408, and lus-miR-N1) with up- or down-regulation under nutrient stress (on the basis of high-throughput sequencing data) was evaluated on extended sampling using qPCR. Reference gene search identified ETIF3H and ETIF3E genes as most suitable for this purpose. Down-regulation of novel potential lus-miR-N1 and up-regulation of conserved miR399 were revealed under the phosphate deficient conditions. In addition, the negative correlation of expression of lus-miR-N1 and its predicted target, ubiquitin-activating enzyme E1 gene, as well as, miR399 and its predicted target, ubiquitin-conjugating enzyme E2 gene, was observed. Thus, in our study, miRNAs expressed in flax plastic lines and genotrophs were identified and their expression and expression of their targets was evaluated using high-throughput sequencing and qPCR for the first time. These data provide new insights
The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

Science.gov (United States)

Gautier, Aude; Le Gac, Florence; Lareyre, Jean-Jacques

2011-02-01

The gonadal soma-derived factor (GSDF) belongs to the transforming growth factor-β superfamily and is conserved in teleostean fish species. Gsdf is specifically expressed in the gonads, and gene expression is restricted to the granulosa and Sertoli cells in trout and medaka. The gsdf gene expression is correlated to early testis differentiation in medaka and was shown to stimulate primordial germ cell and spermatogonia proliferation in trout. In the present study, we show that the gsdf gene localizes to a syntenic chromosomal fragment conserved among vertebrates although no gsdf-related gene is detected on the corresponding genomic region in tetrapods. We demonstrate using quantitative RT-PCR that most of the genes localized in the synteny are specifically expressed in medaka gonads. Gsdf is the only gene of the synteny with a much higher expression in the testis compared to the ovary. In contrast, gene expression pattern analysis of the gsdf surrounding genes (nup54, aff1, klhl8, sdad1, and ptpn13) indicates that these genes are preferentially expressed in the female gonads. The tissue distribution of these genes is highly similar in medaka and zebrafish, two teleostean species that have diverged more than 110 million years ago. The cellular localization of these genes was determined in medaka gonads using the whole-mount in situ hybridization technique. We confirm that gsdf gene expression is restricted to Sertoli and granulosa cells in contact with the premeiotic and meiotic cells. The nup54 gene is expressed in spermatocytes and previtellogenic oocytes. Transcripts corresponding to the ovary-specific genes (aff1, klhl8, and sdad1) are detected only in previtellogenic oocytes. No expression was detected in the gonocytes in 10 dpf embryos. In conclusion, we show that the gsdf gene localizes to a syntenic chromosomal fragment harboring evolutionary conserved genes in vertebrates. These genes are preferentially expressed in previtelloogenic oocytes, and thus, they
APRIL is a novel clinical chemo-resistance biomarker in colorectal adenocarcinoma identified by gene expression profiling

International Nuclear Information System (INIS)

Petty, Russell D; Wang, Weiguang; Gilbert, Fiona; Semple, Scot; Collie-Duguid, Elaina SR; Samuel, Leslie M; Murray, Graeme I; MacDonald, Graham; O'Kelly, Terrence; Loudon, Malcolm; Binnie, Norman; Aly, Emad; McKinlay, Aileen

2009-01-01

5-Fluorouracil(5FU) and oral analogues, such as capecitabine, remain one of the most useful agents for the treatment of colorectal adenocarcinoma. Low toxicity and convenience of administration facilitate use, however clinical resistance is a major limitation. Investigation has failed to fully explain the molecular mechanisms of resistance and no clinically useful predictive biomarkers for 5FU resistance have been identified. We investigated the molecular mechanisms of clinical 5FU resistance in colorectal adenocarcinoma patients in a prospective biomarker discovery project utilising gene expression profiling. The aim was to identify novel 5FU resistance mechanisms and qualify these as candidate biomarkers and therapeutic targets. Putative treatment specific gene expression changes were identified in a transcriptomics study of rectal adenocarcinomas, biopsied and profiled before and after pre-operative short-course radiotherapy or 5FU based chemo-radiotherapy, using microarrays. Tumour from untreated controls at diagnosis and resection identified treatment-independent gene expression changes. Candidate 5FU chemo-resistant genes were identified by comparison of gene expression data sets from these clinical specimens with gene expression signatures from our previous studies of colorectal cancer cell lines, where parental and daughter lines resistant to 5FU were compared. A colorectal adenocarcinoma tissue microarray (n = 234, resected tumours) was used as an independent set to qualify candidates thus identified. APRIL/TNFSF13 mRNA was significantly upregulated following 5FU based concurrent chemo-radiotherapy and in 5FU resistant colorectal adenocarcinoma cell lines but not in radiotherapy alone treated colorectal adenocarcinomas. Consistent withAPRIL's known function as an autocrine or paracrine secreted molecule, stromal but not tumour cell protein expression by immunohistochemistry was correlated with poor prognosis (p = 0.019) in the independent set
Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.

Directory of Open Access Journals (Sweden)

Neutelings Godfrey

2010-04-01

Full Text Available Abstract Background Quantitative real-time PCR (qRT-PCR is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs. Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L. Results Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups. qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59. LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both ge
Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.).

Science.gov (United States)

Huis, Rudy; Hawkins, Simon; Neutelings, Godfrey

2010-04-19

Quantitative real-time PCR (qRT-PCR) is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs). Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L). Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs) and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH) as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups.qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59). LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both geNorm-designated- and Norm
Differential gene expression during the moult cycle of Antarctic krill (Euphausia superba

Directory of Open Access Journals (Sweden)

Gaten Edward

2010-10-01

Full Text Available Abstract Background All crustaceans periodically moult to renew their exoskeleton. In krill this involves partial digestion and resorption of the old exoskeleton and synthesis of new cuticle. Molecular events that underlie the moult cycle are poorly understood in calcifying crustaceans and even less so in non-calcifying organisms such as krill. To address this we constructed an Antarctic krill cDNA microarray in order to generate gene expression profiles across the moult cycle and identify possible activation pathways. Results A total of 26 different cuticle genes were identified that showed differential gene expression across the moult cycle. Almost all cuticle genes were up regulated during premoult and down regulated during late intermoult. There were a number of transcripts with significant sequence homology to genes potentially involved in the synthesis, breakdown and resorption of chitin. During early premoult glutamine synthetase, a gene involved in generating an amino acid used in the synthesis of glucosamine, a constituent of chitin, was up regulated more than twofold. Mannosyltransferase 1, a member of the glycosyltransferase family of enzymes that includes chitin synthase was also up regulated during early premoult. Transcripts homologous to a β-N-acetylglucosaminidase (β-NAGase precursor were expressed at a higher level during late intermoult (prior to apolysis than during premoult. This observation coincided with the up regulation during late intermoult, of a coatomer subunit epsilon involved in the production of vesicles that maybe used to transport the β-NAGase precursors into the exuvial cleft. Trypsin, known to activate the β-NAGase precursor, was up regulated more than fourfold during premoult. The up regulation of a predicted oligopeptide transporter during premoult may allow the transport of chitin breakdown products across the newly synthesised epi- and exocuticle layers. Conclusion We have identified many genes
Expression analysis of the Theileria parva subtelomere-encoded variable secreted protein gene family.

Directory of Open Access Journals (Sweden)

Jacqueline Schmuckli-Maurer

Full Text Available The intracellular protozoan parasite Theileria parva transforms bovine lymphocytes inducing uncontrolled proliferation. Proteins released from the parasite are assumed to contribute to phenotypic changes of the host cell and parasite persistence. With 85 members, genes encoding subtelomeric variable secreted proteins (SVSPs form the largest gene family in T. parva. The majority of SVSPs contain predicted signal peptides, suggesting secretion into the host cell cytoplasm.We analysed SVSP expression in T. parva-transformed cell lines established in vitro by infection of T or B lymphocytes with cloned T. parva parasites. Microarray and quantitative real-time PCR analysis revealed mRNA expression for a wide range of SVSP genes. The pattern of mRNA expression was largely defined by the parasite genotype and not by host background or cell type, and found to be relatively stable in vitro over a period of two months. Interestingly, immunofluorescence analysis carried out on cell lines established from a cloned parasite showed that expression of a single SVSP encoded by TP03_0882 is limited to only a small percentage of parasites. Epitope-tagged TP03_0882 expressed in mammalian cells was found to translocate into the nucleus, a process that could be attributed to two different nuclear localisation signals.Our analysis reveals a complex pattern of Theileria SVSP mRNA expression, which depends on the parasite genotype. Whereas in cell lines established from a cloned parasite transcripts can be found corresponding to a wide range of SVSP genes, only a minority of parasites appear to express a particular SVSP protein. The fact that a number of SVSPs contain functional nuclear localisation signals suggests that proteins released from the parasite could contribute to phenotypic changes of the host cell. This initial characterisation will facilitate future studies on the regulation of SVSP gene expression and the potential biological role of these enigmatic
Time warping of evolutionary distant temporal gene expression data based on noise suppression

Directory of Open Access Journals (Sweden)

Papatsenko Dmitri

2009-10-01

Full Text Available Abstract Background Comparative analysis of genome wide temporal gene expression data has a broad potential area of application, including evolutionary biology, developmental biology, and medicine. However, at large evolutionary distances, the construction of global alignments and the consequent comparison of the time-series data are difficult. The main reason is the accumulation of variability in expression profiles of orthologous genes, in the course of evolution. Results We applied Pearson distance matrices, in combination with other noise-suppression techniques and data filtering to improve alignments. This novel framework enhanced the capacity to capture the similarities between the temporal gene expression datasets separated by large evolutionary distances. We aligned and compared the temporal gene expression data in budding (Saccharomyces cerevisiae and fission (Schizosaccharomyces pombe yeast, which are separated by more then ~400 myr of evolution. We found that the global alignment (time warping properly matched the duration of cell cycle phases in these distant organisms, which was measured in prior studies. At the same time, when applied to individual ortholog pairs, this alignment procedure revealed groups of genes with distinct alignments, different from the global alignment. Conclusion Our alignment-based predictions of differences in the cell cycle phases between the two yeast species were in a good agreement with the existing data, thus supporting the computational strategy adopted in this study. We propose that the existence of the alternative alignments, specific to distinct groups of genes, suggests presence of different synchronization modes between the two organisms and possible functional decoupling of particular physiological gene networks in the course of evolution.
The Medicago truncatula gene expression atlas web server

Directory of Open Access Journals (Sweden)

Tang Yuhong

2009-12-01

Full Text Available Abstract Background Legumes (Leguminosae or Fabaceae play a major role in agriculture. Transcriptomics studies in the model legume species, Medicago truncatula, are instrumental in helping to formulate hypotheses about the role of legume genes. With the rapid growth of publically available Affymetrix GeneChip Medicago Genome Array GeneChip data from a great range of tissues, cell types, growth conditions, and stress treatments, the legume research community desires an effective bioinformatics system to aid efforts to interpret the Medicago genome through functional genomics. We developed the Medicago truncatula Gene Expression Atlas (MtGEA web server for this purpose. Description The Medicago truncatula Gene Expression Atlas (MtGEA web server is a centralized platform for analyzing the Medicago transcriptome. Currently, the web server hosts gene expression data from 156 Affymetrix GeneChip® Medicago genome arrays in 64 different experiments, covering a broad range of developmental and environmental conditions. The server enables flexible, multifaceted analyses of transcript data and provides a range of additional information about genes, including different types of annotation and links to the genome sequence, which help users formulate hypotheses about gene function. Transcript data can be accessed using Affymetrix probe identification number, DNA sequence, gene name, functional description in natural language, GO and KEGG annotation terms, and InterPro domain number. Transcripts can also be discovered through co-expression or differential expression analysis. Flexible tools to select a subset of experiments and to visualize and compare expression profiles of multiple genes have been implemented. Data can be downloaded, in part or full, in a tabular form compatible with common analytical and visualization software. The web server will be updated on a regular basis to incorporate new gene expression data and genome annotation, and is accessible
A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes.

Directory of Open Access Journals (Sweden)

Simone de Jong

Full Text Available Despite large-scale genome-wide association studies (GWAS, the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1, is located in, and regulated by the major histocompatibility (MHC complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network.
Clustering based gene expression feature selection method: A computational approach to enrich the classifier efficiency of differentially expressed genes

KAUST Repository

Abusamra, Heba

2016-07-20

The native nature of high dimension low sample size of gene expression data make the classification task more challenging. Therefore, feature (gene) selection become an apparent need. Selecting a meaningful and relevant genes for classifier not only decrease the computational time and cost, but also improve the classification performance. Among different approaches of feature selection methods, however most of them suffer from several problems such as lack of robustness, validation issues etc. Here, we present a new feature selection technique that takes advantage of clustering both samples and genes. Materials and methods We used leukemia gene expression dataset [1]. The effectiveness of the selected features were evaluated by four different classification methods; support vector machines, k-nearest neighbor, random forest, and linear discriminate analysis. The method evaluate the importance and relevance of each gene cluster by summing the expression level for each gene belongs to this cluster. The gene cluster consider important, if it satisfies conditions depend on thresholds and percentage otherwise eliminated. Results Initial analysis identified 7120 differentially expressed genes of leukemia (Fig. 15a), after applying our feature selection methodology we end up with specific 1117 genes discriminating two classes of leukemia (Fig. 15b). Further applying the same method with more stringent higher positive and lower negative threshold condition, number reduced to 58 genes have be tested to evaluate the effectiveness of the method (Fig. 15c). The results of the four classification methods are summarized in Table 11. Conclusions The feature selection method gave good results with minimum classification error. Our heat-map result shows distinct pattern of refines genes discriminating between two classes of leukemia.
A strategy for full interrogation of prognostic gene expression patterns: exploring the biology of diffuse large B cell lymphoma.

Directory of Open Access Journals (Sweden)

Lisa M Rimsza

Full Text Available Gene expression profiling yields quantitative data on gene expression used to create prognostic models that accurately predict patient outcome in diffuse large B cell lymphoma (DLBCL. Often, data are analyzed with genes classified by whether they fall above or below the median expression level. We sought to determine whether examining multiple cut-points might be a more powerful technique to investigate the association of gene expression with outcome.We explored gene expression profiling data using variable cut-point analysis for 36 genes with reported prognostic value in DLBCL. We plotted two-group survival logrank test statistics against corresponding cut-points of the gene expression levels and smooth estimates of the hazard ratio of death versus gene expression levels. To facilitate comparisons we also standardized the expression of each of the genes by the fraction of patients that would be identified by any cut-point. A multiple comparison adjusted permutation p-value identified 3 different patterns of significance: 1 genes with significant cut-point points below the median, whose loss is associated with poor outcome (e.g. HLA-DR; 2 genes with significant cut-points above the median, whose over-expression is associated with poor outcome (e.g. CCND2; and 3 genes with significant cut-points on either side of the median, (e.g. extracellular molecules such as FN1.Variable cut-point analysis with permutation p-value calculation can be used to identify significant genes that would not otherwise be identified with median cut-points and may suggest biological patterns of gene effects.
Positron emission tomography imaging of gene expression

International Nuclear Information System (INIS)

Tang Ganghua

2001-01-01

The merging of molecular biology and nuclear medicine is developed into molecular nuclear medicine. Positron emission tomography (PET) of gene expression in molecular nuclear medicine has become an attractive area. Positron emission tomography imaging gene expression includes the antisense PET imaging and the reporter gene PET imaging. It is likely that the antisense PET imaging will lag behind the reporter gene PET imaging because of the numerous issues that have not yet to be resolved with this approach. The reporter gene PET imaging has wide application into animal experimental research and human applications of this approach will likely be reported soon
Radiation Gene-expression Signatures in Primary Breast Cancer Cells.

Science.gov (United States)

Minafra, Luigi; Bravatà, Valentina; Cammarata, Francesco P; Russo, Giorgio; Gilardi, Maria C; Forte, Giusi I

2018-05-01

In breast cancer (BC) care, radiation therapy (RT) is an efficient treatment to control localized tumor. Radiobiological research is needed to understand molecular differences that affect radiosensitivity of different tumor subtypes and the response variability. The aim of this study was to analyze gene expression profiling (GEP) in primary BC cells following irradiation with doses of 9 Gy and 23 Gy delivered by intraoperative electron radiation therapy (IOERT) in order to define gene signatures of response to high doses of ionizing radiation. We performed GEP by cDNA microarrays and evaluated cell survival after IOERT treatment in primary BC cell cultures. Real-time quantitative reverse transcription polymerase chain reaction (qRT-PCR) was performed to validate candidate genes. We showed, for the first time, a 4-gene and a 6-gene signature, as new molecular biomarkers, in two primary BC cell cultures after exposure at 9 Gy and 23 Gy respectively, for which we observed a significantly high survival rate. Gene signatures activated by different doses of ionizing radiation may predict response to RT and contribute to defining a personalized biological-driven treatment plan. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Understanding gene expression in coronary artery disease through ...

Indian Academy of Sciences (India)

Understanding gene expression in coronary artery disease through global profiling, network analysis and independent validation of key candidate genes. Prathima ... Table 2. Differentially expressed genes in CAD compared to age and gender matched controls. .... Regulation of nuclear pre-mRNA domain containing 1A.
Gene expression profile of pulpitis.

Science.gov (United States)

Galicia, J C; Henson, B R; Parker, J S; Khan, A A

2016-06-01

The cost, prevalence and pain associated with endodontic disease necessitate an understanding of the fundamental molecular aspects of its pathogenesis. This study was aimed to identify the genetic contributors to pulpal pain and inflammation. Inflamed pulps were collected from patients diagnosed with irreversible pulpitis (n=20). Normal pulps from teeth extracted for various reasons served as controls (n=20). Pain level was assessed using a visual analog scale (VAS). Genome-wide microarray analysis was performed using Affymetrix GeneTitan Multichannel Instrument. The difference in gene expression levels were determined by the significance analysis of microarray program using a false discovery rate (q-value) of 5%. Genes involved in immune response, cytokine-cytokine receptor interaction and signaling, integrin cell surface interactions, and others were expressed at relatively higher levels in the pulpitis group. Moreover, several genes known to modulate pain and inflammation showed differential expression in asymptomatic and mild pain patients (⩾30 mm on VAS) compared with those with moderate to severe pain. This exploratory study provides a molecular basis for the clinical diagnosis of pulpitis. With an enhanced understanding of pulpal inflammation, future studies on treatment and management of pulpitis and on pain associated with it can have a biological reference to bridge treatment strategies with pulpal biology.
Mel-18, a mammalian Polycomb gene, regulates angiogenic gene expression of endothelial cells.

Science.gov (United States)

Jung, Ji-Hye; Choi, Hyun-Jung; Maeng, Yong-Sun; Choi, Jung-Yeon; Kim, Minhyung; Kwon, Ja-Young; Park, Yong-Won; Kim, Young-Myeong; Hwang, Daehee; Kwon, Young-Guen

2010-10-01

Mel-18 is a mammalian homolog of Polycomb group (PcG) genes. Microarray analysis revealed that Mel-18 expression was induced during endothelial progenitor cell (EPC) differentiation and correlates with the expression of EC-specific protein markers. Overexpression of Mel-18 promoted EPC differentiation and angiogenic activity of ECs. Accordingly, silencing Mel-18 inhibited EC migration and tube formation in vitro. Gene expression profiling showed that Mel-18 regulates angiogenic genes including kinase insert domain receptor (KDR), claudin 5, and angiopoietin-like 2. Our findings demonstrate, for the first time, that Mel-18 plays a significant role in the angiogenic function of ECs by regulating endothelial gene expression. Copyright © 2010 Elsevier Inc. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.