WorldWideScience

Sample records for approaches gene expression

  1. A constructive approach to gene expression dynamics

    International Nuclear Information System (INIS)

    Ochiai, T.; Nacher, J.C.; Akutsu, T.

    2004-01-01

    Recently, experiments on mRNA abundance (gene expression) have revealed that gene expression shows a stationary organization described by a scale-free distribution. Here we propose a constructive approach to gene expression dynamics which restores the scale-free exponent and describes the intermediate state dynamics. This approach requires only one assumption: Markov property

  2. A stochastic approach to multi-gene expression dynamics

    International Nuclear Information System (INIS)

    Ochiai, T.; Nacher, J.C.; Akutsu, T.

    2005-01-01

    In the last years, tens of thousands gene expression profiles for cells of several organisms have been monitored. Gene expression is a complex transcriptional process where mRNA molecules are translated into proteins, which control most of the cell functions. In this process, the correlation among genes is crucial to determine the specific functions of genes. Here, we propose a novel multi-dimensional stochastic approach to deal with the gene correlation phenomena. Interestingly, our stochastic framework suggests that the study of the gene correlation requires only one theoretical assumption-Markov property-and the experimental transition probability, which characterizes the gene correlation system. Finally, a gene expression experiment is proposed for future applications of the model

  3. Modeling gene expression measurement error: a quasi-likelihood approach

    Directory of Open Access Journals (Sweden)

    Strimmer Korbinian

    2003-03-01

    Full Text Available Abstract Background Using suitable error models for gene expression measurements is essential in the statistical analysis of microarray data. However, the true probabilistic model underlying gene expression intensity readings is generally not known. Instead, in currently used approaches some simple parametric model is assumed (usually a transformed normal distribution or the empirical distribution is estimated. However, both these strategies may not be optimal for gene expression data, as the non-parametric approach ignores known structural information whereas the fully parametric models run the risk of misspecification. A further related problem is the choice of a suitable scale for the model (e.g. observed vs. log-scale. Results Here a simple semi-parametric model for gene expression measurement error is presented. In this approach inference is based an approximate likelihood function (the extended quasi-likelihood. Only partial knowledge about the unknown true distribution is required to construct this function. In case of gene expression this information is available in the form of the postulated (e.g. quadratic variance structure of the data. As the quasi-likelihood behaves (almost like a proper likelihood, it allows for the estimation of calibration and variance parameters, and it is also straightforward to obtain corresponding approximate confidence intervals. Unlike most other frameworks, it also allows analysis on any preferred scale, i.e. both on the original linear scale as well as on a transformed scale. It can also be employed in regression approaches to model systematic (e.g. array or dye effects. Conclusions The quasi-likelihood framework provides a simple and versatile approach to analyze gene expression data that does not make any strong distributional assumptions about the underlying error model. For several simulated as well as real data sets it provides a better fit to the data than competing models. In an example it also

  4. Clustering gene expression regulators: new approach to disease subtyping.

    Directory of Open Access Journals (Sweden)

    Mikhail Pyatnitskiy

    Full Text Available One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms, that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient.

  5. Gene expression and gene therapy imaging

    International Nuclear Information System (INIS)

    Rome, Claire; Couillaud, Franck; Moonen, Chrit T.W.

    2007-01-01

    The fast growing field of molecular imaging has achieved major advances in imaging gene expression, an important element of gene therapy. Gene expression imaging is based on specific probes or contrast agents that allow either direct or indirect spatio-temporal evaluation of gene expression. Direct evaluation is possible with, for example, contrast agents that bind directly to a specific target (e.g., receptor). Indirect evaluation may be achieved by using specific substrate probes for a target enzyme. The use of marker genes, also called reporter genes, is an essential element of MI approaches for gene expression in gene therapy. The marker gene may not have a therapeutic role itself, but by coupling the marker gene to a therapeutic gene, expression of the marker gene reports on the expression of the therapeutic gene. Nuclear medicine and optical approaches are highly sensitive (detection of probes in the picomolar range), whereas MRI and ultrasound imaging are less sensitive and require amplification techniques and/or accumulation of contrast agents in enlarged contrast particles. Recently developed MI techniques are particularly relevant for gene therapy. Amongst these are the possibility to track gene therapy vectors such as stem cells, and the techniques that allow spatiotemporal control of gene expression by non-invasive heating (with MRI guided focused ultrasound) and the use of temperature sensitive promoters. (orig.)

  6. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  7. Clustering based gene expression feature selection method: A computational approach to enrich the classifier efficiency of differentially expressed genes

    KAUST Repository

    Abusamra, Heba

    2016-07-20

    The native nature of high dimension low sample size of gene expression data make the classification task more challenging. Therefore, feature (gene) selection become an apparent need. Selecting a meaningful and relevant genes for classifier not only decrease the computational time and cost, but also improve the classification performance. Among different approaches of feature selection methods, however most of them suffer from several problems such as lack of robustness, validation issues etc. Here, we present a new feature selection technique that takes advantage of clustering both samples and genes. Materials and methods We used leukemia gene expression dataset [1]. The effectiveness of the selected features were evaluated by four different classification methods; support vector machines, k-nearest neighbor, random forest, and linear discriminate analysis. The method evaluate the importance and relevance of each gene cluster by summing the expression level for each gene belongs to this cluster. The gene cluster consider important, if it satisfies conditions depend on thresholds and percentage otherwise eliminated. Results Initial analysis identified 7120 differentially expressed genes of leukemia (Fig. 15a), after applying our feature selection methodology we end up with specific 1117 genes discriminating two classes of leukemia (Fig. 15b). Further applying the same method with more stringent higher positive and lower negative threshold condition, number reduced to 58 genes have be tested to evaluate the effectiveness of the method (Fig. 15c). The results of the four classification methods are summarized in Table 11. Conclusions The feature selection method gave good results with minimum classification error. Our heat-map result shows distinct pattern of refines genes discriminating between two classes of leukemia.

  8. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    Science.gov (United States)

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  9. Determining Physical Mechanisms of Gene Expression Regulation from Single Cell Gene Expression Data

    OpenAIRE

    Ezer, Daphne; Moignard, Victoria; G?ttgens, Berthold; Adryan, Boris

    2016-01-01

    Many genes are expressed in bursts, which can contribute to cell-to-cell heterogeneity. It is now possible to measure this heterogeneity with high throughput single cell gene expression assays (single cell qPCR and RNA-seq). These experimental approaches generate gene expression distributions which can be used to estimate the kinetic parameters of gene expression bursting, namely the rate that genes turn on, the rate that genes turn off, and the rate of transcription. We construct a complete ...

  10. A robust approach based on Weibull distribution for clustering gene expression data

    Directory of Open Access Journals (Sweden)

    Gong Binsheng

    2011-05-01

    Full Text Available Abstract Background Clustering is a widely used technique for analysis of gene expression data. Most clustering methods group genes based on the distances, while few methods group genes according to the similarities of the distributions of the gene expression levels. Furthermore, as the biological annotation resources accumulated, an increasing number of genes have been annotated into functional categories. As a result, evaluating the performance of clustering methods in terms of the functional consistency of the resulting clusters is of great interest. Results In this paper, we proposed the WDCM (Weibull Distribution-based Clustering Method, a robust approach for clustering gene expression data, in which the gene expressions of individual genes are considered as the random variables following unique Weibull distributions. Our WDCM is based on the concept that the genes with similar expression profiles have similar distribution parameters, and thus the genes are clustered via the Weibull distribution parameters. We used the WDCM to cluster three cancer gene expression data sets from the lung cancer, B-cell follicular lymphoma and bladder carcinoma and obtained well-clustered results. We compared the performance of WDCM with k-means and Self Organizing Map (SOM using functional annotation information given by the Gene Ontology (GO. The results showed that the functional annotation ratios of WDCM are higher than those of the other methods. We also utilized the external measure Adjusted Rand Index to validate the performance of the WDCM. The comparative results demonstrate that the WDCM provides the better clustering performance compared to k-means and SOM algorithms. The merit of the proposed WDCM is that it can be applied to cluster incomplete gene expression data without imputing the missing values. Moreover, the robustness of WDCM is also evaluated on the incomplete data sets. Conclusions The results demonstrate that our WDCM produces clusters

  11. Enhanced gene ranking approaches using modified trace ratio algorithm for gene expression data

    Directory of Open Access Journals (Sweden)

    Shruti Mishra

    Full Text Available Microarray technology enables the understanding and investigation of gene expression levels by analyzing high dimensional datasets that contain few samples. Over time, microarray expression data have been collected for studying the underlying biological mechanisms of disease. One such application for understanding the mechanism is by constructing a gene regulatory network (GRN. One of the foremost key criteria for GRN discovery is gene selection. Choosing a generous set of genes for the structure of the network is highly desirable. For this role, two suitable methods were proposed for selection of appropriate genes. The first approach comprises a gene selection method called Information gain, where the dataset is reformed and fused with another distinct algorithm called Trace Ratio (TR. Our second method is the implementation of our projected modified TR algorithm, where the scoring base for finding weight matrices has been re-designed. Both the methods' efficiency was shown with different classifiers that include variants of the Artificial Neural Network classifier, such as Resilient Propagation, Quick Propagation, Back Propagation, Manhattan Propagation and Radial Basis Function Neural Network and also the Support Vector Machine (SVM classifier. In the study, it was confirmed that both of the proposed methods worked well and offered high accuracy with a lesser number of iterations as compared to the original Trace Ratio algorithm. Keywords: Gene regulatory network, Gene selection, Information gain, Trace ratio, Canonical correlation analysis, Classification

  12. Vaccine-induced modulation of gene expression in turbot peritoneal cells. A microarray approach.

    Science.gov (United States)

    Fontenla, Francisco; Blanco-Abad, Verónica; Pardo, Belén G; Folgueira, Iria; Noia, Manuel; Gómez-Tato, Antonio; Martínez, Paulino; Leiro, José M; Lamas, Jesús

    2016-07-01

    We used a microarray approach to examine changes in gene expression in turbot peritoneal cells after injection of the fish with vaccines containing the ciliate parasite Philasterides dicentrarchi as antigen and one of the following adjuvants: chitosan-PVMMA microspheres, Freund́s complete adjuvant, aluminium hydroxide gel or Matrix-Q (Isconova, Sweden). We identified 374 genes that were differentially expressed in all groups of fish. Forty-two genes related to tight junctions and focal adhesions and/or actin cytoskeleton were differentially expressed in free peritoneal cells. The profound changes in gene expression related to cell adherence and cytoskeleton may be associated with cell migration and also with the formation of cell-vaccine masses and their attachment to the peritoneal wall. Thirty-five genes related to apoptosis were differentially expressed. Although most of the proteins coded by these genes have a proapoptotic effect, others are antiapoptotic, indicating that both types of signals occur in peritoneal leukocytes of vaccinated fish. Interestingly, many of the genes related to lymphocytes and lymphocyte activity were downregulated in the groups injected with vaccine. We also observed decreased expression of genes related to antigen presentation, suggesting that macrophages (which were abundant in the peritoneal cavity after vaccination) did not express these during the early inflammatory response in the peritoneal cavity. Finally, several genes that participate in the inflammatory response were differentially expressed, and most participated in resolution of inflammation, indicating that an M2 macrophage response is generated in the peritoneal cavity of fish one day post vaccination. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. Functional Associations by Response Overlap (FARO, a functional genomics approach matching gene expression phenotypes.

    Directory of Open Access Journals (Sweden)

    Henrik Bjørn Nielsen

    2007-08-01

    Full Text Available The systematic comparison of transcriptional responses of organisms is a powerful tool in functional genomics. For example, mutants may be characterized by comparing their transcript profiles to those obtained in other experiments querying the effects on gene expression of many experimental factors including treatments, mutations and pathogen infections. Similarly, drugs may be discovered by the relationship between the transcript profiles effectuated or impacted by a candidate drug and by the target disease. The integration of such data enables systems biology to predict the interplay between experimental factors affecting a biological system. Unfortunately, direct comparisons of gene expression profiles obtained in independent, publicly available microarray experiments are typically compromised by substantial, experiment-specific biases. Here we suggest a novel yet conceptually simple approach for deriving 'Functional Association(s by Response Overlap' (FARO between microarray gene expression studies. The transcriptional response is defined by the set of differentially expressed genes independent from the magnitude or direction of the change. This approach overcomes the limited comparability between studies that is typical for methods that rely on correlation in gene expression. We apply FARO to a compendium of 242 diverse Arabidopsis microarray experimental factors, including phyto-hormones, stresses and pathogens, growth conditions/stages, tissue types and mutants. We also use FARO to confirm and further delineate the functions of Arabidopsis MAP kinase 4 in disease and stress responses. Furthermore, we find that a large, well-defined set of genes responds in opposing directions to different stress conditions and predict the effects of different stress combinations. This demonstrates the usefulness of our approach for exploiting public microarray data to derive biologically meaningful associations between experimental factors. Finally, our

  14. cis sequence effects on gene expression

    Directory of Open Access Journals (Sweden)

    Jacobs Kevin

    2007-08-01

    Full Text Available Abstract Background Sequence and transcriptional variability within and between individuals are typically studied independently. The joint analysis of sequence and gene expression variation (genetical genomics provides insight into the role of linked sequence variation in the regulation of gene expression. We investigated the role of sequence variation in cis on gene expression (cis sequence effects in a group of genes commonly studied in cancer research in lymphoblastoid cell lines. We estimated the proportion of genes exhibiting cis sequence effects and the proportion of gene expression variation explained by cis sequence effects using three different analytical approaches, and compared our results to the literature. Results We generated gene expression profiling data at N = 697 candidate genes from N = 30 lymphoblastoid cell lines for this study and used available candidate gene resequencing data at N = 552 candidate genes to identify N = 30 candidate genes with sufficient variance in both datasets for the investigation of cis sequence effects. We used two additive models and the haplotype phylogeny scanning approach of Templeton (Tree Scanning to evaluate association between individual SNPs, all SNPs at a gene, and diplotypes, with log-transformed gene expression. SNPs and diplotypes at eight candidate genes exhibited statistically significant (p cis sequence effects in our study, respectively. Conclusion Based on analysis of our results and the extant literature, one in four genes exhibits significant cis sequence effects, and for these genes, about 30% of gene expression variation is accounted for by cis sequence variation. Despite diverse experimental approaches, the presence or absence of significant cis sequence effects is largely supported by previously published studies.

  15. Polycistronic gene expression in Aspergillus niger.

    Science.gov (United States)

    Schuetze, Tabea; Meyer, Vera

    2017-09-25

    Genome mining approaches predict dozens of biosynthetic gene clusters in each of the filamentous fungal genomes sequenced so far. However, the majority of these gene clusters still remain cryptic because they are not expressed in their natural host. Simultaneous expression of all genes belonging to a biosynthetic pathway in a heterologous host is one approach to activate biosynthetic gene clusters and to screen the metabolites produced for bioactivities. Polycistronic expression of all pathway genes under control of a single and tunable promoter would be the method of choice, as this does not only simplify cloning procedures, but also offers control on timing and strength of expression. However, polycistronic gene expression is a feature not commonly found in eukaryotic host systems, such as Aspergillus niger. In this study, we tested the suitability of the viral P2A peptide for co-expression of three genes in A. niger. Two genes descend from Fusarium oxysporum and are essential to produce the secondary metabolite enniatin (esyn1, ekivR). The third gene (luc) encodes the reporter luciferase which was included to study position effects. Expression of the polycistronic gene cassette was put under control of the Tet-On system to ensure tunable gene expression in A. niger. In total, three polycistronic expression cassettes which differed in the position of luc were constructed and targeted to the pyrG locus in A. niger. This allowed direct comparison of the luciferase activity based on the position of the luciferase gene. Doxycycline-mediated induction of the Tet-On expression cassettes resulted in the production of one long polycistronic mRNA as proven by Northern analyses, and ensured comparable production of enniatin in all three strains. Notably, gene position within the polycistronic expression cassette matters, as, luciferase activity was lowest at position one and had a comparable activity at positions two and three. The P2A peptide can be used to express at

  16. GSEH: A Novel Approach to Select Prostate Cancer-Associated Genes Using Gene Expression Heterogeneity.

    Science.gov (United States)

    Kim, Hyunjin; Choi, Sang-Min; Park, Sanghyun

    2018-01-01

    When a gene shows varying levels of expression among normal people but similar levels in disease patients or shows similar levels of expression among normal people but different levels in disease patients, we can assume that the gene is associated with the disease. By utilizing this gene expression heterogeneity, we can obtain additional information that abets discovery of disease-associated genes. In this study, we used collaborative filtering to calculate the degree of gene expression heterogeneity between classes and then scored the genes on the basis of the degree of gene expression heterogeneity to find "differentially predicted" genes. Through the proposed method, we discovered more prostate cancer-associated genes than 10 comparable methods. The genes prioritized by the proposed method are potentially significant to biological processes of a disease and can provide insight into them.

  17. A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

    Science.gov (United States)

    Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

    2015-01-01

    Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.

  18. Gene expression in cerebral ischemia: a new approach for neuroprotection.

    Science.gov (United States)

    Millán, Mónica; Arenillas, Juan

    2006-01-01

    Cerebral ischemia is one of the strongest stimuli for gene induction in the brain. Hundreds of genes have been found to be induced by brain ischemia. Many genes are involved in neurodestructive functions such as excitotoxicity, inflammatory response and neuronal apoptosis. However, cerebral ischemia is also a powerful reformatting and reprogramming stimulus for the brain through neuroprotective gene expression. Several genes may participate in both cellular responses. Thus, isolation of candidate genes for neuroprotection strategies and interpretation of expression changes have been proven difficult. Nevertheless, many studies are being carried out to improve the knowledge of the gene activation and protein expression following ischemic stroke, as well as in the development of new therapies that modify biochemical, molecular and genetic changes underlying cerebral ischemia. Owing to the complexity of the process involving numerous critical genes expressed differentially in time, space and concentration, ongoing therapeutic efforts should be based on multiple interventions at different levels. By modification of the acute gene expression induced by ischemia or the apoptotic gene program, gene therapy is a promising treatment but is still in a very experimental phase. Some hurdles will have to be overcome before these therapies can be introduced into human clinical stroke trials. Copyright 2006 S. Karger AG, Basel.

  19. Modulation of gene expression made easy

    DEFF Research Database (Denmark)

    Solem, Christian; Jensen, Peter Ruhdal

    2002-01-01

    A new approach for modulating gene expression, based on randomization of promoter (spacer) sequences, was developed. The method was applied to chromosomal genes in Lactococcus lactis and shown to generate libraries of clones with broad ranges of expression levels of target genes. In one example...... that the method can be applied to modulating the expression of native genes on the chromosome. We constructed a series of strains in which the expression of the las operon, containing the genes pfk, pyk, and ldh, was modulated by integrating a truncated copy of the pfk gene. Importantly, the modulation affected...

  20. Bayesian assignment of gene ontology terms to gene expression experiments

    Science.gov (United States)

    Sykacek, P.

    2012-01-01

    Motivation: Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. Results: This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Availability: Source code under GPL license is available from the author. Contact: peter.sykacek@boku.ac.at PMID:22962488

  1. Bayesian assignment of gene ontology terms to gene expression experiments.

    Science.gov (United States)

    Sykacek, P

    2012-09-15

    Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment. This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways. Source code under GPL license is available from the author. peter.sykacek@boku.ac.at.

  2. Synthetic promoter libraries- tuning of gene expression

    DEFF Research Database (Denmark)

    Hammer, Karin; Mijakovic, Ivan; Jensen, Peter Ruhdal

    2006-01-01

    knockout and strong overexpression. However, applications such as metabolic optimization and control analysis necessitate a continuous set of expression levels with only slight increments in strength to cover a specific window around the wildtype expression level of the studied gene; this requirement can......The study of gene function often requires changing the expression of a gene and evaluating the consequences. In principle, the expression of any given gene can be modulated in a quasi-continuum of discrete expression levels but the traditional approaches are usually limited to two extremes: gene...

  3. More powerful significant testing for time course gene expression data using functional principal component analysis approaches.

    Science.gov (United States)

    Wu, Shuang; Wu, Hulin

    2013-01-16

    One of the fundamental problems in time course gene expression data analysis is to identify genes associated with a biological process or a particular stimulus of interest, like a treatment or virus infection. Most of the existing methods for this problem are designed for data with longitudinal replicates. But in reality, many time course gene experiments have no replicates or only have a small number of independent replicates. We focus on the case without replicates and propose a new method for identifying differentially expressed genes by incorporating the functional principal component analysis (FPCA) into a hypothesis testing framework. The data-driven eigenfunctions allow a flexible and parsimonious representation of time course gene expression trajectories, leaving more degrees of freedom for the inference compared to that using a prespecified basis. Moreover, the information of all genes is borrowed for individual gene inferences. The proposed approach turns out to be more powerful in identifying time course differentially expressed genes compared to the existing methods. The improved performance is demonstrated through simulation studies and a real data application to the Saccharomyces cerevisiae cell cycle data.

  4. A synbio approach for selection of highly expressed gene variants in Gram-positive bacteria

    DEFF Research Database (Denmark)

    Ferro, Roberto; Rennig, Maja; Hernández Rollán, Cristina

    2018-01-01

    with a long history in food fermentation. We have developed a synbio approach for increasing gene expression in two Gram-positive bacteria. First of all, the gene of interest was coupled to an antibiotic resistance gene to create a growth-based selection system. We then randomised the translation initiation...... region (TIR) preceding the gene of interest and selected clones that produced high protein titres, as judged by their ability to survive on high concentrations of antibiotic. Using this approach, we were able to significantly increase production of two industrially relevant proteins; sialidase in B....... subtilis and tyrosine ammonia lyase in L. lactis. Gram-positive bacteria are widely used to produce industrial enzymes. High titres are necessary to make the production economically feasible. The synbio approach presented here is a simple and inexpensive way to increase protein titres, which can be carried...

  5. Machine learning approaches to supporting the identification of photoreceptor-enriched genes based on expression data

    Directory of Open Access Journals (Sweden)

    Simpson David

    2006-03-01

    Full Text Available Abstract Background Retinal photoreceptors are highly specialised cells, which detect light and are central to mammalian vision. Many retinal diseases occur as a result of inherited dysfunction of the rod and cone photoreceptor cells. Development and maintenance of photoreceptors requires appropriate regulation of the many genes specifically or highly expressed in these cells. Over the last decades, different experimental approaches have been developed to identify photoreceptor enriched genes. Recent progress in RNA analysis technology has generated large amounts of gene expression data relevant to retinal development. This paper assesses a machine learning methodology for supporting the identification of photoreceptor enriched genes based on expression data. Results Based on the analysis of publicly-available gene expression data from the developing mouse retina generated by serial analysis of gene expression (SAGE, this paper presents a predictive methodology comprising several in silico models for detecting key complex features and relationships encoded in the data, which may be useful to distinguish genes in terms of their functional roles. In order to understand temporal patterns of photoreceptor gene expression during retinal development, a two-way cluster analysis was firstly performed. By clustering SAGE libraries, a hierarchical tree reflecting relationships between developmental stages was obtained. By clustering SAGE tags, a more comprehensive expression profile for photoreceptor cells was revealed. To demonstrate the usefulness of machine learning-based models in predicting functional associations from the SAGE data, three supervised classification models were compared. The results indicated that a relatively simple instance-based model (KStar model performed significantly better than relatively more complex algorithms, e.g. neural networks. To deal with the problem of functional class imbalance occurring in the dataset, two data re

  6. Ratiometric Gas Reporting: A Nondisruptive Approach To Monitor Gene Expression in Soils.

    Science.gov (United States)

    Cheng, Hsiao-Ying; Masiello, Caroline A; Del Valle, Ilenne; Gao, Xiaodong; Bennett, George N; Silberg, Jonathan J

    2018-03-16

    Fluorescent proteins are ubiquitous tools that are used to monitor the dynamic functions of natural and synthetic genetic circuits. However, these visual reporters can only be used in transparent settings, a limitation that complicates nondisruptive measurements of gene expression within many matrices, such as soils and sediments. We describe a new ratiometric gas reporting method for nondisruptively monitoring gene expression within hard-to-image environmental matrices. With this approach, C 2 H 4 is continuously synthesized by ethylene forming enzyme to provide information on viable cell number, and CH 3 Br is conditionally synthesized by placing a methyl halide transferase gene under the control of a conditional promoter. We show that ratiometric gas reporting enables the creation of Escherichia coli biosensors that report on acylhomoserine lactone (AHL) autoinducers used for quorum sensing by Gram-negative bacteria. Using these biosensors, we find that an agricultural soil decreases the bioavailable concentration of a long-chain AHL up to 100-fold. We also demonstrate that these biosensors can be used in soil to nondisruptively monitor AHLs synthesized by Rhizobium leguminosarum and degraded by Bacillus thuringiensis. Finally, we show that this new reporting approach can be used in Shewanella oneidensis, a bacterium that lives in sediments.

  7. A Link-Based Cluster Ensemble Approach For Improved Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    P.Balaji

    2015-01-01

    Full Text Available Abstract It is difficult from possibilities to select a most suitable effective way of clustering algorithm and its dataset for a defined set of gene expression data because we have a huge number of ways and huge number of gene expressions. At present many researchers are preferring to use hierarchical clustering in different forms this is no more totally optimal. Cluster ensemble research can solve this type of problem by automatically merging multiple data partitions from a wide range of different clusterings of any dimensions to improve both the quality and robustness of the clustering result. But we have many existing ensemble approaches using an association matrix to condense sample-cluster and co-occurrence statistics and relations within the ensemble are encapsulated only at raw level while the existing among clusters are totally discriminated. Finding these missing associations can greatly expand the capability of those ensemble methodologies for microarray data clustering. We propose general K-means cluster ensemble approach for the clustering of general categorical data into required number of partitions.

  8. Classification of gene expression data: A hubness-aware semi-supervised approach.

    Science.gov (United States)

    Buza, Krisztian

    2016-04-01

    Classification of gene expression data is the common denominator of various biomedical recognition tasks. However, obtaining class labels for large training samples may be difficult or even impossible in many cases. Therefore, semi-supervised classification techniques are required as semi-supervised classifiers take advantage of unlabeled data. Gene expression data is high-dimensional which gives rise to the phenomena known under the umbrella of the curse of dimensionality, one of its recently explored aspects being the presence of hubs or hubness for short. Therefore, hubness-aware classifiers have been developed recently, such as Naive Hubness-Bayesian k-Nearest Neighbor (NHBNN). In this paper, we propose a semi-supervised extension of NHBNN which follows the self-training schema. As one of the core components of self-training is the certainty score, we propose a new hubness-aware certainty score. We performed experiments on publicly available gene expression data. These experiments show that the proposed classifier outperforms its competitors. We investigated the impact of each of the components (classification algorithm, semi-supervised technique, hubness-aware certainty score) separately and showed that each of these components are relevant to the performance of the proposed approach. Our results imply that our approach may increase classification accuracy and reduce computational costs (i.e., runtime). Based on the promising results presented in the paper, we envision that hubness-aware techniques will be used in various other biomedical machine learning tasks. In order to accelerate this process, we made an implementation of hubness-aware machine learning techniques publicly available in the PyHubs software package (http://www.biointelligence.hu/pyhubs) implemented in Python, one of the most popular programming languages of data science. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  9. Positron emission tomography imaging of gene expression

    International Nuclear Information System (INIS)

    Tang Ganghua

    2001-01-01

    The merging of molecular biology and nuclear medicine is developed into molecular nuclear medicine. Positron emission tomography (PET) of gene expression in molecular nuclear medicine has become an attractive area. Positron emission tomography imaging gene expression includes the antisense PET imaging and the reporter gene PET imaging. It is likely that the antisense PET imaging will lag behind the reporter gene PET imaging because of the numerous issues that have not yet to be resolved with this approach. The reporter gene PET imaging has wide application into animal experimental research and human applications of this approach will likely be reported soon

  10. Analysis of multiplex gene expression maps obtained by voxelation

    Directory of Open Access Journals (Sweden)

    Smith Desmond J

    2009-04-01

    Full Text Available Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. Results To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in

  11. Analysis of multiplex gene expression maps obtained by voxelation.

    Science.gov (United States)

    An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

    2009-04-29

    Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental

  12. Positron emission tomography and gene therapy: basic concepts and experimental approaches for in vivo gene expression imaging.

    Science.gov (United States)

    Peñuelas, Iván; Boán, JoséF; Martí-Climent, Josep M; Sangro, Bruno; Mazzolini, Guillermo; Prieto, Jesús; Richter, José A

    2004-01-01

    More than two decades of intense research have allowed gene therapy to move from the laboratory to the clinical setting, where its use for the treatment of human pathologies has been considerably increased in the last years. However, many crucial questions remain to be solved in this challenging field. In vivo imaging with positron emission tomography (PET) by combination of the appropriate PET reporter gene and PET reporter probe could provide invaluable qualitative and quantitative information to answer multiple unsolved questions about gene therapy. PET imaging could be used to define parameters not available by other techniques that are of substantial interest not only for the proper understanding of the gene therapy process, but also for its future development and clinical application in humans. This review focuses on the molecular biology basis of gene therapy and molecular imaging, describing the fundamentals of in vivo gene expression imaging by PET, and the application of PET to gene therapy, as a technology that can be used in many different ways. It could be applied to avoid invasive procedures for gene therapy monitoring; accurately diagnose the pathology for better planning of the most adequate therapeutic approach; as treatment evaluation to image the functional effects of gene therapy at the biochemical level; as a quantitative noninvasive way to monitor the location, magnitude and persistence of gene expression over time; and would also help to a better understanding of vector biology and pharmacology devoted to the development of safer and more efficient vectors.

  13. Experimental and Modeling Approaches for Understanding the Effect of Gene Expression Noise in Biological Development

    Directory of Open Access Journals (Sweden)

    David M. Holloway

    2018-04-01

    Full Text Available Biological development involves numerous chemical and physical processes which must act in concert to reliably produce a cell, a tissue, or a body. To be successful, the developing organism must be robust to variability at many levels, such as the environment (e.g., temperature, moisture, upstream information (such as long-range positional information gradients, or intrinsic noise due to the stochastic nature of low concentration chemical kinetics. The latter is especially relevant to the regulation of gene expression in cell differentiation. The temporal stochasticity of gene expression has been studied in single celled organisms for nearly two decades, but only recently have techniques become available to gather temporally-resolved data across spatially-distributed gene expression patterns in developing multicellular organisms. These demonstrate temporal noisy “bursting” in the number of gene transcripts per cell, raising the question of how the transcript number defining a particular cell type is produced, such that one cell type can reliably be distinguished from a neighboring cell of different type along a tissue boundary. Stochastic spatio-temporal modeling of tissue-wide expression patterns can identify signatures for specific types of gene regulation, which can be used to extract regulatory mechanism information from experimental time series. This Perspective focuses on using this type of approach to study gene expression noise during the anterior-posterior segmentation of the fruit fly embryo. Advances in experimental and theoretical techniques will lead to an increasing quantification of expression noise that can be used to understand how regulatory mechanisms contribute to embryonic robustness across a range of developmental processes.

  14. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes

    DEFF Research Database (Denmark)

    de Jong, Simone; Boks, Marco P M; Fuller, Tova F

    2012-01-01

    Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood...... of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co......, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes...

  15. A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.

    Directory of Open Access Journals (Sweden)

    Mohammad Manir Hossain Mollah

    Full Text Available Identifying genes that are differentially expressed (DE between two or more conditions with multiple patterns of expression is one of the primary objectives of gene expression data analysis. Several statistical approaches, including one-way analysis of variance (ANOVA, are used to identify DE genes. However, most of these methods provide misleading results for two or more conditions with multiple patterns of expression in the presence of outlying genes. In this paper, an attempt is made to develop a hybrid one-way ANOVA approach that unifies the robustness and efficiency of estimation using the minimum β-divergence method to overcome some problems that arise in the existing robust methods for both small- and large-sample cases with multiple patterns of expression.The proposed method relies on a β-weight function, which produces values between 0 and 1. The β-weight function with β = 0.2 is used as a measure of outlier detection. It assigns smaller weights (≥ 0 to outlying expressions and larger weights (≤ 1 to typical expressions. The distribution of the β-weights is used to calculate the cut-off point, which is compared to the observed β-weight of an expression to determine whether that gene expression is an outlier. This weight function plays a key role in unifying the robustness and efficiency of estimation in one-way ANOVA.Analyses of simulated gene expression profiles revealed that all eight methods (ANOVA, SAM, LIMMA, EBarrays, eLNN, KW, robust BetaEB and proposed perform almost identically for m = 2 conditions in the absence of outliers. However, the robust BetaEB method and the proposed method exhibited considerably better performance than the other six methods in the presence of outliers. In this case, the BetaEB method exhibited slightly better performance than the proposed method for the small-sample cases, but the the proposed method exhibited much better performance than the BetaEB method for both the small- and large

  16. Shrinkage Approach for Gene Expression Data Analysis

    Czech Academy of Sciences Publication Activity Database

    Haman, Jiří; Valenta, Zdeněk; Kalina, Jan

    2013-01-01

    Roč. 1, č. 1 (2013), s. 65-65 ISSN 1805-8698. [EFMI 2013 Special Topic Conference. 17.04.2013-19.04.2013, Prague] Institutional support: RVO:67985807 Keywords : shrinkage estimation * covariance matrix * high dimensional data * gene expression Subject RIV: IN - Informatics, Computer Science

  17. Gene expression inference with deep learning.

    Science.gov (United States)

    Chen, Yifei; Li, Yi; Narayan, Rajiv; Subramanian, Aravind; Xie, Xiaohui

    2016-06-15

    Large-scale gene expression profiling has been widely used to characterize cellular states in response to various disease conditions, genetic perturbations, etc. Although the cost of whole-genome expression profiles has been dropping steadily, generating a compendium of expression profiling over thousands of samples is still very expensive. Recognizing that gene expressions are often highly correlated, researchers from the NIH LINCS program have developed a cost-effective strategy of profiling only ∼1000 carefully selected landmark genes and relying on computational methods to infer the expression of remaining target genes. However, the computational approach adopted by the LINCS program is currently based on linear regression (LR), limiting its accuracy since it does not capture complex nonlinear relationship between expressions of genes. We present a deep learning method (abbreviated as D-GEX) to infer the expression of target genes from the expression of landmark genes. We used the microarray-based Gene Expression Omnibus dataset, consisting of 111K expression profiles, to train our model and compare its performance to those from other methods. In terms of mean absolute error averaged across all genes, deep learning significantly outperforms LR with 15.33% relative improvement. A gene-wise comparative analysis shows that deep learning achieves lower error than LR in 99.97% of the target genes. We also tested the performance of our learned model on an independent RNA-Seq-based GTEx dataset, which consists of 2921 expression profiles. Deep learning still outperforms LR with 6.57% relative improvement, and achieves lower error in 81.31% of the target genes. D-GEX is available at https://github.com/uci-cbcl/D-GEX CONTACT: xhx@ics.uci.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. Genetic architecture of gene expression in the chicken

    Directory of Open Access Journals (Sweden)

    Stanley Dragana

    2013-01-01

    Full Text Available Abstract Background The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. Results Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. Conclusions We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.

  19. A transgenic approach to study argininosuccinate synthetase gene expression

    Science.gov (United States)

    2014-01-01

    Background Argininosuccinate synthetase (ASS) participates in urea, nitric oxide and arginine production. Besides transcriptional regulation, a post-transcriptional regulation affecting nuclear precursor RNA stability has been reported. To study whether such post-transcriptional regulation underlines particular temporal and spatial ASS expression, and to investigate how human ASS gene behaves in a mouse background, a transgenic mouse system using a modified bacterial artificial chromosome carrying the human ASS gene tagged with EGFP was employed. Results Two lines of ASS-EGFP transgenic mice were generated: one with EGFP under transcriptional control similar to that of the endogenous ASS gene, another with EGFP under both transcriptional and post-transcriptional regulation as that of the endogenous ASS mRNA. EGFP expression in the liver, the organ for urea production, and in the intestine and kidney that are responsible for arginine biosynthesis, was examined. Organs taken from embryos E14.5 stage to young adult were examined under a fluorescence microscope either directly or after cryosectioning. The levels of EGFP and endogenous mouse Ass mRNAs were also quantified by S1 nuclease mapping. EGFP fluorescence and EGFP mRNA levels in both the liver and kidney were found to increase progressively from embryonic stage toward birth. In contrast, EGFP expression in the intestine was higher in neonates and started to decline at about 3 weeks after birth. Comparison between the EGFP profiles of the two transgenic lines indicated the developmental and tissue-specific regulation was mainly controlled at the transcriptional level. The ASS transgene was of human origin. EGFP expression in the liver followed essentially the mouse Ass pattern as evidenced by zonation distribution of fluorescence and the level of EGFP mRNA at birth. However, in the small intestine, Ass mRNA level declined sharply at 3 week of age, and yet substantial EGFP mRNA was still detectable at this stage

  20. Backward-stochastic-differential-equation approach to modeling of gene expression.

    Science.gov (United States)

    Shamarova, Evelina; Chertovskih, Roman; Ramos, Alexandre F; Aguiar, Paulo

    2017-03-01

    In this article, we introduce a backward method to model stochastic gene expression and protein-level dynamics. The protein amount is regarded as a diffusion process and is described by a backward stochastic differential equation (BSDE). Unlike many other SDE techniques proposed in the literature, the BSDE method is backward in time; that is, instead of initial conditions it requires the specification of end-point ("final") conditions, in addition to the model parametrization. To validate our approach we employ Gillespie's stochastic simulation algorithm (SSA) to generate (forward) benchmark data, according to predefined gene network models. Numerical simulations show that the BSDE method is able to correctly infer the protein-level distributions that preceded a known final condition, obtained originally from the forward SSA. This makes the BSDE method a powerful systems biology tool for time-reversed simulations, allowing, for example, the assessment of the biological conditions (e.g., protein concentrations) that preceded an experimentally measured event of interest (e.g., mitosis, apoptosis, etc.).

  1. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes.

    Directory of Open Access Journals (Sweden)

    Simone de Jong

    Full Text Available Despite large-scale genome-wide association studies (GWAS, the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1, is located in, and regulated by the major histocompatibility (MHC complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network.

  2. Functional clustering of time series gene expression data by Granger causality

    Science.gov (United States)

    2012-01-01

    Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425

  3. Inferring gene expression dynamics via functional regression analysis

    Directory of Open Access Journals (Sweden)

    Leng Xiaoyan

    2008-01-01

    Full Text Available Abstract Background Temporal gene expression profiles characterize the time-dynamics of expression of specific genes and are increasingly collected in current gene expression experiments. In the analysis of experiments where gene expression is obtained over the life cycle, it is of interest to relate temporal patterns of gene expression associated with different developmental stages to each other to study patterns of long-term developmental gene regulation. We use tools from functional data analysis to study dynamic changes by relating temporal gene expression profiles of different developmental stages to each other. Results We demonstrate that functional regression methodology can pinpoint relationships that exist between temporary gene expression profiles for different life cycle phases and incorporates dimension reduction as needed for these high-dimensional data. By applying these tools, gene expression profiles for pupa and adult phases are found to be strongly related to the profiles of the same genes obtained during the embryo phase. Moreover, one can distinguish between gene groups that exhibit relationships with positive and others with negative associations between later life and embryonal expression profiles. Specifically, we find a positive relationship in expression for muscle development related genes, and a negative relationship for strictly maternal genes for Drosophila, using temporal gene expression profiles. Conclusion Our findings point to specific reactivation patterns of gene expression during the Drosophila life cycle which differ in characteristic ways between various gene groups. Functional regression emerges as a useful tool for relating gene expression patterns from different developmental stages, and avoids the problems with large numbers of parameters and multiple testing that affect alternative approaches.

  4. A synbio approach for selection of highly expressed gene variants in Gram-positive bacteria.

    Science.gov (United States)

    Ferro, Roberto; Rennig, Maja; Hernández-Rollán, Cristina; Daley, Daniel O; Nørholm, Morten H H

    2018-03-08

    The market for recombinant proteins is on the rise, and Gram-positive strains are widely exploited for this purpose. Bacillus subtilis is a profitable host for protein production thanks to its ability to secrete large amounts of proteins, and Lactococcus lactis is an attractive production organism with a long history in food fermentation. We have developed a synbio approach for increasing gene expression in two Gram-positive bacteria. First of all, the gene of interest was coupled to an antibiotic resistance gene to create a growth-based selection system. We then randomised the translation initiation region (TIR) preceding the gene of interest and selected clones that produced high protein titres, as judged by their ability to survive on high concentrations of antibiotic. Using this approach, we were able to significantly increase production of two industrially relevant proteins; sialidase in B. subtilis and tyrosine ammonia lyase in L. lactis. Gram-positive bacteria are widely used to produce industrial enzymes. High titres are necessary to make the production economically feasible. The synbio approach presented here is a simple and inexpensive way to increase protein titres, which can be carried out in any laboratory within a few days. It could also be implemented as a tool for applications beyond TIR libraries, such as screening of synthetic, homologous or domain-shuffled genes.

  5. Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

    Directory of Open Access Journals (Sweden)

    Paules Richard S

    2007-11-01

    Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying

  6. Novel gene sets improve set-level classification of prokaryotic gene expression data.

    Science.gov (United States)

    Holec, Matěj; Kuželka, Ondřej; Železný, Filip

    2015-10-28

    Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.

  7. A re-assessment of gene-tag classification approaches for describing var gene expression patterns during human Plasmodium falciparum malaria parasite infections.

    Science.gov (United States)

    Githinji, George; Bull, Peter C

    2017-01-01

    PfEMP1 are variant parasite antigens that are inserted on the surface of Plasmodium falciparum infected erythrocytes (IE). Through interactions with various host molecules, PfEMP1 mediate IE sequestration in tissues and play a key role in the pathology of severe malaria. PfEMP1 is encoded by a diverse multi-gene family called var . Previous studies have shown that that expression of specific subsets of var genes are associated with low levels of host immunity and severe malaria. However, in most clinical studies to date, full-length var gene sequences were unavailable and various approaches have been used to make comparisons between var gene expression profiles in different parasite isolates using limited information. Several studies have relied on the classification of a 300 - 500 base-pair "DBLα tag" region in the DBLα domain located at the 5' end of most var genes. We assessed the relationship between various DBLα tag classification methods, and sequence features that are only fully assessable through full-length var gene sequences. We compared these different sequence features in full-length var gene from six fully sequenced laboratory isolates. These comparisons show that despite a long history of recombination,   DBLα sequence tag classification can provide functional information on important features of full-length var genes. Notably, a specific subset of DBLα tags previously defined as "group A-like" is associated with CIDRα1 domains proposed to bind to endothelial protein C receptor. This analysis helps to bring together different sources of data that have been used to assess var gene expression in clinical parasite isolates.

  8. Gene expression

    International Nuclear Information System (INIS)

    Hildebrand, C.E.; Crawford, B.D.; Walters, R.A.; Enger, M.D.

    1983-01-01

    We prepared probes for isolating functional pieces of the metallothionein locus. The probes enabled a variety of experiments, eventually revealing two mechanisms for metallothionein gene expression, the order of the DNA coding units at the locus, and the location of the gene site in its chromosome. Once the switch regulating metallothionein synthesis was located, it could be joined by recombinant DNA methods to other, unrelated genes, then reintroduced into cells by gene-transfer techniques. The expression of these recombinant genes could then be induced by exposing the cells to Zn 2+ or Cd 2+ . We would thus take advantage of the clearly defined switching properties of the metallothionein gene to manipulate the expression of other, perhaps normally constitutive, genes. Already, despite an incomplete understanding of how the regulatory switch of the metallothionein locus operates, such experiments have been performed successfully

  9. Simple Comparative Analyses of Differentially Expressed Gene Lists May Overestimate Gene Overlap.

    Science.gov (United States)

    Lawhorn, Chelsea M; Schomaker, Rachel; Rowell, Jonathan T; Rueppell, Olav

    2018-04-16

    Comparing the overlap between sets of differentially expressed genes (DEGs) within or between transcriptome studies is regularly used to infer similarities between biological processes. Significant overlap between two sets of DEGs is usually determined by a simple test. The number of potentially overlapping genes is compared to the number of genes that actually occur in both lists, treating every gene as equal. However, gene expression is controlled by transcription factors that bind to a variable number of transcription factor binding sites, leading to variation among genes in general variability of their expression. Neglecting this variability could therefore lead to inflated estimates of significant overlap between DEG lists. With computer simulations, we demonstrate that such biases arise from variation in the control of gene expression. Significant overlap commonly arises between two lists of DEGs that are randomly generated, assuming that the control of gene expression is variable among genes but consistent between corresponding experiments. More overlap is observed when transcription factors are specific to their binding sites and when the number of genes is considerably higher than the number of different transcription factors. In contrast, overlap between two DEG lists is always lower than expected when the genetic architecture of expression is independent between the two experiments. Thus, the current methods for determining significant overlap between DEGs are potentially confounding biologically meaningful overlap with overlap that arises due to variability in control of expression among genes, and more sophisticated approaches are needed.

  10. The approaches to mathematical modeling of recA, umuD genes expression in bacteria Escherichia coli after UV-irradiation

    International Nuclear Information System (INIS)

    Belov, O.V.

    2006-01-01

    The modern data of recA, umuD genes expression of the system of SOS-repair at classical object of radiation genetic researches - bacteria Escherichia coli, after ultraviolet irradiation are presented. Essentially a new method of analysis of SOS-genes expression is considered. It was shown that using this method it is possible to determine the character of induction of some SOS-genes more precisely. The possible approach to the mathematical description of SOS-response of cells by construction of the system of the differential equations is presented

  11. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Boris P Hejblum

    2015-06-01

    Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.

  12. A new approach to enhance the performance of decision tree for classifying gene expression data.

    Science.gov (United States)

    Hassan, Md; Kotagiri, Ramamohanarao

    2013-12-20

    Gene expression data classification is a challenging task due to the large dimensionality and very small number of samples. Decision tree is one of the popular machine learning approaches to address such classification problems. However, the existing decision tree algorithms use a single gene feature at each node to split the data into its child nodes and hence might suffer from poor performance specially when classifying gene expression dataset. By using a new decision tree algorithm where, each node of the tree consists of more than one gene, we enhance the classification performance of traditional decision tree classifiers. Our method selects suitable genes that are combined using a linear function to form a derived composite feature. To determine the structure of the tree we use the area under the Receiver Operating Characteristics curve (AUC). Experimental analysis demonstrates higher classification accuracy using the new decision tree compared to the other existing decision trees in literature. We experimentally compare the effect of our scheme against other well known decision tree techniques. Experiments show that our algorithm can substantially boost the classification performance of the decision tree.

  13. System Biology Approach: Gene Network Analysis for Muscular Dystrophy.

    Science.gov (United States)

    Censi, Federica; Calcagnini, Giovanni; Mattei, Eugenio; Giuliani, Alessandro

    2018-01-01

    Phenotypic changes at different organization levels from cell to entire organism are associated to changes in the pattern of gene expression. These changes involve the entire genome expression pattern and heavily rely upon correlation patterns among genes. The classical approach used to analyze gene expression data builds upon the application of supervised statistical techniques to detect genes differentially expressed among two or more phenotypes (e.g., normal vs. disease). The use of an a posteriori, unsupervised approach based on principal component analysis (PCA) and the subsequent construction of gene correlation networks can shed a light on unexpected behaviour of gene regulation system while maintaining a more naturalistic view on the studied system.In this chapter we applied an unsupervised method to discriminate DMD patient and controls. The genes having the highest absolute scores in the discrimination between the groups were then analyzed in terms of gene expression networks, on the basis of their mutual correlation in the two groups. The correlation network structures suggest two different modes of gene regulation in the two groups, reminiscent of important aspects of DMD pathogenesis.

  14. Neighboring Genes Show Correlated Evolution in Gene Expression

    Science.gov (United States)

    Ghanbarian, Avazeh T.; Hurst, Laurence D.

    2015-01-01

    When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking. PMID:25743543

  15. The rules of gene expression in plants: Organ identity and gene body methylation are key factors for regulation of gene expression in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Gutiérrez Rodrigo A

    2008-09-01

    Full Text Available Abstract Background Microarray technology is a widely used approach for monitoring genome-wide gene expression. For Arabidopsis, there are over 1,800 microarray hybridizations representing many different experimental conditions on Affymetrix™ ATH1 gene chips alone. This huge amount of data offers a unique opportunity to infer the principles that govern the regulation of gene expression in plants. Results We used bioinformatics methods to analyze publicly available data obtained using the ATH1 chip from Affymetrix. A total of 1887 ATH1 hybridizations were normalized and filtered to eliminate low-quality hybridizations. We classified and compared control and treatment hybridizations and determined differential gene expression. The largest differences in gene expression were observed when comparing samples obtained from different organs. On average, ten-fold more genes were differentially expressed between organs as compared to any other experimental variable. We defined "gene responsiveness" as the number of comparisons in which a gene changed its expression significantly. We defined genes with the highest and lowest responsiveness levels as hypervariable and housekeeping genes, respectively. Remarkably, housekeeping genes were best distinguished from hypervariable genes by differences in methylation status in their transcribed regions. Moreover, methylation in the transcribed region was inversely correlated (R2 = 0.8 with gene responsiveness on a genome-wide scale. We provide an example of this negative relationship using genes encoding TCA cycle enzymes, by contrasting their regulatory responsiveness to nitrate and methylation status in their transcribed regions. Conclusion Our results indicate that the Arabidopsis transcriptome is largely established during development and is comparatively stable when faced with external perturbations. We suggest a novel functional role for DNA methylation in the transcribed region as a key determinant

  16. Gene Expression Commons: an open platform for absolute gene expression profiling.

    Directory of Open Access Journals (Sweden)

    Jun Seita

    Full Text Available Gene expression profiling using microarrays has been limited to comparisons of gene expression between small numbers of samples within individual experiments. However, the unknown and variable sensitivities of each probeset have rendered the absolute expression of any given gene nearly impossible to estimate. We have overcome this limitation by using a very large number (>10,000 of varied microarray data as a common reference, so that statistical attributes of each probeset, such as the dynamic range and threshold between low and high expression, can be reliably discovered through meta-analysis. This strategy is implemented in a web-based platform named "Gene Expression Commons" (https://gexc.stanford.edu/ which contains data of 39 distinct highly purified mouse hematopoietic stem/progenitor/differentiated cell populations covering almost the entire hematopoietic system. Since the Gene Expression Commons is designed as an open platform, investigators can explore the expression level of any gene, search by expression patterns of interest, submit their own microarray data, and design their own working models representing biological relationship among samples.

  17. Identification and Cloning of Differentially Expressed SOUL and ELIP Genes in Saffron Stigmas Using a Subtractive Hybridization Approach.

    Directory of Open Access Journals (Sweden)

    Oussama Ahrazem

    Full Text Available Using a subtractive hybridization approach, differentially expressed genes involved in the light response in saffron stigmas were identified. Twenty-two differentially expressed transcript-derived fragments were cloned and sequenced. Two of them were highly induced by light and had sequence similarity to early inducible proteins (ELIP and SOUL heme-binding proteins. Using these sequences, we searched for other family members expressed in saffron stigma. ELIP and SOUL are represented by small gene families in saffron, with four and five members, respectively. The expression of these genes was analyzed during the development of the stigma and in light and dark conditions. ELIP transcripts were detected in all the developmental stages showing much higher expression levels in the developed stigmas of saffron and all were up-regulated by light but at different levels. By contrast, only one SOUL gene was up-regulated by light and was highly expressed in the stigma at anthesis. Both the ELIP and SOUL genes induced by light in saffron stigmas might be associated with the structural changes affecting the chromoplast of the stigma, as a result of light exposure, which promotes the development and increases the number of plastoglobules, specialized in the recruitment of specific proteins, which enables them to act in metabolite synthesis and disposal under changing environmental conditions and developmental stages.

  18. Validation of commonly used reference genes for sleep-related gene expression studies

    Directory of Open Access Journals (Sweden)

    Castro Rosa MRPS

    2009-05-01

    Full Text Available Abstract Background Sleep is a restorative process and is essential for maintenance of mental and physical health. In an attempt to understand the complexity of sleep, multidisciplinary strategies, including genetic approaches, have been applied to sleep research. Although quantitative real time PCR has been used in previous sleep-related gene expression studies, proper validation of reference genes is currently lacking. Thus, we examined the effect of total or paradoxical sleep deprivation (TSD or PSD on the expression stability of the following frequently used reference genes in brain and blood: beta-actin (b-actin, beta-2-microglobulin (B2M, glyceraldehyde-3-phosphate dehydrogenase (GAPDH, and hypoxanthine guanine phosphoribosyl transferase (HPRT. Results Neither TSD nor PSD affected the expression stability of all tested genes in both tissues indicating that b-actin, B2M, GAPDH and HPRT are appropriate reference genes for the sleep-related gene expression studies. In order to further verify these results, the relative expression of brain derived neurotrophic factor (BDNF and glycerol-3-phosphate dehydrogenase1 (GPD1 was evaluated in brain and blood, respectively. The normalization with each of four reference genes produced similar pattern of expression in control and sleep deprived rats, but subtle differences in the magnitude of expression fold change were observed which might affect the statistical significance. Conclusion This study demonstrated that sleep deprivation does not alter the expression stability of commonly used reference genes in brain and blood. Nonetheless, the use of multiple reference genes in quantitative RT-PCR is required for the accurate results.

  19. Variation-preserving normalization unveils blind spots in gene expression profiling

    Science.gov (United States)

    Roca, Carlos P.; Gomes, Susana I. L.; Amorim, Mónica J. B.; Scott-Fordsmand, Janeck J.

    2017-01-01

    RNA-Seq and gene expression microarrays provide comprehensive profiles of gene activity, but lack of reproducibility has hindered their application. A key challenge in the data analysis is the normalization of gene expression levels, which is currently performed following the implicit assumption that most genes are not differentially expressed. Here, we present a mathematical approach to normalization that makes no assumption of this sort. We have found that variation in gene expression is much larger than currently believed, and that it can be measured with available assays. Our results also explain, at least partially, the reproducibility problems encountered in transcriptomics studies. We expect that this improvement in detection will help efforts to realize the full potential of gene expression profiling, especially in analyses of cellular processes involving complex modulations of gene expression. PMID:28276435

  20. Multiscale Embedded Gene Co-expression Network Analysis.

    Directory of Open Access Journals (Sweden)

    Won-Min Song

    2015-11-01

    Full Text Available Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3, the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA by: i introducing quality control of co-expression similarities, ii parallelizing embedded network construction, and iii developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs. We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA. MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  1. Multiscale Embedded Gene Co-expression Network Analysis.

    Science.gov (United States)

    Song, Won-Min; Zhang, Bin

    2015-11-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  2. SIGNATURE: A workbench for gene expression signature analysis

    Directory of Open Access Journals (Sweden)

    Chang Jeffrey T

    2011-11-01

    Full Text Available Abstract Background The biological phenotype of a cell, such as a characteristic visual image or behavior, reflects activities derived from the expression of collections of genes. As such, an ability to measure the expression of these genes provides an opportunity to develop more precise and varied sets of phenotypes. However, to use this approach requires computational methods that are difficult to implement and apply, and thus there is a critical need for intelligent software tools that can reduce the technical burden of the analysis. Tools for gene expression analyses are unusually difficult to implement in a user-friendly way because their application requires a combination of biological data curation, statistical computational methods, and database expertise. Results We have developed SIGNATURE, a web-based resource that simplifies gene expression signature analysis by providing software, data, and protocols to perform the analysis successfully. This resource uses Bayesian methods for processing gene expression data coupled with a curated database of gene expression signatures, all carried out within a GenePattern web interface for easy use and access. Conclusions SIGNATURE is available for public use at http://genepattern.genome.duke.edu/signature/.

  3. Expression of streptavidin gene in bacteria and plants

    International Nuclear Information System (INIS)

    Guan, Xueni; Wurtele, E.S.; Nikolau, B.J.

    1990-01-01

    Six biotin-containing proteins are present in plants, representing at least four different biotin enzymes. The physiological function of these biotin enzymes is not understood. Streptavidin, a protein from Streptomyces avidinii, binds tightly and specifically to biotin causing inactivation of biotin enzymes. One approach to elucidating the physiological function of biotin enzymes in plant metabolism is to create transgenic plants expressing the streptavidin gene. A plasmid containing a fused streptavidin-beta-galactosidase gene has been expressed in E. coli. We also have constructed various fusion genes that include an altered CaMV 35S promoter, signal peptides to target the streptavidin protein to specific organelles, and the streptavidin coding gene. We are examining the expression of these genes in cells of carrot

  4. Reference gene validation for gene expression normalization in canine osteosarcoma : a geNorm algorithm approach

    NARCIS (Netherlands)

    Selvarajah, G.T.; Bonestroo, F.A.S.; Timmermans Sprang, E.P.M.; Kirpensteijn, J.|info:eu-repo/dai/nl/189846992; Mol, J.A.|info:eu-repo/dai/nl/070918775

    2017-01-01

    Background Quantitative PCR (qPCR) is a common method for quantifying mRNA expression. Given the heterogeneity present in tumor tissues, it is crucial to normalize target mRNA expression data using appropriate reference genes that are stably expressed under a variety of pathological and experimental

  5. Imaging gene expression in gene therapy

    International Nuclear Information System (INIS)

    Wiebe, Leonard I.

    1997-01-01

    Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on 'suicide gene therapy' of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k + ) has been use for 'suicide' in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k + gene expression where the H S V-1 t k + gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([ 18 F]F H P G; [ 18 F]-A C V), and pyrimidine- ([ 123 / 131 I]I V R F U; [ 124 / 131I ]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [ 123 / 131I ]I V R F U imaging with the H S V-1 t k + reporter gene will be presented

  6. Changes in gene expression during male meiosis in Petunia hybrida.

    Science.gov (United States)

    Cnudde, Filip; Hedatale, Veena; de Jong, Hans; Pierson, Elisabeth S; Rainey, Daphne Y; Zabeau, Marc; Weterings, Koen; Gerats, Tom; Peters, Janny L

    2006-01-01

    We analyzed changes in gene expression during male meiosis in Petunia by combining the meiotic staging of pollen mother cells from a single anther with cDNA-AFLP transcript profiling of mRNA from the synchronously developing sister anthers. The transcript profiling experiments focused on the identification of genes with a modulated expression profile during meiosis, while premeiotic archesporial cells and postmeiotic microspores served as a reference. About 8000 transcript tags, estimated at 30% of the total transcriptome, were generated, of which around 6% exhibited a modulated gene expression pattern at meiosis. Cluster analysis revealed a transcriptional cascade that coincides with the initiation and progression through all stages of the two meiotic divisions. Fragments that exhibited high expression specifically during meiosis I were characterized further by sequencing; 90 out of the 293 sequenced fragments showed homology with known genes, belonging to a wide range of gene classes, including previously characterized meiotic genes. In-situ hybridization experiments were performed to determine the spatial expression pattern for five selected transcript tags. Its concurrence with cDNA-AFLP transcript profiles indicates that this is an excellent approach to study genes involved in specialized processes such as meiosis. Our data set provides the potential to unravel unique meiotic genes that are as yet elusive to reverse genetics approaches.

  7. Comparative genomics of the relationship between gene structure and expression

    NARCIS (Netherlands)

    Ren, X.

    2006-01-01

    The relationship between the structure of genes and their expression is a relatively new aspect of genome organization and regulation. With more genome sequences and expression data becoming available, bioinformatics approaches can help the further elucidation of the relationships between gene

  8. Imaging gene expression in gene therapy

    Energy Technology Data Exchange (ETDEWEB)

    Wiebe, Leonard I. [Alberta Univ., Edmonton (Canada). Noujaim Institute for Pharmaceutical Oncology Research

    1997-12-31

    Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on `suicide gene therapy` of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k{sup +}) has been use for `suicide` in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k{sup +} gene expression where the H S V-1 t k{sup +} gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([{sup 18} F]F H P G; [{sup 18} F]-A C V), and pyrimidine- ([{sup 123}/{sup 131} I]I V R F U; [{sup 124}/{sup 131I}]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [{sup 123}/{sup 131I}]I V R F U imaging with the H S V-1 t k{sup +} reporter gene will be presented

  9. Radiation-modulated gene expression in C. elegans

    International Nuclear Information System (INIS)

    Nelson, G.A.; Bayeta, E.; Perez, C.; Lloyd, E.; Jones, T.; Smith, A.; Tian, J.

    2003-01-01

    Full text: We use the nematode C. elegans to characterize the genotoxic and cytotoxic effects of ionizing radiation with emphasis effects of charged particle radiation and have described the fluence vs. response relationships for mutation, chromosome aberration and certain developmental errors. These endpoints quantify the biological after repair and compensation pathways have completed their work. In order to address the control of these reactions we have turned to gene expression profiling to identify genes that uniquely respond to high LET species or respond differentially as a function of radiation properties. We have employed whole genome microarray methods to map gene expression following exposure to gamma rays, protons and accelerated iron ions. We found that 599 of 17871 genes analyzed showed differential expression 3 hrs after exposure to 3 Gy of at least one radiation types. 193 were up-regulated, 406 were down-regulated, and 90% were affected by only one species of radiation. Genes whose transcription levels responded significantly mapped to definite statistical clusters that were unique for each radiation type. We are now trying to establish the functional relationships of the genes their relevance to mitigation of radiation-induced damage. Three approaches are being used. First, bioinformatics tools are being used to determine the roles of genes in co-regulated gene sets. Second, we are applying the technique of RNA interference to determine whether our radiation-induced genes affect cell survival (measured in terms of embryo survival) and chromosome aberration (intestinal anaphase bridges). Finally we are focussing on the response of the most strongly-regulated gene in our data set. This is the autosomal gene, F36D3.9, whose predicted structure is that of a cysteine protease resembling cathepsin B. An enzymological approach is being used to characterize this gene at the protein level. This work was supported by NASA Cooperative Agreement NCC9-149

  10. Genetic Approaches to Study Meiosis and Meiosis-Specific Gene Expression in Saccharomyces cerevisiae.

    Science.gov (United States)

    Kassir, Yona; Stuart, David T

    2017-01-01

    The budding yeast Saccharomyces cerevisiae has a long history as a model organism for studies of meiosis and the cell cycle. The popularity of this yeast as a model is in large part due to the variety of genetic and cytological approaches that can be effectively performed with the cells. Cultures of the cells can be induced to synchronously progress through meiosis and sporulation allowing large-scale gene expression and biochemical studies to be performed. Additionally, the spore tetrads resulting from meiosis make it possible to characterize the haploid products of meiosis allowing investigation of meiotic recombination and chromosome segregation. Here we describe genetic methods for analysis progression of S. cerevisiae through meiosis and sporulation with an emphasis on strategies for the genetic analysis of regulators of meiosis-specific genes.

  11. Snapshot of the eukaryotic gene expression in muskoxen rumen--a metatranscriptomic approach.

    Directory of Open Access Journals (Sweden)

    Meng Qi

    Full Text Available BACKGROUND: Herbivores rely on digestive tract lignocellulolytic microorganisms, including bacteria, fungi and protozoa, to derive energy and carbon from plant cell wall polysaccharides. Culture independent metagenomic studies have been used to reveal the genetic content of the bacterial species within gut microbiomes. However, the nature of the genes encoded by eukaryotic protozoa and fungi within these environments has not been explored using metagenomic or metatranscriptomic approaches. METHODOLOGY/PRINCIPAL FINDINGS: In this study, a metatranscriptomic approach was used to investigate the functional diversity of the eukaryotic microorganisms within the rumen of muskoxen (Ovibos moschatus, with a focus on plant cell wall degrading enzymes. Polyadenylated RNA (mRNA was sequenced on the Illumina Genome Analyzer II system and 2.8 gigabases of sequences were obtained and 59129 contigs assembled. Plant cell wall degrading enzyme modules including glycoside hydrolases, carbohydrate esterases and polysaccharide lyases were identified from over 2500 contigs. These included a number of glycoside hydrolase family 6 (GH6, GH48 and swollenin modules, which have rarely been described in previous gut metagenomic studies. CONCLUSIONS/SIGNIFICANCE: The muskoxen rumen metatranscriptome demonstrates a much higher percentage of cellulase enzyme discovery and an 8.7x higher rate of total carbohydrate active enzyme discovery per gigabase of sequence than previous rumen metagenomes. This study provides a snapshot of eukaryotic gene expression in the muskoxen rumen, and identifies a number of candidate genes coding for potentially valuable lignocellulolytic enzymes.

  12. Snapshot of the Eukaryotic Gene Expression in Muskoxen Rumen—A Metatranscriptomic Approach

    Science.gov (United States)

    O'Toole, Nicholas; Barboza, Perry S.; Ungerfeld, Emilio; Leigh, Mary Beth; Selinger, L. Brent; Butler, Greg; Tsang, Adrian; McAllister, Tim A.; Forster, Robert J.

    2011-01-01

    Background Herbivores rely on digestive tract lignocellulolytic microorganisms, including bacteria, fungi and protozoa, to derive energy and carbon from plant cell wall polysaccharides. Culture independent metagenomic studies have been used to reveal the genetic content of the bacterial species within gut microbiomes. However, the nature of the genes encoded by eukaryotic protozoa and fungi within these environments has not been explored using metagenomic or metatranscriptomic approaches. Methodology/Principal Findings In this study, a metatranscriptomic approach was used to investigate the functional diversity of the eukaryotic microorganisms within the rumen of muskoxen (Ovibos moschatus), with a focus on plant cell wall degrading enzymes. Polyadenylated RNA (mRNA) was sequenced on the Illumina Genome Analyzer II system and 2.8 gigabases of sequences were obtained and 59129 contigs assembled. Plant cell wall degrading enzyme modules including glycoside hydrolases, carbohydrate esterases and polysaccharide lyases were identified from over 2500 contigs. These included a number of glycoside hydrolase family 6 (GH6), GH48 and swollenin modules, which have rarely been described in previous gut metagenomic studies. Conclusions/Significance The muskoxen rumen metatranscriptome demonstrates a much higher percentage of cellulase enzyme discovery and an 8.7x higher rate of total carbohydrate active enzyme discovery per gigabase of sequence than previous rumen metagenomes. This study provides a snapshot of eukaryotic gene expression in the muskoxen rumen, and identifies a number of candidate genes coding for potentially valuable lignocellulolytic enzymes. PMID:21655220

  13. Codon usage and amino acid usage influence genes expression level.

    Science.gov (United States)

    Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo

    2018-02-01

    Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.

  14. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    Directory of Open Access Journals (Sweden)

    Teng Shaolei

    2013-01-01

    Full Text Available Abstract Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs and Support Vector Machines (SVMs were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression.

  15. Gene Expression Profiling in Fish Toxicology: A Review.

    Science.gov (United States)

    Kumar, Girish; Denslow, Nancy D

    In this review, we present an overview of transcriptomic responses to chemical exposures in a variety of fish species. We have discussed the use of several molecular approaches such as northern blotting, differential display reverse transcription-polymerase chain reaction (DDRT-PCR), suppression subtractive hybridization (SSH), real time quantitative PCR (RT-qPCR), microarrays, and next-generation sequencing (NGS) for measuring gene expression. These techniques have been mainly used to measure the toxic effects of single compounds or simple mixtures in laboratory conditions. In addition, only few studies have been conducted to examine the biological significance of differentially expressed gene sets following chemical exposure. Therefore, future studies should focus more under field conditions using a multidisciplinary approach (genomics, proteomics and metabolomics) to understand the synergetic effects of multiple environmental stressors and to determine the functional significance of differentially expressed genes. Nevertheless, recent developments in NGS technologies and decreasing costs of sequencing holds the promise to uncover the complexity of anthropogenic impacts and biological effects in wild fish populations.

  16. Gene expression profiling of chicken intestinal host responses

    NARCIS (Netherlands)

    Hemert, van S.

    2007-01-01

    Chicken lines differ in genetic disease susceptibility. The scope of the research described in this thesis was to identify genes involved in genetic disease resistance in the chicken intestine. Therefore gene expression in the jejunum was investigated using a microarray approach. An intestine

  17. Extracting biologically significant patterns from short time series gene expression data

    Directory of Open Access Journals (Sweden)

    McGinnis Thomas

    2009-08-01

    Full Text Available Abstract Background Time series gene expression data analysis is used widely to study the dynamics of various cell processes. Most of the time series data available today consist of few time points only, thus making the application of standard clustering techniques difficult. Results We developed two new algorithms that are capable of extracting biological patterns from short time point series gene expression data. The two algorithms, ASTRO and MiMeSR, are inspired by the rank order preserving framework and the minimum mean squared residue approach, respectively. However, ASTRO and MiMeSR differ from previous approaches in that they take advantage of the relatively few number of time points in order to reduce the problem from NP-hard to linear. Tested on well-defined short time expression data, we found that our approaches are robust to noise, as well as to random patterns, and that they can correctly detect the temporal expression profile of relevant functional categories. Evaluation of our methods was performed using Gene Ontology (GO annotations and chromatin immunoprecipitation (ChIP-chip data. Conclusion Our approaches generally outperform both standard clustering algorithms and algorithms designed specifically for clustering of short time series gene expression data. Both algorithms are available at http://www.benoslab.pitt.edu/astro/.

  18. A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes

    Science.gov (United States)

    Seo, Minseok; Shin, Su-kyung; Kwon, Eun-Young; Kim, Sung-Eun; Bae, Yun-Jung; Lee, Seungyeoun; Sung, Mi-Kyung; Choi, Myung-Sook; Park, Taesung

    2016-01-01

    Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs) among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs). However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods. Through analysis of

  19. A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes.

    Directory of Open Access Journals (Sweden)

    Samuel Sunghwan Cho

    Full Text Available Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs. However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods

  20. Frequency-based time-series gene expression recomposition using PRIISM

    Directory of Open Access Journals (Sweden)

    Rosa Bruce A

    2012-06-01

    . Conclusion PRIISM is a novel approach for overcoming the problem of circadian disruptions from stress treatments on plants. PRIISM can be integrated with any existing analysis approach on gene expression data to separate circadian-influenced changes in gene expression, and it can be extended to apply to any organism with regular oscillations in gene expression patterns across a large portion of the genome.

  1. Discovery of time-delayed gene regulatory networks based on temporal gene expression profiling

    Directory of Open Access Journals (Sweden)

    Guo Zheng

    2006-01-01

    Full Text Available Abstract Background It is one of the ultimate goals for modern biological research to fully elucidate the intricate interplays and the regulations of the molecular determinants that propel and characterize the progression of versatile life phenomena, to name a few, cell cycling, developmental biology, aging, and the progressive and recurrent pathogenesis of complex diseases. The vast amount of large-scale and genome-wide time-resolved data is becoming increasing available, which provides the golden opportunity to unravel the challenging reverse-engineering problem of time-delayed gene regulatory networks. Results In particular, this methodological paper aims to reconstruct regulatory networks from temporal gene expression data by using delayed correlations between genes, i.e., pairwise overlaps of expression levels shifted in time relative each other. We have thus developed a novel model-free computational toolbox termed TdGRN (Time-delayed Gene Regulatory Network to address the underlying regulations of genes that can span any unit(s of time intervals. This bioinformatics toolbox has provided a unified approach to uncovering time trends of gene regulations through decision analysis of the newly designed time-delayed gene expression matrix. We have applied the proposed method to yeast cell cycling and human HeLa cell cycling and have discovered most of the underlying time-delayed regulations that are supported by multiple lines of experimental evidence and that are remarkably consistent with the current knowledge on phase characteristics for the cell cyclings. Conclusion We established a usable and powerful model-free approach to dissecting high-order dynamic trends of gene-gene interactions. We have carefully validated the proposed algorithm by applying it to two publicly available cell cycling datasets. In addition to uncovering the time trends of gene regulations for cell cycling, this unified approach can also be used to study the complex

  2. Radiopharmaceuticals to monitor the expression of transferred genes in gene transfer therapy

    International Nuclear Information System (INIS)

    Wiebe, L. I.

    1997-01-01

    The development and application of radiopharmaceuticals has, in many instances, been based on the pharmacological properties of therapeutic agents. The molecular biology-biotechnology revolution has had an important impact on treatment of diseases, in part through the reduced toxicity of 'biologicals', in part because of their specificity for interaction at unique molecular sites and in part because of their selective delivery to the target site. Immunotherapeutic approaches include the use of monoclonal antibodies (MABs), MAB-fragments and chemotactic peptides. Such agents currently form the basis of both diagnostic and immunotherapeutic radiopharmaceuticals. More recently, gene transfer techniques have been advanced to the point that a new molecular approach, gene therapy, has become a reality. Gene therapy offers an opportunity to attack disease at its most fundamental level. The therapeutic mechanism is based on the expression of a specific gene or genes, the product of which will invoke immunological, receptor-based or enzyme-based therapeutic modalities. Several approaches to gene therapy of cancer have been envisioned, the most clinically-advanced concepts involving the introduction of genes that will encode for molecular targets nor normally found in healthy mammalian cells. A number of gene therapy clinical trials are based on the introduction of the Herpes simplex virus type-1 (HSV-1) gene that encodes for viral thymidine kinase (tk+). Once HSV-1 tk+ is expressed in the target (cancer) cell, therapy can be effected by the administration of a highly molecularly-targeted and systemically non-toxic antiviral drug such as ganciclovir. The development of radiodiagnostic imaging in gene therapy will be reviewed, using HSV-1 tk+ and radioiodinated IVFRU as a basis for development of the theme. Molecular targets that could be exploited in gene therapy, other than tk+, will be identified

  3. Radiopharmaceuticals to monitor the expression of transferred genes in gene transfer therapy

    Energy Technology Data Exchange (ETDEWEB)

    Wiebe, L I [University of Alberta, Edmonton (Canada). Noujaim Institute for Pharmaceutical Oncology Research

    1997-10-01

    The development and application of radiopharmaceuticals has, in many instances, been based on the pharmacological properties of therapeutic agents. The molecular biology-biotechnology revolution has had an important impact on treatment of diseases, in part through the reduced toxicity of `biologicals`, in part because of their specificity for interaction at unique molecular sites and in part because of their selective delivery to the target site. Immunotherapeutic approaches include the use of monoclonal antibodies (MABs), MAB-fragments and chemotactic peptides. Such agents currently form the basis of both diagnostic and immunotherapeutic radiopharmaceuticals. More recently, gene transfer techniques have been advanced to the point that a new molecular approach, gene therapy, has become a reality. Gene therapy offers an opportunity to attack disease at its most fundamental level. The therapeutic mechanism is based on the expression of a specific gene or genes, the product of which will invoke immunological, receptor-based or enzyme-based therapeutic modalities. Several approaches to gene therapy of cancer have been envisioned, the most clinically-advanced concepts involving the introduction of genes that will encode for molecular targets nor normally found in healthy mammalian cells. A number of gene therapy clinical trials are based on the introduction of the Herpes simplex virus type-1 (HSV-1) gene that encodes for viral thymidine kinase (tk+). Once HSV-1 tk+ is expressed in the target (cancer) cell, therapy can be effected by the administration of a highly molecularly-targeted and systemically non-toxic antiviral drug such as ganciclovir. The development of radiodiagnostic imaging in gene therapy will be reviewed, using HSV-1 tk+ and radioiodinated IVFRU as a basis for development of the theme. Molecular targets that could be exploited in gene therapy, other than tk+, will be identified

  4. Biclustering methods: biological relevance and application in gene expression analysis.

    Directory of Open Access Journals (Sweden)

    Ali Oghabian

    Full Text Available DNA microarray technologies are used extensively to profile the expression levels of thousands of genes under various conditions, yielding extremely large data-matrices. Thus, analyzing this information and extracting biologically relevant knowledge becomes a considerable challenge. A classical approach for tackling this challenge is to use clustering (also known as one-way clustering methods where genes (or respectively samples are grouped together based on the similarity of their expression profiles across the set of all samples (or respectively genes. An alternative approach is to develop biclustering methods to identify local patterns in the data. These methods extract subgroups of genes that are co-expressed across only a subset of samples and may feature important biological or medical implications. In this study we evaluate 13 biclustering and 2 clustering (k-means and hierarchical methods. We use several approaches to compare their performance on two real gene expression data sets. For this purpose we apply four evaluation measures in our analysis: (1 we examine how well the considered (biclustering methods differentiate various sample types; (2 we evaluate how well the groups of genes discovered by the (biclustering methods are annotated with similar Gene Ontology categories; (3 we evaluate the capability of the methods to differentiate genes that are known to be specific to the particular sample types we study and (4 we compare the running time of the algorithms. In the end, we conclude that as long as the samples are well defined and annotated, the contamination of the samples is limited, and the samples are well replicated, biclustering methods such as Plaid and SAMBA are useful for discovering relevant subsets of genes and samples.

  5. Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

    Science.gov (United States)

    Caracausi, Maria; Piovesan, Allison; Antonaros, Francesca; Strippoli, Pierluigi; Vitale, Lorenza; Pelleri, Maria Chiara

    2017-09-01

    The ideal reference, or control, gene for the study of gene expression in a given organism should be expressed at a medium‑high level for easy detection, should be expressed at a constant/stable level throughout different cell types and within the same cell type undergoing different treatments, and should maintain these features through as many different tissues of the organism. From a biological point of view, these theoretical requirements of an ideal reference gene appear to be best suited to housekeeping (HK) genes. Recent advancements in the quality and completeness of human expression microarray data and in their statistical analysis may provide new clues toward the quantitative standardization of human gene expression studies in biology and medicine, both cross‑ and within‑tissue. The systematic approach used by the present study is based on the Transcriptome Mapper tool and exploits the automated reassignment of probes to corresponding genes, intra‑ and inter‑sample normalization, elaboration and representation of gene expression values in linear form within an indexed and searchable database with a graphical interface recording quantitative levels of expression, expression variability and cross‑tissue width of expression for more than 31,000 transcripts. The present study conducted a meta‑analysis of a pool of 646 expression profile data sets from 54 different human tissues and identified actin γ 1 as the HK gene that best fits the combination of all the traditional criteria to be used as a reference gene for general use; two ribosomal protein genes, RPS18 and RPS27, and one aquaporin gene, POM121 transmembrane nucleporin C, were also identified. The present study provided a list of tissue‑ and organ‑specific genes that may be most suited for the following individual tissues/organs: Adipose tissue, bone marrow, brain, heart, kidney, liver, lung, ovary, skeletal muscle and testis; and also provides in these cases a representative

  6. An extended Kalman filtering approach to modeling nonlinear dynamic gene regulatory networks via short gene expression time series.

    Science.gov (United States)

    Wang, Zidong; Liu, Xiaohui; Liu, Yurong; Liang, Jinling; Vinciotti, Veronica

    2009-01-01

    In this paper, the extended Kalman filter (EKF) algorithm is applied to model the gene regulatory network from gene time series data. The gene regulatory network is considered as a nonlinear dynamic stochastic model that consists of the gene measurement equation and the gene regulation equation. After specifying the model structure, we apply the EKF algorithm for identifying both the model parameters and the actual value of gene expression levels. It is shown that the EKF algorithm is an online estimation algorithm that can identify a large number of parameters (including parameters of nonlinear functions) through iterative procedure by using a small number of observations. Four real-world gene expression data sets are employed to demonstrate the effectiveness of the EKF algorithm, and the obtained models are evaluated from the viewpoint of bioinformatics.

  7. Deep convolutional neural networks for annotating gene expression patterns in the mouse brain.

    Science.gov (United States)

    Zeng, Tao; Li, Rongjian; Mukkamala, Ravi; Ye, Jieping; Ji, Shuiwang

    2015-05-07

    Profiling gene expression in brain structures at various spatial and temporal scales is essential to understanding how genes regulate the development of brain structures. The Allen Developing Mouse Brain Atlas provides high-resolution 3-D in situ hybridization (ISH) gene expression patterns in multiple developing stages of the mouse brain. Currently, the ISH images are annotated with anatomical terms manually. In this paper, we propose a computational approach to annotate gene expression pattern images in the mouse brain at various structural levels over the course of development. We applied deep convolutional neural network that was trained on a large set of natural images to extract features from the ISH images of developing mouse brain. As a baseline representation, we applied invariant image feature descriptors to capture local statistics from ISH images and used the bag-of-words approach to build image-level representations. Both types of features from multiple ISH image sections of the entire brain were then combined to build 3-D, brain-wide gene expression representations. We employed regularized learning methods for discriminating gene expression patterns in different brain structures. Results show that our approach of using convolutional model as feature extractors achieved superior performance in annotating gene expression patterns at multiple levels of brain structures throughout four developing ages. Overall, we achieved average AUC of 0.894 ± 0.014, as compared with 0.820 ± 0.046 yielded by the bag-of-words approach. Deep convolutional neural network model trained on natural image sets and applied to gene expression pattern annotation tasks yielded superior performance, demonstrating its transfer learning property is applicable to such biological image sets.

  8. Network Security via Biometric Recognition of Patterns of Gene Expression

    Science.gov (United States)

    Shaw, Harry C.

    2016-01-01

    Molecular biology provides the ability to implement forms of information and network security completely outside the bounds of legacy security protocols and algorithms. This paper addresses an approach which instantiates the power of gene expression for security. Molecular biology provides a rich source of gene expression and regulation mechanisms, which can be adopted to use in the information and electronic communication domains. Conventional security protocols are becoming increasingly vulnerable due to more intensive, highly capable attacks on the underlying mathematics of cryptography. Security protocols are being undermined by social engineering and substandard implementations by IT organizations. Molecular biology can provide countermeasures to these weak points with the current security approaches. Future advances in instruments for analyzing assays will also enable this protocol to advance from one of cryptographic algorithms to an integrated system of cryptographic algorithms and real-time expression and assay of gene expression products.

  9. A robust approach to identifying tissue-specific gene expression regulatory variants using personalized human induced pluripotent stem cells.

    Directory of Open Access Journals (Sweden)

    Je-Hyuk Lee

    2009-11-01

    Full Text Available Normal variation in gene expression due to regulatory polymorphisms is often masked by biological and experimental noise. In addition, some regulatory polymorphisms may become apparent only in specific tissues. We derived human induced pluripotent stem (iPS cells from adult skin primary fibroblasts and attempted to detect tissue-specific cis-regulatory variants using in vitro cell differentiation. We used padlock probes and high-throughput sequencing for digital RNA allelotyping and measured allele-specific gene expression in primary fibroblasts, lymphoblastoid cells, iPS cells, and their differentiated derivatives. We show that allele-specific expression is both cell type and genotype-dependent, but the majority of detectable allele-specific expression loci remains consistent despite large changes in the cell type or the experimental condition following iPS reprogramming, except on the X-chromosome. We show that our approach to mapping cis-regulatory variants reduces in vitro experimental noise and reveals additional tissue-specific variants using skin-derived human iPS cells.

  10. Towards precise classification of cancers based on robust gene functional expression profiles

    Directory of Open Access Journals (Sweden)

    Zhu Jing

    2005-03-01

    Full Text Available Abstract Background Development of robust and efficient methods for analyzing and interpreting high dimension gene expression profiles continues to be a focus in computational biology. The accumulated experiment evidence supports the assumption that genes express and perform their functions in modular fashions in cells. Therefore, there is an open space for development of the timely and relevant computational algorithms that use robust functional expression profiles towards precise classification of complex human diseases at the modular level. Results Inspired by the insight that genes act as a module to carry out a highly integrated cellular function, we thus define a low dimension functional expression profile for data reduction. After annotating each individual gene to functional categories defined in a proper gene function classification system such as Gene Ontology applied in this study, we identify those functional categories enriched with differentially expressed genes. For each functional category or functional module, we compute a summary measure (s for the raw expression values of the annotated genes to capture the overall activity level of the module. In this way, we can treat the gene expressions within a functional module as an integrative data point to replace the multiple values of individual genes. We compare the classification performance of decision trees based on functional expression profiles with the conventional gene expression profiles using four publicly available datasets, which indicates that precise classification of tumour types and improved interpretation can be achieved with the reduced functional expression profiles. Conclusion This modular approach is demonstrated to be a powerful alternative approach to analyzing high dimension microarray data and is robust to high measurement noise and intrinsic biological variance inherent in microarray data. Furthermore, efficient integration with current biological knowledge

  11. Statistical approach for selection of biologically informative genes.

    Science.gov (United States)

    Das, Samarendra; Rai, Anil; Mishra, D C; Rai, Shesh N

    2018-05-20

    Selection of informative genes from high dimensional gene expression data has emerged as an important research area in genomics. Many gene selection techniques have been proposed so far are either based on relevancy or redundancy measure. Further, the performance of these techniques has been adjudged through post selection classification accuracy computed through a classifier using the selected genes. This performance metric may be statistically sound but may not be biologically relevant. A statistical approach, i.e. Boot-MRMR, was proposed based on a composite measure of maximum relevance and minimum redundancy, which is both statistically sound and biologically relevant for informative gene selection. For comparative evaluation of the proposed approach, we developed two biological sufficient criteria, i.e. Gene Set Enrichment with QTL (GSEQ) and biological similarity score based on Gene Ontology (GO). Further, a systematic and rigorous evaluation of the proposed technique with 12 existing gene selection techniques was carried out using five gene expression datasets. This evaluation was based on a broad spectrum of statistically sound (e.g. subject classification) and biological relevant (based on QTL and GO) criteria under a multiple criteria decision-making framework. The performance analysis showed that the proposed technique selects informative genes which are more biologically relevant. The proposed technique is also found to be quite competitive with the existing techniques with respect to subject classification and computational time. Our results also showed that under the multiple criteria decision-making setup, the proposed technique is best for informative gene selection over the available alternatives. Based on the proposed approach, an R Package, i.e. BootMRMR has been developed and available at https://cran.r-project.org/web/packages/BootMRMR. This study will provide a practical guide to select statistical techniques for selecting informative genes

  12. Screening for interaction effects in gene expression data.

    Directory of Open Access Journals (Sweden)

    Peter J Castaldi

    Full Text Available Expression quantitative trait (eQTL studies are a powerful tool for identifying genetic variants that affect levels of messenger RNA. Since gene expression is controlled by a complex network of gene-regulating factors, one way to identify these factors is to search for interaction effects between genetic variants and mRNA levels of transcription factors (TFs and their respective target genes. However, identification of interaction effects in gene expression data pose a variety of methodological challenges, and it has become clear that such analyses should be conducted and interpreted with caution. Investigating the validity and interpretability of several interaction tests when screening for eQTL SNPs whose effect on the target gene expression is modified by the expression level of a transcription factor, we characterized two important methodological issues. First, we stress the scale-dependency of interaction effects and highlight that commonly applied transformation of gene expression data can induce or remove interactions, making interpretation of results more challenging. We then demonstrate that, in the setting of moderate to strong interaction effects on the order of what may be reasonably expected for eQTL studies, standard interaction screening can be biased due to heteroscedasticity induced by true interactions. Using simulation and real data analysis, we outline a set of reasonable minimum conditions and sample size requirements for reliable detection of variant-by-environment and variant-by-TF interactions using the heteroscedasticity consistent covariance-based approach.

  13. FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data

    DEFF Research Database (Denmark)

    Manijak, Mieszko P.; Nielsen, Henrik Bjørn

    2011-01-01

    circumvented by instead matching gene expression signatures to signatures of other experiments. FINDINGS: To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700...... Arabidopsis microarray experiments. CONCLUSIONS: Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/....

  14. Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

    Science.gov (United States)

    Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

    2015-01-27

    Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.

  15. Dietary approaches to stop hypertension influence on insulin receptor substrate-1gene expression: A randomized controlled clinical trial

    Directory of Open Access Journals (Sweden)

    Marzieh Kafeshani

    2015-01-01

    Full Text Available Background: Insulin receptor substrate (IRS Type 1 is a main substrate for the insulin receptor, controls insulin signaling in skeletal muscle, adipose tissue, and the vascular, so it is an important candidate gene for insulin resistance (IR. We aimed to compare the effects of the Dietary Approaches to Stop Hypertension (DASH and Usual Dietary Advices (UDA on IRS1 gene expression in women at risk for cardiovascular disease. Materials and Methods: A randomized controlled clinical trial was performed in 44 women at risk for cardiovascular disease. Participants were randomly assigned to a UDA diet or the DASH diet. The DASH diet was rich in fruits, vegetables, whole grains, and low-fat dairy products and low in saturated fat, total fat, cholesterol, refined grains, and sweets, with a total of 2400 mg/day sodium. The UDA diet was a regular diet with healthy dietary advice. Gene expression was assessed by the real-time polymerase chain reaction at the first of study and after 12 weeks. Independent sample t-test and paired-samples t-test were used to compare means of all variables within and between two groups respectively. Results: IRS1 gene expression was increased in DASH group compared with UDA diet (P = 0.00. Weight and waist circumference decreased in DASH group significantly compared to the UDA group (P < 0.05 but the results between the two groups showed no significant difference. Conclusion: DASH diet increased IRS1 gene expression and probably has beneficial effects on IR risks.

  16. Whole-body gene expression pattern registration in Platynereis larvae.

    Science.gov (United States)

    Asadulina, Albina; Panzera, Aurora; Verasztó, Csaba; Liebig, Christian; Jékely, Gáspár

    2012-12-03

    Digital anatomical atlases are increasingly used in order to depict different gene expression patterns and neuronal morphologies within a standardized reference template. In evo-devo, a discipline in which the comparison of gene expression patterns is a widely used approach, such standardized anatomical atlases would allow a more rigorous assessment of the conservation of and changes in gene expression patterns during micro- and macroevolutionary time scales. Due to its small size and invariant early development, the annelid Platynereis dumerilii is particularly well suited for such studies. Recently a reference template with registered gene expression patterns has been generated for the anterior part (episphere) of the Platynereis trochophore larva and used for the detailed study of neuronal development. Here we introduce and evaluate a method for whole-body gene expression pattern registration for Platynereis trochophore and nectochaete larvae based on whole-mount in situ hybridization, confocal microscopy, and image registration. We achieved high-resolution whole-body scanning using the mounting medium 2,2'-thiodiethanol (TDE), which allows the matching of the refractive index of the sample to that of glass and immersion oil thereby reducing spherical aberration and improving depth penetration. This approach allowed us to scan entire whole-mount larvae stained with nitroblue tetrazolium/5-bromo-4-chloro-3-indolyl phosphate (NBT/BCIP) in situ hybridization and counterstained fluorescently with an acetylated-tubulin antibody and the nuclear stain 4'6-diamidino-2-phenylindole (DAPI). Due to the submicron isotropic voxel size whole-mount larvae could be scanned in any orientation. Based on the whole-body scans, we generated four different reference templates by the iterative registration and averaging of 40 individual image stacks using either the acetylated-tubulin or the nuclear-stain signal for each developmental stage. We then registered to these templates the

  17. Whole-body gene expression pattern registration in Platynereis larvae

    Directory of Open Access Journals (Sweden)

    Asadulina Albina

    2012-12-01

    Full Text Available Abstract Background Digital anatomical atlases are increasingly used in order to depict different gene expression patterns and neuronal morphologies within a standardized reference template. In evo-devo, a discipline in which the comparison of gene expression patterns is a widely used approach, such standardized anatomical atlases would allow a more rigorous assessment of the conservation of and changes in gene expression patterns during micro- and macroevolutionary time scales. Due to its small size and invariant early development, the annelid Platynereis dumerilii is particularly well suited for such studies. Recently a reference template with registered gene expression patterns has been generated for the anterior part (episphere of the Platynereis trochophore larva and used for the detailed study of neuronal development. Results Here we introduce and evaluate a method for whole-body gene expression pattern registration for Platynereis trochophore and nectochaete larvae based on whole-mount in situ hybridization, confocal microscopy, and image registration. We achieved high-resolution whole-body scanning using the mounting medium 2,2’-thiodiethanol (TDE, which allows the matching of the refractive index of the sample to that of glass and immersion oil thereby reducing spherical aberration and improving depth penetration. This approach allowed us to scan entire whole-mount larvae stained with nitroblue tetrazolium/5-bromo-4-chloro-3-indolyl phosphate (NBT/BCIP in situ hybridization and counterstained fluorescently with an acetylated-tubulin antibody and the nuclear stain 4’6-diamidino-2-phenylindole (DAPI. Due to the submicron isotropic voxel size whole-mount larvae could be scanned in any orientation. Based on the whole-body scans, we generated four different reference templates by the iterative registration and averaging of 40 individual image stacks using either the acetylated-tubulin or the nuclear-stain signal for each developmental

  18. An Effective Tri-Clustering Algorithm Combining Expression Data with Gene Regulation Information

    Directory of Open Access Journals (Sweden)

    Ao Li

    2009-04-01

    Full Text Available Motivation: Bi-clustering algorithms aim to identify sets of genes sharing similar expression patterns across a subset of conditions. However direct interpretation or prediction of gene regulatory mechanisms may be difficult as only gene expression data is used. Information about gene regulators may also be available, most commonly about which transcription factors may bind to the promoter region and thus control the expression level of a gene. Thus a method to integrate gene expression and gene regulation information is desirable for clustering and analyzing. Methods: By incorporating gene regulatory information with gene expression data, we define regulated expression values (REV as indicators of how a gene is regulated by a specific factor. Existing bi-clustering methods are extended to a three dimensional data space by developing a heuristic TRI-Clustering algorithm. An additional approach named Automatic Boundary Searching algorithm (ABS is introduced to automatically determine the boundary threshold. Results: Results based on incorporating ChIP-chip data representing transcription factor-gene interactions show that the algorithms are efficient and robust for detecting tri-clusters. Detailed analysis of the tri-cluster extracted from yeast sporulation REV data shows genes in this cluster exhibited significant differences during the middle and late stages. The implicated regulatory network was then reconstructed for further study of defined regulatory mechanisms. Topological and statistical analysis of this network demonstrated evidence of significant changes of TF activities during the different stages of yeast sporulation, and suggests this approach might be a general way to study regulatory networks undergoing transformations.

  19. VESPUCCI: exploring patterns of gene expression in grapevine

    Directory of Open Access Journals (Sweden)

    Marco eMoretto

    2016-05-01

    Full Text Available Large-scale transcriptional studies aim to decipher the dynamic cellular responses to a stimulus, like different environmental conditions. In the era of high-throughput omics biology, the most used technologies for these purposes are microarray and RNA-Seq, whose data are usually required to be deposited in public repositories upon publication. Such repositories have the enormous potential to provide a comprehensive view of how different experimental conditions lead to expression changes, by comparing gene expression across all possible measured conditions. Unfortunately, this task is greatly impaired by differences among experimental platforms that make direct comparisons difficult.In this paper we present the Vitis Expression Studies Platform Using COLOMBOS Compendia Instances (VESPUCCI, a gene expression compendium for grapevine which was built by adapting an approach originally developed for bacteria, and show how it can be used to investigate complex gene expression patterns. We integrated nearly all publicly available microarray and RNA-Seq expression data: 1608 gene expression samples from 10 different technological platforms. Each sample has been manually annotated using a controlled vocabulary developed ad hoc to ensure both human readability and computational tractability. Expression data in the compendium can be visually explored using several tools provided by the web interface or can be programmatically accessed using the REST interface. VESPUCCI is freely accessible at http://vespucci.colombos.fmach.it.

  20. Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

    Science.gov (United States)

    Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

    2014-05-01

    Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  1. Gene expression analysis of precision-cut human liver slices indicates stable expression of ADME-Tox related genes

    NARCIS (Netherlands)

    Elferink, M. G. L.; Olinga, P.; van Leeuwen, E. M.; Bauerschmidt, S.; Polman, J.; Schoonen, W. G.; Heisterkamp, S. H.; Groothuis, G. M. M.

    2011-01-01

    In the process of drug development it is of high importance to test the safety of new drugs with predictive value for human toxicity. A promising approach of toxicity testing is based on shifts in gene expression profiling of the liver. Toxicity screening based on animal liver cells cannot be

  2. Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Hu Xiaohua

    2011-07-01

    Full Text Available Abstract Background The wide use of high-throughput DNA microarray technology provide an increasingly detailed view of human transcriptome from hundreds to thousands of genes. Although biomedical researchers typically design microarray experiments to explore specific biological contexts, the relationships between genes are hard to identified because they are complex and noisy high-dimensional data and are often hindered by low statistical power. The main challenge now is to extract valuable biological information from the colossal amount of data to gain insight into biological processes and the mechanisms of human disease. To overcome the challenge requires mathematical and computational methods that are versatile enough to capture the underlying biological features and simple enough to be applied efficiently to large datasets. Methods Unsupervised machine learning approaches provide new and efficient analysis of gene expression profiles. In our study, two unsupervised knowledge-based matrix factorization methods, independent component analysis (ICA and nonnegative matrix factorization (NMF are integrated to identify significant genes and related pathways in microarray gene expression dataset of Alzheimer’s disease. The advantage of these two approaches is they can be performed as a biclustering method by which genes and conditions can be clustered simultaneously. Furthermore, they can group genes into different categories for identifying related diagnostic pathways and regulatory networks. The difference between these two method lies in ICA assume statistical independence of the expression modes, while NMF need positivity constrains to generate localized gene expression profiles. Results In our work, we performed FastICA and non-smooth NMF methods on DNA microarray gene expression data of Alzheimer’s disease respectively. The simulation results shows that both of the methods can clearly classify severe AD samples from control samples, and

  3. Large scale gene expression meta-analysis reveals tissue-specific, sex-biased gene expression in humans

    Directory of Open Access Journals (Sweden)

    Benjamin Mayne

    2016-10-01

    Full Text Available The severity and prevalence of many diseases are known to differ between the sexes. Organ specific sex-biased gene expression may underpin these and other sexually dimorphic traits. To further our understanding of sex differences in transcriptional regulation, we performed meta-analyses of sex biased gene expression in multiple human tissues. We analysed 22 publicly available human gene expression microarray data sets including over 2500 samples from 15 different tissues and 9 different organs. Briefly, by using an inverse-variance method we determined the effect size difference of gene expression between males and females. We found the greatest sex differences in gene expression in the brain, specifically in the anterior cingulate cortex, (1818 genes, followed by the heart (375 genes, kidney (224 genes, colon (218 genes and thyroid (163 genes. More interestingly, we found different parts of the brain with varying numbers and identity of sex-biased genes, indicating that specific cortical regions may influence sexually dimorphic traits. The majority of sex-biased genes in other tissues such as the bladder, liver, lungs and pancreas were on the sex chromosomes or involved in sex hormone production. On average in each tissue, 32% of autosomal genes that were expressed in a sex-biased fashion contained androgen or estrogen hormone response elements. Interestingly, across all tissues, we found approximately two-thirds of autosomal genes that were sex-biased were not under direct influence of sex hormones. To our knowledge this is the largest analysis of sex-biased gene expression in human tissues to date. We identified many sex-biased genes that were not under the direct influence of sex chromosome genes or sex hormones. These may provide targets for future development of sex-specific treatments for diseases.

  4. Finding gene regulatory network candidates using the gene expression knowledge base.

    Science.gov (United States)

    Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

    2014-12-10

    Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.

  5. Network Security via Biometric Recognition of Patterns of Gene Expression

    Science.gov (United States)

    Shaw, Harry C.

    2016-01-01

    Molecular biology provides the ability to implement forms of information and network security completely outside the bounds of legacy security protocols and algorithms. This paper addresses an approach which instantiates the power of gene expression for security. Molecular biology provides a rich source of gene expression and regulation mechanisms, which can be adopted to use in the information and electronic communication domains. Conventional security protocols are becoming increasingly vulnerable due to more intensive, highly capable attacks on the underlying mathematics of cryptography. Security protocols are being undermined by social engineering and substandard implementations by IT (Information Technology) organizations. Molecular biology can provide countermeasures to these weak points with the current security approaches. Future advances in instruments for analyzing assays will also enable this protocol to advance from one of cryptographic algorithms to an integrated system of cryptographic algorithms and real-time assays of gene expression products.

  6. Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.

    Directory of Open Access Journals (Sweden)

    Neutelings Godfrey

    2010-04-01

    Full Text Available Abstract Background Quantitative real-time PCR (qRT-PCR is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs. Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L. Results Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups. qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59. LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both ge

  7. Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.).

    Science.gov (United States)

    Huis, Rudy; Hawkins, Simon; Neutelings, Godfrey

    2010-04-19

    Quantitative real-time PCR (qRT-PCR) is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs). Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L). Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs) and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH) as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups.qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59). LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both geNorm-designated- and Norm

  8. Differential Gene Expression and Aging

    Directory of Open Access Journals (Sweden)

    Laurent Seroude

    2002-01-01

    Full Text Available It has been established that an intricate program of gene expression controls progression through the different stages in development. The equally complex biological phenomenon known as aging is genetically determined and environmentally modulated. This review focuses on the genetic component of aging, with a special emphasis on differential gene expression. At least two genetic pathways regulating organism longevity act by modifying gene expression. Many genes are also subjected to age-dependent transcriptional regulation. Some age-related gene expression changes are prevented by caloric restriction, the most robust intervention that slows down the aging process. Manipulating the expression of some age-regulated genes can extend an organism's life span. Remarkably, the activity of many transcription regulatory elements is linked to physiological age as opposed to chronological age, indicating that orderly and tightly controlled regulatory pathways are active during aging.

  9. Removing Batch Effects from Longitudinal Gene Expression - Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data.

    Directory of Open Access Journals (Sweden)

    Christian Müller

    Full Text Available Technical variation plays an important role in microarray-based gene expression studies, and batch effects explain a large proportion of this noise. It is therefore mandatory to eliminate technical variation while maintaining biological variability. Several strategies have been proposed for the removal of batch effects, although they have not been evaluated in large-scale longitudinal gene expression data. In this study, we aimed at identifying a suitable method for batch effect removal in a large study of microarray-based longitudinal gene expression. Monocytic gene expression was measured in 1092 participants of the Gutenberg Health Study at baseline and 5-year follow up. Replicates of selected samples were measured at both time points to identify technical variability. Deming regression, Passing-Bablok regression, linear mixed models, non-linear models as well as ReplicateRUV and ComBat were applied to eliminate batch effects between replicates. In a second step, quantile normalization prior to batch effect correction was performed for each method. Technical variation between batches was evaluated by principal component analysis. Associations between body mass index and transcriptomes were calculated before and after batch removal. Results from association analyses were compared to evaluate maintenance of biological variability. Quantile normalization, separately performed in each batch, combined with ComBat successfully reduced batch effects and maintained biological variability. ReplicateRUV performed perfectly in the replicate data subset of the study, but failed when applied to all samples. All other methods did not substantially reduce batch effects in the replicate data subset. Quantile normalization plus ComBat appears to be a valuable approach for batch correction in longitudinal gene expression data.

  10. Alpha-fetoprotein-targeted reporter gene expression imaging in hepatocellular carcinoma.

    Science.gov (United States)

    Kim, Kwang Il; Chung, Hye Kyung; Park, Ju Hui; Lee, Yong Jin; Kang, Joo Hyun

    2016-07-21

    Hepatocellular carcinoma (HCC) is one of the most common cancers in Eastern Asia, and its incidence is increasing globally. Numerous experimental models have been developed to better our understanding of the pathogenic mechanism of HCC and to evaluate novel therapeutic approaches. Molecular imaging is a convenient and up-to-date biomedical tool that enables the visualization, characterization and quantification of biologic processes in a living subject. Molecular imaging based on reporter gene expression, in particular, can elucidate tumor-specific events or processes by acquiring images of a reporter gene's expression driven by tumor-specific enhancers/promoters. In this review, we discuss the advantages and disadvantages of various experimental HCC mouse models and we present in vivo images of tumor-specific reporter gene expression driven by an alpha-fetoprotein (AFP) enhancer/promoter system in a mouse model of HCC. The current mouse models of HCC development are established by xenograft, carcinogen induction and genetic engineering, representing the spectrum of tumor-inducing factors and tumor locations. The imaging analysis approach of reporter genes driven by AFP enhancer/promoter is presented for these different HCC mouse models. Such molecular imaging can provide longitudinal information about carcinogenesis and tumor progression. We expect that clinical application of AFP-targeted reporter gene expression imaging systems will be useful for the detection of AFP-expressing HCC tumors and screening of increased/decreased AFP levels due to disease or drug treatment.

  11. Automatic Control of Gene Expression in Mammalian Cells.

    Science.gov (United States)

    Fracassi, Chiara; Postiglione, Lorena; Fiore, Gianfranco; di Bernardo, Diego

    2016-04-15

    Automatic control of gene expression in living cells is paramount importance to characterize both endogenous gene regulatory networks and synthetic circuits. In addition, such a technology can be used to maintain the expression of synthetic circuit components in an optimal range in order to ensure reliable performance. Here we present a microfluidics-based method to automatically control gene expression from the tetracycline-inducible promoter in mammalian cells in real time. Our approach is based on the negative-feedback control engineering paradigm. We validated our method in a monoclonal population of cells constitutively expressing a fluorescent reporter protein (d2EYFP) downstream of a minimal CMV promoter with seven tet-responsive operator motifs (CMV-TET). These cells also constitutively express the tetracycline transactivator protein (tTA). In cells grown in standard growth medium, tTA is able to bind the CMV-TET promoter, causing d2EYFP to be maximally expressed. Upon addition of tetracycline to the culture medium, tTA detaches from the CMV-TET promoter, thus preventing d2EYFP expression. We tested two different model-independent control algorithms (relay and proportional-integral (PI)) to force a monoclonal population of cells to express an intermediate level of d2EYFP equal to 50% of its maximum expression level for up to 3500 min. The control input is either tetracycline-rich or standard growth medium. We demonstrated that both the relay and PI controllers can regulate gene expression at the desired level, despite oscillations (dampened in the case of the PI controller) around the chosen set point.

  12. Candidate innate immune system gene expression in the ecological model Daphnia.

    Science.gov (United States)

    Decaestecker, Ellen; Labbé, Pierrick; Ellegaard, Kirsten; Allen, Judith E; Little, Tom J

    2011-10-01

    The last ten years have witnessed increasing interest in host-pathogen interactions involving invertebrate hosts. The invertebrate innate immune system is now relatively well characterised, but in a limited range of genetic model organisms and under a limited number of conditions. Immune systems have been little studied under real-world scenarios of environmental variation and parasitism. Thus, we have investigated expression of candidate innate immune system genes in the water flea Daphnia, a model organism for ecological genetics, and whose capacity for clonal reproduction facilitates an exceptionally rigorous control of exposure dose or the study of responses at many time points. A unique characteristic of the particular Daphnia clones and pathogen strain combinations used presently is that they have been shown to be involved in specific host-pathogen coevolutionary interactions in the wild. We choose five genes, which are strong candidates to be involved in Daphnia-pathogen interactions, given that they have been shown to code for immune effectors in related organisms. Differential expression of these genes was quantified by qRT-PCR following exposure to the bacterial pathogen Pasteuria ramosa. Constitutive expression levels differed between host genotypes, and some genes appeared to show correlated expression. However, none of the genes appeared to show a major modification of expression level in response to Pasteuria exposure. By applying knowledge from related genetic model organisms (e.g. Drosophila) to models for the study of evolutionary ecology and coevolution (i.e. Daphnia), the candidate gene approach is temptingly efficient. However, our results show that detection of only weak patterns is likely if one chooses target genes for study based on previously identified genome sequences by comparison to homologues from other related organisms. Future work on the Daphnia-Pasteuria system will need to balance a candidate gene approach with more comprehensive

  13. Down-Regulation of Gene Expression by RNA-Induced Gene Silencing

    Science.gov (United States)

    Travella, Silvia; Keller, Beat

    Down-regulation of endogenous genes via post-transcriptional gene silencing (PTGS) is a key to the characterization of gene function in plants. Many RNA-based silencing mechanisms such as post-transcriptional gene silencing, co-suppression, quelling, and RNA interference (RNAi) have been discovered among species of different kingdoms (plants, fungi, and animals). One of the most interesting discoveries was RNAi, a sequence-specific gene-silencing mechanism initiated by the introduction of double-stranded RNA (dsRNA), homologous in sequence to the silenced gene, which triggers degradation of mRNA. Infection of plants with modified viruses can also induce RNA silencing and is referred to as virus-induced gene silencing (VIGS). In contrast to insertional mutagenesis, these emerging new reverse genetic approaches represent a powerful tool for exploring gene function and for manipulating gene expression experimentally in cereal species such as barley and wheat. We examined how RNAi and VIGS have been used to assess gene function in barley and wheat, including molecular mechanisms involved in the process and available methodological elements, such as vectors, inoculation procedures, and analysis of silenced phenotypes.

  14. Inferring Drosophila gap gene regulatory network: Pattern analysis of simulated gene expression profiles and stability analysis

    OpenAIRE

    Fomekong-Nanfack, Y.; Postma, M.; Kaandorp, J.A.

    2009-01-01

    Abstract Background Inference of gene regulatory networks (GRNs) requires accurate data, a method to simulate the expression patterns and an efficient optimization algorithm to estimate the unknown parameters. Using this approach it is possible to obtain alternative circuits without making any a priori assumptions about the interactions, which all simulate the observed patterns. It is important to analyze the properties of the circuits. Findings We have analyzed the simulated gene expression ...

  15. The AERO system: a 3D-like approach for recording gene expression patterns in the whole mouse embryo.

    Directory of Open Access Journals (Sweden)

    Hirohito Shimizu

    Full Text Available We have recently constructed a web-based database of gene expression in the mouse whole embryo, EMBRYS (http://embrys.jp/embrys/html/MainMenu.html. To allow examination of gene expression patterns to the fullest extent possible, this database provides both photo images and annotation data. However, since embryos develop via an intricate process of morphogenesis, it would be of great value to track embryonic gene expression from a three dimensional perspective. In fact, several methods have been developed to achieve this goal, but highly laborious procedures and specific operational skills are generally required. We utilized a novel microscopic technique that enables the easy capture of rotational, 3D-like images of the whole embryo. In this method, a rotary head equipped with two mirrors that are designed to obtain an image tilted at 45 degrees to the microscope stage captures serial images at 2-degree intervals. By a simple operation, 180 images are automatically collected. These 2D images obtained at multiple angles are then used to reconstruct 3D-like images, termed AERO images. By means of this system, over 800 AERO images of 191 gene expression patterns were captured. These images can be easily rotated on the computer screen using the EMBRYS database so that researchers can view an entire embryo by a virtual viewing on a computer screen in an unbiased or non-predetermined manner. The advantages afforded by this approach make it especially useful for generating data viewed in public databases.

  16. Robust assignment of cancer subtypes from expression data using a uni-variate gene expression average as classifier

    International Nuclear Information System (INIS)

    Lauss, Martin; Frigyesi, Attila; Ryden, Tobias; Höglund, Mattias

    2010-01-01

    Genome wide gene expression data is a rich source for the identification of gene signatures suitable for clinical purposes and a number of statistical algorithms have been described for both identification and evaluation of such signatures. Some employed algorithms are fairly complex and hence sensitive to over-fitting whereas others are more simple and straight forward. Here we present a new type of simple algorithm based on ROC analysis and the use of metagenes that we believe will be a good complement to existing algorithms. The basis for the proposed approach is the use of metagenes, instead of collections of individual genes, and a feature selection using AUC values obtained by ROC analysis. Each gene in a data set is assigned an AUC value relative to the tumor class under investigation and the genes are ranked according to these values. Metagenes are then formed by calculating the mean expression level for an increasing number of ranked genes, and the metagene expression value that optimally discriminates tumor classes in the training set is used for classification of new samples. The performance of the metagene is then evaluated using LOOCV and balanced accuracies. We show that the simple uni-variate gene expression average algorithm performs as well as several alternative algorithms such as discriminant analysis and the more complex approaches such as SVM and neural networks. The R package rocc is freely available at http://cran.r-project.org/web/packages/rocc/index.html

  17. Prediction of highly expressed genes in microbes based on chromatin accessibility

    Directory of Open Access Journals (Sweden)

    Ussery David W

    2007-02-01

    Full Text Available Abstract Background It is well known that gene expression is dependent on chromatin structure in eukaryotes and it is likely that chromatin can play a role in bacterial gene expression as well. Here, we use a nucleosomal position preference measure of anisotropic DNA flexibility to predict highly expressed genes in microbial genomes. We compare these predictions with those based on codon adaptation index (CAI values, and also with experimental data for 6 different microbial genomes, with a particular interest in experimental data from Escherichia coli. Moreover, position preference is examined further in 328 sequenced microbial genomes. Results We find that absolute gene expression levels are correlated with the position preference in many microbial genomes. It is postulated that in these regions, the DNA may be more accessible to the transcriptional machinery. Moreover, ribosomal proteins and ribosomal RNA are encoded by DNA having significantly lower position preference values than other genes in fast-replicating microbes. Conclusion This insight into DNA structure-dependent gene expression in microbes may be exploited for predicting the expression of non-translated genes such as non-coding RNAs that may not be predicted by any of the conventional codon usage bias approaches.

  18. The identification of functional motifs in temporal gene expression analysis

    Directory of Open Access Journals (Sweden)

    Michael G. Surette

    2005-01-01

    Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.

  19. Autism and increased paternal age related changes in global levels of gene expression regulation.

    Directory of Open Access Journals (Sweden)

    Mark D Alter

    2011-02-01

    Full Text Available A causal role of mutations in multiple general transcription factors in neurodevelopmental disorders including autism suggested that alterations in global levels of gene expression regulation might also relate to disease risk in sporadic cases of autism. This premise can be tested by evaluating for changes in the overall distribution of gene expression levels. For instance, in mice, variability in hippocampal-dependent behaviors was associated with variability in the pattern of the overall distribution of gene expression levels, as assessed by variance in the distribution of gene expression levels in the hippocampus. We hypothesized that a similar change in variance might be found in children with autism. Gene expression microarrays covering greater than 47,000 unique RNA transcripts were done on RNA from peripheral blood lymphocytes (PBL of children with autism (n = 82 and controls (n = 64. Variance in the distribution of gene expression levels from each microarray was compared between groups of children. Also tested was whether a risk factor for autism, increased paternal age, was associated with variance. A decrease in the variance in the distribution of gene expression levels in PBL was associated with the diagnosis of autism and a risk factor for autism, increased paternal age. Traditional approaches to microarray analysis of gene expression suggested a possible mechanism for decreased variance in gene expression. Gene expression pathways involved in transcriptional regulation were down-regulated in the blood of children with autism and children of older fathers. Thus, results from global and gene specific approaches to studying microarray data were complimentary and supported the hypothesis that alterations at the global level of gene expression regulation are related to autism and increased paternal age. Global regulation of transcription, thus, represents a possible point of convergence for multiple etiologies of autism and other

  20. Gene Therapy Approaches to Hemoglobinopathies.

    Science.gov (United States)

    Ferrari, Giuliana; Cavazzana, Marina; Mavilio, Fulvio

    2017-10-01

    Gene therapy for hemoglobinopathies is currently based on transplantation of autologous hematopoietic stem cells genetically modified with a lentiviral vector expressing a globin gene under the control of globin transcriptional regulatory elements. Preclinical and early clinical studies showed the safety and potential efficacy of this therapeutic approach as well as the hurdles still limiting its general application. In addition, for both beta-thalassemia and sickle cell disease, an altered bone marrow microenvironment reduces the efficiency of stem cell harvesting as well as engraftment. These hurdles need be addressed for gene therapy for hemoglobinopathies to become a clinical reality. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Development of Gene Expression Signatures for Practical Radiation Biodosimetry

    International Nuclear Information System (INIS)

    Paul, Sunirmal; Amundson, Sally A.

    2008-01-01

    Purpose: In a large-scale radiologic emergency, estimates of exposure doses and radiation injury would be required for individuals without physical dosimeters. Current methods are inadequate for the task, so we are developing gene expression profiles for radiation biodosimetry. This approach could provide both an estimate of physical radiation dose and an indication of the extent of individual injury or future risk. Methods and Materials: We used whole genome microarray expression profiling as a discovery platform to identify genes with the potential to predict radiation dose across an exposure range relevant for medical decision making in a radiologic emergency. Human peripheral blood from 10 healthy donors was irradiated ex vivo, and global gene expression was measured both 6 and 24 h after exposure. Results: A 74-gene signature was identified that distinguishes between four radiation doses (0.5, 2, 5, and 8 Gy) and controls. More than one third of these genes are regulated by TP53. A nearest centroid classifier using these same 74 genes correctly predicted 98% of samples taken either 6 h or 24 h after treatment as unexposed, exposed to 0.5, 2, or ≥5 Gy. Expression patterns of five genes (CDKN1A, FDXR, SESN1, BBC3, and PHPT1) from this signature were also confirmed by real-time polymerase chain reaction. Conclusion: The ability of a single gene set to predict radiation dose throughout a window of time without need for individual pre-exposure controls represents an important advance in the development of gene expression for biodosimetry

  2. Garlic Influences Gene Expression In Vivo and In Vitro.

    Science.gov (United States)

    Charron, Craig S; Dawson, Harry D; Novotny, Janet A

    2016-02-01

    There is a large body of preclinical research aimed at understanding the roles of garlic and garlic-derived preparations in the promotion of human health. Most of this research has targeted the possible functions of garlic in maintaining cardiovascular health and in preventing and treating cancer. A wide range of outcome variables has been used to investigate the bioactivity of garlic, ranging from direct measures of health status such as cholesterol concentrations, blood pressure, and changes in tumor size and number, to molecular and biochemical measures such as mRNA gene expression, protein concentration, enzyme activity, and histone acetylation status. Determination of how garlic influences mRNA gene expression has proven to be a valuable approach to elucidating the mechanisms of garlic bioactivity. Preclinical studies investigating the health benefits of garlic far outnumber human studies and have made frequent use of mRNA gene expression measurement. There is an immediate need to understand mRNA gene expression in humans as well. Although safety and ethical constraints limit the types of available human tissue, peripheral whole blood is readily accessible, and measuring mRNA gene expression in whole blood may provide a unique window to understanding how garlic intake affects human health. © 2016 American Society for Nutrition.

  3. Multi-targeted priming for genome-wide gene expression assays

    Directory of Open Access Journals (Sweden)

    Adomas Aleksandra B

    2010-08-01

    Full Text Available Abstract Background Complementary approaches to assaying global gene expression are needed to assess gene expression in regions that are poorly assayed by current methodologies. A key component of nearly all gene expression assays is the reverse transcription of transcribed sequences that has traditionally been performed by priming the poly-A tails on many of the transcribed genes in eukaryotes with oligo-dT, or by priming RNA indiscriminately with random hexamers. We designed an algorithm to find common sequence motifs that were present within most protein-coding genes of Saccharomyces cerevisiae and of Neurospora crassa, but that were not present within their ribosomal RNA or transfer RNA genes. We then experimentally tested whether degenerately priming these motifs with multi-targeted primers improved the accuracy and completeness of transcriptomic assays. Results We discovered two multi-targeted primers that would prime a preponderance of genes in the genomes of Saccharomyces cerevisiae and Neurospora crassa while avoiding priming ribosomal RNA or transfer RNA. Examining the response of Saccharomyces cerevisiae to nitrogen deficiency and profiling Neurospora crassa early sexual development, we demonstrated that using multi-targeted primers in reverse transcription led to superior performance of microarray profiling and next-generation RNA tag sequencing. Priming with multi-targeted primers in addition to oligo-dT resulted in higher sensitivity, a larger number of well-measured genes and greater power to detect differences in gene expression. Conclusions Our results provide the most complete and detailed expression profiles of the yeast nitrogen starvation response and N. crassa early sexual development to date. Furthermore, our multi-targeting priming methodology for genome-wide gene expression assays provides selective targeting of multiple sequences and counter-selection against undesirable sequences, facilitating a more complete and

  4. Gene expression network reconstruction by convex feature selection when incorporating genetic perturbations.

    Directory of Open Access Journals (Sweden)

    Benjamin A Logsdon

    Full Text Available Cellular gene expression measurements contain regulatory information that can be used to discover novel network relationships. Here, we present a new algorithm for network reconstruction powered by the adaptive lasso, a theoretically and empirically well-behaved method for selecting the regulatory features of a network. Any algorithms designed for network discovery that make use of directed probabilistic graphs require perturbations, produced by either experiments or naturally occurring genetic variation, to successfully infer unique regulatory relationships from gene expression data. Our approach makes use of appropriately selected cis-expression Quantitative Trait Loci (cis-eQTL, which provide a sufficient set of independent perturbations for maximum network resolution. We compare the performance of our network reconstruction algorithm to four other approaches: the PC-algorithm, QTLnet, the QDG algorithm, and the NEO algorithm, all of which have been used to reconstruct directed networks among phenotypes leveraging QTL. We show that the adaptive lasso can outperform these algorithms for networks of ten genes and ten cis-eQTL, and is competitive with the QDG algorithm for networks with thirty genes and thirty cis-eQTL, with rich topologies and hundreds of samples. Using this novel approach, we identify unique sets of directed relationships in Saccharomyces cerevisiae when analyzing genome-wide gene expression data for an intercross between a wild strain and a lab strain. We recover novel putative network relationships between a tyrosine biosynthesis gene (TYR1, and genes involved in endocytosis (RCY1, the spindle checkpoint (BUB2, sulfonate catabolism (JLP1, and cell-cell communication (PRM7. Our algorithm provides a synthesis of feature selection methods and graphical model theory that has the potential to reveal new directed regulatory relationships from the analysis of population level genetic and gene expression data.

  5. Expression stability and selection of optimal reference genes for gene expression normalization in early life stage rainbow trout exposed to cadmium and copper.

    Science.gov (United States)

    Shekh, Kamran; Tang, Song; Niyogi, Som; Hecker, Markus

    2017-09-01

    Gene expression analysis represents a powerful approach to characterize the specific mechanisms by which contaminants interact with organisms. One of the key considerations when conducting gene expression analyses using quantitative real-time reverse transcription-polymerase chain reaction (qPCR) is the selection of appropriate reference genes, which is often overlooked. Specifically, to reach meaningful conclusions when using relative quantification approaches, expression levels of reference genes must be highly stable and cannot vary as a function of experimental conditions. However, to date, information on the stability of commonly used reference genes across developmental stages, tissues and after exposure to contaminants such as metals is lacking for many vertebrate species including teleost fish. Therefore, in this study, we assessed the stability of expression of 8 reference gene candidates in the gills and skin of three different early life-stages of rainbow trout after acute exposure (24h) to two metals, cadmium (Cd) and copper (Cu) using qPCR. Candidate housekeeping genes were: beta actin (b-actin), DNA directed RNA polymerase II subunit I (DRP2), elongation factor-1 alpha (EF1a), glyceraldehyde 3-phosphate dehydrogenase (GAPDH), glucose-6-phosphate dehydrogenase (G6PD), hypoxanthine phosphoribosyltransferase (HPRT), ribosomal protein L8 (RPL8), and 18S ribosomal RNA (18S). Four algorithms, geNorm, NormFinder, BestKeeper, and the comparative ΔCt method were employed to systematically evaluate the expression stability of these candidate genes under control and exposed conditions as well as across three different life-stages. Finally, stability of genes was ranked by taking geometric means of the ranks established by the different methods. Stability of reference genes was ranked in the following order (from lower to higher stability): HPRT

  6. Sequential Logic Model Deciphers Dynamic Transcriptional Control of Gene Expressions

    Science.gov (United States)

    Yeo, Zhen Xuan; Wong, Sum Thai; Arjunan, Satya Nanda Vel; Piras, Vincent; Tomita, Masaru; Selvarajoo, Kumar; Giuliani, Alessandro; Tsuchiya, Masa

    2007-01-01

    Background Cellular signaling involves a sequence of events from ligand binding to membrane receptors through transcription factors activation and the induction of mRNA expression. The transcriptional-regulatory system plays a pivotal role in the control of gene expression. A novel computational approach to the study of gene regulation circuits is presented here. Methodology Based on the concept of finite state machine, which provides a discrete view of gene regulation, a novel sequential logic model (SLM) is developed to decipher control mechanisms of dynamic transcriptional regulation of gene expressions. The SLM technique is also used to systematically analyze the dynamic function of transcriptional inputs, the dependency and cooperativity, such as synergy effect, among the binding sites with respect to when, how much and how fast the gene of interest is expressed. Principal Findings SLM is verified by a set of well studied expression data on endo16 of Strongylocentrotus purpuratus (sea urchin) during the embryonic midgut development. A dynamic regulatory mechanism for endo16 expression controlled by three binding sites, UI, R and Otx is identified and demonstrated to be consistent with experimental findings. Furthermore, we show that during transition from specification to differentiation in wild type endo16 expression profile, SLM reveals three binary activities are not sufficient to explain the transcriptional regulation of endo16 expression and additional activities of binding sites are required. Further analyses suggest detailed mechanism of R switch activity where indirect dependency occurs in between UI activity and R switch during specification to differentiation stage. Conclusions/Significance The sequential logic formalism allows for a simplification of regulation network dynamics going from a continuous to a discrete representation of gene activation in time. In effect our SLM is non-parametric and model-independent, yet providing rich biological

  7. Sequential logic model deciphers dynamic transcriptional control of gene expressions.

    Directory of Open Access Journals (Sweden)

    Zhen Xuan Yeo

    Full Text Available BACKGROUND: Cellular signaling involves a sequence of events from ligand binding to membrane receptors through transcription factors activation and the induction of mRNA expression. The transcriptional-regulatory system plays a pivotal role in the control of gene expression. A novel computational approach to the study of gene regulation circuits is presented here. METHODOLOGY: Based on the concept of finite state machine, which provides a discrete view of gene regulation, a novel sequential logic model (SLM is developed to decipher control mechanisms of dynamic transcriptional regulation of gene expressions. The SLM technique is also used to systematically analyze the dynamic function of transcriptional inputs, the dependency and cooperativity, such as synergy effect, among the binding sites with respect to when, how much and how fast the gene of interest is expressed. PRINCIPAL FINDINGS: SLM is verified by a set of well studied expression data on endo16 of Strongylocentrotus purpuratus (sea urchin during the embryonic midgut development. A dynamic regulatory mechanism for endo16 expression controlled by three binding sites, UI, R and Otx is identified and demonstrated to be consistent with experimental findings. Furthermore, we show that during transition from specification to differentiation in wild type endo16 expression profile, SLM reveals three binary activities are not sufficient to explain the transcriptional regulation of endo16 expression and additional activities of binding sites are required. Further analyses suggest detailed mechanism of R switch activity where indirect dependency occurs in between UI activity and R switch during specification to differentiation stage. CONCLUSIONS/SIGNIFICANCE: The sequential logic formalism allows for a simplification of regulation network dynamics going from a continuous to a discrete representation of gene activation in time. In effect our SLM is non-parametric and model-independent, yet

  8. Utility and Limitations of Using Gene Expression Data to Identify Functional Associations.

    Directory of Open Access Journals (Sweden)

    Sahra Uygun

    2016-12-01

    Full Text Available Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets.

  9. Elucidating gene function and function evolution through comparison of co-expression networks in plants

    Directory of Open Access Journals (Sweden)

    Marek eMutwil

    2014-08-01

    Full Text Available The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23. In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We show that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that, in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.

  10. Scaling of gene expression data allowing the comparison of different gene expression platforms

    NARCIS (Netherlands)

    van Ruissen, Fred; Schaaf, Gerben J.; Kool, Marcel; Baas, Frank; Ruijter, Jan M.

    2008-01-01

    Serial analysis of gene expression (SAGE) and microarrays have found a widespread application, but much ambiguity exists regarding the amalgamation of the data resulting from these technologies. Cross-platform utilization of gene expression data from the SAGE and microarray technology could reduce

  11. A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis

    Directory of Open Access Journals (Sweden)

    Akira Ishikawa

    2017-11-01

    Full Text Available Large numbers of quantitative trait loci (QTL affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.

  12. A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

    Science.gov (United States)

    Ishikawa, Akira

    2017-11-27

    Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.

  13. Genetic architecture of gene expression in ovine skeletal muscle

    DEFF Research Database (Denmark)

    Kogelman, Lisette Johanna Antonia; Byrne, Keren; Vuocolo, Tony

    2011-01-01

    architecture to the gene expression data, which also discriminated the sire-based Estimated Breeding Value for the trait. An integrated systems biology approach was then used to identify the major functional pathways contributing to the genetics of enhanced muscling by using both Estimated Breeding Value...... has potential, amongst other mechanisms, to alter gene expression via cis- or trans-acting mechanisms in a manner that impacts the functional activities of specific pathways that contribute to muscling traits. By integrating sire-based genetic merit information for a muscling trait with progeny...

  14. Consistent Differential Expression Pattern (CDEP) on microarray to identify genes related to metastatic behavior.

    Science.gov (United States)

    Tsoi, Lam C; Qin, Tingting; Slate, Elizabeth H; Zheng, W Jim

    2011-11-11

    To utilize the large volume of gene expression information generated from different microarray experiments, several meta-analysis techniques have been developed. Despite these efforts, there remain significant challenges to effectively increasing the statistical power and decreasing the Type I error rate while pooling the heterogeneous datasets from public resources. The objective of this study is to develop a novel meta-analysis approach, Consistent Differential Expression Pattern (CDEP), to identify genes with common differential expression patterns across different datasets. We combined False Discovery Rate (FDR) estimation and the non-parametric RankProd approach to estimate the Type I error rate in each microarray dataset of the meta-analysis. These Type I error rates from all datasets were then used to identify genes with common differential expression patterns. Our simulation study showed that CDEP achieved higher statistical power and maintained low Type I error rate when compared with two recently proposed meta-analysis approaches. We applied CDEP to analyze microarray data from different laboratories that compared transcription profiles between metastatic and primary cancer of different types. Many genes identified as differentially expressed consistently across different cancer types are in pathways related to metastatic behavior, such as ECM-receptor interaction, focal adhesion, and blood vessel development. We also identified novel genes such as AMIGO2, Gem, and CXCL11 that have not been shown to associate with, but may play roles in, metastasis. CDEP is a flexible approach that borrows information from each dataset in a meta-analysis in order to identify genes being differentially expressed consistently. We have shown that CDEP can gain higher statistical power than other existing approaches under a variety of settings considered in the simulation study, suggesting its robustness and insensitivity to data variation commonly associated with microarray

  15. Cell surface expression of single chain antibodies with applications to imaging of gene expression in vivo

    International Nuclear Information System (INIS)

    Northrop, Jeffrey P.; Bednarski, Mark; Li, King C.; Barbieri, Susan O.; Lu, Amy T.; Nguyen, Dee; Varadarajan, John; Osen, Maureen; Star-Lack, Josh

    2003-01-01

    Imaging of gene expression in vivo has many potential uses for biomedical research and drug discovery, ranging from the study of gene regulation and cancer to the non-invasive assessment of gene therapies. To streamline the development of imaging marker gene technologies for nuclear medicine, we propose a new approach to the design of reporter/probe pairs wherein the reporter is a cell surface-expressed single chain antibody variable fragment that has been raised against a low molecular weight imaging probe with optimized pharmacokinetic properties. Proof of concept of the approach was achieved using a single chain antibody variable fragment that binds with high affinity to fluorescein and an imaging probe consisting of fluorescein isothiocyanate coupled to the chelator diethylene triamine penta-acetic acid labeled with the gamma-emitter 111 In. We demonstrate specific high-affinity binding of this probe to the cell surface-expressed reporter in vitro and assess the in vivo biodistribution of the probe both in wild-type mice and in mice harboring tumor xenografts expressing the reporter. Specific uptake of the probe by, and in vivo imaging of, tumors expressing the reporter are shown. Since ScFvs with high affinities can be raised to almost any protein or small molecule, the proposed methodology may offer a new flexibility in the design of imaging tracer/reporter pairs wherein both probe pharmacokinetics and binding affinities can be readily optimized. (orig.)

  16. GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.

    Science.gov (United States)

    Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E

    2016-03-11

    Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.

  17. Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease.

    Science.gov (United States)

    Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

    2014-01-01

    We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. http://rged.wall-eva.net. © The Author(s) 2014. Published by Oxford University Press.

  18. Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease

    Science.gov (United States)

    Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

    2014-01-01

    We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Availability and implementation: Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. Database URL: http://rged.wall-eva.net PMID:25252782

  19. Visually Relating Gene Expression and in vivo DNA Binding Data

    Energy Technology Data Exchange (ETDEWEB)

    Huang, Min-Yu; Mackey, Lester; Ker?,; nen, Soile V. E.; Weber, Gunther H.; Jordan, Michael I.; Knowles, David W.; Biggin, Mark D.; Hamann, Bernd

    2011-09-20

    Gene expression and in vivo DNA binding data provide important information for understanding gene regulatory networks: in vivo DNA binding data indicate genomic regions where transcription factors are bound, and expression data show the output resulting from this binding. Thus, there must be functional relationships between these two types of data. While visualization and data analysis tools exist for each data type alone, there is a lack of tools that can easily explore the relationship between them. We propose an approach that uses the average expression driven by multiple of ciscontrol regions to visually relate gene expression and in vivo DNA binding data. We demonstrate the utility of this tool with examples from the network controlling early Drosophila development. The results obtained support the idea that the level of occupancy of a transcription factor on DNA strongly determines the degree to which the factor regulates a target gene, and in some cases also controls whether the regulation is positive or negative.

  20. Integration of gene expression and methylation to unravel biological networks in glioblastoma patients.

    Science.gov (United States)

    Gadaleta, Francesco; Bessonov, Kyrylo; Van Steen, Kristel

    2017-02-01

    The vast amount of heterogeneous omics data, encompassing a broad range of biomolecular information, requires novel methods of analysis, including those that integrate the available levels of information. In this work, we describe Regression2Net, a computational approach that is able to integrate gene expression and genomic or methylation data in two steps. First, penalized regressions are used to build Expression-Expression (EEnet) and Expression-Genomic or Expression-Methylation (EMnet) networks. Second, network theory is used to highlight important communities of genes. When applying our approach, Regression2Net to gene expression and methylation profiles for individuals with glioblastoma multiforme, we identified, respectively, 284 and 447 potentially interesting genes in relation to glioblastoma pathology. These genes showed at least one connection in the integrated networks ANDnet and XORnet derived from aforementioned EEnet and EMnet networks. Although the edges in ANDnet occur in both EEnet and EMnet, the edges in XORnet occur in EMnet but not in EEnet. In-depth biological analysis of connected genes in ANDnet and XORnet revealed genes that are related to energy metabolism, cell cycle control (AATF), immune system response, and several cancer types. Importantly, we observed significant overrepresentation of cancer-related pathways including glioma, especially in the XORnet network, suggesting a nonignorable role of methylation in glioblastoma multiforma. In the ANDnet, we furthermore identified potential glioma suppressor genes ACCN3 and ACCN4 linked to the NBPF1 neuroblastoma breakpoint family, as well as numerous ABC transporter genes (ABCA1, ABCB1) suggesting drug resistance of glioblastoma tumors. © 2016 WILEY PERIODICALS, INC.

  1. Creating and validating cis-regulatory maps of tissue-specific gene expression regulation

    Science.gov (United States)

    O'Connor, Timothy R.; Bailey, Timothy L.

    2014-01-01

    Predicting which genomic regions control the transcription of a given gene is a challenge. We present a novel computational approach for creating and validating maps that associate genomic regions (cis-regulatory modules–CRMs) with genes. The method infers regulatory relationships that explain gene expression observed in a test tissue using widely available genomic data for ‘other’ tissues. To predict the regulatory targets of a CRM, we use cross-tissue correlation between histone modifications present at the CRM and expression at genes within 1 Mbp of it. To validate cis-regulatory maps, we show that they yield more accurate models of gene expression than carefully constructed control maps. These gene expression models predict observed gene expression from transcription factor binding in the CRMs linked to that gene. We show that our maps are able to identify long-range regulatory interactions and improve substantially over maps linking genes and CRMs based on either the control maps or a ‘nearest neighbor’ heuristic. Our results also show that it is essential to include CRMs predicted in multiple tissues during map-building, that H3K27ac is the most informative histone modification, and that CAGE is the most informative measure of gene expression for creating cis-regulatory maps. PMID:25200088

  2. Prostate cancer-associated gene expression alterations determined from needle biopsies.

    Science.gov (United States)

    Qian, David Z; Huang, Chung-Ying; O'Brien, Catherine A; Coleman, Ilsa M; Garzotto, Mark; True, Lawrence D; Higano, Celestia S; Vessella, Robert; Lange, Paul H; Nelson, Peter S; Beer, Tomasz M

    2009-05-01

    To accurately identify gene expression alterations that differentiate neoplastic from normal prostate epithelium using an approach that avoids contamination by unwanted cellular components and is not compromised by acute gene expression changes associated with tumor devascularization and resulting ischemia. Approximately 3,000 neoplastic and benign prostate epithelial cells were isolated using laser capture microdissection from snap-frozen prostate biopsy specimens provided by 31 patients who subsequently participated in a clinical trial of preoperative chemotherapy. cDNA synthesized from amplified total RNA was hybridized to custom-made microarrays composed of 6,200 clones derived from the Prostate Expression Database. Expression differences for selected genes were verified using quantitative reverse transcription-PCR. Comparative analyses identified 954 transcript alterations associated with cancer (q transport. Genes down-regulated in prostate cancers were enriched in categories related to immune response, cellular responses to pathogens, and apoptosis. A heterogeneous pattern of androgen receptor expression changes was noted. In exploratory analyses, androgen receptor down-regulation was associated with a lower probability of cancer relapse after neoadjuvant chemotherapy followed by radical prostatectomy. Assessments of tumor phenotypes based on gene expression for treatment stratification and drug targeting of oncogenic alterations may best be ascertained using biopsy-based analyses where the effects of ischemia do not complicate interpretation.

  3. Building gene co-expression networks using transcriptomics data for systems biology investigations

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Watson-Haigh, Nathan S.

    2012-01-01

    Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four......) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT...... (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended....

  4. Using gene expression noise to understand gene regulation

    NARCIS (Netherlands)

    Munsky, B.; Neuert, G.; van Oudenaarden, A.

    2012-01-01

    Phenotypic variation is ubiquitous in biology and is often traceable to underlying genetic and environmental variation. However, even genetically identical cells in identical environments display variable phenotypes. Stochastic gene expression, or gene expression "noise," has been suggested as a

  5. A functional genomics approach using radiation-induced changes in gene expression to study low dose radiation effects in vitro and in vivo

    Energy Technology Data Exchange (ETDEWEB)

    Fornace, Jr, A J

    2007-03-03

    Abstract for final report for project entitled A functional genomics approach using radiation-induced changes in gene expression to study low dose radiation effects in vitro and in vivo which has been supported by the DOE Low Dose Radiation Research Program for approximately 7 years. This project has encompassed two sequential awards, ER62683 and then ER63308, in the Gene Response Section in the Center for Cancer Research at the National Cancer Institute. The project was temporarily suspended during the relocation of the Principal Investigators laboratory to the Dept. of Genetics and Complex Diseases at Harvard School of Public Health at the end of 2004. Remaining support for the final year was transferred to this new site later in 2005 and was assigned the DOE Award Number ER64065. The major aims of this project have been 1) to characterize changes in gene expression in response to low-dose radiation responses; this includes responses in human cells lines, peripheral blood lymphocytes (PBL), and in vivo after human or murine exposures, as well as the effect of dose-rate on gene responses; 2) to characterize changes in gene expression that may be involved in bystander effects, such as may be mediated by cytokines and other intercellular signaling proteins; and 3) to characterize responses in transgenic mouse models with relevance to genomic stability. A variety of approaches have been used to study transcriptional events including microarray hybridization, quantitative single-probe hybridization which was developed in this laboratory, quantitative RT-PCR, and promoter microarray analysis using genomic regulatory motifs. Considering the frequent responsiveness of genes encoding cytokines and related signaling proteins that can affect cellular metabolism, initial efforts were initiated to study radiation responses at the metabolomic level and to correlate with radiation-responsive gene expression. Productivity includes twenty-four published and in press manuscripts

  6. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling

    Directory of Open Access Journals (Sweden)

    Hala Alshamlan

    2015-01-01

    Full Text Available An artificial bee colony (ABC is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR, and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO. The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  7. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.

    Science.gov (United States)

    Alshamlan, Hala; Badr, Ghada; Alohali, Yousef

    2015-01-01

    An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  8. Exact protein distributions for stochastic models of gene expression using partitioning of Poisson processes.

    Science.gov (United States)

    Pendar, Hodjat; Platini, Thierry; Kulkarni, Rahul V

    2013-04-01

    Stochasticity in gene expression gives rise to fluctuations in protein levels across a population of genetically identical cells. Such fluctuations can lead to phenotypic variation in clonal populations; hence, there is considerable interest in quantifying noise in gene expression using stochastic models. However, obtaining exact analytical results for protein distributions has been an intractable task for all but the simplest models. Here, we invoke the partitioning property of Poisson processes to develop a mapping that significantly simplifies the analysis of stochastic models of gene expression. The mapping leads to exact protein distributions using results for mRNA distributions in models with promoter-based regulation. Using this approach, we derive exact analytical results for steady-state and time-dependent distributions for the basic two-stage model of gene expression. Furthermore, we show how the mapping leads to exact protein distributions for extensions of the basic model that include the effects of posttranscriptional and posttranslational regulation. The approach developed in this work is widely applicable and can contribute to a quantitative understanding of stochasticity in gene expression and its regulation.

  9. Exact protein distributions for stochastic models of gene expression using partitioning of Poisson processes

    Science.gov (United States)

    Pendar, Hodjat; Platini, Thierry; Kulkarni, Rahul V.

    2013-04-01

    Stochasticity in gene expression gives rise to fluctuations in protein levels across a population of genetically identical cells. Such fluctuations can lead to phenotypic variation in clonal populations; hence, there is considerable interest in quantifying noise in gene expression using stochastic models. However, obtaining exact analytical results for protein distributions has been an intractable task for all but the simplest models. Here, we invoke the partitioning property of Poisson processes to develop a mapping that significantly simplifies the analysis of stochastic models of gene expression. The mapping leads to exact protein distributions using results for mRNA distributions in models with promoter-based regulation. Using this approach, we derive exact analytical results for steady-state and time-dependent distributions for the basic two-stage model of gene expression. Furthermore, we show how the mapping leads to exact protein distributions for extensions of the basic model that include the effects of posttranscriptional and posttranslational regulation. The approach developed in this work is widely applicable and can contribute to a quantitative understanding of stochasticity in gene expression and its regulation.

  10. Strategies used for genetically modifying bacterial genome: ite-directed mutagenesis, gene inactivation, and gene over-expression*

    Science.gov (United States)

    Xu, Jian-zhong; Zhang, Wei-guo

    2016-01-01

    With the availability of the whole genome sequence of Escherichia coli or Corynebacterium glutamicum, strategies for directed DNA manipulation have developed rapidly. DNA manipulation plays an important role in understanding the function of genes and in constructing novel engineering bacteria according to requirement. DNA manipulation involves modifying the autologous genes and expressing the heterogenous genes. Two alternative approaches, using electroporation linear DNA or recombinant suicide plasmid, allow a wide variety of DNA manipulation. However, the over-expression of the desired gene is generally executed via plasmid-mediation. The current review summarizes the common strategies used for genetically modifying E. coli and C. glutamicum genomes, and discusses the technical problem of multi-layered DNA manipulation. Strategies for gene over-expression via integrating into genome are proposed. This review is intended to be an accessible introduction to DNA manipulation within the bacterial genome for novices and a source of the latest experimental information for experienced investigators. PMID:26834010

  11. Elimination of contaminating cap genes in AAV vector virions reduces immune responses and improves transgene expression in a canine gene therapy model.

    Science.gov (United States)

    Wang, Z; Halbert, C L; Lee, D; Butts, T; Tapscott, S J; Storb, R; Miller, A D

    2014-04-01

    Animal and human gene therapy studies utilizing AAV vectors have shown that immune responses to AAV capsid proteins can severely limit transgene expression. The main source of capsid antigen is that associated with the AAV vectors, which can be reduced by stringent vector purification. A second source of AAV capsid proteins is that expressed from cap genes aberrantly packaged into AAV virions during vector production. This antigen source can be eliminated by the use of a cap gene that is too large to be incorporated into an AAV capsid, such as a cap gene containing a large intron (captron gene). Here, we investigated the effects of elimination of cap gene transfer and of vector purification by CsCl gradient centrifugation on AAV vector immunogenicity and expression following intramuscular injection in dogs. We found that both approaches reduced vector immunogenicity and that combining the two produced the lowest immune responses and highest transgene expression. This combined approach enabled the use of a relatively mild immunosuppressive regimen to promote robust micro-dystrophin gene expression in Duchenne muscular dystrophy-affected dogs. Our study shows the importance of minimizing AAV cap gene impurities and indicates that this improvement in AAV vector production may benefit human applications.

  12. Finding biological process modifications in cancer tissues by mining gene expression correlations

    Directory of Open Access Journals (Sweden)

    Storari Sergio

    2006-01-01

    Full Text Available Abstract Background Through the use of DNA microarrays it is now possible to obtain quantitative measurements of the expression of thousands of genes from a biological sample. This technology yields a global view of gene expression that can be used in several ways. Functional insight into expression profiles is routinely obtained by using Gene Ontology terms associated to the cellular genes. In this paper, we deal with functional data mining from expression profiles, proposing a novel approach that studies the correlations between genes and their relations to Gene Ontology (GO. By using this "functional correlations comparison" we explore all possible pairs of genes identifying the affected biological processes by analyzing in a pair-wise manner gene expression patterns and linking correlated pairs with Gene Ontology terms. Results We apply here this "functional correlations comparison" approach to identify the existing correlations in hepatocarcinoma (161 microarray experiments and to reveal functional differences between normal liver and cancer tissues. The number of well-correlated pairs in each GO term highlights several differences in genetic interactions between cancer and normal tissues. We performed a bootstrap analysis in order to compute false detection rates (FDR and confidence limits. Conclusion Experimental results show the main advantage of the applied method: it both picks up general and specific GO terms (in particular it shows a fine resolution in the specific GO terms. The results obtained by this novel method are highly coherent with the ones proposed by other cancer biology studies. But additionally they highlight the most specific and interesting GO terms helping the biologist to focus his/her studies on the most relevant biological processes.

  13. Use of bacterially expressed dsRNA to downregulate Entamoeba histolytica gene expression.

    Directory of Open Access Journals (Sweden)

    Carlos F Solis

    Full Text Available BACKGROUND: Modern RNA interference (RNAi methodologies using small interfering RNA (siRNA oligonucleotide duplexes or episomally synthesized hairpin RNA are valuable tools for the analysis of gene function in the protozoan parasite Entamoeba histolytica. However, these approaches still require time-consuming procedures including transfection and drug selection, or costly synthetic molecules. PRINCIPAL FINDINGS: Here we report an efficient and handy alternative for E. histolytica gene down-regulation mediated by bacterial double-stranded RNA (dsRNA targeting parasite genes. The Escherichia coli strain HT115 which is unable to degrade dsRNA, was genetically engineered to produce high quantities of long dsRNA segments targeting the genes that encode E. histolytica beta-tubulin and virulence factor KERP1. Trophozoites cultured in vitro were directly fed with dsRNA-expressing bacteria or soaked with purified dsRNA. Both dsRNA delivery methods resulted in significant reduction of protein expression. In vitro host cell-parasite assays showed that efficient downregulation of kerp1 gene expression mediated by bacterial dsRNA resulted in significant reduction of parasite adhesion and lytic capabilities, thus supporting a major role for KERP1 in the pathogenic process. Furthermore, treatment of trophozoites cultured in microtiter plates, with a repertoire of eighty-five distinct bacterial dsRNA segments targeting E. histolytica genes with unknown function, led to the identification of three genes potentially involved in the growth of the parasite. CONCLUSIONS: Our results showed that the use of bacterial dsRNA is a powerful method for the study of gene function in E. histolytica. This dsRNA delivery method is also technically suitable for the study of a large number of genes, thus opening interesting perspectives for the identification of novel drug and vaccine targets.

  14. ADAGE signature analysis: differential expression analysis with data-defined gene sets.

    Science.gov (United States)

    Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S

    2017-11-22

    Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed

  15. Serious limitations of the QTL/Microarray approach for QTL gene discovery

    Directory of Open Access Journals (Sweden)

    Warden Craig H

    2010-07-01

    Full Text Available Abstract Background It has been proposed that the use of gene expression microarrays in nonrecombinant parental or congenic strains can accelerate the process of isolating individual genes underlying quantitative trait loci (QTL. However, the effectiveness of this approach has not been assessed. Results Thirty-seven studies that have implemented the QTL/microarray approach in rodents were reviewed. About 30% of studies showed enrichment for QTL candidates, mostly in comparisons between congenic and background strains. Three studies led to the identification of an underlying QTL gene. To complement the literature results, a microarray experiment was performed using three mouse congenic strains isolating the effects of at least 25 biometric QTL. Results show that genes in the congenic donor regions were preferentially selected. However, within donor regions, the distribution of differentially expressed genes was homogeneous once gene density was accounted for. Genes within identical-by-descent (IBD regions were less likely to be differentially expressed in chromosome 2, but not in chromosomes 11 and 17. Furthermore, expression of QTL regulated in cis (cis eQTL showed higher expression in the background genotype, which was partially explained by the presence of single nucleotide polymorphisms (SNP. Conclusions The literature shows limited successes from the QTL/microarray approach to identify QTL genes. Our own results from microarray profiling of three congenic strains revealed a strong tendency to select cis-eQTL over trans-eQTL. IBD regions had little effect on rate of differential expression, and we provide several reasons why IBD should not be used to discard eQTL candidates. In addition, mismatch probes produced false cis-eQTL that could not be completely removed with the current strains genotypes and low probe density microarrays. The reviewed studies did not account for lack of coverage from the platforms used and therefore removed genes

  16. Characterization of differentially expressed genes using high-dimensional co-expression networks

    DEFF Research Database (Denmark)

    Coelho Goncalves de Abreu, Gabriel; Labouriau, Rodrigo S.

    2010-01-01

    We present a technique to characterize differentially expressed genes in terms of their position in a high-dimensional co-expression network. The set-up of Gaussian graphical models is used to construct representations of the co-expression network in such a way that redundancy and the propagation...... that allow to make effective inference in problems with high degree of complexity (e.g. several thousands of genes) and small number of observations (e.g. 10-100) as typically occurs in high throughput gene expression studies. Taking advantage of the internal structure of decomposable graphical models, we...... construct a compact representation of the co-expression network that allows to identify the regions with high concentration of differentially expressed genes. It is argued that differentially expressed genes located in highly interconnected regions of the co-expression network are less informative than...

  17. Regulation of eucaryotic gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Brent, R.; Ptashne, M.S

    1989-05-23

    This patent describes a method of regulating the expression of a gene in a eucaryotic cell. The method consists of: providing in the eucaryotic cell, a peptide, derived from or substantially similar to a peptide of a procaryotic cell able to bind to DNA upstream from or within the gene, the amount of the peptide being sufficient to bind to the gene and thereby control expression of the gene.

  18. Transcriptomic analysis in the developing zebrafish embryo after compound exposure: Individual gene expression and pathway regulation

    Energy Technology Data Exchange (ETDEWEB)

    Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands); Pronk, Tessa E. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht (Netherlands); Brandhof, Evert-Jan van den [Centre for Environmental Quality, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Ven, Leo T.M. van der [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Piersma, Aldert H. [Centre for Health Protection, National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven (Netherlands); Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80.178, 3508 TD, Utrecht (Netherlands)

    2013-10-01

    The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol and saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.

  19. The use of molecular imaging of gene expression by radiotracers in gene therapy

    International Nuclear Information System (INIS)

    Richard-Fiardo, P.; Franken, P.R.; Harrington, K.J.; Vassaux, G.; Cambien, B.

    2011-01-01

    Introduction: Progress with gene-based therapies has been hampered by difficulties in monitoring the biodistribution and kinetics of vector-mediated gene expression. Recent developments in non-invasive imaging have allowed researchers and clinicians to assess the location, magnitude and persistence of gene expression in animals and humans. Such advances should eventually lead to improvement in the efficacy and safety of current clinical protocols for future treatments. Areas Covered: The molecular imaging techniques for monitoring gene therapy in the living subject, with a specific highlight on the key reporter gene approaches that have been developed and validated in preclinical models using the latest imaging modalities. The applications of molecular imaging to biotherapy, with a particular emphasis on monitoring of gene and vector biodistribution and on image-guided radiotherapy. Expert Opinion: Among the reporter gene/probe combinations that have been described so far, one stands out, in our view, as the most versatile and easy to implement: the Na/I symporter. This strategy, exploiting more than 50 years of experience in the treatment of differentiated thyroid carcinomas, has been validated in different types of experimental cancers and with different types of oncolytic viruses and is likely to become a key tool in the implementation of human gene therapy. (authors)

  20. Mining gene expression data by interpreting principal components

    Directory of Open Access Journals (Sweden)

    Mortazavi Ali

    2006-04-01

    Full Text Available Abstract Background There are many methods for analyzing microarray data that group together genes having similar patterns of expression over all conditions tested. However, in many instances the biologically important goal is to identify relatively small sets of genes that share coherent expression across only some conditions, rather than all or most conditions as required in traditional clustering; e.g. genes that are highly up-regulated and/or down-regulated similarly across only a subset of conditions. Equally important is the need to learn which conditions are the decisive ones in forming such gene sets of interest, and how they relate to diverse conditional covariates, such as disease diagnosis or prognosis. Results We present a method for automatically identifying such candidate sets of biologically relevant genes using a combination of principal components analysis and information theoretic metrics. To enable easy use of our methods, we have developed a data analysis package that facilitates visualization and subsequent data mining of the independent sources of significant variation present in gene microarray expression datasets (or in any other similarly structured high-dimensional dataset. We applied these tools to two public datasets, and highlight sets of genes most affected by specific subsets of conditions (e.g. tissues, treatments, samples, etc.. Statistically significant associations for highlighted gene sets were shown via global analysis for Gene Ontology term enrichment. Together with covariate associations, the tool provides a basis for building testable hypotheses about the biological or experimental causes of observed variation. Conclusion We provide an unsupervised data mining technique for diverse microarray expression datasets that is distinct from major methods now in routine use. In test uses, this method, based on publicly available gene annotations, appears to identify numerous sets of biologically relevant genes. It

  1. Gene expression characterizes different nutritional strategies among three mixotrophic protists.

    Science.gov (United States)

    Liu, Zhenfeng; Campbell, Victoria; Heidelberg, Karla B; Caron, David A

    2016-07-01

    Mixotrophic protists, i.e. protists that can carry out both phototrophy and heterotrophy, are a group of organisms with a wide range of nutritional strategies. The ecological and biogeochemical importance of these species has recently been recognized. In this study, we investigated and compared the gene expression of three mixotrophic protists, Prymnesium parvum, Dinobyron sp. and Ochromonas sp. under light and dark conditions in the presence of prey using RNA-Seq. Gene expression of the obligately phototrophic P. parvum and Dinobryon sp. changed significantly between light and dark treatments, while that of primarily heterotrophic Ochromonas sp. was largely unchanged. Gene expression of P. parvum and Dinobryon sp. shared many similarities, especially in the expression patterns of genes related to reproduction. However, key genes involved in central carbon metabolism and phagotrophy had different expression patterns between these two species, suggesting differences in prey consumption and heterotrophic nutrition in the dark. Transcriptomic data also offered clues to other physiological traits of these organisms such as preference of nitrogen sources and photo-oxidative stress. These results provide potential target genes for further exploration of the mechanisms of mixotrophic physiology and demonstrate the potential usefulness of molecular approaches in characterizing the nutritional modes of mixotrophic protists. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  2. Geometry of the Gene Expression Space of Individual Cells.

    Directory of Open Access Journals (Sweden)

    Yael Korem

    2015-07-01

    Full Text Available There is a revolution in the ability to analyze gene expression of single cells in a tissue. To understand this data we must comprehend how cells are distributed in a high-dimensional gene expression space. One open question is whether cell types form discrete clusters or whether gene expression forms a continuum of states. If such a continuum exists, what is its geometry? Recent theory on evolutionary trade-offs suggests that cells that need to perform multiple tasks are arranged in a polygon or polyhedron (line, triangle, tetrahedron and so on, generally called polytopes in gene expression space, whose vertices are the expression profiles optimal for each task. Here, we analyze single-cell data from human and mouse tissues profiled using a variety of single-cell technologies. We fit the data to shapes with different numbers of vertices, compute their statistical significance, and infer their tasks. We find cases in which single cells fill out a continuum of expression states within a polyhedron. This occurs in intestinal progenitor cells, which fill out a tetrahedron in gene expression space. The four vertices of this tetrahedron are each enriched with genes for a specific task related to stemness and early differentiation. A polyhedral continuum of states is also found in spleen dendritic cells, known to perform multiple immune tasks: cells fill out a tetrahedron whose vertices correspond to key tasks related to maturation, pathogen sensing and communication with lymphocytes. A mixture of continuum-like distributions and discrete clusters is found in other cell types, including bone marrow and differentiated intestinal crypt cells. This approach can be used to understand the geometry and biological tasks of a wide range of single-cell datasets. The present results suggest that the concept of cell type may be expanded. In addition to discreet clusters in gene-expression space, we suggest a new possibility: a continuum of states within a

  3. Current approaches to gene regulatory network modelling

    Directory of Open Access Journals (Sweden)

    Brazma Alvis

    2007-09-01

    Full Text Available Abstract Many different approaches have been developed to model and simulate gene regulatory networks. We proposed the following categories for gene regulatory network models: network parts lists, network topology models, network control logic models, and dynamic models. Here we will describe some examples for each of these categories. We will study the topology of gene regulatory networks in yeast in more detail, comparing a direct network derived from transcription factor binding data and an indirect network derived from genome-wide expression data in mutants. Regarding the network dynamics we briefly describe discrete and continuous approaches to network modelling, then describe a hybrid model called Finite State Linear Model and demonstrate that some simple network dynamics can be simulated in this model.

  4. Spectral Analysis on Time-Course Expression Data: Detecting Periodic Genes Using a Real-Valued Iterative Adaptive Approach

    Directory of Open Access Journals (Sweden)

    Kwadwo S. Agyepong

    2013-01-01

    Full Text Available Time-course expression profiles and methods for spectrum analysis have been applied for detecting transcriptional periodicities, which are valuable patterns to unravel genes associated with cell cycle and circadian rhythm regulation. However, most of the proposed methods suffer from restrictions and large false positives to a certain extent. Additionally, in some experiments, arbitrarily irregular sampling times as well as the presence of high noise and small sample sizes make accurate detection a challenging task. A novel scheme for detecting periodicities in time-course expression data is proposed, in which a real-valued iterative adaptive approach (RIAA, originally proposed for signal processing, is applied for periodogram estimation. The inferred spectrum is then analyzed using Fisher’s hypothesis test. With a proper -value threshold, periodic genes can be detected. A periodic signal, two nonperiodic signals, and four sampling strategies were considered in the simulations, including both bursts and drops. In addition, two yeast real datasets were applied for validation. The simulations and real data analysis reveal that RIAA can perform competitively with the existing algorithms. The advantage of RIAA is manifested when the expression data are highly irregularly sampled, and when the number of cycles covered by the sampling time points is very reduced.

  5. Preferential Allele Expression Analysis Identifies Shared Germline and Somatic Driver Genes in Advanced Ovarian Cancer

    Science.gov (United States)

    Halabi, Najeeb M.; Martinez, Alejandra; Al-Farsi, Halema; Mery, Eliane; Puydenus, Laurence; Pujol, Pascal; Khalak, Hanif G.; McLurcan, Cameron; Ferron, Gwenael; Querleu, Denis; Al-Azwani, Iman; Al-Dous, Eman; Mohamoud, Yasmin A.; Malek, Joel A.; Rafii, Arash

    2016-01-01

    Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. PMID:26735499

  6. Heart morphogenesis gene regulatory networks revealed by temporal expression analysis.

    Science.gov (United States)

    Hill, Jonathon T; Demarest, Bradley; Gorsi, Bushra; Smith, Megan; Yost, H Joseph

    2017-10-01

    During embryogenesis the heart forms as a linear tube that then undergoes multiple simultaneous morphogenetic events to obtain its mature shape. To understand the gene regulatory networks (GRNs) driving this phase of heart development, during which many congenital heart disease malformations likely arise, we conducted an RNA-seq timecourse in zebrafish from 30 hpf to 72 hpf and identified 5861 genes with altered expression. We clustered the genes by temporal expression pattern, identified transcription factor binding motifs enriched in each cluster, and generated a model GRN for the major gene batteries in heart morphogenesis. This approach predicted hundreds of regulatory interactions and found batteries enriched in specific cell and tissue types, indicating that the approach can be used to narrow the search for novel genetic markers and regulatory interactions. Subsequent analyses confirmed the GRN using two mutants, Tbx5 and nkx2-5 , and identified sets of duplicated zebrafish genes that do not show temporal subfunctionalization. This dataset provides an essential resource for future studies on the genetic/epigenetic pathways implicated in congenital heart defects and the mechanisms of cardiac transcriptional regulation. © 2017. Published by The Company of Biologists Ltd.

  7. Global gene expression analysis of the zoonotic parasite Trichinella spiralis revealed novel genes in host parasite interaction.

    Directory of Open Access Journals (Sweden)

    Xiaolei Liu

    Full Text Available BACKGROUND: Trichinellosis is a typical food-borne zoonotic disease which is epidemic worldwide and the nematode Trichinella spiralis is the main pathogen. The life cycle of T. spiralis contains three developmental stages, i.e. adult worms, new borne larva (new borne L1 larva and muscular larva (infective L1 larva. Stage-specific gene expression in the parasites has been investigated with various immunological and cDNA cloning approaches, whereas the genome-wide transcriptome and expression features of the parasite have been largely unknown. The availability of the genome sequence information of T. spiralis has made it possible to deeply dissect parasite biology in association with global gene expression and pathogenesis. METHODOLOGY AND PRINCIPAL FINDINGS: In this study, we analyzed the global gene expression patterns in the three developmental stages of T. spiralis using digital gene expression (DGE analysis. Almost 15 million sequence tags were generated with the Illumina RNA-seq technology, producing expression data for more than 9,000 genes, covering 65% of the genome. The transcriptome analysis revealed thousands of differentially expressed genes within the genome, and importantly, a panel of genes encoding functional proteins associated with parasite invasion and immuno-modulation were identified. More than 45% of the genes were found to be transcribed from both strands, indicating the importance of RNA-mediated gene regulation in the development of the parasite. Further, based on gene ontological analysis, over 3000 genes were functionally categorized and biological pathways in the three life cycle stage were elucidated. CONCLUSIONS AND SIGNIFICANCE: The global transcriptome of T. spiralis in three developmental stages has been profiled, and most gene activity in the genome was found to be developmentally regulated. Many metabolic and biological pathways have been revealed. The findings of the differential expression of several protein

  8. Evaluation of a nanotechnology-based approach to induce gene-expression in human THP-1 macrophages under inflammatory conditions.

    Science.gov (United States)

    Bernal, Laura; Alvarado-Vázquez, Abigail; Ferreira, David Wilson; Paige, Candler A; Ulecia-Morón, Cristina; Hill, Bailey; Caesar, Marina; Romero-Sandoval, E Alfonso

    2017-02-01

    Macrophages orchestrate the initiation and resolution of inflammation by producing pro- and anti-inflammatory products. An imbalance in these mediators may originate from a deficient or excessive immune response. Therefore, macrophages are valid therapeutic targets to restore homeostasis under inflammatory conditions. We hypothesize that a specific mannosylated nanoparticle effectively induces gene expression in human macrophages under inflammatory conditions without undesirable immunogenic responses. THP-1 macrophages were challenged with lipopolysaccharide (LPS, 5μg/mL). Polyethylenimine (PEI) nanoparticles grafted with a mannose receptor ligand (Man-PEI) were used as a gene delivery method. Nanoparticle toxicity, Man-PEI cellular uptake rate and gene induction efficiency (GFP, CD14 or CD68) were studied. Potential immunogenic responses were evaluated by measuring the production of tumor necrosis factor-alpha (TNF-α), Interleukin (IL)-6 and IL-10. Man-PEI did not produce cytotoxicity, and it was effectively up-taken by THP-1 macrophages (69%). This approach produced a significant expression of GFP (mRNA and protein), CD14 and CD68 (mRNA), and transiently and mildly reduced IL-6 and IL-10 levels in LPS-challenged macrophages. Our results indicate that Man-PEI is suitable for inducing an efficient gene overexpression in human macrophages under inflammatory conditions with limited immunogenic responses. Our promising results set the foundation to test this technology to induce functional anti-inflammatory genes. Copyright © 2016 Elsevier GmbH. All rights reserved.

  9. Inferring causal genomic alterations in breast cancer using gene expression data

    Science.gov (United States)

    2011-01-01

    Background One of the primary objectives in cancer research is to identify causal genomic alterations, such as somatic copy number variation (CNV) and somatic mutations, during tumor development. Many valuable studies lack genomic data to detect CNV; therefore, methods that are able to infer CNVs from gene expression data would help maximize the value of these studies. Results We developed a framework for identifying recurrent regions of CNV and distinguishing the cancer driver genes from the passenger genes in the regions. By inferring CNV regions across many datasets we were able to identify 109 recurrent amplified/deleted CNV regions. Many of these regions are enriched for genes involved in many important processes associated with tumorigenesis and cancer progression. Genes in these recurrent CNV regions were then examined in the context of gene regulatory networks to prioritize putative cancer driver genes. The cancer driver genes uncovered by the framework include not only well-known oncogenes but also a number of novel cancer susceptibility genes validated via siRNA experiments. Conclusions To our knowledge, this is the first effort to systematically identify and validate drivers for expression based CNV regions in breast cancer. The framework where the wavelet analysis of copy number alteration based on expression coupled with the gene regulatory network analysis, provides a blueprint for leveraging genomic data to identify key regulatory components and gene targets. This integrative approach can be applied to many other large-scale gene expression studies and other novel types of cancer data such as next-generation sequencing based expression (RNA-Seq) as well as CNV data. PMID:21806811

  10. A novel approach to select differential pathways associated with hypertrophic cardiomyopathy based on gene co‑expression analysis.

    Science.gov (United States)

    Chen, Xiao-Min; Feng, Ming-Jun; Shen, Cai-Jie; He, Bin; Du, Xian-Feng; Yu, Yi-Bo; Liu, Jing; Chu, Hui-Min

    2017-07-01

    The present study was designed to develop a novel method for identifying significant pathways associated with human hypertrophic cardiomyopathy (HCM), based on gene co‑expression analysis. The microarray dataset associated with HCM (E‑GEOD‑36961) was obtained from the European Molecular Biology Laboratory‑European Bioinformatics Institute database. Informative pathways were selected based on the Reactome pathway database and screening treatments. An empirical Bayes method was utilized to construct co‑expression networks for informative pathways, and a weight value was assigned to each pathway. Differential pathways were extracted based on weight threshold, which was calculated using a random model. In order to assess whether the co‑expression method was feasible, it was compared with traditional pathway enrichment analysis of differentially expressed genes, which were identified using the significance analysis of microarrays package. A total of 1,074 informative pathways were screened out for subsequent investigations and their weight values were also obtained. According to the threshold of weight value of 0.01057, 447 differential pathways, including folding of actin by chaperonin containing T‑complex protein 1 (CCT)/T‑complex protein 1 ring complex (TRiC), purine ribonucleoside monophosphate biosynthesis and ubiquinol biosynthesis, were obtained. Compared with traditional pathway enrichment analysis, the number of pathways obtained from the co‑expression approach was increased. The results of the present study demonstrated that this method may be useful to predict marker pathways for HCM. The pathways of folding of actin by CCT/TRiC and purine ribonucleoside monophosphate biosynthesis may provide evidence of the underlying molecular mechanisms of HCM, and offer novel therapeutic directions for HCM.

  11. Identification and expression analysis of cold and freezing stress responsive genes of Brassica oleracea.

    Science.gov (United States)

    Ahmed, Nasar Uddin; Jung, Hee-Jeong; Park, Jong-In; Cho, Yong-Gu; Hur, Yoonkang; Nou, Ill-Sup

    2015-01-10

    Cold and freezing stress is a major environmental constraint to the production of Brassica crops. Enhancement of tolerance by exploiting cold and freezing tolerance related genes offers the most efficient approach to address this problem. Cold-induced transcriptional profiling is a promising approach to the identification of potential genes related to cold and freezing stress tolerance. In this study, 99 highly expressed genes were identified from a whole genome microarray dataset of Brassica rapa. Blast search analysis of the Brassica oleracea database revealed the corresponding homologous genes. To validate their expression, pre-selected cold tolerant and susceptible cabbage lines were analyzed. Out of 99 BoCRGs, 43 were differentially expressed in response to varying degrees of cold and freezing stress in the contrasting cabbage lines. Among the differentially expressed genes, 18 were highly up-regulated in the tolerant lines, which is consistent with their microarray expression. Additionally, 12 BoCRGs were expressed differentially after cold stress treatment in two contrasting cabbage lines, and BoCRG54, 56, 59, 62, 70, 72 and 99 were predicted to be involved in cold regulatory pathways. Taken together, the cold-responsive genes identified in this study provide additional direction for elucidating the regulatory network of low temperature stress tolerance and developing cold and freezing stress resistant Brassica crops. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. Adaptive Evolution of Gene Expression in Drosophila.

    Science.gov (United States)

    Nourmohammad, Armita; Rambeau, Joachim; Held, Torsten; Kovacova, Viera; Berg, Johannes; Lässig, Michael

    2017-08-08

    Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Adaptive Evolution of Gene Expression in Drosophila

    Directory of Open Access Journals (Sweden)

    Armita Nourmohammad

    2017-08-01

    Full Text Available Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis.

  14. Identification of transcription-factor genes expressed in the Arabidopsis female gametophyte

    Directory of Open Access Journals (Sweden)

    Kang Il-Ho

    2010-06-01

    study have not been reported previously as being expressed in the female gametophyte. Therefore, they might represent novel regulators and provide entry points for reverse genetic and molecular approaches to uncover the gene regulatory networks underlying female gametophyte development.

  15. iGC-an integrated analysis package of gene expression and copy number alteration.

    Science.gov (United States)

    Lai, Yi-Pin; Wang, Liang-Bo; Wang, Wei-An; Lai, Liang-Chuan; Tsai, Mong-Hsun; Lu, Tzu-Pin; Chuang, Eric Y

    2017-01-14

    With the advancement in high-throughput technologies, researchers can simultaneously investigate gene expression and copy number alteration (CNA) data from individual patients at a lower cost. Traditional analysis methods analyze each type of data individually and integrate their results using Venn diagrams. Challenges arise, however, when the results are irreproducible and inconsistent across multiple platforms. To address these issues, one possible approach is to concurrently analyze both gene expression profiling and CNAs in the same individual. We have developed an open-source R/Bioconductor package (iGC). Multiple input formats are supported and users can define their own criteria for identifying differentially expressed genes driven by CNAs. The analysis of two real microarray datasets demonstrated that the CNA-driven genes identified by the iGC package showed significantly higher Pearson correlation coefficients with their gene expression levels and copy numbers than those genes located in a genomic region with CNA. Compared with the Venn diagram approach, the iGC package showed better performance. The iGC package is effective and useful for identifying CNA-driven genes. By simultaneously considering both comparative genomic and transcriptomic data, it can provide better understanding of biological and medical questions. The iGC package's source code and manual are freely available at https://www.bioconductor.org/packages/release/bioc/html/iGC.html .

  16. Changes in gene expression linked to methamphetamine-induced dopaminergic neurotoxicity.

    Science.gov (United States)

    Xie, Tao; Tong, Liqiong; Barrett, Tanya; Yuan, Jie; Hatzidimitriou, George; McCann, Una D; Becker, Kevin G; Donovan, David M; Ricaurte, George A

    2002-01-01

    The purpose of these studies was to examine the role of gene expression in methamphetamine (METH)-induced dopamine (DA) neurotoxicity. First, the effects of the mRNA synthesis inhibitor, actinomycin-D, and the protein synthesis inhibitor, cycloheximide, were examined. Both agents afforded complete protection against METH-induced DA neurotoxicity and did so independently of effects on core temperature, DA transporter function, or METH brain levels, suggesting that gene transcription and mRNA translation play a role in METH neurotoxicity. Next, microarray technology, in combination with an experimental approach designed to facilitate recognition of relevant gene expression patterns, was used to identify gene products linked to METH-induced DA neurotoxicity. This led to the identification of several genes in the ventral midbrain associated with the neurotoxic process, including genes for energy metabolism [cytochrome c oxidase subunit 1 (COX1), reduced nicotinamide adenine dinucleotide ubiquinone oxidoreductase chain 2, and phosphoglycerate mutase B], ion regulation (members of sodium/hydrogen exchanger and sodium/bile acid cotransporter family), signal transduction (adenylyl cyclase III), and cell differentiation and degeneration (N-myc downstream-regulated gene 3 and tau protein). Of these differentially expressed genes, we elected to further examine the increase in COX1 expression, because of data implicating energy utilization in METH neurotoxicity and the known role of COX1 in energy metabolism. On the basis of time course studies, Northern blot analyses, in situ hybridization results, and temperature studies, we now report that increased COX1 expression in the ventral midbrain is linked to METH-induced DA neuronal injury. The precise role of COX1 and other genes in METH neurotoxicity remains to be elucidated.

  17. Synergistic Effect of Auto-Activation and Small RNA Regulation on Gene Expression

    Science.gov (United States)

    Xiong, Li-Ping; Ma, Yu-Qiang; Tang, Lei-Han

    2010-09-01

    Auto-activation and small ribonucleic acid (RNA)-mediated regulation are two important mechanisms in controlling gene expression. We study the synergistic effect of these two regulations on gene expression. It is found that under this combinatorial regulation, gene expression exhibits bistable behaviors at the transition regime, while each of these two regulations, if working solely, only leads to monostability. Within the stochastic framework, the base pairing strength between sRNA and mRNA plays an important role in controlling the transition time between on and off states. The noise strength of protein number in the off state approaches 1 and is smaller than that in the on state. The noise strength also depends on which parameters, the feedback strength or the synthesis rate of small RNA, are tuned in switching the gene expression on and off. Our findings may provide a new insight into gene-regulation mechanism and can be applied in synthetic biology.

  18. Synergistic Effect of Auto-Activation and Small RNA Regulation on Gene Expression

    International Nuclear Information System (INIS)

    Li-Ping, Xiong; Yu-Qiang, Ma; Lei-Han, Tang

    2010-01-01

    Auto-activation and small ribonucleic acid (RNA)-mediated regulation are two important mechanisms in controlling gene expression. We study the synergistic effect of these two regulations on gene expression. It is found that under this combinatorial regulation, gene expression exhibits bistable behaviors at the transition regime, while each of these two regulations, if working solely, only leads to monostability. Within the stochastic framework, the base pairing strength between sRNA and mRNA plays an important role in controlling the transition time between on and off states. The noise strength of protein number in the off state approaches 1 and is smaller than that in the on state. The noise strength also depends on which parameters, the feedback strength or the synthesis rate of small RNA, are tuned in switching the gene expression on and off. Our findings may provide a new insight into gene-regulation mechanism and can be applied in synthetic biology

  19. Spatial reconstruction of single-cell gene expression data.

    Science.gov (United States)

    Satija, Rahul; Farrell, Jeffrey A; Gennert, David; Schier, Alexander F; Regev, Aviv

    2015-05-01

    Spatial localization is a key determinant of cellular fate and behavior, but methods for spatially resolved, transcriptome-wide gene expression profiling across complex tissues are lacking. RNA staining methods assay only a small number of transcripts, whereas single-cell RNA-seq, which measures global gene expression, separates cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos and generated a transcriptome-wide map of spatial patterning. We confirmed Seurat's accuracy using several experimental approaches, then used the strategy to identify a set of archetypal expression patterns and spatial markers. Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems.

  20. Gene selection for the reconstruction of stem cell differentiation trees: a linear programming approach.

    Science.gov (United States)

    Ghadie, Mohamed A; Japkowicz, Nathalie; Perkins, Theodore J

    2015-08-15

    Stem cell differentiation is largely guided by master transcriptional regulators, but it also depends on the expression of other types of genes, such as cell cycle genes, signaling genes, metabolic genes, trafficking genes, etc. Traditional approaches to understanding gene expression patterns across multiple conditions, such as principal components analysis or K-means clustering, can group cell types based on gene expression, but they do so without knowledge of the differentiation hierarchy. Hierarchical clustering can organize cell types into a tree, but in general this tree is different from the differentiation hierarchy itself. Given the differentiation hierarchy and gene expression data at each node, we construct a weighted Euclidean distance metric such that the minimum spanning tree with respect to that metric is precisely the given differentiation hierarchy. We provide a set of linear constraints that are provably sufficient for the desired construction and a linear programming approach to identify sparse sets of weights, effectively identifying genes that are most relevant for discriminating different parts of the tree. We apply our method to microarray gene expression data describing 38 cell types in the hematopoiesis hierarchy, constructing a weighted Euclidean metric that uses just 175 genes. However, we find that there are many alternative sets of weights that satisfy the linear constraints. Thus, in the style of random-forest training, we also construct metrics based on random subsets of the genes and compare them to the metric of 175 genes. We then report on the selected genes and their biological functions. Our approach offers a new way to identify genes that may have important roles in stem cell differentiation. tperkins@ohri.ca Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Regulatory Architecture of Gene Expression Variation in the Threespine Stickleback Gasterosteus aculeatus

    Directory of Open Access Journals (Sweden)

    Victoria L. Pritchard

    2017-01-01

    Full Text Available Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus, an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats.

  2. An integrative approach to inferring biologically meaningful gene modules

    Directory of Open Access Journals (Sweden)

    Wang Kai

    2011-07-01

    Full Text Available Abstract Background The ability to construct biologically meaningful gene networks and modules is critical for contemporary systems biology. Though recent studies have demonstrated the power of using gene modules to shed light on the functioning of complex biological systems, most modules in these networks have shown little association with meaningful biological function. We have devised a method which directly incorporates gene ontology (GO annotation in construction of gene modules in order to gain better functional association. Results We have devised a method, Semantic Similarity-Integrated approach for Modularization (SSIM that integrates various gene-gene pairwise similarity values, including information obtained from gene expression, protein-protein interactions and GO annotations, in the construction of modules using affinity propagation clustering. We demonstrated the performance of the proposed method using data from two complex biological responses: 1. the osmotic shock response in Saccharomyces cerevisiae, and 2. the prion-induced pathogenic mouse model. In comparison with two previously reported algorithms, modules identified by SSIM showed significantly stronger association with biological functions. Conclusions The incorporation of semantic similarity based on GO annotation with gene expression and protein-protein interaction data can greatly enhance the functional relevance of inferred gene modules. In addition, the SSIM approach can also reveal the hierarchical structure of gene modules to gain a broader functional view of the biological system. Hence, the proposed method can facilitate comprehensive and in-depth analysis of high throughput experimental data at the gene network level.

  3. Shaped 3D Singular Spectrum Analysis for Quantifying Gene Expression, with Application to the Early Zebrafish Embryo

    Directory of Open Access Journals (Sweden)

    Alex Shlemov

    2015-01-01

    Full Text Available Recent progress in microscopy technologies, biological markers, and automated processing methods is making possible the development of gene expression atlases at cellular-level resolution over whole embryos. Raw data on gene expression is usually very noisy. This noise comes from both experimental (technical/methodological and true biological sources (from stochastic biochemical processes. In addition, the cells or nuclei being imaged are irregularly arranged in 3D space. This makes the processing, extraction, and study of expression signals and intrinsic biological noise a serious challenge for 3D data, requiring new computational approaches. Here, we present a new approach for studying gene expression in nuclei located in a thick layer around a spherical surface. The method includes depth equalization on the sphere, flattening, interpolation to a regular grid, pattern extraction by Shaped 3D singular spectrum analysis (SSA, and interpolation back to original nuclear positions. The approach is demonstrated on several examples of gene expression in the zebrafish egg (a model system in vertebrate development. The method is tested on several different data geometries (e.g., nuclear positions and different forms of gene expression patterns. Fully 3D datasets for developmental gene expression are becoming increasingly available; we discuss the prospects of applying 3D-SSA to data processing and analysis in this growing field.

  4. A powerful nonparametric method for detecting differentially co-expressed genes: distance correlation screening and edge-count test.

    Science.gov (United States)

    Zhang, Qingyang

    2018-05-16

    Differential co-expression analysis, as a complement of differential expression analysis, offers significant insights into the changes in molecular mechanism of different phenotypes. A prevailing approach to detecting differentially co-expressed genes is to compare Pearson's correlation coefficients in two phenotypes. However, due to the limitations of Pearson's correlation measure, this approach lacks the power to detect nonlinear changes in gene co-expression which is common in gene regulatory networks. In this work, a new nonparametric procedure is proposed to search differentially co-expressed gene pairs in different phenotypes from large-scale data. Our computational pipeline consisted of two main steps, a screening step and a testing step. The screening step is to reduce the search space by filtering out all the independent gene pairs using distance correlation measure. In the testing step, we compare the gene co-expression patterns in different phenotypes by a recently developed edge-count test. Both steps are distribution-free and targeting nonlinear relations. We illustrate the promise of the new approach by analyzing the Cancer Genome Atlas data and the METABRIC data for breast cancer subtypes. Compared with some existing methods, the new method is more powerful in detecting nonlinear type of differential co-expressions. The distance correlation screening can greatly improve computational efficiency, facilitating its application to large data sets.

  5. High-throughput analysis of candidate imprinted genes and allele-specific gene expression in the human term placenta

    Directory of Open Access Journals (Sweden)

    Clark Taane G

    2010-04-01

    Full Text Available Abstract Background Imprinted genes show expression from one parental allele only and are important for development and behaviour. This extreme mode of allelic imbalance has been described for approximately 56 human genes. Imprinting status is often disrupted in cancer and dysmorphic syndromes. More subtle variation of gene expression, that is not parent-of-origin specific, termed 'allele-specific gene expression' (ASE is more common and may give rise to milder phenotypic differences. Using two allele-specific high-throughput technologies alongside bioinformatics predictions, normal term human placenta was screened to find new imprinted genes and to ascertain the extent of ASE in this tissue. Results Twenty-three family trios of placental cDNA, placental genomic DNA (gDNA and gDNA from both parents were tested for 130 candidate genes with the Sequenom MassArray system. Six genes were found differentially expressed but none imprinted. The Illumina ASE BeadArray platform was then used to test 1536 SNPs in 932 genes. The array was enriched for the human orthologues of 124 mouse candidate genes from bioinformatics predictions and 10 human candidate imprinted genes from EST database mining. After quality control pruning, a total of 261 informative SNPs (214 genes remained for analysis. Imprinting with maternal expression was demonstrated for the lymphocyte imprinted gene ZNF331 in human placenta. Two potential differentially methylated regions (DMRs were found in the vicinity of ZNF331. None of the bioinformatically predicted candidates tested showed imprinting except for a skewed allelic expression in a parent-specific manner observed for PHACTR2, a neighbour of the imprinted PLAGL1 gene. ASE was detected for two or more individuals in 39 candidate genes (18%. Conclusions Both Sequenom and Illumina assays were sensitive enough to study imprinting and strong allelic bias. Previous bioinformatics approaches were not predictive of new imprinted genes

  6. Multiplex cDNA quantification method that facilitates the standardization of gene expression data

    Science.gov (United States)

    Gotoh, Osamu; Murakami, Yasufumi; Suyama, Akira

    2011-01-01

    Microarray-based gene expression measurement is one of the major methods for transcriptome analysis. However, current microarray data are substantially affected by microarray platforms and RNA references because of the microarray method can provide merely the relative amounts of gene expression levels. Therefore, valid comparisons of the microarray data require standardized platforms, internal and/or external controls and complicated normalizations. These requirements impose limitations on the extensive comparison of gene expression data. Here, we report an effective approach to removing the unfavorable limitations by measuring the absolute amounts of gene expression levels on common DNA microarrays. We have developed a multiplex cDNA quantification method called GEP-DEAN (Gene expression profiling by DCN-encoding-based analysis). The method was validated by using chemically synthesized DNA strands of known quantities and cDNA samples prepared from mouse liver, demonstrating that the absolute amounts of cDNA strands were successfully measured with a sensitivity of 18 zmol in a highly multiplexed manner in 7 h. PMID:21415008

  7. Identifying potential maternal genes of Bombyx mori using digital gene expression profiling

    Science.gov (United States)

    Xu, Pingzhen

    2018-01-01

    Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160

  8. A Pathway Based Classification Method for Analyzing Gene Expression for Alzheimer's Disease Diagnosis.

    Science.gov (United States)

    Voyle, Nicola; Keohane, Aoife; Newhouse, Stephen; Lunnon, Katie; Johnston, Caroline; Soininen, Hilkka; Kloszewska, Iwona; Mecocci, Patrizia; Tsolaki, Magda; Vellas, Bruno; Lovestone, Simon; Hodges, Angela; Kiddle, Steven; Dobson, Richard Jb

    2016-01-01

    Recent studies indicate that gene expression levels in blood may be able to differentiate subjects with Alzheimer's disease (AD) from normal elderly controls and mild cognitively impaired (MCI) subjects. However, there is limited replicability at the single marker level. A pathway-based interpretation of gene expression may prove more robust. This study aimed to investigate whether a case/control classification model built on pathway level data was more robust than a gene level model and may consequently perform better in test data. The study used two batches of gene expression data from the AddNeuroMed (ANM) and Dementia Case Registry (DCR) cohorts. Our study used Illumina Human HT-12 Expression BeadChips to collect gene expression from blood samples. Random forest modeling with recursive feature elimination was used to predict case/control status. Age and APOE ɛ4 status were used as covariates for all analysis. Gene and pathway level models performed similarly to each other and to a model based on demographic information only. Any potential increase in concordance from the novel pathway level approach used here has not lead to a greater predictive ability in these datasets. However, we have only tested one method for creating pathway level scores. Further, we have been able to benchmark pathways against genes in datasets that had been extensively harmonized. Further work should focus on the use of alternative methods for creating pathway level scores, in particular those that incorporate pathway topology, and the use of an endophenotype based approach.

  9. In vitro analysis of integrated global high-resolution DNA methylation profiling with genomic imbalance and gene expression in osteosarcoma.

    Directory of Open Access Journals (Sweden)

    Bekim Sadikovic

    Full Text Available Genetic and epigenetic changes contribute to deregulation of gene expression and development of human cancer. Changes in DNA methylation are key epigenetic factors regulating gene expression and genomic stability. Recent progress in microarray technologies resulted in developments of high resolution platforms for profiling of genetic, epigenetic and gene expression changes. OS is a pediatric bone tumor with characteristically high level of numerical and structural chromosomal changes. Furthermore, little is known about DNA methylation changes in OS. Our objective was to develop an integrative approach for analysis of high-resolution epigenomic, genomic, and gene expression profiles in order to identify functional epi/genomic differences between OS cell lines and normal human osteoblasts. A combination of Affymetrix Promoter Tilling Arrays for DNA methylation, Agilent array-CGH platform for genomic imbalance and Affymetrix Gene 1.0 platform for gene expression analysis was used. As a result, an integrative high-resolution approach for interrogation of genome-wide tumour-specific changes in DNA methylation was developed. This approach was used to provide the first genomic DNA methylation maps, and to identify and validate genes with aberrant DNA methylation in OS cell lines. This first integrative analysis of global cancer-related changes in DNA methylation, genomic imbalance, and gene expression has provided comprehensive evidence of the cumulative roles of epigenetic and genetic mechanisms in deregulation of gene expression networks.

  10. Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.

    Directory of Open Access Journals (Sweden)

    Mika Gustafsson

    Full Text Available BACKGROUND: To predict gene expressions is an important endeavour within computational systems biology. It can both be a way to explore how drugs affect the system, as well as providing a framework for finding which genes are interrelated in a certain process. A practical problem, however, is how to assess and discriminate among the various algorithms which have been developed for this purpose. Therefore, the DREAM project invited the year 2008 to a challenge for predicting gene expression values, and here we present the algorithm with best performance. METHODOLOGY/PRINCIPAL FINDINGS: We develop an algorithm by exploring various regression schemes with different model selection procedures. It turns out that the most effective scheme is based on least squares, with a penalty term of a recently developed form called the "elastic net". Key components in the algorithm are the integration of expression data from other experimental conditions than those presented for the challenge and the utilization of transcription factor binding data for guiding the inference process towards known interactions. Of importance is also a cross-validation procedure where each form of external data is used only to the extent it increases the expected performance. CONCLUSIONS/SIGNIFICANCE: Our algorithm proves both the possibility to extract information from large-scale expression data concerning prediction of gene levels, as well as the benefits of integrating different data sources for improving the inference. We believe the former is an important message to those still hesitating on the possibilities for computational approaches, while the latter is part of an important way forward for the future development of the field of computational systems biology.

  11. Bayesian Modeling of MPSS Data: Gene Expression Analysis of Bovine Salmonella Infection

    KAUST Repository

    Dhavala, Soma S.

    2010-09-01

    Massively Parallel Signature Sequencing (MPSS) is a high-throughput, counting-based technology available for gene expression profiling. It produces output that is similar to Serial Analysis of Gene Expression and is ideal for building complex relational databases for gene expression. Our goal is to compare the in vivo global gene expression profiles of tissues infected with different strains of Salmonella obtained using the MPSS technology. In this article, we develop an exact ANOVA type model for this count data using a zero-inflatedPoisson distribution, different from existing methods that assume continuous densities. We adopt two Bayesian hierarchical models-one parametric and the other semiparametric with a Dirichlet process prior that has the ability to "borrow strength" across related signatures, where a signature is a specific arrangement of the nucleotides, usually 16-21 base pairs long. We utilize the discreteness of Dirichlet process prior to cluster signatures that exhibit similar differential expression profiles. Tests for differential expression are carried out using nonparametric approaches, while controlling the false discovery rate. We identify several differentially expressed genes that have important biological significance and conclude with a summary of the biological discoveries. This article has supplementary materials online. © 2010 American Statistical Association.

  12. Bayesian Modeling of MPSS Data: Gene Expression Analysis of Bovine Salmonella Infection

    KAUST Repository

    Dhavala, Soma S.; Datta, Sujay; Mallick, Bani K.; Carroll, Raymond J.; Khare, Sangeeta; Lawhon, Sara D.; Adams, L. Garry

    2010-01-01

    Massively Parallel Signature Sequencing (MPSS) is a high-throughput, counting-based technology available for gene expression profiling. It produces output that is similar to Serial Analysis of Gene Expression and is ideal for building complex relational databases for gene expression. Our goal is to compare the in vivo global gene expression profiles of tissues infected with different strains of Salmonella obtained using the MPSS technology. In this article, we develop an exact ANOVA type model for this count data using a zero-inflatedPoisson distribution, different from existing methods that assume continuous densities. We adopt two Bayesian hierarchical models-one parametric and the other semiparametric with a Dirichlet process prior that has the ability to "borrow strength" across related signatures, where a signature is a specific arrangement of the nucleotides, usually 16-21 base pairs long. We utilize the discreteness of Dirichlet process prior to cluster signatures that exhibit similar differential expression profiles. Tests for differential expression are carried out using nonparametric approaches, while controlling the false discovery rate. We identify several differentially expressed genes that have important biological significance and conclude with a summary of the biological discoveries. This article has supplementary materials online. © 2010 American Statistical Association.

  13. Adipose gene expression prior to weight loss can differentiate and weakly predict dietary responders.

    Directory of Open Access Journals (Sweden)

    David M Mutch

    Full Text Available BACKGROUND: The ability to identify obese individuals who will successfully lose weight in response to dietary intervention will revolutionize disease management. Therefore, we asked whether it is possible to identify subjects who will lose weight during dietary intervention using only a single gene expression snapshot. METHODOLOGY/PRINCIPAL FINDINGS: The present study involved 54 female subjects from the Nutrient-Gene Interactions in Human Obesity-Implications for Dietary Guidelines (NUGENOB trial to determine whether subcutaneous adipose tissue gene expression could be used to predict weight loss prior to the 10-week consumption of a low-fat hypocaloric diet. Using several statistical tests revealed that the gene expression profiles of responders (8-12 kgs weight loss could always be differentiated from non-responders (<4 kgs weight loss. We also assessed whether this differentiation was sufficient for prediction. Using a bottom-up (i.e. black-box approach, standard class prediction algorithms were able to predict dietary responders with up to 61.1%+/-8.1% accuracy. Using a top-down approach (i.e. using differentially expressed genes to build a classifier improved prediction accuracy to 80.9%+/-2.2%. CONCLUSION: Adipose gene expression profiling prior to the consumption of a low-fat diet is able to differentiate responders from non-responders as well as serve as a weak predictor of subjects destined to lose weight. While the degree of prediction accuracy currently achieved with a gene expression snapshot is perhaps insufficient for clinical use, this work reveals that the comprehensive molecular signature of adipose tissue paves the way for the future of personalized nutrition.

  14. The functional landscape of mouse gene expression

    Directory of Open Access Journals (Sweden)

    Zhang Wen

    2004-12-01

    Full Text Available Abstract Background Large-scale quantitative analysis of transcriptional co-expression has been used to dissect regulatory networks and to predict the functions of new genes discovered by genome sequencing in model organisms such as yeast. Although the idea that tissue-specific expression is indicative of gene function in mammals is widely accepted, it has not been objectively tested nor compared with the related but distinct strategy of correlating gene co-expression as a means to predict gene function. Results We generated microarray expression data for nearly 40,000 known and predicted mRNAs in 55 mouse tissues, using custom-built oligonucleotide arrays. We show that quantitative transcriptional co-expression is a powerful predictor of gene function. Hundreds of functional categories, as defined by Gene Ontology 'Biological Processes', are associated with characteristic expression patterns across all tissues, including categories that bear no overt relationship to the tissue of origin. In contrast, simple tissue-specific restriction of expression is a poor predictor of which genes are in which functional categories. As an example, the highly conserved mouse gene PWP1 is widely expressed across different tissues but is co-expressed with many RNA-processing genes; we show that the uncharacterized yeast homolog of PWP1 is required for rRNA biogenesis. Conclusions We conclude that 'functional genomics' strategies based on quantitative transcriptional co-expression will be as fruitful in mammals as they have been in simpler organisms, and that transcriptional control of mammalian physiology is more modular than is generally appreciated. Our data and analyses provide a public resource for mammalian functional genomics.

  15. Expression of Sox genes in tooth development.

    Science.gov (United States)

    Kawasaki, Katsushige; Kawasaki, Maiko; Watanabe, Momoko; Idrus, Erik; Nagai, Takahiro; Oommen, Shelly; Maeda, Takeyasu; Hagiwara, Nobuko; Que, Jianwen; Sharpe, Paul T; Ohazama, Atsushi

    2015-01-01

    Members of the Sox gene family play roles in many biological processes including organogenesis. We carried out comparative in situ hybridization analysis of seventeen sox genes (Sox1-14, 17, 18, 21) during murine odontogenesis from the epithelial thickening to the cytodifferentiation stages. Localized expression of five Sox genes (Sox6, 9, 13, 14 and 21) was observed in tooth bud epithelium. Sox13 showed restricted expression in the primary enamel knots. At the early bell stage, three Sox genes (Sox8, 11, 17 and 21) were expressed in pre-ameloblasts, whereas two others (Sox5 and 18) showed expression in odontoblasts. Sox genes thus showed a dynamic spatio-temporal expression during tooth development.

  16. A Methodology for the Development of RESTful Semantic Web Services for Gene Expression Analysis.

    Directory of Open Access Journals (Sweden)

    Gabriela D A Guardia

    Full Text Available Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis.

  17. A Methodology for the Development of RESTful Semantic Web Services for Gene Expression Analysis.

    Science.gov (United States)

    Guardia, Gabriela D A; Pires, Luís Ferreira; Vêncio, Ricardo Z N; Malmegrim, Kelen C R; de Farias, Cléver R G

    2015-01-01

    Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS) Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis.

  18. Gene expression profiling--Opening the black box of plant ecosystem responses to global change

    Energy Technology Data Exchange (ETDEWEB)

    Leakey, A.D.B.; Ainsworth, E.A.; Bernard, S.M.; Markelz, R.J.C.; Ort, D.R.; Placella, S.A.P.; Rogers, A.; Smith, M.D.; Sudderth, E.A.; Weston, D.J.; Wullschleger, S.D.; Yuan, S.

    2009-11-01

    The use of genomic techniques to address ecological questions is emerging as the field of genomic ecology. Experimentation under environmentally realistic conditions to investigate the molecular response of plants to meaningful changes in growth conditions and ecological interactions is the defining feature of genomic ecology. Since the impact of global change factors on plant performance are mediated by direct effects at the molecular, biochemical and physiological scales, gene expression analysis promises important advances in understanding factors that have previously been consigned to the 'black box' of unknown mechanism. Various tools and approaches are available for assessing gene expression in model and non-model species as part of global change biology studies. Each approach has its own unique advantages and constraints. A first generation of genomic ecology studies in managed ecosystems and mesocosms have provided a testbed for the approach and have begun to reveal how the experimental design and data analysis of gene expression studies can be tailored for use in an ecological context.

  19. Profiling Gene Expression in Germinating Brassica Roots.

    Science.gov (United States)

    Park, Myoung Ryoul; Wang, Yi-Hong; Hasenstein, Karl H

    2014-01-01

    Based on previously developed solid-phase gene extraction (SPGE) we examined the mRNA profile in primary roots of Brassica rapa seedlings for highly expressed genes like ACT7 (actin7), TUB (tubulin1), UBQ (ubiquitin), and low expressed GLK (glucokinase) during the first day post-germination. The assessment was based on the mRNA load of the SPGE probe of about 2.1 ng. The number of copies of the investigated genes changed spatially along the length of primary roots. The expression level of all genes differed significantly at each sample position. Among the examined genes ACT7 expression was most even along the root. UBQ was highest at the tip and root-shoot junction (RS). TUB and GLK showed a basipetal gradient. The temporal expression of UBQ was highest in the MZ 9 h after primary root emergence and higher than at any other sample position. Expressions of GLK in EZ and RS increased gradually over time. SPGE extraction is the result of oligo-dT and oligo-dA hybridization and the results illustrate that SPGE can be used for gene expression profiling at high spatial and temporal resolution. SPGE needles can be used within two weeks when stored at 4 °C. Our data indicate that gene expression studies that are based on the entire root miss important differences in gene expression that SPGE is able to resolve for example growth adjustments during gravitropism.

  20. Unstable Expression of Commonly Used Reference Genes in Rat Pancreatic Islets Early after Isolation Affects Results of Gene Expression Studies.

    Directory of Open Access Journals (Sweden)

    Lucie Kosinová

    Full Text Available The use of RT-qPCR provides a powerful tool for gene expression studies; however, the proper interpretation of the obtained data is crucially dependent on accurate normalization based on stable reference genes. Recently, strong evidence has been shown indicating that the expression of many commonly used reference genes may vary significantly due to diverse experimental conditions. The isolation of pancreatic islets is a complicated procedure which creates severe mechanical and metabolic stress leading possibly to cellular damage and alteration of gene expression. Despite of this, freshly isolated islets frequently serve as a control in various gene expression and intervention studies. The aim of our study was to determine expression of 16 candidate reference genes and one gene of interest (F3 in isolated rat pancreatic islets during short-term cultivation in order to find a suitable endogenous control for gene expression studies. We compared the expression stability of the most commonly used reference genes and evaluated the reliability of relative and absolute quantification using RT-qPCR during 0-120 hrs after isolation. In freshly isolated islets, the expression of all tested genes was markedly depressed and it increased several times throughout the first 48 hrs of cultivation. We observed significant variability among samples at 0 and 24 hrs but substantial stabilization from 48 hrs onwards. During the first 48 hrs, relative quantification failed to reflect the real changes in respective mRNA concentrations while in the interval 48-120 hrs, the relative expression generally paralleled the results determined by absolute quantification. Thus, our data call into question the suitability of relative quantification for gene expression analysis in pancreatic islets during the first 48 hrs of cultivation, as the results may be significantly affected by unstable expression of reference genes. However, this method could provide reliable information

  1. Haloperidol induces pharmacoepigenetic response by modulating miRNA expression, global DNA methylation and expression profiles of methylation maintenance genes and genes involved in neurotransmission in neuronal cells.

    Science.gov (United States)

    Swathy, Babu; Banerjee, Moinak

    2017-01-01

    Haloperidol has been extensively used in various psychiatric conditions. It has also been reported to induce severe side effects. We aimed to evaluate whether haloperidol can influence host methylome, and if so what are the possible mechanisms for it in neuronal cells. Impact on host methylome and miRNAs can have wide spread alterations in gene expression, which might possibly help in understanding how haloperidol may impact treatment response or induce side effects. SK-N-SH, a neuroblasoma cell line was treated with haloperidol at 10μm concentration for 24 hours and global DNA methylation was evaluated. Methylation at global level is maintained by methylation maintenance machinery and certain miRNAs. Therefore, the expression of methylation maintenance genes and their putative miRNA expression profiles were assessed. These global methylation alterations could result in gene expression changes. Therefore genes expressions for neurotransmitter receptors, regulators, ion channels and transporters were determined. Subsequently, we were also keen to identify a strong candidate miRNA based on biological and in-silico approach which can reflect on the pharmacoepigenetic trait of haloperidol and can also target the altered neuroscience panel of genes used in the study. Haloperidol induced increase in global DNA methylation which was found to be associated with corresponding increase in expression of various epigenetic modifiers that include DNMT1, DNMT3A, DNMT3B and MBD2. The expression of miR-29b that is known to putatively regulate the global methylation by modulating the expression of epigenetic modifiers was observed to be down regulated by haloperidol. In addition to miR-29b, miR-22 was also found to be downregulated by haloperidol treatment. Both these miRNA are known to putatively target several genes associated with various epigenetic modifiers, pharmacogenes and neurotransmission. Interestingly some of these putative target genes involved in neurotransmission

  2. Haloperidol induces pharmacoepigenetic response by modulating miRNA expression, global DNA methylation and expression profiles of methylation maintenance genes and genes involved in neurotransmission in neuronal cells.

    Directory of Open Access Journals (Sweden)

    Babu Swathy

    Full Text Available Haloperidol has been extensively used in various psychiatric conditions. It has also been reported to induce severe side effects. We aimed to evaluate whether haloperidol can influence host methylome, and if so what are the possible mechanisms for it in neuronal cells. Impact on host methylome and miRNAs can have wide spread alterations in gene expression, which might possibly help in understanding how haloperidol may impact treatment response or induce side effects.SK-N-SH, a neuroblasoma cell line was treated with haloperidol at 10μm concentration for 24 hours and global DNA methylation was evaluated. Methylation at global level is maintained by methylation maintenance machinery and certain miRNAs. Therefore, the expression of methylation maintenance genes and their putative miRNA expression profiles were assessed. These global methylation alterations could result in gene expression changes. Therefore genes expressions for neurotransmitter receptors, regulators, ion channels and transporters were determined. Subsequently, we were also keen to identify a strong candidate miRNA based on biological and in-silico approach which can reflect on the pharmacoepigenetic trait of haloperidol and can also target the altered neuroscience panel of genes used in the study.Haloperidol induced increase in global DNA methylation which was found to be associated with corresponding increase in expression of various epigenetic modifiers that include DNMT1, DNMT3A, DNMT3B and MBD2. The expression of miR-29b that is known to putatively regulate the global methylation by modulating the expression of epigenetic modifiers was observed to be down regulated by haloperidol. In addition to miR-29b, miR-22 was also found to be downregulated by haloperidol treatment. Both these miRNA are known to putatively target several genes associated with various epigenetic modifiers, pharmacogenes and neurotransmission. Interestingly some of these putative target genes involved in

  3. Detection of growth hormone doping by gene expression profiling of peripheral blood.

    Science.gov (United States)

    Mitchell, Christopher J; Nelson, Anne E; Cowley, Mark J; Kaplan, Warren; Stone, Glenn; Sutton, Selina K; Lau, Amie; Lee, Carol M Y; Ho, Ken K Y

    2009-12-01

    GH abuse is a significant problem in many sports, and there is currently no robust test that allows detection of doping beyond a short window after administration. Our objective was to evaluate gene expression profiling in peripheral blood leukocytes in-vivo as a test for GH doping in humans. Seven men and thirteen women were administered GH, 2 mg/d sc for 8 wk. Blood was collected at baseline and at 8 wk. RNA was extracted from the white cell fraction. Microarray analysis was undertaken using Agilent 44K G4112F arrays using a two-color design. Quantitative RT-PCR using TaqMan gene expression assays was performed for validation of selected differentially expressed genes. GH induced an approximately 2-fold increase in circulating IGF-I that was maintained throughout the 8 wk of the study. GH induced significant changes in gene expression with 353 in women and 41 in men detected with a false discovery rate of less than 5%. None of the differentially expressed genes were common between men and women. The maximal changes were a doubling for up-regulated or halving for down-regulated genes, similar in magnitude to the variation between individuals. Quantitative RT-PCR for seven target genes showed good concordance between microarray and quantitative PCR data in women but not in men. Gene expression analysis of peripheral blood leukocytes is unlikely to be a viable approach for the detection of GH doping.

  4. Personality and gene expression: Do individual differences exist in the leukocyte transcriptome?

    Science.gov (United States)

    Vedhara, Kavita; Gill, Sana; Eldesouky, Lameese; Campbell, Bruce K; Arevalo, Jesusa M G; Ma, Jeffrey; Cole, Steven W

    2015-02-01

    The temporal and situational stability of personality has led generations of researchers to hypothesize that personality may have enduring effects on health, but the biological mechanisms of such relationships remain poorly understood. In the present study, we utilized a functional genomics approach to examine the relationship between the 5 major dimensions of personality and patterns of gene expression as predicted by 'behavioural immune response' theory. We specifically focussed on two sets of genes previously linked to stress, threat, and adverse socio-environmental conditions: pro-inflammatory genes and genes involved in Type I interferon and antibody responses. An opportunity sample of 121 healthy individuals was recruited (86 females; mean age 24 years). Individuals completed a validated measure of personality; questions relating to current health behaviours; and provided a 5ml sample of peripheral blood for gene expression analysis. Extraversion was associated with increased expression of pro-inflammatory genes and Conscientiousness was associated with reduced expression of pro-inflammatory genes. Both associations were independent of health behaviours, negative affect, and leukocyte subset distributions. Antiviral and antibody-related gene expression was not associated with any personality dimension. The present data shed new light on the long-observed epidemiological associations between personality, physical health, and human longevity. Further research is required to elucidate the biological mechanisms underlying these associations. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. Cis-regulatory somatic mutations and gene-expression alteration in B-cell lymphomas.

    Science.gov (United States)

    Mathelier, Anthony; Lefebvre, Calvin; Zhang, Allen W; Arenillas, David J; Ding, Jiarui; Wasserman, Wyeth W; Shah, Sohrab P

    2015-04-23

    With the rapid increase of whole-genome sequencing of human cancers, an important opportunity to analyze and characterize somatic mutations lying within cis-regulatory regions has emerged. A focus on protein-coding regions to identify nonsense or missense mutations disruptive to protein structure and/or function has led to important insights; however, the impact on gene expression of mutations lying within cis-regulatory regions remains under-explored. We analyzed somatic mutations from 84 matched tumor-normal whole genomes from B-cell lymphomas with accompanying gene expression measurements to elucidate the extent to which these cancers are disrupted by cis-regulatory mutations. We characterize mutations overlapping a high quality set of well-annotated transcription factor binding sites (TFBSs), covering a similar portion of the genome as protein-coding exons. Our results indicate that cis-regulatory mutations overlapping predicted TFBSs are enriched in promoter regions of genes involved in apoptosis or growth/proliferation. By integrating gene expression data with mutation data, our computational approach culminates with identification of cis-regulatory mutations most likely to participate in dysregulation of the gene expression program. The impact can be measured along with protein-coding mutations to highlight key mutations disrupting gene expression and pathways in cancer. Our study yields specific genes with disrupted expression triggered by genomic mutations in either the coding or the regulatory space. It implies that mutated regulatory components of the genome contribute substantially to cancer pathways. Our analyses demonstrate that identifying genomically altered cis-regulatory elements coupled with analysis of gene expression data will augment biological interpretation of mutational landscapes of cancers.

  6. Comprehensive analysis of gene expression patterns of hedgehog-related genes

    Directory of Open Access Journals (Sweden)

    Baillie David

    2006-10-01

    Full Text Available Abstract Background The Caenorhabditis elegans genome encodes ten proteins that share sequence similarity with the Hedgehog signaling molecule through their C-terminal autoprocessing Hint/Hog domain. These proteins contain novel N-terminal domains, and C. elegans encodes dozens of additional proteins containing only these N-terminal domains. These gene families are called warthog, groundhog, ground-like and quahog, collectively called hedgehog (hh-related genes. Previously, the expression pattern of seventeen genes was examined, which showed that they are primarily expressed in the ectoderm. Results With the completion of the C. elegans genome sequence in November 2002, we reexamined and identified 61 hh-related ORFs. Further, we identified 49 hh-related ORFs in C. briggsae. ORF analysis revealed that 30% of the genes still had errors in their predictions and we improved these predictions here. We performed a comprehensive expression analysis using GFP fusions of the putative intergenic regulatory sequence with one or two transgenic lines for most genes. The hh-related genes are expressed in one or a few of the following tissues: hypodermis, seam cells, excretory duct and pore cells, vulval epithelial cells, rectal epithelial cells, pharyngeal muscle or marginal cells, arcade cells, support cells of sensory organs, and neuronal cells. Using time-lapse recordings, we discovered that some hh-related genes are expressed in a cyclical fashion in phase with molting during larval development. We also generated several translational GFP fusions, but they did not show any subcellular localization. In addition, we also studied the expression patterns of two genes with similarity to Drosophila frizzled, T23D8.1 and F27E11.3A, and the ortholog of the Drosophila gene dally-like, gpn-1, which is a heparan sulfate proteoglycan. The two frizzled homologs are expressed in a few neurons in the head, and gpn-1 is expressed in the pharynx. Finally, we compare the

  7. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  8. Methods for monitoring multiple gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA

    2012-05-01

    The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.

  9. Methods for monitoring multiple gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Berka, Randy; Bachkirova, Elena; Rey, Michael

    2013-10-01

    The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.

  10. Regulatory Architecture of Gene Expression Variation in the Threespine Stickleback Gasterosteus aculeatus.

    Science.gov (United States)

    Pritchard, Victoria L; Viitaniemi, Heidi M; McCairns, R J Scott; Merilä, Juha; Nikinmaa, Mikko; Primmer, Craig R; Leder, Erica H

    2017-01-05

    Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus), an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL) underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats. Copyright © 2017 Pritchard et al.

  11. Clustering gene expression data based on predicted differential effects of GV interaction.

    Science.gov (United States)

    Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

    2005-02-01

    Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.

  12. Determinants of human adipose tissue gene expression

    DEFF Research Database (Denmark)

    Viguerie, Nathalie; Montastier, Emilie; Maoret, Jean-José

    2012-01-01

    weight maintenance diets. For 175 genes, opposite regulation was observed during calorie restriction and weight maintenance phases, independently of variations in body weight. Metabolism and immunity genes showed inverse profiles. During the dietary intervention, network-based analyses revealed strong...... interconnection between expression of genes involved in de novo lipogenesis and components of the metabolic syndrome. Sex had a marked influence on AT expression of 88 transcripts, which persisted during the entire dietary intervention and after control for fat mass. In women, the influence of body mass index...... on expression of a subset of genes persisted during the dietary intervention. Twenty-two genes revealed a metabolic syndrome signature common to men and women. Genetic control of AT gene expression by cis signals was observed for 46 genes. Dietary intervention, sex, and cis genetic variants independently...

  13. Using RNA-Seq Data to Evaluate Reference Genes Suitable for Gene Expression Studies in Soybean.

    Directory of Open Access Journals (Sweden)

    Aldrin Kay-Yuen Yim

    Full Text Available Differential gene expression profiles often provide important clues for gene functions. While reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR is an important tool, the validity of the results depends heavily on the choice of proper reference genes. In this study, we employed new and published RNA-sequencing (RNA-Seq datasets (26 sequencing libraries in total to evaluate reference genes reported in previous soybean studies. In silico PCR showed that 13 out of 37 previously reported primer sets have multiple targets, and 4 of them have amplicons with different sizes. Using a probabilistic approach, we identified new and improved candidate reference genes. We further performed 2 validation tests (with 26 RNA samples on 8 commonly used reference genes and 7 newly identified candidates, using RT-qPCR. In general, the new candidate reference genes exhibited more stable expression levels under the tested experimental conditions. The three newly identified candidate reference genes Bic-C2, F-box protein2, and VPS-like gave the best overall performance, together with the commonly used ELF1b. It is expected that the proposed probabilistic model could serve as an important tool to identify stable reference genes when more soybean RNA-Seq data from different growth stages and treatments are used.

  14. Data-driven asthma endotypes defined from blood biomarker and gene expression data.

    Directory of Open Access Journals (Sweden)

    Barbara Jane George

    Full Text Available The diagnosis and treatment of childhood asthma is complicated by its mechanistically distinct subtypes (endotypes driven by genetic susceptibility and modulating environmental factors. Clinical biomarkers and blood gene expression were collected from a stratified, cross-sectional study of asthmatic and non-asthmatic children from Detroit, MI. This study describes four distinct asthma endotypes identified via a purely data-driven method. Our method was specifically designed to integrate blood gene expression and clinical biomarkers in a way that provides new mechanistic insights regarding the different asthma endotypes. For example, we describe metabolic syndrome-induced systemic inflammation as an associated factor in three of the four asthma endotypes. Context provided by the clinical biomarker data was essential in interpreting gene expression patterns and identifying putative endotypes, which emphasizes the importance of integrated approaches when studying complex disease etiologies. These synthesized patterns of gene expression and clinical markers from our research may lead to development of novel serum-based biomarker panels.

  15. Multiple Suboptimal Solutions for Prediction Rules in Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Osamu Komori

    2013-01-01

    Full Text Available This paper discusses mathematical and statistical aspects in analysis methods applied to microarray gene expressions. We focus on pattern recognition to extract informative features embedded in the data for prediction of phenotypes. It has been pointed out that there are severely difficult problems due to the unbalance in the number of observed genes compared with the number of observed subjects. We make a reanalysis of microarray gene expression published data to detect many other gene sets with almost the same performance. We conclude in the current stage that it is not possible to extract only informative genes with high performance in the all observed genes. We investigate the reason why this difficulty still exists even though there are actively proposed analysis methods and learning algorithms in statistical machine learning approaches. We focus on the mutual coherence or the absolute value of the Pearson correlations between two genes and describe the distributions of the correlation for the selected set of genes and the total set. We show that the problem of finding informative genes in high dimensional data is ill-posed and that the difficulty is closely related with the mutual coherence.

  16. LINE FUSION GENES: a database of LINE expression in human genes

    Directory of Open Access Journals (Sweden)

    Park Hong-Seog

    2006-06-01

    Full Text Available Abstract Background Long Interspersed Nuclear Elements (LINEs are the most abundant retrotransposons in humans. About 79% of human genes are estimated to contain at least one segment of LINE per transcription unit. Recent studies have shown that LINE elements can affect protein sequences, splicing patterns and expression of human genes. Description We have developed a database, LINE FUSION GENES, for elucidating LINE expression throughout the human gene database. We searched the 28,171 genes listed in the NCBI database for LINE elements and analyzed their structures and expression patterns. The results show that the mRNA sequences of 1,329 genes were affected by LINE expression. The LINE expression types were classified on the basis of LINEs in the 5' UTR, exon or 3' UTR sequences of the mRNAs. Our database provides further information, such as the tissue distribution and chromosomal location of the genes, and the domain structure that is changed by LINE integration. We have linked all the accession numbers to the NCBI data bank to provide mRNA sequences for subsequent users. Conclusion We believe that our work will interest genome scientists and might help them to gain insight into the implications of LINE expression for human evolution and disease. Availability http://www.primate.or.kr/line

  17. Analyzing Multiple-Probe Microarray: Estimation and Application of Gene Expression Indexes

    KAUST Repository

    Maadooliat, Mehdi

    2012-07-26

    Gene expression index estimation is an essential step in analyzing multiple probe microarray data. Various modeling methods have been proposed in this area. Amidst all, a popular method proposed in Li and Wong (2001) is based on a multiplicative model, which is similar to the additive model discussed in Irizarry et al. (2003a) at the logarithm scale. Along this line, Hu et al. (2006) proposed data transformation to improve expression index estimation based on an ad hoc entropy criteria and naive grid search approach. In this work, we re-examined this problem using a new profile likelihood-based transformation estimation approach that is more statistically elegant and computationally efficient. We demonstrate the applicability of the proposed method using a benchmark Affymetrix U95A spiked-in experiment. Moreover, We introduced a new multivariate expression index and used the empirical study to shows its promise in terms of improving model fitting and power of detecting differential expression over the commonly used univariate expression index. As the other important content of the work, we discussed two generally encountered practical issues in application of gene expression index: normalization and summary statistic used for detecting differential expression. Our empirical study shows somewhat different findings from the MAQC project (MAQC, 2006).

  18. Selection of Reliable Reference Genes for Gene Expression Studies on Rhododendron molle G. Don.

    Science.gov (United States)

    Xiao, Zheng; Sun, Xiaobo; Liu, Xiaoqing; Li, Chang; He, Lisi; Chen, Shangping; Su, Jiale

    2016-01-01

    The quantitative real-time polymerase chain reaction (qRT-PCR) approach has become a widely used method to analyze expression patterns of target genes. The selection of an optimal reference gene is a prerequisite for the accurate normalization of gene expression in qRT-PCR. The present study constitutes the first systematic evaluation of potential reference genes in Rhododendron molle G. Don. Eleven candidate reference genes in different tissues and flowers at different developmental stages of R. molle were assessed using the following three software packages: GeNorm, NormFinder, and BestKeeper. The results showed that EF1- α (elongation factor 1-alpha), 18S (18s ribosomal RNA), and RPL3 (ribosomal protein L3) were the most stable reference genes in developing rhododendron flowers and, thus, in all of the tested samples, while tublin ( TUB ) was the least stable. ACT5 (actin), RPL3 , 18S , and EF1- α were found to be the top four choices for different tissues, whereas TUB was not found to favor qRT-PCR normalization in these tissues. Three stable reference genes are recommended for the normalization of qRT-PCR data in R. molle . Furthermore, the expression profiles of RmPSY (phytoene synthase) and RmPDS (phytoene dehydrogenase) were assessed using EF1- α, 18S , ACT5 , RPL3 , and their combination as internals. Similar trends were found, but these trends varied when the least stable reference gene TUB was used. The results further prove that it is necessary to validate the stability of reference genes prior to their use for normalization under different experimental conditions. This study provides useful information for reliable qRT-PCR data normalization in gene studies of R. molle .

  19. Selection of Reliable Reference Genes for Gene Expression Studies on Rhododendron molle G. Don

    Directory of Open Access Journals (Sweden)

    Zheng Xiao

    2016-10-01

    Full Text Available The quantitative real-time polymerase chain reaction (qRT-PCR approach has become a widely used method to analyze expression patterns of target genes. The selection of an optimal reference gene is a prerequisite for the accurate normalization of gene expression in qRT-PCR. The present study constitutes the first systematic evaluation of potential reference genes in Rhododendron molle G. Don. Eleven candidate reference genes in different tissues and flowers at different developmental stages of R. molle were assessed using the following three software packages: GeNorm, NormFinder and BestKeeper. The results showed that EF1-α (elongation factor 1-alpha, 18S (18s ribosomal RNA and RPL3 (ribosomal protein L3 were the most stable reference genes in developing rhododendron flowers and, thus, in all of the tested samples, while tublin (TUB was the least stable. ACT5 (actin, RPL3, 18S and EF1-α were found to be the top four choices for different tissues, whereas TUB was not found to favor qRT-PCR normalization in these tissues. Three stable reference genes are recommended for the normalization of qRT-PCR data in R. molle. Furthermore, the expression profiles of RmPSY (phytoene synthase and RmPDS (phytoene dehydrogenase were assessed using EF1-α, 18S, ACT5, and RPL3 and their combination as internals. Similar trends were found, but these trends varied when the least stable reference gene TUB was used. The results further prove that it is necessary to validate the stability of reference genes prior to their use for normalization under different experimental conditions. This study provides useful information for reliable qRT-PCR data normalization in gene studies of R. molle.

  20. FocusHeuristics - expression-data-driven network optimization and disease gene prediction.

    Science.gov (United States)

    Ernst, Mathias; Du, Yang; Warsow, Gregor; Hamed, Mohamed; Endlich, Nicole; Endlich, Karlhans; Murua Escobar, Hugo; Sklarz, Lisa-Madeleine; Sender, Sina; Junghanß, Christian; Möller, Steffen; Fuellen, Georg; Struckmann, Stephan

    2017-02-16

    To identify genes contributing to disease phenotypes remains a challenge for bioinformatics. Static knowledge on biological networks is often combined with the dynamics observed in gene expression levels over disease development, to find markers for diagnostics and therapy, and also putative disease-modulatory drug targets and drugs. The basis of current methods ranges from a focus on expression-levels (Limma) to concentrating on network characteristics (PageRank, HITS/Authority Score), and both (DeMAND, Local Radiality). We present an integrative approach (the FocusHeuristics) that is thoroughly evaluated based on public expression data and molecular disease characteristics provided by DisGeNet. The FocusHeuristics combines three scores, i.e. the log fold change and another two, based on the sum and difference of log fold changes of genes/proteins linked in a network. A gene is kept when one of the scores to which it contributes is above a threshold. Our FocusHeuristics is both, a predictor for gene-disease-association and a bioinformatics method to reduce biological networks to their disease-relevant parts, by highlighting the dynamics observed in expression data. The FocusHeuristics is slightly, but significantly better than other methods by its more successful identification of disease-associated genes measured by AUC, and it delivers mechanistic explanations for its choice of genes.

  1. Global expression differences and tissue specific expression differences in rice evolution result in two contrasting types of differentially expressed genes

    KAUST Repository

    Horiuchi, Youko

    2015-12-23

    Background Since the development of transcriptome analysis systems, many expression evolution studies characterized evolutionary forces acting on gene expression, without explicit discrimination between global expression differences and tissue specific expression differences. However, different types of gene expression alteration should have different effects on an organism, the evolutionary forces that act on them might be different, and different types of genes might show different types of differential expression between species. To confirm this, we studied differentially expressed (DE) genes among closely related groups that have extensive gene expression atlases, and clarified characteristics of different types of DE genes including the identification of regulating loci for differential expression using expression quantitative loci (eQTL) analysis data. Results We detected differentially expressed (DE) genes between rice subspecies in five homologous tissues that were verified using japonica and indica transcriptome atlases in public databases. Using the transcriptome atlases, we classified DE genes into two types, global DE genes and changed-tissues DE genes. Global type DE genes were not expressed in any tissues in the atlas of one subspecies, however changed-tissues type DE genes were expressed in both subspecies with different tissue specificity. For the five tissues in the two japonica-indica combinations, 4.6 ± 0.8 and 5.9 ± 1.5 % of highly expressed genes were global and changed-tissues DE genes, respectively. Changed-tissues DE genes varied in number between tissues, increasing linearly with the abundance of tissue specifically expressed genes in the tissue. Molecular evolution of global DE genes was rapid, unlike that of changed-tissues DE genes. Based on gene ontology, global and changed-tissues DE genes were different, having no common GO terms. Expression differences of most global DE genes were regulated by cis-eQTLs. Expression

  2. Social Regulation of Gene Expression in Threespine Sticklebacks.

    Directory of Open Access Journals (Sweden)

    Anna K Greenwood

    Full Text Available Identifying genes that are differentially expressed in response to social interactions is informative for understanding the molecular basis of social behavior. To address this question, we described changes in gene expression as a result of differences in the extent of social interactions. We housed threespine stickleback (Gasterosteus aculeatus females in either group conditions or individually for one week, then measured levels of gene expression in three brain regions using RNA-sequencing. We found that numerous genes in the hindbrain/cerebellum had altered expression in response to group or individual housing. However, relatively few genes were differentially expressed in either the diencephalon or telencephalon. The list of genes upregulated in fish from social groups included many genes related to neural development and cell adhesion as well as genes with functions in sensory signaling, stress, and social and reproductive behavior. The list of genes expressed at higher levels in individually-housed fish included several genes previously identified as regulated by social interactions in other animals. The identified genes are interesting targets for future research on the molecular mechanisms of normal social interactions.

  3. Patterns of homoeologous gene expression shown by RNA sequencing in hexaploid bread wheat.

    KAUST Repository

    Leach, Lindsey J

    2014-04-11

    BACKGROUND: Bread wheat (Triticum aestivum) has a large, complex and hexaploid genome consisting of A, B and D homoeologous chromosome sets. Therefore each wheat gene potentially exists as a trio of A, B and D homoeoloci, each of which may contribute differentially to wheat phenotypes. We describe a novel approach combining wheat cytogenetic resources (chromosome substitution \\'nullisomic-tetrasomic\\' lines) with next generation deep sequencing of gene transcripts (RNA-Seq), to directly and accurately identify homoeologue-specific single nucleotide variants and quantify the relative contribution of individual homoeoloci to gene expression. RESULTS: We discover, based on a sample comprising ~5-10% of the total wheat gene content, that at least 45% of wheat genes are expressed from all three distinct homoeoloci. Most of these genes show strikingly biased expression patterns in which expression is dominated by a single homoeolocus. The remaining ~55% of wheat genes are expressed from either one or two homoeoloci only, through a combination of extensive transcriptional silencing and homoeolocus loss. CONCLUSIONS: We conclude that wheat is tending towards functional diploidy, through a variety of mechanisms causing single homoeoloci to become the predominant source of gene transcripts. This discovery has profound consequences for wheat breeding and our understanding of wheat evolution.

  4. Patterns of homoeologous gene expression shown by RNA sequencing in hexaploid bread wheat.

    KAUST Repository

    Leach, Lindsey J; Belfield, Eric J; Jiang, Caifu; Brown, Carly; Mithani, Aziz; Harberd, Nicholas P

    2014-01-01

    BACKGROUND: Bread wheat (Triticum aestivum) has a large, complex and hexaploid genome consisting of A, B and D homoeologous chromosome sets. Therefore each wheat gene potentially exists as a trio of A, B and D homoeoloci, each of which may contribute differentially to wheat phenotypes. We describe a novel approach combining wheat cytogenetic resources (chromosome substitution 'nullisomic-tetrasomic' lines) with next generation deep sequencing of gene transcripts (RNA-Seq), to directly and accurately identify homoeologue-specific single nucleotide variants and quantify the relative contribution of individual homoeoloci to gene expression. RESULTS: We discover, based on a sample comprising ~5-10% of the total wheat gene content, that at least 45% of wheat genes are expressed from all three distinct homoeoloci. Most of these genes show strikingly biased expression patterns in which expression is dominated by a single homoeolocus. The remaining ~55% of wheat genes are expressed from either one or two homoeoloci only, through a combination of extensive transcriptional silencing and homoeolocus loss. CONCLUSIONS: We conclude that wheat is tending towards functional diploidy, through a variety of mechanisms causing single homoeoloci to become the predominant source of gene transcripts. This discovery has profound consequences for wheat breeding and our understanding of wheat evolution.

  5. GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.

    Science.gov (United States)

    Zhang, Song; Cao, Jing; Kong, Y Megan; Scheuermann, Richard H

    2010-04-01

    A typical approach for the interpretation of high-throughput experiments, such as gene expression microarrays, is to produce groups of genes based on certain criteria (e.g. genes that are differentially expressed). To gain more mechanistic insights into the underlying biology, overrepresentation analysis (ORA) is often conducted to investigate whether gene sets associated with particular biological functions, for example, as represented by Gene Ontology (GO) annotations, are statistically overrepresented in the identified gene groups. However, the standard ORA, which is based on the hypergeometric test, analyzes each GO term in isolation and does not take into account the dependence structure of the GO-term hierarchy. We have developed a Bayesian approach (GO-Bayes) to measure overrepresentation of GO terms that incorporates the GO dependence structure by taking into account evidence not only from individual GO terms, but also from their related terms (i.e. parents, children, siblings, etc.). The Bayesian framework borrows information across related GO terms to strengthen the detection of overrepresentation signals. As a result, this method tends to identify sets of closely related GO terms rather than individual isolated GO terms. The advantage of the GO-Bayes approach is demonstrated with a simulation study and an application example.

  6. Characterizing bacterial gene expression in nitrogen cycle metabolism with RT-qPCR.

    Science.gov (United States)

    Graham, James E; Wantland, Nicholas B; Campbell, Mark; Klotz, Martin G

    2011-01-01

    Recent advances in DNA sequencing have greatly accelerated our ability to obtain the raw information needed to recognize both known and potential novel modular microbial genomic capacity for nitrogen metabolism. With PCR-based approaches to quantifying microbial mRNA expression now mainstream in most laboratories, researchers can now more efficiently propose and test hypotheses on the contributions of individual microbes to the biological accessibility of nitrogen upon which all other life depends. We review known microbial roles in these key nitrogen transformations, and describe the necessary steps in carrying out relevant gene expression studies. An example experimental design is then provided characterizing Nitrosococcus oceani mRNA expression in cultures responding to ammonia. The approach described, that of assessing microbial genome inventory and testing putative modular gene expression by mRNA quantification, is likely to remain an important tool in understanding individual microbial contributions within microbial community activities that maintain the Earth's nitrogen balance. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. Signal Transduction Pathways that Regulate CAB Gene Expression

    Energy Technology Data Exchange (ETDEWEB)

    Chory, Joanne

    2004-12-31

    The process of chloroplast differentiation, involves the coordinate regulation of many nuclear and chloroplast genes. The cues for the initiation of this developmental program are both extrinsic (e.g., light) and intrinsic (cell-type and plastid signals). During this project period, we utilized a molecular genetic approach to select for Arabidopsis mutants that did not respond properly to environmental light conditions, as well as mutants that were unable to perceive plastid damage. These latter mutants, called gun mutants, define two retrograde signaling pathways that regulate nuclear gene expression in response to chloroplasts. A major finding was to identify a signal from chloroplasts that regulates nuclear gene transcription. This signal is the build-up of Mg-Protoporphyrin IX, a key intermediate of the chlorophyll biosynthetic pathway. The signaling pathways downstream of this signal are currently being studied. Completion of this project has provided an increased understanding of the input signals and retrograde signaling pathways that control nuclear gene expression in response to the functional state of chloroplasts. These studies should ultimately influence our abilities to manipulate plant growth and development, and will aid in the understanding of the developmental control of photosynthesis.

  8. Signal Transduction Pathways that Regulate CAB Gene Expression

    Energy Technology Data Exchange (ETDEWEB)

    Chory, Joanne

    2006-01-16

    The process of chloroplast differentiation, involves the coordinate regulation of many nuclear and chloroplast genes. The cues for the initiation of this developmental program are both extrinsic (e.g., light) and intrinsic (cell-type and plastid signals). During this project period, we utilized a molecular genetic approach to select for Arabidopsis mutants that did not respond properly to environmental light conditions, as well as mutants that were unable to perceive plastid damage. These latter mutants, called gun mutants, define two retrograde signaling pathways that regulate nuclear gene expression in response to chloroplasts. A major finding was to identify a signal from chloroplasts that regulates nuclear gene transcription. This signal is the build-up of Mg-Protoporphyrin IX, a key intermediate of the chlorophyll biosynthetic pathway. The signaling pathways downstream of this signal are currently being studied. Completion of this project has provided an increased understanding of the input signals and retrograde signaling pathways that control nuclear gene expression in response to the functional state of chloroplasts. These studies should ultimately influence our abilities to manipulate plant growth and development, and will aid in the understanding of the developmental control of photosynthesis.

  9. Gene expression of panaxydol-treated human melanoma cells using radioactive cDNA microarrays

    International Nuclear Information System (INIS)

    Cho, Joong Youn; Yu, Su Jin; Soh, Jeong Won; Kim, Meyoung Kon

    2001-01-01

    Polyacetylenic alcohols derived from Panax ginseng have been studied to be an anticancer reagent previously. One of the Panax ginseng polyacetylenic alcohols, i.e., panaxydol, has been studied to possess an antiproliferative effect on human melanoma cell line (SK-MEL-1). In ths study, radioactive cDNA microarrays enabled an efficient approach to analyze the pattern of gene expression (3.194 genes in a total) simultaneously. The bioinformatics selection of human cDNAs, which is specifically designed for immunology, apoptosis and signal transduction, were arrayed on nylon membranes. Using with 33 P labeled probes, this method provided highly sensitive gene expression profiles of our interest including apoptosis, cell proliferation, cell cycle, and signal transduction. Gene expression profiles were also classified into several categories in accordance with the duration of panaxydol treatment. Consequently, the gene profiles of our interest were significantly up (199 genes, > 2.0 of Z-ratio) or down-(196 genes, < 2.0 of Z-ratio) regulated in panaxydol-treated human melanoma cells

  10. Gene expression of panaxydol-treated human melanoma cells using radioactive cDNA microarrays

    Energy Technology Data Exchange (ETDEWEB)

    Cho, Joong Youn; Yu, Su Jin; Soh, Jeong Won; Kim, Meyoung Kon [College of Medicine, Korea Univ., Seoul (Korea, Republic of)

    2001-07-01

    Polyacetylenic alcohols derived from Panax ginseng have been studied to be an anticancer reagent previously. One of the Panax ginseng polyacetylenic alcohols, i.e., panaxydol, has been studied to possess an antiproliferative effect on human melanoma cell line (SK-MEL-1). In ths study, radioactive cDNA microarrays enabled an efficient approach to analyze the pattern of gene expression (3.194 genes in a total) simultaneously. The bioinformatics selection of human cDNAs, which is specifically designed for immunology, apoptosis and signal transduction, were arrayed on nylon membranes. Using with {sup 33}P labeled probes, this method provided highly sensitive gene expression profiles of our interest including apoptosis, cell proliferation, cell cycle, and signal transduction. Gene expression profiles were also classified into several categories in accordance with the duration of panaxydol treatment. Consequently, the gene profiles of our interest were significantly up (199 genes, > 2.0 of Z-ratio) or down-(196 genes, < 2.0 of Z-ratio) regulated in panaxydol-treated human melanoma cells.

  11. Stably Expressed Genes Involved in Basic Cellular Functions.

    Directory of Open Access Journals (Sweden)

    Kejian Wang

    Full Text Available Stably Expressed Genes (SEGs whose expression varies within a narrow range may be involved in core cellular processes necessary for basic functions. To identify such genes, we re-analyzed existing RNA-Seq gene expression profiles across 11 organs at 4 developmental stages (from immature to old age in both sexes of F344 rats (n = 4/group; 320 samples. Expression changes (calculated as the maximum expression / minimum expression for each gene of >19000 genes across organs, ages, and sexes ranged from 2.35 to >109-fold, with a median of 165-fold. The expression of 278 SEGs was found to vary ≤4-fold and these genes were significantly involved in protein catabolism (proteasome and ubiquitination, RNA transport, protein processing, and the spliceosome. Such stability of expression was further validated in human samples where the expression variability of the homologous human SEGs was significantly lower than that of other genes in the human genome. It was also found that the homologous human SEGs were generally less subject to non-synonymous mutation than other genes, as would be expected of stably expressed genes. We also found that knockout of SEG homologs in mouse models was more likely to cause complete preweaning lethality than non-SEG homologs, corroborating the fundamental roles played by SEGs in biological development. Such stably expressed genes and pathways across life-stages suggest that tight control of these processes is important in basic cellular functions and that perturbation by endogenous (e.g., genetics or exogenous agents (e.g., drugs, environmental factors may cause serious adverse effects.

  12. Lithium ions induce prestalk-associated gene expression and inhibit prespore gene expression in Dictyostelium discoideum

    NARCIS (Netherlands)

    Peters, Dorien J.M.; Lookeren Campagne, Michiel M. van; Haastert, Peter J.M. van; Spek, Wouter; Schaap, Pauline

    1989-01-01

    We investigated the effect of Li+ on two types of cyclic AMP-regulated gene expression and on basal and cyclic AMP-stimulated inositol 1,4,5-trisphosphate (Ins(1,4,5)P3) levels. Li+ effectively inhibits cyclic AMP-induced prespore gene expression, half-maximal inhibition occurring at about 2mM-LiCl.

  13. Evaluation of gene expression profile of keratinocytes in response to JP-8 jet fuel

    International Nuclear Information System (INIS)

    Espinoza, Luis A.; Li Peijun; Lee, Richard Y.; Wang Yue; Boulares, A. Hamid; Clarke, Robert; Smulson, Mark E.

    2004-01-01

    The skin is the principal barrier against any environmental insult. Therefore, there is a high risk for a large number of military and civilian personnel exposed to jet fuel JP-8 to suffer percutaneous absorption of this fuel. This paper reports the use of cDNA microarray to identify the gene expression profile in normal human epidermal keratinocytes exposed to JP-8 for 24-h and 7-day periods. The effects of JP-8 exposure on keratinocytes at these two different periods induced a set of genes with altered expression in response to this type of insult. Microarray data were visualized using a novel algorithm based on simple statistical analyses to reduce data dimensionality and identify subsets of discriminant genes. Predictive neural networks were built using a multiplayer perceptron to carry out a proper classification task in microarray data in the untreated versus JP-8-treated samples. The pattern of expressions in response to JP-8 provides evidences that detoxificant-related and cell growth regulator genes with the most variability in the level of expression may be useful genetic markers in adverse health effects of personnel exposed to JP-8. The approaches in our analysis provide a simple, safe, novel, and effective method that is reliable in identifying and analyzing gene expression in samples treated with JP-8 or over potential toxic agents. Gene expression data from these studies can be used to build accurate predictive models that separate different molecular profiles. The data establish the use and effectiveness of these approaches for future prospective studies

  14. Cloning-free regulated monitoring of reporter and gene expression

    Directory of Open Access Journals (Sweden)

    Demirkaya Omer

    2009-03-01

    Full Text Available Abstract Background The majority of the promoters, their regulatory elements, and their variations in the human genome remain unknown. Reporter gene technology for transcriptional activity is a widely used tool for the study of promoter structure, gene regulation, and signaling pathways. Construction of transcriptional reporter vectors, including use of cis-acting sequences, requires cloning and time-demanding manipulations, particularly with introduced mutations. Results In this report, we describe a cloning-free strategy to generate transcriptionally-controllable linear reporter constructs. This approach was applied in common transcriptional models of inflammatory response and the interferon system. In addition, it was used to delineate minimal transcriptional activity of selected ribosomal protein promoters. The approach was tested for conversion of genes into TetO-inducible/repressible expression cassettes. Conclusion The simple introduction and tuning of any transcriptional control in the linear DNA product renders promoter activation and regulated gene studies simple and versatile.

  15. Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Pugalendhi Ganesh Kumar

    Full Text Available This study describes a novel approach to reducing the challenges of highly nonlinear multiclass gene expression values for cancer diagnosis. To build a fruitful system for cancer diagnosis, in this study, we introduced two levels of gene selection such as filtering and embedding for selection of potential genes and the most relevant genes associated with cancer, respectively. The filter procedure was implemented by developing a fuzzy rough set (FR-based method for redefining the criterion function of f-information (FI to identify the potential genes without discretizing the continuous gene expression values. The embedded procedure is implemented by means of a water swirl algorithm (WSA, which attempts to optimize the rule set and membership function required to classify samples using a fuzzy-rule-based multiclassification system (FRBMS. Two novel update equations are proposed in WSA, which have better exploration and exploitation abilities while designing a self-learning FRBMS. The efficiency of our new approach was evaluated on 13 multicategory and 9 binary datasets of cancer gene expression. Additionally, the performance of the proposed FRFI-WSA method in designing an FRBMS was compared with existing methods for gene selection and optimization such as genetic algorithm (GA, particle swarm optimization (PSO, and artificial bee colony algorithm (ABC on all the datasets. In the global cancer map with repeated measurements (GCM_RM dataset, the FRFI-WSA showed the smallest number of 16 most relevant genes associated with cancer using a minimal number of 26 compact rules with the highest classification accuracy (96.45%. In addition, the statistical validation used in this study revealed that the biological relevance of the most relevant genes associated with cancer and their linguistics detected by the proposed FRFI-WSA approach are better than those in the other methods. The simple interpretable rules with most relevant genes and effectively

  16. Microarray meta-analysis to explore abiotic stress-specific gene expression patterns in Arabidopsis.

    Science.gov (United States)

    Shen, Po-Chih; Hour, Ai-Ling; Liu, Li-Yu Daisy

    2017-12-01

    Abiotic stresses are the major limiting factors that affect plant growth, development, yield and final quality. Deciphering the underlying mechanisms of plants' adaptations to stresses using few datasets might overlook the different aspects of stress tolerance in plants, which might be simultaneously and consequently operated in the system. Fortunately, the accumulated microarray expression data offer an opportunity to infer abiotic stress-specific gene expression patterns through meta-analysis. In this study, we propose to combine microarray gene expression data under control, cold, drought, heat, and salt conditions and determined modules (gene sets) of genes highly associated with each other according to the observed expression data. By analyzing the expression variations of the Eigen genes from different conditions, we had identified two, three, and five gene modules as cold-, heat-, and salt-specific modules, respectively. Most of the cold- or heat-specific modules were differentially expressed to a particular degree in shoot samples, while most of the salt-specific modules were differentially expressed to a particular degree in root samples. A gene ontology (GO) analysis on the stress-specific modules suggested that the gene modules exclusively enriched stress-related GO terms and that different genes under the same GO terms may be alternatively disturbed in different conditions. The gene regulatory events for two genes, DREB1A and DEAR1, in the cold-specific gene module had also been validated, as evidenced through the literature search. Our protocols study the specificity of the gene modules that were specifically activated under a particular type of abiotic stress. The biplot can also assist to visualize the stress-specific gene modules. In conclusion, our approach has the potential to further elucidate mechanisms in plants and beneficial for future experiments design under different abiotic stresses.

  17. Double-filter identification of vascular-expressed genes using Arabidopsis plants with vascular hypertrophy and hypotrophy.

    Science.gov (United States)

    Ckurshumova, Wenzislava; Scarpella, Enrico; Goldstein, Rochelle S; Berleth, Thomas

    2011-08-01

    Genes expressed in vascular tissues have been identified by several strategies, usually with a focus on mature vascular cells. In this study, we explored the possibility of using two opposite types of altered tissue compositions in combination with a double-filter selection to identify genes with a high probability of vascular expression in early organ primordia. Specifically, we generated full-transcriptome microarray profiles of plants with (a) genetically strongly reduced and (b) pharmacologically vastly increased vascular tissues and identified a reproducible cohort of 158 transcripts that fulfilled the dual requirement of being underrepresented in (a) and overrepresented in (b). In order to assess the predictive value of our identification scheme for vascular gene expression, we determined the expression patterns of genes in two unbiased subsamples. First, we assessed the expression patterns of all twenty annotated transcription factor genes from the cohort of 158 genes and found that seventeen of the twenty genes were preferentially expressed in leaf vascular cells. Remarkably, fifteen of these seventeen vascular genes were clearly expressed already very early in leaf vein development. Twelve genes with published leaf expression patterns served as a second subsample to monitor the representation of vascular genes in our cohort. Of those twelve genes, eleven were preferentially expressed in leaf vascular tissues. Based on these results we propose that our compendium of 158 genes represents a sample that is highly enriched for genes expressed in vascular tissues and that our approach is particularly suited to detect genes expressed in vascular cell lineages at early stages of their inception. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  18. Evaluation of Different Normalization and Analysis Procedures for Illumina Gene Expression Microarray Data Involving Small Changes

    Science.gov (United States)

    Johnstone, Daniel M.; Riveros, Carlos; Heidari, Moones; Graham, Ross M.; Trinder, Debbie; Berretta, Regina; Olynyk, John K.; Scott, Rodney J.; Moscato, Pablo; Milward, Elizabeth A.

    2013-01-01

    While Illumina microarrays can be used successfully for detecting small gene expression changes due to their high degree of technical replicability, there is little information on how different normalization and differential expression analysis strategies affect outcomes. To evaluate this, we assessed concordance across gene lists generated by applying different combinations of normalization strategy and analytical approach to two Illumina datasets with modest expression changes. In addition to using traditional statistical approaches, we also tested an approach based on combinatorial optimization. We found that the choice of both normalization strategy and analytical approach considerably affected outcomes, in some cases leading to substantial differences in gene lists and subsequent pathway analysis results. Our findings suggest that important biological phenomena may be overlooked when there is a routine practice of using only one approach to investigate all microarray datasets. Analytical artefacts of this kind are likely to be especially relevant for datasets involving small fold changes, where inherent technical variation—if not adequately minimized by effective normalization—may overshadow true biological variation. This report provides some basic guidelines for optimizing outcomes when working with Illumina datasets involving small expression changes. PMID:27605185

  19. Dynamic Maternal Gradients Control Timing and Shift-Rates for Drosophila Gap Gene Expression

    Science.gov (United States)

    Verd, Berta; Crombach, Anton

    2017-01-01

    Pattern formation during development is a highly dynamic process. In spite of this, few experimental and modelling approaches take into account the explicit time-dependence of the rules governing regulatory systems. We address this problem by studying dynamic morphogen interpretation by the gap gene network in Drosophila melanogaster. Gap genes are involved in segment determination during early embryogenesis. They are activated by maternal morphogen gradients encoded by bicoid (bcd) and caudal (cad). These gradients decay at the same time-scale as the establishment of the antero-posterior gap gene pattern. We use a reverse-engineering approach, based on data-driven regulatory models called gene circuits, to isolate and characterise the explicitly time-dependent effects of changing morphogen concentrations on gap gene regulation. To achieve this, we simulate the system in the presence and absence of dynamic gradient decay. Comparison between these simulations reveals that maternal morphogen decay controls the timing and limits the rate of gap gene expression. In the anterior of the embyro, it affects peak expression and leads to the establishment of smooth spatial boundaries between gap domains. In the posterior of the embryo, it causes a progressive slow-down in the rate of gap domain shifts, which is necessary to correctly position domain boundaries and to stabilise the spatial gap gene expression pattern. We use a newly developed method for the analysis of transient dynamics in non-autonomous (time-variable) systems to understand the regulatory causes of these effects. By providing a rigorous mechanistic explanation for the role of maternal gradient decay in gap gene regulation, our study demonstrates that such analyses are feasible and reveal important aspects of dynamic gene regulation which would have been missed by a traditional steady-state approach. More generally, it highlights the importance of transient dynamics for understanding complex regulatory

  20. Inferring time-varying network topologies from gene expression data.

    Science.gov (United States)

    Rao, Arvind; Hero, Alfred O; States, David J; Engel, James Douglas

    2007-01-01

    Most current methods for gene regulatory network identification lead to the inference of steady-state networks, that is, networks prevalent over all times, a hypothesis which has been challenged. There has been a need to infer and represent networks in a dynamic, that is, time-varying fashion, in order to account for different cellular states affecting the interactions amongst genes. In this work, we present an approach, regime-SSM, to understand gene regulatory networks within such a dynamic setting. The approach uses a clustering method based on these underlying dynamics, followed by system identification using a state-space model for each learnt cluster--to infer a network adjacency matrix. We finally indicate our results on the mouse embryonic kidney dataset as well as the T-cell activation-based expression dataset and demonstrate conformity with reported experimental evidence.

  1. Identification and expression analysis of genes associated with bovine blastocyst formation

    Directory of Open Access Journals (Sweden)

    Van Zeveren Alex

    2007-06-01

    Full Text Available Abstract Background Normal preimplantation embryo development encompasses a series of events including first cleavage division, activation of the embryonic genome, compaction and blastocyst formation. First lineage differentiation starts at the blastocyst stage with the formation of the trophectoderm and the inner cell mass. The main objective of this study was the detection, identification and expression analysis of genes associated with blastocyst formation in order to help us better understand this process. This information could lead to improvements of in vitro embryo production procedures. Results A subtractive cDNA library was constructed enriched for transcripts preferentially expressed at the blastocyst stage compared to the 2-cell and 8-cell stage. Sequence information was obtained for 65 randomly selected clones. The RNA expression levels of 12 candidate genes were determined throughout 3 stages of preimplantation embryo development (2-cell, 8-cell and blastocyst and compared with the RNA expression levels of in vivo "golden standard" embryos using real-time PCR. The RNA expression profiles of 9 (75% transcripts (KRT18, FN1, MYL6, ATP1B3, FTH1, HINT1, SLC25A5, ATP6V0B, RPL10 were in agreement with the subtractive cDNA cloning approach, whereas for the remaining 3 (25% (ACTN1, COPE, EEF1A1 the RNA expression level was equal or even higher at the earlier developmental stages compared to the blastocyst stage. Moreover, significant differences in RNA expression levels were observed between in vitro and in vivo produced embryos. By immunofluorescent labelling, the protein expression of KRT18, FN1 and MYL6 was determined throughout bovine preimplantation embryo development and showed the same pattern as the RNA expression analyses. Conclusion By subtractive cDNA cloning, candidate genes involved in blastocyst formation were identified. For several candidate genes, important differences in gene expression were observed between in vivo and in

  2. Cross-species global and subset gene expression profiling identifies genes involved in prostate cancer response to selenium

    Directory of Open Access Journals (Sweden)

    Dhir Rajiv

    2004-08-01

    Full Text Available Abstract Background Gene expression technologies have the ability to generate vast amounts of data, yet there often resides only limited resources for subsequent validation studies. This necessitates the ability to perform sorting and prioritization of the output data. Previously described methodologies have used functional pathways or transcriptional regulatory grouping to sort genes for further study. In this paper we demonstrate a comparative genomics based method to leverage data from animal models to prioritize genes for validation. This approach allows one to develop a disease-based focus for the prioritization of gene data, a process that is essential for systems that lack significant functional pathway data yet have defined animal models. This method is made possible through the use of highly controlled spotted cDNA slide production and the use of comparative bioinformatics databases without the use of cross-species slide hybridizations. Results Using gene expression profiling we have demonstrated a similar whole transcriptome gene expression patterns in prostate cancer cells from human and rat prostate cancer cell lines both at baseline expression levels and after treatment with physiologic concentrations of the proposed chemopreventive agent Selenium. Using both the human PC3 and rat PAII prostate cancer cell lines have gone on to identify a subset of one hundred and fifty-four genes that demonstrate a similar level of differential expression to Selenium treatment in both species. Further analysis and data mining for two genes, the Insulin like Growth Factor Binding protein 3, and Retinoic X Receptor alpha, demonstrates an association with prostate cancer, functional pathway links, and protein-protein interactions that make these genes prime candidates for explaining the mechanism of Selenium's chemopreventive effect in prostate cancer. These genes are subsequently validated by western blots showing Selenium based induction and using

  3. Stochastic gene expression in Arabidopsis thaliana.

    Science.gov (United States)

    Araújo, Ilka Schultheiß; Pietsch, Jessica Magdalena; Keizer, Emma Mathilde; Greese, Bettina; Balkunde, Rachappa; Fleck, Christian; Hülskamp, Martin

    2017-12-14

    Although plant development is highly reproducible, some stochasticity exists. This developmental stochasticity may be caused by noisy gene expression. Here we analyze the fluctuation of protein expression in Arabidopsis thaliana. Using the photoconvertible KikGR marker, we show that the protein expressions of individual cells fluctuate over time. A dual reporter system was used to study extrinsic and intrinsic noise of marker gene expression. We report that extrinsic noise is higher than intrinsic noise and that extrinsic noise in stomata is clearly lower in comparison to several other tissues/cell types. Finally, we show that cells are coupled with respect to stochastic protein expression in young leaves, hypocotyls and roots but not in mature leaves. Our data indicate that stochasticity of gene expression can vary between tissues/cell types and that it can be coupled in a non-cell-autonomous manner.

  4. Deriving Trading Rules Using Gene Expression Programming

    Directory of Open Access Journals (Sweden)

    Adrian VISOIU

    2011-01-01

    Full Text Available This paper presents how buy and sell trading rules are generated using gene expression programming with special setup. Market concepts are presented and market analysis is discussed with emphasis on technical analysis and quantitative methods. The use of genetic algorithms in deriving trading rules is presented. Gene expression programming is applied in a form where multiple types of operators and operands are used. This gives birth to multiple gene contexts and references between genes in order to keep the linear structure of the gene expression programming chromosome. The setup of multiple gene contexts is presented. The case study shows how to use the proposed gene setup to derive trading rules encoded by Boolean expressions, using a dataset with the reference exchange rates between the Euro and the Romanian leu. The conclusions highlight the positive results obtained in deriving useful trading rules.

  5. Inferring Drosophila gap gene regulatory network: Pattern analysis of simulated gene expression profiles and stability analysis

    NARCIS (Netherlands)

    Fomekong-Nanfack, Y.; Postma, M.; Kaandorp, J.A.

    2009-01-01

    Background: Inference of gene regulatory networks (GRNs) requires accurate data, a method to simulate the expression patterns and an efficient optimization algorithm to estimate the unknown parameters. Using this approach it is possible to obtain alternative circuits without making any a priori

  6. Using PCR to Target Misconceptions about Gene Expression

    Directory of Open Access Journals (Sweden)

    Leslie K. Wright

    2013-02-01

    Full Text Available We present a PCR-based laboratory exercise that can be used with first- or second-year biology students to help overcome common misconceptions about gene expression. Biology students typically do not have a clear understanding of the difference between genes (DNA and gene expression (mRNA/protein and often believe that genes exist in an organism or cell only when they are expressed. This laboratory exercise allows students to carry out a PCR-based experiment designed to challenge their misunderstanding of the difference between genes and gene expression. Students first transform E. coli with an inducible GFP gene containing plasmid and observe induced and un-induced colonies. The following exercise creates cognitive dissonance when actual PCR results contradict their initial (incorrect predictions of the presence of the GFP gene in transformed cells. Field testing of this laboratory exercise resulted in learning gains on both knowledge and application questions on concepts related to genes and gene expression.

  7. Exploring gene expression changes in the amphioxus gill after poly(I:C) challenge using digital expression profiling.

    Science.gov (United States)

    Zhang, Qi-Lin; Qiu, Han-Yue; Liang, Ming-Zhong; Luo, Bang; Wang, Xiu-Qiang; Chen, Jun-Yuan

    2017-11-01

    Amphioxus, a cephalochordate, is a key model animal for studying the evolution of vertebrate immunity. Recently, studies have revealed that microRNA (miRNA) expression profiles change significantly in the amphioxus gill after immune stimulation, but it remains largely unknown how gene expression responds to immune stress. Elucidating gene expression changes in the amphioxus gill will provide a deeper understanding of the evolution of gill immunity in vertebrates. Here, we used high-throughput RNA sequencing technology (RNA-seq) to conduct tag-based digital gene expression profiling (DGE) analyses of the gills of control Branchiostoma belcheri and of those exposed to the viral mimic, poly(I:C) (pIC). Six libraries were created for the control and treatment groups including three biological replicates per group. A total of 1999 differently expressed genes (DEGs) were obtained, with 571 and 1428 DEGs showing up- or down-regulation, respectively, in the treatment group. Enrichment analysis of gene ontology (GO) terms and pathways revealed that the DEGs were primarily related to immune and defense response, apoptosis, human disease, cancer, protein metabolism, enzyme activity, and regulatory processes. In addition, eight DEGs were randomly selected to validate the RNA-seq data using real-time quantitative PCR (qRT-PCR), and the results confirmed the accuracy of the RNA-seq approach. Next, we screened eight key responding genes to examine the dynamic changes in expression levels at different time points in more detail. The results indicated that expressions of TRADD, MARCH, RNF31, NF-κb, CYP450, TNFRSF6B, IFI and LECT1 were induced to participate in the antiviral response against pIC. This study provides a valuable resource for understanding the role of the amphioxus gill in antiviral immunity and the evolution of gill immunity in vertebrates. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. Differential gene expression during Trypanosoma cruzi metacyclogenesis

    Directory of Open Access Journals (Sweden)

    Marco Aurelio Krieger

    1999-09-01

    Full Text Available The transformation of epimastigotes into metacyclic trypomastigotes involves changes in the pattern of expressed genes, resulting in important morphological and functional differences between these developmental forms of Trypanosoma cruzi. In order to identify and characterize genes involved in triggering the metacyclogenesis process and in conferring to metacyclic trypomastigotes their stage specific biological properties, we have developed a method allowing the isolation of genes specifically expressed when comparing two close related cell populations (representation of differential expression or RDE. The method is based on the PCR amplification of gene sequences selected by hybridizing and subtracting the populations in such a way that after some cycles of hybridization-amplification genes specific to a given population are highly enriched. The use of this method in the analysis of differential gene expression during T. cruzi metacyclogenesis (6 hr and 24 hr of differentiation and metacyclic trypomastigotes resulted in the isolation of several clones from each time point. Northern blot analysis showed that some genes are transiently expressed (6 hr and 24 hr differentiating cells, while others are present in differentiating cells and in metacyclic trypomastigotes. Nucleotide sequencing of six clones characterized so far showed that they do not display any homology to gene sequences available in the GeneBank.

  9. Conditional gene expression in the mouse using a Sleeping Beauty gene-trap transposon

    Directory of Open Access Journals (Sweden)

    Hackett Perry B

    2006-06-01

    Full Text Available Abstract Background Insertional mutagenesis techniques with transposable elements have been popular among geneticists studying model organisms from E. coli to Drosophila and, more recently, the mouse. One such element is the Sleeping Beauty (SB transposon that has been shown in several studies to be an effective insertional mutagen in the mouse germline. SB transposon vector studies have employed different functional elements and reporter molecules to disrupt and report the expression of endogenous mouse genes. We sought to generate a transposon system that would be capable of reporting the expression pattern of a mouse gene while allowing for conditional expression of a gene of interest in a tissue- or temporal-specific pattern. Results Here we report the systematic development and testing of a transposon-based gene-trap system incorporating the doxycycline-repressible Tet-Off (tTA system that is capable of activating the expression of genes under control of a Tet response element (TRE promoter. We demonstrate that the gene trap system is fully functional in vitro by introducing the "gene-trap tTA" vector into human cells by transposition and identifying clones that activate expression of a TRE-luciferase transgene in a doxycycline-dependent manner. In transgenic mice, we mobilize gene-trap tTA vectors, discover parameters that can affect germline mobilization rates, and identify candidate gene insertions to demonstrate the in vivo functionality of the vector system. We further demonstrate that the gene-trap can act as a reporter of endogenous gene expression and it can be coupled with bioluminescent imaging to identify genes with tissue-specific expression patterns. Conclusion Akin to the GAL4/UAS system used in the fly, we have made progress developing a tool for mutating and revealing the expression of mouse genes by generating the tTA transactivator in the presence of a secondary TRE-regulated reporter molecule. A vector like the gene

  10. A comparative gene expression database for invertebrates

    Directory of Open Access Journals (Sweden)

    Ormestad Mattias

    2011-08-01

    Full Text Available Abstract Background As whole genome and transcriptome sequencing gets cheaper and faster, a great number of 'exotic' animal models are emerging, rapidly adding valuable data to the ever-expanding Evo-Devo field. All these new organisms serve as a fantastic resource for the research community, but the sheer amount of data, some published, some not, makes detailed comparison of gene expression patterns very difficult to summarize - a problem sometimes even noticeable within a single lab. The need to merge existing data with new information in an organized manner that is publicly available to the research community is now more necessary than ever. Description In order to offer a homogenous way of storing and handling gene expression patterns from a variety of organisms, we have developed the first web-based comparative gene expression database for invertebrates that allows species-specific as well as cross-species gene expression comparisons. The database can be queried by gene name, developmental stage and/or expression domains. Conclusions This database provides a unique tool for the Evo-Devo research community that allows the retrieval, analysis and comparison of gene expression patterns within or among species. In addition, this database enables a quick identification of putative syn-expression groups that can be used to initiate, among other things, gene regulatory network (GRN projects.

  11. Genetic Variants Contribute to Gene Expression Variability in Humans

    Science.gov (United States)

    Hulse, Amanda M.; Cai, James J.

    2013-01-01

    Expression quantitative trait loci (eQTL) studies have established convincing relationships between genetic variants and gene expression. Most of these studies focused on the mean of gene expression level, but not the variance of gene expression level (i.e., gene expression variability). In the present study, we systematically explore genome-wide association between genetic variants and gene expression variability in humans. We adapt the double generalized linear model (dglm) to simultaneously fit the means and the variances of gene expression among the three possible genotypes of a biallelic SNP. The genomic loci showing significant association between the variances of gene expression and the genotypes are termed expression variability QTL (evQTL). Using a data set of gene expression in lymphoblastoid cell lines (LCLs) derived from 210 HapMap individuals, we identify cis-acting evQTL involving 218 distinct genes, among which 8 genes, ADCY1, CTNNA2, DAAM2, FERMT2, IL6, PLOD2, SNX7, and TNFRSF11B, are cross-validated using an extra expression data set of the same LCLs. We also identify ∼300 trans-acting evQTL between >13,000 common SNPs and 500 randomly selected representative genes. We employ two distinct scenarios, emphasizing single-SNP and multiple-SNP effects on expression variability, to explain the formation of evQTL. We argue that detecting evQTL may represent a novel method for effectively screening for genetic interactions, especially when the multiple-SNP influence on expression variability is implied. The implication of our results for revealing genetic mechanisms of gene expression variability is discussed. PMID:23150607

  12. Identification and characterization of a novel gene differentially expressed in zebrafish cross-subfamily cloned embryos

    Directory of Open Access Journals (Sweden)

    Wang Ya-Ping

    2008-03-01

    Full Text Available Abstract Background Cross-species nuclear transfer has been shown to be a potent approach to retain the genetic viability of a certain species near extinction. However, most embryos produced by cross-species nuclear transfer were compromised because that they were unable to develop to later stages. Gene expression analysis of cross-species cloned embryos will yield new insights into the regulatory mechanisms involved in cross-species nuclear transfer and embryonic development. Results A novel gene, K31, was identified as an up-regulated gene in fish cross-subfamily cloned embryos using SSH approach and RACE method. K31 complete cDNA sequence is 1106 base pairs (bp in length, with a 342 bp open reading frame (ORF encoding a putative protein of 113 amino acids (aa. Comparative analysis revealed no homologous known gene in zebrafish and other species database. K31 protein contains a putative transmembrane helix and five putative phosphorylation sites but without a signal peptide. Expression pattern analysis by real time RT-PCR and whole-mount in situ hybridization (WISH shows that it has the characteristics of constitutively expressed gene. Sub-cellular localization assay shows that K31 protein can not penetrate the nuclei. Interestingly, over-expression of K31 gene can cause lethality in the epithelioma papulosum cyprinid (EPC cells in cell culture, which gave hint to the inefficient reprogramming events occurred in cloned embryos. Conclusion Taken together, our findings indicated that K31 gene is a novel gene differentially expressed in fish cross-subfamily cloned embryos and over-expression of K31 gene can cause lethality of cultured fish cells. To our knowledge, this is the first report on the determination of novel genes involved in nucleo-cytoplasmic interaction of fish cross-subfamily cloned embryos.

  13. Correction of gene expression data

    DEFF Research Database (Denmark)

    Darbani Shirvanehdeh, Behrooz; Stewart, C. Neal, Jr.; Noeparvar, Shahin

    2014-01-01

    This report investigates for the first time the potential inter-treatment bias source of cell number for gene expression studies. Cell-number bias can affect gene expression analysis when comparing samples with unequal total cellular RNA content or with different RNA extraction efficiencies....... For maximal reliability of analysis, therefore, comparisons should be performed at the cellular level. This could be accomplished using an appropriate correction method that can detect and remove the inter-treatment bias for cell-number. Based on inter-treatment variations of reference genes, we introduce...

  14. Gene expression in colorectal cancer

    DEFF Research Database (Denmark)

    Birkenkamp-Demtroder, Karin; Christensen, Lise Lotte; Olesen, Sanne Harder

    2002-01-01

    Understanding molecular alterations in colorectal cancer (CRC) is needed to define new biomarkers and treatment targets. We used oligonucleotide microarrays to monitor gene expression of about 6,800 known genes and 35,000 expressed sequence tags (ESTs) on five pools (four to six samples in each...... pool) of total RNA from left-sided sporadic colorectal carcinomas. We compared normal tissue to carcinoma tissue from Dukes' stages A-D (noninvasive to distant metastasis) and identified 908 known genes and 4,155 ESTs that changed remarkably from normal to tumor tissue. Based on intensive filtering 226...

  15. Prostate cancer metastasis-driving genes: hurdles and potential approaches in their identification

    Directory of Open Access Journals (Sweden)

    Yan Ting Chiang

    2014-08-01

    Full Text Available Metastatic prostate cancer is currently incurable. Metastasis is thought to result from changes in the expression of specific metastasis-driving genes in nonmetastatic prostate cancer tissue, leading to a cascade of activated downstream genes that set the metastatic process in motion. Such genes could potentially serve as effective therapeutic targets for improved management of the disease. They could be identified by comparative analysis of gene expression profiles of patient-derived metastatic and nonmetastatic prostate cancer tissues to pinpoint genes showing altered expression, followed by determining whether silencing of such genes can lead to inhibition of metastatic properties. Various hurdles encountered in this approach are discussed, including (i the need for clinically relevant, nonmetastatic and metastatic prostate cancer tissues such as xenografts of patients' prostate cancers developed via subrenal capsule grafting technology and (ii limitations in the currently available methodology for identification of master regulatory genes.

  16. Shared control of gene expression in bacteria by transcription factors and global physiology of the cell.

    NARCIS (Netherlands)

    Berthoumieux, S.; Jong, H. de; Baptist, G.; Pinel, C.; Ranquet, C.; Ropers, D.; Geiselmann, J.

    2013-01-01

    Gene expression is controlled by the joint effect of (i) the global physiological state of the cell, in particular the activity of the gene expression machinery, and (ii) DNA-binding transcription factors and other specific regulators. We present a model-based approach to distinguish between these

  17. Identification of heterogeneity among soft tissue sarcomas by gene expression profiles from different tumors

    Directory of Open Access Journals (Sweden)

    Skubitz Amy PN

    2008-05-01

    Full Text Available Abstract The heterogeneity that soft tissue sarcomas (STS exhibit in their clinical behavior, even within histological subtypes, complicates patient care. Histological appearance is determined by gene expression. Morphologic features are generally good predictors of biologic behavior, however, metastatic propensity, tumor growth, and response to chemotherapy may be determined by gene expression patterns that do not correlate well with morphology. One approach to identify heterogeneity is to search for genetic markers that correlate with differences in tumor behavior. Alternatively, subsets may be identified based on gene expression patterns alone, independent of knowledge of clinical outcome. We have reported gene expression patterns that distinguish two subgroups of clear cell renal carcinoma (ccRCC, and other gene expression patterns that distinguish heterogeneity of serous ovarian carcinoma (OVCA and aggressive fibromatosis (AF. In this study, gene expression in 53 samples of STS and AF [including 16 malignant fibrous histiocytoma (MFH, 9 leiomyosarcoma, 12 liposarcoma, 4 synovial sarcoma, and 12 samples of AF] was determined at Gene Logic Inc. (Gaithersburg, MD using Affymetrix GeneChip® U_133 arrays containing approximately 40,000 genes/ESTs. Gene expression analysis was performed with the Gene Logic Genesis Enterprise System® Software and Expressionist software. Hierarchical clustering of the STS using our three previously reported gene sets, each generated subgroups within the STS that for some subtypes correlated with histology, and also suggested the existence of subsets of MFH. All three gene sets also recognized the same two subsets of the fibromatosis samples that we had found in our earlier study of AF. These results suggest that these subgroups may have biological significance, and that these gene sets may be useful for sub-classification of STS. In addition, several genes that are targets of some anti-tumor drugs were found to

  18. Stochastic modeling for the expression of a gene regulated by competing transcription factors.

    Directory of Open Access Journals (Sweden)

    Hsih-Te Yang

    Full Text Available It is widely accepted that gene expression regulation is a stochastic event. The common approach for its computer simulation requires detailed information on the interactions of individual molecules, which is often not available for the analyses of biological experiments. As an alternative approach, we employed a more intuitive model to simulate the experimental result, the Markov-chain model, in which a gene is regulated by activators and repressors, which bind the same site in a mutually exclusive manner. Our stochastic simulation in the presence of both activators and repressors predicted a Hill-coefficient of the dose-response curve closer to the experimentally observed value than the calculated value based on the simple additive effects of activators alone and repressors alone. The simulation also reproduced the heterogeneity of gene expression levels among individual cells observed by Fluorescence Activated Cell Sorting analysis. Therefore, our approach may help to apply stochastic simulations to broader experimental data.

  19. Fitness Effects of Network Non-Linearity Induced by Gene Expression Noise

    Science.gov (United States)

    Ray, Christian; Cooper, Tim; Balazsi, Gabor

    2012-02-01

    In the non-equilibrium dynamics of growing microbial cells, metabolic enzymes can create non-linearities in metabolite concentration because of non-linear degradation (utilization): an enzyme can saturate in the process of metabolite utilization. Increasing metabolite production past the saturation point then results in an ultrasensitive metabolite response. If the production rate of a metabolite depends on a second enzyme or other protein-mediated process, uncorrelated gene expression noise can thus cause transient metabolite concentration bursts. Such bursts are physiologically unnecessary and may represent a source of selection against the ultrasensitive switch, especially if the fluctuating metabolic intermediate is toxic. Selection may therefore favor correlated gene expression fluctuations for enzymes in the same pathway, such as by same-operon membership in bacteria. Using a modified experimental lac operon system, we are undertaking a combined theoretical-experimental approach to demonstrate that (i) the lac operon has an implicit ultrasensitive switch that we predict is avoided by gene expression correlations induced by same-operon membership; (ii) bacterial growth rates are sensitive to crossing the ultrasensitive threshold. Our results suggest that correlations in intrinsic gene expression noise are exploited by evolution to ameliorate the detrimental effects of nonlinearities in metabolite concentrations.

  20. Vascular Gene Expression: A Hypothesis

    Directory of Open Access Journals (Sweden)

    Angélica Concepción eMartínez-Navarro

    2013-07-01

    Full Text Available The phloem is the conduit through which photoassimilates are distributed from autotrophic to heterotrophic tissues and is involved in the distribution of signaling molecules that coordinate plant growth and responses to the environment. Phloem function depends on the coordinate expression of a large array of genes. We have previously identified conserved motifs in upstream regions of the Arabidopsis genes, encoding the homologs of pumpkin phloem sap mRNAs, displaying expression in vascular tissues. This tissue-specific expression in Arabidopsis is predicted by the overrepresentation of GA/CT-rich motifs in gene promoters. In this work we have searched for common motifs in upstream regions of the homologous genes from plants considered to possess a primitive vascular tissue (a lycophyte, as well as from others that lack a true vascular tissue (a bryophyte, and finally from chlorophytes. Both lycophyte and bryophyte display motifs similar to those found in Arabidopsis with a significantly low E-value, while the chlorophytes showed either a different conserved motif or no conserved motif at all. These results suggest that these same genes are expressed coordinately in non- vascular plants; this coordinate expression may have been one of the prerequisites for the development of conducting tissues in plants. We have also analyzed the phylogeny of conserved proteins that may be involved in phloem function and development. The presence of CmPP16, APL, FT and YDA in chlorophytes suggests the recruitment of ancient regulatory networks for the development of the vascular tissue during evolution while OPS is a novel protein specific to vascular plants.

  1. A Microchip for Integrated Single-Cell Gene Expression Profiling and Genotoxicity Detection

    Directory of Open Access Journals (Sweden)

    Hui Dong

    2016-09-01

    Full Text Available Microfluidics-based single-cell study is an emerging approach in personalized treatment or precision medicine studies. Single-cell gene expression holds a potential to provide treatment selections with maximized efficacy to help cancer patients based on a genetic understanding of their disease. This work presents a multi-layer microchip for single-cell multiplexed gene expression profiling and genotoxicity detection. Treated by three drug reagents (i.e., methyl methanesulfonate, docetaxel and colchicine with varied concentrations and time lengths, individual human cancer cells (MDA-MB-231 are lysed on-chip, and the released mRNA templates are captured and reversely transcribed into single strand DNA. Glyceraldehyde-3-phosphate dehydrogenase (GAPDH, cyclin-dependent kinase inhibitor 1A (CDKN1A, and aurora kinase A (AURKA genes from single cells are amplified and real-time quantified through multiplex polymerase chain reaction. The microchip is capable of integrating all steps of single-cell multiplexed gene expression profiling, and providing precision detection of drug induced genotoxic stress. Throughput has been set to be 18, and can be further increased following the same approach. Numerical simulation of on-chip single cell trapping and heat transfer has been employed to evaluate the chip design and operation.

  2. The evolution of gene expression in primates

    OpenAIRE

    Tashakkori Ghanbarian, Avazeh

    2015-01-01

    The evolution of a gene’s expression profile is commonly assumed to be independent of its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between expression of neighboring genes in extant taxa. Indeed, in all eukaryotic genomes, genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their e...

  3. Low pH induces co-ordinate regulation of gene expression in oesophageal cells.

    Science.gov (United States)

    Duggan, Shane P; Gallagher, William M; Fox, Edward J P; Abdel-Latif, Mohammed M; Reynolds, John V; Kelleher, Dermot

    2006-02-01

    The development of gastro-oesophageal reflux disease (GORD) is known to be a causative risk factor in the evolution of adenocarcinoma of the oesophagus. The major component of this reflux is gastric acid. However, the impact of low pH on gene expression has not been extensively studied in oesophageal cells. This study utilizes a transcriptomic and bioinformatic approach to assess regulation of gene expression in response to low pH. In more detail, oesophageal adenocarcinoma cell lines were exposed to a range of pH environments. Affymetrix microarrays were used for gene-expression analysis and results were validated using cycle limitation and real-time RT-PCR analysis, as well as northern and western blotting. Comparative promoter transcription factor binding site (TFBS) analysis (MatInspector) of hierarchically clustered gene-expression data was employed to identify the elements which may co-ordinately regulate individual gene clusters. Initial experiments demonstrated maximal induction of EGR1 gene expression at pH 6.5. Subsequent array experimentation revealed significant induction of gene expression from such functional categories as DNA damage response (EGR1-4, ATF3) and cell-cycle control (GADD34, GADD45, p57). Changes in expression of EGR1, EGR3, ATF3, MKP-1, FOSB, CTGF and CYR61 were verified in separate experiments and in a variety of oesophageal cell lines. TFBS analysis of promoters identified transcription factors that may co-ordinately regulate gene-expression clusters, Cluster 1: Oct-1, AP4R; Cluster 2: NF-kB, EGRF; Cluster 3: IKRS, AP-1F. Low pH has the ability to induce genes and pathways which can provide an environment suitable for the progression of malignancy. Further functional analysis of the genes and clusters identified in this low pH study is likely to lead to new insights into the pathogenesis and therapeutics of GORD and oesophageal cancer.

  4. Interspecies Systems Biology Uncovers Metabolites Affecting C. elegans Gene Expression and Life History Traits

    Science.gov (United States)

    Watson, Emma; MacNeil, Lesley T.; Ritter, Ashlyn D.; Yilmaz, L. Safak; Rosebrock, Adam P.; Caudy, Amy A.; Walhout, Albertha J. M.

    2014-01-01

    SUMMARY Diet greatly influences gene expression and physiology. In mammals, elucidating the effects and mechanisms of individual nutrients is challenging due to the complexity of both the animal and its diet. Here we used an interspecies systems biology approach with Caenorhabditis elegans and two if its bacterial diets, Escherichia coli and Comamonas aquatica, to identify metabolites that affect the animal’s gene expression and physiology. We identify vitamin B12 as the major dilutable metabolite provided by Comamonas aq. that regulates gene expression, accelerates development and reduces fertility, but does not affect lifespan. We find that vitamin B12 has a dual role in the animal: it affects development and fertility via the methionine/S-Adenosylmethionine (SAM) cycle and breaks down the short-chain fatty acid propionic acid preventing its toxic buildup. Our interspecies systems biology approach provides a paradigm for understanding complex interactions between diet and physiology. PMID:24529378

  5. Quantification of differential gene expression by multiplexed targeted resequencing of cDNA

    Science.gov (United States)

    Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.

    2017-01-01

    Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677

  6. Decomposition of gene expression state space trajectories.

    Directory of Open Access Journals (Sweden)

    Jessica C Mar

    2009-12-01

    Full Text Available Representing and analyzing complex networks remains a roadblock to creating dynamic network models of biological processes and pathways. The study of cell fate transitions can reveal much about the transcriptional regulatory programs that underlie these phenotypic changes and give rise to the coordinated patterns in expression changes that we observe. The application of gene expression state space trajectories to capture cell fate transitions at the genome-wide level is one approach currently used in the literature. In this paper, we analyze the gene expression dataset of Huang et al. (2005 which follows the differentiation of promyelocytes into neutrophil-like cells in the presence of inducers dimethyl sulfoxide and all-trans retinoic acid. Huang et al. (2005 build on the work of Kauffman (2004 who raised the attractor hypothesis, stating that cells exist in an expression landscape and their expression trajectories converge towards attractive sites in this landscape. We propose an alternative interpretation that explains this convergent behavior by recognizing that there are two types of processes participating in these cell fate transitions-core processes that include the specific differentiation pathways of promyelocytes to neutrophils, and transient processes that capture those pathways and responses specific to the inducer. Using functional enrichment analyses, specific biological examples and an analysis of the trajectories and their core and transient components we provide a validation of our hypothesis using the Huang et al. (2005 dataset.

  7. Extensive changes in the expression of the opioid genes between humans and chimpanzees.

    Science.gov (United States)

    Cruz-Gordillo, Peter; Fedrigo, Olivier; Wray, Gregory A; Babbitt, Courtney C

    2010-01-01

    The various means by which the body perceives, transmits, and resolves the experiences of pain and nociception are mediated by a host of molecules, including neuropeptides within the opioid gene signaling pathway. The peptide ligands and receptors encoded by this group of genes have been linked to behavioral disorders as well as a number of psychiatric affective disorders. Our aim was to explore the recent evolutionary history of these two gene families by taking a comparative genomics approach, specifically through a comparison between humans and chimpanzees. Our analyses indicate differential expression of these genes between the two species, more than expected based on genome-wide comparisons, indicating that differential expression is pervasive among the opioid genes. Of the 8 family members, three genes showed significant expression differences (PENK, PNOC, and OPRL1), with two others marginally significant (OPRM1 and OPRD1). Accelerated substitution rates along human and chimpanzee lineages within the putative regulatory regions of OPRM1, POMC, and PDYN between the human and chimpanzee branches are consistent with positive selection. Collectively, these results suggest that there may have been a selective advantage to modulating the expression of the opioid genes in humans compared with our closest living relatives. Information about the cognitive roles mediated by these genes in humans may help to elucidate the trait consequences of these putatively adaptive expression changes. Copyright © 2010 S. Karger AG, Basel.

  8. Reduced expression of Autographa californica nucleopolyhedrovirus ORF34, an essential gene, enhances heterologous gene expression

    International Nuclear Information System (INIS)

    Salem, Tamer Z.; Zhang, Fengrui; Thiem, Suzanne M.

    2013-01-01

    Autographa californica multiple nucleopolyhedrovirus ORF34 is part of a transcriptional unit that includes ORF32, encoding a viral fibroblast growth factor (FGF) and ORF33. We identified ORF34 as a candidate for deletion to improve protein expression in the baculovirus expression system based on enhanced reporter gene expression in an RNAi screen of virus genes. However, ORF34 was shown to be an essential gene. To explore ORF34 function, deletion (KO34) and rescue bacmids were constructed and characterized. Infection did not spread from primary KO34 transfected cells and supernatants from KO34 transfected cells could not infect fresh Sf21 cells whereas the supernatant from the rescue bacmids transfection could recover the infection. In addition, budded viruses were not observed in KO34 transfected cells by electron microscopy, nor were viral proteins detected from the transfection supernatants by western blots. These demonstrate that ORF34 is an essential gene with a possible role in infectious virus production.

  9. Reduced expression of Autographa californica nucleopolyhedrovirus ORF34, an essential gene, enhances heterologous gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Salem, Tamer Z. [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Department of Microbial Molecular Biology, AGERI, Agricultural Research Center, Giza 12619 (Egypt); Division of Biomedical Sciences, Zewail University, Zewail City of Science and Technology, Giza 12588 (Egypt); Zhang, Fengrui [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Thiem, Suzanne M., E-mail: smthiem@msu.edu [Department of Entomology, Michigan State University, East Lansing, MI 48824 (United States); Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824 (United States)

    2013-01-20

    Autographa californica multiple nucleopolyhedrovirus ORF34 is part of a transcriptional unit that includes ORF32, encoding a viral fibroblast growth factor (FGF) and ORF33. We identified ORF34 as a candidate for deletion to improve protein expression in the baculovirus expression system based on enhanced reporter gene expression in an RNAi screen of virus genes. However, ORF34 was shown to be an essential gene. To explore ORF34 function, deletion (KO34) and rescue bacmids were constructed and characterized. Infection did not spread from primary KO34 transfected cells and supernatants from KO34 transfected cells could not infect fresh Sf21 cells whereas the supernatant from the rescue bacmids transfection could recover the infection. In addition, budded viruses were not observed in KO34 transfected cells by electron microscopy, nor were viral proteins detected from the transfection supernatants by western blots. These demonstrate that ORF34 is an essential gene with a possible role in infectious virus production.

  10. CRISPR Perturbation of Gene Expression Alters Bacterial Fitness under Stress and Reveals Underlying Epistatic Constraints.

    Science.gov (United States)

    Otoupal, Peter B; Erickson, Keesha E; Escalas-Bordoy, Antoni; Chatterjee, Anushree

    2017-01-20

    The evolution of antibiotic resistance has engendered an impending global health crisis that necessitates a greater understanding of how resistance emerges. The impact of nongenetic factors and how they influence the evolution of resistance is a largely unexplored area of research. Here we present a novel application of CRISPR-Cas9 technology for investigating how gene expression governs the adaptive pathways available to bacteria during the evolution of resistance. We examine the impact of gene expression changes on bacterial adaptation by constructing a library of deactivated CRISPR-Cas9 synthetic devices to tune the expression of a set of stress-response genes in Escherichia coli. We show that artificially inducing perturbations in gene expression imparts significant synthetic control over fitness and growth during stress exposure. We present evidence that these impacts are reversible; strains with synthetically perturbed gene expression regained wild-type growth phenotypes upon stress removal, while maintaining divergent growth characteristics under stress. Furthermore, we demonstrate a prevailing trend toward negative epistatic interactions when multiple gene perturbations are combined simultaneously, thereby posing an intrinsic constraint on gene expression underlying adaptive trajectories. Together, these results emphasize how CRISPR-Cas9 can be employed to engineer gene expression changes that shape bacterial adaptation, and present a novel approach to synthetically control the evolution of antimicrobial resistance.

  11. Genes expressed in specific areas of the human fetal cerebral cortex display distinct patterns of evolution.

    Directory of Open Access Journals (Sweden)

    Nelle Lambert

    2011-03-01

    Full Text Available The developmental mechanisms through which the cerebral cortex increased in size and complexity during primate evolution are essentially unknown. To uncover genetic networks active in the developing cerebral cortex, we combined three-dimensional reconstruction of human fetal brains at midgestation and whole genome expression profiling. This novel approach enabled transcriptional characterization of neurons from accurately defined cortical regions containing presumptive Broca and Wernicke language areas, as well as surrounding associative areas. We identified hundreds of genes displaying differential expression between the two regions, but no significant difference in gene expression between left and right hemispheres. Validation by qRTPCR and in situ hybridization confirmed the robustness of our approach and revealed novel patterns of area- and layer-specific expression throughout the developing cortex. Genes differentially expressed between cortical areas were significantly associated with fast-evolving non-coding sequences harboring human-specific substitutions that could lead to divergence in their repertoires of transcription factor binding sites. Strikingly, while some of these sequences were accelerated in the human lineage only, many others were accelerated in chimpanzee and/or mouse lineages, indicating that genes important for cortical development may be particularly prone to changes in transcriptional regulation across mammals. Genes differentially expressed between cortical regions were also enriched for transcriptional targets of FoxP2, a key gene for the acquisition of language abilities in humans. Our findings point to a subset of genes with a unique combination of cortical areal expression and evolutionary patterns, suggesting that they play important roles in the transcriptional network underlying human-specific neural traits.

  12. Widespread ectopic expression of olfactory receptor genes

    Directory of Open Access Journals (Sweden)

    Yanai Itai

    2006-05-01

    Full Text Available Abstract Background Olfactory receptors (ORs are the largest gene family in the human genome. Although they are expected to be expressed specifically in olfactory tissues, some ectopic expression has been reported, with special emphasis on sperm and testis. The present study systematically explores the expression patterns of OR genes in a large number of tissues and assesses the potential functional implication of such ectopic expression. Results We analyzed the expression of hundreds of human and mouse OR transcripts, via EST and microarray data, in several dozens of human and mouse tissues. Different tissues had specific, relatively small OR gene subsets which had particularly high expression levels. In testis, average expression was not particularly high, and very few highly expressed genes were found, none corresponding to ORs previously implicated in sperm chemotaxis. Higher expression levels were more common for genes with a non-OR genomic neighbor. Importantly, no correlation in expression levels was detected for human-mouse orthologous pairs. Also, no significant difference in expression levels was seen between intact and pseudogenized ORs, except for the pseudogenes of subfamily 7E which has undergone a human-specific expansion. Conclusion The OR superfamily as a whole, show widespread, locus-dependent and heterogeneous expression, in agreement with a neutral or near neutral evolutionary model for transcription control. These results cannot reject the possibility that small OR subsets might play functional roles in different tissues, however considerable care should be exerted when offering a functional interpretation for ectopic OR expression based only on transcription information.

  13. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  14. MeSH key terms for validation and annotation of gene expression clusters

    Energy Technology Data Exchange (ETDEWEB)

    Rechtsteiner, A. (Andreas); Rocha, L. M. (Luis Mateus)

    2004-01-01

    Integration of different sources of information is a great challenge for the analysis of gene expression data, and for the field of Functional Genomics in general. As the availability of numerical data from high-throughput methods increases, so does the need for technologies that assist in the validation and evaluation of the biological significance of results extracted from these data. In mRNA assaying with microarrays, for example, numerical analysis often attempts to identify clusters of co-expressed genes. The important task to find the biological significance of the results and validate them has so far mostly fallen to the biological expert who had to perform this task manually. One of the most promising avenues to develop automated and integrative technology for such tasks lies in the application of modern Information Retrieval (IR) and Knowledge Management (KM) algorithms to databases with biomedical publications and data. Examples of databases available for the field are bibliographic databases c ntaining scientific publications (e.g. MEDLINE/PUBMED), databases containing sequence data (e.g. GenBank) and databases of semantic annotations (e.g. the Gene Ontology Consortium and Medical Subject Headings (MeSH)). We present here an approach that uses the MeSH terms and their concept hierarchies to validate and obtain functional information for gene expression clusters. The controlled and hierarchical MeSH vocabulary is used by the National Library of Medicine (NLM) to index all the articles cited in MEDLINE. Such indexing with a controlled vocabulary eliminates some of the ambiguity due to polysemy (terms that have multiple meanings) and synonymy (multiple terms have similar meaning) that would be encountered if terms would be extracted directly from the articles due to differing article contexts or author preferences and background. Further, the hierarchical organization of the MeSH terms can illustrate the conceptuallfunctional relationships of genes

  15. Network analysis of differential expression for the identification of disease-causing genes.

    Directory of Open Access Journals (Sweden)

    Daniela Nitsch

    Full Text Available Genetic studies (in particular linkage and association studies identify chromosomal regions involved in a disease or phenotype of interest, but those regions often contain many candidate genes, only a few of which can be followed-up for biological validation. Recently, computational methods to identify (prioritize the most promising candidates within a region have been proposed, but they are usually not applicable to cases where little is known about the phenotype (no or few confirmed disease genes, fragmentary understanding of the biological cascades involved. We seek to overcome this limitation by replacing knowledge about the biological process by experimental data on differential gene expression between affected and healthy individuals. Considering the problem from the perspective of a gene/protein network, we assess a candidate gene by considering the level of differential expression in its neighborhood under the assumption that strong candidates will tend to be surrounded by differentially expressed neighbors. We define a notion of soft neighborhood where each gene is given a contributing weight, which decreases with the distance from the candidate gene on the protein network. To account for multiple paths between genes, we define the distance using the Laplacian exponential diffusion kernel. We score candidates by aggregating the differential expression of neighbors weighted as a function of distance. Through a randomization procedure, we rank candidates by p-values. We illustrate our approach on four monogenic diseases and successfully prioritize the known disease causing genes.

  16. Dynamic association rules for gene expression data analysis.

    Science.gov (United States)

    Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

    2015-10-14

    The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed

  17. Gene expression in periodontal tissues following treatment

    Directory of Open Access Journals (Sweden)

    Eisenacher Martin

    2008-07-01

    Full Text Available Abstract Background In periodontitis, treatment aimed at controlling the periodontal biofilm infection results in a resolution of the clinical and histological signs of inflammation. Although the cell types found in periodontal tissues following treatment have been well described, information on gene expression is limited to few candidate genes. Therefore, the aim of the study was to determine the expression profiles of immune and inflammatory genes in periodontal tissues from sites with severe chronic periodontitis following periodontal therapy in order to identify genes involved in tissue homeostasis. Gingival biopsies from 12 patients with severe chronic periodontitis were taken six to eight weeks following non-surgical periodontal therapy, and from 11 healthy controls. As internal standard, RNA of an immortalized human keratinocyte line (HaCaT was used. Total RNA was subjected to gene expression profiling using a commercially available microarray system focusing on inflammation-related genes. Post-hoc confirmation of selected genes was done by Realtime-PCR. Results Out of the 136 genes analyzed, the 5% most strongly expressed genes compared to healthy controls were Interleukin-12A (IL-12A, Versican (CSPG-2, Matrixmetalloproteinase-1 (MMP-1, Down syndrome critical region protein-1 (DSCR-1, Macrophage inflammatory protein-2β (Cxcl-3, Inhibitor of apoptosis protein-1 (BIRC-1, Cluster of differentiation antigen 38 (CD38, Regulator of G-protein signalling-1 (RGS-1, and Finkel-Biskis-Jinkins murine osteosarcoma virus oncogene (C-FOS; the 5% least strongly expressed genes were Receptor-interacting Serine/Threonine Kinase-2 (RIP-2, Complement component 3 (C3, Prostaglandin-endoperoxide synthase-2 (COX-2, Interleukin-8 (IL-8, Endothelin-1 (EDN-1, Plasminogen activator inhibitor type-2 (PAI-2, Matrix-metalloproteinase-14 (MMP-14, and Interferon regulating factor-7 (IRF-7. Conclusion Gene expression profiles found in periodontal tissues following

  18. A novel bidirectional expression system for simultaneous expression of both the protein-coding genes and short hairpin RNAs in mammalian cells

    International Nuclear Information System (INIS)

    Hung, C.-F.; Cheng, T.-L.; Wu, R.-H.; Teng, C.-F.; Chang, W.-T.

    2006-01-01

    RNA interference (RNAi) is an extremely powerful and widely used gene silencing approach for reverse functional genomics and molecular therapeutics. In mammals, the conserved poly(ADP-ribose) polymerase 2 (PARP-2)/RNase P bidirectional control promoter simultaneously expresses both the PARP-2 protein and RNase P RNA by RNA polymerase II- and III-dependent mechanisms, respectively. To explore this unique bidirectional control system in RNAi-mediated gene silencing strategy, we have constructed two novel bidirectional expression vectors, pbiHsH1 and pbiMmH1, which contained the PARP-2/RNase P bidirectional control promoters from human and mouse, for simultaneous expression of both the protein-coding genes and short hairpin RNAs. Analyses of the dual transcriptional activities indicated that these two bidirectional expression vectors could not only express enhanced green fluorescent protein as a functional reporter but also simultaneously transcribe shLuc for inhibiting the firefly luciferase expression. In addition, to extend its utility for the establishment of inherited stable clones, we have also reconstructed this bidirectional expression system with the blasticidin S deaminase gene, an effective dominant drug resistance selectable marker, and examined both the selection and inhibition efficiencies in drug resistance and gene expression. Moreover, we have further demonstrated that this bidirectional expression system could efficiently co-regulate the functionally important genes, such as overexpression of tumor suppressor protein p53 and inhibition of anti-apoptotic protein Bcl-2 at the same time. In summary, the bidirectional expression vectors, pbiHsH1 and pbiMmH1, should provide a simple, convenient, and efficient novel tool for manipulating the gene function in mammalian cells

  19. Gene expression profiles in skeletal muscle after gene electrotransfer

    DEFF Research Database (Denmark)

    Hojman, Pernille; Zibert, John R; Gissel, Hanne

    2007-01-01

    BACKGROUND: Gene transfer by electroporation (DNA electrotransfer) to muscle results in high level long term transgenic expression, showing great promise for treatment of e.g. protein deficiency syndromes. However little is known about the effects of DNA electrotransfer on muscle fibres. We have...... caused down-regulation of structural proteins e.g. sarcospan and catalytic enzymes. Injection of DNA induced down-regulation of intracellular transport proteins e.g. sentrin. The effects on muscle fibres were transient as the expression profiles 3 weeks after treatment were closely related......) followed by a long low voltage pulse (LV, 100 V/cm, 400 ms); a pulse combination optimised for efficient and safe gene transfer. Muscles were transfected with green fluorescent protein (GFP) and excised at 4 hours, 48 hours or 3 weeks after treatment. RESULTS: Differentially expressed genes were...

  20. Comparative gene expression between two yeast species

    Directory of Open Access Journals (Sweden)

    Guan Yuanfang

    2013-01-01

    Full Text Available Abstract Background Comparative genomics brings insight into sequence evolution, but even more may be learned by coupling sequence analyses with experimental tests of gene function and regulation. However, the reliability of such comparisons is often limited by biased sampling of expression conditions and incomplete knowledge of gene functions across species. To address these challenges, we previously systematically generated expression profiles in Saccharomyces bayanus to maximize functional coverage as compared to an existing Saccharomyces cerevisiae data repository. Results In this paper, we take advantage of these two data repositories to compare patterns of ortholog expression in a wide variety of conditions. First, we developed a scalable metric for expression divergence that enabled us to detect a significant correlation between sequence and expression conservation on the global level, which previous smaller-scale expression studies failed to detect. Despite this global conservation trend, between-species gene expression neighborhoods were less well-conserved than within-species comparisons across different environmental perturbations, and approximately 4% of orthologs exhibited a significant change in co-expression partners. Furthermore, our analysis of matched perturbations collected in both species (such as diauxic shift and cell cycle synchrony demonstrated that approximately a quarter of orthologs exhibit condition-specific expression pattern differences. Conclusions Taken together, these analyses provide a global view of gene expression patterns between two species, both in terms of the conditions and timing of a gene's expression as well as co-expression partners. Our results provide testable hypotheses that will direct future experiments to determine how these changes may be specified in the genome.

  1. Spatio Temporal Expression Pattern of an Insecticidal Gene (cry2A in Transgenic Cotton Lines

    Directory of Open Access Journals (Sweden)

    Allah BAKHSH

    2012-11-01

    Full Text Available The production of transgenic plants with stable, high-level transgene expression is important for the success of crop improvement programs based on genetic engineering. The present study was conducted to evaluate genomic integration and spatio temporal expression of an insecticidal gene (cry2A in pre-existing transgenic lines of cotton. Genomic integration of cry2A was evaluated using various molecular approaches. The expression levels of cry2A were determined at vegetative and reproductive stages of cotton at regular intervals. These lines showed a stable integration of insecticidal gene in advance lines of transgenic cotton whereas gene expression was found variable with at various growth stages as well as in different plant parts throughout the season. The leaves of transgenic cotton were found to have maximum expression of cry2A gene followed by squares, bolls, anthers and petals. The protein level in fruiting part was less as compared to other parts showing inconsistency in gene expression. It was concluded that for culturing of transgenic crops, strategies should be developed to ensure the foreign genes expression efficient, consistent and in a predictable manner.

  2. Interactive visualization of gene regulatory networks with associated gene expression time series data

    NARCIS (Netherlands)

    Westenberg, M.A.; Hijum, van S.A.F.T.; Lulko, A.T.; Kuipers, O.P.; Roerdink, J.B.T.M.; Linsen, L.; Hagen, H.; Hamann, B.

    2008-01-01

    We present GENeVis, an application to visualize gene expression time series data in a gene regulatory network context. This is a network of regulator proteins that regulate the expression of their respective target genes. The networks are represented as graphs, in which the nodes represent genes,

  3. VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

    Science.gov (United States)

    Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

    2013-12-16

    Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis

  4. Optimization of Agrobacterium-mediated transient expression of heterologous genes in spinach

    DEFF Research Database (Denmark)

    Cao, Dang Viet; Pamplona, Reniel S.; Kim, Jiwon

    2017-01-01

    The Agrobacterium-mediated transient assay is a relatively rapid technique and a promising approach for assessing the expression of a gene of interest. Despite the successful application of this transient expression system in several plant species, it is not well understood in spinach. In this st...

  5. Gene expression analysis identifies new candidate genes associated with the development of black skin spots in Corriedale sheep.

    Science.gov (United States)

    Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I

    2012-02-01

    The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.

  6. Serial analysis of gene expression (SAGE)

    NARCIS (Netherlands)

    van Ruissen, Fred; Baas, Frank

    2007-01-01

    In 1995, serial analysis of gene expression (SAGE) was developed as a versatile tool for gene expression studies. SAGE technology does not require pre-existing knowledge of the genome that is being examined and therefore SAGE can be applied to many different model systems. In this chapter, the SAGE

  7. Evaluation of phenoxybenzamine in the CFA model of pain following gene expression studies and connectivity mapping.

    Science.gov (United States)

    Chang, Meiping; Smith, Sarah; Thorpe, Andrew; Barratt, Michael J; Karim, Farzana

    2010-09-16

    We have previously used the rat 4 day Complete Freund's Adjuvant (CFA) model to screen compounds with potential to reduce osteoarthritic pain. The aim of this study was to identify genes altered in this model of osteoarthritic pain and use this information to infer analgesic potential of compounds based on their own gene expression profiles using the Connectivity Map approach. Using microarrays, we identified differentially expressed genes in L4 and L5 dorsal root ganglia (DRG) from rats that had received intraplantar CFA for 4 days compared to matched, untreated control animals. Analysis of these data indicated that the two groups were distinguishable by differences in genes important in immune responses, nerve growth and regeneration. This list of differentially expressed genes defined a "CFA signature". We used the Connectivity Map approach to identify pharmacologic agents in the Broad Institute Build02 database that had gene expression signatures that were inversely related ('negatively connected') with our CFA signature. To test the predictive nature of the Connectivity Map methodology, we tested phenoxybenzamine (an alpha adrenergic receptor antagonist) - one of the most negatively connected compounds identified in this database - for analgesic activity in the CFA model. Our results indicate that at 10 mg/kg, phenoxybenzamine demonstrated analgesia comparable to that of Naproxen in this model. Evaluation of phenoxybenzamine-induced analgesia in the current study lends support to the utility of the Connectivity Map approach for identifying compounds with analgesic properties in the CFA model.

  8. An Interactive Database of Cocaine-Responsive Gene Expression

    Directory of Open Access Journals (Sweden)

    Willard M. Freeman

    2002-01-01

    Full Text Available The postgenomic era of large-scale gene expression studies is inundating drug abuse researchers and many other scientists with findings related to gene expression. This information is distributed across many different journals, and requires laborious literature searches. Here, we present an interactive database that combines existing information related to cocaine-mediated changes in gene expression in an easy-to-use format. The database is limited to statistically significant changes in mRNA or protein expression after cocaine administration. The Flash-based program is integrated into a Web page, and organizes changes in gene expression based on neuroanatomical region, general function, and gene name. Accompanying each gene is a description of the gene, links to the original publications, and a link to the appropriate OMIM (Online Mendelian Inheritance in Man entry. The nature of this review allows for timely modifications and rapid inclusion of new publications, and should help researchers build second-generation hypotheses on the role of gene expression changes in the physiology and behavior of cocaine abuse. Furthermore, this method of organizing large volumes of scientific information can easily be adapted to assist researchers in fields outside of drug abuse.

  9. CDX2 gene expression in acute lymphoblastic leukemia

    International Nuclear Information System (INIS)

    Arnaoaut, H.H.; Mokhtar, D.A.; Samy, R.M.; Omar, Sh.A.; Khames, S.A.

    2014-01-01

    CDX genes are classically known as regulators of axial elongation during early embryogenesis. An unsuspected role for CDX genes has been revealed during hematopoietic development. The CDX gene family member CDX2 belongs to the most frequent aberrantly expressed proto-oncogenes in human acute leukemias and is highly leukemogenic in experimental models. We used reversed transcriptase polymerase chain reaction (RT-PCR) to determine the expression level of CDX2 gene in 30 pediatric patients with acute lymphoblastic leukemia (ALL) at diagnosis and 30 healthy volunteers. ALL patients were followed up to detect minimal residual disease (MRD) on days 15 and 42 of induction. We found that CDX2 gene was expressed in 50% of patients and not expressed in controls. Associations between gene expression and different clinical and laboratory data of patients revealed no impact on different findings. With follow up, we could not confirm that CDX2 expression had a prognostic significance.

  10. Identification of reference genes in human myelomonocytic cells for gene expression studies in altered gravity.

    Science.gov (United States)

    Thiel, Cora S; Hauschild, Swantje; Tauber, Svantje; Paulsen, Katrin; Raig, Christiane; Raem, Arnold; Biskup, Josefine; Gutewort, Annett; Hürlimann, Eva; Unverdorben, Felix; Buttron, Isabell; Lauber, Beatrice; Philpot, Claudia; Lier, Hartwin; Engelmann, Frank; Layer, Liliana E; Ullrich, Oliver

    2015-01-01

    Gene expression studies are indispensable for investigation and elucidation of molecular mechanisms. For the process of normalization, reference genes ("housekeeping genes") are essential to verify gene expression analysis. Thus, it is assumed that these reference genes demonstrate similar expression levels over all experimental conditions. However, common recommendations about reference genes were established during 1 g conditions and therefore their applicability in studies with altered gravity has not been demonstrated yet. The microarray technology is frequently used to generate expression profiles under defined conditions and to determine the relative difference in expression levels between two or more different states. In our study, we searched for potential reference genes with stable expression during different gravitational conditions (microgravity, normogravity, and hypergravity) which are additionally not altered in different hardware systems. We were able to identify eight genes (ALB, B4GALT6, GAPDH, HMBS, YWHAZ, ABCA5, ABCA9, and ABCC1) which demonstrated no altered gene expression levels in all tested conditions and therefore represent good candidates for the standardization of gene expression studies in altered gravity.

  11. Interspecies systems biology uncovers metabolites affecting C. elegans gene expression and life history traits.

    Science.gov (United States)

    Watson, Emma; MacNeil, Lesley T; Ritter, Ashlyn D; Yilmaz, L Safak; Rosebrock, Adam P; Caudy, Amy A; Walhout, Albertha J M

    2014-02-13

    Diet greatly influences gene expression and physiology. In mammals, elucidating the effects and mechanisms of individual nutrients is challenging due to the complexity of both the animal and its diet. Here, we used an interspecies systems biology approach with Caenorhabditis elegans and two of its bacterial diets, Escherichia coli and Comamonas aquatica, to identify metabolites that affect the animal's gene expression and physiology. We identify vitamin B12 as the major dilutable metabolite provided by Comamonas aq. that regulates gene expression, accelerates development, and reduces fertility but does not affect lifespan. We find that vitamin B12 has a dual role in the animal: it affects development and fertility via the methionine/S-Adenosylmethionine (SAM) cycle and breaks down the short-chain fatty acid propionic acid, preventing its toxic buildup. Our interspecies systems biology approach provides a paradigm for understanding complex interactions between diet and physiology. Copyright © 2014 Elsevier Inc. All rights reserved.

  12. Inferring gene networks from discrete expression data

    KAUST Repository

    Zhang, L.

    2013-07-18

    The modeling of gene networks from transcriptional expression data is an important tool in biomedical research to reveal signaling pathways and to identify treatment targets. Current gene network modeling is primarily based on the use of Gaussian graphical models applied to continuous data, which give a closedformmarginal likelihood. In this paper,we extend network modeling to discrete data, specifically data from serial analysis of gene expression, and RNA-sequencing experiments, both of which generate counts of mRNAtranscripts in cell samples.We propose a generalized linear model to fit the discrete gene expression data and assume that the log ratios of the mean expression levels follow a Gaussian distribution.We restrict the gene network structures to decomposable graphs and derive the graphs by selecting the covariance matrix of the Gaussian distribution with the hyper-inverse Wishart priors. Furthermore, we incorporate prior network models based on gene ontology information, which avails existing biological information on the genes of interest. We conduct simulation studies to examine the performance of our discrete graphical model and apply the method to two real datasets for gene network inference. © The Author 2013. Published by Oxford University Press. All rights reserved.

  13. Reference Gene Screening for Analyzing Gene Expression Across Goat Tissue

    Directory of Open Access Journals (Sweden)

    Yu Zhang

    2013-12-01

    Full Text Available Real-time quantitative PCR (qRT-PCR is one of the important methods for investigating the changes in mRNA expression levels in cells and tissues. Selection of the proper reference genes is very important when calibrating the results of real-time quantitative PCR. Studies on the selection of reference genes in goat tissues are limited, despite the economic importance of their meat and dairy products. We used real-time quantitative PCR to detect the expression levels of eight reference gene candidates (18S, TBP, HMBS, YWHAZ, ACTB, HPRT1, GAPDH and EEF1A2 in ten tissues types sourced from Boer goats. The optimal reference gene combination was selected according to the results determined by geNorm, NormFinder and Bestkeeper software packages. The analyses showed that tissue is an important variability factor in genes expression stability. When all tissues were considered, 18S, TBP and HMBS is the optimal reference combination for calibrating quantitative PCR analysis of gene expression from goat tissues. Dividing data set by tissues, ACTB was the most stable in stomach, small intestine and ovary, 18S in heart and spleen, HMBS in uterus and lung, TBP in liver, HPRT1 in kidney and GAPDH in muscle. Overall, this study provided valuable information about the goat reference genes that can be used in order to perform a proper normalisation when relative quantification by qRT-PCR studies is undertaken.

  14. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  15. Study of five novel non-synonymous polymorphisms in human brain-expressed genes in a Colombian sample.

    Science.gov (United States)

    Ojeda, Diego A; Forero, Diego A

    2014-10-01

    Non-synonymous single nucleotide polymorphisms (nsSNPs) in brain-expressed genes represent interesting candidates for genetic research in neuropsychiatric disorders. To study novel nsSNPs in brain-expressed genes in a sample of Colombian subjects. We applied an approach based on in silico mining of available genomic data to identify and select novel nsSNPs in brain-expressed genes. We developed novel genotyping assays, based in allele-specific PCR methods, for these nsSNPs and genotyped them in 171 Colombian subjects. Five common nsSNPs (rs6855837; p.Leu395Ile, rs2305160; p.Thr394Ala, rs10503929; p.Met289Thr, rs2270641; p.Thr4Pro and rs3822659; p.Ser735Ala) were studied, located in the CLOCK, NPAS2, NRG1, SLC18A1 and WWC1 genes. We reported allele and genotype frequencies in a sample of South American healthy subjects. There is previous experimental evidence, arising from genome-wide expression and association studies, for the involvement of these genes in several neuropsychiatric disorders and endophenotypes, such as schizophrenia, mood disorders or memory performance. Frequencies for these nsSNPSs in the Colombian samples varied in comparison to different HapMap populations. Future study of these nsSNPs in brain-expressed genes, a synaptogenomics approach, will be important for a better understanding of neuropsychiatric diseases and endophenotypes in different populations.

  16. A Systems’ Biology Approach to Study MicroRNA-Mediated Gene Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Xin Lai

    2013-01-01

    Full Text Available MicroRNAs (miRNAs are potent effectors in gene regulatory networks where aberrant miRNA expression can contribute to human diseases such as cancer. For a better understanding of the regulatory role of miRNAs in coordinating gene expression, we here present a systems biology approach combining data-driven modeling and model-driven experiments. Such an approach is characterized by an iterative process, including biological data acquisition and integration, network construction, mathematical modeling and experimental validation. To demonstrate the application of this approach, we adopt it to investigate mechanisms of collective repression on p21 by multiple miRNAs. We first construct a p21 regulatory network based on data from the literature and further expand it using algorithms that predict molecular interactions. Based on the network structure, a detailed mechanistic model is established and its parameter values are determined using data. Finally, the calibrated model is used to study the effect of different miRNA expression profiles and cooperative target regulation on p21 expression levels in different biological contexts.

  17. Gene expression of the mismatch repair gene MSH2 in primary colorectal cancer

    DEFF Research Database (Denmark)

    Jensen, Lars Henrik; Kuramochi, Hidekazu; Crüger, Dorthe Gylling

    2011-01-01

    promoter was only detected in 14 samples and only at a low level with no correlation to gene expression. MSH2 gene expression was not a prognostic factor for overall survival in univariate or multivariate analysis. The gene expression of MSH2 is a potential quantitative marker ready for further clinical...

  18. GECKO: a complete large-scale gene expression analysis platform

    Directory of Open Access Journals (Sweden)

    Heuer Michael

    2004-12-01

    Full Text Available Abstract Background Gecko (Gene Expression: Computation and Knowledge Organization is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Results Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing ~ 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph, in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (~ 100 users and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. Conclusions The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.

  19. Use of keyword hierarchies to interpret gene expression patterns.

    Science.gov (United States)

    Masys, D R; Welsh, J B; Lynn Fink, J; Gribskov, M; Klacansky, I; Corbeil, J

    2001-04-01

    High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify clusters of genes that share common expression characteristics, but provide no information regarding the biological similarities of genes within clusters. The published literature provides a potential source of information to assist in interpretation of clustering results. We describe a data mining method that uses indexing terms ('keywords') from the published literature linked to specific genes to present a view of the conceptual similarity of genes within a cluster or group of interest. The method takes advantage of the hierarchical nature of Medical Subject Headings used to index citations in the MEDLINE database, and the registry numbers applied to enzymes.

  20. ZODET: software for the identification, analysis and visualisation of outlier genes in microarray expression data.

    Directory of Open Access Journals (Sweden)

    Daniel L Roden

    Full Text Available Complex human diseases can show significant heterogeneity between patients with the same phenotypic disorder. An outlier detection strategy was developed to identify variants at the level of gene transcription that are of potential biological and phenotypic importance. Here we describe a graphical software package (z-score outlier detection (ZODET that enables identification and visualisation of gross abnormalities in gene expression (outliers in individuals, using whole genome microarray data. Mean and standard deviation of expression in a healthy control cohort is used to detect both over and under-expressed probes in individual test subjects. We compared the potential of ZODET to detect outlier genes in gene expression datasets with a previously described statistical method, gene tissue index (GTI, using a simulated expression dataset and a publicly available monocyte-derived macrophage microarray dataset. Taken together, these results support ZODET as a novel approach to identify outlier genes of potential pathogenic relevance in complex human diseases. The algorithm is implemented using R packages and Java.The software is freely available from http://www.ucl.ac.uk/medicine/molecular-medicine/publications/microarray-outlier-analysis.

  1. The relationship among gene expression, the evolution of gene dosage, and the rate of protein evolution.

    Directory of Open Access Journals (Sweden)

    Jean-François Gout

    2010-05-01

    Full Text Available The understanding of selective constraints affecting genes is a major issue in biology. It is well established that gene expression level is a major determinant of the rate of protein evolution, but the reasons for this relationship remain highly debated. Here we demonstrate that gene expression is also a major determinant of the evolution of gene dosage: the rate of gene losses after whole genome duplications in the Paramecium lineage is negatively correlated to the level of gene expression, and this relationship is not a byproduct of other factors known to affect the fate of gene duplicates. This indicates that changes in gene dosage are generally more deleterious for highly expressed genes. This rule also holds for other taxa: in yeast, we find a clear relationship between gene expression level and the fitness impact of reduction in gene dosage. To explain these observations, we propose a model based on the fact that the optimal expression level of a gene corresponds to a trade-off between the benefit and cost of its expression. This COSTEX model predicts that selective pressure against mutations changing gene expression level or affecting the encoded protein should on average be stronger in highly expressed genes and hence that both the frequency of gene loss and the rate of protein evolution should correlate negatively with gene expression. Thus, the COSTEX model provides a simple and common explanation for the general relationship observed between the level of gene expression and the different facets of gene evolution.

  2. Noise minimization in eukaryotic gene expression.

    Directory of Open Access Journals (Sweden)

    Hunter B Fraser

    2004-06-01

    Full Text Available All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or "noise." Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection.

  3. Noise minimization in eukaryotic gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Fraser, Hunter B.; Hirsh, Aaron E.; Giaever, Guri; Kumm, Jochen; Eisen, Michael B.

    2004-01-15

    All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or noise. Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection.

  4. Noise minimization in eukaryotic gene expression

    International Nuclear Information System (INIS)

    Fraser, Hunter B.; Hirsh, Aaron E.; Giaever, Guri; Kumm, Jochen; Eisen, Michael B.

    2004-01-01

    All organisms have elaborate mechanisms to control rates of protein production. However, protein production is also subject to stochastic fluctuations, or noise. Several recent studies in Saccharomyces cerevisiae and Escherichia coli have investigated the relationship between transcription and translation rates and stochastic fluctuations in protein levels, or more generally, how such randomness is a function of intrinsic and extrinsic factors. However, the fundamental question of whether stochasticity in protein expression is generally biologically relevant has not been addressed, and it remains unknown whether random noise in the protein production rate of most genes significantly affects the fitness of any organism. We propose that organisms should be particularly sensitive to variation in the protein levels of two classes of genes: genes whose deletion is lethal to the organism and genes that encode subunits of multiprotein complexes. Using an experimentally verified model of stochastic gene expression in S. cerevisiae, we estimate the noise in protein production for nearly every yeast gene, and confirm our prediction that the production of essential and complex-forming proteins involves lower levels of noise than does the production of most other genes. Our results support the hypothesis that noise in gene expression is a biologically important variable, is generally detrimental to organismal fitness, and is subject to natural selection

  5. Identification and comprehensive evaluation of reference genes for RT-qPCR analysis of host gene-expression in Brassica juncea-aphid interaction using microarray data.

    Science.gov (United States)

    Ram, Chet; Koramutla, Murali Krishna; Bhattacharya, Ramcharan

    2017-07-01

    Brassica juncea is a chief oil yielding crop in many parts of the world including India. With advancement of molecular techniques, RT-qPCR based study of gene-expression has become an integral part of experimentations in crop breeding. In RT-qPCR, use of appropriate reference gene(s) is pivotal. The virtue of the reference genes, being constant in expression throughout the experimental treatments, needs to be validated case by case. Appropriate reference gene(s) for normalization of gene-expression data in B. juncea during the biotic stress of aphid infestation is not known. In the present investigation, 11 reference genes identified from microarray database of Arabidopsis-aphid interaction at a cut off FDR ≤0.1, along with two known reference genes of B. juncea, were analyzed for their expression stability upon aphid infestation. These included 6 frequently used and 5 newly identified reference genes. Ranking orders of the reference genes in terms of expression stability were calculated using advanced statistical approaches such as geNorm, NormFinder, delta Ct and BestKeeper. The analysis suggested CAC, TUA and DUF179 as the most suitable reference genes. Further, normalization of the gene-expression data of STP4 and PR1 by the most and the least stable reference gene, respectively has demonstrated importance and applicability of the recommended reference genes in aphid infested samples of B. juncea. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  6. GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.

    Science.gov (United States)

    Doungpan, Narumol; Engchuan, Worrawat; Chan, Jonathan H; Meechai, Asawin

    2016-12-05

    Gene expression has been used to identify disease gene biomarkers, but there are ongoing challenges. Single gene or gene-set biomarkers are inadequate to provide sufficient understanding of complex disease mechanisms and the relationship among those genes. Network-based methods have thus been considered for inferring the interaction within a group of genes to further study the disease mechanism. Recently, the Gene-Network-based Feature Set (GNFS), which is capable of handling case-control and multiclass expression for gene biomarker identification, has been proposed, partly taking into account of network topology. However, its performance relies on a greedy search for building subnetworks and thus requires further improvement. In this work, we establish a new approach named Gene Sub-Network-based Feature Selection (GSNFS) by implementing the GNFS framework with two proposed searching and scoring algorithms, namely gene-set-based (GS) search and parent-node-based (PN) search, to identify subnetworks. An additional dataset is used to validate the results. The two proposed searching algorithms of the GSNFS method for subnetwork expansion are concerned with the degree of connectivity and the scoring scheme for building subnetworks and their topology. For each iteration of expansion, the neighbour genes of a current subnetwork, whose expression data improved the overall subnetwork score, is recruited. While the GS search calculated the subnetwork score using an activity score of a current subnetwork and the gene expression values of its neighbours, the PN search uses the expression value of the corresponding parent of each neighbour gene. Four lung cancer expression datasets were used for subnetwork identification. In addition, using pathway data and protein-protein interaction as network data in order to consider the interaction among significant genes were discussed. Classification was performed to compare the performance of the identified gene subnetworks with three

  7. Positive selection on gene expression in the human brain

    DEFF Research Database (Denmark)

    Khaitovich, Philipp; Tang, Kun; Franz, Henriette

    2006-01-01

    Recent work has shown that the expression levels of genes transcribed in the brains of humans and chimpanzees have changed less than those of genes transcribed in other tissues [1] . However, when gene expression changes are mapped onto the evolutionary lineage in which they occurred, the brain...... shows more changes than other tissues in the human lineage compared to the chimpanzee lineage [1] , [2] and [3] . There are two possible explanations for this: either positive selection drove more gene expression changes to fixation in the human brain than in the chimpanzee brain, or genes expressed...... in the brain experienced less purifying selection in humans than in chimpanzees, i.e. gene expression in the human brain is functionally less constrained. The first scenario would be supported if genes that changed their expression in the brain in the human lineage showed more selective sweeps than other genes...

  8. Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer

    Directory of Open Access Journals (Sweden)

    Pingzhao Hu

    2006-01-01

    Full Text Available Background: Microarray technology has been previously used to identify genes that are differentially expressed between tumour and normal samples in a single study, as well as in syntheses involving multiple studies. When integrating results from several Affymetrix microarray datasets, previous studies summarized probeset-level data, which may potentially lead to a loss of information available at the probe-level. In this paper, we present an approach for integrating results across studies while taking probe-level data into account. Additionally, we follow a new direction in the analysis of microarray expression data, namely to focus on the variation of expression phenotypes in predefined gene sets, such as pathways. This targeted approach can be helpful for revealing information that is not easily visible from the changes in the individual genes. Results: We used a recently developed method to integrate Affymetrix expression data across studies. The idea is based on a probe-level based test statistic developed for testing for differentially expressed genes in individual studies. We incorporated this test statistic into a classic random-effects model for integrating data across studies. Subsequently, we used a gene set enrichment test to evaluate the significance of enriched biological pathways in the differentially expressed genes identified from the integrative analysis. We compared statistical and biological significance of the prognostic gene expression signatures and pathways identified in the probe-level model (PLM with those in the probeset-level model (PSLM. Our integrative analysis of Affymetrix microarray data from 110 prostate cancer samples obtained from three studies reveals thousands of genes significantly correlated with tumour cell differentiation. The bioinformatics analysis, mapping these genes to the publicly available KEGG database, reveals evidence that tumour cell differentiation is significantly associated with many

  9. Assays for noninvasive imaging of reporter gene expression

    International Nuclear Information System (INIS)

    Gambhir, S.S.; Barrio, J.R.; Herschman, H.R.; Phelps, M.E.

    1999-01-01

    Repeated, noninvasive imaging of reporter gene expression is emerging as a valuable tool for monitoring the expression of genes in animals and humans. Monitoring of organ/cell transplantation in living animals and humans, and the assessment of environmental, behavioral, and pharmacologic modulation of gene expression in transgenic animals should soon be possible. The earliest clinical application is likely to be monitoring human gene therapy in tumors transduced with the herpes simplex virus type 1 thymidine kinase (HSV1-tk) suicide gene. Several candidate assays for imaging reporter gene expression have been studied, utilizing cytosine deaminase (CD), HSV1-tk, and dopamine 2 receptor (D2R) as reporter genes. For the HSV1-tk reporter gene, both uracil nucleoside derivatives (e.g., 5-iodo-2'-fluoro-2'-deoxy-1-β-D-arabinofuranosyl-5-iodouracil [FIAU] labeled with 124 I, 131 I ) and acycloguanosine derivatives {e.g., 8-[ 18 F]fluoro-9-[[2-hydroxy-1-(hydroxymethyl)ethoxy]methyl]guanine (8-[ 18 F]-fluoroganciclovir) ([ 18 F]FGCV), 9-[(3-[ 18 F]fluoro-1-hydroxy-2-propoxy)methyl]guanine ([ 18 F]FHPG)} have been investigated as reporter probes. For the D2R reporter gene, a derivative of spiperone {3-(2'-[ 18 F]-Fluoroethyl)spiperone ([ 18 F]FESP)} has been used with positron emission tomography (PET) imaging. In this review, the principles and specific assays for imaging reporter gene expression are presented and discussed. Specific examples utilizing adenoviral-mediated delivery of a reporter gene as well as tumors expressing reporter genes are discussed

  10. Gene Expression Profiling of Early Stage Non-Small Cell Lung Cancer

    NARCIS (Netherlands)

    J. Hou (Jun)

    2010-01-01

    textabstractNSCLC is a highly heterogeneous malignancy with a poor prognosis. Treatment for NSCLC is currently based on a combination of pathological staging and histological classification. Recently, gene expression-based NSCLC profiling is proven a superior approach to stratify cancer cases with

  11. PRAME gene expression profile in medulloblastoma

    Directory of Open Access Journals (Sweden)

    Tânia Maria Vulcani-Freitas

    2011-02-01

    Full Text Available Medulloblastoma is the most common malignant tumors of central nervous system in the childhood. The treatment is severe, harmful and, thus, has a dismal prognosis. As PRAME is present in various cancers, including meduloblastoma, and has limited expression in normal tissues, this antigen can be an ideal vaccine target for tumor immunotherapy. In order to find a potential molecular target, we investigated PRAME expression in medulloblastoma fragments and we compare the results with the clinical features of each patient. Analysis of gene expression was performed by real-time quantitative PCR from 37 tumor samples. The Mann-Whitney test was used to analysis the relationship between gene expression and clinical characteristics. Kaplan-Meier curves were used to evaluate survival. PRAME was overexpressed in 84% samples. But no statistical association was found between clinical features and PRAME overexpression. Despite that PRAME gene could be a strong candidate for immunotherapy since it is highly expressed in medulloblastomas.

  12. Efficient photoreceptor-targeted gene expression in vivo by recombinant adeno-associated virus.

    Science.gov (United States)

    Flannery, J G; Zolotukhin, S; Vaquero, M I; LaVail, M M; Muzyczka, N; Hauswirth, W W

    1997-06-24

    We describe a general approach for achieving efficient and cell type-specific expression of exogenous genes in photoreceptor cells of the mammalian retina. Recombinant adeno-associated virus (rAAV) vectors were used to transfer the bacterial lacZ gene or a synthetic green fluorescent protein gene (gfp) to mouse or rat retinas after injection into the subretinal space. Using a proximal murine rod opsin promoter (+86 to -385) to drive expression, reporter gene product was found exclusively in photoreceptors, not in any other retinal cell type or in the adjacent retinal pigment epithelium. GFP-expressing photoreceptors typically encompassed 10-20% of the total retinal area after a single 2-microl injection. Photoreceptors were transduced with nearly 100% efficiency in the region directly surrounding the injection site. We estimate approximately 2.5 million photoreceptors were transduced as a result of the single subretinal inoculation. This level of gene transfer and expression suggests the feasibility of genetic therapy for retinal disease. The gfp-containing rAAV stock was substantially free of both adenovirus and wild-type AAV, as judged by plaque assay and infectious center assay, respectively. Thus, highly purified, helper virus-free rAAV vectors can achieve high-frequency tissue-specific transduction of terminally differentiated, postmitotic photoreceptor cells.

  13. Mining gene expression data of multiple sclerosis.

    Directory of Open Access Journals (Sweden)

    Pi Guo

    Full Text Available Microarray produces a large amount of gene expression data, containing various biological implications. The challenge is to detect a panel of discriminative genes associated with disease. This study proposed a robust classification model for gene selection using gene expression data, and performed an analysis to identify disease-related genes using multiple sclerosis as an example.Gene expression profiles based on the transcriptome of peripheral blood mononuclear cells from a total of 44 samples from 26 multiple sclerosis patients and 18 individuals with other neurological diseases (control were analyzed. Feature selection algorithms including Support Vector Machine based on Recursive Feature Elimination, Receiver Operating Characteristic Curve, and Boruta algorithms were jointly performed to select candidate genes associating with multiple sclerosis. Multiple classification models categorized samples into two different groups based on the identified genes. Models' performance was evaluated using cross-validation methods, and an optimal classifier for gene selection was determined.An overlapping feature set was identified consisting of 8 genes that were differentially expressed between the two phenotype groups. The genes were significantly associated with the pathways of apoptosis and cytokine-cytokine receptor interaction. TNFSF10 was significantly associated with multiple sclerosis. A Support Vector Machine model was established based on the featured genes and gave a practical accuracy of ∼86%. This binary classification model also outperformed the other models in terms of Sensitivity, Specificity and F1 score.The combined analytical framework integrating feature ranking algorithms and Support Vector Machine model could be used for selecting genes for other diseases.

  14. mRNA/microRNA gene expression profile in microsatellite unstable colorectal cancer

    Directory of Open Access Journals (Sweden)

    Calin George A

    2007-08-01

    Full Text Available Abstract Background Colorectal cancer develops through two main genetic instability pathways characterized by distinct pathologic features and clinical outcome. Results We investigated colon cancer samples (23 characterized by microsatellite stability, MSS, and 16 by high microsatellite instability, MSI-H for genome-wide expression of microRNA (miRNA and mRNA. Based on combined miRNA and mRNA gene expression, a molecular signature consisting of twenty seven differentially expressed genes, inclusive of 8 miRNAs, could correctly distinguish MSI-H versus MSS colon cancer samples. Among the differentially expressed miRNAs, various members of the oncogenic miR-17-92 family were significantly up-regulated in MSS cancers. The majority of protein coding genes were also up-regulated in MSS cancers. Their functional classification revealed that they were most frequently associated with cell cycle, DNA replication, recombination, repair, gastrointestinal disease and immune response. Conclusion This is the first report that indicates the existence of differences in miRNA expression between MSS versus MSI-H colorectal cancers. In addition, the work suggests that the combination of mRNA/miRNA expression signatures may represent a general approach for improving bio-molecular classification of human cancer.

  15. Modulation of gene expression in heart and liver of hibernating black bears (Ursus americanus

    Directory of Open Access Journals (Sweden)

    Yan Jun

    2011-03-01

    Full Text Available Abstract Background Hibernation is an adaptive strategy to survive in highly seasonal or unpredictable environments. The molecular and genetic basis of hibernation physiology in mammals has only recently been studied using large scale genomic approaches. We analyzed gene expression in the American black bear, Ursus americanus, using a custom 12,800 cDNA probe microarray to detect differences in expression that occur in heart and liver during winter hibernation in comparison to summer active animals. Results We identified 245 genes in heart and 319 genes in liver that were differentially expressed between winter and summer. The expression of 24 genes was significantly elevated during hibernation in both heart and liver. These genes are mostly involved in lipid catabolism and protein biosynthesis and include RNA binding protein motif 3 (Rbm3, which enhances protein synthesis at mildly hypothermic temperatures. Elevated expression of protein biosynthesis genes suggests induction of translation that may be related to adaptive mechanisms reducing cardiac and muscle atrophies over extended periods of low metabolism and immobility during hibernation in bears. Coordinated reduction of transcription of genes involved in amino acid catabolism suggests redirection of amino acids from catabolic pathways to protein biosynthesis. We identify common for black bears and small mammalian hibernators transcriptional changes in the liver that include induction of genes responsible for fatty acid β oxidation and carbohydrate synthesis and depression of genes involved in lipid biosynthesis, carbohydrate catabolism, cellular respiration and detoxification pathways. Conclusions Our findings show that modulation of gene expression during winter hibernation represents molecular mechanism of adaptation to extreme environments.

  16. Modulation of gene expression in heart and liver of hibernating black bears (Ursus americanus).

    Science.gov (United States)

    Fedorov, Vadim B; Goropashnaya, Anna V; Tøien, Øivind; Stewart, Nathan C; Chang, Celia; Wang, Haifang; Yan, Jun; Showe, Louise C; Showe, Michael K; Barnes, Brian M

    2011-03-31

    Hibernation is an adaptive strategy to survive in highly seasonal or unpredictable environments. The molecular and genetic basis of hibernation physiology in mammals has only recently been studied using large scale genomic approaches. We analyzed gene expression in the American black bear, Ursus americanus, using a custom 12,800 cDNA probe microarray to detect differences in expression that occur in heart and liver during winter hibernation in comparison to summer active animals. We identified 245 genes in heart and 319 genes in liver that were differentially expressed between winter and summer. The expression of 24 genes was significantly elevated during hibernation in both heart and liver. These genes are mostly involved in lipid catabolism and protein biosynthesis and include RNA binding protein motif 3 (Rbm3), which enhances protein synthesis at mildly hypothermic temperatures. Elevated expression of protein biosynthesis genes suggests induction of translation that may be related to adaptive mechanisms reducing cardiac and muscle atrophies over extended periods of low metabolism and immobility during hibernation in bears. Coordinated reduction of transcription of genes involved in amino acid catabolism suggests redirection of amino acids from catabolic pathways to protein biosynthesis. We identify common for black bears and small mammalian hibernators transcriptional changes in the liver that include induction of genes responsible for fatty acid β oxidation and carbohydrate synthesis and depression of genes involved in lipid biosynthesis, carbohydrate catabolism, cellular respiration and detoxification pathways. Our findings show that modulation of gene expression during winter hibernation represents molecular mechanism of adaptation to extreme environments.

  17. Cardiac-Specific Gene Expression Facilitated by an Enhanced Myosin Light Chain Promoter

    Directory of Open Access Journals (Sweden)

    Wolfgang Boecker

    2004-04-01

    Full Text Available Background: Adenoviral gene transfer has been shown to be effective in cardiac myocytes in vitro and in vivo. A major limitation of myocardial gene therapy is the extracardiac transgene expression. Methods: To minimize extracardiac gene expression, we have constructed a tissue-specific promoter for cardiac gene transfer, namely, the 250-bp fragment of the myosin light chain-2v (MLC-2v gene, which is known to be expressed in a tissue-specific manner in ventricular myocardium followed by a luciferase (luc reporter gene (Ad.4 × MLC250.Luc. Rat cardiomyocytes, liver and kidney cells were infected with Ad.4 × MLC.Luc or control vectors. For in vivo testing, Ad.4 × MLC250.Luc was injected into the myocardium or in the liver of rats. Kinetics of promoter activity were monitored over 8 days using a cooled CCD camera. Results: In vitro: By infecting hepatic versus cardiomyocyte cells, we found that the promoter specificity ratio (luc activity in cardiomyocytes per liver cells was 20.4 versus 0.9 (Ad.4 × MLC250.Luc vs. Ad.CMV. In vivo: Ad.4 × MLC250.Luc significantly reduced luc activity in liver (38.4-fold, lung (16.1-fold, and kidney (21.8-fold versus Ad.CMV (p = .01; whereas activity in the heart was only 3.8-fold decreased. The gene expression rate of cardiomyocytes versus hepatocytes was 7:1 (Ad.4 × MLC.Luc versus 1:1.4 (Ad.CMV.Luc. Discussion: This new vector may be useful to validate therapeutic approaches in animal disease models and offers the perspective for selective expression of therapeutic genes in the diseased heart.

  18. Plasticity-Related Gene Expression During Eszopiclone-Induced Sleep.

    Science.gov (United States)

    Gerashchenko, Dmitry; Pasumarthi, Ravi K; Kilduff, Thomas S

    2017-07-01

    Experimental evidence suggests that restorative processes depend on synaptic plasticity changes in the brain during sleep. We used the expression of plasticity-related genes to assess synaptic plasticity changes during drug-induced sleep. We first characterized sleep induced by eszopiclone in mice during baseline conditions and during the recovery from sleep deprivation. We then compared the expression of 18 genes and two miRNAs critically involved in synaptic plasticity in these mice. Gene expression was assessed in the cerebral cortex and hippocampus by the TaqMan reverse transcription polymerase chain reaction and correlated with sleep parameters. Eszopiclone reduced the latency to nonrapid eye movement (NREM) sleep and increased NREM sleep amounts. Eszopiclone had no effect on slow wave activity (SWA) during baseline conditions but reduced the SWA increase during recovery sleep (RS) after sleep deprivation. Gene expression analyses revealed three distinct patterns: (1) four genes had higher expression either in the cortex or hippocampus in the group of mice with increased amounts of wakefulness; (2) a large proportion of plasticity-related genes (7 out of 18 genes) had higher expression during RS in the cortex but not in the hippocampus; and (3) six genes and the two miRNAs showed no significant changes across conditions. Even at a relatively high dose (20 mg/kg), eszopiclone did not reduce the expression of plasticity-related genes during RS period in the cortex. These results indicate that gene expression associated with synaptic plasticity occurs in the cortex in the presence of a hypnotic medication. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.

  19. Evaluation of suitable reference genes for gene expression studies in bovine muscular tissue

    Directory of Open Access Journals (Sweden)

    Dunner Susana

    2008-09-01

    Full Text Available Abstract Background Real-time reverse transcriptase quantitative polymerase chain reaction (real-time RTqPCR is a technique used to measure mRNA species copy number as a way to determine key genes involved in different biological processes. However, the expression level of these key genes may vary among tissues or cells not only as a consequence of differential expression but also due to different factors, including choice of reference genes to normalize the expression levels of the target genes; thus the selection of reference genes is critical for expression studies. For this purpose, ten candidate reference genes were investigated in bovine muscular tissue. Results The value of stability of ten candidate reference genes included in three groups was estimated: the so called 'classical housekeeping' genes (18S, GAPDH and ACTB, a second set of genes used in expression studies conducted on other tissues (B2M, RPII, UBC and HMBS and a third set of novel genes (SF3A1, EEF1A2 and CASC3. Three different statistical algorithms were used to rank the genes by their stability measures as produced by geNorm, NormFinder and Bestkeeper. The three methods tend to agree on the most stably expressed genes and the least in muscular tissue. EEF1A2 and HMBS followed by SF3A1, ACTB, and CASC3 can be considered as stable reference genes, and B2M, RPII, UBC and GAPDH would not be appropriate. Although the rRNA-18S stability measure seems to be within the range of acceptance, its use is not recommended because its synthesis regulation is not representative of mRNA levels. Conclusion Based on geNorm algorithm, we propose the use of three genes SF3A1, EEF1A2 and HMBS as references for normalization of real-time RTqPCR in muscle expression studies.

  20. Effects of threshold on the topology of gene co-expression networks.

    Science.gov (United States)

    Couto, Cynthia Martins Villar; Comin, César Henrique; Costa, Luciano da Fontoura

    2017-09-26

    Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.

  1. Expression profiling identifies genes involved in emphysema severity

    Directory of Open Access Journals (Sweden)

    Bowman Rayleen V

    2009-09-01

    Full Text Available Abstract Chronic obstructive pulmonary disease (COPD is a major public health problem. The aim of this study was to identify genes involved in emphysema severity in COPD patients. Gene expression profiling was performed on total RNA extracted from non-tumor lung tissue from 30 smokers with emphysema. Class comparison analysis based on gas transfer measurement was performed to identify differentially expressed genes. Genes were then selected for technical validation by quantitative reverse transcriptase-PCR (qRT-PCR if also represented on microarray platforms used in previously published emphysema studies. Genes technically validated advanced to tests of biological replication by qRT-PCR using an independent test set of 62 lung samples. Class comparison identified 98 differentially expressed genes (p p Gene expression profiling of lung from emphysema patients identified seven candidate genes associated with emphysema severity including COL6A3, SERPINF1, ZNHIT6, NEDD4, CDKN2A, NRN1 and GSTM3.

  2. Predicting spatial and temporal gene expression using an integrative model of transcription factor occupancy and chromatin state.

    Directory of Open Access Journals (Sweden)

    Bartek Wilczynski

    Full Text Available Precise patterns of spatial and temporal gene expression are central to metazoan complexity and act as a driving force for embryonic development. While there has been substantial progress in dissecting and predicting cis-regulatory activity, our understanding of how information from multiple enhancer elements converge to regulate a gene's expression remains elusive. This is in large part due to the number of different biological processes involved in mediating regulation as well as limited availability of experimental measurements for many of them. Here, we used a Bayesian approach to model diverse experimental regulatory data, leading to accurate predictions of both spatial and temporal aspects of gene expression. We integrated whole-embryo information on transcription factor recruitment to multiple cis-regulatory modules, insulator binding and histone modification status in the vicinity of individual gene loci, at a genome-wide scale during Drosophila development. The model uses Bayesian networks to represent the relation between transcription factor occupancy and enhancer activity in specific tissues and stages. All parameters are optimized in an Expectation Maximization procedure providing a model capable of predicting tissue- and stage-specific activity of new, previously unassayed genes. Performing the optimization with subsets of input data demonstrated that neither enhancer occupancy nor chromatin state alone can explain all gene expression patterns, but taken together allow for accurate predictions of spatio-temporal activity. Model predictions were validated using the expression patterns of more than 600 genes recently made available by the BDGP consortium, demonstrating an average 15-fold enrichment of genes expressed in the predicted tissue over a naïve model. We further validated the model by experimentally testing the expression of 20 predicted target genes of unknown expression, resulting in an accuracy of 95% for temporal

  3. Decoupling Linear and Nonlinear Associations of Gene Expression

    KAUST Repository

    Itakura, Alan

    2013-05-01

    The FANTOM consortium has generated a large gene expression dataset of different cell lines and tissue cultures using the single-molecule sequencing technology of HeliscopeCAGE. This provides a unique opportunity to investigate novel associations between gene expression over time and different cell types. Here, we create a MatLab wrapper for a powerful and computationally intensive set of statistics known as Maximal Information Coefficient, and then calculate this statistic for a large, comprehensive dataset containing gene expression of a variety of differentiating tissues. We then distinguish between linear and nonlinear associations, and then create gene association networks. Following this analysis, we are then able to identify clusters of linear gene associations that then associate nonlinearly with other clusters of linearity, providing insight to much more complex connections between gene expression patterns than previously anticipated.

  4. Decoupling Linear and Nonlinear Associations of Gene Expression

    KAUST Repository

    Itakura, Alan

    2013-01-01

    The FANTOM consortium has generated a large gene expression dataset of different cell lines and tissue cultures using the single-molecule sequencing technology of HeliscopeCAGE. This provides a unique opportunity to investigate novel associations between gene expression over time and different cell types. Here, we create a MatLab wrapper for a powerful and computationally intensive set of statistics known as Maximal Information Coefficient, and then calculate this statistic for a large, comprehensive dataset containing gene expression of a variety of differentiating tissues. We then distinguish between linear and nonlinear associations, and then create gene association networks. Following this analysis, we are then able to identify clusters of linear gene associations that then associate nonlinearly with other clusters of linearity, providing insight to much more complex connections between gene expression patterns than previously anticipated.

  5. MINER: exploratory analysis of gene interaction networks by machine learning from expression data

    Directory of Open Access Journals (Sweden)

    Sivieng Jane

    2009-12-01

    Full Text Available Abstract Background The reconstruction of gene regulatory networks from high-throughput "omics" data has become a major goal in the modelling of living systems. Numerous approaches have been proposed, most of which attempt only "one-shot" reconstruction of the whole network with no intervention from the user, or offer only simple correlation analysis to infer gene dependencies. Results We have developed MINER (Microarray Interactive Network Exploration and Representation, an application that combines multivariate non-linear tree learning of individual gene regulatory dependencies, visualisation of these dependencies as both trees and networks, and representation of known biological relationships based on common Gene Ontology annotations. MINER allows biologists to explore the dependencies influencing the expression of individual genes in a gene expression data set in the form of decision, model or regression trees, using their domain knowledge to guide the exploration and formulate hypotheses. Multiple trees can then be summarised in the form of a gene network diagram. MINER is being adopted by several of our collaborators and has already led to the discovery of a new significant regulatory relationship with subsequent experimental validation. Conclusion Unlike most gene regulatory network inference methods, MINER allows the user to start from genes of interest and build the network gene-by-gene, incorporating domain expertise in the process. This approach has been used successfully with RNA microarray data but is applicable to other quantitative data produced by high-throughput technologies such as proteomics and "next generation" DNA sequencing.

  6. Dynamic gene expression for metabolic engineering of mammalian cells in culture.

    Science.gov (United States)

    Le, Huong; Vishwanathan, Nandita; Kantardjieff, Anne; Doo, Inseok; Srienc, Michael; Zheng, Xiaolu; Somia, Nikunj; Hu, Wei-Shou

    2013-11-01

    Recombinant mammalian cells are the major hosts for the production of protein therapeutics. In addition to high expression of the product gene, a hyper-producer must also harbor superior phenotypic traits related to metabolism, protein secretion, and growth control. Introduction of genes endowing the relevant hyper-productivity traits is a strategy frequently used to enhance the productivity. Most of such cell engineering efforts have been performed using constitutive expression systems. However, cells respond to various environmental cues and cellular events dynamically according to cellular needs. The use of inducible systems allows for time dependent expression, but requires external manipulation. Ideally, a transgene's expression should be synchronous to the host cell's own rhythm, and at levels appropriate for the objective. To that end, we identified genes with different expression dynamics and intensity ranges using pooled transcriptome data. Their promoters may be used to drive the expression of the transgenes following the desired dynamics. We isolated the promoter of the Thioredoxin-interacting protein (Txnip) gene and demonstrated its capability to drive transgene expression in concert with cell growth. We further employed this Chinese hamster promoter to engineer dynamic expression of the mouse GLUT5 fructose transporter in Chinese hamster ovary (CHO) cells, enabling them to utilize sugar according to cellular needs rather than in excess as typically seen in culture. Thus, less lactate was produced, resulting in a better growth rate, prolonged culture duration, and higher product titer. This approach illustrates a novel concept in metabolic engineering which can potentially be used to achieve dynamic control of cellular behaviors for enhanced process characteristics. © 2013 Published by Elsevier Inc.

  7. Empirical validation of the S-Score algorithm in the analysis of gene expression data

    Directory of Open Access Journals (Sweden)

    Archer Kellie J

    2006-03-01

    Full Text Available Abstract Background Current methods of analyzing Affymetrix GeneChip® microarray data require the estimation of probe set expression summaries, followed by application of statistical tests to determine which genes are differentially expressed. The S-Score algorithm described by Zhang and colleagues is an alternative method that allows tests of hypotheses directly from probe level data. It is based on an error model in which the detected signal is proportional to the probe pair signal for highly expressed genes, but approaches a background level (rather than 0 for genes with low levels of expression. This model is used to calculate relative change in probe pair intensities that converts probe signals into multiple measurements with equalized errors, which are summed over a probe set to form the S-Score. Assuming no expression differences between chips, the S-Score follows a standard normal distribution, allowing direct tests of hypotheses to be made. Using spike-in and dilution datasets, we validated the S-Score method against comparisons of gene expression utilizing the more recently developed methods RMA, dChip, and MAS5. Results The S-score showed excellent sensitivity and specificity in detecting low-level gene expression changes. Rank ordering of S-Score values more accurately reflected known fold-change values compared to other algorithms. Conclusion The S-score method, utilizing probe level data directly, offers significant advantages over comparisons using only probe set expression summaries.

  8. Gene expression profile data for mouse facial development

    Directory of Open Access Journals (Sweden)

    Sonia M. Leach

    2017-08-01

    Full Text Available This article contains data related to the research articles "Spatial and Temporal Analysis of Gene Expression during Growth and Fusion of the Mouse Facial Prominences" (Feng et al., 2009 [1] and “Systems Biology of facial development: contributions of ectoderm and mesenchyme” (Hooper et al., 2017 In press [2]. Embryonic mammalian craniofacial development is a complex process involving the growth, morphogenesis, and fusion of distinct facial prominences into a functional whole. Aberrant gene regulation during this process can lead to severe craniofacial birth defects, including orofacial clefting. As a means to understand the genes involved in facial development, we had previously dissected the embryonic mouse face into distinct prominences: the mandibular, maxillary or nasal between E10.5 and E12.5. The prominences were then processed intact, or separated into ectoderm and mesenchyme layers, prior analysis of RNA expression using microarrays (Feng et al., 2009, Hooper et al., 2017 in press [1,2]. Here, individual gene expression profiles have been built from these datasets that illustrate the timing of gene expression in whole prominences or in the separated tissue layers. The data profiles are presented as an indexed and clickable list of the genes each linked to a graphical image of that gene׳s expression profile in the ectoderm, mesenchyme, or intact prominence. These data files will enable investigators to obtain a rapid assessment of the relative expression level of any gene on the array with respect to time, tissue, prominence, and expression trajectory.

  9. Integrated olfactory receptor and microarray gene expression databases

    Directory of Open Access Journals (Sweden)

    Crasto Chiquito J

    2007-06-01

    Full Text Available Abstract Background Gene expression patterns of olfactory receptors (ORs are an important component of the signal encoding mechanism in the olfactory system since they determine the interactions between odorant ligands and sensory neurons. We have developed the Olfactory Receptor Microarray Database (ORMD to house OR gene expression data. ORMD is integrated with the Olfactory Receptor Database (ORDB, which is a key repository of OR gene information. Both databases aim to aid experimental research related to olfaction. Description ORMD is a Web-accessible database that provides a secure data repository for OR microarray experiments. It contains both publicly available and private data; accessing the latter requires authenticated login. The ORMD is designed to allow users to not only deposit gene expression data but also manage their projects/experiments. For example, contributors can choose whether to make their datasets public. For each experiment, users can download the raw data files and view and export the gene expression data. For each OR gene being probed in a microarray experiment, a hyperlink to that gene in ORDB provides access to genomic and proteomic information related to the corresponding olfactory receptor. Individual ORs archived in ORDB are also linked to ORMD, allowing users access to the related microarray gene expression data. Conclusion ORMD serves as a data repository and project management system. It facilitates the study of microarray experiments of gene expression in the olfactory system. In conjunction with ORDB, ORMD integrates gene expression data with the genomic and functional data of ORs, and is thus a useful resource for both olfactory researchers and the public.

  10. Statistical Considerations for Immunohistochemistry Panel Development after Gene Expression Profiling of Human Cancers

    Science.gov (United States)

    Betensky, Rebecca A.; Nutt, Catherine L.; Batchelor, Tracy T.; Louis, David N.

    2005-01-01

    In recent years there have been a number of microarray expression studies in which different types of tumors were classified by identifying a panel of differentially expressed genes. Immunohistochemistry is a practical and robust method for extending gene expression data to common pathological specimens with the advantage of being applicable to paraffin-embedded tissues. However, the number of assays required for successful immunohistochemical classification remains unclear. We propose a simulation-based method for assessing sample size for an immunohistochemistry investigation after a promising gene expression study of human tumors. The goals of such an immunohistochemistry study would be to develop and validate a marker panel that yields improved prognostic classification of cancer patients. We demonstrate how the preliminary gene expression data, coupled with certain realistic assumptions, can be used to estimate the number of immunohistochemical assays required for development. These assumptions are more tenable than alternative assumptions that would be required for crude analytic sample size calculations and that may yield underpowered and inefficient studies. We applied our methods to the design of an immunohistochemistry study for glioma classification and estimated the number of assays required to ensure satisfactory technical and prognostic validation. Simulation approaches for computing power and sample size that are based on existing gene expression data provide a powerful tool for efficient design of follow-up genomic studies. PMID:15858152

  11. Gene expression analysis of flax seed development

    Science.gov (United States)

    2011-01-01

    Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise

  12. Gene expression analysis of flax seed development

    Directory of Open Access Journals (Sweden)

    Sharpe Andrew

    2011-04-01

    Full Text Available Abstract Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages seed coats (globular and torpedo stages and endosperm (pooled globular to torpedo stages and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST (GenBank accessions LIBEST_026995 to LIBEST_027011 were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152 had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid

  13. Validation of differential gene expression algorithms: Application comparing fold-change estimation to hypothesis testing

    Directory of Open Access Journals (Sweden)

    Bickel David R

    2010-01-01

    Full Text Available Abstract Background Sustained research on the problem of determining which genes are differentially expressed on the basis of microarray data has yielded a plethora of statistical algorithms, each justified by theory, simulation, or ad hoc validation and yet differing in practical results from equally justified algorithms. Recently, a concordance method that measures agreement among gene lists have been introduced to assess various aspects of differential gene expression detection. This method has the advantage of basing its assessment solely on the results of real data analyses, but as it requires examining gene lists of given sizes, it may be unstable. Results Two methodologies for assessing predictive error are described: a cross-validation method and a posterior predictive method. As a nonparametric method of estimating prediction error from observed expression levels, cross validation provides an empirical approach to assessing algorithms for detecting differential gene expression that is fully justified for large numbers of biological replicates. Because it leverages the knowledge that only a small portion of genes are differentially expressed, the posterior predictive method is expected to provide more reliable estimates of algorithm performance, allaying concerns about limited biological replication. In practice, the posterior predictive method can assess when its approximations are valid and when they are inaccurate. Under conditions in which its approximations are valid, it corroborates the results of cross validation. Both comparison methodologies are applicable to both single-channel and dual-channel microarrays. For the data sets considered, estimating prediction error by cross validation demonstrates that empirical Bayes methods based on hierarchical models tend to outperform algorithms based on selecting genes by their fold changes or by non-hierarchical model-selection criteria. (The latter two approaches have comparable

  14. High-level transfer and long-term expression of the human beta-globin gene in a mouse transplant model.

    Science.gov (United States)

    Raftopoulos, H; Ward, M; Bank, A

    1998-06-30

    Insertion of a normally functioning human beta-globin gene into the hematopoietic stem cells (HSC) of patients with beta-thalassemia may be an effective approach to the therapy of this disorder. Safe, efficient gene transfer and long-term, high-level expression of the transferred human beta-globin gene in animal models are prerequisites for HSC somatic gene therapy. We have recently shown for the first time that, using a modified beta-globin retroviral vector in a mouse transplant model, long-term, high-level expression of a transferred human beta-globin gene is possible. The human beta-globin gene continues to be detected up to eight months post-transplantation of beta-globin-transduced hematopoietic cells into lethally irradiated mice. The transferred human beta-globin gene is detected in three of five mice surviving long-term (> 4 months) transplanted with bone marrow cells transduced with high-titer virus. The unrearranged 5.1 kb human beta-globin gene-containing provirus is seen by Southern blotting in two of these mice. More importantly, long-term expression of the transferred gene is seen in two mice at levels of 5% and 20% that of endogenous murine beta-globin. We document stem cell transduction by showing continued high-level expression of the human beta-globin gene in secondarily transplanted recipient mice. These results provide evidence of HSC transduction with a human beta-globin gene in animals and demonstrate that retroviral-mediated unrearranged human beta-globin gene transfer leads to a high level of human beta-globin gene expression in the long term for the first time. A gene therapy strategy may be a feasible therapeutic approach to the beta-thalassemias if consistent human beta-globin gene transfer and expression into HSC can be achieved.

  15. Multiway real-time PCR gene expression profiling in yeast Saccharomyces cerevisiae reveals altered transcriptional response of ADH-genes to glucose stimuli.

    Science.gov (United States)

    Ståhlberg, Anders; Elbing, Karin; Andrade-Garda, José Manuel; Sjögreen, Björn; Forootan, Amin; Kubista, Mikael

    2008-04-16

    The large sensitivity, high reproducibility and essentially unlimited dynamic range of real-time PCR to measure gene expression in complex samples provides the opportunity for powerful multivariate and multiway studies of biological phenomena. In multiway studies samples are characterized by their expression profiles to monitor changes over time, effect of treatment, drug dosage etc. Here we perform a multiway study of the temporal response of four yeast Saccharomyces cerevisiae strains with different glucose uptake rates upon altered metabolic conditions. We measured the expression of 18 genes as function of time after addition of glucose to four strains of yeast grown in ethanol. The data are analyzed by matrix-augmented PCA, which is a generalization of PCA for 3-way data, and the results are confirmed by hierarchical clustering and clustering by Kohonen self-organizing map. Our approach identifies gene groups that respond similarly to the change of nutrient, and genes that behave differently in mutant strains. Of particular interest is our finding that ADH4 and ADH6 show a behavior typical of glucose-induced genes, while ADH3 and ADH5 are repressed after glucose addition. Multiway real-time PCR gene expression profiling is a powerful technique which can be utilized to characterize functions of new genes by, for example, comparing their temporal response after perturbation in different genetic variants of the studied subject. The technique also identifies genes that show perturbed expression in specific strains.

  16. Multiway real-time PCR gene expression profiling in yeast Saccharomyces cerevisiae reveals altered transcriptional response of ADH-genes to glucose stimuli

    Directory of Open Access Journals (Sweden)

    Andrade-Garda José

    2008-04-01

    Full Text Available Abstract Background The large sensitivity, high reproducibility and essentially unlimited dynamic range of real-time PCR to measure gene expression in complex samples provides the opportunity for powerful multivariate and multiway studies of biological phenomena. In multiway studies samples are characterized by their expression profiles to monitor changes over time, effect of treatment, drug dosage etc. Here we perform a multiway study of the temporal response of four yeast Saccharomyces cerevisiae strains with different glucose uptake rates upon altered metabolic conditions. Results We measured the expression of 18 genes as function of time after addition of glucose to four strains of yeast grown in ethanol. The data are analyzed by matrix-augmented PCA, which is a generalization of PCA for 3-way data, and the results are confirmed by hierarchical clustering and clustering by Kohonen self-organizing map. Our approach identifies gene groups that respond similarly to the change of nutrient, and genes that behave differently in mutant strains. Of particular interest is our finding that ADH4 and ADH6 show a behavior typical of glucose-induced genes, while ADH3 and ADH5 are repressed after glucose addition. Conclusion Multiway real-time PCR gene expression profiling is a powerful technique which can be utilized to characterize functions of new genes by, for example, comparing their temporal response after perturbation in different genetic variants of the studied subject. The technique also identifies genes that show perturbed expression in specific strains.

  17. Supplementary Material for: Global expression differences and tissue specific expression differences in rice evolution result in two contrasting types of differentially expressed genes

    KAUST Repository

    Horiuchi, Youko; Harushima, Yoshiaki; Fujisawa, Hironori; Mochizuki, Takako; Fujita, Masahiro; Ohyanagi, Hajime; Kurata, Nori

    2015-01-01

    Abstract Background Since the development of transcriptome analysis systems, many expression evolution studies characterized evolutionary forces acting on gene expression, without explicit discrimination between global expression differences and tissue specific expression differences. However, different types of gene expression alteration should have different effects on an organism, the evolutionary forces that act on them might be different, and different types of genes might show different types of differential expression between species. To confirm this, we studied differentially expressed (DE) genes among closely related groups that have extensive gene expression atlases, and clarified characteristics of different types of DE genes including the identification of regulating loci for differential expression using expression quantitative loci (eQTL) analysis data. Results We detected differentially expressed (DE) genes between rice subspecies in five homologous tissues that were verified using japonica and indica transcriptome atlases in public databases. Using the transcriptome atlases, we classified DE genes into two types, global DE genes and changed-tissues DE genes. Global type DE genes were not expressed in any tissues in the atlas of one subspecies, however changed-tissues type DE genes were expressed in both subspecies with different tissue specificity. For the five tissues in the two japonica-indica combinations, 4.6 ± 0.8 and 5.9 ± 1.5 % of highly expressed genes were global and changed-tissues DE genes, respectively. Changed-tissues DE genes varied in number between tissues, increasing linearly with the abundance of tissue specifically expressed genes in the tissue. Molecular evolution of global DE genes was rapid, unlike that of changed-tissues DE genes. Based on gene ontology, global and changed-tissues DE genes were different, having no common GO terms. Expression differences of most global DE genes were regulated by cis

  18. Identification of suitable reference genes for gene expression studies of shoulder instability.

    Directory of Open Access Journals (Sweden)

    Mariana Ferreira Leal

    Full Text Available Shoulder instability is a common shoulder injury, and patients present with plastic deformation of the glenohumeral capsule. Gene expression analysis may be a useful tool for increasing the general understanding of capsule deformation, and reverse-transcription quantitative polymerase chain reaction (RT-qPCR has become an effective method for such studies. Although RT-qPCR is highly sensitive and specific, it requires the use of suitable reference genes for data normalization to guarantee meaningful and reproducible results. In the present study, we evaluated the suitability of a set of reference genes using samples from the glenohumeral capsules of individuals with and without shoulder instability. We analyzed the expression of six commonly used reference genes (ACTB, B2M, GAPDH, HPRT1, TBP and TFRC in the antero-inferior, antero-superior and posterior portions of the glenohumeral capsules of cases and controls. The stability of the candidate reference gene expression was determined using four software packages: NormFinder, geNorm, BestKeeper and DataAssist. Overall, HPRT1 was the best single reference gene, and HPRT1 and B2M composed the best pair of reference genes from different analysis groups, including simultaneous analysis of all tissue samples. GenEx software was used to identify the optimal number of reference genes to be used for normalization and demonstrated that the accumulated standard deviation resulting from the use of 2 reference genes was similar to that resulting from the use of 3 or more reference genes. To identify the optimal combination of reference genes, we evaluated the expression of COL1A1. Although the use of different reference gene combinations yielded variable normalized quantities, the relative quantities within sample groups were similar and confirmed that no obvious differences were observed when using 2, 3 or 4 reference genes. Consequently, the use of 2 stable reference genes for normalization, especially

  19. Detecting microRNA activity from gene expression data

    LENUS (Irish Health Repository)

    Madden, Stephen F

    2010-05-18

    Abstract Background MicroRNAs (miRNAs) are non-coding RNAs that regulate gene expression by binding to the messenger RNA (mRNA) of protein coding genes. They control gene expression by either inhibiting translation or inducing mRNA degradation. A number of computational techniques have been developed to identify the targets of miRNAs. In this study we used predicted miRNA-gene interactions to analyse mRNA gene expression microarray data to predict miRNAs associated with particular diseases or conditions. Results Here we combine correspondence analysis, between group analysis and co-inertia analysis (CIA) to determine which miRNAs are associated with differences in gene expression levels in microarray data sets. Using a database of miRNA target predictions from TargetScan, TargetScanS, PicTar4way PicTar5way, and miRanda and combining these data with gene expression levels from sets of microarrays, this method produces a ranked list of miRNAs associated with a specified split in samples. We applied this to three different microarray datasets, a papillary thyroid carcinoma dataset, an in-house dataset of lipopolysaccharide treated mouse macrophages, and a multi-tissue dataset. In each case we were able to identified miRNAs of biological importance. Conclusions We describe a technique to integrate gene expression data and miRNA target predictions from multiple sources.

  20. Detecting microRNA activity from gene expression data.

    LENUS (Irish Health Repository)

    Madden, Stephen F

    2010-01-01

    BACKGROUND: MicroRNAs (miRNAs) are non-coding RNAs that regulate gene expression by binding to the messenger RNA (mRNA) of protein coding genes. They control gene expression by either inhibiting translation or inducing mRNA degradation. A number of computational techniques have been developed to identify the targets of miRNAs. In this study we used predicted miRNA-gene interactions to analyse mRNA gene expression microarray data to predict miRNAs associated with particular diseases or conditions. RESULTS: Here we combine correspondence analysis, between group analysis and co-inertia analysis (CIA) to determine which miRNAs are associated with differences in gene expression levels in microarray data sets. Using a database of miRNA target predictions from TargetScan, TargetScanS, PicTar4way PicTar5way, and miRanda and combining these data with gene expression levels from sets of microarrays, this method produces a ranked list of miRNAs associated with a specified split in samples. We applied this to three different microarray datasets, a papillary thyroid carcinoma dataset, an in-house dataset of lipopolysaccharide treated mouse macrophages, and a multi-tissue dataset. In each case we were able to identified miRNAs of biological importance. CONCLUSIONS: We describe a technique to integrate gene expression data and miRNA target predictions from multiple sources.

  1. Gene expression results in lipopolysaccharide-stimulated monocytes depend significantly on the choice of reference genes

    Directory of Open Access Journals (Sweden)

    Øvstebø Reidun

    2010-05-01

    Full Text Available Abstract Background Gene expression in lipopolysaccharide (LPS-stimulated monocytes is mainly studied by quantitative real-time reverse transcription PCR (RT-qPCR using GAPDH (glyceraldehyde 3-phosphate dehydrogenase or ACTB (beta-actin as reference gene for normalization. Expression of traditional reference genes has been shown to vary substantially under certain conditions leading to invalid results. To investigate whether traditional reference genes are stably expressed in LPS-stimulated monocytes or if RT-qPCR results are dependent on the choice of reference genes, we have assessed and evaluated gene expression stability of twelve candidate reference genes in this model system. Results Twelve candidate reference genes were quantified by RT-qPCR in LPS-stimulated, human monocytes and evaluated using the programs geNorm, Normfinder and BestKeeper. geNorm ranked PPIB (cyclophilin B, B2M (beta-2-microglobulin and PPIA (cyclophilin A as the best combination for gene expression normalization in LPS-stimulated monocytes. Normfinder suggested TBP (TATA-box binding protein and B2M as the best combination. Compared to these combinations, normalization using GAPDH alone resulted in significantly higher changes of TNF-α (tumor necrosis factor-alpha and IL10 (interleukin 10 expression. Moreover, a significant difference in TNF-α expression between monocytes stimulated with equimolar concentrations of LPS from N. meningitides and E. coli, respectively, was identified when using the suggested combinations of reference genes for normalization, but stayed unrecognized when employing a single reference gene, ACTB or GAPDH. Conclusions Gene expression levels in LPS-stimulated monocytes based on RT-qPCR results differ significantly when normalized to a single gene or a combination of stably expressed reference genes. Proper evaluation of reference gene stabiliy is therefore mandatory before reporting RT-qPCR results in LPS-stimulated monocytes.

  2. Differentially expressed genes in iron-induced prion protein conversion

    International Nuclear Information System (INIS)

    Kim, Minsun; Kim, Eun-hee; Choi, Bo-Ran; Woo, Hee-Jong

    2016-01-01

    The conversion of the cellular prion protein (PrP C ) to the protease-resistant isoform is the key event in chronic neurodegenerative diseases, including transmissible spongiform encephalopathies (TSEs). Increased iron in prion-related disease has been observed due to the prion protein-ferritin complex. Additionally, the accumulation and conversion of recombinant PrP (rPrP) is specifically derived from Fe(III) but not Fe(II). Fe(III)-mediated PK-resistant PrP (PrP res ) conversion occurs within a complex cellular environment rather than via direct contact between rPrP and Fe(III). In this study, differentially expressed genes correlated with prion degeneration by Fe(III) were identified using Affymetrix microarrays. Following Fe(III) treatment, 97 genes were differentially expressed, including 85 upregulated genes and 12 downregulated genes (≥1.5-fold change in expression). However, Fe(II) treatment produced moderate alterations in gene expression without inducing dramatic alterations in gene expression profiles. Moreover, functional grouping of identified genes indicated that the differentially regulated genes were highly associated with cell growth, cell maintenance, and intra- and extracellular transport. These findings showed that Fe(III) may influence the expression of genes involved in PrP folding by redox mechanisms. The identification of genes with altered expression patterns in neural cells may provide insights into PrP conversion mechanisms during the development and progression of prion-related diseases. - Highlights: • Differential genes correlated with prion degeneration by Fe(III) were identified. • Genes were identified in cell proliferation and intra- and extracellular transport. • In PrP degeneration, redox related genes were suggested. • Cbr2, Rsad2, Slc40a1, Amph and Mvd were expressed significantly.

  3. Gene expression patterns during the larval development of European sea bass (dicentrarchus labrax) by microarray analysis.

    Science.gov (United States)

    Darias, M J; Zambonino-Infante, J L; Hugot, K; Cahu, C L; Mazurais, D

    2008-01-01

    During the larval period, marine teleosts undergo very fast growth and dramatic changes in morphology, metabolism, and behavior to accomplish their metamorphosis into juvenile fish. Regulation of gene expression is widely thought to be a key mechanism underlying the management of the biological processes required for harmonious development over this phase of life. To provide an overall analysis of gene expression in the whole body during sea bass larval development, we monitored the expression of 6,626 distinct genes at 10 different points in time between 7 and 43 days post-hatching (dph) by using heterologous hybridization of a rainbow trout cDNA microarray. The differentially expressed genes (n = 485) could be grouped into two categories: genes that were generally up-expressed early, between 7 and 23 dph, and genes up-expressed between 25 and 43 dph. Interestingly, among the genes regulated during the larval period, those related to organogenesis, energy pathways, biosynthesis, and digestion were over-represented compared with total set of analyzed genes. We discuss the quantitative regulation of whole-body contents of these specific transcripts with regard to the ontogenesis and maturation of essential functions that take place over larval development. Our study is the first utilization of a transcriptomic approach in sea bass and reveals dynamic changes in gene expression patterns in relation to marine finfish larval development.

  4. Regulation of meiotic gene expression in plants

    Directory of Open Access Journals (Sweden)

    Adele eZhou

    2014-08-01

    Full Text Available With the recent advances in genomics and sequencing technologies, databases of transcriptomes representing many cellular processes have been built. Meiotic transcriptomes in plants have been studied in Arabidopsis thaliana, rice (Oryza sativa, wheat (Triticum aestivum, petunia (Petunia hybrida, sunflower (Helianthus annuus, and maize (Zea mays. Studies in all organisms, but particularly in plants, indicate that a very large number of genes are expressed during meiosis, though relatively few of them seem to be required for the completion of meiosis. In this review, we focus on gene expression at the RNA level and analyze the meiotic transcriptome datasets and explore expression patterns of known meiotic genes to elucidate how gene expression could be regulated during meiosis. We also discuss mechanisms, such as chromatin organization and non-coding RNAs, that might be involved in the regulation of meiotic transcription patterns.

  5. Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

    Directory of Open Access Journals (Sweden)

    Tintle Nathan L

    2012-08-01

    Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

  6. Transcriptional and epigenetic regulation of KIAA1199 gene expression in human breast cancer.

    Directory of Open Access Journals (Sweden)

    Cem Kuscu

    Full Text Available Emerging evidence has demonstrated that upregulated expression of KIAA1199 in human cancer bodes for poor survival. The regulatory mechanism controlling KIAA1199 expression in cancer remains to be characterized. In the present study, we have isolated and characterized the human KIAA1199 promoter in terms of regulation of KIAA1199 gene expression. A 3.3 kb fragment of human genomic DNA containing the 5'-flanking sequence of the KIAA1199 gene possesses both suppressive and activating elements. Employing a deletion mutagenesis approach, a 1.4 kb proximal region was defined as the basic KIAA1199 promoter containing a TATA-box close to the transcription start site. A combination of 5'-primer extension study with 5'RACE DNA sequencing analysis revealed one major transcription start site that is utilized in the human KIAA1199 gene. Bioinformatics analysis suggested that the 1.4 kb KIAA1199 promoter contains putative activating regulatory elements, including activator protein-1(AP-1, Twist-1, and NF-κB sites. Sequential deletion and site-direct mutagenesis analysis demonstrated that the AP-1 and distal NF-κB sites are required for KIAA1199 gene expression. Further analyses using an electrophoretic mobility-shift assay and chromatin immunoprecipitation confirmed the requirement of these cis- and trans-acting elements in controlling KIAA1199 gene expression. Finally, we found that upregulated KIAA1199 expression in human breast cancer specimens correlated with hypomethylation of the regulatory region. Involvement of DNA methylation in regulation of KIAA1199 expression was recapitulated in human breast cancer cell lines. Taken together, our study unraveled the regulatory mechanisms controlling KIAA1199 gene expression in human cancer.

  7. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    Science.gov (United States)

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  8. Relative gene expression of fatty acid synthesis genes at 60 days postpartum in bovine mammary epithelial cells of Surti and Jafarabadi buffaloes

    Directory of Open Access Journals (Sweden)

    Mamta Janmeda

    2017-05-01

    Full Text Available Aim: Aim of the study was to study the relative gene expression of genes associated with fatty acid synthesis at 60 days postpartum (pp in bovine mammary epithelial cells (MECs of Surti and Jafarabadi buffaloes. Materials and Methods: A total of 10 healthy Surti and Jafarabadi buffaloes of each breed were selected at random from Livestock Research Station, Navsari and Cattle Breeding Farm, Junagadh, Gujarat, respectively, for this study. Milk sample was collected from each selected buffalo at day 60 pp from these two breeds to study relative gene expression of major milk fat genes using non-invasive approach of obtaining primary bovine MECs (pBMEC from milk samples. Results: In this study overall, the relative expression of the six major milk lipogenic genes butyrophilin subfamily 1 member A1 (BTN1A1, stearoyl-CoA desaturase (SCD, lipoprotein lipase (LPL, glycerol-3-phosphate acyltransferase mitochondrial (GPAM, acetyl-coenzyme A carboxylase alpha (ACACA, and lipin (LPIN did not show changes in expression patterns at 60th day of lactation in both Surti and Jafarabadi buffaloes. Conclusion: The pBMEC can be successfully recovered from 1500 ml of milk of Surti and Jafarabadi buffaloes using antibody-mediated magnetic bead separation and can be further used for recovering RNA for down step quantification of major milk lipogenic gene expression. The relative expression of the six major milk lipogenic genes BTN1A1, SCD, LPL, GPAM, ACACA, and LPIN did not show changes in expression patterns in both Surti and Jafarabadi buffaloes, suggesting expression levels of lipogenic genes are maintained almost uniform till peak lactation without any significant difference.

  9. Assembly and multiple gene expression of thermophilic enzymes in Escherichia coli for in vitro metabolic engineering.

    Science.gov (United States)

    Ninh, Pham Huynh; Honda, Kohsuke; Sakai, Takaaki; Okano, Kenji; Ohtake, Hisao

    2015-01-01

    In vitro reconstitution of an artificial metabolic pathway is an emerging approach for the biocatalytic production of industrial chemicals. However, several enzymes have to be separately prepared (and purified) for the construction of an in vitro metabolic pathway, thereby limiting the practical applicability of this approach. In this study, genes encoding the nine thermophilic enzymes involved in a non-ATP-forming chimeric glycolytic pathway were assembled in an artificial operon and co-expressed in a single recombinant Escherichia coli strain. Gene expression levels of the thermophilic enzymes were controlled by their sequential order in the artificial operon. The specific activities of the recombinant enzymes in the cell-free extract of the multiple-gene-expression E. coli were 5.0-1,370 times higher than those in an enzyme cocktail prepared from a mixture of single-gene-expression strains, in each of which a single one of the nine thermophilic enzymes was overproduced. Heat treatment of a crude extract of the multiple-gene-expression cells led to the denaturation of indigenous proteins and one-step preparation of an in vitro synthetic pathway comprising only a limited number of thermotolerant enzymes. Coupling this in vitro pathway with other thermophilic enzymes including the H2 O-forming NADH oxidase or the malate/lactate dehydrogenase facilitated one-pot conversion of glucose to pyruvate or lactate, respectively. © 2014 Wiley Periodicals, Inc.

  10. Evaluation of Appropriate Reference Genes for Gene Expression Normalization during Watermelon Fruit Development.

    Directory of Open Access Journals (Sweden)

    Qiusheng Kong

    Full Text Available Gene expression analysis in watermelon (Citrullus lanatus fruit has drawn considerable attention with the availability of genome sequences to understand the regulatory mechanism of fruit development and to improve its quality. Real-time quantitative reverse-transcription PCR (qRT-PCR is a routine technique for gene expression analysis. However, appropriate reference genes for transcript normalization in watermelon fruits have not been well characterized. The aim of this study was to evaluate the appropriateness of 12 genes for their potential use as reference genes in watermelon fruits. Expression variations of these genes were measured in 48 samples obtained from 12 successive developmental stages of parthenocarpic and fertilized fruits of two watermelon genotypes by using qRT-PCR analysis. Considering the effects of genotype, fruit setting method, and developmental stage, geNorm determined clathrin adaptor complex subunit (ClCAC, β-actin (ClACT, and alpha tubulin 5 (ClTUA5 as the multiple reference genes in watermelon fruit. Furthermore, ClCAC alone or together with SAND family protein (ClSAND was ranked as the single or two best reference genes by NormFinder. By using the top-ranked reference genes to normalize the transcript abundance of phytoene synthase (ClPSY1, a good correlation between lycopene accumulation and ClPSY1 expression pattern was observed in ripening watermelon fruit. These validated reference genes will facilitate the accurate measurement of gene expression in the studies on watermelon fruit biology.

  11. Evaluation of Appropriate Reference Genes for Gene Expression Normalization during Watermelon Fruit Development.

    Science.gov (United States)

    Kong, Qiusheng; Yuan, Jingxian; Gao, Lingyun; Zhao, Liqiang; Cheng, Fei; Huang, Yuan; Bie, Zhilong

    2015-01-01

    Gene expression analysis in watermelon (Citrullus lanatus) fruit has drawn considerable attention with the availability of genome sequences to understand the regulatory mechanism of fruit development and to improve its quality. Real-time quantitative reverse-transcription PCR (qRT-PCR) is a routine technique for gene expression analysis. However, appropriate reference genes for transcript normalization in watermelon fruits have not been well characterized. The aim of this study was to evaluate the appropriateness of 12 genes for their potential use as reference genes in watermelon fruits. Expression variations of these genes were measured in 48 samples obtained from 12 successive developmental stages of parthenocarpic and fertilized fruits of two watermelon genotypes by using qRT-PCR analysis. Considering the effects of genotype, fruit setting method, and developmental stage, geNorm determined clathrin adaptor complex subunit (ClCAC), β-actin (ClACT), and alpha tubulin 5 (ClTUA5) as the multiple reference genes in watermelon fruit. Furthermore, ClCAC alone or together with SAND family protein (ClSAND) was ranked as the single or two best reference genes by NormFinder. By using the top-ranked reference genes to normalize the transcript abundance of phytoene synthase (ClPSY1), a good correlation between lycopene accumulation and ClPSY1 expression pattern was observed in ripening watermelon fruit. These validated reference genes will facilitate the accurate measurement of gene expression in the studies on watermelon fruit biology.

  12. Differential neutrophil gene expression in early bovine pregnancy

    Directory of Open Access Journals (Sweden)

    Kizaki Keiichiro

    2013-02-01

    Full Text Available Abstract Background In food production animals, especially cattle, the diagnosis of gestation is important because the timing of gestation directly affects the running of farms. Various methods have been used to detect gestation, but none of them are ideal because of problems with the timing of detection or the accuracy, simplicity, or cost of the method. A new method for detecting gestation, which involves assessing interferon-tau (IFNT-stimulated gene expression in peripheral blood leukocytes (PBL, was recently proposed. PBL fractionation methods were used to examine whether the expression profiles of various PBL populations could be used as reliable diagnostic markers of bovine gestation. Methods PBL were collected on days 0 (just before artificial insemination, 7, 14, 17, 21, and 28 of gestation. The gene expression levels of the PBL were assessed with microarray analysis and/or quantitative real-time reverse transcription (q PCR. PBL fractions were collected by flow cytometry or density gradient cell separation using Histopaque 1083 or Ficoll-Conray solutions. The expression levels of four IFNT-stimulated genes, interferon-stimulated protein 15 kDa (ISG15, myxovirus-resistance (MX 1 and 2, and 2′-5′-oligoadenylate synthetase (OAS1, were then analyzed in each fraction through day 28 of gestation using qPCR. Results Microarray analysis detected 72 and 28 genes in whole PBL that were significantly higher on days 14 and 21 of gestation, respectively, than on day 0. The upregulated genes included IFNT-stimulated genes. The expression levels of these genes increased with the progression of gestation until day 21. In flow cytometry experiments, on day 14 the expression levels of all of the genes were significantly higher in the granulocyte fraction than in the other fractions. Their expression gradually decreased through day 28 of gestation. Strong correlations were observed between the expression levels of the four genes in the granulocyte

  13. Importance of correlation between gene expression levels: application to the type I interferon signature in rheumatoid arthritis.

    Science.gov (United States)

    Reynier, Frédéric; Petit, Fabien; Paye, Malick; Turrel-Davin, Fanny; Imbert, Pierre-Emmanuel; Hot, Arnaud; Mougin, Bruno; Miossec, Pierre

    2011-01-01

    The analysis of gene expression data shows that many genes display similarity in their expression profiles suggesting some co-regulation. Here, we investigated the co-expression patterns in gene expression data and proposed a correlation-based research method to stratify individuals. Using blood from rheumatoid arthritis (RA) patients, we investigated the gene expression profiles from whole blood using Affymetrix microarray technology. Co-expressed genes were analyzed by a biclustering method, followed by gene ontology analysis of the relevant biclusters. Taking the type I interferon (IFN) pathway as an example, a classification algorithm was developed from the 102 RA patients and extended to 10 systemic lupus erythematosus (SLE) patients and 100 healthy volunteers to further characterize individuals. We developed a correlation-based algorithm referred to as Classification Algorithm Based on a Biological Signature (CABS), an alternative to other approaches focused specifically on the expression levels. This algorithm applied to the expression of 35 IFN-related genes showed that the IFN signature presented a heterogeneous expression between RA, SLE and healthy controls which could reflect the level of global IFN signature activation. Moreover, the monitoring of the IFN-related genes during the anti-TNF treatment identified changes in type I IFN gene activity induced in RA patients. In conclusion, we have proposed an original method to analyze genes sharing an expression pattern and a biological function showing that the activation levels of a biological signature could be characterized by its overall state of correlation.

  14. Gene expression in a paleopolyploid: a transcriptome resource for the ciliate Paramecium tetraurelia

    Directory of Open Access Journals (Sweden)

    Kapusta Aurélie

    2010-10-01

    Full Text Available Abstract Background The genome of Paramecium tetraurelia, a unicellular model that belongs to the ciliate phylum, has been shaped by at least 3 successive whole genome duplications (WGD. These dramatic events, which have also been documented in plants, animals and fungi, are resolved over evolutionary time by the loss of one duplicate for the majority of genes. Thanks to a low rate of large scale genome rearrangement in Paramecium, an unprecedented large number of gene duplicates of different ages have been identified, making this organism an outstanding model to investigate the evolutionary consequences of polyploidization. The most recent WGD, with 51% of pre-duplication genes still in 2 copies, provides a snapshot of a phase of rapid gene loss that is not accessible in more ancient polyploids such as yeast. Results We designed a custom oligonucleotide microarray platform for P. tetraurelia genome-wide expression profiling and used the platform to measure gene expression during 1 the sexual cycle of autogamy, 2 growth of new cilia in response to deciliation and 3 biogenesis of secretory granules after massive exocytosis. Genes that are differentially expressed during these time course experiments have expression patterns consistent with a very low rate of subfunctionalization (partition of ancestral functions between duplicated genes in particular since the most recent polyploidization event. Conclusions A public transcriptome resource is now available for Paramecium tetraurelia. The resource has been integrated into the ParameciumDB model organism database, providing searchable access to the data. The microarray platform, freely available through NimbleGen Systems, provides a robust, cost-effective approach for genome-wide expression profiling in P. tetraurelia. The expression data support previous studies showing that at short evolutionary times after a whole genome duplication, gene dosage balance constraints and not functional change are

  15. Both noncoding and protein-coding RNAs contribute to gene expression evolution in the primate brain.

    Science.gov (United States)

    Babbitt, Courtney C; Fedrigo, Olivier; Pfefferle, Adam D; Boyle, Alan P; Horvath, Julie E; Furey, Terrence S; Wray, Gregory A

    2010-01-18

    Despite striking differences in cognition and behavior between humans and our closest primate relatives, several studies have found little evidence for adaptive change in protein-coding regions of genes expressed primarily in the brain. Instead, changes in gene expression may underlie many cognitive and behavioral differences. Here, we used digital gene expression: tag profiling (here called Tag-Seq, also called DGE:tag profiling) to assess changes in global transcript abundance in the frontal cortex of the brains of 3 humans, 3 chimpanzees, and 3 rhesus macaques. A substantial fraction of transcripts we identified as differentially transcribed among species were not assayed in previous studies based on microarrays. Differentially expressed tags within coding regions are enriched for gene functions involved in synaptic transmission, transport, oxidative phosphorylation, and lipid metabolism. Importantly, because Tag-Seq technology provides strand-specific information about all polyadenlyated transcripts, we were able to assay expression in noncoding intragenic regions, including both sense and antisense noncoding transcripts (relative to nearby genes). We find that many noncoding transcripts are conserved in both location and expression level between species, suggesting a possible functional role. Lastly, we examined the overlap between differential gene expression and signatures of positive selection within putative promoter regions, a sign that these differences represent adaptations during human evolution. Comparative approaches may provide important insights into genes responsible for differences in cognitive functions between humans and nonhuman primates, as well as highlighting new candidate genes for studies investigating neurological disorders.

  16. Functional Associations by Response Overlap (FARO), a functional genomics approach matching gene expression phenotypes

    DEFF Research Database (Denmark)

    Nielsen, Henrik Bjørn; Mundy, J.; Willenbrock, Hanni

    2007-01-01

    The systematic comparison of transcriptional responses of organisms is a powerful tool in functional genomics. For example, mutants may be characterized by comparing their transcript profiles to those obtained in other experiments querying the effects on gene expression of many experimental facto...

  17. Heritability of growth traits and correlation with hepatic gene expression among hybrid striped bass exhibiting extremes in performance

    Science.gov (United States)

    We set out to better understand the genetic basis behind growth variation in hybrid striped bass (HSB) by determining whether gene expression changes could be detected between the largest and smallest HSB in a population using a global gene expression approach by RNA sequencing of liver. Fingerling...

  18. Validation of suitable reference genes for quantitative gene expression analysis in Panax ginseng

    Directory of Open Access Journals (Sweden)

    Meizhen eWang

    2016-01-01

    Full Text Available Reverse transcription-qPCR (RT-qPCR has become a popular method for gene expression studies. Its results require data normalization by housekeeping genes. No single gene is proved to be stably expressed under all experimental conditions. Therefore, systematic evaluation of reference genes is necessary. With the aim to identify optimum reference genes for RT-qPCR analysis of gene expression in different tissues of Panax ginseng and the seedlings grown under heat stress, we investigated the expression stability of eight candidate reference genes, including elongation factor 1-beta (EF1-β, elongation factor 1-gamma (EF1-γ, eukaryotic translation initiation factor 3G (IF3G, eukaryotic translation initiation factor 3B (IF3B, actin (ACT, actin11 (ACT11, glyceraldehyde-3-phosphate dehydrogenase (GAPDH and cyclophilin ABH-like protein (CYC, using four widely used computational programs: geNorm, Normfinder, BestKeeper, and the comparative ΔCt method. The results were then integrated using the web-based tool RefFinder. As a result, EF1-γ, IF3G and EF1-β were the three most stable genes in different tissues of P. ginseng, while IF3G, ACT11 and GAPDH were the top three-ranked genes in seedlings treated with heat. Using three better reference genes alone or in combination as internal control, we examined the expression profiles of MAR, a multiple function-associated mRNA-like non-coding RNA (mlncRNA in P. ginseng. Taken together, we recommended EF1-γ/IF3G and IF3G/ACT11 as the suitable pair of reference genes for RT-qPCR analysis of gene expression in different tissues of P. ginseng and the seedlings grown under heat stress, respectively. The results serve as a foundation for future studies on P. ginseng functional genomics.

  19. Identification of differentially expressed genes in flax (Linum usitatissimum L.) under saline-alkaline stress by digital gene expression.

    Science.gov (United States)

    Yu, Ying; Huang, Wengong; Chen, Hongyu; Wu, Guangwen; Yuan, Hongmei; Song, Xixia; Kang, Qinghua; Zhao, Dongsheng; Jiang, Weidong; Liu, Yan; Wu, Jianzhong; Cheng, Lili; Yao, Yubo; Guan, Fengzhi

    2014-10-01

    The salinization and alkalization of soil are widespread environmental problems, and alkaline salt stress is more destructive than neutral salt stress. Therefore, understanding the mechanism of plant tolerance to saline-alkaline stress has become a major challenge. However, little attention has been paid to the mechanism of plant alkaline salt tolerance. In this study, gene expression profiling of flax was analyzed under alkaline-salt stress (AS2), neutral salt stress (NSS) and alkaline stress (AS) by digital gene expression. Three-week-old flax seedlings were placed in 25 mM Na2CO3 (pH11.6) (AS2), 50mM NaCl (NSS) and NaOH (pH11.6) (AS) for 18 h. There were 7736, 1566 and 454 differentially expressed genes in AS2, NSS and AS compared to CK, respectively. The GO category gene enrichment analysis revealed that photosynthesis was particularly affected in AS2, carbohydrate metabolism was particularly affected in NSS, and the response to biotic stimulus was particularly affected in AS. We also analyzed the expression pattern of five categories of genes including transcription factors, signaling transduction proteins, phytohormones, reactive oxygen species proteins and transporters under these three stresses. Some key regulatory gene families involved in abiotic stress, such as WRKY, MAPKKK, ABA, PrxR and ion channels, were differentially expressed. Compared with NSS and AS, AS2 triggered more differentially expressed genes and special pathways, indicating that the mechanism of AS2 was more complex than NSS and AS. To the best of our knowledge, this was the first transcriptome analysis of flax in response to saline-alkaline stress. These data indicate that common and diverse features of saline-alkaline stress provide novel insights into the molecular mechanisms of plant saline-alkaline tolerance and offer a number of candidate genes as potential markers of tolerance to saline-alkaline stress. Copyright © 2014 Elsevier B.V. All rights reserved.

  20. Systematic identification and integrative analysis of novel genes expressed specifically or predominantly in mouse epididymis

    Directory of Open Access Journals (Sweden)

    Lee Hoyong

    2006-12-01

    Full Text Available Abstract Background Maturation of spermatozoa, including development of motility and the ability to fertilize the oocyte, occurs during transit through the microenvironment of the epididymis. Comprehensive understanding of sperm maturation requires identification and characterization of unique genes expressed in the epididymis. Results We systematically identified 32 novel genes with epididymis-specific or -predominant expression in the mouse epididymis UniGene library, containing 1505 gene-oriented transcript clusters, by in silico and in vitro analyses. The Northern blot analysis revealed various characteristics of the genes at the transcript level, such as expression level, size and the presence of isoform. We found that expression of the half of the genes is regulated by androgens. Further expression analyses demonstrated that the novel genes are region-specific and developmentally regulated. Computational analysis showed that 15 of the genes lack human orthologues, suggesting their implication in male reproduction unique to the mouse. A number of the novel genes are putative epididymal protease inhibitors or β-defensins. We also found that six of the genes have secretory activity, indicating that they may interact with sperm and have functional roles in sperm maturation. Conclusion We identified and characterized 32 novel epididymis-specific or -predominant genes by an integrative approach. Our study is unique in the aspect of systematic identification of novel epididymal genes and should be a firm basis for future investigation into molecular mechanisms underlying sperm maturation in the epididymis.

  1. Improved gene expression signature of testicular carcinoma in situ

    DEFF Research Database (Denmark)

    Almstrup, Kristian; Leffers, Henrik; Lothe, Ragnhild A

    2007-01-01

    on global gene expression in testicular CIS have been previously published. We have merged the two data sets on CIS samples (n = 6) and identified the shared gene expression signature in relation to expression in normal testis. Among the top-20 highest expressed genes, one-third was transcription factors...... development' were significantly altered and could collectively affect cellular pathways like the WNT signalling cascade, which thus may be disrupted in testicular CIS. The merged CIS data from two different microarray platforms, to our knowledge, provide the most precise CIS gene expression signature to date....

  2. Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer

    International Nuclear Information System (INIS)

    Thomassen, Mads; Tan, Qihua; Kruse, Torben A

    2008-01-01

    Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent studies. Besides classification of outcome, these global expression patterns may reflect biological mechanisms involved in metastasis of breast cancer. Our purpose has been to investigate pathways and transcription factors involved in metastasis by use of gene expression data sets. We have analyzed 8 publicly available gene expression data sets. A global approach, 'gene set enrichment analysis' as well as an approach focusing on a subset of significantly differently regulated genes, GenMAPP, has been applied to rank pathway gene sets according to differential regulation in metastasizing tumors compared to non-metastasizing tumors. Meta-analysis has been used to determine overrepresentation of pathways and transcription factors targets, concordant deregulated in metastasizing breast tumors, in several data sets. The major findings are up-regulation of cell cycle pathways and a metabolic shift towards glucose metabolism reflected in several pathways in metastasizing tumors. Growth factor pathways seem to play dual roles; EGF and PDGF pathways are decreased, while VEGF and sex-hormone pathways are increased in tumors that metastasize. Furthermore, migration, proteasome, immune system, angiogenesis, DNA repair and several signal transduction pathways are associated to metastasis. Finally several transcription factors e.g. E2F, NFY, and YY1 are identified as being involved in metastasis. By pathway meta-analysis many biological mechanisms beyond major characteristics such as proliferation are identified. Transcription factor analysis identifies a number of key factors that support central pathways. Several previously proposed treatment targets are identified and several new pathways that may

  3. The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

    Science.gov (United States)

    Gautier, Aude; Le Gac, Florence; Lareyre, Jean-Jacques

    2011-02-01

    The gonadal soma-derived factor (GSDF) belongs to the transforming growth factor-β superfamily and is conserved in teleostean fish species. Gsdf is specifically expressed in the gonads, and gene expression is restricted to the granulosa and Sertoli cells in trout and medaka. The gsdf gene expression is correlated to early testis differentiation in medaka and was shown to stimulate primordial germ cell and spermatogonia proliferation in trout. In the present study, we show that the gsdf gene localizes to a syntenic chromosomal fragment conserved among vertebrates although no gsdf-related gene is detected on the corresponding genomic region in tetrapods. We demonstrate using quantitative RT-PCR that most of the genes localized in the synteny are specifically expressed in medaka gonads. Gsdf is the only gene of the synteny with a much higher expression in the testis compared to the ovary. In contrast, gene expression pattern analysis of the gsdf surrounding genes (nup54, aff1, klhl8, sdad1, and ptpn13) indicates that these genes are preferentially expressed in the female gonads. The tissue distribution of these genes is highly similar in medaka and zebrafish, two teleostean species that have diverged more than 110 million years ago. The cellular localization of these genes was determined in medaka gonads using the whole-mount in situ hybridization technique. We confirm that gsdf gene expression is restricted to Sertoli and granulosa cells in contact with the premeiotic and meiotic cells. The nup54 gene is expressed in spermatocytes and previtellogenic oocytes. Transcripts corresponding to the ovary-specific genes (aff1, klhl8, and sdad1) are detected only in previtellogenic oocytes. No expression was detected in the gonocytes in 10 dpf embryos. In conclusion, we show that the gsdf gene localizes to a syntenic chromosomal fragment harboring evolutionary conserved genes in vertebrates. These genes are preferentially expressed in previtelloogenic oocytes, and thus, they

  4. MALDI-TOF mass spectrometry for quantitative gene expression analysis of acid responses in Staphylococcus aureus.

    Science.gov (United States)

    Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild

    2009-07-01

    Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.

  5. Learning Gene Regulatory Networks Computationally from Gene Expression Data Using Weighted Consensus

    KAUST Repository

    Fujii, Chisato

    2015-04-16

    Gene regulatory networks analyze the relationships between genes allowing us to un- derstand the gene regulatory interactions in systems biology. Gene expression data from the microarray experiments is used to obtain the gene regulatory networks. How- ever, the microarray data is discrete, noisy and non-linear which makes learning the networks a challenging problem and existing gene network inference methods do not give consistent results. Current state-of-the-art study uses the average-ranking-based consensus method to combine and average the ranked predictions from individual methods. However each individual method has an equal contribution to the consen- sus prediction. We have developed a linear programming-based consensus approach which uses learned weights from linear programming among individual methods such that the methods have di↵erent weights depending on their performance. Our result reveals that assigning di↵erent weights to individual methods rather than giving them equal weights improves the performance of the consensus. The linear programming- based consensus method is evaluated and it had the best performance on in silico and Saccharomyces cerevisiae networks, and the second best on the Escherichia coli network outperformed by Inferelator Pipeline method which gives inconsistent results across a wide range of microarray data sets.

  6. The Medicago truncatula gene expression atlas web server

    Directory of Open Access Journals (Sweden)

    Tang Yuhong

    2009-12-01

    Full Text Available Abstract Background Legumes (Leguminosae or Fabaceae play a major role in agriculture. Transcriptomics studies in the model legume species, Medicago truncatula, are instrumental in helping to formulate hypotheses about the role of legume genes. With the rapid growth of publically available Affymetrix GeneChip Medicago Genome Array GeneChip data from a great range of tissues, cell types, growth conditions, and stress treatments, the legume research community desires an effective bioinformatics system to aid efforts to interpret the Medicago genome through functional genomics. We developed the Medicago truncatula Gene Expression Atlas (MtGEA web server for this purpose. Description The Medicago truncatula Gene Expression Atlas (MtGEA web server is a centralized platform for analyzing the Medicago transcriptome. Currently, the web server hosts gene expression data from 156 Affymetrix GeneChip® Medicago genome arrays in 64 different experiments, covering a broad range of developmental and environmental conditions. The server enables flexible, multifaceted analyses of transcript data and provides a range of additional information about genes, including different types of annotation and links to the genome sequence, which help users formulate hypotheses about gene function. Transcript data can be accessed using Affymetrix probe identification number, DNA sequence, gene name, functional description in natural language, GO and KEGG annotation terms, and InterPro domain number. Transcripts can also be discovered through co-expression or differential expression analysis. Flexible tools to select a subset of experiments and to visualize and compare expression profiles of multiple genes have been implemented. Data can be downloaded, in part or full, in a tabular form compatible with common analytical and visualization software. The web server will be updated on a regular basis to incorporate new gene expression data and genome annotation, and is accessible

  7. Tamoxifen-elicited uterotrophy: cross-species and cross-ligand analysis of the gene expression program

    Directory of Open Access Journals (Sweden)

    Forgacs Agnes L

    2009-04-01

    Full Text Available Abstract Background Tamoxifen (TAM is a well characterized breast cancer drug and selective estrogen receptor modulator (SERM which also has been associated with a small increase in risk for uterine cancers. TAM's partial agonist activation of estrogen receptor has been characterized for specific gene promoters but not at the genomic level in vivo.Furthermore, reducing uncertainties associated with cross-species extrapolations of pharmaco- and toxicogenomic data remains a formidable challenge. Results A comparative ligand and species analysis approach was conducted to systematically assess the physiological, morphological and uterine gene expression alterations elicited across time by TAM and ethynylestradiol (EE in immature ovariectomized Sprague-Dawley rats and C57BL/6 mice. Differential gene expression was evaluated using custom cDNA microarrays, and the data was compared to identify conserved and divergent responses. 902 genes were differentially regulated in all four studies, 398 of which exhibit identical temporal expression patterns. Conclusion Comparative analysis of EE and TAM differentially expressed gene lists suggest TAM regulates no unique uterine genes that are conserved in the rat and mouse. This demonstrates that the partial agonist activities of TAM extend to molecular targets in regulating only a subset of EE-responsive genes. Ligand-conserved, species-divergent expression of carbonic anhydrase 2 was observed in the microarray data and confirmed by real time PCR. The identification of comparable temporal phenotypic responses linked to related gene expression profiles demonstrates that systematic comparative genomic assessments can elucidate important conserved and divergent mechanisms in rodent estrogen signalling during uterine proliferation.

  8. A deep auto-encoder model for gene expression prediction.

    Science.gov (United States)

    Xie, Rui; Wen, Jia; Quitadamo, Andrew; Cheng, Jianlin; Shi, Xinghua

    2017-11-17

    Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance. To demonstrate the usage of this model, we apply MLP-SAE to a real genomic datasets with genotypes and gene expression profiles measured in yeast. Our results show that the MLP-SAE model with dropout outperforms other models including Lasso, Random Forests and the MLP-SAE model without dropout. Using the MLP-SAE model with dropout, we show that gene expression quantifications predicted by the model solely based on genotypes, align well with true gene expression patterns. We provide a deep auto-encoder model for predicting gene expression from SNP genotypes. This study demonstrates that deep learning is appropriate for tackling another genomic problem, i.e., building predictive models to understand genotypes' contribution to gene expression. With the emerging availability of richer genomic data, we anticipate that deep learning models play a bigger role in modeling and interpreting genomics.

  9. Survey of the Heritability and Sparse Architecture of Gene Expression Traits across Human Tissues.

    Science.gov (United States)

    Wheeler, Heather E; Shah, Kaanan P; Brenner, Jonathon; Garcia, Tzintzuni; Aquino-Michaels, Keston; Cox, Nancy J; Nicolae, Dan L; Im, Hae Kyung

    2016-11-01

    Understanding the genetic architecture of gene expression traits is key to elucidating the underlying mechanisms of complex traits. Here, for the first time, we perform a systematic survey of the heritability and the distribution of effect sizes across all representative tissues in the human body. We find that local h2 can be relatively well characterized with 59% of expressed genes showing significant h2 (FDR Decomposition (OTD) approach. Through a series of simulations we show that the cross-tissue and tissue-specific components are identifiable via OTD. Heritability and sparsity estimates of these derived expression phenotypes show similar characteristics to the original traits. Consistent properties relative to prior GTEx multi-tissue analysis results suggest that these traits reflect the expected biology. Finally, we apply this knowledge to develop prediction models of gene expression traits for all tissues. The prediction models, heritability, and prediction performance R2 for original and decomposed expression phenotypes are made publicly available (https://github.com/hakyimlab/PrediXcan).

  10. Understanding gene expression in coronary artery disease through ...

    Indian Academy of Sciences (India)

    Understanding gene expression in coronary artery disease through global profiling, network analysis and independent validation of key candidate genes. Prathima ... Table 2. Differentially expressed genes in CAD compared to age and gender matched controls. .... Regulation of nuclear pre-mRNA domain containing 1A.

  11. UVB-induced gene expression in the skin of Xiphophorus maculatus Jp 163 B☆

    Science.gov (United States)

    Yang, Kuan; Boswell, Mikki; Walter, Dylan J.; Downs, Kevin P.; Gaston-Pravia, Kimberly; Garcia, Tzintzuni; Shen, Yingjia; Mitchell, David L.; Walter, Ronald B.

    2014-01-01

    Xiphophorus fish and interspecies hybrids represent long-standing models to study the genetics underlying spontaneous and induced tumorigenesis. The recent release of the Xiphophorus maculatus genome sequence will allow global genetic regulation studies of genes involved in the inherited susceptibility to UVB-induced melanoma within select backcross hybrids. As a first step toward this goal, we report results of an RNA-Seq approach to identify genes and pathways showing modulated transcription within the skin of X. maculatus Jp 163 B upon UVB exposure. X. maculatus Jp 163 B were exposed to various doses of UVB followed by RNA-Seq analysis at each dose to investigate overall gene expression in each sample. A total of 357 genes with a minimum expression change of 4-fold (p-adj fish skin to UVB exposure. PMID:24556253

  12. Gene expression profile of pulpitis.

    Science.gov (United States)

    Galicia, J C; Henson, B R; Parker, J S; Khan, A A

    2016-06-01

    The cost, prevalence and pain associated with endodontic disease necessitate an understanding of the fundamental molecular aspects of its pathogenesis. This study was aimed to identify the genetic contributors to pulpal pain and inflammation. Inflamed pulps were collected from patients diagnosed with irreversible pulpitis (n=20). Normal pulps from teeth extracted for various reasons served as controls (n=20). Pain level was assessed using a visual analog scale (VAS). Genome-wide microarray analysis was performed using Affymetrix GeneTitan Multichannel Instrument. The difference in gene expression levels were determined by the significance analysis of microarray program using a false discovery rate (q-value) of 5%. Genes involved in immune response, cytokine-cytokine receptor interaction and signaling, integrin cell surface interactions, and others were expressed at relatively higher levels in the pulpitis group. Moreover, several genes known to modulate pain and inflammation showed differential expression in asymptomatic and mild pain patients (⩾30 mm on VAS) compared with those with moderate to severe pain. This exploratory study provides a molecular basis for the clinical diagnosis of pulpitis. With an enhanced understanding of pulpal inflammation, future studies on treatment and management of pulpitis and on pain associated with it can have a biological reference to bridge treatment strategies with pulpal biology.

  13. Mel-18, a mammalian Polycomb gene, regulates angiogenic gene expression of endothelial cells.

    Science.gov (United States)

    Jung, Ji-Hye; Choi, Hyun-Jung; Maeng, Yong-Sun; Choi, Jung-Yeon; Kim, Minhyung; Kwon, Ja-Young; Park, Yong-Won; Kim, Young-Myeong; Hwang, Daehee; Kwon, Young-Guen

    2010-10-01

    Mel-18 is a mammalian homolog of Polycomb group (PcG) genes. Microarray analysis revealed that Mel-18 expression was induced during endothelial progenitor cell (EPC) differentiation and correlates with the expression of EC-specific protein markers. Overexpression of Mel-18 promoted EPC differentiation and angiogenic activity of ECs. Accordingly, silencing Mel-18 inhibited EC migration and tube formation in vitro. Gene expression profiling showed that Mel-18 regulates angiogenic genes including kinase insert domain receptor (KDR), claudin 5, and angiopoietin-like 2. Our findings demonstrate, for the first time, that Mel-18 plays a significant role in the angiogenic function of ECs by regulating endothelial gene expression. Copyright © 2010 Elsevier Inc. All rights reserved.

  14. Balancing gene expression without library construction via a reusable sRNA pool.

    Science.gov (United States)

    Ghodasara, Amar; Voigt, Christopher A

    2017-07-27

    Balancing protein expression is critical when optimizing genetic systems. Typically, this requires library construction to vary the genetic parts controlling each gene, which can be expensive and time-consuming. Here, we develop sRNAs corresponding to 15nt 'target' sequences that can be inserted upstream of a gene. The targeted gene can be repressed from 1.6- to 87-fold by controlling sRNA expression using promoters of different strength. A pool is built where six sRNAs are placed under the control of 16 promoters that span a ∼103-fold range of strengths, yielding ∼107 combinations. This pool can simultaneously optimize up to six genes in a system. This requires building only a single system-specific construct by placing a target sequence upstream of each gene and transforming it with the pre-built sRNA pool. The resulting library is screened and the top clone is sequenced to determine the promoter controlling each sRNA, from which the fold-repression of the genes can be inferred. The system is then rebuilt by rationally selecting parts that implement the optimal expression of each gene. We demonstrate the versatility of this approach by using the same pool to optimize a metabolic pathway (β-carotene) and genetic circuit (XNOR logic gate). © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. GFP expression by intracellular gene delivery of GFP-coding fragments using nanocrystal quantum dots

    International Nuclear Information System (INIS)

    Hoshino, Akiyoshi; Manabe, Noriyoshi; Fujioka, Kouki; Hanada, Sanshiro; Yamamoto, Kenji; Yasuhara, Masato; Kondo, Akihiko

    2008-01-01

    Gene therapy is an attractive approach to supplement a deficient gene function. Although there has been some success with specific gene delivery using various methods including viral vectors and liposomes, most of these methods have a limited efficiency or also carry a risk for oncogenesis. We herein report that quantum dots (QDs) conjugated with nuclear localizing signal peptides (NLSP) successfully introduced gene-fragments with promoter elements, which promoted the expression of the enhanced green fluorescent protein (eGFP) gene in mammalian cells. The expression of eGFP protein was observed when the QD/gene-construct was added to the culture media. The gene-expression efficiency varied depending on multiple factors around QDs, such as (1) the reading direction of the gene-fragments, (2) the quantity of gene-fragments attached on the surface of the QD-constructs, (3) the surface electronic charges varied according to the structure of the QD/gene-constructs, and (4) the particle size of QD/gene complex varied according to the structure and amounts of gene-fragments. Using this QD/gene-construct system, eGFP protein could be detected 28 days after the gene-introduction whereas the fluorescence of QDs had disappeared. This system therefore provides another method for the intracellular delivery of gene-fragments without using either viral vectors or specific liposomes.

  16. A method to identify differential expression profiles of time-course gene data with Fourier transformation.

    Science.gov (United States)

    Kim, Jaehee; Ogden, Robert Todd; Kim, Haseong

    2013-10-18

    Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization.The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be

  17. Gene Expression Omnibus (GEO)

    Data.gov (United States)

    U.S. Department of Health & Human Services — Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided...

  18. Identification and validation of suitable endogenous reference genes for gene expression studies in human peripheral blood

    Directory of Open Access Journals (Sweden)

    Turner Renee J

    2009-08-01

    Full Text Available Abstract Background Gene expression studies require appropriate normalization methods. One such method uses stably expressed reference genes. Since suitable reference genes appear to be unique for each tissue, we have identified an optimal set of the most stably expressed genes in human blood that can be used for normalization. Methods Whole-genome Affymetrix Human 2.0 Plus arrays were examined from 526 samples of males and females ages 2 to 78, including control subjects and patients with Tourette syndrome, stroke, migraine, muscular dystrophy, and autism. The top 100 most stably expressed genes with a broad range of expression levels were identified. To validate the best candidate genes, we performed quantitative RT-PCR on a subset of 10 genes (TRAP1, DECR1, FPGS, FARP1, MAPRE2, PEX16, GINS2, CRY2, CSNK1G2 and A4GALT, 4 commonly employed reference genes (GAPDH, ACTB, B2M and HMBS and PPIB, previously reported to be stably expressed in blood. Expression stability and ranking analysis were performed using GeNorm and NormFinder algorithms. Results Reference genes were ranked based on their expression stability and the minimum number of genes needed for nomalization as calculated using GeNorm showed that the fewest, most stably expressed genes needed for acurate normalization in RNA expression studies of human whole blood is a combination of TRAP1, FPGS, DECR1 and PPIB. We confirmed the ranking of the best candidate control genes by using an alternative algorithm (NormFinder. Conclusion The reference genes identified in this study are stably expressed in whole blood of humans of both genders with multiple disease conditions and ages 2 to 78. Importantly, they also have different functions within cells and thus should be expressed independently of each other. These genes should be useful as normalization genes for microarray and RT-PCR whole blood studies of human physiology, metabolism and disease.

  19. Validation of reference genes for quantifying changes in gene expression in virus-infected tobacco.

    Science.gov (United States)

    Baek, Eseul; Yoon, Ju-Yeon; Palukaitis, Peter

    2017-10-01

    To facilitate quantification of gene expression changes in virus-infected tobacco plants, eight housekeeping genes were evaluated for their stability of expression during infection by one of three systemically-infecting viruses (cucumber mosaic virus, potato virus X, potato virus Y) or a hypersensitive-response-inducing virus (tobacco mosaic virus; TMV) limited to the inoculated leaf. Five reference-gene validation programs were used to establish the order of the most stable genes for the systemically-infecting viruses as ribosomal protein L25 > β-Tubulin > Actin, and the least stable genes Ubiquitin-conjugating enzyme (UCE) genes were EF1α > Cysteine protease > Actin, and the least stable genes were GAPDH genes, three defense responsive genes were examined to compare their relative changes in gene expression caused by each virus. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. Identification of a set of genes showing regionally enriched expression in the mouse brain

    Directory of Open Access Journals (Sweden)

    Marra Marco A

    2008-07-01

    Full Text Available Abstract Background The Pleiades Promoter Project aims to improve gene therapy by designing human mini-promoters ( Results We have utilized LongSAGE to identify regionally enriched transcripts in the adult mouse brain. As supplemental strategies, we also performed a meta-analysis of published literature and inspected the Allen Brain Atlas in situ hybridization data. From a set of approximately 30,000 mouse genes, 237 were identified as showing specific or enriched expression in 30 target regions of the mouse brain. GO term over-representation among these genes revealed co-involvement in various aspects of central nervous system development and physiology. Conclusion Using a multi-faceted expression validation approach, we have identified mouse genes whose human orthologs are good candidates for design of mini-promoters. These mouse genes represent molecular markers in several discrete brain regions/cell-types, which could potentially provide a mechanistic explanation of unique functions performed by each region. This set of markers may also serve as a resource for further studies of gene regulatory elements influencing brain expression.

  1. Gene expression patterns in pancreatic tumors, cells and tissues.

    Directory of Open Access Journals (Sweden)

    Anson W Lowe

    2007-03-01

    Full Text Available Cancers of the pancreas originate from both the endocrine and exocrine elements of the organ, and represent a major cause of cancer-related death. This study provides a comprehensive assessment of gene expression for pancreatic tumors, the normal pancreas, and nonneoplastic pancreatic disease.DNA microarrays were used to assess the gene expression for surgically derived pancreatic adenocarcinomas, islet cell tumors, and mesenchymal tumors. The addition of normal pancreata, isolated islets, isolated pancreatic ducts, and pancreatic adenocarcinoma cell lines enhanced subsequent analysis by increasing the diversity in gene expression profiles obtained. Exocrine, endocrine, and mesenchymal tumors displayed unique gene expression profiles. Similarities in gene expression support the pancreatic duct as the origin of adenocarcinomas. In addition, genes highly expressed in other cancers and associated with specific signal transduction pathways were also found in pancreatic tumors.The scope of the present work was enhanced by the inclusion of publicly available datasets that encompass a wide spectrum of human tissues and enabled the identification of candidate genes that may serve diagnostic and therapeutic goals.

  2. A longitudinal study of gene expression in healthy individuals

    Directory of Open Access Journals (Sweden)

    Tessier Michel

    2009-06-01

    Full Text Available Abstract Background The use of gene expression in venous blood either as a pharmacodynamic marker in clinical trials of drugs or as a diagnostic test requires knowledge of the variability in expression over time in healthy volunteers. Here we defined a normal range of gene expression over 6 months in the blood of four cohorts of healthy men and women who were stratified by age (22–55 years and > 55 years and gender. Methods Eleven immunomodulatory genes likely to play important roles in inflammatory conditions such as rheumatoid arthritis and infection in addition to four genes typically used as reference genes were examined by quantitative reverse transcription-polymerase chain reaction (qRT-PCR, as well as the full genome as represented by Affymetrix HG U133 Plus 2.0 microarrays. Results Gene expression levels as assessed by qRT-PCR and microarray were relatively stable over time with ~2% of genes as measured by microarray showing intra-subject differences over time periods longer than one month. Fifteen genes varied by gender. The eleven genes examined by qRT-PCR remained within a limited dynamic range for all individuals. Specifically, for the seven most stably expressed genes (CXCL1, HMOX1, IL1RN, IL1B, IL6R, PTGS2, and TNF, 95% of all samples profiled fell within 1.5–2.5 Ct, the equivalent of a 4- to 6-fold dynamic range. Two subjects who experienced severe adverse events of cancer and anemia, had microarray gene expression profiles that were distinct from normal while subjects who experienced an infection had only slightly elevated levels of inflammatory markers. Conclusion This study defines the range and variability of gene expression in healthy men and women over a six-month period. These parameters can be used to estimate the number of subjects needed to observe significant differences from normal gene expression in clinical studies. A set of genes that varied by gender was also identified as were a set of genes with elevated

  3. Vaginal Gene Expression During Treatment With Aromatase Inhibitors.

    Science.gov (United States)

    Kallak, Theodora Kunovac; Baumgart, Juliane; Nilsson, Kerstin; Åkerud, Helena; Poromaa, Inger Sundström; Stavreus-Evers, Anneli

    2015-12-01

    Aromatase inhibitor (AI) treatment suppresses estrogen biosynthesis and causes genitourinary symptoms of menopause such as vaginal symptoms, ultimately affecting the quality of life for many postmenopausal women with breast cancer. Thus, the aim of this study was to examine vaginal gene expression in women during treatment with AIs compared with estrogen-treated women. The secondary aim was to study the presence and localization of vaginal aromatase. Vaginal biopsies were collected from postmenopausal women treated with AIs and from age-matched control women treated with vaginal estrogen therapy. Differential gene expression was studied with the Affymetrix Gene Chip Gene 1.0 ST Array (Affymetrix Inc, Santa Clara, CA) system, Ingenuity pathway analysis, quantitative real-time polymerase chain reaction, and immunohistochemistry. The expression of 279 genes differed between the 2 groups; AI-treated women had low expression of genes involved in cell differentiation, proliferation, and cell adhesion. Some differentially expressed genes were found to interact indirectly with the estrogen receptor alpha. In addition, aromatase protein staining was evident in the basal and the intermediate vaginal epithelium layers, and also in stromal cells with a slightly stronger staining intensity found in AI-treated women. In this study, we demonstrated that genes involved in cell differentiation, proliferation, and cell adhesion are differentially expressed in AI-treated women. The expression of vaginal aromatase suggests that this could be the result of local and systemic inhibition of aromatase. Our results emphasize the role of estrogen for vaginal cell differentiation and proliferation and future drug candidates should be aimed at improving cell differentiation and proliferation. Copyright © 2015 Elsevier Inc. All rights reserved.

  4. Gene-expression profiling after exposure to C-ion beams

    International Nuclear Information System (INIS)

    Saegusa, Kumiko; Furuno, Aki; Ishikawa, Kenichi; Ishikawa, Atsuko; Ohtsuka, Yoshimi; Kawai, Seiko; Imai, Takashi; Nojima, Kumie

    2005-01-01

    It is recognized that carbon-ion beam kills cancer cells more efficiently than X-ray. In this study we have compared cellular gene expression response after carbon-ion beam exposure with that after X-ray exposure. Gene expression profiles of cultured neonatal human dermal fibroblasts (NHDF) at 0, 1, 3, 6, 12, 18, and 24 hr after exposure to 0.1, 2 and 5 Gy of X-ray or carbon-ion beam were obtained using 22K oligonucleotide microarray. N-way ANOVA analysis of whole gene expression data sets selected 960 genes for carbon-ion beam and 977 genes for X-ray, respectively. Interestingly, majority of these genes (91% for carbon-ion beam and 88% for X-ray, respectively) were down regulated. The selected genes were further classified by their dose-dependence or time-dependence of gene expression change (fold change>1.5). It was revealed that genes involved in cell proliferation had tendency to show time-dependent up regulation by carbon-ion beam. Another N-way ANOVA analysis was performed to select 510 genes, and further selection was made to find 70 genes that showed radiation species-dependent gene expression change (fold change>1.25). These genes were then categorized by the K-Mean clustering method into 4 clusters. Each cluster showed tendency to contain genes involved in cell cycle regulation, cell death, responses to stress and metabolisms, respectively. (author)

  5. AffyMiner: mining differentially expressed genes and biological knowledge in GeneChip microarray data

    Directory of Open Access Journals (Sweden)

    Xia Yuannan

    2006-12-01

    Full Text Available Abstract Background DNA microarrays are a powerful tool for monitoring the expression of tens of thousands of genes simultaneously. With the advance of microarray technology, the challenge issue becomes how to analyze a large amount of microarray data and make biological sense of them. Affymetrix GeneChips are widely used microarrays, where a variety of statistical algorithms have been explored and used for detecting significant genes in the experiment. These methods rely solely on the quantitative data, i.e., signal intensity; however, qualitative data are also important parameters in detecting differentially expressed genes. Results AffyMiner is a tool developed for detecting differentially expressed genes in Affymetrix GeneChip microarray data and for associating gene annotation and gene ontology information with the genes detected. AffyMiner consists of the functional modules, GeneFinder for detecting significant genes in a treatment versus control experiment and GOTree for mapping genes of interest onto the Gene Ontology (GO space; and interfaces to run Cluster, a program for clustering analysis, and GenMAPP, a program for pathway analysis. AffyMiner has been used for analyzing the GeneChip data and the results were presented in several publications. Conclusion AffyMiner fills an important gap in finding differentially expressed genes in Affymetrix GeneChip microarray data. AffyMiner effectively deals with multiple replicates in the experiment and takes into account both quantitative and qualitative data in identifying significant genes. AffyMiner reduces the time and effort needed to compare data from multiple arrays and to interpret the possible biological implications associated with significant changes in a gene's expression.

  6. Microarray gene expression profiling and analysis in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Sadhukhan Provash

    2004-06-01

    Full Text Available Abstract Background Renal cell carcinoma (RCC is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. Methods Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. Results Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR. Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. Conclusions This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most

  7. Regulation of Gene Expression in Protozoa Parasites

    Directory of Open Access Journals (Sweden)

    Consuelo Gomez

    2010-01-01

    Full Text Available Infections with protozoa parasites are associated with high burdens of morbidity and mortality across the developing world. Despite extensive efforts to control the transmission of these parasites, the spread of populations resistant to drugs and the lack of effective vaccines against them contribute to their persistence as major public health problems. Parasites should perform a strict control on the expression of genes involved in their pathogenicity, differentiation, immune evasion, or drug resistance, and the comprehension of the mechanisms implicated in that control could help to develop novel therapeutic strategies. However, until now these mechanisms are poorly understood in protozoa. Recent investigations into gene expression in protozoa parasites suggest that they possess many of the canonical machineries employed by higher eukaryotes for the control of gene expression at transcriptional, posttranscriptional, and epigenetic levels, but they also contain exclusive mechanisms. Here, we review the current understanding about the regulation of gene expression in Plasmodium sp., Trypanosomatids, Entamoeba histolytica and Trichomonas vaginalis.

  8. Regulation of gene expression in protozoa parasites.

    Science.gov (United States)

    Gomez, Consuelo; Esther Ramirez, M; Calixto-Galvez, Mercedes; Medel, Olivia; Rodríguez, Mario A

    2010-01-01

    Infections with protozoa parasites are associated with high burdens of morbidity and mortality across the developing world. Despite extensive efforts to control the transmission of these parasites, the spread of populations resistant to drugs and the lack of effective vaccines against them contribute to their persistence as major public health problems. Parasites should perform a strict control on the expression of genes involved in their pathogenicity, differentiation, immune evasion, or drug resistance, and the comprehension of the mechanisms implicated in that control could help to develop novel therapeutic strategies. However, until now these mechanisms are poorly understood in protozoa. Recent investigations into gene expression in protozoa parasites suggest that they possess many of the canonical machineries employed by higher eukaryotes for the control of gene expression at transcriptional, posttranscriptional, and epigenetic levels, but they also contain exclusive mechanisms. Here, we review the current understanding about the regulation of gene expression in Plasmodium sp., Trypanosomatids, Entamoeba histolytica and Trichomonas vaginalis.

  9. Spatial gene expression quantification: a tool for analysis of in situ hybridizations in sea anemone Nematostella vectensis

    Directory of Open Access Journals (Sweden)

    Botman Daniel

    2012-10-01

    Full Text Available Abstract Background Spatial gene expression quantification is required for modeling gene regulation in developing organisms. The fruit fly Drosophila melanogaster is the model system most widely applied for spatial gene expression analysis due to its unique embryonic properties: the shape does not change significantly during its early cleavage cycles and most genes are differentially expressed along a straight axis. This system of development is quite exceptional in the animal kingdom. In the sea anemone Nematostella vectensis the embryo changes its shape during early development; there are cell divisions and cell movement, like in most other metazoans. Nematostella is an attractive case study for spatial gene expression since its transparent body wall makes it accessible to various imaging techniques. Findings Our new quantification method produces standardized gene expression profiles from raw or annotated Nematostella in situ hybridizations by measuring the expression intensity along its cell layer. The procedure is based on digital morphologies derived from high-resolution fluorescence pictures. Additionally, complete descriptions of nonsymmetric expression patterns have been constructed by transforming the gene expression images into a three-dimensional representation. Conclusions We created a standard format for gene expression data, which enables quantitative analysis of in situ hybridizations from embryos with various shapes in different developmental stages. The obtained expression profiles are suitable as input for optimization of gene regulatory network models, and for correlation analysis of genes from dissimilar Nematostella morphologies. This approach is potentially applicable to many other metazoan model organisms and may also be suitable for processing data from three-dimensional imaging techniques.

  10. Differential gene expression of cardiac ion channels in human dilated cardiomyopathy.

    Directory of Open Access Journals (Sweden)

    Maria Micaela Molina-Navarro

    Full Text Available BACKGROUND: Dilated cardiomyopathy (DCM is characterized by idiopathic dilation and systolic contractile dysfunction of the cardiac chambers. The present work aimed to study the alterations in gene expression of ion channels involved in cardiomyocyte function. METHODS AND RESULTS: Microarray profiling using the Affymetrix Human Gene® 1.0 ST array was performed using 17 RNA samples, 12 from DCM patients undergoing cardiac transplantation and 5 control donors (CNT. The analysis focused on 7 cardiac ion channel genes, since this category has not been previously studied in human DCM. SCN2B was upregulated, while KCNJ5, KCNJ8, CLIC2, CLCN3, CACNB2, and CACNA1C were downregulated. The RT-qPCR (21 DCM and 8 CNT samples validated the gene expression of SCN2B (p < 0.0001, KCNJ5 (p < 0.05, KCNJ8 (p < 0.05, CLIC2 (p < 0.05, and CACNB2 (p < 0.05. Furthermore, we performed an IPA analysis and we found a functional relationship between the different ion channels studied in this work. CONCLUSION: This study shows a differential expression of ion channel genes involved in cardiac contraction in DCM that might partly underlie the changes in left ventricular function observed in these patients. These results could be the basis for new genetic therapeutic approaches.

  11. Divergent and nonuniform gene expression patterns in mouse brain

    Science.gov (United States)

    Morris, John A.; Royall, Joshua J.; Bertagnolli, Darren; Boe, Andrew F.; Burnell, Josh J.; Byrnes, Emi J.; Copeland, Cathy; Desta, Tsega; Fischer, Shanna R.; Goldy, Jeff; Glattfelder, Katie J.; Kidney, Jolene M.; Lemon, Tracy; Orta, Geralyn J.; Parry, Sheana E.; Pathak, Sayan D.; Pearson, Owen C.; Reding, Melissa; Shapouri, Sheila; Smith, Kimberly A.; Soden, Chad; Solan, Beth M.; Weller, John; Takahashi, Joseph S.; Overly, Caroline C.; Lein, Ed S.; Hawrylycz, Michael J.; Hohmann, John G.; Jones, Allan R.

    2010-01-01

    Considerable progress has been made in understanding variations in gene sequence and expression level associated with phenotype, yet how genetic diversity translates into complex phenotypic differences remains poorly understood. Here, we examine the relationship between genetic background and spatial patterns of gene expression across seven strains of mice, providing the most extensive cellular-resolution comparative analysis of gene expression in the mammalian brain to date. Using comprehensive brainwide anatomic coverage (more than 200 brain regions), we applied in situ hybridization to analyze the spatial expression patterns of 49 genes encoding well-known pharmaceutical drug targets. Remarkably, over 50% of the genes examined showed interstrain expression variation. In addition, the variability was nonuniformly distributed across strain and neuroanatomic region, suggesting certain organizing principles. First, the degree of expression variance among strains mirrors genealogic relationships. Second, expression pattern differences were concentrated in higher-order brain regions such as the cortex and hippocampus. Divergence in gene expression patterns across the brain could contribute significantly to variations in behavior and responses to neuroactive drugs in laboratory mouse strains and may help to explain individual differences in human responsiveness to neuroactive drugs. PMID:20956311

  12. Rethinking cell-cycle-dependent gene expression in Schizosaccharomyces pombe.

    Science.gov (United States)

    Cooper, Stephen

    2017-11-01

    Three studies of gene expression during the division cycle of Schizosaccharomyces pombe led to the proposal that a large number of genes are expressed at particular times during the S. pombe cell cycle. Yet only a small fraction of genes proposed to be expressed in a cell-cycle-dependent manner are reproducible in all three published studies. In addition to reproducibility problems, questions about expression amplitudes, cell-cycle timing of expression, synchronization artifacts, and the problem with methods for synchronizing cells must be considered. These problems and complications prompt the idea that caution should be used before accepting the conclusion that there are a large number of genes expressed in a cell-cycle-dependent manner in S. pombe.

  13. G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Directory of Open Access Journals (Sweden)

    Lemay Danielle G

    2012-09-01

    Full Text Available Abstract Background In previous studies, gene neighborhoods—spatial clusters of co-expressed genes in the genome—have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Scoring Tool (G-NEST which combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all possible window sizes simultaneously. Results Using G-NEST on atlases of mouse and human tissue expression data, we found that large neighborhoods of ten or more genes are extremely rare in mammalian genomes. When they do occur, neighborhoods are typically composed of families of related genes. Both the highest scoring and the largest neighborhoods in mammalian genomes are formed by tandem gene duplication. Mammalian gene neighborhoods contain highly and variably expressed genes. Co-localized noisy gene pairs exhibit lower evolutionary conservation of their adjacent genome locations, suggesting that their shared transcriptional background may be disadvantageous. Genes that are essential to mammalian survival and reproduction are less likely to occur in neighborhoods, although neighborhoods are enriched with genes that function in mitosis. We also found that gene orientation and protein-protein interactions are partially responsible for maintenance of gene neighborhoods. Conclusions Our experiments using G-NEST confirm that tandem gene duplication is the primary driver of non-random gene order in mammalian genomes. Non-essentiality, co-functionality, gene orientation, and protein-protein interactions are additional forces that maintain gene neighborhoods, especially those formed by tandem duplicates. We expect G-NEST to be useful for other applications such as the identification of core regulatory modules, common transcriptional backgrounds, and chromatin domains. The

  14. Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

    Science.gov (United States)

    Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

    2017-11-13

    The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly

  15. G-NEST: A gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Science.gov (United States)

    In previous studies, gene neighborhoods--spatial clusters of co-expressed genes in the genome--have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Sc...

  16. Identification of imprinted genes subject to parent-of-origin specific expression in Arabidopsis thaliana seeds

    LENUS (Irish Health Repository)

    McKeown, Peter C

    2011-08-12

    confirmed via allele-specific transcript analysis across a range of different accessions. Differentially methylated regions were identified adjacent to ATCDC48 and PDE120, which may represent candidate imprinting control regions. Finally, we demonstrate that expression levels of these three genes in vegetative tissues are MET1-dependent, while their uniparental maternal expression in the seed is not dependent on MET1. Conclusions Using a cDNA-AFLP transcriptome profiling approach, we have identified three genes, ATCDC48, PDE120 and MS5-like which represent novel maternally expressed imprinted genes in the Arabidopsis thaliana seed. The extent of overlap between our cDNA-AFLP screen for maternally expressed imprinted genes, and other screens for imprinted and endosperm-expressed genes is discussed.

  17. Identification of imprinted genes subject to parent-of-origin specific expression in Arabidopsis thaliana seeds

    Directory of Open Access Journals (Sweden)

    Wennblom Trevor J

    2011-08-01

    seeds was confirmed via allele-specific transcript analysis across a range of different accessions. Differentially methylated regions were identified adjacent to ATCDC48 and PDE120, which may represent candidate imprinting control regions. Finally, we demonstrate that expression levels of these three genes in vegetative tissues are MET1-dependent, while their uniparental maternal expression in the seed is not dependent on MET1. Conclusions Using a cDNA-AFLP transcriptome profiling approach, we have identified three genes, ATCDC48, PDE120 and MS5-like which represent novel maternally expressed imprinted genes in the Arabidopsis thaliana seed. The extent of overlap between our cDNA-AFLP screen for maternally expressed imprinted genes, and other screens for imprinted and endosperm-expressed genes is discussed.

  18. Selection for the compactness of highly expressed genes in Gallus gallus

    Directory of Open Access Journals (Sweden)

    Zhou Ming

    2010-05-01

    Full Text Available Abstract Background Coding sequence (CDS length, gene size, and intron length vary within a genome and among genomes. Previous studies in diverse organisms, including human, D. Melanogaster, C. elegans, S. cerevisiae, and Arabidopsis thaliana, indicated that there are negative relationships between expression level and gene size, CDS length as well as intron length. Different models such as selection for economy model, genomic design model, and mutational bias hypotheses have been proposed to explain such observation. The debate of which model is a superior one to explain the observation has not been settled down. The chicken (Gallus gallus is an important model organism that bridges the evolutionary gap between mammals and other vertebrates. As D. Melanogaster, chicken has a larger effective population size, selection for chicken genome is expected to be more effective in increasing protein synthesis efficiency. Therefore, in this study the chicken was used as a model organism to elucidate the interaction between gene features and expression pattern upon selection pressure. Results Based on different technologies, we gathered expression data for nuclear protein coding, single-splicing genes from Gallus gallus genome and compared them with gene parameters. We found that gene size, CDS length, first intron length, average intron length, and total intron length are negatively correlated with expression level and expression breadth significantly. The tissue specificity is positively correlated with the first intron length but negatively correlated with the average intron length, and not correlated with the CDS length and protein domain numbers. Comparison analyses showed that ubiquitously expressed genes and narrowly expressed genes with the similar expression levels do not differ in compactness. Our data provided evidence that the genomic design model can not, at least in part, explain our observations. We grouped all somatic-tissue-specific genes

  19. Co-expression Network Approach to Studying the Effects of Botulinum Neurotoxin-A.

    Science.gov (United States)

    Mukund, Kavitha; Ward, Samuel R; Lieber, Richard L; Subramaniam, Shankar

    2017-10-16

    Botulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules, further hierarchically clustered into 5 groups. Quantifying average expression and co-expression patterns across these groups revealed temporal aspects of muscle's response to BoNT-A. Functional analysis revealed enrichment of group 1 with metabolism; group 5 with contradictory functions of atrophy and cellular recovery; and groups 2 and 3 with extracellular matrix (ECM) and non-fast fiber isoforms. Topological positioning of two highly ranked, significantly expressed genes- Dclk1 and Ostalpha within group 5 suggested possible mechanistic roles in recovery from BoNT-A induced atrophy. Phenotypic correlations of groups with titin and myosin protein content further emphasized the effect of BoNT-A on the sarcomeric contraction machinery in early phase of chemodenervation. In summary, our approach revealed a hierarchical functional response to BoNT-A induced paralysis with early metabolic and later ECM responses and identified putative biomarkers associated with chemodenervation. Additionally, our results provide an unbiased validation of the response documented in our previous workBotulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules

  20. Predictive modelling of gene expression from transcriptional regulatory elements.

    Science.gov (United States)

    Budden, David M; Hurley, Daniel G; Crampin, Edmund J

    2015-07-01

    Predictive modelling of gene expression provides a powerful framework for exploring the regulatory logic underpinning transcriptional regulation. Recent studies have demonstrated the utility of such models in identifying dysregulation of gene and miRNA expression associated with abnormal patterns of transcription factor (TF) binding or nucleosomal histone modifications (HMs). Despite the growing popularity of such approaches, a comparative review of the various modelling algorithms and feature extraction methods is lacking. We define and compare three methods of quantifying pairwise gene-TF/HM interactions and discuss their suitability for integrating the heterogeneous chromatin immunoprecipitation (ChIP)-seq binding patterns exhibited by TFs and HMs. We then construct log-linear and ϵ-support vector regression models from various mouse embryonic stem cell (mESC) and human lymphoblastoid (GM12878) data sets, considering both ChIP-seq- and position weight matrix- (PWM)-derived in silico TF-binding. The two algorithms are evaluated both in terms of their modelling prediction accuracy and ability to identify the established regulatory roles of individual TFs and HMs. Our results demonstrate that TF-binding and HMs are highly predictive of gene expression as measured by mRNA transcript abundance, irrespective of algorithm or cell type selection and considering both ChIP-seq and PWM-derived TF-binding. As we encourage other researchers to explore and develop these results, our framework is implemented using open-source software and made available as a preconfigured bootable virtual environment. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  1. Genome-wide profiling of 24 hr diel rhythmicity in the water flea, Daphnia pulex: network analysis reveals rhythmic gene expression and enhances functional gene annotation.

    Science.gov (United States)

    Rund, Samuel S C; Yoo, Boyoung; Alam, Camille; Green, Taryn; Stephens, Melissa T; Zeng, Erliang; George, Gary F; Sheppard, Aaron D; Duffield, Giles E; Milenković, Tijana; Pfrender, Michael E

    2016-08-18

    Marine and freshwater zooplankton exhibit daily rhythmic patterns of behavior and physiology which may be regulated directly by the light:dark (LD) cycle and/or a molecular circadian clock. One of the best-studied zooplankton taxa, the freshwater crustacean Daphnia, has a 24 h diel vertical migration (DVM) behavior whereby the organism travels up and down through the water column daily. DVM plays a critical role in resource tracking and the behavioral avoidance of predators and damaging ultraviolet radiation. However, there is little information at the transcriptional level linking the expression patterns of genes to the rhythmic physiology/behavior of Daphnia. Here we analyzed genome-wide temporal transcriptional patterns from Daphnia pulex collected over a 44 h time period under a 12:12 LD cycle (diel) conditions using a cosine-fitting algorithm. We used a comprehensive network modeling and analysis approach to identify novel co-regulated rhythmic genes that have similar network topological properties and functional annotations as rhythmic genes identified by the cosine-fitting analyses. Furthermore, we used the network approach to predict with high accuracy novel gene-function associations, thus enhancing current functional annotations available for genes in this ecologically relevant model species. Our results reveal that genes in many functional groupings exhibit 24 h rhythms in their expression patterns under diel conditions. We highlight the rhythmic expression of immunity, oxidative detoxification, and sensory process genes. We discuss differences in the chronobiology of D. pulex from other well-characterized terrestrial arthropods. This research adds to a growing body of literature suggesting the genetic mechanisms governing rhythmicity in crustaceans may be divergent from other arthropod lineages including insects. Lastly, these results highlight the power of using a network analysis approach to identify differential gene expression and provide novel

  2. Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

    KAUST Repository

    Othoum, Ghofran K

    2013-05-01

    Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using

  3. Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

    KAUST Repository

    Othoum, Ghofran K

    2013-01-01

    Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using

  4. Aging and Gene Expression in the Primate Brain

    Energy Technology Data Exchange (ETDEWEB)

    Fraser, Hunter B.; Khaitovich, Philipp; Plotkin, Joshua B.; Paabo, Svante; Eisen, Michael B.

    2005-02-18

    It is well established that gene expression levels in many organisms change during the aging process, and the advent of DNA microarrays has allowed genome-wide patterns of transcriptional changes associated with aging to be studied in both model organisms and various human tissues. Understanding the effects of aging on gene expression in the human brain is of particular interest, because of its relation to both normal and pathological neurodegeneration. Here we show that human cerebral cortex, human cerebellum, and chimpanzee cortex each undergo different patterns of age-related gene expression alterations. In humans, many more genes undergo consistent expression changes in the cortex than in the cerebellum; in chimpanzees, many genes change expression with age in cortex, but the pattern of changes in expression bears almost no resemblance to that of human cortex. These results demonstrate the diversity of aging patterns present within the human brain, as well as how rapidly genome-wide patterns of aging can evolve between species; they may also have implications for the oxidative free radical theory of aging, and help to improve our understanding of human neurodegenerative diseases.

  5. Aging and gene expression in the primate brain.

    Directory of Open Access Journals (Sweden)

    Hunter B Fraser

    2005-09-01

    Full Text Available It is well established that gene expression levels in many organisms change during the aging process, and the advent of DNA microarrays has allowed genome-wide patterns of transcriptional changes associated with aging to be studied in both model organisms and various human tissues. Understanding the effects of aging on gene expression in the human brain is of particular interest, because of its relation to both normal and pathological neurodegeneration. Here we show that human cerebral cortex, human cerebellum, and chimpanzee cortex each undergo different patterns of age-related gene expression alterations. In humans, many more genes undergo consistent expression changes in the cortex than in the cerebellum; in chimpanzees, many genes change expression with age in cortex, but the pattern of changes in expression bears almost no resemblance to that of human cortex. These results demonstrate the diversity of aging patterns present within the human brain, as well as how rapidly genome-wide patterns of aging can evolve between species; they may also have implications for the oxidative free radical theory of aging, and help to improve our understanding of human neurodegenerative diseases.

  6. Clock Genes Influence Gene Expression in Growth Plate and Endochondral Ossification in Mice*

    Science.gov (United States)

    Takarada, Takeshi; Kodama, Ayumi; Hotta, Shogo; Mieda, Michihiro; Shimba, Shigeki; Hinoi, Eiichi; Yoneda, Yukio

    2012-01-01

    We have previously shown transient promotion by parathyroid hormone of Period-1 (Per1) expression in cultured chondrocytes. Here we show the modulation by clock genes of chondrogenic differentiation through gene transactivation of the master regulator of chondrogenesis Indian hedgehog (IHH) in chondrocytes of the growth plate. Several clock genes were expressed with oscillatory rhythmicity in cultured chondrocytes and rib growth plate in mice, whereas chondrogenesis was markedly inhibited in stable transfectants of Per1 in chondrocytic ATDC5 cells and in rib growth plate chondrocytes from mice deficient of brain and muscle aryl hydrocarbon receptor nuclear translocator-like (BMAL1). Ihh promoter activity was regulated by different clock gene products, with clear circadian rhythmicity in expression profiles of Ihh in the growth plate. In BMAL1-null mice, a predominant decrease was seen in Ihh expression in the growth plate with a smaller body size than in wild-type mice. BMAL1 deficit led to disruption of the rhythmic expression profiles of both Per1 and Ihh in the growth plate. A clear rhythmicity was seen with Ihh expression in ATDC5 cells exposed to dexamethasone. In young mice defective of BMAL1 exclusively in chondrocytes, similar abnormalities were found in bone growth and Ihh expression. These results suggest that endochondral ossification is under the regulation of particular clock gene products expressed in chondrocytes during postnatal skeletogenesis through a mechanism relevant to the rhythmic Ihh expression. PMID:22936800

  7. A Gene Expression Classifier of Node-Positive Colorectal Cancer

    Directory of Open Access Journals (Sweden)

    Paul F. Meeh

    2009-10-01

    Full Text Available We used digital long serial analysis of gene expression to discover gene expression differences between node-negative and node-positive colorectal tumors and developed a multigene classifier able to discriminate between these two tumor types. We prepared and sequenced long serial analysis of gene expression libraries from one node-negative and one node-positive colorectal tumor, sequenced to a depth of 26,060 unique tags, and identified 262 tags significantly differentially expressed between these two tumors (P < 2 x 10-6. We confirmed the tag-to-gene assignments and differential expression of 31 genes by quantitative real-time polymerase chain reaction, 12 of which were elevated in the node-positive tumor. We analyzed the expression levels of these 12 upregulated genes in a validation panel of 23 additional tumors and developed an optimized seven-gene logistic regression classifier. The classifier discriminated between node-negative and node-positive tumors with 86% sensitivity and 80% specificity. Receiver operating characteristic analysis of the classifier revealed an area under the curve of 0.86. Experimental manipulation of the function of one classification gene, Fibronectin, caused profound effects on invasion and migration of colorectal cancer cells in vitro. These results suggest that the development of node-positive colorectal cancer occurs in part through elevated epithelial FN1 expression and suggest novel strategies for the diagnosis and treatment of advanced disease.

  8. Gene expression analysis to identify molecular correlates of pre- and post-conditioning derived neuroprotection.

    Science.gov (United States)

    Prasad, Shiv S; Russell, Marsha; Nowakowska, Margeryta; Williams, Andrew; Yauk, Carole

    2012-06-01

    Mild ischaemic exposures before or after severe injurious ischaemia that elicit neuroprotective responses are referred to as preconditioning and post-conditioning. The corresponding molecular mechanisms of neuroprotection are not completely understood. Identification of the genes and associated pathways of corresponding neuroprotection would provide insight into neuronal survival, potential therapeutic approaches and assessments of therapies for stroke. The objectives of this study were to use global gene expression approach to infer the molecular mechanisms in pre- and post-conditioning-derived neuroprotection in cortical neurons following oxygen and glucose deprivation (OGD) in vitro and then to apply these findings to predict corresponding functional pathways. To this end, microarray analysis was applied to rat cortical neurons with or without the pre- and post-conditioning treatments at 3-h post-reperfusion, and differentially expressed transcripts were subjected to statistical, hierarchical clustering and pathway analyses. The expression patterns of 3,431 genes altered under all conditions of ischaemia (with and without pre- or post-conditioning). We identified 1,595 genes that were commonly regulated within both the pre- and post-conditioning treatments. Cluster analysis revealed that transcription profiles clustered tightly within controls, non-conditioned OGD and neuroprotected groups. Two clusters defining neuroprotective conditions associated with up- and downregulated genes were evident. The five most upregulated genes within the neuroprotective clusters were Tagln, Nes, Ptrf, Vim and Adamts9, and the five most downregulated genes were Slc7a3, Bex1, Brunol4, Nrxn3 and Cpne4. Pathway analysis revealed that the intracellular and second messenger signalling pathways in addition to cell death were predominantly associated with downregulated pre- and post-conditioning associated genes, suggesting that modulation of cell death and signal transduction pathways

  9. Rhythmic diel pattern of gene expression in juvenile maize leaf.

    Directory of Open Access Journals (Sweden)

    Maciej Jończyk

    Full Text Available BACKGROUND: Numerous biochemical and physiological parameters of living organisms follow a circadian rhythm. Although such rhythmic behavior is particularly pronounced in plants, which are strictly dependent on the daily photoperiod, data on the molecular aspects of the diurnal cycle in plants is scarce and mostly concerns the model species Arabidopsis thaliana. Here we studied the leaf transcriptome in seedlings of maize, an important C4 crop only distantly related to A. thaliana, throughout a cycle of 10 h darkness and 14 h light to look for rhythmic patterns of gene expression. RESULTS: Using DNA microarrays comprising ca. 43,000 maize-specific probes we found that ca. 12% of all genes showed clear-cut diel rhythms of expression. Cluster analysis identified 35 groups containing from four to ca. 1,000 genes, each comprising genes of similar expression patterns. Perhaps unexpectedly, the most pronounced and most common (concerning the highest number of genes expression maxima were observed towards and during the dark phase. Using Gene Ontology classification several meaningful functional associations were found among genes showing similar diel expression patterns, including massive induction of expression of genes related to gene expression, translation, protein modification and folding at dusk and night. Additionally, we found a clear-cut tendency among genes belonging to individual clusters to share defined transcription factor-binding sequences. CONCLUSIONS: Co-expressed genes belonging to individual clusters are likely to be regulated by common mechanisms. The nocturnal phase of the diurnal cycle involves gross induction of fundamental biochemical processes and should be studied more thoroughly than was appreciated in most earlier physiological studies. Although some general mechanisms responsible for the diel regulation of gene expression might be shared among plants, details of the diurnal regulation of gene expression seem to differ

  10. Molecular transformation, gene cloning, and gene expression systems for filamentous fungi

    Science.gov (United States)

    Gold, Scott E.; Duick, John W.; Redman, Regina S.; Rodriguez, Rusty J.

    2001-01-01

    This chapter discusses the molecular transformation, gene cloning, and gene expression systems for filamentous fungi. Molecular transformation involves the movement of discrete amounts of DNA into cells, the expression of genes on the transported DNA, and the sustainable replication of the transforming DNA. The ability to transform fungi is dependent on the stable replication and expression of genes located on the transforming DNA. Three phenomena observed in bacteria, that is, competence, plasmids, and restriction enzymes to facilitate cloning, were responsible for the development of molecular transformation in fungi. Initial transformation success with filamentous fungi, involving the complementation of auxotrophic mutants by exposure to sheared genomic DNA or RNA from wt isolates, occurred with low transformation efficiencies. In addition, it was difficult to retrieve complementing DNA fragments and isolate genes of interest. This prompted the development of transformation vectors and methods to increase efficiencies. The physiological studies performed with fungi indicated that the cell wall could be removed to generate protoplasts. It was evident that protoplasts could be transformed with significantly greater efficiencies than walled cells.

  11. Imaging gene expression in real-time using aptamers

    Energy Technology Data Exchange (ETDEWEB)

    Shin, Il Chung [Iowa State Univ., Ames, IA (United States)

    2011-01-01

    Signal transduction pathways are usually activated by external stimuli and are transient. The downstream changes such as transcription of the activated genes are also transient. Real-time detection of promoter activity is useful for understanding changes in gene expression, especially during cell differentiation and in development. A simple and reliable method for viewing gene expression in real time is not yet available. Reporter proteins such as fluorescent proteins and luciferase allow for non-invasive detection of the products of gene expression in living cells. However, current reporter systems do not provide for real-time imaging of promoter activity in living cells. This is because of the long time period after transcription required for fluorescent protein synthesis and maturation. We have developed an RNA reporter system for imaging in real-time to detect changes in promoter activity as they occur. The RNA reporter uses strings of RNA aptamers that constitute IMAGEtags (Intracellular MultiAptamer GEnetic tags), which can be expressed from a promoter of choice. The tobramycin, neomycin and PDC RNA aptamers have been utilized for this system and expressed in yeast from the GAL1 promoter. The IMAGEtag RNA kinetics were quantified by RT-qPCR. In yeast precultured in raffinose containing media the GAL1 promoter responded faster than in yeast precultured in glucose containing media. IMAGEtag RNA has relatively short half-life (5.5 min) in yeast. For imaging, the yeast cells are incubated with their ligands that are labeled with fluorescent dyes. To increase signal to noise, ligands have been separately conjugated with the FRET (Förster resonance energy transfer) pairs, Cy3 and Cy5. With these constructs, the transcribed aptamers can be imaged after activation of the promoter by galactose. FRET was confirmed with three different approaches, which were sensitized emission, acceptor photobleaching and donor lifetime by FLIM (fluorescence lifetime imaging

  12. Imaging gene expression in real-time using aptamers

    Energy Technology Data Exchange (ETDEWEB)

    Shin, Ilchung [Iowa State Univ., Ames, IA (United States)

    2012-01-01

    Signal transduction pathways are usually activated by external stimuli and are transient. The downstream changes such as transcription of the activated genes are also transient. Real-time detection of promoter activity is useful for understanding changes in gene expression, especially during cell differentiation and in development. A simple and reliable method for viewing gene expression in real time is not yet available. Reporter proteins such as fluorescent proteins and luciferase allow for non-invasive detection of the products of gene expression in living cells. However, current reporter systems do not provide for real-time imaging of promoter activity in living cells. This is because of the long time period after transcription required for fluorescent protein synthesis and maturation. We have developed an RNA reporter system for imaging in real-time to detect changes in promoter activity as they occur. The RNA reporter uses strings of RNA aptamers that constitute IMAGEtags (Intracellular MultiAptamer GEnetic tags), which can be expressed from a promoter of choice. The tobramycin, neomycin and PDC RNA aptamers have been utilized for this system and expressed in yeast from the GAL1 promoter. The IMAGEtag RNA kinetics were quantified by RT-qPCR. In yeast precultured in raffinose containing media the GAL1 promoter responded faster than in yeast precultured in glucose containing media. IMAGEtag RNA has relatively short half-life (5.5 min) in yeast. For imaging, the yeast cells are incubated with their ligands that are labeled with fluorescent dyes. To increase signal to noise, ligands have been separately conjugated with the FRET (Förster resonance energy transfer) pairs, Cy3 and Cy5. With these constructs, the transcribed aptamers can be imaged after activation of the promoter by galactose. FRET was confirmed with three different approaches, which were sensitized emission, acceptor photobleaching and donor lifetime by FLIM (fluorescence lifetime imaging

  13. With Reference to Reference Genes: A Systematic Review of Endogenous Controls in Gene Expression Studies.

    Science.gov (United States)

    Chapman, Joanne R; Waldenström, Jonas

    2015-01-01

    The choice of reference genes that are stably expressed amongst treatment groups is a crucial step in real-time quantitative PCR gene expression studies. Recent guidelines have specified that a minimum of two validated reference genes should be used for normalisation. However, a quantitative review of the literature showed that the average number of reference genes used across all studies was 1.2. Thus, the vast majority of studies continue to use a single gene, with β-actin (ACTB) and/or glyceraldehyde 3-phosphate dehydrogenase (GAPDH) being commonly selected in studies of vertebrate gene expression. Few studies (15%) tested a panel of potential reference genes for stability of expression before using them to normalise data. Amongst studies specifically testing reference gene stability, few found ACTB or GAPDH to be optimal, whereby these genes were significantly less likely to be chosen when larger panels of potential reference genes were screened. Fewer reference genes were tested for stability in non-model organisms, presumably owing to a dearth of available primers in less well characterised species. Furthermore, the experimental conditions under which real-time quantitative PCR analyses were conducted had a large influence on the choice of reference genes, whereby different studies of rat brain tissue showed different reference genes to be the most stable. These results highlight the importance of validating the choice of normalising reference genes before conducting gene expression studies.

  14. Gene expression variability in human hepatic drug metabolizing enzymes and transporters.

    Directory of Open Access Journals (Sweden)

    Lun Yang

    Full Text Available Interindividual variability in the expression of drug-metabolizing enzymes and transporters (DMETs in human liver may contribute to interindividual differences in drug efficacy and adverse reactions. Published studies that analyzed variability in the expression of DMET genes were limited by sample sizes and the number of genes profiled. We systematically analyzed the expression of 374 DMETs from a microarray data set consisting of gene expression profiles derived from 427 human liver samples. The standard deviation of interindividual expression for DMET genes was much higher than that for non-DMET genes. The 20 DMET genes with the largest variability in the expression provided examples of the interindividual variation. Gene expression data were also analyzed using network analysis methods, which delineates the similarities of biological functionalities and regulation mechanisms for these highly variable DMET genes. Expression variability of human hepatic DMET genes may affect drug-gene interactions and disease susceptibility, with concomitant clinical implications.

  15. Mutants of Agrobacterium tumefaciens with elevated vir gene expression

    International Nuclear Information System (INIS)

    Pazour, G.J.; Ta, C.N.; Das, A.

    1991-01-01

    Expression of Agrobacterium tumefaciens virulence (vir) genes requires virA, virG, and a plant-derived inducing compound such as acetosyringone. To identify the critical functional domains of virA and virG, a mutational approach was used. Agrobacterium A136 harboring plasmid pGP159, which contains virA, virG, and a reporter virB:lacZ gene fusion, was mutagenized with UV light or nitrosoguanidine. Survivors that formed blue colonies on a plate containing 5-bromo-4-chloro-3-indolyl beta-D-galactoside were isolated and analyzed. Quantification of beta-galactosidase activity in liquid assays identified nine mutant strains. By plasmid reconstruction and other procedures, all mutations mapped to the virA locus. These mutations caused an 11- to 560-fold increase in the vegetative level of virB:lacZ reporter gene expression. DNA sequence analysis showed that the mutations are located in four regions of VirA: transmembrane domain one, the active site, a glycine-rich region with homology to ATP-binding sites, and a region at the C terminus that has homology to the N terminus of VirG

  16. Gravity-regulated gene expression in Arabidopsis thaliana

    Science.gov (United States)

    Sederoff, Heike; Brown, Christopher S.; Heber, Steffen; Kajla, Jyoti D.; Kumar, Sandeep; Lomax, Terri L.; Wheeler, Benjamin; Yalamanchili, Roopa

    Plant growth and development is regulated by changes in environmental signals. Plants sense environmental changes and respond to them by modifying gene expression programs to ad-just cell growth, differentiation, and metabolism. Functional expression of genes comprises many different processes including transcription, translation, post-transcriptional and post-translational modifications, as well as the degradation of RNA and proteins. Recently, it was discovered that small RNAs (sRNA, 18-24 nucleotides long), which are heritable and systemic, are key elements in regulating gene expression in response to biotic and abiotic changes. Sev-eral different classes of sRNAs have been identified that are part of a non-cell autonomous and phloem-mobile network of regulators affecting transcript stability, translational kinetics, and DNA methylation patterns responsible for heritable transcriptional silencing (epigenetics). Our research has focused on gene expression changes in response to gravistimulation of Arabidopsis roots. Using high-throughput technologies including microarrays and 454 sequencing, we iden-tified rapid changes in transcript abundance of genes as well as differential expression of small RNA in Arabidopsis root apices after minutes of reorientation. Some of the differentially regu-lated transcripts are encoded by genes that are important for the bending response. Functional mutants of those genes respond faster to reorientation than the respective wild type plants, indicating that these proteins are repressors of differential cell elongation. We compared the gravity responsive sRNAs to the changes in transcript abundances of their putative targets and identified several potential miRNA: target pairs. Currently, we are using mutant and transgenic Arabidopsis plants to characterize the function of those miRNAs and their putative targets in gravitropic and phototropic responses in Arabidopsis.

  17. Stunned Silence: Gene Expression Programs in Human Cells Infected with Monkeypox or Vaccinia Virus

    Science.gov (United States)

    Rubins, Kathleen H.; Hensley, Lisa E.; Relman, David A.; Brown, Patrick O.

    2011-01-01

    Poxviruses use an arsenal of molecular weapons to evade detection and disarm host immune responses. We used DNA microarrays to investigate the gene expression responses to infection by monkeypox virus (MPV), an emerging human pathogen, and Vaccinia virus (VAC), a widely used model and vaccine organism, in primary human macrophages, primary human fibroblasts and HeLa cells. Even as the overwhelmingly infected cells approached their demise, with extensive cytopathic changes, their gene expression programs appeared almost oblivious to poxvirus infection. Although killed (gamma-irradiated) MPV potently induced a transcriptional program characteristic of the interferon response, no such response was observed during infection with either live MPV or VAC. Moreover, while the gene expression response of infected cells to stimulation with ionomycin plus phorbol 12-myristate 13-acetate (PMA), or poly (I-C) was largely unimpaired by infection with MPV, a cluster of pro-inflammatory genes were a notable exception. Poly(I-C) induction of genes involved in alerting the innate immune system to the infectious threat, including TNF-alpha, IL-1 alpha and beta, CCL5 and IL-6, were suppressed by infection with live MPV. Thus, MPV selectively inhibits expression of genes with critical roles in cell-signaling pathways that activate innate immune responses, as part of its strategy for stealthy infection. PMID:21267444

  18. Acute Vhl gene inactivation induces cardiac HIF-dependent erythropoietin gene expression.

    Directory of Open Access Journals (Sweden)

    Marta Miró-Murillo

    Full Text Available Von Hippel Lindau (Vhl gene inactivation results in embryonic lethality. The consequences of its inactivation in adult mice, and of the ensuing activation of the hypoxia-inducible factors (HIFs, have been explored mainly in a tissue-specific manner. This mid-gestation lethality can be also circumvented by using a floxed Vhl allele in combination with an ubiquitous tamoxifen-inducible recombinase Cre-ER(T2. Here, we characterize a widespread reduction in Vhl gene expression in Vhl(floxed-UBC-Cre-ER(T2 adult mice after dietary tamoxifen administration, a convenient route of administration that has yet to be fully characterized for global gene inactivation. Vhl gene inactivation rapidly resulted in a marked splenomegaly and skin erythema, accompanied by renal and hepatic induction of the erythropoietin (Epo gene, indicative of the in vivo activation of the oxygen sensing HIF pathway. We show that acute Vhl gene inactivation also induced Epo gene expression in the heart, revealing cardiac tissue to be an extra-renal source of EPO. Indeed, primary cardiomyocytes and HL-1 cardiac cells both induce Epo gene expression when exposed to low O(2 tension in a HIF-dependent manner. Thus, as well as demonstrating the potential of dietary tamoxifen administration for gene inactivation studies in UBC-Cre-ER(T2 mouse lines, this data provides evidence of a cardiac oxygen-sensing VHL/HIF/EPO pathway in adult mice.

  19. Gene expression, nucleotide composition and codon usage bias of genes associated with human Y chromosome.

    Science.gov (United States)

    Choudhury, Monisha Nath; Uddin, Arif; Chakraborty, Supriyo

    2017-06-01

    Analysis of codon usage pattern is important to understand the genetic and evolutionary characteristics of genomes. We have used bioinformatic approaches to analyze the codon usage bias (CUB) of the genes located in human Y chromosome. Codon bias index (CBI) indicated that the overall extent of codon usage bias was low. The relative synonymous codon usage (RSCU) analysis suggested that approximately half of the codons out of 59 synonymous codons were most frequently used, and possessed a T or G at the third codon position. The codon usage pattern was different in different genes as revealed from correspondence analysis (COA). A significant correlation between effective number of codons (ENC) and various GC contents suggests that both mutation pressure and natural selection affect the codon usage pattern of genes located in human Y chromosome. In addition, Y-linked genes have significant difference in GC contents at the second and third codon positions, expression level, and codon usage pattern of some codons like the SPANX genes in X chromosome.

  20. A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.

    Science.gov (United States)

    Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C

    2017-10-01

    Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.

  1. Hepatocyte specific expression of human cloned genes

    Energy Technology Data Exchange (ETDEWEB)

    Cortese, R

    1986-01-01

    A large number of proteins are specifically synthesized in the hepatocyte. Only the adult liver expresses the complete repertoire of functions which are required at various stages during development. There is therefore a complex series of regulatory mechanisms responsible for the maintenance of the differentiated state and for the developmental and physiological variations in the pattern of gene expression. Human hepatoma cell lines HepG2 and Hep3B display a pattern of gene expression similar to adult and fetal liver, respectively; in contrast, cultured fibroblasts or HeLa cells do not express most of the liver specific genes. They have used these cell lines for transfection experiments with cloned human liver specific genes. DNA segments coding for alpha1-antitrypsin and retinol binding protein (two proteins synthesized both in fetal and adult liver) are expressed in the hepatoma cell lines HepG2 and Hep3B, but not in HeLa cells or fibroblasts. A DNA segment coding for haptoglobin (a protein synthesized only after birth) is only expressed in the hepatoma cell line HepG2 but not in Hep3B nor in non hepatic cell lines. The information for tissue specific expression is located in the 5' flanking region of all three genes. In vivo competition experiments show that these DNA segments bind to a common, apparently limiting, transacting factor. Conventional techniques (Bal deletions, site directed mutagenesis, etc.) have been used to precisely identify the DNA sequences responsible for these effects. The emerging picture is complex: they have identified multiple, separate transcriptional signals, essential for maximal promoter activation and tissue specific expression. Some of these signals show a negative effect on transcription in fibroblast cell lines.

  2. Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis

    Science.gov (United States)

    dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

    2015-01-01

    Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis. PMID:26393928

  3. Characterization of the MLO gene family in Rosaceae and gene expression analysis in Malus domestica.

    Science.gov (United States)

    Pessina, Stefano; Pavan, Stefano; Catalano, Domenico; Gallotta, Alessandra; Visser, Richard G F; Bai, Yuling; Malnoy, Mickael; Schouten, Henk J

    2014-07-22

    Powdery mildew (PM) is a major fungal disease of thousands of plant species, including many cultivated Rosaceae. PM pathogenesis is associated with up-regulation of MLO genes during early stages of infection, causing down-regulation of plant defense pathways. Specific members of the MLO gene family act as PM-susceptibility genes, as their loss-of-function mutations grant durable and broad-spectrum resistance. We carried out a genome-wide characterization of the MLO gene family in apple, peach and strawberry, and we isolated apricot MLO homologs through a PCR-approach. Evolutionary relationships between MLO homologs were studied and syntenic blocks constructed. Homologs that are candidates for being PM susceptibility genes were inferred by phylogenetic relationships with functionally characterized MLO genes and, in apple, by monitoring their expression following inoculation with the PM causal pathogen Podosphaera leucotricha. Genomic tools available for Rosaceae were exploited in order to characterize the MLO gene family. Candidate MLO susceptibility genes were identified. In follow-up studies it can be investigated whether silencing or a loss-of-function mutations in one or more of these candidate genes leads to PM resistance.

  4. Vascular Gene Expression in Nonneoplastic and Malignant Brain

    Science.gov (United States)

    Madden, Stephen L.; Cook, Brian P.; Nacht, Mariana; Weber, William D.; Callahan, Michelle R.; Jiang, Yide; Dufault, Michael R.; Zhang, Xiaoming; Zhang, Wen; Walter-Yohrling, Jennifer; Rouleau, Cecile; Akmaev, Viatcheslav R.; Wang, Clarence J.; Cao, Xiaohong; St. Martin, Thia B.; Roberts, Bruce L.; Teicher, Beverly A.; Klinger, Katherine W.; Stan, Radu-Virgil; Lucey, Brenden; Carson-Walter, Eleanor B.; Laterra, John; Walter, Kevin A.

    2004-01-01

    Malignant gliomas are uniformly lethal tumors whose morbidity is mediated in large part by the angiogenic response of the brain to the invading tumor. This profound angiogenic response leads to aggressive tumor invasion and destruction of surrounding brain tissue as well as blood-brain barrier breakdown and life-threatening cerebral edema. To investigate the molecular mechanisms governing the proliferation of abnormal microvasculature in malignant brain tumor patients, we have undertaken a cell-specific transcriptome analysis from surgically harvested nonneoplastic and tumor-associated endothelial cells. SAGE-derived endothelial cell gene expression patterns from glioma and nonneoplastic brain tissue reveal distinct gene expression patterns and consistent up-regulation of certain glioma endothelial marker genes across patient samples. We define the G-protein-coupled receptor RDC1 as a tumor endothelial marker whose expression is distinctly induced in tumor endothelial cells of both brain and peripheral vasculature. Further, we demonstrate that the glioma-induced gene, PV1, shows expression both restricted to endothelial cells and coincident with endothelial cell tube formation. As PV1 provides a framework for endothelial cell caveolar diaphragms, this protein may serve to enhance glioma-induced disruption of the blood-brain barrier and transendothelial exchange. Additional characterization of this extensive brain endothelial cell gene expression database will provide unique molecular insights into vascular gene expression. PMID:15277233

  5. Studies of Wilms’ Tumor (WT1 Gene Expression in Adult Acute Leukemias in Singapore

    Directory of Open Access Journals (Sweden)

    Che Kang Lim

    2007-01-01

    Full Text Available Biomarkers provide certain values for diagnosis, monitor treatment effi cacy, or for the development of novel therapeutic approach for particular diseases. Thus, the identifi cation of specifi c of biomarkers for specifi c medical problems, including malignant diseases may be valuable in medical practice. In the study, we have used the Wilms’ tumor gene (WT1 as a biomarker to evaluate its expression in local adult patients with newly diagnosed acute leukemia, including both acute myeloid and lymphoid leukemias (AML and ALL.Aim: To investigate WT1 gene expression in adult patients with acute leukemia at diagnosis.Methods: Eighteen patients with acute leukemia diagnosed at Singapore General Hospital, Singapore, between September, 2004 and July, 2005 were included in this study. There were fifteen AML and three ALL cases aged from 18 to 71 years old. Total RNA and DNA was extracted from peripheral blood mononuclear cells (PBMCs. Expression of WT1 was detected by nested reverse-transcription polymerase chain reaction (Nested RT-PCR. K562, and 3T3 cells were used as positive- and negative-controls. The results were revalidated using real-time PCR. HLA-A genotyping was performed using sequence specific oligonucleotide polymorphism (SSOP analysis.Results: WT1 gene was exclusively expressed in all eighteen, including three ALL and fi fteen AML, patients. In contrast with WT1 gene, the HLA-A genotyping was remarkably heterogeneous in these patients.Conclusions: WT1 gene expression was observed in local patients with acute leukemia at diagnosis. It may be used as a potential molecular marker for diagnosis, clinical progression of the diseases or monitoring the response to treatment, as well as a target for the development of novel therapeutic approaches.

  6. Electrotransfer parameters as a tool for controlled and targeted gene expression in skin

    Directory of Open Access Journals (Sweden)

    Spela Kos

    2016-01-01

    Full Text Available Skin is an attractive target for gene electrotransfer. It consists of different cell types that can be transfected, leading to various responses to gene electrotransfer. We demonstrate that these responses could be controlled by selecting the appropriate electrotransfer parameters. Specifically, the application of low or high electric pulses, applied by multi-electrode array, provided the possibility to control the depth of the transfection in the skin, the duration and the level of gene expression, as well as the local or systemic distribution of the transgene. The influence of electric pulse type was first studied using a plasmid encoding a reporter gene (DsRed. Then, plasmids encoding therapeutic genes (IL-12, shRNA against endoglin, shRNA against melanoma cell adhesion molecule were used, and their effects on wound healing and cutaneous B16F10 melanoma tumors were investigated. The high-voltage pulses resulted in gene expression that was restricted to superficial skin layers and induced a local response. In contrast, the low-voltage electric pulses promoted transfection into the deeper skin layers, resulting in prolonged gene expression and higher transgene production, possibly with systemic distribution. Therefore, in the translation into the clinics, it will be of the utmost importance to adjust the electrotransfer parameters for different therapeutic approaches and specific mode of action of the therapeutic gene.

  7. GeneCAT--novel webtools that combine BLAST and co-expression analyses

    DEFF Research Database (Denmark)

    Mutwil, Marek; Obro, Jens; Willats, William G T

    2008-01-01

    The gene co-expression analysis toolbox (GeneCAT) introduces several novel microarray data analyzing tools. First, the multigene co-expression analysis, combined with co-expressed gene networks, provides a more powerful data mining technique than standard, single-gene co-expression analysis. Second...... orthologs in the plant model organisms Arabidopsis thaliana and Hordeum vulgare (Barley). GeneCAT is equipped with expression data for the model plant A. thaliana, and first to introduce co-expression mining tools for the monocot Barley. GeneCAT is available at http://genecat.mpg.de....

  8. Automated discovery of functional generality of human gene expression programs.

    Directory of Open Access Journals (Sweden)

    Georg K Gerber

    2007-08-01

    Full Text Available An important research problem in computational biology is the identification of expression programs, sets of co-expressed genes orchestrating normal or pathological processes, and the characterization of the functional breadth of these programs. The use of human expression data compendia for discovery of such programs presents several challenges including cellular inhomogeneity within samples, genetic and environmental variation across samples, uncertainty in the numbers of programs and sample populations, and temporal behavior. We developed GeneProgram, a new unsupervised computational framework based on Hierarchical Dirichlet Processes that addresses each of the above challenges. GeneProgram uses expression data to simultaneously organize tissues into groups and genes into overlapping programs with consistent temporal behavior, to produce maps of expression programs, which are sorted by generality scores that exploit the automatically learned groupings. Using synthetic and real gene expression data, we showed that GeneProgram outperformed several popular expression analysis methods. We applied GeneProgram to a compendium of 62 short time-series gene expression datasets exploring the responses of human cells to infectious agents and immune-modulating molecules. GeneProgram produced a map of 104 expression programs, a substantial number of which were significantly enriched for genes involved in key signaling pathways and/or bound by NF-kappaB transcription factors in genome-wide experiments. Further, GeneProgram discovered expression programs that appear to implicate surprising signaling pathways or receptor types in the response to infection, including Wnt signaling and neurotransmitter receptors. We believe the discovered map of expression programs involved in the response to infection will be useful for guiding future biological experiments; genes from programs with low generality scores might serve as new drug targets that exhibit minimal

  9. Blood cell gene expression profiling in rheumatoid arthritis. Discriminative genes and effect of rheumatoid factor

    DEFF Research Database (Denmark)

    Bovin, Lone Frier; Rieneck, Klaus; Workman, Christopher

    2004-01-01

    To study the pathogenic importance of the rheumatoid factor (RF) in rheumatoid arthritis (RA) and to identify genes differentially expressed in patients and healthy individuals, total RNA was isolated from peripheral blood mononuclear cells (PBMC) from eight RF-positive and six RF-negative RA...... patients, and seven healthy controls. Gene expression of about 10,000 genes were examined using oligonucleotide-based DNA chip microarrays. The analyses showed no significant differences in PBMC expression patterns from RF-positive and RF-negative patients. However, comparisons of gene expression patterns...

  10. A compendium of canine normal tissue gene expression.

    Directory of Open Access Journals (Sweden)

    Joseph Briggs

    Full Text Available BACKGROUND: Our understanding of disease is increasingly informed by changes in gene expression between normal and abnormal tissues. The release of the canine genome sequence in 2005 provided an opportunity to better understand human health and disease using the dog as clinically relevant model. Accordingly, we now present the first genome-wide, canine normal tissue gene expression compendium with corresponding human cross-species analysis. METHODOLOGY/PRINCIPAL FINDINGS: The Affymetrix platform was utilized to catalogue gene expression signatures of 10 normal canine tissues including: liver, kidney, heart, lung, cerebrum, lymph node, spleen, jejunum, pancreas and skeletal muscle. The quality of the database was assessed in several ways. Organ defining gene sets were identified for each tissue and functional enrichment analysis revealed themes consistent with known physio-anatomic functions for each organ. In addition, a comparison of orthologous gene expression between matched canine and human normal tissues uncovered remarkable similarity. To demonstrate the utility of this dataset, novel canine gene annotations were established based on comparative analysis of dog and human tissue selective gene expression and manual curation of canine probeset mapping. Public access, using infrastructure identical to that currently in use for human normal tissues, has been established and allows for additional comparisons across species. CONCLUSIONS/SIGNIFICANCE: These data advance our understanding of the canine genome through a comprehensive analysis of gene expression in a diverse set of tissues, contributing to improved functional annotation that has been lacking. Importantly, it will be used to inform future studies of disease in the dog as a model for human translational research and provides a novel resource to the community at large.

  11. Evaluation of suitable reference genes for gene expression studies ...

    Indian Academy of Sciences (India)

    2011-12-14

    Dec 14, 2011 ... MADS family of TFs control floral organ identity within each whorl of the flower by activating downstream genes. Measuring gene expression in different tissue types and developmental stages is of fundamental importance in TFs functional research. In last few years, quantitative real-time. PCR (qRT-PCR) ...

  12. Identification of differentially expressed genes in cucumber (Cucumis sativus L.) root under waterlogging stress by digital gene expression profile.

    Science.gov (United States)

    Qi, Xiao-Hua; Xu, Xue-Wen; Lin, Xiao-Jian; Zhang, Wen-Jie; Chen, Xue-Hao

    2012-03-01

    High-throughput tag-sequencing (Tag-seq) analysis based on the Solexa Genome Analyzer platform was applied to analyze the gene expression profiling of cucumber plant at 5 time points over a 24h period of waterlogging treatment. Approximately 5.8 million total clean sequence tags per library were obtained with 143013 distinct clean tag sequences. Approximately 23.69%-29.61% of the distinct clean tags were mapped unambiguously to the unigene database, and 53.78%-60.66% of the distinct clean tags were mapped to the cucumber genome database. Analysis of the differentially expressed genes revealed that most of the genes were down-regulated in the waterlogging stages, and the differentially expressed genes mainly linked to carbon metabolism, photosynthesis, reactive oxygen species generation/scavenging, and hormone synthesis/signaling. Finally, quantitative real-time polymerase chain reaction using nine genes independently verified the tag-mapped results. This present study reveals the comprehensive mechanisms of waterlogging-responsive transcription in cucumber. Copyright © 2011 Elsevier Inc. All rights reserved.

  13. Cartilage-selective genes identified in genome-scale analysis of non-cartilage and cartilage gene expression

    Directory of Open Access Journals (Sweden)

    Cohn Zachary A

    2007-06-01

    Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.

  14. A systems genetics approach identifies CXCL14, ITGAX, and LPCAT2 as novel aggressive prostate cancer susceptibility genes.

    Directory of Open Access Journals (Sweden)

    Kendra A Williams

    2014-11-01

    Full Text Available Although prostate cancer typically runs an indolent course, a subset of men develop aggressive, fatal forms of this disease. We hypothesize that germline variation modulates susceptibility to aggressive prostate cancer. The goal of this work is to identify susceptibility genes using the C57BL/6-Tg(TRAMP8247Ng/J (TRAMP mouse model of neuroendocrine prostate cancer. Quantitative trait locus (QTL mapping was performed in transgene-positive (TRAMPxNOD/ShiLtJ F2 intercross males (n = 228, which facilitated identification of 11 loci associated with aggressive disease development. Microarray data derived from 126 (TRAMPxNOD/ShiLtJ F2 primary tumors were used to prioritize candidate genes within QTLs, with candidate genes deemed as being high priority when possessing both high levels of expression-trait correlation and a proximal expression QTL. This process enabled the identification of 35 aggressive prostate tumorigenesis candidate genes. The role of these genes in aggressive forms of human prostate cancer was investigated using two concurrent approaches. First, logistic regression analysis in two human prostate gene expression datasets revealed that expression levels of five genes (CXCL14, ITGAX, LPCAT2, RNASEH2A, and ZNF322 were positively correlated with aggressive prostate cancer and two genes (CCL19 and HIST1H1A were protective for aggressive prostate cancer. Higher than average levels of expression of the five genes that were positively correlated with aggressive disease were consistently associated with patient outcome in both human prostate cancer tumor gene expression datasets. Second, three of these five genes (CXCL14, ITGAX, and LPCAT2 harbored polymorphisms associated with aggressive disease development in a human GWAS cohort consisting of 1,172 prostate cancer patients. This study is the first example of using a systems genetics approach to successfully identify novel susceptibility genes for aggressive prostate cancer. Such

  15. Fear conditioning leads to alteration in specific genes expression in cortical and thalamic neurons that project to the lateral amygdala.

    Science.gov (United States)

    Katz, Ira K; Lamprecht, Raphael

    2015-02-01

    RNA transcription is needed for memory formation. However, the ability to identify genes whose expression is altered by learning is greatly impaired because of methodological difficulties in profiling gene expression in specific neurons involved in memory formation. Here, we report a novel approach to monitor the expression of genes after learning in neurons in specific brain pathways needed for memory formation. In this study, we aimed to monitor gene expression after fear learning. We retrogradely labeled discrete thalamic neurons that project to the lateral amygdala (LA) of rats. The labeled neurons were dissected, using laser microdissection microscopy, after fear conditioning learning or unpaired training. The RNAs from the dissected neurons were subjected to microarray analysis. The levels of selected RNAs detected by the microarray analysis to be altered by fear conditioning were also assessed by nanostring analysis. We observed that the expression of genes involved in the regulation of translation, maturation and degradation of proteins was increased 6 h after fear conditioning compared to unpaired or naïve trained rats. These genes were not expressed 24 h after training or in cortical neurons that project to the LA. The expression of genes involved in transcription regulation and neuronal development was altered after fear conditioning learning in the cortical-LA pathway. The present study provides key information on the identity of genes expressed in discrete thalamic and cortical neurons that project to the LA after fear conditioning. Such an approach could also serve to identify gene products as targets for the development of a new generation of therapeutic agents that could be aimed to functionally identified brain circuits to treat memory-related disorders. © 2014 International Society for Neurochemistry.

  16. Multiplex preamplification of specific cDNA targets prior to gene expression analysis by TaqMan Arrays

    Directory of Open Access Journals (Sweden)

    Ribal María

    2008-06-01

    Full Text Available Abstract Background An accurate gene expression quantification using TaqMan Arrays (TA could be limited by the low RNA quantity obtained from some clinical samples. The novel cDNA preamplification system, the TaqMan PreAmp Master Mix kit (TPAMMK, enables a multiplex preamplification of cDNA targets and therefore, could provide a sufficient amount of specific amplicons for their posterior analysis on TA. Findings A multiplex preamplification of 47 genes was performed in 22 samples prior to their analysis by TA, and relative gene expression levels of non-preamplified (NPA and preamplified (PA samples were compared. Overall, the mean cycle threshold (CT decrement in the PA genes was 3.85 (ranging from 2.07 to 5.01. A high correlation (r between the gene expression measurements of NPA and PA samples was found (mean r = 0.970, ranging from 0.937 to 0.994; p Conclusion We demonstrate that cDNA preamplification using the TPAMMK before TA analysis is a reliable approach to simultaneously measure gene expression of multiple targets in a single sample. Moreover, this procedure was validated in genes from degraded RNA samples and low abundance expressed genes. This combined methodology could have wide applications in clinical research, where scarce amounts of degraded RNA are usually obtained and several genes need to be quantified in each sample.

  17. Gene expression profiling of resting and activated vascular smooth muscle cells by serial analysis of gene expression and clustering analysis

    NARCIS (Netherlands)

    Beauchamp, Nicholas J.; van Achterberg, Tanja A. E.; Engelse, Marten A.; Pannekoek, Hans; de Vries, Carlie J. M.

    2003-01-01

    Migration and proliferation of vascular smooth muscle cells (SMCs) are key events in atherosclerosis. However, little is known about alterations in gene expression upon transition of the quiescent, contractile SMC to the proliferative SMC. We performed serial analysis of gene expression (SAGE) of

  18. Oxygen and tissue culture affect placental gene expression.

    Science.gov (United States)

    Brew, O; Sullivan, M H F

    2017-07-01

    Placental explant culture is an important model for studying placental development and functions. We investigated the differences in placental gene expression in response to tissue culture, atmospheric and physiologic oxygen concentrations. Placental explants were collected from normal term (38-39 weeks of gestation) placentae with no previous uterine contractile activity. Placental transcriptomic expressions were evaluated with GeneChip ® Human Genome U133 Plus 2.0 arrays (Affymetrix). We uncovered sub-sets of genes that regulate response to stress, induction of apoptosis programmed cell death, mis-regulation of cell growth, proliferation, cell morphogenesis, tissue viability, and protection from apoptosis in cultured placental explants. We also identified a sub-set of genes with highly unstable pattern of expression after exposure to tissue culture. Tissue culture irrespective of oxygen concentration induced dichotomous increase in significant gene expression and increased enrichment of significant pathways and transcription factor targets (TFTs) including HIF1A. The effect was exacerbated by culture at atmospheric oxygen concentration, where further up-regulation of TFTs including PPARA, CEBPD, HOXA9 and down-regulated TFTs such as JUND/FOS suggest intrinsic heightened key biological and metabolic mechanisms such as glucose use, lipid biosynthesis, protein metabolism; apoptosis, inflammatory responses; and diminished trophoblast proliferation, differentiation, invasion, regeneration, and viability. These findings demonstrate that gene expression patterns differ between pre-culture and cultured explants, and the gene expression of explants cultured at atmospheric oxygen concentration favours stressed, pro-inflammatory and increased apoptotic transcriptomic response. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. VH gene expression and regulation in the mutant Alicia rabbit. Rescue of VHa2 allotype expression.

    Science.gov (United States)

    Chen, H T; Alexander, C B; Young-Cooper, G O; Mage, R G

    1993-04-01

    Rabbits of the Alicia strain, derived from rabbits expressing the VHa2 allotype, have a mutation in the H chain locus that has a cis effect upon the expression of VHa2 and VHa- genes. A small deletion at the most J-proximal (3') end of the VH locus leads to low expression of all the genes on the entire chromosome in heterozygous ali mutants and altered relative expression of VH genes in homozygotes. To study VH gene expression and regulation, we used the polymerase chain reaction to amplify the VH genes expressed in spleens of young and adult wild-type and mutant Alicia rabbits. The cDNA from reverse transcription of splenic mRNA was amplified and polymerase chain reaction libraries were constructed and screened with oligonucleotides from framework regions 1 and 3, as well as JH. Thirty-three VH-positive clones were sequenced and analyzed. We found that in mutant Alicia rabbits, products of the first functional VH gene (VH4a2), (or VH4a2-like genes) were expressed in 2- to 8-wk-olds. Expression of both the VHx and VHy types of VHa- genes was also elevated but the relative proportions of VHx and VHy, especially VHx, decreased whereas the relative levels of expression of VH4a2 or VH4a2-like genes increased with age. Our results suggest that the appearance of sequences resembling that of the VH1a2, which is deleted in the mutant ali rabbits, could be caused by alterations of the sequences of the rearranged VH4a2 genes by gene conversions and/or rearrangement of upstream VH1a2-like genes later in development.

  20. The human cumulus--oocyte complex gene-expression profile

    Science.gov (United States)

    Assou, Said; Anahory, Tal; Pantesco, Véronique; Le Carrour, Tanguy; Pellestor, Franck; Klein, Bernard; Reyftmann, Lionel; Dechaud, Hervé; De Vos, John; Hamamah, Samir

    2006-01-01

    BACKGROUND The understanding of the mechanisms regulating human oocyte maturation is still rudimentary. We have identified transcripts differentially expressed between immature and mature oocytes, and cumulus cells. METHODS Using oligonucleotides microarrays, genome wide gene expression was studied in pooled immature and mature oocytes or cumulus cells from patients who underwent IVF. RESULTS In addition to known genes such as DAZL, BMP15 or GDF9, oocytes upregulated 1514 genes. We show that PTTG3 and AURKC are respectively the securin and the Aurora kinase preferentially expressed during oocyte meiosis. Strikingly, oocytes overexpressed previously unreported growth factors such as TNFSF13/APRIL, FGF9, FGF14, and IL4, and transcription factors including OTX2, SOX15 and SOX30. Conversely, cumulus cells, in addition to known genes such as LHCGR or BMPR2, overexpressed cell-tocell signaling genes including TNFSF11/RANKL, numerous complement components, semaphorins (SEMA3A, SEMA6A, SEMA6D) and CD genes such as CD200. We also identified 52 genes progressively increasing during oocyte maturation, comprising CDC25A and SOCS7. CONCLUSION The identification of genes up and down regulated during oocyte maturation greatly improves our understanding of oocyte biology and will provide new markers that signal viable and competent oocytes. Furthermore, genes found expressed in cumulus cells are potential markers of granulosa cell tumors. PMID:16571642

  1. Neonatal Gene Therapy for Hemophilia B by a Novel Adenovirus Vector Showing Reduced Leaky Expression of Viral Genes.

    Science.gov (United States)

    Iizuka, Shunsuke; Sakurai, Fuminori; Tachibana, Masashi; Ohashi, Kazuo; Mizuguchi, Hiroyuki

    2017-09-15

    Gene therapy during neonatal and infant stages is a promising approach for hemophilia B, a congenital disorder caused by deficiency of blood coagulation factor IX (FIX). An adenovirus (Ad) vector has high potential for use in neonatal or infant gene therapy for hemophilia B due to its superior transduction properties; however, leaky expression of Ad genes often reduces the transduction efficiencies by Ad protein-mediated tissue damage. Here, we used a novel Ad vector, Ad-E4-122aT, which exhibits a reduction in the leaky expression of Ad genes in liver, in gene therapy studies for neonatal hemophilia B mice. Ad-E4-122aT exhibited significantly higher transduction efficiencies than a conventional Ad vector in neonatal mice. In neonatal hemophilia B mice, a single neonatal injection of Ad-E4-122aT expressing human FIX (hFIX) (Ad-E4-122aT-AHAFIX) maintained more than 6% of the normal plasma hFIX activity levels for approximately 100 days. Sequential administration of Ad-E4-122aT-AHAFIX resulted in more than 100% of the plasma hFIX activity levels for more than 100 days and rescued the bleeding phenotypes of hemophilia B mice. In addition, immunotolerance to hFIX was induced by Ad-E4-122aT-AHAFIX administration in neonatal hemophilia B mice. These results indicated that Ad-E4-122aT is a promising gene delivery vector for neonatal or infant gene therapy for hemophilia B.

  2. Accurate Gene Expression-Based Biodosimetry Using a Minimal Set of Human Gene Transcripts

    Energy Technology Data Exchange (ETDEWEB)

    Tucker, James D., E-mail: jtucker@biology.biosci.wayne.edu [Department of Biological Sciences, Wayne State University, Detroit, Michigan (United States); Joiner, Michael C. [Department of Radiation Oncology, Wayne State University, Detroit, Michigan (United States); Thomas, Robert A.; Grever, William E.; Bakhmutsky, Marina V. [Department of Biological Sciences, Wayne State University, Detroit, Michigan (United States); Chinkhota, Chantelle N.; Smolinski, Joseph M. [Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan (United States); Divine, George W. [Department of Public Health Sciences, Henry Ford Hospital, Detroit, Michigan (United States); Auner, Gregory W. [Department of Electrical and Computer Engineering, Wayne State University, Detroit, Michigan (United States)

    2014-03-15

    Purpose: Rapid and reliable methods for conducting biological dosimetry are a necessity in the event of a large-scale nuclear event. Conventional biodosimetry methods lack the speed, portability, ease of use, and low cost required for triaging numerous victims. Here we address this need by showing that polymerase chain reaction (PCR) on a small number of gene transcripts can provide accurate and rapid dosimetry. The low cost and relative ease of PCR compared with existing dosimetry methods suggest that this approach may be useful in mass-casualty triage situations. Methods and Materials: Human peripheral blood from 60 adult donors was acutely exposed to cobalt-60 gamma rays at doses of 0 (control) to 10 Gy. mRNA expression levels of 121 selected genes were obtained 0.5, 1, and 2 days after exposure by reverse-transcriptase real-time PCR. Optimal dosimetry at each time point was obtained by stepwise regression of dose received against individual gene transcript expression levels. Results: Only 3 to 4 different gene transcripts, ASTN2, CDKN1A, GDF15, and ATM, are needed to explain ≥0.87 of the variance (R{sup 2}). Receiver-operator characteristics, a measure of sensitivity and specificity, of 0.98 for these statistical models were achieved at each time point. Conclusions: The actual and predicted radiation doses agree very closely up to 6 Gy. Dosimetry at 8 and 10 Gy shows some effect of saturation, thereby slightly diminishing the ability to quantify higher exposures. Analyses of these gene transcripts may be advantageous for use in a field-portable device designed to assess exposures in mass casualty situations or in clinical radiation emergencies.

  3. Platelet-derived growth factor (PDGF) B-chain gene expression by activated blood monocytes precedes the expression of the PDGF A-chain gene

    International Nuclear Information System (INIS)

    Martinet, Y.; Jaffe, H.A.; Yamauchi, K.; Betsholtz, C.; Westermark, B.; Heldin, C.H.; Crystal, R.G.

    1987-01-01

    When activated, normal human blood monocytes are known to express the c-sis proto-oncogene coding for PDGF B-chain. Since normal human platelet PDGF molecules are dimers of A and B chains and platelets and monocytes are derived from the same marrow precursors, activated blood monocytes were simultaneously evaluated for their expression of PDGF A and B chain genes. Human blood monocytes were purified by adherence, cultured with or without activation by lipopolysaccharide and poly(A)+ RNA evaluated using Northern analysis and 32 P-labeled A-chain and B-chain (human c-sis) probes. Unstimulated blood monocytes did not express either A-chain or B-chain genes. In contrast, activated monocytes expressed a 4.2 kb mRNA B-chain transcript at 4 hr, but the B-chain mRNA levels declined significantly over the next 18 hr. In comparison, activated monocytes expressed very little A-chain mRNA at 4 hr, but at 12 hr 1.9, 2.3, and 2.8 kb transcripts were observed and persisted through 24 hr. Thus, activation of blood monocytes is followed by PDGF B-chain gene expression preceding PDGF A-chain gene expression, suggesting a difference in the regulation of the expression of the genes for these two chains by these cells

  4. Variation in gene expression within clones of the earthworm Dendrobaena octaedra.

    Directory of Open Access Journals (Sweden)

    Marina Mustonen

    Full Text Available Gene expression is highly plastic, which can help organisms to both acclimate and adapt to changing environments. Possible variation in gene expression among individuals with the same genotype (among clones is not widely considered, even though it could impact the results of studies that focus on gene expression phenotypes, for example studies using clonal lines. We examined the extent of within and between clone variation in gene expression in the earthworm Dendrobaena octaedra, which reproduces through apomictic parthenogenesis. Five microsatellite markers were developed and used to confirm that offspring are genetic clones of their parent. After that, expression of 12 genes was measured from five individuals each from six clonal lines after exposure to copper contaminated soil. Variation in gene expression was higher over all genotypes than within genotypes, as initially assumed. A subset of the genes was also examined in the offspring of exposed individuals in two of the clonal lines. In this case, variation in gene expression within genotypes was as high as that observed over all genotypes. One gene in particular (chymotrypsin inhibitor also showed significant differences in the expression levels among genetically identical individuals. Gene expression can vary considerably, and the extent of variation may depend on the genotypes and genes studied. Ensuring a large sample, with many different genotypes, is critical in studies comparing gene expression phenotypes. Researchers should be especially cautious inferring gene expression phenotypes when using only a single clonal or inbred line, since the results might be specific to only certain genotypes.

  5. Expression and functional assessment of candidate type 2 diabetes susceptibility genes identify four new genes contributing to human insulin secretion

    Directory of Open Access Journals (Sweden)

    Fatou K. Ndiaye

    2017-06-01

    Full Text Available Objectives: Genome-wide association studies (GWAS have identified >100 loci independently contributing to type 2 diabetes (T2D risk. However, translational implications for precision medicine and for the development of novel treatments have been disappointing, due to poor knowledge of how these loci impact T2D pathophysiology. Here, we aimed to measure the expression of genes located nearby T2D associated signals and to assess their effect on insulin secretion from pancreatic beta cells. Methods: The expression of 104 candidate T2D susceptibility genes was measured in a human multi-tissue panel, through PCR-free expression assay. The effects of the knockdown of beta-cell enriched genes were next investigated on insulin secretion from the human EndoC-βH1 beta-cell line. Finally, we performed RNA-sequencing (RNA-seq so as to assess the pathways affected by the knockdown of the new genes impacting insulin secretion from EndoC-βH1, and we analyzed the expression of the new genes in mouse models with altered pancreatic beta-cell function. Results: We found that the candidate T2D susceptibility genes' expression is significantly enriched in pancreatic beta cells obtained by laser capture microdissection or sorted by flow cytometry and in EndoC-βH1 cells, but not in insulin sensitive tissues. Furthermore, the knockdown of seven T2D-susceptibility genes (CDKN2A, GCK, HNF4A, KCNK16, SLC30A8, TBC1D4, and TCF19 with already known expression and/or function in beta cells changed insulin secretion, supporting our functional approach. We showed first evidence for a role in insulin secretion of four candidate T2D-susceptibility genes (PRC1, SRR, ZFAND3, and ZFAND6 with no previous knowledge of presence and function in beta cells. RNA-seq in EndoC-βH1 cells with decreased expression of PRC1, SRR, ZFAND6, or ZFAND3 identified specific gene networks related to T2D pathophysiology. Finally, a positive correlation between the expression of Ins2 and the

  6. Dlx homeobox gene family expression in osteoclasts.

    Science.gov (United States)

    Lézot, F; Thomas, B L; Blin-Wakkach, C; Castaneda, B; Bolanos, A; Hotton, D; Sharpe, P T; Heymann, D; Carles, G F; Grigoriadis, A E; Berdal, A

    2010-06-01

    Skeletal growth and homeostasis require the finely orchestrated secretion of mineralized tissue matrices by highly specialized cells, balanced with their degradation by osteoclasts. Time- and site-specific expression of Dlx and Msx homeobox genes in the cells secreting these matrices have been identified as important elements in the regulation of skeletal morphology. Such specific expression patterns have also been reported in osteoclasts for Msx genes. The aim of the present study was to establish the expression patterns of Dlx genes in osteoclasts and identify their function in regulating skeletal morphology. The expression patterns of all Dlx genes were examined during the whole osteoclastogenesis using different in vitro models. The results revealed that Dlx1 and Dlx2 are the only Dlx family members with a possible function in osteoclastogenesis as well as in mature osteoclasts. Dlx5 and Dlx6 were detected in the cultures but appear to be markers of monocytes and their derivatives. In vivo, Dlx2 expression in osteoclasts was examined using a Dlx2/LacZ transgenic mouse. Dlx2 is expressed in a subpopulation of osteoclasts in association with tooth, brain, nerve, and bone marrow volumetric growths. Altogether the present data suggest a role for Dlx2 in regulation of skeletal morphogenesis via functions within osteoclasts. (c) 2010 Wiley-Liss, Inc.

  7. PRAME Gene Expression in Acute Leukemia and Its Clinical Significance

    International Nuclear Information System (INIS)

    Ding, Kai; Wang, Xiao-ming; Fu, Rong; Ruan, Er-bao; Liu, Hui; Shao, Zong-hong

    2012-01-01

    To investigate the expression of the preferentially expressed antigen of melanoma (PRAME) gene in acute leukemia and its clinical significance. The level of expressed PRAME mRNA in bone marrow mononuclear cells from 34 patients with acute leukemia (AL) and in 12 bone marrow samples from healthy volunteers was measured via RT-PCR. Correlation analyses between PRAME gene expression and the clinical characteristics (gender, age, white blood count, immunophenotype of leukemia, percentage of blast cells, and karyotype) of the patients were performed. The PRAME gene was expressed in 38.2% of all 34 patients, in 40.7% of the patients with acute myelogenous leukemia (AML, n=27), and in 28.6% of the patients with acute lymphoblastic leukemia (ALL, n=7), but was not expressed in the healthy volunteers. The difference in the expression levels between AML and ALL patients was statistically significant. The rate of gene expression was 80% in M 3 , 33.3% in M 2 , and 28.6% in M 5 . Gene expression was also found to be correlated with CD15 and CD33 expression and abnormal karyotype, but not with age, gender, white blood count or percentage of blast cells. The PRAME gene is highly expressed in acute leukemia and could be a useful marker to monitor minimal residual disease. This gene is also a candidate target for the immunotherapy of acute leukemia

  8. An information-theoretic machine learning approach to expression QTL analysis.

    Directory of Open Access Journals (Sweden)

    Tao Huang

    Full Text Available Expression Quantitative Trait Locus (eQTL analysis is a powerful tool to study the biological mechanisms linking the genotype with gene expression. Such analyses can identify genomic locations where genotypic variants influence the expression of genes, both in close proximity to the variant (cis-eQTL, and on other chromosomes (trans-eQTL. Many traditional eQTL methods are based on a linear regression model. In this study, we propose a novel method by which to identify eQTL associations with information theory and machine learning approaches. Mutual Information (MI is used to describe the association between genetic marker and gene expression. MI can detect both linear and non-linear associations. What's more, it can capture the heterogeneity of the population. Advanced feature selection methods, Maximum Relevance Minimum Redundancy (mRMR and Incremental Feature Selection (IFS, were applied to optimize the selection of the affected genes by the genetic marker. When we applied our method to a study of apoE-deficient mice, it was found that the cis-acting eQTLs are stronger than trans-acting eQTLs but there are more trans-acting eQTLs than cis-acting eQTLs. We compared our results (mRMR.eQTL with R/qtl, and MatrixEQTL (modelLINEAR and modelANOVA. In female mice, 67.9% of mRMR.eQTL results can be confirmed by at least two other methods while only 14.4% of R/qtl result can be confirmed by at least two other methods. In male mice, 74.1% of mRMR.eQTL results can be confirmed by at least two other methods while only 18.2% of R/qtl result can be confirmed by at least two other methods. Our methods provide a new way to identify the association between genetic markers and gene expression. Our software is available from supporting information.

  9. Epigenetic regulation on the gene expression signature in esophagus adenocarcinoma.

    Science.gov (United States)

    Xi, Ting; Zhang, Guizhi

    2017-02-01

    Understanding the molecular mechanisms represents an important step in the development of diagnostic and therapeutic measures of esophagus adenocarcinoma (NOS). The objective of this study is to identify the epigenetic regulation on gene expression in NOS, shedding light on the molecular mechanisms of NOS. In this study, 78 patients with NOS were included and the data of mRNA, miRNA and DNA methylation of were downloaded from The Cancer Genome Atlas (TCGA). Differential analysis between NOS and controls was performed in terms of gene expression, miRNA expression, and DNA methylation. Bioinformatic analysis was followed to explore the regulation mechanisms of miRNA and DNA methylationon gene expression. Totally, up to 1320 differentially expressed genes (DEGs) and 32 differentially expressed miRNAs were identified. 240 DEGs that were not only the target genes but also negatively correlated with the screened differentially expressed miRNAs. 101 DEGs were found to be highlymethylated in CpG islands. Then, 8 differentially methylated genes (DMGs) were selected, which showed down-regulated expression in NOS. Among of these genes, 6 genes including ADHFE1, DPP6, GRIA4, CNKSR2, RPS6KA6 and ZNF135 were target genes of differentially expressed miRNAs (hsa-mir-335, hsa-mir-18a, hsa-mir-93, hsa-mir-106b and hsa-mir-21). The identified altered miRNA, genes and DNA methylation site may be applied as biomarkers for diagnosis and prognosis of NOS. Copyright © 2016 Elsevier GmbH. All rights reserved.

  10. Impacts of Neanderthal-Introgressed Sequences on the Landscape of Human Gene Expression.

    Science.gov (United States)

    McCoy, Rajiv C; Wakefield, Jon; Akey, Joshua M

    2017-02-23

    Regulatory variation influencing gene expression is a key contributor to phenotypic diversity, both within and between species. Unfortunately, RNA degrades too rapidly to be recovered from fossil remains, limiting functional genomic insights about our extinct hominin relatives. Many Neanderthal sequences survive in modern humans due to ancient hybridization, providing an opportunity to assess their contributions to transcriptional variation and to test hypotheses about regulatory evolution. We developed a flexible Bayesian statistical approach to quantify allele-specific expression (ASE) in complex RNA-seq datasets. We identified widespread expression differences between Neanderthal and modern human alleles, indicating pervasive cis-regulatory impacts of introgression. Brain regions and testes exhibited significant downregulation of Neanderthal alleles relative to other tissues, consistent with natural selection influencing the tissue-specific regulatory landscape. Our study demonstrates that Neanderthal-inherited sequences are not silent remnants of ancient interbreeding but have measurable impacts on gene expression that contribute to variation in modern human phenotypes. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. Gene expression profiling via LongSAGE in a non-model plant species: a case study in seeds of Brassica napus

    Directory of Open Access Journals (Sweden)

    Friedt Wolfgang

    2009-07-01

    Full Text Available Abstract Background Serial analysis of gene expression (LongSAGE was applied for gene expression profiling in seeds of oilseed rape (Brassica napus ssp. napus. The usefulness of this technique for detailed expression profiling in a non-model organism was demonstrated for the highly complex, neither fully sequenced nor annotated genome of B. napus by applying a tag-to-gene matching strategy based on Brassica ESTs and the annotated proteome of the closely related model crucifer A. thaliana. Results Transcripts from 3,094 genes were detected at two time-points of seed development, 23 days and 35 days after pollination (DAP. Differential expression showed a shift from gene expression involved in diverse developmental processes including cell proliferation and seed coat formation at 23 DAP to more focussed metabolic processes including storage protein accumulation and lipid deposition at 35 DAP. The most abundant transcripts at 23 DAP were coding for diverse protease inhibitor proteins and proteases, including cysteine proteases involved in seed coat formation and a number of lipid transfer proteins involved in embryo pattern formation. At 35 DAP, transcripts encoding napin, cruciferin and oleosin storage proteins were most abundant. Over both time-points, 18.6% of the detected genes were matched by Brassica ESTs identified by LongSAGE tags in antisense orientation. This suggests a strong involvement of antisense transcript expression in regulatory processes during B. napus seed development. Conclusion This study underlines the potential of transcript tagging approaches for gene expression profiling in Brassica crop species via EST matching to annotated A. thaliana genes. Limits of tag detection for low-abundance transcripts can today be overcome by ultra-high throughput sequencing approaches, so that tag-based gene expression profiling may soon become the method of choice for global expression profiling in non-model species.

  12. Profile of the genes expressed in the human peripheral retina, macula, and retinal pigment epithelium determined through serial analysis of gene expression (SAGE)

    Science.gov (United States)

    Sharon, Dror; Blackshaw, Seth; Cepko, Constance L.; Dryja, Thaddeus P.

    2002-01-01

    We used the serial analysis of gene expression (SAGE) technique to catalogue and measure the relative levels of expression of the genes expressed in the human peripheral retina, macula, and retinal pigment epithelium (RPE) from one or both of two humans, aged 88 and 44 years. The cone photoreceptor contribution to all transcription in the retina was found to be similar in the macula versus the retinal periphery, whereas the rod contribution was greater in the periphery versus the macula. Genes encoding structural proteins for axons were found to be expressed at higher levels in the macula versus the retinal periphery, probably reflecting the large proportion of ganglion cells in the central retina. In comparison with the younger eye, the peripheral retina of the older eye had a substantially higher proportion of mRNAs from genes encoding proteins involved in iron metabolism or protection against oxidative damage and a substantially lower proportion of mRNAs from genes encoding proteins involved in rod phototransduction. These differences may reflect the difference in age between the two donors or merely interindividual variation. The RPE library had numerous previously unencountered tags, suggesting that this cell type has a large, idiosyncratic repertoire of expressed genes. Comparison of these libraries with 100 reported nonocular SAGE libraries revealed 89 retina-specific or enriched genes expressed at substantial levels, of which 14 are known to cause a retinal disease and 53 are RPE-specific genes. We expect that these libraries will serve as a resource for understanding the relative expression levels of genes in the retina and the RPE and for identifying additional disease genes. PMID:11756676

  13. Gene Regulation, Modulation, and Their Applications in Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    Mario Flores

    2013-01-01

    Full Text Available Common microarray and next-generation sequencing data analysis concentrate on tumor subtype classification, marker detection, and transcriptional regulation discovery during biological processes by exploring the correlated gene expression patterns and their shared functions. Genetic regulatory network (GRN based approaches have been employed in many large studies in order to scrutinize for dysregulation and potential treatment controls. In addition to gene regulation and network construction, the concept of the network modulator that has significant systemic impact has been proposed, and detection algorithms have been developed in past years. Here we provide a unified mathematic description of these methods, followed with a brief survey of these modulator identification algorithms. As an early attempt to extend the concept to new RNA regulation mechanism, competitive endogenous RNA (ceRNA, into a modulator framework, we provide two applications to illustrate the network construction, modulation effect, and the preliminary finding from these networks. Those methods we surveyed and developed are used to dissect the regulated network under different modulators. Not limit to these, the concept of “modulation” can adapt to various biological mechanisms to discover the novel gene regulation mechanisms.

  14. Disease gene characterization through large-scale co-expression analysis.

    Directory of Open Access Journals (Sweden)

    Allen Day

    2009-12-01

    Full Text Available In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET.Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2 and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders.UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis.

  15. Amniotic fluid RNA gene expression profiling provides insights into the phenotype of Turner syndrome.

    Science.gov (United States)

    Massingham, Lauren J; Johnson, Kirby L; Scholl, Thomas M; Slonim, Donna K; Wick, Heather C; Bianchi, Diana W

    2014-09-01

    Turner syndrome is a sex chromosome aneuploidy with characteristic malformations. Amniotic fluid, a complex biological material, could contribute to the understanding of Turner syndrome pathogenesis. In this pilot study, global gene expression analysis of cell-free RNA in amniotic fluid supernatant was utilized to identify specific genes/organ systems that may play a role in Turner syndrome pathophysiology. Cell-free RNA from amniotic fluid of five mid-trimester Turner syndrome fetuses and five euploid female fetuses matched for gestational age was extracted, amplified, and hybridized onto Affymetrix(®) U133 Plus 2.0 arrays. Significantly differentially regulated genes were identified using paired t tests. Biological interpretation was performed using Ingenuity Pathway Analysis and BioGPS gene expression atlas. There were 470 statistically significantly differentially expressed genes identified. They were widely distributed across the genome. XIST was significantly down-regulated (p Turner syndrome transcriptome from other aneuploidies we previously studied. Manual curation of the differentially expressed gene list identified genes of possible pathologic significance, including NFATC3, IGFBP5, and LDLR. Transcriptomic differences in the amniotic fluid of Turner syndrome fetuses are due to genome-wide dysregulation. The hematologic/immune system differences may play a role in early-onset autoimmune dysfunction. Other genes identified with possible pathologic significance are associated with cardiac and skeletal systems, which are known to be affected in females with Turner syndrome. The discovery-driven approach described here may be useful in elucidating novel mechanisms of disease in Turner syndrome.

  16. Molecular Imaging of Gene Expression and Efficacy following Adenoviral-Mediated Brain Tumor Gene Therapy

    Directory of Open Access Journals (Sweden)

    Alnawaz Rehemtulla

    2002-01-01

    Full Text Available Cancer gene therapy is an active area of research relying upon the transfer and subsequent expression of a therapeutic transgene into tumor cells in order to provide for therapeutic selectivity. Noninvasive assessment of therapeutic response and correlation of the location, magnitude, and duration of transgene expression in vivo would be particularly useful in the development of cancer gene therapy protocols by facilitating optimization of gene transfer protocols, vector development, and prodrug dosing schedules. In this study, we developed an adenoviral vector containing both the therapeutic transgene yeast cytosine deaminase (yCD along with an optical reporter gene (luciferase. Following intratumoral injection of the vector into orthotopic 9L gliomas, anatomical and diffusion-weighted MR images were obtained over time in order to provide for quantitative assessment of overall therapeutic efficacy and spatial heterogeneity of cell kill, respectively. In addition, bioluminescence images were acquired to assess the duration and magnitude of gene expression. MR images revealed significant reduction in tumor growth rates associated with yCD/5-fluorocytosine (5FC gene therapy. Significant increases in mean tumor diffusion values were also observed during treatment with 5FC. Moreover, spatial heterogeneity in tumor diffusion changes were also observed revealing that diffusion magnetic resonance imaging could detect regional therapeutic effects due to the nonuniform delivery and/or expression of the therapeutic yCD transgene within the tumor mass. In addition, in vivo bioluminescence imaging detected luciferase gene expression, which was found to decrease over time during administration of the prodrug providing a noninvasive surrogate marker for monitoring gene expression. These results demonstrate the efficacy of the yCD/5FC strategy for the treatment of brain tumors and reveal the feasibility of using multimodality molecular and functional imaging

  17. From gene networks to drugs: systems pharmacology approaches for AUD.

    Science.gov (United States)

    Ferguson, Laura B; Harris, R Adron; Mayfield, Roy Dayne

    2018-06-01

    The alcohol research field has amassed an impressive number of gene expression datasets spanning key brain areas for addiction, species (humans as well as multiple animal models), and stages in the addiction cycle (binge/intoxication, withdrawal/negative effect, and preoccupation/anticipation). These data have improved our understanding of the molecular adaptations that eventually lead to dysregulation of brain function and the chronic, relapsing disorder of addiction. Identification of new medications to treat alcohol use disorder (AUD) will likely benefit from the integration of genetic, genomic, and behavioral information included in these important datasets. Systems pharmacology considers drug effects as the outcome of the complex network of interactions a drug has rather than a single drug-molecule interaction. Computational strategies based on this principle that integrate gene expression signatures of pharmaceuticals and disease states have shown promise for identifying treatments that ameliorate disease symptoms (called in silico gene mapping or connectivity mapping). In this review, we suggest that gene expression profiling for in silico mapping is critical to improve drug repurposing and discovery for AUD and other psychiatric illnesses. We highlight studies that successfully apply gene mapping computational approaches to identify or repurpose pharmaceutical treatments for psychiatric illnesses. Furthermore, we address important challenges that must be overcome to maximize the potential of these strategies to translate to the clinic and improve healthcare outcomes.

  18. Tracking Differential Gene Expression in MRL/MpJ Versus C57BL/6 Anergic B Cells: Molecular Markers of Autoimmunity

    Directory of Open Access Journals (Sweden)

    Amy G. Clark

    2008-01-01

    Full Text Available Background: Anergy is a key mechanism controlling expression of autoreactive B cells and a major site for failed regulation in autoimmune diseases. Yet the molecular basis for this differentiated cell state remains poorly understood. The current lack of well-characterized surface or molecular markers hinders the isolation of anergic cells for further study. Global gene profiling recently identified transcripts whose expression differentiates anergic from naïve B cells in model mouse systems. The objective of the current study was to evaluate the molecular and cellular processes that differentiate anergic cells that develop in the healthy C57BL/6 (B6 milieu from those that develop in the autoimmune-prone MRL/MpJ (MRL background. This approach takes advantage of B6 and MRL mice bearing an anti-laminin Ig transgene with a well characterized anergic B cell phenotype.Results: Global gene expression was evaluated in purified transgenic B cells using Operon version 3.0 oligonucleotide microarray assaying 31,000 oligoprobes. Genes with a 2-fold expression difference in B6 as compared to MRL anergic B cells were identified. Expression of selected genes was confirmed using quantitative RT-PCR. This approach identified 43 probes corresponding to 37 characterized genes, including Ptpn22, CD74, Birc1f/Naip, and Ctla4, as differentially expressed in anergic B cells in the two strains. Gene Ontology classification identified differentiation, cell cycle, proliferation, development, apoptosis, and cell death as prominently represented ontology groups. Ingenuity Pathway Analysis identified two major networks incorporating 27 qualifying genes. Network 1 centers on beta-estradiol and TP53, and Network 2 encompasses RB1, p38 MAPK, and NFkB cell growth, proliferation, and cell cycle signaling pathways.Conclusion: Using microarray analysis we identified 37 characterized genes and two functional pathways engaged in maintenance of B cell anergy for which expression is

  19. Anterior-posterior regionalized gene expression in the Ciona notochord.

    Science.gov (United States)

    Reeves, Wendy; Thayer, Rachel; Veeman, Michael

    2014-04-01

    In the simple ascidian chordate Ciona, the signaling pathways and gene regulatory networks giving rise to initial notochord induction are largely understood and the mechanisms of notochord morphogenesis are being systematically elucidated. The notochord has generally been thought of as a non-compartmentalized or regionalized organ that is not finely patterned at the level of gene expression. Quantitative imaging methods have recently shown, however, that notochord cell size, shape, and behavior vary consistently along the anterior-posterior (AP) axis. Here we screen candidate genes by whole mount in situ hybridization for potential AP asymmetry. We identify 4 genes that show non-uniform expression in the notochord. Ezrin/radixin/moesin (ERM) is expressed more strongly in the secondary notochord lineage than the primary. CTGF is expressed stochastically in a subset of notochord cells. A novel calmodulin-like gene (BCamL) is expressed more strongly at both the anterior and posterior tips of the notochord. A TGF-β ortholog is expressed in a gradient from posterior to anterior. The asymmetries in ERM, BCamL, and TGF-β expression are evident even before the notochord cells have intercalated into a single-file column. We conclude that the Ciona notochord is not a homogeneous tissue but instead shows distinct patterns of regionalized gene expression. Copyright © 2013 Wiley Periodicals, Inc.

  20. Changes in gene expression following androgen receptor blockade ...

    Indian Academy of Sciences (India)

    Madhu urs

    of gene expression in the ventral prostate, it is not clear whether all the gene expression ... These include clusterin, methionine adenosyl transferase IIα, and prostate-specific ..... MAGEE1 melanoma antigen and no similarity was found with the ...

  1. Selection of Reference Genes for Expression Studies in Diaphorina citri (Hemiptera: Liviidae).

    Science.gov (United States)

    Bassan, Meire Menezes; Angelotti-Mendonc A, Je Ssika; Alves, Gustavo Rodrigues; Yamamoto, Pedro Takao; Moura O Filho, Francisco de Assis Alves

    2017-12-05

    The Asian citrus psyllid, Diaphorina citri Kuwayama (Hemiptera: Liviidae), is considered the main vector of the bacteria associated with huanglongbing, a very serious disease that has threatened the world citrus industry. The absence of efficient control management protocols, including a lack of resistant cultivars, has led to the development of different approaches to study this pathosystem. The production of resistant genotypes relies on D. citri gene expression analyses by RT-qPCR to assess control of the vector population. High-quality, reliable RT-qPCR analyses depend upon proper reference gene selection and validation. However, adequate D. citri reference genes have not yet been identified. In the present study, we evaluated the genes EF 1-α, ACT, GAPDH, RPL7, RPL17, and TUB as candidate reference genes for this insect. Gene expression stability was evaluated using the mathematical algorithms deltaCt, NormFinder, BestKeeper, and geNorm, at five insect developmental stages, grown on two different plant hosts [Citrus sinensis (L.) Osbeck (Sapindales: Rutaceae) and Murraya paniculata (L.) Jack (Sapindales: Rutaceae)]. The final gene ranking was calculated using RefFinder software, and the V-ATPase-A gene was selected for validation. According to our results, two reference genes are recommended when different plant hosts and developmental stages are considered. Considering gene expression studies in D. citri grown on M. paniculata, regardless of the insect developmental stage, GAPDH and RPL7 have the best fit as reference genes in RT-qPCR analyses, whereas GAPDH and EF 1-α are recommended as reference genes in insect studies using C. sinensis. © The Author(s) 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  2. Expression and clinical significance of Pax6 gene in retinoblastoma

    Directory of Open Access Journals (Sweden)

    Hai-Dong Huang

    2013-07-01

    Full Text Available AIM: To discuss the expression and clinical significance of Pax6 gene in retinoblastoma(Rb. METHODS: Totally 15 cases of fresh Rb organizations were selected as observation group and 15 normal retinal organizations as control group. Western-Blot and reverse transcriptase polymerase chain reaction(RT-PCRmethods were used to detect Pax6 protein and Pax6 mRNA expressions of the normal retina organizations and Rb organizations. At the same time, Western Blot method was used to detect the Pax6 gene downstream MATH5 and BRN3b differentiation gene protein level expression. After the comparison between two groups, the expression and clinical significance of Pax6 gene in Rb were discussed. RESULTS: In the observation group, average value of mRNA expression of Pax6 gene was 0.99±0.03; average value of Pax6 gene protein expression was 2.07±0.15; average value of BRN3b protein expression was 0.195±0.016; average value of MATH5 protein expression was 0.190±0.031. They were significantly higher than the control group, and the differences were statistically significant(PCONCLUSION: Abnormal expression of Pax6 gene is likely to accelerate the occurrence of Rb.

  3. Observation of intermittency in gene expression on cDNA microarrays

    CERN Document Server

    Peterson, L E

    2002-01-01

    We used scaled factorial moments to search for intermittency in the log expression ratios (LERs) for thousands of genes spotted on cDNA microarrays (gene chips). Results indicate varying levels of intermittency in gene expression. The observation of intermittency in the data analyzed provides a complimentary handle on moderately expressed genes, generally not tackled by conventional techniques.

  4. Gene expression during testis development in Duroc boars

    DEFF Research Database (Denmark)

    Lervik, Siri; Kristoffersen, Anja Bråthen; Conley, Lene

    2015-01-01

    . Nine clusters of genes with significant differential expression over time and 49 functional charts were found in the analysed testis samples. Prominent pathways in the prepubertal testis were associated with tissue renewal, cell respiration and increased endocytocis. E-cadherines may be associated...... with the onset of pubertal development. With elevated steroidogenesis (weeks 16 to 27), there was an increase in the expression of genes in the MAPK pathway, STAR and its analogue STARD6. A pubertal shift in genes coding for cellular cholesterol transport was observed. Increased expression of meiotic pathways...

  5. Equivalent Gene Expression Profiles between Glatopa™ and Copaxone®.

    Directory of Open Access Journals (Sweden)

    Josephine S D'Alessandro

    Full Text Available Glatopa™ is a generic glatiramer acetate recently approved for the treatment of patients with relapsing forms of multiple sclerosis. Gene expression profiling was performed as a means to evaluate equivalence of Glatopa and Copaxone®. Microarray analysis containing 39,429 unique probes across the entire genome was performed in murine glatiramer acetate--responsive Th2-polarized T cells, a test system highly relevant to the biology of glatiramer acetate. A closely related but nonequivalent glatiramoid molecule was used as a control to establish assay sensitivity. Multiple probe-level (Student's t-test and sample-level (principal component analysis, multidimensional scaling, and hierarchical clustering statistical analyses were utilized to look for differences in gene expression induced by the test articles. The analyses were conducted across all genes measured, as well as across a subset of genes that were shown to be modulated by Copaxone. The following observations were made across multiple statistical analyses: the expression of numerous genes was significantly changed by treatment with Copaxone when compared against media-only control; gene expression profiles induced by Copaxone and Glatopa were not significantly different; and gene expression profiles induced by Copaxone and the nonequivalent glatiramoid were significantly different, underscoring the sensitivity of the test system and the multiple analysis methods. Comparative analysis was also performed on sets of transcripts relevant to T-cell biology and antigen presentation, among others that are known to be modulated by glatiramer acetate. No statistically significant differences were observed between Copaxone and Glatopa in the expression levels (magnitude and direction of these glatiramer acetate-regulated genes. In conclusion, multiple methods consistently supported equivalent gene expression profiles between Copaxone and Glatopa.

  6. Using RNA-Seq data to select refence genes for normalizing gene expression in apple roots

    Science.gov (United States)

    Gene expression in apple roots in response to various stress conditions is a less-explored research subject. Reliable reference genes for normalizing quantitative gene expression data have not been carefully investigated. In this study, the suitability of a set of 15 apple genes were evaluated for t...

  7. Comparative gene expression of intestinal metabolizing enzymes.

    Science.gov (United States)

    Shin, Ho-Chul; Kim, Hye-Ryoung; Cho, Hee-Jung; Yi, Hee; Cho, Soo-Min; Lee, Dong-Goo; Abd El-Aty, A M; Kim, Jin-Suk; Sun, Duxin; Amidon, Gordon L

    2009-11-01

    The purpose of this study was to compare the expression profiles of drug-metabolizing enzymes in the intestine of mouse, rat and human. Total RNA was isolated from the duodenum and the mRNA expression was measured using Affymetrix GeneChip oligonucleotide arrays. Detected genes from the intestine of mouse, rat and human were ca. 60% of 22690 sequences, 40% of 8739 and 47% of 12559, respectively. Total genes of metabolizing enzymes subjected in this study were 95, 33 and 68 genes in mouse, rat and human, respectively. Of phase I enzymes, the mouse exhibited abundant gene expressions for Cyp3a25, Cyp4v3, Cyp2d26, followed by Cyp2b20, Cyp2c65 and Cyp4f14, whereas, the rat showed higher expression profiles of Cyp3a9, Cyp2b19, Cyp4f1, Cyp17a1, Cyp2d18, Cyp27a1 and Cyp4f6. However, the highly expressed P450 enzymes were CYP3A4, CYP3A5, CYP4F3, CYP2C18, CYP2C9, CYP2D6, CYP3A7, CYP11B1 and CYP2B6 in the human. For phase II enzymes, glucuronosyltransferase Ugt1a6, glutathione S-transferases Gstp1, Gstm3 and Gsta2, sulfotransferase Sult1b1 and acyltransferase Dgat1 were highly expressed in the mouse. The rat revealed predominant expression of glucuronosyltransferases Ugt1a1 and Ugt1a7, sulfotransferase Sult1b1, acetyltransferase Dlat and acyltransferase Dgat1. On the other hand, in human, glucuronosyltransferases UGT2B15 and UGT2B17, glutathione S-transferases MGST3, GSTP1, GSTA2 and GSTM4, sulfotransferases ST1A3 and SULT1A2, acetyltransferases SAT1 and CRAT, and acyltransferase AGPAT2 were dominantly detected. Therefore, current data indicated substantial interspecies differences in the pattern of intestinal gene expression both for P450 enzymes and phase II drug-metabolizing enzymes. This genomic database is expected to improve our understanding of interspecies variations in estimating intestinal prehepatic clearance of oral drugs.

  8. A role for gene duplication and natural variation of gene expression in the evolution of metabolism.

    Directory of Open Access Journals (Sweden)

    Daniel J Kliebenstein

    Full Text Available BACKGROUND: Most eukaryotic genomes have undergone whole genome duplications during their evolutionary history. Recent studies have shown that the function of these duplicated genes can diverge from the ancestral gene via neo- or sub-functionalization within single genotypes. An additional possibility is that gene duplicates may also undergo partitioning of function among different genotypes of a species leading to genetic differentiation. Finally, the ability of gene duplicates to diverge may be limited by their biological function. METHODOLOGY/PRINCIPAL FINDINGS: To test these hypotheses, I estimated the impact of gene duplication and metabolic function upon intraspecific gene expression variation of segmental and tandem duplicated genes within Arabidopsis thaliana. In all instances, the younger tandem duplicated genes showed higher intraspecific gene expression variation than the average Arabidopsis gene. Surprisingly, the older segmental duplicates also showed evidence of elevated intraspecific gene expression variation albeit typically lower than for the tandem duplicates. The specific biological function of the gene as defined by metabolic pathway also modulated the level of intraspecific gene expression variation. The major energy metabolism and biosynthetic pathways showed decreased variation, suggesting that they are constrained in their ability to accumulate gene expression variation. In contrast, a major herbivory defense pathway showed significantly elevated intraspecific variation suggesting that it may be under pressure to maintain and/or generate diversity in response to fluctuating insect herbivory pressures. CONCLUSION: These data show that intraspecific variation in gene expression is facilitated by an interaction of gene duplication and biological activity. Further, this plays a role in controlling diversity of plant metabolism.

  9. Gene expression studies of reference genes for quantitative real-time PCR: an overview in insects.

    Science.gov (United States)

    Shakeel, Muhammad; Rodriguez, Alicia; Tahir, Urfa Bin; Jin, Fengliang

    2018-02-01

    Whenever gene expression is being examined, it is essential that a normalization process is carried out to eliminate non-biological variations. The use of reference genes, such as glyceraldehyde-3-phosphate dehydrogenase, actin, and ribosomal protein genes, is the usual method of choice for normalizing gene expression. Although reference genes are used to normalize target gene expression, a major problem is that the stability of these genes differs among tissues, developmental stages, species, and responses to abiotic factors. Therefore, the use and validation of multiple reference genes are required. This review discusses the reasons that why RT-qPCR has become the preferred method for validating results of gene expression profiles, the use of specific and non-specific dyes and the importance of use of primers and probes for qPCR as well as to discuss several statistical algorithms developed to help the validation of potential reference genes. The conflicts arising in the use of classical reference genes in gene normalization and their replacement with novel references are also discussed by citing the high stability and low stability of classical and novel reference genes under various biotic and abiotic experimental conditions by employing various methods applied for the reference genes amplification.

  10. GOBO: gene expression-based outcome for breast cancer online.

    Directory of Open Access Journals (Sweden)

    Markus Ringnér

    Full Text Available Microarray-based gene expression analysis holds promise of improving prognostication and treatment decisions for breast cancer patients. However, the heterogeneity of breast cancer emphasizes the need for validation of prognostic gene signatures in larger sample sets stratified into relevant subgroups. Here, we describe a multifunctional user-friendly online tool, GOBO (http://co.bmc.lu.se/gobo, allowing a range of different analyses to be performed in an 1881-sample breast tumor data set, and a 51-sample breast cancer cell line set, both generated on Affymetrix U133A microarrays. GOBO supports a wide range of applications including: 1 rapid assessment of gene expression levels in subgroups of breast tumors and cell lines, 2 identification of co-expressed genes for creation of potential metagenes, 3 association with outcome for gene expression levels of single genes, sets of genes, or gene signatures in multiple subgroups of the 1881-sample breast cancer data set. The design and implementation of GOBO facilitate easy incorporation of additional query functions and applications, as well as additional data sets irrespective of tumor type and array platform.

  11. Regulation of mitochondrial gene expression, the epigenetic enigma

    NARCIS (Netherlands)

    Mposhi, Archibold; van der Wijst, Monique G. P.; Faber, Klaas Nico; Rots, Marianne G.

    2017-01-01

    Epigenetics provides an important layer of information on top of the DNA sequence and is essential for establishing gene expression profiles. Extensive studies have shown that nuclear DNA methylation and histone modifications influence nuclear gene expression. However, it remains unclear whether

  12. Ebola virus infection induces irregular dendritic cell gene expression.

    Science.gov (United States)

    Melanson, Vanessa R; Kalina, Warren V; Williams, Priscilla

    2015-02-01

    Filoviruses subvert the human immune system in part by infecting and replicating in dendritic cells (DCs). Using gene arrays, a phenotypic profile of filovirus infection in human monocyte-derived DCs was assessed. Monocytes from human donors were cultured in GM-CSF and IL-4 and were infected with Ebola virus Kikwit variant for up to 48 h. Extracted DC RNA was analyzed on SuperArray's Dendritic and Antigen Presenting Cell Oligo GEArray and compared to uninfected controls. Infected DCs exhibited increased expression of cytokine, chemokine, antiviral, and anti-apoptotic genes not seen in uninfected controls. Significant increases of intracellular antiviral and MHC I and II genes were also noted in EBOV-infected DCs. However, infected DCs failed to show any significant difference in co-stimulatory T-cell gene expression from uninfected DCs. Moreover, several chemokine genes were activated, but there was sparse expression of chemokine receptors that enabled activated DCs to home to lymph nodes. Overall, statistically significant expression of several intracellular antiviral genes was noted, which may limit viral load but fails to stop replication. EBOV gene expression profiling is of vital importance in understanding pathogenesis and devising novel therapeutic treatments such as small-molecule inhibitors.

  13. A network approach to predict pathogenic genes for Fusarium graminearum.

    Science.gov (United States)

    Liu, Xiaoping; Tang, Wei-Hua; Zhao, Xing-Ming; Chen, Luonan

    2010-10-04

    Fusarium graminearum is the pathogenic agent of Fusarium head blight (FHB), which is a destructive disease on wheat and barley, thereby causing huge economic loss and health problems to human by contaminating foods. Identifying pathogenic genes can shed light on pathogenesis underlying the interaction between F. graminearum and its plant host. However, it is difficult to detect pathogenic genes for this destructive pathogen by time-consuming and expensive molecular biological experiments in lab. On the other hand, computational methods provide an alternative way to solve this problem. Since pathogenesis is a complicated procedure that involves complex regulations and interactions, the molecular interaction network of F. graminearum can give clues to potential pathogenic genes. Furthermore, the gene expression data of F. graminearum before and after its invasion into plant host can also provide useful information. In this paper, a novel systems biology approach is presented to predict pathogenic genes of F. graminearum based on molecular interaction network and gene expression data. With a small number of known pathogenic genes as seed genes, a subnetwork that consists of potential pathogenic genes is identified from the protein-protein interaction network (PPIN) of F. graminearum, where the genes in the subnetwork are further required to be differentially expressed before and after the invasion of the pathogenic fungus. Therefore, the candidate genes in the subnetwork are expected to be involved in the same biological processes as seed genes, which imply that they are potential pathogenic genes. The prediction results show that most of the pathogenic genes of F. graminearum are enriched in two important signal transduction pathways, including G protein coupled receptor pathway and MAPK signaling pathway, which are known related to pathogenesis in other fungi. In addition, several pathogenic genes predicted by our method are verified in other pathogenic fungi, which

  14. A network approach to predict pathogenic genes for Fusarium graminearum.

    Directory of Open Access Journals (Sweden)

    Xiaoping Liu

    Full Text Available Fusarium graminearum is the pathogenic agent of Fusarium head blight (FHB, which is a destructive disease on wheat and barley, thereby causing huge economic loss and health problems to human by contaminating foods. Identifying pathogenic genes can shed light on pathogenesis underlying the interaction between F. graminearum and its plant host. However, it is difficult to detect pathogenic genes for this destructive pathogen by time-consuming and expensive molecular biological experiments in lab. On the other hand, computational methods provide an alternative way to solve this problem. Since pathogenesis is a complicated procedure that involves complex regulations and interactions, the molecular interaction network of F. graminearum can give clues to potential pathogenic genes. Furthermore, the gene expression data of F. graminearum before and after its invasion into plant host can also provide useful information. In this paper, a novel systems biology approach is presented to predict pathogenic genes of F. graminearum based on molecular interaction network and gene expression data. With a small number of known pathogenic genes as seed genes, a subnetwork that consists of potential pathogenic genes is identified from the protein-protein interaction network (PPIN of F. graminearum, where the genes in the subnetwork are further required to be differentially expressed before and after the invasion of the pathogenic fungus. Therefore, the candidate genes in the subnetwork are expected to be involved in the same biological processes as seed genes, which imply that they are potential pathogenic genes. The prediction results show that most of the pathogenic genes of F. graminearum are enriched in two important signal transduction pathways, including G protein coupled receptor pathway and MAPK signaling pathway, which are known related to pathogenesis in other fungi. In addition, several pathogenic genes predicted by our method are verified in other

  15. Gene expression analysis of embryonic stem cells expressing VE-cadherin (CD144 during endothelial differentiation

    Directory of Open Access Journals (Sweden)

    Libermann Towia

    2008-05-01

    Full Text Available Abstract Background Endothelial differentiation occurs during normal vascular development in the developing embryo. This process is recapitulated in the adult when endothelial progenitor cells are generated in the bone marrow and can contribute to vascular repair or angiogenesis at sites of vascular injury or ischemia. The molecular mechanisms of endothelial differentiation remain incompletely understood. Novel approaches are needed to identify the factors that regulate endothelial differentiation. Methods Mouse embryonic stem (ES cells were used to further define the molecular mechanisms of endothelial differentiation. By flow cytometry a population of VEGF-R2 positive cells was identified as early as 2.5 days after differentiation of ES cells, and a subset of VEGF-R2+ cells, that were CD41 positive at 3.5 days. A separate population of VEGF-R2+ stem cells expressing the endothelial-specific marker CD144 (VE-cadherin was also identified at this same time point. Channels lined by VE-cadherin positive cells developed within the embryoid bodies (EBs formed by differentiating ES cells. VE-cadherin and CD41 expressing cells differentiate in close proximity to each other within the EBs, supporting the concept of a common origin for cells of hematopoietic and endothelial lineages. Results Microarray analysis of >45,000 transcripts was performed on RNA obtained from cells expressing VEGF-R2+, CD41+, and CD144+ and VEGF-R2-, CD41-, and CD144-. All microarray experiments were performed in duplicate using RNA obtained from independent experiments, for each subset of cells. Expression profiling confirmed the role of several genes involved in hematopoiesis, and identified several putative genes involved in endothelial differentiation. Conclusion The isolation of CD144+ cells during ES cell differentiation from embryoid bodies provides an excellent model system and method for identifying genes that are expressed during endothelial differentiation and that

  16. A resampling-based meta-analysis for detection of differential gene expression in breast cancer

    International Nuclear Information System (INIS)

    Gur-Dedeoglu, Bala; Konu, Ozlen; Kir, Serkan; Ozturk, Ahmet Rasit; Bozkurt, Betul; Ergul, Gulusan; Yulug, Isik G

    2008-01-01

    proposed meta-analysis approach has the ability to detect a set of differentially expressed genes with the least amount of within-group variability, thus providing highly stable gene lists for class prediction. Increased statistical power and stringent filtering criteria used in the present study also make identification of novel candidate genes possible and may provide further insight to improve our understanding of breast cancer development

  17. A resampling-based meta-analysis for detection of differential gene expression in breast cancer

    Directory of Open Access Journals (Sweden)

    Ergul Gulusan

    2008-12-01

    -time qRT-PCR supported the meta-analysis results. Conclusion The proposed meta-analysis approach has the ability to detect a set of differentially expressed genes with the least amount of within-group variability, thus providing highly stable gene lists for class prediction. Increased statistical power and stringent filtering criteria used in the present study also make identification of novel candidate genes possible and may provide further insight to improve our understanding of breast cancer development.

  18. Xylella fastidiosa gene expression analysis by DNA microarrays

    Directory of Open Access Journals (Sweden)

    Regiane F. Travensolo

    2009-01-01

    Full Text Available Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM2 and liquid BCYE. All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others. The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.

  19. Clinicopathologic and gene expression parameters predict liver cancer prognosis

    International Nuclear Information System (INIS)

    Hao, Ke; Zhong, Hua; Greenawalt, Danielle; Ferguson, Mark D; Ng, Irene O; Sham, Pak C; Poon, Ronnie T; Molony, Cliona; Schadt, Eric E; Dai, Hongyue; Luk, John M; Lamb, John; Zhang, Chunsheng; Xie, Tao; Wang, Kai; Zhang, Bin; Chudin, Eugene; Lee, Nikki P; Mao, Mao

    2011-01-01

    The prognosis of hepatocellular carcinoma (HCC) varies following surgical resection and the large variation remains largely unexplained. Studies have revealed the ability of clinicopathologic parameters and gene expression to predict HCC prognosis. However, there has been little systematic effort to compare the performance of these two types of predictors or combine them in a comprehensive model. Tumor and adjacent non-tumor liver tissues were collected from 272 ethnic Chinese HCC patients who received curative surgery. We combined clinicopathologic parameters and gene expression data (from both tissue types) in predicting HCC prognosis. Cross-validation and independent studies were employed to assess prediction. HCC prognosis was significantly associated with six clinicopathologic parameters, which can partition the patients into good- and poor-prognosis groups. Within each group, gene expression data further divide patients into distinct prognostic subgroups. Our predictive genes significantly overlap with previously published gene sets predictive of prognosis. Moreover, the predictive genes were enriched for genes that underwent normal-to-tumor gene network transformation. Previously documented liver eSNPs underlying the HCC predictive gene signatures were enriched for SNPs that associated with HCC prognosis, providing support that these genes are involved in key processes of tumorigenesis. When applied individually, clinicopathologic parameters and gene expression offered similar predictive power for HCC prognosis. In contrast, a combination of the two types of data dramatically improved the power to predict HCC prognosis. Our results also provided a framework for understanding the impact of gene expression on the processes of tumorigenesis and clinical outcome

  20. Gene expression and adaptive noncoding changes during human evolution.

    Science.gov (United States)

    Babbitt, Courtney C; Haygood, Ralph; Nielsen, William J; Wray, Gregory A

    2017-06-05

    Despite evidence for adaptive changes in both gene expression and non-protein-coding, putatively regulatory regions of the genome during human evolution, the relationship between gene expression and adaptive changes in cis-regulatory regions remains unclear. Here we present new measurements of gene expression in five tissues of humans and chimpanzees, and use them to assess this relationship. We then compare our results with previous studies of adaptive noncoding changes, analyzing correlations at the level of gene ontology groups, in order to gain statistical power to detect correlations. Consistent with previous studies, we find little correlation between gene expression and adaptive noncoding changes at the level of individual genes; however, we do find significant correlations at the level of biological function ontology groups. The types of function include processes regulated by specific transcription factors, responses to genetic or chemical perturbations, and differentiation of cell types within the immune system. Among functional categories co-enriched with both differential expression and noncoding adaptation, prominent themes include cancer, particularly epithelial cancers, and neural development and function.

  1. Real-time PCR expression profiling of genes encoding potential virulence factors in Candida albicans biofilms: identification of model-dependent and -independent gene expression

    Directory of Open Access Journals (Sweden)

    Řičicová Markéta

    2010-04-01

    Full Text Available Abstract Background Candida albicans infections are often associated with biofilm formation. Previous work demonstrated that the expression of HWP1 (hyphal wall protein and of genes belonging to the ALS (agglutinin-like sequence, SAP (secreted aspartyl protease, PLB (phospholipase B and LIP (lipase gene families is associated with biofilm growth on mucosal surfaces. We investigated using real-time PCR whether genes encoding potential virulence factors are also highly expressed in biofilms associated with abiotic surfaces. For this, C. albicans biofilms were grown on silicone in microtiter plates (MTP or in the Centres for Disease Control (CDC reactor, on polyurethane in an in vivo subcutaneous catheter rat (SCR model, and on mucosal surfaces in the reconstituted human epithelium (RHE model. Results HWP1 and genes belonging to the ALS, SAP, PLB and LIP gene families were constitutively expressed in C. albicans biofilms. ALS1-5 were upregulated in all model systems, while ALS9 was mostly downregulated. ALS6 and HWP1 were overexpressed in all models except in the RHE and MTP, respectively. The expression levels of SAP1 were more pronounced in both in vitro models, while those of SAP2, SAP4 and SAP6 were higher in the in vivo model. Furthermore, SAP5 was highly upregulated in the in vivo and RHE models. For SAP9 and SAP10 similar gene expression levels were observed in all model systems. PLB genes were not considerably upregulated in biofilms, while LIP1-3, LIP5-7 and LIP9-10 were highly overexpressed in both in vitro models. Furthermore, an elevated lipase activity was detected in supernatans of biofilms grown in the MTP and RHE model. Conclusions Our findings show that HWP1 and most of the genes belonging to the ALS, SAP and LIP gene families are upregulated in C. albicans biofilms. Comparison of the fold expression between the various model systems revealed similar expression levels for some genes, while for others model-dependent expression

  2. Population density approach for discrete mRNA distributions in generalized switching models for stochastic gene expression.

    Science.gov (United States)

    Stinchcombe, Adam R; Peskin, Charles S; Tranchina, Daniel

    2012-06-01

    We present a generalization of a population density approach for modeling and analysis of stochastic gene expression. In the model, the gene of interest fluctuates stochastically between an inactive state, in which transcription cannot occur, and an active state, in which discrete transcription events occur; and the individual mRNA molecules are degraded stochastically in an independent manner. This sort of model in simplest form with exponential dwell times has been used to explain experimental estimates of the discrete distribution of random mRNA copy number. In our generalization, the random dwell times in the inactive and active states, T_{0} and T_{1}, respectively, are independent random variables drawn from any specified distributions. Consequently, the probability per unit time of switching out of a state depends on the time since entering that state. Our method exploits a connection between the fully discrete random process and a related continuous process. We present numerical methods for computing steady-state mRNA distributions and an analytical derivation of the mRNA autocovariance function. We find that empirical estimates of the steady-state mRNA probability mass function from Monte Carlo simulations of laboratory data do not allow one to distinguish between underlying models with exponential and nonexponential dwell times in some relevant parameter regimes. However, in these parameter regimes and where the autocovariance function has negative lobes, the autocovariance function disambiguates the two types of models. Our results strongly suggest that temporal data beyond the autocovariance function is required in general to characterize gene switching.

  3. DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach

    Directory of Open Access Journals (Sweden)

    Vingron Martin

    2011-06-01

    Full Text Available Abstract Background The analysis of massive high throughput data via clustering algorithms is very important for elucidating gene functions in biological systems. However, traditional clustering methods have several drawbacks. Biclustering overcomes these limitations by grouping genes and samples simultaneously. It discovers subsets of genes that are co-expressed in certain samples. Recent studies showed that biclustering has a great potential in detecting marker genes that are associated with certain tissues or diseases. Several biclustering algorithms have been proposed. However, it is still a challenge to find biclusters that are significant based on biological validation measures. Besides that, there is a need for a biclustering algorithm that is capable of analyzing very large datasets in reasonable time. Results Here we present a fast biclustering algorithm called DeBi (Differentially Expressed BIclusters. The algorithm is based on a well known data mining approach called frequent itemset. It discovers maximum size homogeneous biclusters in which each gene is strongly associated with a subset of samples. We evaluate the performance of DeBi on a yeast dataset, on synthetic datasets and on human datasets. Conclusions We demonstrate that the DeBi algorithm provides functionally more coherent gene sets compared to standard clustering or biclustering algorithms using biological validation measures such as Gene Ontology term and Transcription Factor Binding Site enrichment. We show that DeBi is a computationally efficient and powerful tool in analyzing large datasets. The method is also applicable on multiple gene expression datasets coming from different labs or platforms.

  4. Gene expression profiling of placentas affected by pre-eclampsia

    DEFF Research Database (Denmark)

    Hoegh, Anne Mette; Borup, Rehannah; Nielsen, Finn Cilius

    2010-01-01

    Several studies point to the placenta as the primary cause of pre-eclampsia. Our objective was to identify placental genes that may contribute to the development of pre-eclampsia. RNA was purified from tissue biopsies from eleven pre-eclamptic placentas and eighteen normal controls. Messenger RNA...... expression from pooled samples was analysed by microarrays. Verification of the expression of selected genes was performed using real-time PCR. A surprisingly low number of genes (21 out of 15,000) were identified as differentially expressed. Among these were genes not previously associated with pre-eclampsia...... as bradykinin B1 receptor and a 14-3-3 protein, but also genes that have already been connected with pre-eclampsia, for example, inhibin beta A subunit and leptin. A low number of genes were repeatedly identified as differentially expressed, because they may represent the endpoint of a cascade of events...

  5. Over-expression of Eph and ephrin genes in advanced ovarian cancer: ephrin gene expression correlates with shortened survival

    Directory of Open Access Journals (Sweden)

    Lincoln Douglas

    2006-06-01

    Full Text Available Abstract Background Increased expression of Eph receptor tyrosine kinases and their ephrin ligands has been implicated in tumor progression in a number of malignancies. This report describes aberrant expression of these genes in ovarian cancer, the commonest cause of death amongst gynaecological malignancies. Methods Eph and ephrin expression was determined using quantitative real time RT-PCR. Correlation of gene expression was measured using Spearman's rho statistic. Survival was analysed using log-rank analysis and (was visualised by Kaplan-Meier survival curves. Results Greater than 10 fold over-expression of EphA1 and a more modest over-expression of EphA2 were observed in partially overlapping subsets of tumors. Over-expression of EphA1 strongly correlated (r = 0.801; p Conclusion These data imply that increased levels of ephrins A1 and A5 in the presence of high expression of Ephs A1 and A2 lead to a more aggressive tumor phenotype. The known functions of Eph/ephrin signalling in cell de-adhesion and movement may explain the observed correlation of ephrin expression with poor prognosis.

  6. Stunned silence: gene expression programs in human cells infected with monkeypox or vaccinia virus.

    Directory of Open Access Journals (Sweden)

    Kathleen H Rubins

    2011-01-01

    Full Text Available Poxviruses use an arsenal of molecular weapons to evade detection and disarm host immune responses. We used DNA microarrays to investigate the gene expression responses to infection by monkeypox virus (MPV, an emerging human pathogen, and Vaccinia virus (VAC, a widely used model and vaccine organism, in primary human macrophages, primary human fibroblasts and HeLa cells. Even as the overwhelmingly infected cells approached their demise, with extensive cytopathic changes, their gene expression programs appeared almost oblivious to poxvirus infection. Although killed (gamma-irradiated MPV potently induced a transcriptional program characteristic of the interferon response, no such response was observed during infection with either live MPV or VAC. Moreover, while the gene expression response of infected cells to stimulation with ionomycin plus phorbol 12-myristate 13-acetate (PMA, or poly (I-C was largely unimpaired by infection with MPV, a cluster of pro-inflammatory genes were a notable exception. Poly(I-C induction of genes involved in alerting the innate immune system to the infectious threat, including TNF-alpha, IL-1 alpha and beta, CCL5 and IL-6, were suppressed by infection with live MPV. Thus, MPV selectively inhibits expression of genes with critical roles in cell-signaling pathways that activate innate immune responses, as part of its strategy for stealthy infection.

  7. Gene co-expression networks shed light into diseases of brain iron accumulation.

    Science.gov (United States)

    Bettencourt, Conceição; Forabosco, Paola; Wiethoff, Sarah; Heidari, Moones; Johnstone, Daniel M; Botía, Juan A; Collingwood, Joanna F; Hardy, John; Milward, Elizabeth A; Ryten, Mina; Houlden, Henry

    2016-03-01

    Aberrant brain iron deposition is observed in both common and rare neurodegenerative disorders, including those categorized as Neurodegeneration with Brain Iron Accumulation (NBIA), which are characterized by focal iron accumulation in the basal ganglia. Two NBIA genes are directly involved in iron metabolism, but whether other NBIA-related genes also regulate iron homeostasis in the human brain, and whether aberrant iron deposition contributes to neurodegenerative processes remains largely unknown. This study aims to expand our understanding of these iron overload diseases and identify relationships between known NBIA genes and their main interacting partners by using a systems biology approach. We used whole-transcriptome gene expression data from human brain samples originating from 101 neuropathologically normal individuals (10 brain regions) to generate weighted gene co-expression networks and cluster the 10 known NBIA genes in an unsupervised manner. We investigated NBIA-enriched networks for relevant cell types and pathways, and whether they are disrupted by iron loading in NBIA diseased tissue and in an in vivo mouse model. We identified two basal ganglia gene co-expression modules significantly enriched for NBIA genes, which resemble neuronal and oligodendrocytic signatures. These NBIA gene networks are enriched for iron-related genes, and implicate synapse and lipid metabolism related pathways. Our data also indicates that these networks are disrupted by excessive brain iron loading. We identified multiple cell types in the origin of NBIA disorders. We also found unforeseen links between NBIA networks and iron-related processes, and demonstrate convergent pathways connecting NBIAs and phenotypically overlapping diseases. Our results are of further relevance for these diseases by providing candidates for new causative genes and possible points for therapeutic intervention. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  8. Gene duplication, tissue-specific gene expression and sexual conflict in stalk-eyed flies (Diopsidae).

    Science.gov (United States)

    Baker, Richard H; Narechania, Apurva; Johns, Philip M; Wilkinson, Gerald S

    2012-08-19

    Gene duplication provides an essential source of novel genetic material to facilitate rapid morphological evolution. Traits involved in reproduction and sexual dimorphism represent some of the fastest evolving traits in nature, and gene duplication is intricately involved in the origin and evolution of these traits. Here, we review genomic research on stalk-eyed flies (Diopsidae) that has been used to examine the extent of gene duplication and its role in the genetic architecture of sexual dimorphism. Stalk-eyed flies are remarkable because of the elongation of the head into long stalks, with the eyes and antenna laterally displaced at the ends of these stalks. Many species are strongly sexually dimorphic for eyespan, and these flies have become a model system for studying sexual selection. Using both expressed sequence tag and next-generation sequencing, we have established an extensive database of gene expression in the developing eye-antennal imaginal disc, the adult head and testes. Duplicated genes exhibit narrower expression patterns than non-duplicated genes, and the testes, in particular, provide an abundant source of gene duplication. Within somatic tissue, duplicated genes are more likely to be differentially expressed between the sexes, suggesting gene duplication may provide a mechanism for resolving sexual conflict.

  9. Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

    Science.gov (United States)

    Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

    2018-04-23

    Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis

  10. Gastric Cancer Associated Genes Identified by an Integrative Analysis of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Bing Jiang

    2017-01-01

    Full Text Available Gastric cancer is one of the most severe complex diseases with high morbidity and mortality in the world. The molecular mechanisms and risk factors for this disease are still not clear since the cancer heterogeneity caused by different genetic and environmental factors. With more and more expression data accumulated nowadays, we can perform integrative analysis for these data to understand the complexity of gastric cancer and to identify consensus players for the heterogeneous cancer. In the present work, we screened the published gene expression data and analyzed them with integrative tool, combined with pathway and gene ontology enrichment investigation. We identified several consensus differentially expressed genes and these genes were further confirmed with literature mining; at last, two genes, that is, immunoglobulin J chain and C-X-C motif chemokine ligand 17, were screened as novel gastric cancer associated genes. Experimental validation is proposed to further confirm this finding.

  11. Integrative analysis of gene expression and DNA methylation using unsupervised feature extraction for detecting candidate cancer biomarkers.

    Science.gov (United States)

    Moon, Myungjin; Nakai, Kenta

    2018-04-01

    Currently, cancer biomarker discovery is one of the important research topics worldwide. In particular, detecting significant genes related to cancer is an important task for early diagnosis and treatment of cancer. Conventional studies mostly focus on genes that are differentially expressed in different states of cancer; however, noise in gene expression datasets and insufficient information in limited datasets impede precise analysis of novel candidate biomarkers. In this study, we propose an integrative analysis of gene expression and DNA methylation using normalization and unsupervised feature extractions to identify candidate biomarkers of cancer using renal cell carcinoma RNA-seq datasets. Gene expression and DNA methylation datasets are normalized by Box-Cox transformation and integrated into a one-dimensional dataset that retains the major characteristics of the original datasets by unsupervised feature extraction methods, and differentially expressed genes are selected from the integrated dataset. Use of the integrated dataset demonstrated improved performance as compared with conventional approaches that utilize gene expression or DNA methylation datasets alone. Validation based on the literature showed that a considerable number of top-ranked genes from the integrated dataset have known relationships with cancer, implying that novel candidate biomarkers can also be acquired from the proposed analysis method. Furthermore, we expect that the proposed method can be expanded for applications involving various types of multi-omics datasets.

  12. Spatial reconstruction of single-cell gene expression

    Science.gov (United States)

    Satija, Rahul; Farrell, Jeffrey A.; Gennert, David; Schier, Alexander F.; Regev, Aviv

    2015-01-01

    Spatial localization is a key determinant of cellular fate and behavior, but spatial RNA assays traditionally rely on staining for a limited number of RNA species. In contrast, single-cell RNA-seq allows for deep profiling of cellular gene expression, but established methods separate cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos, inferring a transcriptome-wide map of spatial patterning. We confirmed Seurat’s accuracy using several experimental approaches, and used it to identify a set of archetypal expression patterns and spatial markers. Additionally, Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems. PMID:25867923

  13. An Improved Biclustering Algorithm and Its Application to Gene Expression Spectrum Analysis

    OpenAIRE

    Qu, Hua; Wang, Liu-Pu; Liang, Yan-Chun; Wu, Chun-Guo

    2016-01-01

    Cheng and Church algorithm is an important approach in biclustering algorithms. In this paper, the process of the extended space in the second stage of Cheng and Church algorithm is improved and the selections of two important parameters are discussed. The results of the improved algorithm used in the gene expression spectrum analysis show that, compared with Cheng and Church algorithm, the quality of clustering results is enhanced obviously, the mining expression models are better, and the d...

  14. The ordering of expression among a few genes can provide simple cancer biomarkers and signal BRCA1 mutations

    Directory of Open Access Journals (Sweden)

    Parmigiani Giovanni

    2009-08-01

    Full Text Available Abstract Background A major challenge in computational biology is to extract knowledge about the genetic nature of disease from high-throughput data. However, an important obstacle to both biological understanding and clinical applications is the "black box" nature of the decision rules provided by most machine learning approaches, which usually involve many genes combined in a highly complex fashion. Achieving biologically relevant results argues for a different strategy. A promising alternative is to base prediction entirely upon the relative expression ordering of a small number of genes. Results We present a three-gene version of "relative expression analysis" (RXA, a rigorous and systematic comparison with earlier approaches in a variety of cancer studies, a clinically relevant application to predicting germline BRCA1 mutations in breast cancer and a cross-study validation for predicting ER status. In the BRCA1 study, RXA yields high accuracy with a simple decision rule: in tumors carrying mutations, the expression of a "reference gene" falls between the expression of two differentially expressed genes, PPP1CB and RNF14. An analysis of the protein-protein interactions among the triplet of genes and BRCA1 suggests that the classifier has a biological foundation. Conclusion RXA has the potential to identify genomic "marker interactions" with plausible biological interpretation and direct clinical applicability. It provides a general framework for understanding the roles of the genes involved in decision rules, as illustrated for the difficult and clinically relevant problem of identifying BRCA1 mutation carriers.

  15. GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Chris Cheadle

    2007-01-01

    Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

  16. Effects of BST and high energy diet on gene expression in mammary parenchyma of dairy heifers

    Directory of Open Access Journals (Sweden)

    Betina Joyce Lew

    2013-07-01

    Full Text Available The objective of this study was to determine the effects of dietary energy and recombinant bovine somatotropin (bST injection to identify genes that might control mammogenesis. Total RNA was extracted from the parenchymal tissue of 32 heifers randomly assigned to one of four treatments: two diets (a standard diet and a high energy, high protein diet, each with or without bST. To perform microarray experiments, RNA samples were pooled (2 animals/pool before reverse transcription and labeling with Cy3 or Cy5. A 4-node loop design was used to examine the differential gene expression among treatments using a bovine-specific cDNA microarray (National Bovine Functional Genomics Consortium Library, NBFGC containing 18,263 unique expressed sequence tags (EST. Significance levels of differential gene expression among treatments were assessed using a mixed model approach. Injection of bST altered the expression of 12 % of the genes on NBFGC slide related to tissue development, whereas 6% were altered by diet. Administration of bST increases the expression of genes positively related to cell proliferation and mammary parenchyma to a greater extent than a high energy diet.

  17. A Marfan syndrome gene expression phenotype in cultured skin fibroblasts

    Directory of Open Access Journals (Sweden)

    Emond Mary

    2007-09-01

    Full Text Available Abstract Background Marfan syndrome (MFS is a heritable connective tissue disorder caused by mutations in the fibrillin-1 gene. This syndrome constitutes a significant identifiable subtype of aortic aneurysmal disease, accounting for over 5% of ascending and thoracic aortic aneurysms. Results We used spotted membrane DNA macroarrays to identify genes whose altered expression levels may contribute to the phenotype of the disease. Our analysis of 4132 genes identified a subset with significant expression differences between skin fibroblast cultures from unaffected controls versus cultures from affected individuals with known fibrillin-1 mutations. Subsequently, 10 genes were chosen for validation by quantitative RT-PCR. Conclusion Differential expression of many of the validated genes was associated with MFS samples when an additional group of unaffected and MFS affected subjects were analyzed (p-value -6 under the null hypothesis that expression levels in cultured fibroblasts are unaffected by MFS status. An unexpected observation was the range of individual gene expression. In unaffected control subjects, expression ranges exceeding 10 fold were seen in many of the genes selected for qRT-PCR validation. The variation in expression in the MFS affected subjects was even greater.

  18. Volatile Gas Production by Methyl Halide Transferase: An In Situ Reporter Of Microbial Gene Expression In Soil.

    Science.gov (United States)

    Cheng, Hsiao-Ying; Masiello, Caroline A; Bennett, George N; Silberg, Jonathan J

    2016-08-16

    Traditional visual reporters of gene expression have only very limited use in soils because their outputs are challenging to detect through the soil matrix. This severely restricts our ability to study time-dependent microbial gene expression in one of the Earth's largest, most complex habitats. Here we describe an approach to report on dynamic gene expression within a microbial population in a soil under natural water levels (at and below water holding capacity) via production of methyl halides using a methyl halide transferase. As a proof-of-concept application, we couple the expression of this gas reporter to the conjugative transfer of a bacterial plasmid in a soil matrix and show that gas released from the matrix displays a strong correlation with the number of transconjugant bacteria that formed. Gas reporting of gene expression will make possible dynamic studies of natural and engineered microbes within many hard-to-image environmental matrices (soils, sediments, sludge, and biomass) at sample scales exceeding those used for traditional visual reporting.

  19. HD CAG-correlated gene expression changes support a simple dominant gain of function

    Science.gov (United States)

    Jacobsen, Jessie C.; Gregory, Gillian C.; Woda, Juliana M.; Thompson, Morgan N.; Coser, Kathryn R.; Murthy, Vidya; Kohane, Isaac S.; Gusella, James F.; Seong, Ihn Sik; MacDonald, Marcy E.; Shioda, Toshi; Lee, Jong-Min

    2011-01-01

    Huntington's disease is initiated by the expression of a CAG repeat-encoded polyglutamine region in full-length huntingtin, with dominant effects that vary continuously with CAG size. The mechanism could involve a simple gain of function or a more complex gain of function coupled to a loss of function (e.g. dominant negative-graded loss of function). To distinguish these alternatives, we compared genome-wide gene expression changes correlated with CAG size across an allelic series of heterozygous CAG knock-in mouse embryonic stem (ES) cell lines (HdhQ20/7, HdhQ50/7, HdhQ91/7, HdhQ111/7), to genes differentially expressed between Hdhex4/5/ex4/5 huntingtin null and wild-type (HdhQ7/7) parental ES cells. The set of 73 genes whose expression varied continuously with CAG length had minimal overlap with the 754-member huntingtin-null gene set but the two were not completely unconnected. Rather, the 172 CAG length-correlated pathways and 238 huntingtin-null significant pathways clustered into 13 shared categories at the network level. A closer examination of the energy metabolism and the lipid/sterol/lipoprotein metabolism categories revealed that CAG length-correlated genes and huntingtin-null-altered genes either were different members of the same pathways or were in unique, but interconnected pathways. Thus, varying the polyglutamine size in full-length huntingtin produced gene expression changes that were distinct from, but related to, the effects of lack of huntingtin. These findings support a simple gain-of-function mechanism acting through a property of the full-length huntingtin protein and point to CAG-correlative approaches to discover its effects. Moreover, for therapeutic strategies based on huntingtin suppression, our data highlight processes that may be more sensitive to the disease trigger than to decreased huntingtin levels. PMID:21536587

  20. Gene expression signature analysis identifies vorinostat as a candidate therapy for gastric cancer.

    Directory of Open Access Journals (Sweden)

    Sofie Claerhout

    Full Text Available Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future.Using microarray technology, we generated a gene expression profile of human gastric cancer-specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern.We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment.